Introduction
HTML is the backbone of the Web. We can build a website without JavaScript, even without CSS, but without HTML—any HTML—, no, we can’t.
As HTML is so important and yet also so easy, everyone writes HTML, and everyone says they can write HTML. They don’t just mean they’re able to write HTML, but that they write good HTML, where “good” means “high quality.”
That would be great news, because writing good HTML is actually more important than writing good CSS or good JavaScript. This is at least the view of the first paradigm of web development, which sees a benefit in separating concerns, and assumes, as the end goal, a large code base of many HTML templates and files with very few style sheets and scripts.
The problem is not that there’s also a new paradigm, in which components feature HTML and CSS and JavaScript (or ECMAScript, if we want to adjust terminology), but that not everyone writes good HTML. That has many reasons.
The most important one is that HTML isn’t actually that easy, because it really is complex. Want an example?
HTML 5.2, which is the latest HTML recommendation over at the W3C (as opposed to the WHATWG, where HTML really is written—usually, look at their spec), contains 111 elements alone. 111. How many do you know? How many does the average web developer know? What does that mean for their HTML if they don’t know all the elements?
Want another example? When people talk about HTML elements like, for example, html or a or p, then most of the time they do mean elements. But what they then talk about is “tags.” Google, as of September, 2019, finds 25 million occurrences of “html tags” alone. HTML elements, what people mean, but don’t call by name? 2.4 million hits, a tenth. This begs the question how well developers understand HTML if they’re not sure about the difference between elements and tags.
Another one? In a few years, HTML will celebrate its 30th birthday. Very cool! So we will certainly have maxed out all options to reduce HTML payload, to improve performance? Well, no. One of the major options at our disposal to reduce HTML payload is not to write HTML that can be left out without a document turning invalid (please validate, by the way). Unfortunately, almost no one uses that option. Web developers have their concerns and habits, yes, but the point is that the method is so under-utilized, we cannot speak of HTML mastery here, either.
We could go on, adding data and anecdotes to how HTML is both important and yet not well-understood, nor well-used. The route I want to take now is to take 10 examples, from the wild, to show simple improvements to respective HTML code. That code, then, has been anonymized, for this booklet is not about pointing fingers, but to simply look at HTML and see what else we can do, how we can improve it, how we can: upgrade it.
Welcome to this first and light and playful edition of Upgrade Your HTML.
—Jens Oliver Meiert (short: Jens).
PS.
The booklet really is about HTML. Not SGML, not XHTML, not XML, just HTML.