Sweet. I’ll translate it!
Whilst your waiting, here is the TOC and the first bits translated:
- What is the world wide web?
- HTTP
- Hypertext, hyperlinks, hypermedia
- HTML
- …what about other media?
- Mediatype
- Registering a media type?
- Request for Comments (RFC)
- RFC for everything
- RFC for anything
- RFC 1121
- RFC 1925
- RFC 2119
- What gives? How does this help me?
- HTTP: the why?
- Inventing the wheel
- How do I prevent re-inventing the wheel?
- Web standards for web developers
- Join the development of our www
- Final words
What is the World Wide Web?
To be able to talk about web standards, it’s good to first define what the web is and consists of, including some elaborations.
HTTP
HTTP is short for Hypertext Transfer Protocol.
A protocol is generally a collection of rules to achieve some goal. In this case we’re dealing with a communication and transfer protocol, and it describes the syntax, semantics, error handling and correction, as well as synchronization to accomplish the following:
“…where hypertext documents include hyperlinks to other resources that the user can easily access.”
Wikipedia
Hypertext, hyperlinks, hypermedia
“Hyper-” from the Greek prefix “ὑπερ-”, which means over or beyond, as in exceeding.
- Hypertext is digitally displayable text, with references (yes, the hyperlinks) to other (hyper)text, which can be accessed directly by a user.
- Hyperlink is a link (a reference) that grants a user access to the data at the other end of that link (referred data).
- Hypermedia is an extension of the hypertext term and describes non-lineair (digital) media both including plain text and hyperlinks, as well as images, audio, and video.
An example of hypermedia is the World Wide Web!
HTML
HTML is short for HyperText Markup Language.
It follows that hypertext doesn’t equal HTML. Instead, this language is a way to mark up hypertext in such a way that the data (the text, and when we talk about hypermedia also the other types of content) can be displayed or referenced (using hyperlinks).
The HTML standard is meant to describe how to annotate documents so a browser may display it. It’s a way to make hypertext interactive. Thus, the web browser is a tool to follow hyperlinks, which we know as browsing the web.
HTTP aids transferring HTML documents, as it’s the Hypertext Transfer Protocol for documents written in Hypertext Markup Language.
…what about other media?
When you’re browsing the web, you’ll often run into content that’s not just text or text-based — plain or styled — but rather things such as images, audio, video, and other non-textual content.
The smart humans that brought HTTP to life (and continued developing it) also came up with a smart way to allow HTTP to send more than just hypertext and hypertext-based documents. The way they accomplished this is by using metadata which is sent along with the content requested.
In order to support other (types of) media, HTTP commonly uses a Media type (MIME TYPE).
Mediatype
The data format of a representation is known as a media type […]
On the web we technically talk about representations of resources. The format of such a representation (meaning the syntax, the rules, the usage, the constraints, etc.) are commonly known as a media type.
Some well known examples of media types are:
text/html
image/png
application/json
There are rules about the syntax and usage of these media types. Together we came up with and agreed on how specific binary data can be interpreted (read) or arranged (written). A PNG image is a PNG image if we interpret the binary data as a PNG. How that would and should work for PNGs is written down and accepted as a standard, after which it was (publicly) registered.
Registering a media type?
Here are another three examples of media types:
application/vnd.ms-powerpoint
application/graphql
application/vnd.xpbytes.errors.v1+json
The first one describes Microsoft Powerpoint files. The second one used to be the (defacto) standard way to desrcibe GraphQL queries and responses. The last one is one out of many vendor (vnd
) specific media types that we use at work to describe error messages.
There exists a document (and standard) that defines and explains how to register a media type: RFC 6838: Media Type Specifications and Registration Procedures
“This document defines procedures for the specification and registration of media types for use in HTTP, MIME, and other Internet protocols.”
The goal of registering a media type is to make it accessible for other to use on the interwebs, and so that we may agree on how data can be interpreted or arranged. Whilst it’s not strictly mandatory to register media types starting with vnd.
, it will aid in the process of writing, sanitizing, iterating, and improving a media type specification.
For media types outside of the vnd.
(vendor) and prs.
(personal) trees — in other words: media types that don’t start with that prefix — registration is mandatory. It’s not a media type until it’s officially been registered.