The HTML 5 draft reflects an effort, started in 2004, to study contemporary HTML implementations and deployed content. The draft: <ol type="1"><li class="MsoNormal" style="color:black;line-height:normal;margin:0 0 10pt;">Defines a single language called HTML 5 which can be written in a “custom” HTML syntax and in XML syntax. </li><li class="MsoNormal" style="color:black;line-height:normal;margin:0 0 10pt;">Defines detailed processing models to foster interoperable implementations. </li><li class="MsoNormal" style="color:black;line-height:normal;margin:0 0 10pt;">Improves markup for documents. </li><div class="separator" style="clear:both;text-align:center;"></div><li class="MsoNormal" style="color:black;line-height:normal;margin:0 0 10pt;">Introduces markup and APIs for emerging idioms, such as Web applications. </li></ol><div class="MsoNormal" style="color:black;line-height:normal;margin:0 0 10pt;"> </div><h3 id="open-issues" style="margin:auto 0;">1.1. Open Issues</h3>HTML 5 is still a draft. The contents of HTML 5, as well as the contents of this document which depend on HTML 5, are still being discussed on the HTML Working Group and WHATWG mailing lists. Some of the open issues include (this list is not exhaustive): <ul type="disc"><li class="MsoNormal" style="color:black;line-height:normal;margin:0 0 10pt;">De facto semantic definitions for some formerly presentational elements. </li><li class="MsoNormal" style="color:black;line-height:normal;margin:0 0 10pt;">Details of accessibility and media-independence features, such as the longdesc, alt, summary, and headers attributes. </li><li class="MsoNormal" style="color:black;line-height:normal;margin:0 0 10pt;">The style attribute. </li><li class="MsoNormal" style="color:black;line-height:normal;margin:0 0 10pt;">The repetition model. </li></ul><h3 id="backwards-compatible" style="margin:auto 0;">1.2. Backwards Compatible</h3>HTML 5 is defined in a way that it is backwards compatible with the way user agents handle deployed content. To keep the authoring language relatively simple for authors several elements and attributes are not included as outlined in the other sections of this document, such as presentational elements that are better dealt with using CSS. User agents, however, will always have to support these older elements and this is why the specification clearly separates requirements for authors and user agents. This means that authors can not use the isindex or plaintext element, but user agents are required to support them in a way that is compatible with how these elements behaved previously. Since HTML 5 has separate conformance requirements for authors and user agents there is no longer a need for marking things “deprecated”. <h3 id="development-model" style="margin:auto 0;">1.3. Development Model</h3>The HTML 5 specification will not be considered finished before there are at least two complete implementations of the specification. This is a different approach than previous versions of HTML had. The goal is to ensure that the specification is implementable and usable by designers and developers once it is finished. <h2 id="syntax" style="margin:auto 0;">2. Syntax</h2>The HTML 5 language has a “custom” HTML syntax that is compatible with HTML 4 and XHTML1 documents published on the Web, but is not compatible with the more esoteric SGML features of HTML 4, such as <em/content/. Documents using this “custom” syntax must be served with the text/html MIME type. HTML 5 also defines detailed parsing rules (including “error handling”) for this syntax which are largely compatible with popular implementations. User agents will follow these rules for resources that have the text/html MIME type. Here is an example document that conforms to the HTML syntax: <pre></pre><pre></pre><pre></pre><pre></pre><pre>Example document</pre><pre></pre><pre></pre><pre><p>Example paragraph</p></pre><pre></pre><pre></pre>The other syntax that can be used for HTML 5 is XML. This syntax is compatible with XHTML1 documents and implementations. Documents using this syntax need to be served with an XML MIME type and elements need to be put in the http://www.w3.org/1999/xhtml namespace following the rules set forth by the XML specifications. [XML] Below is an example document that conforms to the XML syntax of HTML 5. Note that XML documents must have an XML MIME type such as application/xhtml+xml or application/xml. <pre></pre><pre></pre><pre></pre><pre>Example document</pre><pre></pre><pre></pre><pre><p>Example paragraph</p></pre><pre></pre><pre></pre><h3 id="character-encoding" style="margin:auto 0;">2.1. Character Encoding</h3>For the HTML syntax of HTML 5 authors have three means of setting the character encoding: <ul type="disc"><li class="MsoNormal" style="color:black;line-height:normal;margin:0 0 10pt;">At the transport level. By using the HTTP Content-Type header for instance. </li><li class="MsoNormal" style="color:black;line-height:normal;margin:0 0 10pt;">Using a Unicode Byte Order Mark (BOM) character at the start of the file. This character provides a signature for the encoding used. </li><li class="MsoNormal" style="color:black;line-height:normal;margin:0 0 10pt;">Using a meta element with a charset attribute that specifies the encoding as the first element child of the head element. could be used to specify the UTF-8 encoding. This replaces the need for </li></ul>For the XML syntax authors have to use the rules as set forth in the XML specifications to set the character encoding. <h3 id="doctype" style="margin:auto 0;">2.2. The DOCTYPE</h3>The HTML syntax of HTML 5 requires a DOCTYPE to be specified to ensure that the browser renders the page in standards mode. The DOCTYPE has no other purpose and is therefore optional for XML. Documents with an XML MIME type are always handled in standards mode. [DOCTYPE] The DOCTYPE declaration is and is case-insensitive in the HTML syntax. DOCTYPEs from earlier versions of HTML were longer because the HTML language was SGML based and therefore required a reference to a DTD. With HTML 5 this is no longer the case and the DOCTYPE is only needed to enable standards mode for documents written using the HTML syntax. Browsers already do this for . <h2 id="language" style="margin:auto 0;">3. Language</h2>This section is split up in several subsections to more clearly illustrate the various differences there are between HTML 4 and HTML 5. <h3 id="new-elements" style="margin:auto 0;">3.1. New Elements</h3>The following elements have been introduced for better structure: <div style="margin-left:.5in;text-indent:-.25in;">·section represents a generic document or application section. It can be used together with h1-h6 to indicate the document structure. </div><div style="margin-left:.5in;text-indent:-.25in;">·article represents an independent piece of content of a document, such as a blog entry or newspaper article. </div><div style="margin-left:.5in;text-indent:-.25in;">·aside represents a piece of content that is only slightly related to the rest of the page. </div><div style="margin-left:.5in;text-indent:-.25in;">·header represents the header of a section. </div><div style="margin-left:.5in;text-indent:-.25in;">·footer represents a footer for a section and can contain information about the author, copyright information, et cetera. </div><div style="margin-left:.5in;text-indent:-.25in;">·nav represents a section of the document intended for navigation. </div><div style="margin-left:.5in;text-indent:-.25in;">·dialog can be used to mark up a conversation like this:</div><pre style="margin-left:60pt;text-indent:-.25in;">·</pre><pre style="margin-left:60pt;text-indent:-.25in;">·<dt> Costello</pre><pre style="margin-left:60pt;text-indent:-.25in;">·<dd> Look, you gotta first baseman?</pre><pre style="margin-left:60pt;text-indent:-.25in;">·<dt> Abbott</pre><pre style="margin-left:60pt;text-indent:-.25in;">·<dd> Certainly.</pre><pre style="margin-left:60pt;text-indent:-.25in;">·<dt> Costello</pre><pre style="margin-left:60pt;text-indent:-.25in;">·<dd> Who's playing first?</pre><pre style="margin-left:60pt;text-indent:-.25in;">·<dt> Abbott</pre><pre style="margin-left:60pt;text-indent:-.25in;">·<dd> That's right.</pre><pre style="margin-left:60pt;text-indent:-.25in;">·<dt> Costello</pre><pre style="margin-left:60pt;text-indent:-.25in;">·<dd> When you pay off the first baseman every month, who gets the money?</pre><pre style="margin-left:60pt;text-indent:-.25in;">·<dt> Abbott</pre><pre style="margin-left:60pt;text-indent:-.25in;">·<dd> Every dollar of it. </pre><pre style="margin-left:60pt;"></pre><div style="margin-left:.5in;text-indent:-.25in;">·figure can be used to associate a caption together with some embedded content, such as a graphic or video:</div><pre style="margin-left:60pt;text-indent:-.25in;">·