You are here:
- Manual
- Development
- Character encoding
- The Web
Character encoding: Character encoding and the Web
Methods for indicating the character set
- Indicating the character set by means of an HTTP Content-type header
The Content-type header can be used to indicate a character set for the document to be sent by the server.
- Indicating the character set by means of a
metaelement
Web developers can also apply ametaelement in the HTML source code to indicate the character set of a page.
Guidelines for indicating the character set
Web developers should always specify the character set in their pages.
Which character set is suitable?
- The
ISO-8859-1(Latin-1) character set - The
UTF-8character set
The UTF-8 character set has the most extensive repertoire and combines most character sets (Western and Eastern scripts, and symbols) into a single set. Web developers are advised to specify the UTF-8 character set.
Character encoding and fonts
Character encoding is sometimes – incorrectly – associated with fonts.
Links and references
- HTML Document Representation
W3C HTML4.01 specification - The Absolute Minimum Every Software Developer Absolutely, Positively Must Know About Unicode and Character Sets (No Excuses!)
Joel Spolsky
