Much of how browsers interpret foreign language Web sites is dependent on how text is numerically encoded on the Internet. Understanding a little bit about encoding can help you develop foreign language web sites properly.
Use the Next and Prev arrows to go back and forth in the Encoding Section.
Much of the material in this tutorial was pulled from the following references. Other links to additional references are included in the appropriate sections.
Ager, Simon (1998-2005) "Omniglot: A Guide to Written Language"
Crystal, David (1997) The Cambridge Encyclopedia of Language. Cambridge University Press.
Czyborra, Roman (2000) "Unicode in the Unix Environment"
Korpela, Jukka (2000-2004) "Theoretical Basis of Character Encoding"
Spolsky, Joel (2003) "The Absolute Minimum Every Software Developer Absolutely, Positively Must Know About Unicode and Character Sets (No Excuses!)"
©Penn State University, 2000-2012.
This Web page maintained by Teaching and Learning with Technology, a unit of Information Technology Services. For questions or comments on this Web page, please contact Elizabeth J. Pyatt (email@example.com).
Unicode character names and hexadecimal entity codes are taken from the public Unicode Character Charts.