Another useful Web globalization Q&A has been added to the W3C site. It has to do with the issue of character sets and encodings. I can’t even begin to describe how confusing this issue can be to Web developers as they begin tackling new languages — and new scripts. But is is something they will encounter more frequently. Fortunately, we now have Unicode.
Question:
What is the ‘Document Character Set’ for XML and HTML, and how does it relate to the encodings I use for my documents?
For the answer, go to: http://www.w3.org/International/questions/qa-doc-charset.html