Hey, I'm working on an application that allows users to submit content, but i'm having a hard time understanding how exactly the submitted information is encoded. The content is in another language I know that I have to utilize utf-8 in order to render the special characters correctly. How do I make sure that the characters inputed by the users are utf-8 compliant though? I've noticed that in windows vista, I have two options for the language which I've selected: "legacy" and "standard", and they (although output the same character), the Hex values for them are not ideantical.
How is this typically done, so that all user submitted content are subject to the same encoding. Do I need to, perhaps, have buttons to insert special character's html entities via javascript?
When storing the submitted data (assuming it was typed with utf-8 characters), do store them in the mysql db as is? or do store their html entity equivalent?