Hi,
I have an Html document that declares that it uses the utf-8 character
set. As this document is editable via a web interface I need to make
sure than high-ascii characters that may be accidentally entered are
properly represented when the document is served. My programming
language allows me to get the ascii value for any individual character
so what I am doing when a change is saved is to look at each character
in the content and if the ascii value for a character > 127 then I
replace 'character' with '&#AsciiValue;' .
I am not very well up on character sets and document encoding
mechanisms so I would like to know, is this a sensible idea?
TIA
Chandy
I have an Html document that declares that it uses the utf-8 character
set. As this document is editable via a web interface I need to make
sure than high-ascii characters that may be accidentally entered are
properly represented when the document is served. My programming
language allows me to get the ascii value for any individual character
so what I am doing when a change is saved is to look at each character
in the content and if the ascii value for a character > 127 then I
replace 'character' with '&#AsciiValue;' .
I am not very well up on character sets and document encoding
mechanisms so I would like to know, is this a sensible idea?
TIA
Chandy
Comment