[WMASTERS] Does 8-bit scheme clash with HTML standard?


I ran across this post by Michal Jankowski <michalj@fuw.edu.pl> in the
usenet newsgroups      comp.std.internat and comp.software.international
that raises potential problems with the characters in the 0x80 to 0x9F
range.  I haven't verified this yet but I have seen similar statements
elsewhere.  Can the 8-bit Tamil Font designers verify the assertions made
by this post?

Mani M. Manivannan
Fremont, CA, USA.

-------------------- Quoted article below ----------
Re: MES instead of ISO 8859-nn
From:         Michal Jankowski <michalj@fuw.edu.pl>
Date:         1997/07/07

>An example for a practical character subset frustration is ISO 8859-1
>versus Microsoft CP1252 (the Windows character set):
>Today, I have already the problem that Web page authers using
>MS-Windows use the CP1252 characters in the 0x80-0x9f range that
>are not part of ISO 8859-1 and therefore are IN THEORY not allowed
>in HTML. I can't see these characters on my fully HTML conforming
>system, and the Web authors who are unaware of what the proper subset
>of their available character set is inappropriately use frequently
>QUOTATION MARK (0x91 and 0x92 in CP1252, 0x2019 and 0x201c
>in Unicode) where they really should use just QUOTATION
>MARK (0x22) if the character set announced by HTTP for this
>Web page is ISO 8859-1.
>I expect problems like this to be many orders of magnitude worse
>once Unicode starts to get widely used on the Web.  The above
>problem is at least well-defined, the people using the
>0x80-0x9f characters in HTML are clearly wrong, the HTML specification
>leaves no doubt about this.

-----------------  Quote ends ----------------------------------------------
 . ׯ

Mani M. Manivannan
Fremont, CA, USA.

    ݍ -   Only a sculptor knows a sculpture's flaws


