Newsgroups : Borland : borland.public.delphi.internet.winsock : 2006 May : Re: Html entities... converting them e.g. unicode...
| Subject: | Re: Html entities... converting them e.g. unicode... |
| Posted by: | "Ralf Junker - http://www.yunqa.de/delphi/" (delphi.at.yunqa.dot...@) |
| Date: | Fri, 19 May 2006 10:23:49 |
The official W3C HTML entity conversion tables as part of the HTML specification
are here:
http://www.w3.org/TR/html4/sgml/entities.html
DIHtmlParser is a very fast and popular Unicode compatible HTML Parser for
Delphi which fully implements the above entities for both reading and writing:
http://www.yunqa.de/delphi/htmlparser/
Regards,
Ralf
"dk_sz" <dk_sz@hotmail.com> wrote:
>So suppose I have found e.g. following entities in some HTML:
>
> ,(CharVal: 219; CodeDec: 'Û'; CodeName: 'Û')
> ,(CharVal: 220; CodeDec: 'Ü'; CodeName: 'Ü')
>
>If I am using MBCS on western Windows...
>I can assume a one byte value of 219...
>
>But what if I wanted to convert to a WideString character..?
>(and wasn't using a MBCS / western Windows OS that...)
>Well, just something I begun puzzling about... I have not
>been able to find anything useful about this topic on the net...?
>I had kinda thought there would be some kind of tables or something?
---
The Delphi Inspiration
http://www.yunqa.de/delphi/
none