Re: character to HTML ampersand escape sequence converter
In article <hsivonen-BAB9D4.01493919 122004@news.dna internet.net>,
Henri Sivonen <hsivonen@iki.f i> writes:
[color=blue][color=green]
>> In article <hsivonen-5BCFB2.12592918 122004@news.dna internet.net>,
>> Henri Sivonen <hsivonen@iki.f i> writes:
>>[color=darkred]
>> >> Indeed. I was on the point of suggesting AN XML processor until I saw
>> >> that (libxml2 accepts HTML as well as XML input).[/color][/color]
>[color=green]
>> The HTML parser gives you either SAX or DOM, and will process either
>> HTML or XHTML input without distinction.[/color]
>
> Are the elements in the XHTML namespace or in no namespace?[/color]
They're not namespaced. At least not in the SAX parse mode, which is
where I've investigated the issue. At least, my preliminary experiments
trying to use the HTML parser in SAX2 mode were not successful, which
is not to say I won't return to the issue.
[color=blue]
> The good
> thing about TagSoup is that it allows the app internals to be written
> for XHTML, so the same app internals work for HTML, XHTML *and*
> XHTML+FooML (using an XML parser). That is, the HTML/XHTML difference is
> left on the parsing level and not carried over to higher levels as in
> browsers.[/color]
Watch this space. That's what I'd like mod_publisher to do. OTOH,
how many people mix HTML (no X) with other namespaces in real life?
The full capability is at best a pathological edge-case.
BTW, if you're interested in namespace processing on the Web,
may I refer you to my recently-published article at
--
Nick Kew
In article <hsivonen-BAB9D4.01493919 122004@news.dna internet.net>,
Henri Sivonen <hsivonen@iki.f i> writes:
[color=blue][color=green]
>> In article <hsivonen-5BCFB2.12592918 122004@news.dna internet.net>,
>> Henri Sivonen <hsivonen@iki.f i> writes:
>>[color=darkred]
>> >> Indeed. I was on the point of suggesting AN XML processor until I saw
>> >> that (libxml2 accepts HTML as well as XML input).[/color][/color]
>[color=green]
>> The HTML parser gives you either SAX or DOM, and will process either
>> HTML or XHTML input without distinction.[/color]
>
> Are the elements in the XHTML namespace or in no namespace?[/color]
They're not namespaced. At least not in the SAX parse mode, which is
where I've investigated the issue. At least, my preliminary experiments
trying to use the HTML parser in SAX2 mode were not successful, which
is not to say I won't return to the issue.
[color=blue]
> The good
> thing about TagSoup is that it allows the app internals to be written
> for XHTML, so the same app internals work for HTML, XHTML *and*
> XHTML+FooML (using an XML parser). That is, the HTML/XHTML difference is
> left on the parsing level and not carried over to higher levels as in
> browsers.[/color]
Watch this space. That's what I'd like mod_publisher to do. OTOH,
how many people mix HTML (no X) with other namespaces in real life?
The full capability is at best a pathological edge-case.
BTW, if you're interested in namespace processing on the Web,
may I refer you to my recently-published article at
--
Nick Kew
Comment