Where to look for source of HTML::Parser

Collapse
This topic is closed.
X
X
 
  • Time
  • Show
Clear All
new posts
  • Himanshu Garg

    Where to look for source of HTML::Parser

    Hello,

    I am trying to pinpoint an apparent bug in HTML::Parser. The
    encoding of the text seems to change incorrectly if the locale isn't
    set properly.

    However Parser.pm in the directory
    (/usr/lib/perl5/vendor_perl/5.8.0/i386-linux-thread-multi/HTML/)
    doesn't seem to contain the "real" parsing statements.

    Could you tell me where to look for the "real" code.

    Thank You
    Himanshu.
  • Gunnar Hjalmarsson

    #2
    Re: Where to look for source of HTML::Parser

    Himanshu Garg wrote:[color=blue]
    > I am trying to pinpoint an apparent bug in HTML::Parser. The
    > encoding of the text seems to change incorrectly if the locale
    > isn't set properly.
    >
    > However Parser.pm in the directory
    > (/usr/lib/perl5/vendor_perl/5.8.0/i386-linux-thread-multi/HTML/)
    > doesn't seem to contain the "real" parsing statements.
    >
    > Could you tell me where to look for the "real" code.[/color]

    HTML::Parser requires HTML::Entities, and it's probably the latter you
    want to study.

    --
    Gunnar Hjalmarsson
    Email: http://www.gunnar.cc/cgi-bin/contact.pl

    Comment

    • Himanshu Garg

      #3
      Re: Where to look for source of HTML::Parser

      Gunnar Hjalmarsson <noreply@gunnar .cc> wrote in message news:<FwFSb.470 96$mU6.179415@n ewsb.telia.net> ...[color=blue]
      > Himanshu Garg wrote:[color=green]
      > > I am trying to pinpoint an apparent bug in HTML::Parser. The
      > > encoding of the text seems to change incorrectly if the locale
      > > isn't set properly.
      > >
      > > However Parser.pm in the directory
      > > (/usr/lib/perl5/vendor_perl/5.8.0/i386-linux-thread-multi/HTML/)
      > > doesn't seem to contain the "real" parsing statements.
      > >
      > > Could you tell me where to look for the "real" code.[/color]
      >
      > HTML::Parser requires HTML::Entities, and it's probably the latter you
      > want to study.[/color]
      No it is not the one I am looking for. HTML/Entities.pm merely encodes
      and decodes entities.

      I am looking for where other things like tags, their attributes etc
      are handled and I do not find them in either of these.

      Nevertheless, I thank you very much for your reply.

      Thank You
      ++imanshu.

      Comment

      • Joe Smith

        #4
        Re: Where to look for source of HTML::Parser

        Himanshu Garg wrote:
        [color=blue]
        > Hello,
        >
        > I am trying to pinpoint an apparent bug in HTML::Parser. The
        > encoding of the text seems to change incorrectly if the locale isn't
        > set properly.
        >
        > However Parser.pm in the directory
        > (/usr/lib/perl5/vendor_perl/5.8.0/i386-linux-thread-multi/HTML/)
        > doesn't seem to contain the "real" parsing statements.
        >
        > Could you tell me where to look for the "real" code.[/color]

        Take a look at the beginning of the perl module.
        /usr/local/lib/perl5/site_perl/5.8.0/sun4-solaris/HTML/Parser.pm
        require DynaLoader;
        @ISA=qw(DynaLoa der);
        HTML::Parser->bootstrap($VER SION);
        That means Parser.pm uses Parser.so (and Parser.bs).

        /usr/local/lib/perl5/site_perl/5.8.0/sun4-solaris/auto/HTML/Parser/.packlist
        /usr/local/lib/perl5/site_perl/5.8.0/sun4-solaris/auto/HTML/Parser/Parser.bs
        /usr/local/lib/perl5/site_perl/5.8.0/sun4-solaris/auto/HTML/Parser/Parser.so

        The *.so files are built from *.h and *.c files.
        ..cpan/build/HTML-Parser-3.34/hctype.h
        ..cpan/build/HTML-Parser-3.34/hparser.c
        ..cpan/build/HTML-Parser-3.34/hparser.h
        ..cpan/build/HTML-Parser-3.34/Parser.bs
        ..cpan/build/HTML-Parser-3.34/Parser.c
        ..cpan/build/HTML-Parser-3.34/Parser.o
        ..cpan/build/HTML-Parser-3.34/Parser.pm
        ..cpan/build/HTML-Parser-3.34/Parser.xs
        ..cpan/build/HTML-Parser-3.34/pfunc.h

        --
        I love my TiVo - http://www.inwap.com/u/joe/tivo/

        Comment

        Working...