parsing HTML results

Collapse
This topic is closed.
X
X
 
  • Time
  • Show
Clear All
new posts
  • John Raines

    parsing HTML results

    Hello-

    I am trying to parse HTML results which look like this:

    <html>
    <body>
    <span class=aaa>this is a test</span><span class=bbb>testi ng
    again</span><span class=aaa>test number 3</span>
    </body>
    </html>

    All the <spanare on one line.

    In this particular case, I want to match all the <span class=aaaso that my
    results would look like:

    this is a test
    test number 3

    I was fairly successful with a regex like /<span class=aaa>(.*)</span>/ but
    I could only get the first result.

    Is there an easy way to match <span class=aaa>*</spanand return the
    contents within the tag? And to either insert the results into an array, or
    have the ability to loop through the results?

    Thanks


  • Jürgen Exner

    #2
    Re: parsing HTML results

    John Raines wrote:
    I am trying to parse HTML results which look like this:
    [...]
    This has been discussed a gazillion of times.
    Is there an easy way to match <span class=aaa>*</spanand return the
    contents within the tag? And to either insert the results into an
    array, or have the ability to loop through the results?
    Absolutely. The best way to do it is as described in the FAQ: just use a
    parser to parse HTML.

    jue


    Comment

    Working...