I want to read out several strings from an HTML (text) file, like
everything between "<h2>" and "</h2>" to create a table-of-contents
(also, things other than tags).
I do have a function but it's slow, and sometimes doesn't finish for
larger files (600K, not much really!).
Now what would be a nice function to do this job? I suppose some regex
with preg_match_all?
It should have a parameter telling which occurrence of the string
should be used, e.g. the second, third and so on.
------------
Like:
function getTextBetween( $allText, $textBefore, $textAfter, $offset = 0)
{
// ?
}
Then I could say:
$s = getTextBetween( "<h2>foo</h2><p>Hello World</p><h2>bar</h2>",
"<h2>", "</h2>", 1);
echo $s; // ... would be "bar"
------------
Any help greatly appreciated!
everything between "<h2>" and "</h2>" to create a table-of-contents
(also, things other than tags).
I do have a function but it's slow, and sometimes doesn't finish for
larger files (600K, not much really!).
Now what would be a nice function to do this job? I suppose some regex
with preg_match_all?
It should have a parameter telling which occurrence of the string
should be used, e.g. the second, third and so on.
------------
Like:
function getTextBetween( $allText, $textBefore, $textAfter, $offset = 0)
{
// ?
}
Then I could say:
$s = getTextBetween( "<h2>foo</h2><p>Hello World</p><h2>bar</h2>",
"<h2>", "</h2>", 1);
echo $s; // ... would be "bar"
------------
Any help greatly appreciated!
Comment