Im using the below to extract the text between all the <br></br>.
But it does not prints out all text and prints the normal text which is not a part of html link tag.
Example, if you have <a href="test.html " ><b>The Testing Page is here</b></a>
<b> extrat text</b>
I want to extract only - "The Testing Page is here"
Here variable $myfile
Here variable $myfile contains the whole HTML page
Can some one help me out, what I am doing wrong here?
More Information, I am trying to extract all the text which is a link in the given HTML page.
But it does not prints out all text and prints the normal text which is not a part of html link tag.
Example, if you have <a href="test.html " ><b>The Testing Page is here</b></a>
<b> extrat text</b>
I want to extract only - "The Testing Page is here"
Here variable $myfile
Here variable $myfile contains the whole HTML page
Code:
while ($myfile =~ /<br.+?>(.*)<\/br>/xg)
{print ("a");
print $1;
}
More Information, I am trying to extract all the text which is a link in the given HTML page.
Comment