Hi ,
I am trying to parse HTML data and retrive the contents. I am facing a problem which I have explained below.
I have imported HTMLParser class and using the handle_data function. The issue here is the '<' and '>' data which is represented as &le and &ge is getting stripped off.
For eg: if the html representation is like <This> is an example which will read as <This> is an example . When I parse it, I am getting the value only as This is an example.
ie... '<' and '>' got stripped off....
Please help
I am trying to parse HTML data and retrive the contents. I am facing a problem which I have explained below.
I have imported HTMLParser class and using the handle_data function. The issue here is the '<' and '>' data which is represented as &le and &ge is getting stripped off.
For eg: if the html representation is like <This> is an example which will read as <This> is an example . When I parse it, I am getting the value only as This is an example.
ie... '<' and '>' got stripped off....
Please help
Comment