That is how to extract the word content in the html file and write it into a word file, is there a built in command where you can filter out all the code part, thanks