I have endless paragraphs of data, in which only 10% are to be kept, rest discarded. Each "entry," ie. paragraph, are of this format:
I guess it should be possible to find the(lack of) keyword, find previous and next <p...> and delete paragraph? Is this possible in a /bin/bash/ script for linux/unix?
Hoping for helpful input! Thank you!
Code:
<parameter> text over several lines sometimes containing key word </parameter>
Hoping for helpful input! Thank you!
Comment