How to make HTML contnent with same tag in each line using Sed

Collapse
X
 
  • Time
  • Show
Clear All
new posts
  • amskape
    New Member
    • Apr 2010
    • 56

    How to make HTML contnent with same tag in each line using Sed

    I am looking in a an HTML file to modify for the purpose of easy parsing . I need to put each item of HTML after body to separate line.

    eg : my current HTML file is

    Code:
    <?xml version="1.0"?>
    <!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd">
    <html lang="en" xml:lang="en" xmlns="http://www.w3.org/1999/xhtml">
      <head>
        <meta content="text/html; charset=utf-8" http-equiv="Content-type" />
        <meta name="ncc:files" content="78" />
      </head>
      <body>
        <h1 class="title" id="h1"><a href="001.smil#txt4">ABOUT DAISY</a></h1>
        <h1 class="section" id="h7">
          <a href="002.smil#txt10">Cover</a>
        </h1>
        <span class="page-normal" id="p13">
          <a href="002.smil#txt15">1</a>
        </span>
        <h1 class="section" id="h18">
          <a href="003.smil#txt21">Swadesaabhimaani, K. Kelappan, Muhammad Abdul Rahiman</a>
        </h1>
        <span class="page-normal" id="p24">
          <a href="003.smil#txt26">2</a>
        </span>
        <span class="page-normal" id="p33">
          <a href="003.smil#txt35">3</a>
        </span>
        <h1 class="section" id="h38">
          <a href="004.smil#txt41">Title</a>
        </h1>
        <span class="page-normal" id="p45">
          <a href="004.smil#txt47">4</a>
        </span>
        <h1 class="section" id="h50">
          <a href="005.smil#txt53">Publication</a>
        </h1>
        <span class="page-normal" id="p69">
          <a href="005.smil#txt71">5</a>
        </span>
        <h1 class="section" id="h74">
          <a href="006.smil#txt77">K. Ramakrishnapilla</a>
        </h1>
          </body>
    </html>

    required html must be same line without line split after <body> tag .

    means each tag content must come in same line without split. Please advise How it can do with Sed...

    Waiting fast reply...

    Thanks

    Anes
Working...