Reproducing a web page and add own content to it.

Collapse
This topic is closed.
X
X
 
  • Time
  • Show
Clear All
new posts
  • LaundroMat

    Reproducing a web page and add own content to it.

    Hi -

    I'm working on a Django powered site where one of the required
    functionalities is the possibility of displaying the content of
    external pages, with an extra banner at the top where specific
    information is displayed. In other words, I'm looking for a way to
    reproduce an existing web page and add some HTML code to it. (I can't
    think of an example right now, but the idea is similar to sites that
    let you see an external page and have some site-specific text above it
    (often stating that the content below is not part of the site the user
    comes from)).

    To test this, I've been downloading an external page, adding some text
    to it and re-opening it in a browser (with the help of built-in
    modules such as urllib2 etc). This works of course, but the external
    page's links such as <img src="hello.png" >, or <a href="help.html ">
    are evidently no longer correct.

    Apart from parsing the whole file and trying to inject the external
    site's domain in links such as the above (with the added inconvenience
    of having to store the external page locally), is there an easier way
    of accomplishing what I want?

    Thanks,

    Mathieu

  • Diez B. Roggisch

    #2
    Re: Reproducing a web page and add own content to it.

    LaundroMat wrote:
    Hi -
    >
    I'm working on a Django powered site where one of the required
    functionalities is the possibility of displaying the content of
    external pages, with an extra banner at the top where specific
    information is displayed. In other words, I'm looking for a way to
    reproduce an existing web page and add some HTML code to it. (I can't
    think of an example right now, but the idea is similar to sites that
    let you see an external page and have some site-specific text above it
    (often stating that the content below is not part of the site the user
    comes from)).
    >
    To test this, I've been downloading an external page, adding some text
    to it and re-opening it in a browser (with the help of built-in
    modules such as urllib2 etc). This works of course, but the external
    page's links such as <img src="hello.png" >, or <a href="help.html ">
    are evidently no longer correct.
    >
    Apart from parsing the whole file and trying to inject the external
    site's domain in links such as the above (with the added inconvenience
    of having to store the external page locally), is there an easier way
    of accomplishing what I want?
    Using a frame?

    Diez

    Comment

    • LaundroMat

      #3
      Re: Reproducing a web page and add own content to it.

      On Apr 8, 2:04 pm, "Diez B. Roggisch" <de...@nospam.w eb.dewrote:
      LaundroMat wrote:
      Hi -
      >
      I'm working on a Django powered site where one of the required
      functionalities is the possibility of displaying the content of
      external pages, with an extra banner at the top where specific
      information is displayed. In other words, I'm looking for a way to
      reproduce an existing web page and add some HTML code to it. (I can't
      think of an example right now, but the idea is similar to sites that
      let you see an external page and have some site-specific text above it
      (often stating that the content below is not part of the site the user
      comes from)).
      >
      To test this, I've been downloading an external page, adding some text
      to it and re-opening it in a browser (with the help of built-in
      modules such as urllib2 etc). This works of course, but the external
      page's links such as <img src="hello.png" >, or <a href="help.html ">
      are evidently no longer correct.
      >
      Apart from parsing the whole file and trying to inject the external
      site's domain in links such as the above (with the added inconvenience
      of having to store the external page locally), is there an easier way
      of accomplishing what I want?
      >
      Using a frame?
      >
      Diez
      Ack. I was too focused on importing the external web page and
      redisplaying the information (I've just been reading up on
      BeautifulSoup) instead of looking for an HTML based approach.

      Thanks!

      Comment

      • Steve Holden

        #4
        Re: Reproducing a web page and add own content to it.

        LaundroMat wrote:
        On Apr 8, 2:04 pm, "Diez B. Roggisch" <de...@nospam.w eb.dewrote:
        >LaundroMat wrote:
        >>Hi -
        >>I'm working on a Django powered site where one of the required
        >>functionaliti es is the possibility of displaying the content of
        >>external pages, with an extra banner at the top where specific
        >>information is displayed. In other words, I'm looking for a way to
        >>reproduce an existing web page and add some HTML code to it. (I can't
        >>think of an example right now, but the idea is similar to sites that
        >>let you see an external page and have some site-specific text above it
        >>(often stating that the content below is not part of the site the user
        >>comes from)).
        >>To test this, I've been downloading an external page, adding some text
        >>to it and re-opening it in a browser (with the help of built-in
        >>modules such as urllib2 etc). This works of course, but the external
        >>page's links such as <img src="hello.png" >, or <a href="help.html ">
        >>are evidently no longer correct.
        >>Apart from parsing the whole file and trying to inject the external
        >>site's domain in links such as the above (with the added inconvenience
        >>of having to store the external page locally), is there an easier way
        >>of accomplishing what I want?
        >Using a frame?
        >>
        >Diez
        >
        Ack. I was too focused on importing the external web page and
        redisplaying the information (I've just been reading up on
        BeautifulSoup) instead of looking for an HTML based approach.
        >
        Thanks!
        You could also look at adding a <basetag to your generated page's
        <headsection.

        regards
        Steve
        --
        Steve Holden +1 571 484 6266 +1 800 494 3119
        Holden Web LLC http://www.holdenweb.com/

        Comment

        • LaundroMat

          #5
          Re: Reproducing a web page and add own content to it.

          On Apr 8, 4:11 pm, Steve Holden <st...@holdenwe b.comwrote:
          LaundroMat wrote:
          On Apr 8, 2:04 pm, "Diez B. Roggisch" <de...@nospam.w eb.dewrote:
          LaundroMat wrote:
          >Hi -
          >I'm working on a Django powered site where one of the required
          >functionalitie s is the possibility of displaying the content of
          >external pages, with an extra banner at the top where specific
          >information is displayed. In other words, I'm looking for a way to
          >reproduce an existing web page and add some HTML code to it. (I can't
          >think of an example right now, but the idea is similar to sites that
          >let you see an external page and have some site-specific text above it
          >(often stating that the content below is not part of the site the user
          >comes from)).
          >To test this, I've been downloading an external page, adding some text
          >to it and re-opening it in a browser (with the help of built-in
          >modules such as urllib2 etc). This works of course, but the external
          >page's links such as <img src="hello.png" >, or <a href="help.html ">
          >are evidently no longer correct.
          >Apart from parsing the whole file and trying to inject the external
          >site's domain in links such as the above (with the added inconvenience
          >of having to store the external page locally), is there an easier way
          >of accomplishing what I want?
          Using a frame?
          >
          Diez
          >
          Ack. I was too focused on importing the external web page and
          redisplaying the information (I've just been reading up on
          BeautifulSoup) instead of looking for an HTML based approach.
          >
          Thanks!
          >
          You could also look at adding a <basetag to your generated page's
          <headsection.
          >
          regards
          Steve
          --
          Steve Holden +1 571 484 6266 +1 800 494 3119
          Holden Web LLC http://www.holdenweb.com/
          True, but I suppose that users would no longer see the top banner
          added by me when they click on one of the links on the external site's
          page. I'm a bit hesitant about using frames however, but reading up on
          them makes me think the application I have in mind for them might be
          the generally accepted exception to the rule that frames are bad :)

          Anyway. Thanks for the help!

          Comment

          Working...