URL Component

Collapse
This topic is closed.
X
X
 
  • Time
  • Show
Clear All
new posts
  • bgold12

    URL Component

    Hey, I know that in a URL there's generally the "scheme" (i.e. http://),
    followed by the hostname or domain name (i.e. en.wikipedia.or g/)
    followed by possible subfolders (i.e. /wiki/) which can then be
    followed by the name of a file to send the request to (i.e. index.html
    or index.php or something). There can also be GET data (i.e. ?
    name=value) and the anchor thing (i.e. #Chapter_1). But also,
    frequently there's just a string that finishes the URL, which doesn't
    appear to be a folder (no trailing forward slash) or a file (no
    extension), for example:



    Can someone explain to me what this string (i.e.
    "Uniform_Resour ce_Locator" in the above example) is, and how you can
    use php to get it?

    Thanks.
  • viza

    #2
    Re: URL Component

    Hi

    On Wed, 15 Oct 2008 09:03:10 -0700, bgold12 wrote:
    Hey, I know that in a URL there's generally the "scheme" (i.e. http://),
    followed by the hostname or domain name (i.e. en.wikipedia.or g/)
    followed by possible subfolders (i.e. /wiki/) which can then be followed
    by the name of a file to send the request to (i.e. index.html or
    index.php or something). There can also be GET data (i.e. ? name=value)
    and the anchor thing (i.e. #Chapter_1). But also, frequently there's
    just a string that finishes the URL, which doesn't appear to be a folder
    (no trailing forward slash) or a file (no extension), for example:
    >
    http://en.wikipedia.org/wiki/Uniform_Resource_Locator
    1) Files do not have to have have "extensions " on modern systems.

    2) Web addresses do not have to refer to files.



    is a valid URL even on some systems where foo...bar is not a valid
    filename.

    Moreover the part after the third slash and before the final slash does
    not have to refer to a directory, although if relative URLs contain ../
    then it they are treated as if it does.

    PS: you duplicated the third slash - it is not part of the domain.
    and how you can use php to get it?
    There are all sorts of ways to open a remote resource in php. IIRC you
    can just do:

    $foo= file( 'http://en.wikipedia.or g/wiki/Uniform_Resourc e_Locator' );

    HTH
    viza

    Comment

    • =?ISO-8859-1?Q?=22=C1lvaro_G=2E_Vicario=22?=

      #3
      Re: URL Component

      bgold12 escribió:
      Hey, I know that in a URL there's generally the "scheme" (i.e. http://),
      followed by the hostname or domain name (i.e. en.wikipedia.or g/)
      followed by possible subfolders (i.e. /wiki/) which can then be
      followed by the name of a file to send the request to (i.e. index.html
      or index.php or something).
      URLs don't need to match physical files and directories on the server
      (and often don't). A simple example is when you omit "index.html " in the
      URL and get the "index.html " file anyway. But you can configure your
      webserver to map any URL to any resource.

      So distinguishing between files and directories is normally meaningless.

      Can someone explain to me what this string (i.e.
      "Uniform_Resour ce_Locator" in the above example) is, and how you can
      use php to get it?
      The short answer is that you don't use PHP for this. Instead, you
      configure your web server to run (e.g.)
      /home/foo/fetch-article.php?que ry=Uniform_Reso urce_Locator and read
      input from good old $_GET['query']. The typical option is Apache's
      mod_rewrite:



      You might also enjoy:






      --
      -- http://alvaro.es - Álvaro G. Vicario - Burgos, Spain
      -- Mi sitio sobre programación web: http://bits.demogracia.com
      -- Mi web de humor al baño María: http://www.demogracia.com
      --

      Comment

      Working...