URL Component

**viza** · Oct 15 '08, 04:25 PM

Re: URL Component

Hi

On Wed, 15 Oct 2008 09:03:10 -0700, bgold12 wrote:

Hey, I know that in a URL there's generally the "scheme" (i.e. http://),
followed by the hostname or domain name (i.e. en.wikipedia.or g/)
followed by possible subfolders (i.e. /wiki/) which can then be followed
by the name of a file to send the request to (i.e. index.html or
index.php or something). There can also be GET data (i.e. ? name=value)
and the anchor thing (i.e. #Chapter_1). But also, frequently there's
just a string that finishes the URL, which doesn't appear to be a folder
(no trailing forward slash) or a file (no extension), for example:
>
http://en.wikipedia.org/wiki/Uniform_Resource_Locator

1) Files do not have to have have "extensions " on modern systems.

2) Web addresses do not have to refer to files.

Example Domain

http://example.org/fooo...barrr

is a valid URL even on some systems where foo...bar is not a valid
filename.

Moreover the part after the third slash and before the final slash does
not have to refer to a directory, although if relative URLs contain ../
then it they are treated as if it does.

PS: you duplicated the third slash - it is not part of the domain.

and how you can use php to get it?

There are all sorts of ways to open a remote resource in php. IIRC you
can just do:

$foo= file( 'http://en.wikipedia.or g/wiki/Uniform_Resourc e_Locator' );

HTH
viza

**=?ISO-8859-1?Q?=22=C1lvaro_G=2E_Vicario=22?=** · Oct 15 '08, 04:25 PM

Re: URL Component

bgold12 escribió:

Hey, I know that in a URL there's generally the "scheme" (i.e. http://),
followed by the hostname or domain name (i.e. en.wikipedia.or g/)
followed by possible subfolders (i.e. /wiki/) which can then be
followed by the name of a file to send the request to (i.e. index.html
or index.php or something).

URLs don't need to match physical files and directories on the server
(and often don't). A simple example is when you omit "index.html " in the
URL and get the "index.html " file anyway. But you can configure your
webserver to map any URL to any resource.

So distinguishing between files and directories is normally meaningless.

Can someone explain to me what this string (i.e.
"Uniform_Resour ce_Locator" in the above example) is, and how you can
use php to get it?

The short answer is that you don't use PHP for this. Instead, you
configure your web server to run (e.g.)
/home/foo/fetch-article.php?que ry=Uniform_Reso urce_Locator and read
input from good old $_GET['query']. The typical option is Apache's
mod_rewrite:

Apache mod_rewrite - Apache HTTP Server Version 2.2

http://httpd.apache.org/docs/2.2/en/rewrite/

URL Component

URL Component

Comment

Comment