Unrecognized file format prolem with valid html, please help!

Collapse
This topic is closed.
X
X
 
  • Time
  • Show
Clear All
new posts
  • Jeff Parker

    Unrecognized file format prolem with valid html, please help!

    I have a web application that for the real estate industry. Here is
    one of the sites using said application.

    Friendly and helpful customer support that goes above and beyond. We help you get the perfect domain name.


    As you can see if you click this link here


    This site validates just fine using the w3 validator

    the problem that i have is that google does not recognise the file
    format of this site

    check this link here



    and this link here



    I am not sure why this page which validates just fine with the w3
    validator is not reconised and spidered properly by google

    as you can imagine my clients are less than happy about this and I am
    at a loss for what to do about it.

    If anyone has any ideas for me they would be much appreciated.

    thank you

    Jeff Parker
  • C A Upsdell

    #2
    Re: Unrecognized file format prolem with valid html, please help!

    "Jeff Parker" <puffindown@hot mail.com> wrote in message
    news:329ca6e3.0 411091755.715b2 979@posting.goo gle.com...[color=blue]
    >I have a web application that for the real estate industry. Here is
    > one of the sites using said application.
    >
    > http://www.wellsre.com[/color]

    You have the HTML tag on the same line as the DOCTYPE. Most unusual. Try
    putting them on separate lines.


    Comment

    • Lars Eighner

      #3
      Re: Unrecognized file format prolem with valid html, please help!

      In our last episode,
      <329ca6e3.04110 91755.715b2979@ posting.google. com>,
      the lovely and talented Jeff Parker
      broadcast on comp.infosystem s.www.authoring.html:
      [color=blue]
      > I have a web application that for the real estate industry. Here is
      > one of the sites using said application.[/color]
      [color=blue]
      > http://www.wellsre.com[/color]

      Where is the rest of it? That is, what is the actual filename
      and why is the trailing slash missing?

      Adding a meta http-equiv with the content type might help. Are
      you certain the server is sending the correct content type for
      this file?

      --
      Lars Eighner -finger for geek code- eighner@io.com http://www.io.com/~eighner/
      If it wasn't for muscle spasms, I wouldn't get any exercise at all.

      Comment

      • Leif K-Brooks

        #4
        Re: Unrecognized file format prolem with valid html, please help!

        Jeff Parker wrote:[color=blue]
        > I have a web application that for the real estate industry. Here is
        > one of the sites using said application.
        >
        > http://www.wellsre.com
        >
        > the problem that i have is that google does not recognise the file
        > format of this site[/color]

        The "HTML" in your content-type is all-caps. Try fixing that.

        [leif@localhost leif]$ HEAD http://www.wellsre.com
        200 OK
        Cache-Control: no-cache
        Connection: close
        Date: Wed, 10 Nov 2004 04:22:16 GMT
        Server: Microsoft-IIS/5.0
        Content-Length: 20689
        Content-Type: text/HTML; Charset=ISO-8859-1
        Client-Date: Wed, 10 Nov 2004 04:25:47 GMT
        Client-Peer: 66.232.22.13:80
        Client-Response-Num: 1
        Set-Cookie: ASPSESSIONIDQCA SRDCB=PJBHCLKCP EEHHDOKJBIIIDPI ; path=/
        X-Powered-By: ASP.NET

        Comment

        • Lachlan Hunt

          #5
          Re: Unrecognized file format prolem with valid html, please help!

          Jeff Parker wrote:[color=blue]
          > I have a web application that for the real estate industry. Here is
          > one of the sites using said application.
          >
          > http://www.wellsre.com
          >
          > the problem that i have is that google does not recognise the file
          > format of this site[/color]

          The HTTP response headers [1] contain:

          Content-Type: text/HTML; Charset=ISO-8859-1

          I suspect that may be the problem. I've never seen the content type
          fields written in uppercase, they're usually written in lowercase. I
          don't know if it's invalid or not to have it in uppercase (according to
          the relevant RFCs: RFC 2616 (HTTP1.1), 2045 (MIME) or 2046 (Media
          Types)), but perhaps google doesn't recognise it like that. Fix your
          server to send:

          Content-Type: text/html; charset=ISO-8859-1


          Also, even though it is valid HTML, you should look into replacing all
          those layout tables and presentational elements/attributes with CSS, and
          use a DOCTYPE that doesn't trigger quirks mode [2] in browsers. You
          should also use <p> instead of <br><br> to create seperate paragraphs.

          eg. Write this:
          <p>paragraph 1 ...</p>
          </p>paragraph 2 ...</p>

          instead of:
          paragraph 1 ...
          <br><br>
          paragraph 2 ...

          [1] http://cgi.w3.org/cgi-bin/headers?ur...w.wellsre.com/
          [2] http://www.mozilla.org/docs/web-deve.../doctypes.html

          --
          Lachlan Hunt

          http://GetFirefox.com/ Rediscover the Web
          http://SpreadFirefox.com/ Igniting the Web

          Comment

          • Neal

            #6
            Re: Unrecognized file format prolem with valid html, please help!

            On 9 Nov 2004 17:55:41 -0800, Jeff Parker <puffindown@hot mail.com> wrote:
            [color=blue]
            > I have a web application that for the real estate industry. Here is
            > one of the sites using said application.
            >
            > http://www.wellsre.com
            >
            > As you can see if you click this link here
            > http://validator.w3.org/check?uri=ht...ww.wellsre.com
            >
            > This site validates just fine using the w3 validator
            >
            > the problem that i have is that google does not recognise the file
            > format of this site
            >
            > check this link here
            > http://www.google.com/search?sourcei...TF-8&q=wellsre[/color]

            See http://www.google.com/search?q=%22We...wellsre.com%22

            Comment

            • Neal

              #7
              Re: Unrecognized file format prolem with valid html, please help!

              On Tue, 09 Nov 2004 23:36:14 -0500, Neal <neal413@yahoo. com> wrote:
              [color=blue]
              > On 9 Nov 2004 17:55:41 -0800, Jeff Parker <puffindown@hot mail.com> wrote:
              >[color=green]
              >> I have a web application that for the real estate industry. Here is
              >> one of the sites using said application.
              >>
              >> http://www.wellsre.com
              >>
              >> As you can see if you click this link here
              >> http://validator.w3.org/check?uri=ht...ww.wellsre.com
              >>
              >> This site validates just fine using the w3 validator
              >>
              >> the problem that i have is that google does not recognise the file
              >> format of this site[/color][/color]

              Oh, never mind, I see now.

              What is the file format? You never told us.

              Comment

              • Mark Parnell

                #8
                Re: Unrecognized file format prolem with valid html, please help!

                On Wed, 10 Nov 2004 04:31:26 GMT, Lachlan Hunt <spam.my.gspot@ gmail.com>
                declared in comp.infosystem s.www.authoring.html:
                [color=blue]
                > <p>paragraph 1 ...</p>
                > </p>paragraph 2 ...</p>[/color]

                That would be:

                <p>paragraph 1 ...</p>
                <p>paragraph 2 ...</p>

                --
                Mark Parnell

                Comment

                • Neal

                  #9
                  Re: Unrecognized file format prolem with valid html, please help!

                  On 9 Nov 2004 17:55:41 -0800, Jeff Parker <puffindown@hot mail.com> wrote:
                  [color=blue]
                  > http://www.wellsre.com[/color]

                  Possibly unrelated but worth mentioning - in Opera 7.23 the page appears
                  two times if I reload. One below the other.

                  Bizarre.

                  Comment

                  • Stan Brown

                    #10
                    Re: Unrecognized file format prolem with valid html, please help!

                    "Lars Eighner" <eighner@io.com > wrote in
                    comp.infosystem s.www.authoring.html:[color=blue]
                    ><329ca6e3.0411 091755.715b2979 @posting.google .com> Jeff Parker:[/color]
                    [color=blue][color=green]
                    >> http://www.wellsre.com[/color]
                    >
                    >Where is the rest of it? That is, what is the actual filename[/color]

                    You and your browser don't need to know that.
                    [color=blue]
                    >and why is the trailing slash missing?[/color]

                    It isn't.


                    P.S. I'm a big fan of proper attributions, but four lines does seem
                    like a superabundance of riches.

                    --
                    Stan Brown, Oak Road Systems, Tompkins County, New York, USA
                    Dragon222 adalah situs slot gacor terbaru yang selalu memberikan banyak bonus menarik dan kemenangan JP untuk pemain setia selama bermain di link slot DRAGON222.

                    HTML 4.01 spec: http://www.w3.org/TR/html401/
                    validator: http://validator.w3.org/
                    CSS 2.1 spec: http://www.w3.org/TR/CSS21/
                    validator: http://jigsaw.w3.org/css-validator/

                    Comment

                    • Lars Eighner

                      #11
                      Re: Unrecognized file format prolem with valid html, please help!

                      In our last episode, <MPG.1bfc4c137d 7fd0f698ccff@ne ws.odyssey.net> ,
                      the lovely and talented Stan Brown broadcast on
                      comp.infosystem s.www.authoring.html:
                      [color=blue]
                      > "Lars Eighner" <eighner@io.com > wrote in
                      > comp.infosystem s.www.authoring.html:[/color]
                      [color=blue][color=green]
                      >><329ca6e3.041 1091755.715b297 9@posting.googl e.com> Jeff Parker:[/color][/color]
                      [color=blue][color=green][color=darkred]
                      >>> http://www.wellsre.com[/color]
                      >>
                      >>Where is the rest of it? That is, what is the actual filename[/color][/color]
                      [color=blue]
                      > You and your browser don't need to know that.[/color]

                      I'm not the one begging for help here.
                      [color=blue][color=green]
                      >>and why is the trailing slash missing?[/color][/color]
                      [color=blue]
                      > It isn't.[/color]

                      Oh, it is ont of those *invisible* trailing slashes.
                      [color=blue]
                      > P.S. I'm a big fan of proper attributions, but four lines does seem
                      > like a superabundance of riches.[/color]

                      --
                      Lars Eighner -finger for geek code- eighner@io.com http://www.io.com/~eighner/
                      If it wasn't for muscle spasms, I wouldn't get any exercise at all.

                      Comment

                      • Neal

                        #12
                        Re: Unrecognized file format prolem with valid html, please help!

                        Lars Eighner wrote:[color=blue]
                        > Stan Brown broadcast[color=green]
                        >> It isn't.[/color]
                        > Oh, it is ont of those *invisible* trailing slashes.[/color]

                        AFAIK the trailing slash is not needed at the end of a domain. It is at
                        the end of a directory.

                        Comment

                        • Brian

                          #13
                          Re: Unrecognized file format prolem with valid html, please help!

                          Lars Eighner wrote:[color=blue]
                          > Jeff Parker :
                          >[color=green]
                          >> I have a web application that for the real estate industry.
                          >>
                          >> http://www.wellsre.com[/color]
                          >
                          >
                          > Where is the rest of it? That is, what is the actual filename[/color]

                          There is no filename on the client end, only a url and a resource,
                          hopefully with a mime type.
                          [color=blue]
                          > and why is the trailing slash missing?[/color]

                          The trailing slash on that url is optional.

                          --
                          Brian (remove "invalid" to email me)

                          Comment

                          • Lars Eighner

                            #14
                            Re: Unrecognized file format prolem with valid html, please help!

                            In our last episode,
                            <opsg9t6goq6v66 56@news.individ ual.net>,
                            the lovely and talented Neal
                            broadcast on comp.infosystem s.www.authoring.html:
                            [color=blue]
                            > Lars Eighner wrote:[color=green]
                            >> Stan Brown broadcast[color=darkred]
                            >>> It isn't.[/color]
                            >> Oh, it is ont of those *invisible* trailing slashes.[/color][/color]
                            [color=blue]
                            > AFAIK the trailing slash is not needed at the end of a domain. It is at
                            > the end of a directory.[/color]

                            It is my understanding that, at least with some combinations of
                            browsers and servers, an extra http transaction is required if
                            the trailing slash is omitted. Moreover, from googling on
                            trailing slash domain, I find several reports of google handling
                            sites somewhat differently according to whether the trailing slash
                            is included.

                            The question isn't whether your browser or my browser can get the
                            page. Obviously most - if not all - modern browsers can bring up the
                            page by hook or by crook. The question was about some apparently
                            mysterious google behavior, but whether a quirk in google's spider or
                            in google's subsequent processing is involved I don't know.

                            --
                            Lars Eighner -finger for geek code- eighner@io.com http://www.io.com/~eighner/
                            If it wasn't for muscle spasms, I wouldn't get any exercise at all.

                            Comment

                            • Neal

                              #15
                              Re: Unrecognized file format prolem with valid html, please help!

                              On Wed, 10 Nov 2004 17:25:57 -0600, Lars Eighner <eighner@io.com > wrote:
                              [color=blue]
                              > The question isn't whether your browser or my browser can get the
                              > page. Obviously most - if not all - modern browsers can bring up the
                              > page by hook or by crook. The question was about some apparently
                              > mysterious google behavior, but whether a quirk in google's spider or
                              > in google's subsequent processing is involved I don't know.[/color]

                              It shouldn't be related to the slash. Likely his filetype is being
                              mis-served or is otherwise screwed up.

                              Comment

                              Working...