php and Googlebot

Collapse
This topic is closed.
X
X
 
  • Time
  • Show
Clear All
new posts
  • somaboy mx

    php and Googlebot

    I've created a php page which is optimized for search engine indexation: no
    images, tables or css, just plain html with relevant meta tags etc.

    The page contains a list records pulled from a database, and for each record
    there is a link to the detail view for that record in this form: <a
    href="<?=$_SERV ER['PHP_SELF'] ?>?rec=<?=$reco rdId ?>">

    I've made sure the urlstring doesn't contain a variable called 'id' or
    something, since I heard some bots will assume that is a session id and not
    follow the link.

    From my server stats I can see that the "master view" page is being indexed
    fine by Google, but the bot apparently doesn't follow the links to the
    detail pages.

    Does anyone have any idea why?


    ..soma


  • Yttrium

    #2
    Re: php and Googlebot


    "somaboy mx" <nosuch@fakemai l.fk> a écrit dans le message de news:
    42c3d3a1$1_2@x-privat.org...

    [color=blue]
    > From my server stats I can see that the "master view" page is being[/color]
    indexed[color=blue]
    > fine by Google, but the bot apparently doesn't follow the links to the
    > detail pages.
    >
    > Does anyone have any idea why?[/color]

    Be patient...


    Comment

    • ECRIA Public Mail Buffer

      #3
      Re: php and Googlebot

      Bots don't like dynamic URLs.

      ECRIA



      Comment

      • Alvaro G Vicario

        #4
        Re: php and Googlebot

        *** ECRIA Public Mail Buffer wrote/escribió (Thu, 30 Jun 2005 11:54:05
        -0400):[color=blue]
        > Bots don't like dynamic URLs.[/color]

        Since nowadays most interesting info comes from dynamic URLs, that doesn't
        say much about bots.


        --
        -- Álvaro G. Vicario - Burgos, Spain
        -- http://bits.demogracia.com - Mi sitio sobre programación web
        -- Don't e-mail me your questions, post them to the group
        --

        Comment

        • defaultuserbr@yahoo.com

          #5
          Re: php and Googlebot



          Alvaro G Vicario wrote:[color=blue]
          > *** ECRIA Public Mail Buffer wrote/escribió (Thu, 30 Jun 2005 11:54:05
          > -0400):[color=green]
          > > Bots don't like dynamic URLs.[/color]
          >
          > Since nowadays most interesting info comes from dynamic URLs, that doesn't
          > say much about bots.[/color]


          What would you have the bots do, fill out forms, click buttons? Even if
          they could, would you want a bunch webcrawler bots firing your cgis and
          PHP stuff all the time?




          Brian

          Comment

          • juglesh

            #6
            Re: php and Googlebot


            ECRIA Public Mail Buffer wrote:[color=blue]
            > Bots don't like dynamic URLs.[/color]

            They dont seem to mind mine.

            Comment

            • www.douglassdavis.com

              #7
              Re: php and Googlebot



              defaultuserbr@y ahoo.com wrote:[color=blue]
              > Alvaro G Vicario wrote:[color=green]
              > > *** ECRIA Public Mail Buffer wrote/escribió (Thu, 30 Jun 2005 11:54:05
              > > -0400):[color=darkred]
              > > > Bots don't like dynamic URLs.[/color]
              > >
              > > Since nowadays most interesting info comes from dynamic URLs, that doesn't
              > > say much about bots.[/color]
              >
              >
              > What would you have the bots do, fill out forms, click buttons? Even if
              > they could, would you want a bunch webcrawler bots firing your cgis and
              > PHP stuff all the time?[/color]

              I would... And I wouldn't mind them firing my cgis and PHP stuff a
              few times a day. Wouldn't really hurt anything.

              And they don't have to fill out forms most of the time, just follow the
              given links. As long as there are links from the home page.

              --
              Contact Us Script:


              Comment

              • Yttrium

                #8
                Re: php and Googlebot


                "ECRIA Public Mail Buffer" <ngsremovemail@ ecria.com> a écrit dans le message
                de news: da14im$b5k$1@mu rdoch.acc.Virgi nia.EDU...[color=blue]
                > Bots don't like dynamic URLs.
                >[/color]
                Completly wrong


                Comment

                • somaboy mx

                  #9
                  Re: php and Googlebot

                  "ECRIA Public Mail Buffer" <ngsremovemail@ ecria.com> wrote in message
                  news:da14im$b5k $1@murdoch.acc. Virginia.EDU...[color=blue]
                  > Bots don't like dynamic URLs.[/color]

                  I'd be surprised ifthat were the case. I've had pages with a couple of
                  variables in the urlstring which got excellent indexing.

                  This said, I believe there might be some benefit in "clean" url's, if only
                  for human readability.


                  ..soma


                  Comment

                  • ECRIA Public Mail Buffer

                    #10
                    Re: php and Googlebot

                    Google for one generally only indexes dynamic URLs for news sites and
                    forums. The reason is simple: Dynamic content changes unpredictably and
                    cannot be indexed reliably without additional credible information about the
                    content.

                    This is a discussion-based newsgroup, so disagree if you must. Feel free to
                    give us an example of your dynamic non-forum/news/blog URL that appears in
                    web search results. Doing so is a much better way to make your point than
                    anonymously posting "You don't know what you're talking about".

                    We have been in the Design/SEO business for years, and we know exactly what
                    we are talking about.

                    ECRIA



                    Comment

                    • Daniel Tryba

                      #11
                      Re: php and Googlebot

                      ECRIA Public Mail Buffer <ngsremovemail@ ecria.com> wrote:[color=blue]
                      > Google for one generally only indexes dynamic URLs for news sites and
                      > forums. The reason is simple: Dynamic content changes unpredictably and
                      > cannot be indexed reliably without additional credible information about the
                      > content.
                      >
                      > We have been in the Design/SEO business for years, and we know exactly what
                      > we are talking about.[/color]

                      Then you should know that there is no way to determine if an URL is
                      dynamically generated or not other than parsing it's contents. If a
                      spider/indexer decides to mark something as dynamic based on the form of
                      an URL the developers should be fired at once.

                      Comment

                      • Daniel Tryba

                        #12
                        Re: php and Googlebot

                        ECRIA Public Mail Buffer <ngsremovemail@ ecria.com> wrote:[color=blue]
                        > Google for one generally only indexes dynamic URLs for news sites and
                        > forums. The reason is simple: Dynamic content changes unpredictably and
                        > cannot be indexed reliably without additional credible information about the
                        > content.
                        >
                        > We have been in the Design/SEO business for years, and we know exactly what
                        > we are talking about.[/color]

                        Then you should know that there is no way to determine if an URL is
                        dynamically generated or not other than parsing it's contents. And even
                        that doesn't mean anything (an unchanged page can still be dynamically
                        generated and a changed page could be manually updated).

                        If a spider/indexer decides to mark something as dynamic based on the
                        form of an URL the developers should be fired at once.

                        Comment

                        • ECRIA Public Mail Buffer

                          #13
                          Re: php and Googlebot

                          " Then you should know that there is no way to determine if an URL is
                          dynamically generated or not other than parsing it's contents. And even that
                          doesn't mean anything (an unchanged page can still be dynamically generated
                          and a changed page could be manually updated)."

                          Agreed. Furthermore, a static page URL may actually be a dynamically
                          generated page. For example, there is not a single HTML file on
                          http://www.ecria.com - but HTML is all anyone will see.

                          However, the point is that if there are variables in a URL, robots assume
                          that the page is generated dynamically - which is a fair assumption. We're
                          not condoning this - it's just what happens.

                          There are ways to get around it, but they don't change the fact that this is
                          the way robots work.

                          Other than a blog/forum/news pages, how many web sites containing URL
                          variables show up in, say, a Google search?

                          See what I mean?

                          ECRIA



                          Comment

                          • Daniel Tryba

                            #14
                            Re: php and Googlebot

                            ECRIA Public Mail Buffer <ngsremovemail@ ecria.com> wrote:[color=blue]
                            > However, the point is that if there are variables in a URL, robots assume
                            > that the page is generated dynamically - which is a fair assumption.[/color]

                            Bad assumption, they could be used clientside.
                            [color=blue]
                            > There are ways to get around it, but they don't change the fact that this is
                            > the way robots work.[/color]

                            That's why I say they should be fired. They are to lazy to fix bad
                            assumptions and even go out of their way to insert this kind of silly
                            behavior.
                            [color=blue]
                            > Other than a blog/forum/news pages, how many web sites containing URL
                            > variables show up in, say, a Google search?
                            >
                            > See what I mean?[/color]

                            No, all pages of eg http://www.amsterdamchinafestival.nl/ appear to be
                            listed in Google. All dynamically generated by hiddeous URL without
                            trying to hide it's dynamic and it's no blog, form or news page. Even if
                            it was how would a spider know it's on of the "special" sites?

                            Comment

                            • ECRIA Public Mail Buffer

                              #15
                              Re: php and Googlebot

                              There's a difference between being LISTED and being RANKED -
                              http://www.amsterdamchinafestival.nl/ does not show up in Google results for
                              "Amsterdam china festival" - it's own title!

                              How does Google know it's a news/blog site? Beats me - ask them. They know.
                              Probably something to do with the Google Groups technology.

                              I think that will have to be my final word... this discussion is not going
                              anywhere.

                              ECRIA



                              Comment

                              Working...