spelling errors in query terms

Collapse
This topic is closed.
X
X
 
  • Time
  • Show
Clear All
new posts
  • Joel Rodrigues

    spelling errors in query terms

    Hi,

    I seem to recall once coming across & using functionality in PostgreSQL
    that allowed for some flexibility in the spelling of a query term. For
    example, if one meant to look for 'Honda', but typed in 'Zonda'. There
    was even a 'looseness' factor of sorts available. I've spent a lot of
    time trying to find it in the docs and various articles & tutorials on
    my hard drive, even an hour on Google in vain. It was not
    contrib/fuzzystrmatch.

    Anyone know what I'm talking about ?

    Cheers,
    Joel




    _______________ _______________ _______________ _______________ _________
    Envie de discuter en "live" avec vos amis ? Télécharger MSN Messenger
    http://www.ifrance.com/_reloc/m la 1ère messagerie instantanée de France


    ---------------------------(end of broadcast)---------------------------
    TIP 3: if posting/reading through Usenet, please send an appropriate
    subscribe-nomail command to majordomo@postg resql.org so that your
    message can get through to the mailing list cleanly

  • Craig O'Shannessy

    #2
    Re: spelling errors in query terms

    Hi Joel,

    If your using java, and full text, fuzzy searching is an important part of
    your app, you might want to check out Lucene. I evaluated it for a
    project, and it was very good (the project was a fuzzy search of huge
    pages of text).

    Apache Lucene™ is a high-performance, full-featured search engine library written entirely in Java. It is a technology suitable for...



    On Mon, 8 Dec 2003, Joel Rodrigues wrote:
    [color=blue]
    > Hi,
    >
    > I seem to recall once coming across & using functionality in PostgreSQL
    > that allowed for some flexibility in the spelling of a query term. For
    > example, if one meant to look for 'Honda', but typed in 'Zonda'. There
    > was even a 'looseness' factor of sorts available. I've spent a lot of
    > time trying to find it in the docs and various articles & tutorials on
    > my hard drive, even an hour on Google in vain. It was not
    > contrib/fuzzystrmatch.
    >
    > Anyone know what I'm talking about ?
    >
    > Cheers,
    > Joel
    >
    >
    >
    >
    > _______________ _______________ _______________ _______________ _________
    > Envie de discuter en "live" avec vos amis ? Télécharger MSN Messenger
    > http://www.ifrance.com/_reloc/m la 1ère messagerie instantanée de France
    >
    >
    > ---------------------------(end of broadcast)---------------------------
    > TIP 3: if posting/reading through Usenet, please send an appropriate
    > subscribe-nomail command to majordomo@postg resql.org so that your
    > message can get through to the mailing list cleanly
    >[/color]


    ---------------------------(end of broadcast)---------------------------
    TIP 1: subscribe and unsubscribe commands go to majordomo@postg resql.org

    Comment

    • Oleg Bartunov

      #3
      Re: spelling errors in query terms

      On Mon, 8 Dec 2003, Joel Rodrigues wrote:
      [color=blue]
      > Hi,
      >
      > I seem to recall once coming across & using functionality in PostgreSQL
      > that allowed for some flexibility in the spelling of a query term. For
      > example, if one meant to look for 'Honda', but typed in 'Zonda'. There
      > was even a 'looseness' factor of sorts available. I've spent a lot of
      > time trying to find it in the docs and various articles & tutorials on
      > my hard drive, even an hour on Google in vain. It was not
      > contrib/fuzzystrmatch.[/color]

      You might try http://www.sai.msu.su/~megera/postgres/gist/trgm/
      which uses trigram.

      [color=blue]
      >
      > Anyone know what I'm talking about ?
      >
      > Cheers,
      > Joel
      >
      >
      >
      >
      > _______________ _______________ _______________ _______________ _________
      > Envie de discuter en "live" avec vos amis ? Télécharger MSN Messenger
      > http://www.ifrance.com/_reloc/m la 1ère messagerie instantanée de France
      >
      >
      > ---------------------------(end of broadcast)---------------------------
      > TIP 3: if posting/reading through Usenet, please send an appropriate
      > subscribe-nomail command to majordomo@postg resql.org so that your
      > message can get through to the mailing list cleanly
      >[/color]

      Regards,
      Oleg
      _______________ _______________ _______________ _______________ _
      Oleg Bartunov, sci.researcher, hostmaster of AstroNet,
      Sternberg Astronomical Institute, Moscow University (Russia)
      Internet: oleg@sai.msu.su, http://www.sai.msu.su/~megera/
      phone: +007(095)939-16-83, +007(095)939-23-83

      ---------------------------(end of broadcast)---------------------------
      TIP 8: explain analyze is your friend

      Comment

      • Joel Rodrigues

        #4
        Re: spelling errors in query terms

        Thanks Oleg, I'll take a look at it. Too bad there is no documentation.

        Also, there's something wrong with 'trgm.tgz'. This is what happens if I
        try to extract it's contents :

        gzip: stdin is encrypted -- get newer version of gzip
        tar: End of archive volume 1 reached
        tar: Sorry, unable to determine archive format.


        Cheers ! - Joel


        On Monday, December 8, 2003, at 05:39 , Oleg Bartunov wrote:
        [color=blue]
        > On Mon, 8 Dec 2003, Joel Rodrigues wrote:
        >[color=green]
        >> Hi,
        >>
        >> I seem to recall once coming across & using functionality in PostgreSQL
        >> that allowed for some flexibility in the spelling of a query term. For
        >> example, if one meant to look for 'Honda', but typed in 'Zonda'. There
        >> was even a 'looseness' factor of sorts available. I've spent a lot of
        >> time trying to find it in the docs and various articles & tutorials on
        >> my hard drive, even an hour on Google in vain. It was not
        >> contrib/fuzzystrmatch.[/color]
        >
        > You might try http://www.sai.msu.su/~megera/postgres/gist/trgm/
        > which uses trigram.[/color]

        _______________ _______________ _______________ _______________ _________
        Envie de discuter en "live" avec vos amis ? Télécharger MSN Messenger
        http://www.ifrance.com/_reloc/m la 1ère messagerie instantanée de France


        ---------------------------(end of broadcast)---------------------------
        TIP 5: Have you checked our extensive FAQ?



        Comment

        • Joel Rodrigues

          #5
          Re: spelling errors in query terms

          Thanks Oleg, I'll take a look at it. Too bad there is no documentation.

          Also, there's something wrong with 'trgm.tgz'. This is what happens if I
          try to extract it's contents :

          gzip: stdin is encrypted -- get newer version of gzip
          tar: End of archive volume 1 reached
          tar: Sorry, unable to determine archive format.


          Cheers ! - Joel


          On Monday, December 8, 2003, at 05:39 , Oleg Bartunov wrote:
          [color=blue]
          > On Mon, 8 Dec 2003, Joel Rodrigues wrote:
          >[color=green]
          >> Hi,
          >>
          >> I seem to recall once coming across & using functionality in PostgreSQL
          >> that allowed for some flexibility in the spelling of a query term. For
          >> example, if one meant to look for 'Honda', but typed in 'Zonda'. There
          >> was even a 'looseness' factor of sorts available. I've spent a lot of
          >> time trying to find it in the docs and various articles & tutorials on
          >> my hard drive, even an hour on Google in vain. It was not
          >> contrib/fuzzystrmatch.[/color]
          >
          > You might try http://www.sai.msu.su/~megera/postgres/gist/trgm/
          > which uses trigram.[/color]

          _______________ _______________ _______________ _______________ _________
          Envie de discuter en "live" avec vos amis ? Télécharger MSN Messenger
          http://www.ifrance.com/_reloc/m la 1ère messagerie instantanée de France


          ---------------------------(end of broadcast)---------------------------
          TIP 2: you can get off all lists at once with the unregister command
          (send "unregister YourEmailAddres sHere" to majordomo@postg resql.org)

          Comment

          • Joel Rodrigues

            #6
            Re: spelling errors in query terms

            Thanks for the tip. However I use Python with psycopg to interface with
            PostgreSQL, so unfortunately it's not an option.

            It's really strange, I could swear I used a function within PostgreSQL
            that could handle small spelling errors in the query term.

            Cheers,
            Joel

            On Monday, December 8, 2003, at 11:34 , Craig O'Shannessy wrote:
            [color=blue]
            > Hi Joel,
            >
            > If your using java, and full text, fuzzy searching is an important part
            > of
            > your app, you might want to check out Lucene. I evaluated it for a
            > project, and it was very good (the project was a fuzzy search of huge
            > pages of text).
            >
            > http://jakarta.apache.org/lucene/docs/index.html[/color]

            _______________ _______________ _______________ _______________ _________
            Envie de discuter en "live" avec vos amis ? Télécharger MSN Messenger
            http://www.ifrance.com/_reloc/m la 1ère messagerie instantanée de France


            ---------------------------(end of broadcast)---------------------------
            TIP 8: explain analyze is your friend

            Comment

            Working...