Aho Corasick source code needed, please help

Collapse
This topic is closed.
X
X
 
  • Time
  • Show
Clear All
new posts
  • Seth

    Aho Corasick source code needed, please help

    I am in need of source code for the Aho Corasick algorithm. I have
    tried searching the web but can't seem to find any code.

    Is there a good site for c code I can search?

    Thanks in advance.
  • osmium

    #2
    Re: Aho Corasick source code needed, please help

    Seth writes:
    [color=blue]
    > I am in need of source code for the Aho Corasick algorithm. I have
    > tried searching the web but can't seem to find any code.
    >
    > Is there a good site for c code I can search?[/color]

    There are useful hits, but not necessarily source code, for < aho corasick>
    on Google.


    Comment

    • pete

      #3
      Re: Aho Corasick source code needed, please help

      osmium wrote:[color=blue]
      >
      > Seth writes:
      >[color=green]
      > > I am in need of source code for the Aho Corasick algorithm. I have
      > > tried searching the web but can't seem to find any code.
      > >
      > > Is there a good site for c code I can search?[/color]
      >
      > There are useful hits, but not necessarily source code,
      > for < aho corasick> on Google.[/color]

      The correct way to search for code,
      is to put the word "code" in your search.



      Aho Corasick algorithm code. Results 1 - 10 of about 1,100.
      Search took 0.13 seconds.

      From comp.compilers newsgroup: Techrepts & C tookit: Performance of (single,multi) keyword pattern matching algorithms

      [color=blue]
      > The Knuth-Morris-Pratt and Aho-Corasick algorithms performed linearly and
      > consistently (on widely varying keyword sets), as their theoretical
      > running time predicts. The Commentz-Walter algorithm (and its variants)
      > displayed more interesting behaviour, greatly out-performing even the best
      > Aho-Corasick variant on a large portion of the input data. The
      > recommendations section of this paper details the conditions under which a
      > particular algorithm should be chosen.
      > \end{abstract}
      >
      > The report can be obtained by anonymous ftp from:
      > ftp.win.tue.nl (also known as 131.155.70.100)
      > in directory:
      > pub/techreports/pi/pattm/bench
      > Files WARRANTY (says that there is NO warranty)
      > pattperf...
      > pattmkit.tar.gz (contains the C code)[/color]


      --
      pete

      Comment

      • osmium

        #4
        Re: Aho Corasick source code needed, please help

        pete wrote:
        [color=blue][color=green]
        > > There are useful hits, but not necessarily source code,
        > > for < aho corasick> on Google.[/color]
        >
        > The correct way to search for code,
        > is to put the word "code" in your search.[/color]

        I wasn't *trying* to find code. The only reason I tried at all was because
        I thought there might likely be a spelling error in Corasick.


        Comment

        • pete

          #5
          Re: Aho Corasick source code needed, please help

          osmium wrote:[color=blue]
          >
          > pete wrote:
          >[color=green][color=darkred]
          > > > There are useful hits, but not necessarily source code,
          > > > for < aho corasick> on Google.[/color]
          > >
          > > The correct way to search for code,
          > > is to put the word "code" in your search.[/color]
          >
          > I wasn't *trying* to find code.[/color]

          OP was.

          --
          pete

          Comment

          • Seth

            #6
            Re: Aho Corasick source code needed, please help

            pete <pfiland@mindsp ring.com> wrote in message news:<401A558C. 6ADC@mindspring .com>...[color=blue]
            > osmium wrote:[color=green]
            > >
            > > pete wrote:
            > >[color=darkred]
            > > > > There are useful hits, but not necessarily source code,
            > > > > for < aho corasick> on Google.
            > > >
            > > > The correct way to search for code,
            > > > is to put the word "code" in your search.[/color]
            > >
            > > I wasn't *trying* to find code.[/color]
            >
            > OP was.[/color]

            Thanks for the help, I checked out that code but it is all
            copyrighted, which means they might as well have not even posted it as
            it is useless.

            Is there any source where I can get free, uncopyrighted code? Or
            should I just buy a book and write the code myself? Or is that in
            violation of the book's copyright?

            I have read about 2 dozen papers on the subject of string matching.
            Every paper is about how someone CLAIMS they can beat the AC
            algorithm. No evidence is given. No proof. No source code. No
            independent tests. No way for the reader to test. Just their
            bragging, basically saying "mine is bigger than yours and you'll just
            have to believe me."

            A typical example is Mike Fisk and George Varghese and their paper
            "Applying Fast String Matching to Intrusion Detection." They, of
            course, claim they can beat AC, and even have some cute little graphs.
            They offer no proof of this claim, of course, and no source code, no
            way to test it, and no evidence they ever tested it themselves. But,
            they claim they will release the code in a library. This paper was
            written in 2001. Have they released the library yet? Nope. I
            emailed them both, and they both ignored my request.

            At best, they are useless wastes of my time that have contributed
            nothing to the world of computer science other than empty brags. At
            worst, they are liars.

            I'm starting to think I should apply for a grant, spend the money in
            Tahiti, and then write a paper claiming I can beat AC.

            Comment

            • Alan Balmer

              #7
              Re: Aho Corasick source code needed, please help

              On 4 Feb 2004 11:14:59 -0800, skintigh_spam@y ahoo.com (Seth) wrote:
              [color=blue]
              >Thanks for the help, I checked out that code but it is all
              >copyrighted, which means they might as well have not even posted it as
              >it is useless.[/color]

              Not necessarily. You can use the ideas expressed by the code, you just
              have to find your own expression of those ideas. If you want to use
              the code as written, or modify the code, contact the copyright holder.
              They may make it available with a free license.[color=blue]
              >
              >Is there any source where I can get free, uncopyrighted code?[/color]

              No. All code is copyrighted automatically as soon as it's expressed in
              tangible form (including yours;-). However, copyrighted code can be
              free, depending on the wishes of the copyright holder.
              [color=blue]
              > Or
              >should I just buy a book and write the code myself? Or is that in
              >violation of the book's copyright?[/color]

              If you use the ideas in the book and write the code yourself, there's
              no violation. If you simply copy code from the book, there probably
              is, depending on the terms set by the copyright holder.

              --
              Al Balmer
              Balmer Consulting
              removebalmercon sultingthis@att .net

              Comment

              • Mark McIntyre

                #8
                Re: Aho Corasick source code needed, please help

                On 4 Feb 2004 11:14:59 -0800, in comp.lang.c , skintigh_spam@y ahoo.com
                (Seth) wrote:
                [color=blue]
                >Is there any source where I can get free, uncopyrighted code?[/color]

                thats 2 questions:

                q1) where can I get uncopyrighted code?
                a1) you can't - anything anyone writes down is copyrighted to them,
                including this email, which is mostly copyrighted to me.

                q2) where can I get free code?
                a2) many authors make their code freely available. Its possible that
                the authors you found do this too - check with them.
                [color=blue]
                >Or should I just buy a book and write the code myself?[/color]

                that would be a better idea anyway.
                [color=blue]
                >Or is that in violation of the book's copyright?[/color]

                if you copy enough of the book verbatim, you're breaching copyright.
                "Fair use" permits you to copy reasonable extracts however.

                --
                Mark McIntyre
                CLC FAQ <http://www.eskimo.com/~scs/C-faq/top.html>
                CLC readme: <http://www.angelfire.c om/ms3/bchambless0/welcome_to_clc. html>


                ----== Posted via Newsfeed.Com - Unlimited-Uncensored-Secure Usenet News==----
                http://www.newsfeed.com The #1 Newsgroup Service in the World! >100,000 Newsgroups
                ---= 19 East/West-Coast Specialized Servers - Total Privacy via Encryption =---

                Comment

                • Keith Thompson

                  #9
                  Re: Aho Corasick source code needed, please help

                  Alan Balmer <albalmer@att.n et> writes:[color=blue]
                  > On 4 Feb 2004 11:14:59 -0800, skintigh_spam@y ahoo.com (Seth) wrote:[/color]
                  [...][color=blue][color=green]
                  > >Is there any source where I can get free, uncopyrighted code?[/color]
                  >
                  > No. All code is copyrighted automatically as soon as it's expressed in
                  > tangible form (including yours;-). However, copyrighted code can be
                  > free, depending on the wishes of the copyright holder.[/color]

                  The owner of a piece of code (or any copyrightable work) can
                  explicitly release it to the public domain. Such code is not covered
                  by copyright.

                  This is actually fairly rare; works don't become public domain unless
                  the author explicitly releases them or the copyright expires. I don't
                  think any C code has existed long enough for its copyright to expire.

                  There's a great deal of *free* code; it's covered by copyright and by
                  a license that explicitly allows certain uses.

                  One exception is Doug Gwyn's public domain Q8 library, available at
                  <http://www.lysator.liu .se/c/q8/>.

                  Disclaimer: I am not a lawyer, and this is not an appropriate forum
                  for discussing legal issues. I mention this only to clarify what's
                  already been said. Do not depend on the correctness of anything I say
                  about intellectual property law, other than this sentence.

                  --
                  Keith Thompson (The_Other_Keit h) kst-u@mib.org <http://www.ghoti.net/~kst>
                  San Diego Supercomputer Center <*> <http://www.sdsc.edu/~kst>
                  Schroedinger does Shakespeare: "To be *and* not to be"

                  Comment

                  • Mark McIntyre

                    #10
                    Re: Aho Corasick source code needed, please help

                    On Thu, 05 Feb 2004 21:40:27 GMT, in comp.lang.c , Keith Thompson
                    <kst-u@mib.org> wrote:
                    [color=blue]
                    >Alan Balmer <albalmer@att.n et> writes:[color=green]
                    >> On 4 Feb 2004 11:14:59 -0800, skintigh_spam@y ahoo.com (Seth) wrote:[/color]
                    >[...][color=green][color=darkred]
                    >> >Is there any source where I can get free, uncopyrighted code?[/color]
                    >>
                    >> No. All code is copyrighted automatically as soon as it's expressed in
                    >> tangible form (including yours;-). However, copyrighted code can be
                    >> free, depending on the wishes of the copyright holder.[/color]
                    >
                    >The owner of a piece of code (or any copyrightable work) can
                    >explicitly release it to the public domain. Such code is not covered
                    >by copyright.[/color]

                    I'm not sure thats quite right. You can release it to the PD, but you
                    still have copyright. Its an innate property of your own work.
                    [color=blue]
                    >This is actually fairly rare; works don't become public domain unless
                    >the author explicitly releases them or the copyright expires. I don't
                    >think any C code has existed long enough for its copyright to expire.[/color]

                    70 years after the author's death? Soonest would be ~2045 I guess...


                    --
                    Mark McIntyre
                    CLC FAQ <http://www.eskimo.com/~scs/C-faq/top.html>
                    CLC readme: <http://www.angelfire.c om/ms3/bchambless0/welcome_to_clc. html>


                    ----== Posted via Newsfeed.Com - Unlimited-Uncensored-Secure Usenet News==----
                    http://www.newsfeed.com The #1 Newsgroup Service in the World! >100,000 Newsgroups
                    ---= 19 East/West-Coast Specialized Servers - Total Privacy via Encryption =---

                    Comment

                    • Keith Thompson

                      #11
                      Re: Aho Corasick source code needed, please help

                      Mark McIntyre <markmcintyre@s pamcop.net> writes:[color=blue]
                      > On Thu, 05 Feb 2004 21:40:27 GMT, in comp.lang.c , Keith Thompson
                      > <kst-u@mib.org> wrote:[/color]
                      [...][color=blue][color=green]
                      > >The owner of a piece of code (or any copyrightable work) can
                      > >explicitly release it to the public domain. Such code is not covered
                      > >by copyright.[/color]
                      >
                      > I'm not sure thats quite right. You can release it to the PD, but you
                      > still have copyright. Its an innate property of your own work.[/color]

                      I think it's right, but I could easily be mistaken. Blah blah
                      disclaimer blah blah.

                      --
                      Keith Thompson (The_Other_Keit h) kst-u@mib.org <http://www.ghoti.net/~kst>
                      San Diego Supercomputer Center <*> <http://www.sdsc.edu/~kst>
                      Schroedinger does Shakespeare: "To be *and* not to be"

                      Comment

                      • Martin Dickopp

                        #12
                        [OT] Re: Aho Corasick source code needed, please help

                        Mark McIntyre <markmcintyre@s pamcop.net> writes:
                        [color=blue]
                        > On Thu, 05 Feb 2004 21:40:27 GMT, in comp.lang.c , Keith Thompson
                        > <kst-u@mib.org> wrote:
                        >[color=green]
                        >>The owner of a piece of code (or any copyrightable work) can
                        >>explicitly release it to the public domain.[/color][/color]

                        In the USA, yes. In most European countries, no. Not sure about the rest
                        of the world.
                        [color=blue][color=green]
                        >>Such code is not covered by copyright.[/color][/color]

                        Correct.
                        [color=blue]
                        > I'm not sure thats quite right. You can release it to the PD, but you
                        > still have copyright.[/color]

                        If you release a work in the public domain (assuming it's possible at
                        all in your jurisdiction), you give up your copyright by definition of
                        public domain. A work cannot simultaneously be copyrighted and in the
                        public domain.
                        [color=blue]
                        > Its an innate property of your own work.[/color]

                        In the USA, no: copyrights can be completely transfered or given up. The
                        European coypright tradition distiguishes between "author's rights" and
                        "usage rights." The latter can be transfered, sold, etc., but not the
                        former.
                        [color=blue]
                        > 70 years after the author's death? Soonest would be ~2045 I guess...[/color]

                        Yes, but only in the unlikely case that this period of time (recently
                        extended to 90 years in the USA, retroactively covering all works that
                        were about to enter the public domain, BTW) will not be further extended
                        until then.

                        Martin

                        Comment

                        • Joona I Palaste

                          #13
                          Re: [OT] Re: Aho Corasick source code needed, please help

                          Martin Dickopp <expires-2004-03-31@zero-based.org> scribbled the following:[color=blue]
                          > Mark McIntyre <markmcintyre@s pamcop.net> writes:[color=green]
                          >> On Thu, 05 Feb 2004 21:40:27 GMT, in comp.lang.c , Keith Thompson
                          >> <kst-u@mib.org> wrote:[color=darkred]
                          >>>The owner of a piece of code (or any copyrightable work) can
                          >>>explicitly release it to the public domain.[/color][/color][/color]
                          [color=blue]
                          > In the USA, yes. In most European countries, no. Not sure about the rest
                          > of the world.[/color]

                          You should see the discussion on the comp.sys.amiga. games group. People
                          claim that old Amiga games, no longer on the market, are abandonware,
                          and thus not covered by copyright. The fact that there is no such legal
                          status as "abandonwar e" does not deter them. Apparently they think that
                          because the concept "abandonwar e" has a name, it's legally recognised.

                          --
                          /-- Joona Palaste (palaste@cc.hel sinki.fi) ------------- Finland --------\
                          \-- http://www.helsinki.fi/~palaste --------------------- rules! --------/
                          "O pointy birds, O pointy-pointy. Anoint my head, anointy-nointy."
                          - Dr. Michael Hfuhruhurr

                          Comment

                          • osmium

                            #14
                            Re: [OT] Re: Aho Corasick source code needed, please help

                            Martin Dickopp writes:
                            [color=blue][color=green]
                            > > 70 years after the author's death? Soonest would be ~2045 I guess...[/color]
                            >
                            > Yes, but only in the unlikely case that this period of time (recently
                            > extended to 90 years in the USA, retroactively covering all works that
                            > were about to enter the public domain, BTW) will not be further extended
                            > until then.[/color]

                            AFAIK US copyright law is designed, and modified as necessary, so Mickey
                            Mouse will never lose its copyright. Mickey Mouse was born about 1928, so,
                            from here, it looks like anything published after 1928 will *never* lose its
                            copyright. Mickey Mouse seems to have a lot of friends in high places. But
                            one can lose copyright as a result of inattention or dying

                            Note that in the US there are at least three distinct sets of intellectual
                            property laws: trademarks, patents and copyright.

                            IANAL. Thank God!


                            Comment

                            • CBFalconer

                              #15
                              Re: [OT] Re: Aho Corasick source code needed, please help

                              Joona I Palaste wrote:[color=blue]
                              > Martin Dickopp <expires-2004-03-31@zero-based.org> scribbled[color=green]
                              > > Mark McIntyre <markmcintyre@s pamcop.net> writes:[color=darkred]
                              > >> Keith Thompson <kst-u@mib.org> wrote:[/color][/color]
                              >[color=green][color=darkred]
                              > >>> The owner of a piece of code (or any copyrightable work) can
                              > >>> explicitly release it to the public domain.[/color][/color]
                              >[color=green]
                              > > In the USA, yes. In most European countries, no. Not sure about
                              > > the rest of the world.[/color]
                              >
                              > You should see the discussion on the comp.sys.amiga. games group.
                              > People claim that old Amiga games, no longer on the market, are
                              > abandonware, and thus not covered by copyright. The fact that
                              > there is no such legal status as "abandonwar e" does not deter
                              > them. Apparently they think that because the concept
                              > "abandonwar e" has a name, it's legally recognised.[/color]

                              In practice, and until and if there is money to be made, they are
                              right. Once someone can make any money (net) all rights
                              automatically regain full force :-)

                              There are people around making a living from this concept,
                              especially in the field of patents.

                              --
                              Chuck F (cbfalconer@yah oo.com) (cbfalconer@wor ldnet.att.net)
                              Available for consulting/temporary embedded and systems.
                              <http://cbfalconer.home .att.net> USE worldnet address!


                              Comment

                              Working...