Howdy!
I am a graduate student and completely at sea in connection with a
project that is underway.
I wish to index text documents using an appropriate indexing tool (e.g.
MS Index Server). For a given query string, I wish to obtain relevant
documents from the text index. Additionally, I would like to obtain the
statistics for each of the documents retrieved - for example, the
frequency of occurrence of terms within each document. This I believe
will help me correlate documents, extracted from two separately
maintained indices (through vector or probability models).
I was planning on using MS SQL. However, I am unsure if MS SQL provides
me with the statistics I need to implement the correlation algorithm.
Kindly suggest a tool that may help my cause.
Cordially, Kini
I am a graduate student and completely at sea in connection with a
project that is underway.
I wish to index text documents using an appropriate indexing tool (e.g.
MS Index Server). For a given query string, I wish to obtain relevant
documents from the text index. Additionally, I would like to obtain the
statistics for each of the documents retrieved - for example, the
frequency of occurrence of terms within each document. This I believe
will help me correlate documents, extracted from two separately
maintained indices (through vector or probability models).
I was planning on using MS SQL. However, I am unsure if MS SQL provides
me with the statistics I need to implement the correlation algorithm.
Kindly suggest a tool that may help my cause.
Cordially, Kini