Tiedemann (2007), wrote an interesting paper about Genetic Algorithms, regarding how to improve passage retrieval in Question Answering systems. In this paper four selection strategies in evolutionary optimization of information retrieval (IR) in a question answering system are compared. The IR index has been enhanced by linguistic features to improve the retrieval performance of potential [...]
Archive for the ‘passage retrieval’ Category
Question Answering systems & Genetic Algorithms
Posted in information retrieval, Methodology, passage retrieval, question and answering, tagged Genetic Algorithms, Tiedemann on April 23, 2010 | Leave a Comment »
Tools: OpenEphyra Question Answering System
Posted in information retrieval, natural language processing, passage retrieval, question and answering, tools, trec, tagged IR, natural language processing, trec on April 23, 2010 | 3 Comments »
OpenEphyra is the first open framework for question answering (QA). It retrieves accurate answers to natural language questions from the Web and other sources. OpenEphyra is hosted at SourceForge and published under the GNU General Public License (GPL). The framework comes with implementations of a number of algorithms that proved effective in CMU’s Ephyra system, [...]
Passage Retrieval – Some Algorithms
Posted in information retrieval, Methodology, passage retrieval, question and answering, trec, tagged algorithm, alicante, bm25, Clarke, clef, hovy, ibm, information, isi, Ittycheriah, lee, Light, llopis, mitre, multitext, okapi, okapi bm25, PR, Robertson, siteq, sliding window, tellex, trec, vicedo, voting on April 19, 2010 | 2 Comments »
Research in Question Answering (QA) systems has been improved by the Text Retrieval Conference (TREC) series since 1999. Almost all QA systems fielded at TREC employ some passage retrieval technique to reduce the size of the relevant document set to a manageable number of passages. Here are a bunch of algorithms that might be useful [...]
Passage Retrieval: How to define passages
Posted in information retrieval, passage retrieval, tagged callan, information, kaszkiel, zobel on April 14, 2010 | 2 Comments »
Passage Retrieval (PR) is a typical Information Retrieval (IR) system that returns short passages in response to a user query. But how to define the size and style of that short passage? It should be the paragraph where the answer probably is? Should we retrieve the whole section of the original document? Or should we [...]
Information Retrieval: Text Compression
Posted in information retrieval, passage retrieval, text compression, tagged arithmetic, huffman, IR, modern informantion retrieval, PR, statistical, text compression, zip-lempel on April 13, 2010 | 2 Comments »
Text compression is all about finding the best way to represent an original text into fewer bytes. Text compressors try to build a reduced representation of the original source data, identifying and using the relevant structures from it. One important point is that the original source text can be restored from this reduced representation without [...]
Some blueprints…
Posted in information retrieval, Methodology, ontology, passage retrieval, prymas, tagged name-entity, penn treebank, pos-tagging, stemming, stopwords, stopwords removal, tagger, triples on April 6, 2010 | 1 Comment »
It works something like that: We need to build a system that is capable of automatically identifying highly relevant triples (pairs of concepts connected by a relation) over concepts from an existing ontology. By extracting relevant verbs and their grammatical arguments from a domain-specific text collection and computing corresponding relations through a combination of linguistic [...]
Passage Retrieval vs Information Retrieval
Posted in information retrieval, passage retrieval, Uncategorized, tagged documents, information, IR, lucene, PR on March 30, 2010 | 7 Comments »
Why choose Passage Retrieval (PR) over Information Retrieval (IR) ? Which one is the best painkiller? Information is essential. However, if we can’t find it, it really doesn’t exists. Even if we have all the documents properly stored and indexed, our effort is wasted, unless we have reliable and fast mechanisms to look up these [...]
Gold Rush
Posted in knowledge management, Methodology, passage retrieval, prymas, tagged Entity Recognition, GATE, machine learning, POS, semantic network, Steeming, Tokenization, UIMA, unstructured knowledge on March 23, 2010 | 1 Comment »
Usually X marks the spot, but the path for conversion of unstructured knowledge into a reliable and efficient knowledge database of facts isn’t straightforward. Despite the knowledge being already assembled in a machine-optimal-representation, information recovery into a English natural language answer isn’t trivial. Nowadays, the amount of information that companies deal with is overwhelming. Being [...]
