Feeds:
Posts
Comments

Archive for the ‘Methodology’ Category

Tiedemann (2007), wrote an interesting paper about Genetic Algorithms, regarding how to improve passage retrieval in Question Answering systems. In this paper four selection strategies in evolutionary optimization of information retrieval (IR) in a question answering system are compared. The IR index has been enhanced by linguistic features to improve the retrieval performance of potential [...]

Read Full Post »

Research in Question Answering (QA) systems has been improved by the Text Retrieval Conference (TREC) series since 1999. Almost all QA systems fielded at TREC employ some passage retrieval technique to reduce the size of the relevant document set to a manageable number of passages. Here are a bunch of algorithms that might be useful [...]

Read Full Post »

“… semantic networks suffer from an inherent semantic ambiguity. For example, we were unable to differentiate individuals from concepts in the resulting concept maps. Moreover, due to the direct translation of written sentences into concept map sentences, various terms were used to express synonyms, resulting in further ambiguity …” Hummm…. interesting (or not). References: Building [...]

Read Full Post »

We have seen that are many alternatives in designing an Information Retrieval (IR) system. However, how do we know which of these techniques are effective? To measure IR effectiveness in the standard way, we need an input test collection consisting of three things: A document collection A test suite of information needs, expressible as queries [...]

Read Full Post »

For this research project, instead of a plan-driven or a disciplined methodology, we use an Agile Software development framework, SCRUM. But what is SCRUM? SCRUM is an iterative, incremental framework for agile software development. Scrum encouraging co-location, and verbal communication across all team members and disciplines that are involved in the project. A key principle [...]

Read Full Post »

Extensible Markup Language (XML) is a simple and very flexible text format derived from SGML (ISO 8879). Originally designed to meet the challenges of large-scale electronic publishing, XML also plays an increasingly important role in the store and exchange of a wide variety of data on the Web. XML is interesting to people who wish [...]

Read Full Post »

It works something like that: We need to build a system that is capable of automatically identifying highly relevant triples (pairs of concepts connected by a relation) over concepts from an existing ontology. By extracting relevant verbs and their grammatical arguments from a domain-specific text collection and computing corresponding relations through a combination of linguistic [...]

Read Full Post »

What Lucene is? Lucene is a high-performance, scalable Information Retrieval (IR) library, created originally by Doug Cutting. It provides indexing and searching features to applications. It is a technology suitable for nearly any application that requires full-text search, especially cross-platform. Lucene is a mature, open-source project implemented in Java, available for free download. Moreover, it’s [...]

Read Full Post »

Usually X marks the spot, but the path for conversion of unstructured knowledge into a reliable and efficient knowledge database of facts isn’t straightforward. Despite the knowledge being already assembled in a machine-optimal-representation, information recovery into a English natural language answer isn’t trivial. Nowadays, the amount of information that companies deal with is overwhelming. Being [...]

Read Full Post »

How to represent knowledge in a way that could be efficiently manipulated by a machine program? How to formally represent the domain of a problem? How to achieve intelligent behavior? These are the 1M $ questions… The key problem is to find a representation and a supporting system that make inferences within the constraints, appropriated [...]

Read Full Post »

Follow

Get every new post delivered to your Inbox.