The Berkeley FrameNet project is creating an on-line lexical resource for English, based on frame semantics and supported by corpus evidence.
The aim is to document the range of semantic and syntactic combinatorial possibilities of each word in each of its senses, through computer-assisted annotation of example sentences and automatic tabulation and display of the annotation results.
The major product of this work, the FrameNet lexical database, currently contains more than 11.600 lexical units, more than 6.800 of which are fully annotated, in more than 960 semantic frames, exemplified in more than 150.000 annotated sentences.
In addition to lexicographic work, FrameNet also annotates continuous texts as a demonstration of how frame semantics can contribute to text understanding.
This could be an interesting and useful tool to improve the efficiency of text extraction tools.
