Triple extraction (Subject, Predicate, Object) is a good method to translate free-form sentences into knowledge.
The Gate Predicate-Argument Extractor Component (PAX) could be very useful in this task.
PAX is a GATE component for extracting predicate-argument structures (PAS). PASs are used in various contexts to represent relations within a sentence structure. Different “semantic” parsers extract relational information from sentences but there exists no common format to store these information. The predicate-argument extractor component (PAX) takes the annotations generated by selected parsers and extracts/transforms the parsers’ results to predicate-argument structures represented as triples (subject-verb-object).
I pass the day testing this plug in and I’m very satisfied with the first results.
By the way In the beginning I was having some configuration problems, but after some emails with project creators everything started working fine. Thanks guys
More info about this plug-in is available here.
