The DIRE development platform is based on GATE. This has two implications:
As presented in the GATE manual, “the basic business of GATE is annotating documents”. This is quite powerful in itself, given the numerous possible definitions and uses of both document (any content) and annotation (any content attached to any interval within a document). However, GATE will only process documents individually and sequentially (so if you need multi-document processing, you have to merge them into a single document).
A DIRE-base software can be divided into three sets:
All members of the DRIM team that work on documents are strongly encouraged to contribute to and benefit from the DIRE platform. Those that are not sure how to do so will find useful information here.
All plugins listed here are available through our repository, located at http://liris.cnrs.fr/dire/gate/gate-update-site.xml
.
Three plugins are currently available, providing various Processing Resources:
Corrector
means that that resource will modify the content of the document being processed. Be careful not to run any annotating resource before a Corrector: annotations are characterised by their offset from the first character of the document, and Correctors may insert or delete characters.Annotators
will add annotations to the processed document (and may modify existing ones), while File Writers
will do the eponymous activity (and will usually not modify nor add annotations).