Hi there,
UIMA does not do out-of-the-box text extraction from various document
formats.
For this task you can use TIKA ( http://tika.apache.org/).
In our project (ATLAS - http://www.atlasproject.eu/) we've developed a
text extraction framework prior UIMA wrapped NLP tools for different
languages. Do not hesitate to contact me if you need more information on
this.
greetings
Diman
On 03/04/2013 12:26 PM, Mehdi Alaoui Belghiti wrote:
> Hi,
> I was looking for a platform that can make me processing files written in
> different formats (xml, owl, rdf,...) and extract relevant information. So
> i found UIMA.
> However, I found only examples for processing natural language.
> Is UIMA limited to this, or it can allow me for example extracting classes
> or attributes from an a Ecore file?
>
> Thank you for help! I would be happy to find examples of processing more
> complex data.
>
|