Return-Path: X-Original-To: apmail-uima-user-archive@www.apache.org Delivered-To: apmail-uima-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 2E8DAD785 for ; Mon, 4 Mar 2013 10:31:08 +0000 (UTC) Received: (qmail 21638 invoked by uid 500); 4 Mar 2013 10:31:07 -0000 Delivered-To: apmail-uima-user-archive@uima.apache.org Received: (qmail 21570 invoked by uid 500); 4 Mar 2013 10:31:07 -0000 Mailing-List: contact user-help@uima.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@uima.apache.org Delivered-To: mailing list user@uima.apache.org Received: (qmail 21546 invoked by uid 99); 4 Mar 2013 10:31:07 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 04 Mar 2013 10:31:07 +0000 X-ASF-Spam-Status: No, hits=2.2 required=5.0 tests=HTML_MESSAGE,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: local policy) Received: from [213.145.99.122] (HELO mail.tetracom-bg.com) (213.145.99.122) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 04 Mar 2013 10:30:59 +0000 Received: (qmail 12835 invoked by uid 1009); 4 Mar 2013 10:30:37 -0000 Received: from 78.90.180.200 by mail.tetracom-bg.com (envelope-from , uid 1008) with qmail-scanner-1.25-st-qms (clamdscan: 0.91.2/4648. spamassassin: 3.1.3. perlscan: 1.25-st-qms. Clear:RC:1(78.90.180.200):. Processed in 0.064465 secs); 04 Mar 2013 10:30:37 -0000 X-Antivirus-MAIL.TETRACOM-BG.COM-Mail-From: diman@tetracom.com via mail.tetracom-bg.com X-Antivirus-MAIL.TETRACOM-BG.COM: 1.25-st-qms (Clear:RC:1(78.90.180.200):. Processed in 0.064465 secs Process 12829) Received: from unknown (HELO ?192.168.1.74?) (78.90.180.200) by mail.tetracom-bg.com with SMTP; 4 Mar 2013 10:30:37 -0000 Message-ID: <513477C9.6020100@tetracom.com> Date: Mon, 04 Mar 2013 12:30:33 +0200 From: Diman Karagiozov User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:17.0) Gecko/20130106 Thunderbird/17.0.2 MIME-Version: 1.0 To: user@uima.apache.org Subject: Re: UIMA [new user] References: In-Reply-To: Content-Type: multipart/alternative; boundary="------------090209030004080202060301" X-Virus-Checked: Checked by ClamAV on apache.org --------------090209030004080202060301 Content-Type: text/plain; charset=windows-1252; format=flowed Content-Transfer-Encoding: 7bit Hi there, UIMA does not do out-of-the-box text extraction from various document formats. For this task you can use TIKA ( http://tika.apache.org/). In our project (ATLAS - http://www.atlasproject.eu/) we've developed a text extraction framework prior UIMA wrapped NLP tools for different languages. Do not hesitate to contact me if you need more information on this. greetings Diman On 03/04/2013 12:26 PM, Mehdi Alaoui Belghiti wrote: > Hi, > I was looking for a platform that can make me processing files written in > different formats (xml, owl, rdf,...) and extract relevant information. So > i found UIMA. > However, I found only examples for processing natural language. > Is UIMA limited to this, or it can allow me for example extracting classes > or attributes from an a Ecore file? > > Thank you for help! I would be happy to find examples of processing more > complex data. > --------------090209030004080202060301--