Return-Path: X-Original-To: apmail-uima-user-archive@www.apache.org Delivered-To: apmail-uima-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id C1E0C9A93 for ; Tue, 17 Jul 2012 19:22:21 +0000 (UTC) Received: (qmail 99543 invoked by uid 500); 17 Jul 2012 19:22:21 -0000 Delivered-To: apmail-uima-user-archive@uima.apache.org Received: (qmail 99466 invoked by uid 500); 17 Jul 2012 19:22:20 -0000 Mailing-List: contact user-help@uima.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@uima.apache.org Delivered-To: mailing list user@uima.apache.org Delivered-To: moderator for user@uima.apache.org Received: (qmail 67833 invoked by uid 99); 13 Jul 2012 20:31:34 -0000 X-ASF-Spam-Status: No, hits=-0.7 required=5.0 tests=FSL_RCVD_USER,MIME_QP_LONG_LINE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of bplank@gmail.com designates 209.85.212.171 as permitted sender) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=subject:references:from:content-type:x-mailer:in-reply-to :message-id:date:to:content-transfer-encoding:mime-version; bh=ARaYZlJ81SrjdTomh1KUGNwdMiRhq5FufJSXhY88ai8=; b=KDiW43nqEqgH+MaZS1P4y4RDRUoV9KkiDEvozWw8RHAmlb68CP1EDn0zPrraeaV50x yagnjTK6nygLJCxzt833eMfD5Q+lqrxMwOdKLfoFiLDlwxdgx++g7CHh+x6G4iV24INZ BpS5A3MsBhKWBD+GJJG5ou+bGPCN+O6pJTUnuULsylt1h9gutAwCVypu63qA0JJOl1eP 8znStMfFPQfoez8BdjnsVqR97iYm8BPjjkBd3VbB1eu/DWejRZMBMcYzxKHadiRUtjkx yjPNY5lrTgBjXduAqB1wfXe+f2NIUevbQfsGNiCQJI2ix1HHoFocSZ3q3z1u2rVM8NOE V2HA== Subject: Re: Using Apache UIMA for processing russian texts References: <2526B96D977292429F23E66A7D6BCB8C1E5DE52E@AMY.ukp.informatik.tu-darmstadt.de> <5000771C.9020103@schor.com> From: Barbara Plank Content-Type: text/plain; charset=utf-8 X-Mailer: iPhone Mail (8J2) In-Reply-To: <5000771C.9020103@schor.com> Message-Id: Date: Fri, 13 Jul 2012 22:29:50 +0200 To: "user@uima.apache.org" Content-Transfer-Encoding: quoted-printable Mime-Version: 1.0 (iPhone Mail 8J2) On Jul 13, 2012, at 9:29 PM, Marshall Schor wrote: > yes, this is a commonly done thing. >=20 > The extermal resources can be loaded once and shared across multiple annot= ators, > for instance. You may read more about this here: >=20 > http://uima.apache.org/d/uimaj-2.4.0/tutorials_and_users_guides.html#ugr.t= ug.aae.accessing_external_resource_files >=20 >=20 > On 7/13/2012 5:52 AM, =D0=90=D0=BB=D0=B5=D0=BA=D1=81=D0=B0=D0=BD=D0=B4=D1=80= =D0=9A=D1=80=D1=8B=D0=BB=D0=BE=D0=B2 wrote: >> ok, tnank You for Your answer! >>=20 >> So, I will see DKPro Core Framework today, >>=20 >> And also i would like to ask You -- can i use external >> resources/libraries/api (etc) in my annotators? (It's may be keywords and= >> entity extractors, filters, rubricators, russian morphology, detecrots, >> etc) - i have this libraties (example: aot.ru - the Alexey Sokirko's >> morphology projects -- greatest russian morphology) >> But hight level of this project will be Apache UIMA. (All my logic -- >> incapsulated in Annotators, written by me). It's possible? >>=20 >> You faithfully, Alexander >>=20 >>=20 >> 2012/7/12 Torsten Zesch >>=20 >>> Redirected the request to UIMA userlist ... >>>=20 >>> Hi Alexander, >>>=20 >>> In addition to what you have already found, the DKPro Core Framework >>> http://code.google.com/p/dkpro-core-asl/ >>> has a POS Tagger (TreeTagger) that comes with a Russian model. >>>=20 >>> I am not aware of Russian components for detecting dates, regions etc. >>>=20 >>> -Torsten >>>=20 >>>> -----Original Message----- >>>> From: =D0=90=D0=BB=D0=B5=D0=BA=D1=81=D0=B0=D0=BD=D0=B4=D1=80 =D0=9A=D1=80= =D1=8B=D0=BB=D0=BE=D0=B2 [mailto:qblook@gmail.com] >>>> Sent: Wednesday, July 11, 2012 11:17 AM >>>> To: dev@uima.apache.org >>>> Subject: Using Apache UIMA for processing russian texts >>>>=20 >>>> Hello! >>>>=20 >>>> Sorry of my English - It's bad.. >>>> I would like to use Apache UIMA Annotators and other UIMA Tools for >>>> processing russian language texts.. It's search of statistircs term, >>> dates, >>>> regions in text documents. >>>> In examples I found only english (and some other) languages, but no >>> russian. >>>> But on Apache UIMA seb site written that Showball Annotator supports th= e >>>> russian language. >>>> So, I would like to ask - what Annotators supports russian language? Ca= n >>> I use >>>> external russian morphology systems in Annotators, created by using >>> Apache >>>> UIMA? >>>>=20 >>>> Thank You >>>> Your faithfully, >>>> Alexander >=20 >=20