Return-Path: X-Original-To: apmail-uima-dev-archive@www.apache.org Delivered-To: apmail-uima-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id F38DD10F77 for ; Fri, 31 Jan 2014 09:39:39 +0000 (UTC) Received: (qmail 84333 invoked by uid 500); 31 Jan 2014 09:39:39 -0000 Delivered-To: apmail-uima-dev-archive@uima.apache.org Received: (qmail 84179 invoked by uid 500); 31 Jan 2014 09:39:38 -0000 Mailing-List: contact dev-help@uima.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@uima.apache.org Delivered-To: mailing list dev@uima.apache.org Received: (qmail 84171 invoked by uid 99); 31 Jan 2014 09:39:38 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 31 Jan 2014 09:39:38 +0000 X-ASF-Spam-Status: No, hits=-2.3 required=5.0 tests=RCVD_IN_DNSWL_MED,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: local policy) Received: from [132.187.3.35] (HELO mailrelay.rz.uni-wuerzburg.de) (132.187.3.35) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 31 Jan 2014 09:39:30 +0000 Received: from virusscan-slb.rz.uni-wuerzburg.de (localhost [127.0.0.1]) by mailrelay-slb.rz.uni-wuerzburg.de (Postfix) with ESMTP id 1A0EB77017 for ; Fri, 31 Jan 2014 10:39:10 +0100 (CET) Received: from localhost (localhost [127.0.0.1]) by virusscan-slb.rz.uni-wuerzburg.de (Postfix) with ESMTP id 1865076FE9 for ; Fri, 31 Jan 2014 10:39:10 +0100 (CET) X-Virus-Scanned: amavisd-new at uni-wuerzburg.de Received: from mailmaster.uni-wuerzburg.de ([10.101.19.1]) by localhost (vmail002.slb.uni-wuerzburg.de [10.101.19.142]) (amavisd-new, port 10225) with ESMTP id b-Pp9soVxs7d for ; Fri, 31 Jan 2014 10:39:10 +0100 (CET) Received: from [132.187.15.93] (win6093.informatik.uni-wuerzburg.de [132.187.15.93]) by mailmaster.uni-wuerzburg.de (Postfix) with ESMTPSA id 000037B1F3 for ; Fri, 31 Jan 2014 10:39:09 +0100 (CET) Message-ID: <52EB6F3D.3000200@uni-wuerzburg.de> Date: Fri, 31 Jan 2014 10:39:09 +0100 From: =?ISO-8859-15?Q?Peter_Kl=FCgl?= User-Agent: Mozilla/5.0 (Windows NT 6.1; WOW64; rv:24.0) Gecko/20100101 Thunderbird/24.2.0 MIME-Version: 1.0 To: dev@uima.apache.org Subject: Re: UIMA Ruta next steps References: <52B3027D.8010204@uni-wuerzburg.de> In-Reply-To: <52B3027D.8010204@uni-wuerzburg.de> X-Enigmail-Version: 1.6 Content-Type: text/plain; charset=ISO-8859-15 Content-Transfer-Encoding: 8bit X-Virus-Checked: Checked by ClamAV on apache.org It's now end of January. We (Martin and I) are currently quite busy, but the release process should be started soon. Here's the list of unresolved issues for this release: https://issues.apache.org/jira/browse/UIMA-3590?jql=project%20%3D%20UIMA%20AND%20resolution%20%3D%20Unresolved%20AND%20fixVersion%20%3D%20%222.2.0ruta%22%20AND%20component%20%3D%20ruta%20ORDER%20BY%20priority%20DESC UIMA-3303 Add a way to alias types in RUTA (e.g. "IMPORT type AS alias") Alexandre, do you know when you are able to resolve this issue? We could maybe defer the missing piece to the next release. UIMA-3495 Report ambiguous types in Ruta Editor UIMA-3533 Support new import functionality in Workbench I will try to implement both issues ASAP after UIMA-3303 is resolved UIMA-3347 Ruta: Missing False Positives in "Annotation Test" view This issue can be resolved. We should maybe set the default for "useAllTypes" to true. UIMA-3441 Ruta: Extend classpath for Annotation Test run Martin, can you please update the documentation and resolve the issue? UIMA-3309 Ruta: Filter file names in Query View Martin, can you please update the documentation and resolve the issue? UIMA-3469 Ruta: Annotation Browser View Extensions Martin, can you please update the documentation and resolve the issue? UIMA-3539 Prepare Ruta 2.2.0 release No work left here, I think. Best, Peter Am 19.12.2013 15:28, schrieb Peter Kl�gl: > Hi, > > I just want to start a discussion about the next release and maybe > interesting directions for extensions. > > I am planning a bugfix release for the end of January, UIMA Ruta version > 2.1.1 > > List of the 26 already resolved issues for 2.1.1: > https://issues.apache.org/jira/browse/UIMA-3342?jql=project%20%3D%20UIMA%20AND%20fixVersion%20%3D%20%222.1.1ruta%22%20AND%20component%20%3D%20ruta%20AND%20status%20in%20(Resolved%2C%20Closed)%20ORDER%20BY%20priority%20DESC > > List of currently unresolved issues: > https://issues.apache.org/jira/browse/UIMA-2982?jql=project%20%3D%20UIMA%20AND%20resolution%20%3D%20Unresolved%20AND%20component%20%3D%20ruta%20ORDER%20BY%20priority%20DESC > > I think the following issues should (at least) be resolved in addition > for 2.1.1 (some of them are already fixed, but the documentation is not > yet up-to-date): > - UIMA-3137: Cleanup Ruta launch configuration tabs > - UIMA-3471: Arrays in Annotation Browser View > - UIMA-3347: Ruta: Missing False Positives in "Annotation Test" view > - UIMA-3286: Start anchor after optional literal > - UIMA-3280: Option to specify vm arguments for Ruta launch config > - UIMA-3283: Matching reference pointing outside of current window > - UIMA-3303: Add a way to alias types in RUTA (e.g. "IMPORT type AS alias") > - UIMA-3495: Report ambiguous types in Ruta Editor > - UIMA-3441: Ruta: Extend classpath for Annotation Test run > - UIMA-3469: Ruta: Annotation Browser View Extensions > - UIMA-3275: Minor discrepencies in license and notice files > - UIMA-3309: Ruta: Filter file names in Query View > - UIMA-3485: Ruta: Workbench extension point for "Script execution finished" > > Maybe the issues for dropins-support should also be included. > > Are there any wishes/opinions which other issues should be included? > > ### > > Here are a few ideas of major changes for a 2.2.x or 3.x release: > > 1. Making UIMA Ruta faster > There are four aspects that can be considered: > a) Parallelization/Scale-Out, already supported by UIMA-AS and friends > b) Improvements in the current implementation. I know of at least four > code fragments that can be improved > c) Add new language constructs that are simply faster in some > situations. I am thinking of an FST implementation similar to JAPE Plus > and of an extension of the dynamic anchoring towards the operator plan > optimization of SystemT > d) Write faster rules. Some rules are just faster than others. This > leads to a cookbook for best practices > > 2. Improve support for coreference information > There are some nice ideas of unification-based grammars that can be > included in the rule language. It does not have to be as mature as in > SProUT, but maybe something like in CAFETIERE. This would automatically > solve the restriction of value assignments in actions vs conditions > > 3. Support arbitrary CAS collections in the Ruta Workbench > The Workbench currently only supports normal xmi files. There is no > concept of a collection reader or similar stuff. It would maybe be nice > for some users, if the Workbench can operate on CASs stored in a > database or on any collection reader. > > 4. Actually useful rule induction algorithm > After about six implementations of supervised rule learners, I think I > have an idea of the layout of an actually useful algorithm for Ruta. I > think it's also the time to adapt some ideas of semi-supervised machine > learning for rule-based systems. > > 5. Support generic type systems in the Workbench > Sometimes you cannot avoid specifying the semantics of an annotation in > the feature values instead of in the type. However, most of the tooling > will be not as useful then, e.g., the Annotation Browser view shows only > one type with a lot of annotations. There should be some additional, > configurable views that support those situations. > > > All opinions or wishes are welcome :-) > > Best, > > Peter >