From user-return-8028-archive-asf-public=cust-asf.ponee.io@uima.apache.org Wed Feb 6 15:42:17 2019 Return-Path: X-Original-To: archive-asf-public@cust-asf.ponee.io Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by mx-eu-01.ponee.io (Postfix) with SMTP id D84D4180679 for ; Wed, 6 Feb 2019 16:42:16 +0100 (CET) Received: (qmail 42997 invoked by uid 500); 6 Feb 2019 15:42:15 -0000 Mailing-List: contact user-help@uima.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@uima.apache.org Delivered-To: mailing list user@uima.apache.org Received: (qmail 42986 invoked by uid 99); 6 Feb 2019 15:42:15 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd3-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 06 Feb 2019 15:42:15 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd3-us-west.apache.org (ASF Mail Server at spamd3-us-west.apache.org) with ESMTP id 40B33180CE3 for ; Wed, 6 Feb 2019 15:42:14 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd3-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 1 X-Spam-Level: * X-Spam-Status: No, score=1 tagged_above=-999 required=6.31 tests=[KAM_LAZY_DOMAIN_SECURITY=1, RCVD_IN_DNSWL_NONE=-0.0001] autolearn=disabled Received: from mx1-lw-us.apache.org ([10.40.0.8]) by localhost (spamd3-us-west.apache.org [10.40.0.10]) (amavisd-new, port 10024) with ESMTP id hWDIUnpkqQMZ for ; Wed, 6 Feb 2019 15:42:12 +0000 (UTC) Received: from mout.kundenserver.de (mout.kundenserver.de [212.227.17.24]) by mx1-lw-us.apache.org (ASF Mail Server at mx1-lw-us.apache.org) with ESMTPS id A459560FA4 for ; Wed, 6 Feb 2019 15:42:12 +0000 (UTC) Received: from [192.168.11.106] ([212.60.243.34]) by mrelayeu.kundenserver.de (mreue109 [212.227.15.183]) with ESMTPSA (Nemesis) id 1M7auJ-1glfF0100b-0082V8 for ; Wed, 06 Feb 2019 16:42:11 +0100 Subject: Re: Issues with Ruta workbench (Permission Denied and wrong output view) To: user@uima.apache.org References: <0bf782fa-547c-f2c1-6ff8-2169601c1d54@th-koeln.de> <2ddd436d-43df-2984-f3a3-fce15a736ac6@schor.com> <4666ab55-ab52-e330-3e42-09b5857f2254@th-koeln.de> From: =?UTF-8?Q?Peter_Kl=c3=bcgl?= Openpgp: preference=signencrypt Message-ID: <43622d3d-8337-e0a0-bc0b-3065b2ef532d@averbis.com> Date: Wed, 6 Feb 2019 16:42:10 +0100 User-Agent: Mozilla/5.0 (Windows NT 10.0; WOW64; rv:60.0) Gecko/20100101 Thunderbird/60.5.0 MIME-Version: 1.0 In-Reply-To: <4666ab55-ab52-e330-3e42-09b5857f2254@th-koeln.de> Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 8bit X-Provags-ID: V03:K1:Jupa/0rqMzxaFpZ4EZ7ux/JrOOQgxXUHVgGBxRU/tS4iFpYFQb7 e9/er2j0U7KZCcNMXq6JUwJaWstu9rdpULzeCzcP1Fqd0BwPHEDbJ4ycsNoXfSUb7ipFFhE /j+CRTIgrmVNclM5mR2kIGtb8LuR8e9Q/pFNPCYY+4eIkZhpJYFVzitdqzWc+RNQlUVUqCT UUeuHEhS+0SosDqNdC79A== X-UI-Out-Filterresults: notjunk:1;V03:K0:jhWXgYMSin4=:LWPENUha5Bzn3/b+rRyJ7T vghiCoAMgGvn8rkeoUU3Fphum5DmmxNNKflq2d8PjdoUvQjAJ5/5P/ZM0T7CBd1Ec+GS2Z1vQ ac4QyK8IvzNqd+JrbRnYL46cBhupDtcRCD9fOSQHGgENQRTfKW+ElGebTMb2oRadOWv9lHY6z AKgepCPt3bUZ/ZUwNM91mjMpO+gA468h2628u0Cjp50buUJMybE2dD7VPEMQ1CuCVH/7R2zdm qy6ylWR8VhGqGeOagqT3O7m175M8uH55Es+tu+ZsUWbDRfQNhIAxfDTgdo2KOCAsh/S/MBIEs q9FdotvsJbYO6H623QIgSAsQvUN9j8h2HVegB5Jd1shex3l1n1fh3ZJX6cbEWYkH+41jYxX+l mjwvhe5eN3t3sZ4NgrIhP9vM5b+zWncikOOk8xQWorlz6cizW6BPQ8PMdD4PBvDZV25BeVJrT F/t1iLBdWmIFkh2lmplC+39f9RsMVwo8gBmPUkWzSRWbSE/4rrXgxO/z1/ddqEVSOVtddFORe l9zvssX1SiisyCHtvvyMsydtiSlxE3N9yXHe8YW7uLhZTKyeW29uzweJ5aDrKllbNbWbfT7da BUuTYnd/DNG89/Ui+dQAO2EBdftiZDEUcDtv0ealGpSV27xW5Wk3ZHhQQwvSMY4h4rnBemo5+ KeX95TyR1mzMApv3oKT45NiX78yhRyuKayc50mx6tYm4q8tYfdjNlE7hTlXa1bFus1zFrTy3F ls8aDSIm7jxz2R+fORGC12v2nNQfFHPplw9Oy7e6YiIDPArf0Q3/xRVBlVDnwcYiq2Xk9XgSP G1tgvJ8 Hi, does the plain vs _InitialView problem occur in the CASes in the output folder or in the converted folder? "output" should contain the result of the script processing. The _InitialView is set by the launcher, it's static and cannot be changed. "converted" should contain additional CASes where the plain view is copied to the _InitialView, which hasn't been set yet. (Although I think that I have written those rules as an example some time ago, I personally prefer to perform the HTML conversion in Java) Best, Peter Am 06.02.2019 um 16:18 schrieb Mandy Neumann: > Hi, > > after some additional digging I found this setting in the workbench > preferences where SourceDocumentInformation is used for the output > parameter. This seems to have fixed the permission issue, I get no > more exceptions. > > Unfortunately, the problem with plain vs. _InitialView still persists, > which is kind of annoying. Any ideas on that? (I'd like to also make > sure that this is not causing any further problems in my planned > workflow.) > > Best, > > Mandy > > Am 06.02.19 um 15:40 schrieb Marshall Schor: >> hi, >> >> I'm not an expert, but I'm guessing that there still is a permissions >> issue, >> perhaps on a different file or directory than the one you checked. >> >> Try having someone else take a look at your stack trace / error >> message, and >> your file system permissions.  A second pair of eyes often is helpful >> (I speak >> from personal experience). >> >> Cheers. -Marshall >> >> On 2/6/2019 5:44 AM, Mandy Neumann wrote: >>> Hi all, >>> >>> I'm just starting to get familiar with UIMA Ruta and the workbench, >>> and I'm >>> having some strange issues. >>> >>> I got a project from a co-worker who already prepared some scripts >>> for me to >>> extend. The project has .html files in the input folder, and he already >>> provided a Ruta script to convert HTML markup into annotations. The >>> script is >>> adapted from the Ruta manual: >>> >>>> ENGINE utils.HtmlAnnotator; >>>> ENGINE utils.HtmlConverter; >>>> ENGINE HtmlViewWriter; >>>> TYPESYSTEM utils.HtmlTypeSystem; >>>> TYPESYSTEM utils.SourceDocumentInformation; >>>> >>>> Document{->CONFIGURE(HtmlAnnotator, "onlyContent"=true), >>>> EXEC(HtmlAnnotator, >>>> {TAG})}; >>>> >>>> Document { -> CONFIGURE(HtmlConverter, "inputView" = "_InitialView", >>>>      "outputView" = "plain", "expandOffsets"=false, >>>> "replaceLinebreaks"=true, >>>> "skipWhitespacs"=true, "linebreakReplacement"=" ", "processAll"=true), >>>>        EXEC(HtmlConverter)}; >>>> >>>> Document{ -> CONFIGURE(HtmlViewWriter, "inputView" = "plain", >>>>      "outputView" = "_InitialView", "output" = "../converted"), >>>>      EXEC(HtmlViewWriter)}; >>> On my machine and with my settings, when I run this script, my >>> console get >>> spammed with >>> org.apache.uima.analysis_engine.AnalysisEngineProcessExceptions >>> caused by java.io.FileNotFoundException >>>   with the message "../converted (Permission denied)". I checked the >>> file >>> permissions on this directory which were 775 - I even chmodded to >>> 777 but >>> still the same issue. >>> >>> In spite of all these exceptions, the output still gets generated, >>> though. I >>> would be fine with it if there weren't another issue - although the >>> script >>> should write the annotations into _InitialView, I need to change the >>> view to >>> "plain" in the editor to get plain text with HTML annotations. The >>> _InitialView still shows the html markup. >>> >>> I think both issues are related. Any ideas? >>> >>> Cheers, >>> >>> Mandy >>> >>> >>> System Info: eclipse Oxygen.3a Release (4.7.3a), UIMA Ruta workbench >>> 2.6.1, OS >>> Kubuntu 18.04 >>> >>> -- Dr. Peter Klügl R&D Text Mining/Machine Learning Averbis GmbH Tennenbacher Str. 11 79106 Freiburg Germany Fon: +49 761 708 394 0 Fax: +49 761 708 394 10 Email: peter.kluegl@averbis.com Web: https://averbis.com Headquarters: Freiburg im Breisgau Register Court: Amtsgericht Freiburg im Breisgau, HRB 701080 Managing Directors: Dr. med. Philipp Daumke, Dr. Kornél Markó