Return-Path: Delivered-To: apmail-incubator-uima-user-archive@locus.apache.org Received: (qmail 17523 invoked from network); 11 Jun 2008 13:01:52 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 11 Jun 2008 13:01:52 -0000 Received: (qmail 40681 invoked by uid 500); 11 Jun 2008 13:01:55 -0000 Delivered-To: apmail-incubator-uima-user-archive@incubator.apache.org Received: (qmail 40464 invoked by uid 500); 11 Jun 2008 13:01:54 -0000 Mailing-List: contact uima-user-help@incubator.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: uima-user@incubator.apache.org Delivered-To: mailing list uima-user@incubator.apache.org Received: (qmail 40453 invoked by uid 99); 11 Jun 2008 13:01:54 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 11 Jun 2008 06:01:54 -0700 X-ASF-Spam-Status: No, hits=2.0 required=10.0 tests=HTML_MESSAGE,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of hirohanin@gmail.com designates 64.233.178.247 as permitted sender) Received: from [64.233.178.247] (HELO hs-out-0708.google.com) (64.233.178.247) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 11 Jun 2008 13:01:04 +0000 Received: by hs-out-0708.google.com with SMTP id 23so1191583hsn.12 for ; Wed, 11 Jun 2008 06:01:22 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:received:received:message-id:date:from:to :subject:in-reply-to:mime-version:content-type:references; bh=bOm+dgDOy4acO0JgVS2aWRcAucMpnep1SN5Fpec70iA=; b=EkbqvE2mPPeW4Q0k1v22I53M9t0OoOywrg/F4KZ2NJJ+1FkDIhnuc4vJhPI6oWNIxm WEf3qauvwdLUfz2d6Z4PwImgqyw9s47CN5tLHqjJVGVfuw2Ruo3hHi/kwl498Q53Sx2g HNWzzRQfBIq4dl3K5ikLTDtNzKHKJJXxduVns= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=message-id:date:from:to:subject:in-reply-to:mime-version :content-type:references; b=Kb3TNwQbymb5k0l1dYVeEDY0XghVaibG10gNpZ4+sJCvGbhBZHMlVrt1QTv+D4VIHk OeslztsoLht2UqAtIp3OXsr8mF0yHRbbd/I7pG6fsF3EZqtSiPkfYP/OefZIh3g8yb+V 7ziZU6S1q7zeJIos36jWETEAGiN7ip+DnbGuc= Received: by 10.151.14.4 with SMTP id r4mr180162ybi.199.1213189282098; Wed, 11 Jun 2008 06:01:22 -0700 (PDT) Received: by 10.150.220.15 with HTTP; Wed, 11 Jun 2008 06:01:22 -0700 (PDT) Message-ID: <54c3312d0806110601o70f27f1an8bf33ca9ef6a3dc4@mail.gmail.com> Date: Wed, 11 Jun 2008 18:31:22 +0530 From: "rohan rai" To: uima-user@incubator.apache.org Subject: Re: import location over Hadoop In-Reply-To: <484FC759.60506@gmx.de> MIME-Version: 1.0 Content-Type: multipart/alternative; boundary="----=_Part_12621_6193000.1213189282081" References: <54c3312d0806110231k2b87c7b6nd5b674d26145b643@mail.gmail.com> <484FBA28.6070303@michael-baessler.de> <54c3312d0806110459l4a5e779dt2b54f46d87790eb7@mail.gmail.com> <484FC759.60506@gmx.de> X-Virus-Checked: Checked by ClamAV on apache.org ------=_Part_12621_6193000.1213189282081 Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit Content-Disposition: inline Thanks Thilo. Well If do that all sorts of invalid xml exception is getting thrown org.apache.uima.util.InvalidXMLException: Invalid descriptor at . at org.apache.uima.util.impl.XMLParser_impl.parse(XMLParser_impl.java:193) at org.apache.uima.util.impl.XMLParser_impl.parseResourceSpecifier(XMLParser_impl.java:365) at org.apache.uima.util.impl.XMLParser_impl.parseResourceSpecifier(XMLParser_impl.java:346) at org.ziva.dq.hadoop.DQHadoopMain$Map.dQFile(DQHadoopMain.java:45) at org.ziva.dq.hadoop.DQHadoopMain$Map.map(DQHadoopMain.java:37) at org.ziva.dq.hadoop.DQHadoopMain$Map.map(DQHadoopMain.java:1) at org.apache.hadoop.mapred.MapRunner.run(MapRunner.java:50) at org.apache.hadoop.mapred.MapTask.run(MapTask.java:208) at org.apache.hadoop.mapred.TaskTracker$Child.main(TaskTracker.java:2084) Caused by: org.xml.sax.SAXParseException: Content is not allowed in prolog. at com.sun.org.apache.xerces.internal.parsers.AbstractSAXParser.parse(AbstractSAXParser.java:1231) at com.sun.org.apache.xerces.internal.jaxp.SAXParserImpl$JAXPSAXParser.parse(SAXParserImpl.java:522) at org.apache.uima.util.impl.XMLParser_impl.parse(XMLParser_impl.java:176) ... 8 more org.apache.uima.util.InvalidXMLException: Invalid descriptor at . at org.apache.uima.util.impl.XMLParser_impl.parse(XMLParser_impl.java:193) at org.apache.uima.util.impl.XMLParser_impl.parseResourceSpecifier(XMLParser_impl.java:365) at org.apache.uima.util.impl.XMLParser_impl.parseResourceSpecifier(XMLParser_impl.java:346) at org.ziva.dq.hadoop.DQHadoopMain$Map.dQFile(DQHadoopMain.java:45) at org.ziva.dq.hadoop.DQHadoopMain$Map.map(DQHadoopMain.java:37) at org.ziva.dq.hadoop.DQHadoopMain$Map.map(DQHadoopMain.java:1) at org.apache.hadoop.mapred.MapRunner.run(MapRunner.java:50) at org.apache.hadoop.mapred.MapTask.run(MapTask.java:208) at org.apache.hadoop.mapred.TaskTracker$Child.main(TaskTracker.java:2084) Caused by: org.xml.sax.SAXParseException: Content is not allowed in prolog. at com.sun.org.apache.xerces.internal.parsers.AbstractSAXParser.parse(AbstractSAXParser.java:1231) at com.sun.org.apache.xerces.internal.jaxp.SAXParserImpl$JAXPSAXParser.parse(SAXParserImpl.java:522) at org.apache.uima.util.impl.XMLParser_impl.parse(XMLParser_impl.java:176) On Wed, Jun 11, 2008 at 6:08 PM, Thilo Goetz wrote: > You need to use import by name instead of import > by location in your descriptor. Then things get > loaded via the classpath and you should be ok > (provided that you stick your descriptors in the > jar of course). I suggest you test this locally > first by moving your application to a different > machine where you don't have any descriptors > lying around. It'll be easier to debug than in > hadoop. > > --Thilo > > > rohan rai wrote: > >> Well the question is for running UIMA over hadoop? How to do that as in >> UIMA >> there are xml descriptors which have relative urls and location? Which >> throws exception >> >> But I can probably do without that answer >> >> Simplifying the problem >> >> I create a jar for my application and I am trying to run a map reduce job >> >> In the map I am trying to read an xml resource which gives this kind of >> exceprion >> >> java.io.FileNotFoundException: >> >> /tmp/hadoop-root/mapred/local/taskTracker/jobcache/job_200806102252_0028/task_200806102252_0028_m_000000_0/./descriptors/annotators/RecordCandidateAnnotator.xml >> (No such file or directory) >> at java.io.FileInputStream.open(Native Method) >> at java.io.FileInputStream.(FileInputStream.java:106) >> at java.io.FileInputStream.(FileInputStream.java:66) >> at >> sun.net.www.protocol.file.FileURLConnection.connect(FileURLConnection.java:70) >> at >> sun.net.www.protocol.file.FileURLConnection.getInputStream(FileURLConnection.java:161) >> at java.net.URL.openStream(URL.java:1009) >> at >> org.apache.uima.util.XMLInputSource.(XMLInputSource.java:83) >> >> I think I require to pass on the content of the jar which contains the >> resource xml and classes(other than the JOB class) to each and every >> taskXXXXXXX getting created >> >> How can I do that >> >> REgards >> Rohan >> >> >> >> >> On Wed, Jun 11, 2008 at 5:12 PM, Michael Baessler < >> mba@michael-baessler.de> >> wrote: >> >> rohan rai wrote: >>> >>>> Hi >>>> A simple thing such as a name annotator which has an import location of >>>> type starts throwing exception when I create a jar of the application I >>>> >>> am >>> >>>> developing and run over hadoop. >>>> >>>> If I have to do it a java class file then I can use XMLInputSource in = >>>> >>> new >>> >>> XMLInputSource(ClassLoader.getSystemResourceAsStream(aeXmlDescriptor),null); >>> >>>> But the relative paths in annotators, analysis engines etc starts >>>> >>> throwing >>> >>>> exception >>>> >>>> Please Help >>>> >>>> Regards >>>> Rohan >>>> >>>> I'm not sure I understand your question, but I think you need some help >>> with the exceptions you get. >>> Can you provide the exception stack trace? >>> >>> -- Michael >>> >>> >> ------=_Part_12621_6193000.1213189282081--