any23-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Lewis John McGibbney (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (ANY23-248) NTriplesWriter on hadoop : issue with MIME type
Date Tue, 03 Feb 2015 23:06:35 GMT

    [ https://issues.apache.org/jira/browse/ANY23-248?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14304267#comment-14304267
] 

Lewis John McGibbney commented on ANY23-248:
--------------------------------------------

bq. Hi Lewis, I did not understand your question.
Sorry I should have been clearer. If you were able to send us a pull request to upgrade the
sesame dependencies within Any23 to the latest including upgrading the API to accommodate
the new dependencies then it would be excellent. Thanks in advance for any PR's that come
to Any23.

> NTriplesWriter on hadoop : issue with MIME type
> -----------------------------------------------
>
>                 Key: ANY23-248
>                 URL: https://issues.apache.org/jira/browse/ANY23-248
>             Project: Apache Any23
>          Issue Type: Bug
>    Affects Versions: 1.1
>         Environment: hadoop,linux
>            Reporter: Souri
>            Priority: Minor
>             Fix For: 1.2
>
>
> I am trying to create n-triples from an html string. I am using the following code to
do it:
> StringDocumentSource documentSource = new StringDocumentSource(html, null);
>             ByteArrayOutputStream out = new ByteArrayOutputStream();
>             final NTriplesWriter tripleHandler = new NTriplesWriter(out);
>             Any23 runner = new Any23();
>            
>             runner.extract(documentSource,tripleHandler);
>             tripleHandler.close();
>             String result = out.toString("us-ascii");
>             return result;
> This is giving me the error :
> java.lang.NullPointerException
> 	at org.apache.any23.extractor.SingleDocumentExtraction.filterExtractorsByMIMEType(SingleDocumentExtraction.java:421)
> 	at org.apache.any23.extractor.SingleDocumentExtraction.run(SingleDocumentExtraction.java:223)
> 	at org.apache.any23.Any23.extract(Any23.java:298)
> 	at org.apache.any23.Any23.extract(Any23.java:433)
> I am running this in hadoop. When I run locally with a single file it works, but doesn't
work when run on hadoop.
> Can someone please tell me how to go about this issue?



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message