any23-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "ASF GitHub Bot (JIRA)" <>
Subject [jira] [Commented] (ANY23-304) Add extractor for OpenIE
Date Fri, 24 Feb 2017 01:32:44 GMT


ASF GitHub Bot commented on ANY23-304:

GitHub user lewismc opened a pull request:

    ANY23-304 Add extractor for OpenIE

    Hi Folks,
    This issue is a rework of #33 which takes on board @ansell 's comments to add the new
extractor as a separate module as oppose to inside of core. 
    There are a number of classes which are cleaned up for JDK1.8 compliance.
    In addition, this new functionality augments the default configuration by introducing
a threshold for OpenIE extractions of 0.5. Anything below this value is not converted into
    I run a test extraction on a reasonably testing Webpage from the [PO.DAAC](
but right now i am not asserting anything.
    As far as I can see this is working pretty well but some community review would go a long

You can merge this pull request into a Git repository by running:

    $ git pull ANY23-304

Alternatively you can review and apply these changes as the patch at:

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #34
commit 2ecfbff1dddaf57689b725feddba47c7921f726d
Author: Lewis John McGibbney <>
Date:   2017-02-24T01:26:03Z

    ANY23-304 Add extractor for OpenIE


> Add extractor for OpenIE
> ------------------------
>                 Key: ANY23-304
>                 URL:
>             Project: Apache Any23
>          Issue Type: Bug
>          Components: core, extractors
>            Reporter: Lewis John McGibbney
>            Assignee: Lewis John McGibbney
>             Fix For: 2.1
> I'm going to start work on an extractor which uses the OpenIE library
> This will provide us with the ability to execute structured extractions from unstructured
content essentially taking Any23 in a new direction.

This message was sent by Atlassian JIRA

View raw message