manifoldcf-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Aingaran Pillai (JIRA)" <j...@apache.org>
Subject [jira] [Comment Edited] (CONNECTORS-1219) Lucene Output Connector
Date Thu, 16 Jul 2015 19:37:04 GMT

    [ https://issues.apache.org/jira/browse/CONNECTORS-1219?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14630218#comment-14630218
] 

Aingaran Pillai edited comment on CONNECTORS-1219 at 7/16/15 7:36 PM:
----------------------------------------------------------------------

[~shinichiro abe] for a low latency crawl solution you may want to look at Apache Storm. Here's
a pull crawler implementation based on Apache Storm: https://github.com/DigitalPebble/storm-crawler.
It doesn't do permissions though. 


was (Author: apillaiz):
[~shinichiro abe] for a low latency crawl solution you may want to look at Apache Storm. Here's
an pull crawler implementation based on Apache Storm: https://github.com/DigitalPebble/storm-crawler.
It doesn't do permissions though. 

> Lucene Output Connector
> -----------------------
>
>                 Key: CONNECTORS-1219
>                 URL: https://issues.apache.org/jira/browse/CONNECTORS-1219
>             Project: ManifoldCF
>          Issue Type: New Feature
>            Reporter: Shinichiro Abe
>            Assignee: Shinichiro Abe
>         Attachments: CONNECTORS-1219-v0.1patch.patch, CONNECTORS-1219-v0.2.patch, CONNECTORS-1219-v0.3.patch
>
>
> A output connector for Lucene local index directly, not via remote search engine. It
would be nice if we could use Lucene various API to the index directly, even though we could
do the same thing to the Solr or Elasticsearch index. I assume we can do something to classification,
categorization, and tagging, using e.g lucene-classification package.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message