manifoldcf-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Shinichiro Abe (JIRA)" <>
Subject [jira] [Commented] (CONNECTORS-1219) Lucene Output Connector
Date Sun, 02 Aug 2015 09:41:05 GMT


Shinichiro Abe commented on CONNECTORS-1219:

r1693798 to the branch.
The multiprocess mode works with hdfs indexes. I've tested zk and file processes example.
The hdfs indexes have an index per a processId this time since an indexwriter works per a
process, if I make indexwriters to index across processes, indexwriter throws LockObtainException.
In this condition, removeDocument could not work properly because the connections don't know
processId, know only documentURI. Please advice for me.

> Lucene Output Connector
> -----------------------
>                 Key: CONNECTORS-1219
>                 URL:
>             Project: ManifoldCF
>          Issue Type: New Feature
>            Reporter: Shinichiro Abe
>            Assignee: Shinichiro Abe
>         Attachments: CONNECTORS-1219-v0.1patch.patch, CONNECTORS-1219-v0.2.patch, CONNECTORS-1219-v0.3.patch
> A output connector for Lucene local index directly, not via remote search engine. It
would be nice if we could use Lucene various API to the index directly, even though we could
do the same thing to the Solr or Elasticsearch index. I assume we can do something to classification,
categorization, and tagging, using e.g lucene-classification package.

This message was sent by Atlassian JIRA

View raw message