manifoldcf-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Shinichiro Abe (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (CONNECTORS-1219) Lucene Output Connector
Date Fri, 10 Jul 2015 07:00:11 GMT

    [ https://issues.apache.org/jira/browse/CONNECTORS-1219?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14621851#comment-14621851
] 

Shinichiro Abe commented on CONNECTORS-1219:
--------------------------------------------

Thanks Karl, I'll try to change String argument into Reader. But Reader is used if the field
is indexed [but not stored|https://svn.apache.org/viewvc/lucene/dev/branches/lucene_solr_5_2/lucene/core/src/java/org/apache/lucene/document/Field.java?view=markup#l123].
I'll add [StoredField|https://svn.apache.org/viewvc/lucene/dev/branches/lucene_solr_5_2/lucene/core/src/java/org/apache/lucene/document/StoredField.java?view=markup#l47]
when storing value, but there is byte array capacity, approximately 2g. Anyway maxDocumentLength
is required, I think.

> Lucene Output Connector
> -----------------------
>
>                 Key: CONNECTORS-1219
>                 URL: https://issues.apache.org/jira/browse/CONNECTORS-1219
>             Project: ManifoldCF
>          Issue Type: New Feature
>            Reporter: Shinichiro Abe
>            Assignee: Shinichiro Abe
>         Attachments: CONNECTORS-1219-v0.1patch.patch, CONNECTORS-1219-v0.2.patch
>
>
> A output connector for Lucene local index directly, not via remote search engine. It
would be nice if we could use Lucene various API to the index directly, even though we could
do the same thing to the Solr or Elasticsearch index. I assume we can do something to classification,
categorization, and tagging, using e.g lucene-classification package.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message