lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Cao Manh Dat (JIRA)" <j...@apache.org>
Subject [jira] [Comment Edited] (SOLR-12278) Ignore very large document on indexing
Date Tue, 01 May 2018 03:14:00 GMT

    [ https://issues.apache.org/jira/browse/SOLR-12278?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16459412#comment-16459412
] 

Cao Manh Dat edited comment on SOLR-12278 at 5/1/18 3:13 AM:
-------------------------------------------------------------

[~dsmiley] some problem of that approach
 * we have to modify all other parsers,
 * each parser has its own set of parameters, which make the size of a SolrInputDocument quite
different with the number of bytes of the input (ie: SOLR-6304)
 * what happens if the users have some processor in the middle which enriches the SolrInputDocument

In short vision, IgnoreLargeDocumentProcessor might handy for users who need to filter large
documents.


was (Author: caomanhdat):
[~dsmiley] problem of that approach is we have to modify all other parsers, not mention that
each parser has its worn set of parameters.

> Ignore very large document on indexing
> --------------------------------------
>
>                 Key: SOLR-12278
>                 URL: https://issues.apache.org/jira/browse/SOLR-12278
>             Project: Solr
>          Issue Type: Improvement
>      Security Level: Public(Default Security Level. Issues are Public) 
>            Reporter: Cao Manh Dat
>            Assignee: Cao Manh Dat
>            Priority: Major
>         Attachments: SOLR-12278.patch, SOLR-12278.patch
>
>
> Solr should be able to ignore very large document, so it won't affect the index as well
as the tlog. 



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


Mime
View raw message