lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Cao Manh Dat (JIRA)" <>
Subject [jira] [Comment Edited] (SOLR-12278) Ignore very large document on indexing
Date Tue, 01 May 2018 03:14:00 GMT


Cao Manh Dat edited comment on SOLR-12278 at 5/1/18 3:13 AM:

[~dsmiley] some problem of that approach
 * we have to modify all other parsers,
 * each parser has its own set of parameters, which make the size of a SolrInputDocument quite
different with the number of bytes of the input (ie: SOLR-6304)
 * what happens if the users have some processor in the middle which enriches the SolrInputDocument

In short vision, IgnoreLargeDocumentProcessor might handy for users who need to filter large

was (Author: caomanhdat):
[~dsmiley] problem of that approach is we have to modify all other parsers, not mention that
each parser has its worn set of parameters.

> Ignore very large document on indexing
> --------------------------------------
>                 Key: SOLR-12278
>                 URL:
>             Project: Solr
>          Issue Type: Improvement
>      Security Level: Public(Default Security Level. Issues are Public) 
>            Reporter: Cao Manh Dat
>            Assignee: Cao Manh Dat
>            Priority: Major
>         Attachments: SOLR-12278.patch, SOLR-12278.patch
> Solr should be able to ignore very large document, so it won't affect the index as well
as the tlog. 

This message was sent by Atlassian JIRA

To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message