lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Erick Erickson <erickerick...@gmail.com>
Subject Re: Positions files analysis
Date Tue, 28 Jun 2016 05:26:35 GMT
Positions are necessary if you need to do "phrase searches".
If that's not necessary, simply turn that option off in your
schema for the fields where it's unnecessary. See the reference
guide for
termVectors
termPositions
termOffsets

I'm really not sure what you're asking by:
"Is there a way I can read/analyze index files as .pos?"

The various file extensions are a result of the options you
define on your fields, that's just the way Lucene works...

Best,
Erick

On Mon, Jun 27, 2016 at 7:25 AM, asteiner <asteiner@varonis.com> wrote:
> Hi
>
> I have a very large index and I'd like to see how can I reduce it.
> Some of the largest files in the index are the .pos files (positions).
> There are many excel files indexed with formulas, so I suspect that a large
> part of the index is used by junk terms as very long numbers.
> Is there a way I can read/analyze index files as .pos?
>
>
>
>
> --
> View this message in context: http://lucene.472066.n3.nabble.com/Positions-files-analysis-tp4284485.html
> Sent from the Solr - User mailing list archive at Nabble.com.

Mime
View raw message