Return-Path: Delivered-To: apmail-lucene-solr-user-archive@minotaur.apache.org Received: (qmail 58814 invoked from network); 16 Jan 2011 12:51:51 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 16 Jan 2011 12:51:51 -0000 Received: (qmail 8986 invoked by uid 500); 16 Jan 2011 12:51:48 -0000 Delivered-To: apmail-lucene-solr-user-archive@lucene.apache.org Received: (qmail 8877 invoked by uid 500); 16 Jan 2011 12:51:45 -0000 Mailing-List: contact solr-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: solr-user@lucene.apache.org Delivered-To: mailing list solr-user@lucene.apache.org Received: (qmail 8869 invoked by uid 99); 16 Jan 2011 12:51:44 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 16 Jan 2011 12:51:44 +0000 X-ASF-Spam-Status: No, hits=2.2 required=10.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_NEUTRAL X-Spam-Check-By: apache.org Received-SPF: neutral (nike.apache.org: local policy) Received: from [209.85.210.176] (HELO mail-iy0-f176.google.com) (209.85.210.176) by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 16 Jan 2011 12:51:37 +0000 Received: by iyb26 with SMTP id 26so4524879iyb.35 for ; Sun, 16 Jan 2011 04:51:15 -0800 (PST) MIME-Version: 1.0 Received: by 10.231.12.68 with SMTP id w4mr2940989ibw.50.1295182275652; Sun, 16 Jan 2011 04:51:15 -0800 (PST) Received: by 10.231.79.137 with HTTP; Sun, 16 Jan 2011 04:51:15 -0800 (PST) X-Originating-IP: [111.88.48.107] In-Reply-To: <274082.91665.qm@web130101.mail.mud.yahoo.com> References: <274082.91665.qm@web130101.mail.mud.yahoo.com> Date: Sun, 16 Jan 2011 17:51:15 +0500 Message-ID: Subject: Re: TVF file From: Salman Akram To: solr-user@lucene.apache.org Content-Type: multipart/alternative; boundary=00032557474e75ab7b0499f61f10 X-Virus-Checked: Checked by ClamAV on apache.org --00032557474e75ab7b0499f61f10 Content-Type: text/plain; charset=ISO-8859-1 Nops. I optimized it with Standard File Format and cleaned up Index dir through Luke. It adds upto to the total size when I optimized it with Compound File Format. On Sun, Jan 16, 2011 at 5:46 PM, Otis Gospodnetic < otis_gospodnetic@yahoo.com> wrote: > Is it possible that the tvf file you are looking at is old (i.e. not part > of > your active index)? > > Otis > ---- > Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch > Lucene ecosystem search :: http://search-lucene.com/ > > > > ----- Original Message ---- > > From: Salman Akram > > To: solr-user@lucene.apache.org > > Sent: Sun, January 16, 2011 6:17:23 AM > > Subject: Re: TVF file > > > > Some more info I copied it from Luke and below is what it says for... > > > > Text Fields --> stored/uncompressed,indexed,tokenized > > String Fields --> stored/uncompressed,indexed,omitTermFreqAndPositions > > > > The main contents field is not stored so it doesn't show up on Luke but > that > > is Analyzed and Tokenized for searching. > > > > On Sun, Jan 16, 2011 at 3:50 PM, Salman Akram < > > salman.akram@northbaysolutions.net> wrote: > > > > > Hi, > > > > > > From my understanding TVF file stores the Term Vectors > (Positions/Offset) > > > so if no field has Field.TermVector set (default is NO) so it > shouldn't be > > > created, right? > > > > > > I have an index created through SOLR on which no field had any value > for > > > TermVectors so by default it shouldn't be saved. All the fields are > either > > > String or Text. All fields have just indexed and stored attributes set > to > > > True. String fields have omitNorms = true as well. > > > > > > Even in Luke it doesn't show V (Term Vector) flag but I have a big TVF > file > > > in my index. Its almost 30% of the total index (around 60% is the PRX > > > positions file). > > > > > > Also in Luke it shows 'f' (omitTF) flag for strings but not for text > > > fields. > > > > > > Any ideas what's going on? Thanks! > > > > > > -- > > > Regards, > > > > > > Salman Akram > > > Senior Software Engineer - Tech Lead > > > 80-A, Abu Bakar Block, Garden Town, Pakistan > > > Cell: +92-321-4391210 > > > > > > > > > > > -- > > Regards, > > > > Salman Akram > > > > > -- Regards, Salman Akram --00032557474e75ab7b0499f61f10--