Return-Path: X-Original-To: apmail-lucene-solr-user-archive@minotaur.apache.org Delivered-To: apmail-lucene-solr-user-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id ECA37EFA3 for ; Tue, 12 Feb 2013 11:19:54 +0000 (UTC) Received: (qmail 22725 invoked by uid 500); 12 Feb 2013 11:19:51 -0000 Delivered-To: apmail-lucene-solr-user-archive@lucene.apache.org Received: (qmail 22390 invoked by uid 500); 12 Feb 2013 11:19:51 -0000 Mailing-List: contact solr-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: solr-user@lucene.apache.org Delivered-To: mailing list solr-user@lucene.apache.org Received: (qmail 22347 invoked by uid 99); 12 Feb 2013 11:19:49 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 12 Feb 2013 11:19:49 +0000 X-ASF-Spam-Status: No, hits=2.5 required=5.0 tests=FREEMAIL_ENVFROM_END_DIGIT,SPF_SOFTFAIL,URI_HEX X-Spam-Check-By: apache.org Received-SPF: softfail (nike.apache.org: transitioning domain of peter0589@hotmail.com does not designate 216.139.236.26 as permitted sender) Received: from [216.139.236.26] (HELO sam.nabble.com) (216.139.236.26) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 12 Feb 2013 11:19:42 +0000 Received: from ben.nabble.com ([192.168.236.152]) by sam.nabble.com with esmtp (Exim 4.72) (envelope-from ) id 1U5DtS-0001rE-Ag for solr-user@lucene.apache.org; Tue, 12 Feb 2013 03:19:22 -0800 Date: Tue, 12 Feb 2013 03:19:22 -0800 (PST) From: Macroman To: solr-user@lucene.apache.org Message-ID: <1360667962318-4039908.post@n3.nabble.com> In-Reply-To: <1552284961.20130207123231@alud.com.pl> References: <1360235993798-4038961.post@n3.nabble.com> <1552284961.20130207123231@alud.com.pl> Subject: Re: Maximum Number of Records In Index MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Transfer-Encoding: 7bit X-Virus-Checked: Checked by ClamAV on apache.org Our document ID's are most definately distinct and there are partial updates to existing records, I have run SQL queries outside of SOLR to validate records going in and only about 1% are updates to existing records. There are no deletes underway every day new records are added or updated. Example for today. Before Data Handler ran, 13,586,537 records in SOLR all distinct ID's. Records extracted from 7 different sources to go into SOLR index was , 45,345, of these 1,912 were updates to existing records. Thus 43,433 were new records each with a new ID. I made sure ID's we always distinct. Yet our index now says 13,589,646. Indicating that only 3,109 new records went into the index. Thus missing 40,324 records. I use Date Facet Range and can see that there is an increase for January and February this year. In conclusion I have to say that it must be removing earlier records somehow despite no knowing where this may be controlled/set if at all. If there is a possible configuration to remove or weed records where is this configured? Our SOLR is virtually out of the box and only SOLCONFIG and SCHEMA amended to suit the needs of our business for fields and field types indexed. We also have configured the macro.s "Velocity" to display results. So none the wiser and thank you to all whom have responded so far. -- View this message in context: http://lucene.472066.n3.nabble.com/Maximum-Number-of-Records-In-Index-tp4038961p4039908.html Sent from the Solr - User mailing list archive at Nabble.com.