Return-Path: X-Original-To: apmail-lucenenet-user-archive@www.apache.org Delivered-To: apmail-lucenenet-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 8D9DAF02E for ; Thu, 21 Mar 2013 22:55:26 +0000 (UTC) Received: (qmail 30113 invoked by uid 500); 21 Mar 2013 22:55:26 -0000 Delivered-To: apmail-lucenenet-user-archive@lucenenet.apache.org Received: (qmail 30032 invoked by uid 500); 21 Mar 2013 22:55:26 -0000 Mailing-List: contact user-help@lucenenet.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@lucenenet.apache.org Delivered-To: mailing list user@lucenenet.apache.org Received: (qmail 30024 invoked by uid 99); 21 Mar 2013 22:55:26 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 21 Mar 2013 22:55:26 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of rob.cecil@gmail.com designates 209.85.210.169 as permitted sender) Received: from [209.85.210.169] (HELO mail-ia0-f169.google.com) (209.85.210.169) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 21 Mar 2013 22:55:20 +0000 Received: by mail-ia0-f169.google.com with SMTP id j5so3011900iaf.14 for ; Thu, 21 Mar 2013 15:54:59 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=x-received:mime-version:in-reply-to:references:from:date:message-id :subject:to:content-type; bh=toOB2kvfjnLXnugvawLkiGKGxJ7Wl0oeTiO5TBRAfVQ=; b=ChnVzeguQWzyyQq/HzE3ojYsv8QU09zNeD8ABd8Jlme7gVw3wh4MLdxthXSlJf8Q/A APUSirrXInck0PBXB4+zjMjNG8uJ+hG705HBZhlAOjcY+U83F/ljJdvrZN6pL2zKF6vd q6EtxDOUL1S92ZNlcA6ayZ5d6XdmmVONB50Tv62/2sSKyZVJfF1d+WzcfUUBDPoflKjP fmK4r9VGWRBLa/ZhZxEeH/VMd92lvEf/Q/ixDMhe2UUVbRL794apRnxNR3tSCo0zfWDF bxTLS5JAckG0rBEFjU+nujBBk1Y2O1v92Z/74OefpuoSnNxI1KjgT6mzw4Sa6bVV0a1f n4aA== X-Received: by 10.50.186.134 with SMTP id fk6mr3458224igc.9.1363906499323; Thu, 21 Mar 2013 15:54:59 -0700 (PDT) MIME-Version: 1.0 Received: by 10.42.44.196 with HTTP; Thu, 21 Mar 2013 15:54:39 -0700 (PDT) In-Reply-To: References: From: Rob Cecil Date: Thu, 21 Mar 2013 18:54:39 -0400 Message-ID: Subject: Re: Unsubscribe To: user@lucenenet.apache.org Content-Type: multipart/alternative; boundary=14dae9340f2d66366104d8773aa5 X-Virus-Checked: Checked by ClamAV on apache.org --14dae9340f2d66366104d8773aa5 Content-Type: text/plain; charset=ISO-8859-1 Sucks. Who are these Apache guys again? New to the Internets? :) On Thu, Mar 21, 2013 at 6:47 PM, Rob Vesse wrote: > The Lucene.Net website is really kinda poor in many regards > > Following the general Apache mailing list instructions from > http://www.apache.org/foundation/mailinglists.html should work > > So for this list send an email to user-unsubscribe@lucenenet.apache.org, > if you are also on the dev list send one to > dev-unsubscribe@lucenenet.apache.org > > Rob > > On 3/21/13 2:54 PM, "Kyle Jones" wrote: > > >Rob, > > > >Actually, that email address is broken. I'm not sure there is currently a > >documented way to unsubscribe. > > > >- Kyle > > > > > >On Thu, Mar 21, 2013 at 2:21 PM, Rob Cecil wrote: > > > >> Bart, > >> > >> How about reading http://lucenenet.apache.org/community.html ?? > >> > >> "To subscribe to the mailing lists, send an email to *list*- > >> subscribe@lucenenet.apache.org. To unsubscribe, send an email to > *list*- > >> unsubscribe@lucenenet.apache.org." > >> > >> Did you try that? > >> > >> > >> On Thu, Mar 21, 2013 at 3:02 PM, wrote: > >> > >> > Unsubscribe me from everything for the eight time > >> > > >> > > >> > > >> > Sent from Windows Mail > >> > > >> > > >> > From: Anthony Rodriguez > >> > Sent: March 18, 2013 8:10 PM > >> > To: user@lucenenet.apache.org > >> > Subject: lucene.net 3.0.3 indexing spatial too slow > >> > > >> > > >> > I have recently upgraded my search code from lucene.net 2.9.4 to > >>3.0.3. > >> I > >> > have noticed a change in the spatial packages and have updated my code > >> > accordingly. One drawback from the upgrade that I have noticed is much > >> > slower index times. Through process of elimination, I have been able > >>to > >> > narrow the slowness down to the new spatial code that indexes the > >> lat/long > >> > coordinates: > >> > public void AddLocation (double lat, double lng) > >> > { > >> > try > >> > { > >> > string latLongKey = lat.ToString() + "," + lng.ToString(); > >> > AbstractField[] shapeFields = null; > >> > Shape shape = null; > >> > if (HasSpatialShapes(latLongKey)) > >> > { > >> > shape = SpatialShapes[latLongKey]; > >> > } > >> > else > >> > { > >> > if (this.Strategy is BBoxStrategy) > >> > { > >> > shape = > >> > Context.MakeRectangle(DistanceUtils.NormLonDEG(lng), > >> > DistanceUtils.NormLonDEG(lng), DistanceUtils.NormLatDEG(lat), > >> > DistanceUtils.NormLatDEG(lat)); > >> > } > >> > else > >> > { > >> > shape = > >> > Context.MakePoint(DistanceUtils.NormLonDEG(lng), > >> > DistanceUtils.NormLatDEG(lat)); > >> > } > >> > > >> > AddSpatialShapes(latLongKey, shape); > >> > } > >> > > >> > shapeFields = Strategy.CreateIndexableFields(shape); > >> > //Potentially more than one shape in this field is > >>supported > >> > by some > >> > // strategies; see the javadocs of the SpatialStrategy > >>impl > >> to > >> > see. > >> > foreach (AbstractField f in shapeFields) > >> > { > >> > _document.Add(f); > >> > } > >> > //add lat long values to index too > >> > _document.Add(GetField("latitude", > >> > NumericUtils.DoubleToPrefixCoded(lat), Field.Index.NOT_ANALYZED, > >> > Field.Store.YES, 0f, false)); > >> > _document.Add(GetField("longitude", > >> > NumericUtils.DoubleToPrefixCoded(lng), Field.Index.NOT_ANALYZED, > >> > Field.Store.YES, 0f, false)); > >> > } > >> > catch (Exception e) > >> > { > >> > > >> > > >> > >>RollingFileLogger.Instance.LogException(ServiceConstants.SERVICE_INDEXER_ > >>CONST, > >> > "Document",string.Format("AddLocation({0},{1})", lat.ToString(), > >> > lng.ToString()), e, null); > >> > throw e; > >> > } > >> > } > >> > > >> > With 2.9.4, I was able to index about 300,000 rows of data with > >>lat/lng > >> > points in about 11 minutes. With this new spatial package it takes > >> upwards > >> > of 5 hours (I've killed the test before it finishes so I don't have an > >> > exact timing for it). Here is the spatial context/strategy I am using: > >> > > >> > > >> > public static SpatialContext SpatialContext > >> > > >> > { > >> > > >> > get > >> > > >> > { > >> > > >> > if (null == _spatialContext) > >> > > >> > { > >> > > >> > lock (_lockObject) > >> > > >> > { > >> > > >> > if(null==_spatialContext) _spatialContext = > >> > SpatialContext.GEO; > >> > > >> > } > >> > > >> > } > >> > > >> > return _spatialContext; > >> > > >> > } > >> > > >> > } > >> > > >> > > >> > > >> > public static SpatialStrategy SpatialStrategy > >> > > >> > { > >> > > >> > get > >> > > >> > { > >> > > >> > if (null == _spatialStrategy) > >> > > >> > { > >> > > >> > lock (_lockObject) > >> > > >> > { > >> > > >> > if (null == _spatialStrategy) > >> > > >> > { > >> > > >> > int maxLength = 9; > >> > > >> > GeohashPrefixTree geohashPrefixTree = new > >> > GeohashPrefixTree(SpatialContext, maxLength); > >> > > >> > _spatialStrategy = new > >> > RecursivePrefixTreeStrategy(geohashPrefixTree, "geoField"); > >> > > >> > } > >> > > >> > } > >> > > >> > } > >> > > >> > return _spatialStrategy; > >> > > >> > } > >> > > >> > } > >> > > >> > Is there something I am doing wrong with my indexing approach? I have > >> > cached the shapes that get created by the lat/lng points since I don't > >> need > >> > a new shape for the same coordinates. It appears to be the > >> > CreateIndexableFields() method that is taking the most time during > >> > indexing. I've tried to cache the fields generated by this method to > >> reuse > >> > but I can't create a new instance of the TokenStream from the cached > >> field > >> > to use in a new Document (in lucene.net 3.0.3 the constructor for > >> > TokenStream is protected). I've lowered the maxLevels int to 4 in the > >> > spatial strategy but I haven't seen an improvement in indexing times. > >>Any > >> > feedback would be greatly appreciated. > >> > > >> > ________________________________ > >> > Anthony Rodriguez > >> > Senior Software Developer > >> > > >> > Spark Networks | Igniting Relationships(r) > >> > 8383 Wilshire Blvd. Suite 800 | Beverly Hills, CA 90211 > >> > p. 323 658 3000 ext. 8021 | f. 866 945 5209 > >> > ________________________________ > >> > > > > > --14dae9340f2d66366104d8773aa5--