Return-Path: X-Original-To: apmail-lucene-solr-user-archive@minotaur.apache.org Delivered-To: apmail-lucene-solr-user-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 78ABA17324 for ; Thu, 24 Sep 2015 07:04:23 +0000 (UTC) Received: (qmail 55508 invoked by uid 500); 24 Sep 2015 07:04:19 -0000 Delivered-To: apmail-lucene-solr-user-archive@lucene.apache.org Received: (qmail 55435 invoked by uid 500); 24 Sep 2015 07:04:19 -0000 Mailing-List: contact solr-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: solr-user@lucene.apache.org Delivered-To: mailing list solr-user@lucene.apache.org Received: (qmail 55423 invoked by uid 99); 24 Sep 2015 07:04:19 -0000 Received: from Unknown (HELO spamd3-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 24 Sep 2015 07:04:19 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd3-us-west.apache.org (ASF Mail Server at spamd3-us-west.apache.org) with ESMTP id BA2DA18104B for ; Thu, 24 Sep 2015 07:04:18 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd3-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 1.081 X-Spam-Level: * X-Spam-Status: No, score=1.081 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, KAM_COUK=1.1, RCVD_IN_MSPIKE_H3=-0.01, RCVD_IN_MSPIKE_WL=-0.01, URIBL_BLOCKED=0.001] autolearn=disabled Authentication-Results: spamd3-us-west.apache.org (amavisd-new); dkim=pass (1024-bit key) header.d=messagingengine.com Received: from mx1-us-west.apache.org ([10.40.0.8]) by localhost (spamd3-us-west.apache.org [10.40.0.10]) (amavisd-new, port 10024) with ESMTP id 08c684exSKpT for ; Thu, 24 Sep 2015 07:04:13 +0000 (UTC) Received: from out2-smtp.messagingengine.com (out2-smtp.messagingengine.com [66.111.4.26]) by mx1-us-west.apache.org (ASF Mail Server at mx1-us-west.apache.org) with ESMTPS id AFAF7237F7 for ; Thu, 24 Sep 2015 07:04:12 +0000 (UTC) Received: from compute1.internal (compute1.nyi.internal [10.202.2.41]) by mailout.nyi.internal (Postfix) with ESMTP id 28CC220454 for ; Thu, 24 Sep 2015 03:04:12 -0400 (EDT) Received: from web1 ([10.202.2.211]) by compute1.internal (MEProxy); Thu, 24 Sep 2015 03:04:12 -0400 DKIM-Signature: v=1; a=rsa-sha1; c=relaxed/relaxed; d= messagingengine.com; h=content-transfer-encoding:content-type :date:from:in-reply-to:message-id:mime-version:references :subject:to:x-sasl-enc:x-sasl-enc; s=smtpout; bh=TSUQKxXjx3pt5oe 4MMSKOWQ9BLw=; b=OFzC4JC5GrKJyq98fuaT26nbxGUMh+5iwvbTutF56irTQx1 SAaSNRuYa5N2gDd0IjJzaEsNz6S59vUQjfHLzYbEQ3cg9aKpVjxW/e7ilH7HqDEG 5xJLnrgwnG9TvClnFWYXi0JaH+j4aeEvDIXhjDLw1f99gasNOI7Da0EMOc/U= Received: by web1.nyi.internal (Postfix, from userid 99) id 05E50AE4626; Thu, 24 Sep 2015 03:04:11 -0400 (EDT) Message-Id: <1443078251.1305832.392219753.5218B846@webmail.messagingengine.com> X-Sasl-Enc: CSwfHGLJ+FCRXTD/5ktuhntomw/TlTa8HX1SWEioRket 1443078251 From: Upayavira To: solr-user@lucene.apache.org MIME-Version: 1.0 Content-Transfer-Encoding: 7bit Content-Type: text/plain X-Mailer: MessagingEngine.com Webmail Interface - ajax-2a494e8d Subject: Re: query parsing Date: Thu, 24 Sep 2015 08:04:11 +0100 In-Reply-To: References: <56029391.9020508@noaa.gov> <1443012895250-4230793.post@n3.nabble.com> <5602B2C3.5050601@noaa.gov> <5602C02E.6010109@noaa.gov> <5602D1BD.6000607@noaa.gov> <5602DE79.9000405@noaa.gov> typically, the index dir is inside the data dir. Delete the index dir and you should be good. If there is a tlog next to it, you might want to delete that also. If you dont have a data dir, i wonder whether you set the data dir when creating your core or collection. Typically the instance dir and data dir aren't needed. Upayavira On Wed, Sep 23, 2015, at 10:46 PM, Erick Erickson wrote: > OK, this is bizarre. You'd have had to set up SolrCloud by specifying the > -zkRun command when you start Solr or the -zkHost; highly unlikely. On > the > admin page there would be a "cloud" link on the left side, I really doubt > one's there. > > You should have a data directory, it should be the parent of the index > and > tlog directories. As of sanity check try looking at the analysis page. > Type > a bunch of words in the left hand side indexing box and uncheck the > verbose > box. As you can tell I'm grasping at straws. I'm still puzzled why you > don't have a "data" directory here, but that shouldn't really matter. How > did you create this index? I don't mean data import handler more how did > you create the core that you're indexing to? > > Best, > Erick > > On Wed, Sep 23, 2015 at 10:16 AM, Mark Fenbers > wrote: > > > On 9/23/2015 12:30 PM, Erick Erickson wrote: > > > >> Then my next guess is you're not pointing at the index you think you are > >> when you 'rm -rf data' > >> > >> Just ignore the Elall field for now I should think, although get rid of it > >> if you don't think you need it. > >> > >> DIH should be irrelevant here. > >> > >> So let's back up. > >> 1> go ahead and "rm -fr data" (with Solr stopped). > >> > > I have no "data" dir. Did you mean "index" dir? I removed 3 index > > directories (2 for spelling): > > cd /localapps/dev/eventLog; rm -rfv index solr/spFile solr/spIndex > > > >> 2> start Solr > >> 3> do NOT re-index. > >> 4> look at your index via the schema-browser. Of course there should be > >> nothing there! > >> > > Correct! It said "there is no term info :(" > > > >> 5> now kick off the DIH job and look again. > >> > > Now it shows a histogram, but most of the "terms" are long -- the full > > texts of (the table.column) eventlogtext.logtext, including the whitespace > > (with %0A used for newline characters)... So, it appears it is not being > > tokenized properly, correct? > > > >> Your logtext field should have only single tokens. The fact that you have > >> some very > >> long tokens presumably with whitespace) indicates that you aren't really > >> blowing > >> the index away between indexing. > >> > > Well, I did this time for sure. I verified that initially, because it > > showed there was no term info until I DIH'd again. > > > >> Are you perhaps in Solr Cloud with more than one replica? > >> > > Not that I know of, but being new to Solr, there could be things going on > > that I'm not aware of. How can I tell? I certainly didn't set anything up > > for solrCloud deliberately. > > > >> In that case you > >> might be getting the index replicated on startup assuming you didn't > >> blow away all replicas. If you are in SolrCloud, I'd just delete the > >> collection and > >> start over, after insuring that you'd pushed the configset up to > >> Zookeeper. > >> > >> BTW, I always look at the schema.xml file from the Solr admin window just > >> as > >> a sanity check in these situations. > >> > > Good idea! But the one shown in the browser is identical to the one I've > > been editing! So that's not an issue. > > > >