jackrabbit-users mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Julio Castillo" <jcasti...@edgenuity.com>
Subject Re: Search not finding documents after Populate!
Date Thu, 06 Mar 2008 22:23:31 GMT
Sorry, I sent it to the wrong thread. Sorry, sorry.

I just finished finally configuring the repository going against a MySQL
bundle for the persistence manager.
I have problems with Search (it worked fine when using Derby, default
configuration).

Using the demo jsp pages, I went ahead and used the "populate.jsp" link of
the default index.jsp page. I selected 50 documents of type MS Word and PDF.
It successfully finished and during that time the following INFO messages
appeared:
06.03.2008 12:35:38 *INFO * IndexMerger: merged 94 documents in 140 ms into
_a. (IndexMerger.java, line 304)
06.03.2008 12:40:47 *INFO * IndexMerger: merged 185 documents in 172 ms into
_k. (IndexMerger.java, line 304)

When performing a search, no documents are found. I tried several words that
I saw in the Word and PDF documents when I navigated using the Browsing
link.

After restarting the log in DEBUG mode shows the following:

06.03.2008 13:21:25 *DEBUG* IndexMerger: index added: name=_0, numDocs=96
(IndexMerger.java, line 162)
06.03.2008 13:21:25 *DEBUG* AbstractIndex: closing IndexWriter.
(AbstractIndex.java, line 219)
06.03.2008 13:21:25 *DEBUG* Recovery: RedoLog is empty, no recovery needed.
(Recovery.java, line 80)
06.03.2008 13:21:25 *INFO * SearchIndex: Index initialized:
C:\Temp\jackrabbit\repository/repository/index Version: 2 (SearchIndex.java,
line 454)
06.03.2008 13:21:25 *DEBUG* MLRUItemStateCache:
org.apache.jackrabbit.core.state.MLRUItemStateCache@1c63e8c size=1,
1264/4194304 (MLRUItemStateCache.java, line 148)
06.03.2008 13:21:25 *DEBUG* JackrabbitTextExtractor:
JackrabbitTextExtractor(org.apache.jackrabbit.extractor.DefaultTextExtractor
) (JackrabbitTextExtractor.java, line 108)
06.03.2008 13:21:25 *INFO * LocalFileSystem: LocalFileSystem initialized at
path C:\Temp\jackrabbit\repository\workspaces\default\index
(LocalFileSystem.java, line 166)
06.03.2008 13:21:25 *DEBUG* IndexMerger: index added: name=_k, numDocs=185
(IndexMerger.java, line 162)
06.03.2008 13:21:25 *DEBUG* IndexMerger: index added: name=_l, numDocs=15
(IndexMerger.java, line 162)
06.03.2008 13:21:25 *DEBUG* IndexMerger: index added: name=_m, numDocs=11
(IndexMerger.java, line 162)
06.03.2008 13:21:25 *DEBUG* IndexMerger: index added: name=_n, numDocs=6
(IndexMerger.java, line 162)
06.03.2008 13:21:25 *DEBUG* IndexMerger: index added: name=_o, numDocs=45
(IndexMerger.java, line 162)
06.03.2008 13:21:25 *DEBUG* IndexMerger: index added: name=_p, numDocs=20
(IndexMerger.java, line 162)
06.03.2008 13:21:25 *DEBUG* IndexMerger: index added: name=_q, numDocs=14
(IndexMerger.java, line 162)
06.03.2008 13:21:25 *DEBUG* IndexMerger: index added: name=_r, numDocs=27
(IndexMerger.java, line 162)
06.03.2008 13:21:25 *DEBUG* IndexMerger: index added: name=_s, numDocs=6
(IndexMerger.java, line 162)
06.03.2008 13:21:25 *DEBUG* AbstractIndex: closing IndexWriter.
(AbstractIndex.java, line 219)
06.03.2008 13:21:25 *DEBUG* Recovery: RedoLog is empty, no recovery needed.
(Recovery.java, line 80)
06.03.2008 13:21:25 *INFO * SearchIndex: Index initialized:
C:\Temp\jackrabbit\repository\workspaces\default/index Version: 2
(SearchIndex.java, line 454)

I then went ahead and did a search on the following 2 words: election and
then on the word Texas. The election is part of the path to a PDF document
that has the word Texas within it. No document was found with either
keyword. Below is the log. Is there a way to inspect the search index? I can
get to the documents browsing them, but not via search:

06.03.2008 13:25:35 *DEBUG* QueryImpl: Executing query:
+ Root node
+ Select properties: {internal}excerpt()
  + PathQueryNode
    + LocationStepQueryNode:  NodeTest=* Descendants=true Index=NONE
      + NodeTypeQueryNode:  Prop={http://www.jcp.org/jcr/1.0}primaryType
Value={http://www.jcp.org/jcr/nt/1.0}file
      + TextsearchQueryNode:  Path={http://www.jcp.org/jcr/1.0}content
Query=elections (QueryImpl.java, line 106)
06.03.2008 13:25:35 *DEBUG* MLRUItemStateCache:
org.apache.jackrabbit.core.state.MLRUItemStateCache@14a97b size=1,
4564/4194304(MLRUItemStateCache.java, line 148)
06.03.2008 13:25:35 *DEBUG* ItemManager: created item
cafebabe-cafe-babe-cafe-babecafebabe (ItemManager.java, line 750)
06.03.2008 13:25:35 *DEBUG* ItemManager: caching item
cafebabe-cafe-babe-cafe-babecafebabe (ItemManager.java, line 689)
06.03.2008 13:25:35 *DEBUG* QueryResultImpl: getResults(2147483647)
(QueryResultImpl.java, line 272)
06.03.2008 13:25:35 *DEBUG* QueryImpl: executed in 0.24 s. (//element(*,
nt:file)[jcr:contains(jcr:content, 'elections')]/rep:excerpt(.))
(QueryImpl.java, line 183)
06.03.2008 13:25:35 *DEBUG* QueryImpl: Executing query:
+ Root node
+ Select properties: {internal}spellcheck()
  + PathQueryNode
    + LocationStepQueryNode:  NodeTest={} Descendants=false Index=NONE
      + RelationQueryNode: Op: spellcheck
Prop=@{http://www.jcp.org/jcr/1.0}primaryType Type=STRING Value=elections
(QueryImpl.java, line 106)
06.03.2008 13:25:35 *DEBUG* QueryResultImpl: getResults(2147483647)
(QueryResultImpl.java, line 272)
06.03.2008 13:25:35 *DEBUG* QueryImpl: executed in 0.03 s.
(/jcr:root[rep:spellcheck('elections')]/(rep:spellcheck())) (QueryImpl.java,
line 183)
06.03.2008 13:25:35 *DEBUG* DocOrderNodeIteratorImpl: 1 node(s) ordered in 0
ms (DocOrderNodeIteratorImpl.java, line 254)
06.03.2008 13:25:35 *DEBUG* ItemManager: invalidated item
cafebabe-cafe-babe-cafe-babecafebabe (ItemManager.java, line 761)
06.03.2008 13:25:35 *DEBUG* ItemManager: removing item
cafebabe-cafe-babe-cafe-babecafebabe from cache (ItemManager.java, line 702)
06.03.2008 13:26:42 *DEBUG* MLRUItemStateCache:
org.apache.jackrabbit.core.state.MLRUItemStateCache@1933acb size=1,
664/4194304(MLRUItemStateCache.java, line 148)
06.03.2008 13:26:42 *DEBUG* QueryImpl: Executing query:
+ Root node
+ Select properties: {internal}excerpt()
  + PathQueryNode
    + LocationStepQueryNode:  NodeTest=* Descendants=true Index=NONE
      + NodeTypeQueryNode:  Prop={http://www.jcp.org/jcr/1.0}primaryType
Value={http://www.jcp.org/jcr/nt/1.0}file
      + TextsearchQueryNode:  Path={http://www.jcp.org/jcr/1.0}content
Query=Texas (QueryImpl.java, line 106)
06.03.2008 13:26:42 *DEBUG* MLRUItemStateCache:
org.apache.jackrabbit.core.state.MLRUItemStateCache@8c858a size=1,
4564/4194304(MLRUItemStateCache.java, line 148)
06.03.2008 13:26:42 *DEBUG* ItemManager: created item
cafebabe-cafe-babe-cafe-babecafebabe (ItemManager.java, line 750)
06.03.2008 13:26:42 *DEBUG* ItemManager: caching item
cafebabe-cafe-babe-cafe-babecafebabe (ItemManager.java, line 689)
06.03.2008 13:26:42 *DEBUG* QueryResultImpl: getResults(2147483647)
(QueryResultImpl.java, line 272)
06.03.2008 13:26:42 *DEBUG* QueryImpl: executed in 0.02 s. (//element(*,
nt:file)[jcr:contains(jcr:content, 'Texas')]/rep:excerpt(.))
(QueryImpl.java, line 183)
06.03.2008 13:26:42 *DEBUG* QueryImpl: Executing query:
+ Root node
+ Select properties: {internal}spellcheck()
  + PathQueryNode
    + LocationStepQueryNode:  NodeTest={} Descendants=false Index=NONE
      + RelationQueryNode: Op: spellcheck
Prop=@{http://www.jcp.org/jcr/1.0}primaryType Type=STRING Value=Texas
(QueryImpl.java, line 106)
06.03.2008 13:26:42 *DEBUG* QueryResultImpl: getResults(2147483647)
(QueryResultImpl.java, line 272)
06.03.2008 13:26:42 *DEBUG* QueryImpl: executed in 0.00 s.
(/jcr:root[rep:spellcheck('Texas')]/(rep:spellcheck())) (QueryImpl.java,
line 183)
06.03.2008 13:26:42 *DEBUG* DocOrderNodeIteratorImpl: 1 node(s) ordered in 0
ms (DocOrderNodeIteratorImpl.java, line 254)
06.03.2008 13:26:42 *DEBUG* ItemManager: invalidated item
cafebabe-cafe-babe-cafe-babecafebabe (ItemManager.java, line 761)
06.03.2008 13:26:42 *DEBUG* ItemManager: removing item
cafebabe-cafe-babe-cafe-babecafebabe from cache (ItemManager.java, line 702)

The path to the document in my environment is as follows per browser link to
my repository in the index.jsp page:
/default/us/tx/state/sos/www/elections/forms/vr17.pdf

Any ideas? suggestions?

thanks

Julio Castillo
Edgenuity Inc.


Mime
View raw message