nutch-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Dan Morrill" <ralph.morr...@baker.edu>
Subject RE: Merging indexes -- please help....
Date Mon, 03 Apr 2006 12:18:34 GMT
Hi,

I noticed that when I used the drive designation that it didn't like that
(windows cygwin environment) if you did

./nutch merge -local /STG1/index /STG1/indexes that may work better, let me
know. 

Cheers/r/dan

-----Original Message-----
From: Vertical Search [mailto:vertical.searchh@gmail.com] 
Sent: Sunday, April 02, 2006 7:07 PM
To: nutch-user@lucene.apache.org
Subject: Re: Merging indexes -- please help....

Okay.
I had 2 sets of crawl
such as E:/STG1 and E/STG2
I used the dedup command to remove duplicates
Then I the command i used to merge is as follows
<based on what have been available on mail archieves and responses I got

First I can

 bin/nutch merge E:/STG1/index E:/STG1/indexes
  bin/nutch merge E:/STG1/index E:/STG2/indexes

In the nutch-site .xml I have searcher.dir ad E:/STG1

I get the absolutely no results...The command console is as follows.
Can some one shed some light on this please ASAP..

INFO: creating new bean
Apr 2, 2006 8:58:36 PM org.apache.nutch.searcher.NutchBean init
INFO: opening merged index in E:\Hoodukoo\STG5\index
Apr 2, 2006 8:58:36 PM org.apache.nutch.searcher.NutchBean init
INFO: opening segments in E:\Hoodukoo\STG5\segments
Apr 2, 2006 8:58:36 PM
org.apache.hadoop.conf.ConfigurationgetConfResourceAsRea
der
INFO: found resource common-terms.utf8 at
file:/C:/xampp/tomcat/webapps/hoodukoo
/WEB-INF/classes/common-terms.utf8
Apr 2, 2006 8:58:36 PM org.apache.nutch.searcher.NutchBean init
INFO: opening linkdb in E:\Hoodukoo\STG5\linkdb
Apr 2, 2006 8:58:36 PM org.apache.jsp.search_jsp _jspService
INFO: query request from 127.0.0.1
Apr 2, 2006 8:58:36 PM org.apache.jsp.search_jsp _jspService
INFO: query: site
Apr 2, 2006 8:58:36 PM org.apache.nutch.searcher.NutchBean search
INFO: searching for 20 raw hits


Mime
View raw message