Mailing list archives: April 2008

Site index · List index
Message list« Previous · 1 · 2 · 3Thread · Author · Date
POIRIER David Generator: 0 records selected for fetching, exiting ... Tue, 22 Apr, 08:58
POIRIER David RE: Generator: 0 records selected for fetching, exiting ... Tue, 22 Apr, 09:33
Iskandar Zaynutdinov Weather I should use nutch to search Domain model? Tue, 22 Apr, 09:42
Raj Malhotra Re: Can two different urls be configured as same ? Tue, 22 Apr, 10:50
Richard Cyganiak Re: Can two different urls be configured as same ? Tue, 22 Apr, 11:22
Raj Malhotra Re: Can two different urls be configured as same ? Tue, 22 Apr, 11:44
Richard Cyganiak Re: Can two different urls be configured as same ? Tue, 22 Apr, 11:55
Raj Malhotra Re: Can two different urls be configured as same ? Tue, 22 Apr, 12:31
Dennis Kubes Re: Fetching inefficiency Tue, 22 Apr, 13:58
Dennis Kubes Re: Generator: 0 records selected for fetching, exiting ... Tue, 22 Apr, 14:04
ogjunk-nu...@yahoo.com Re: Weather I should use nutch to search Domain model? Tue, 22 Apr, 14:05
POIRIER David RE: Generator: 0 records selected for fetching, exiting ... Tue, 22 Apr, 15:10
Jason Boss hadoop slaves Tue, 22 Apr, 16:44
Dennis Kubes Re: Generator: 0 records selected for fetching, exiting ... Tue, 22 Apr, 17:22
ogjunk-nu...@yahoo.com Re: Delete Urls from CrawlsDB Wed, 23 Apr, 03:46
ogjunk-nu...@yahoo.com Re: how to deal with the max number of outlinks and inlinks per page? Wed, 23 Apr, 03:48
ogjunk-nu...@yahoo.com Re: Fetching inefficiency Wed, 23 Apr, 03:59
Siddhartha Reddy Re: Fetching inefficiency Wed, 23 Apr, 04:49
POIRIER David RE: Generator: 0 records selected for fetching, exiting ... Wed, 23 Apr, 07:42
Andrzej Bialecki Re: Fetching inefficiency Wed, 23 Apr, 08:23
Hilkiah Lavinier Re: use crawl command to fetch arbitrary pages? Wed, 23 Apr, 13:35
Dennis Kubes Re: Generator: 0 records selected for fetching, exiting ... Wed, 23 Apr, 15:01
ogjunk-nu...@yahoo.com Re: Fetching inefficiency Wed, 23 Apr, 15:22
ogjunk-nu...@yahoo.com Re: Fetching inefficiency Wed, 23 Apr, 15:30
Brian Ulicny Extracting Embedded Outlinks Wed, 23 Apr, 15:45
ogjunk-nu...@yahoo.com Re: Fetching inefficiency Wed, 23 Apr, 15:49
Howie Wang RE: Extracting Embedded Outlinks Wed, 23 Apr, 17:12
Brian Ulicny RE: Extracting Embedded Outlinks Wed, 23 Apr, 17:41
ywang Re: Re: use crawl command to fetch arbitrary pages? Thu, 24 Apr, 02:28
Siddhartha Reddy Re: Fetching inefficiency Thu, 24 Apr, 04:49
Lukas Vlcek Crawling MOSS 2007 content using Nutch via GSA connector Thu, 24 Apr, 10:41
Brent Walker Searching for Quoted Phrases Thu, 24 Apr, 14:25
Bradford Stephens Running other Hadoop Tasks on Nutch Servers? Thu, 24 Apr, 18:38
Samuel Guo Nutch Performance Fri, 25 Apr, 01:13
edwinchiu crawling crashed at dedup Fri, 25 Apr, 03:17
Samuel Guo Re: crawling crashed at dedup Fri, 25 Apr, 03:44
POIRIER David crawl command & urlfilter Fri, 25 Apr, 12:24
POIRIER David RE: Generator: 0 records selected for fetching, exiting ... Fri, 25 Apr, 12:41
Hilkiah Lavinier Re: crawl command & urlfilter Fri, 25 Apr, 13:33
POIRIER David RE: crawl command & urlfilter Fri, 25 Apr, 13:54
Bradford Stephens Cache URL Rewriting Not Working... Fri, 25 Apr, 19:10
ogjunk-nu...@yahoo.com Normalizing host names (e.g. www1|www2 => www) Fri, 25 Apr, 23:09
Samuel Guo Re: Generator: 0 records selected for fetching, exiting ... Sat, 26 Apr, 06:06
Doğacan Güney Re: Normalizing host names (e.g. www1|www2 => www) Sun, 27 Apr, 09:41
Ken Krugler Re: Normalizing host names (e.g. www1|www2 => www) Sun, 27 Apr, 18:03
Euan Clark On-page javascript treated as relative link Sun, 27 Apr, 22:40
Stefan Will Re: On-page javascript treated as relative link Mon, 28 Apr, 04:28
chris sleeman nutch crawl sub-directories required for search Mon, 28 Apr, 09:59
chris sleeman nutch crawl sub-directories required for search Mon, 28 Apr, 10:04
Bradford Stephens Re: Cache URL Rewriting Not Working... Mon, 28 Apr, 17:29
v k Error: Failed to get the current user's information: Login failed: Cannot run program "whoami": Tue, 29 Apr, 03:19
ogjunk-nu...@yahoo.com Re: Error: Failed to get the current user's information: Login failed: Cannot run program "whoami": Tue, 29 Apr, 03:59
ogjunk-nu...@yahoo.com Re: Nutch Performance Tue, 29 Apr, 04:01
Samuel Guo Re: Nutch Performance Tue, 29 Apr, 04:07
Samuel Guo Re: Nutch Performance Tue, 29 Apr, 04:21
vkblogger Re: Error: Failed to get the current user's information: Login failed: Cannot run program "whoami": Tue, 29 Apr, 06:23
Mathias Conradt Problems with encoding (UTF-8), display of search results with special characters Tue, 29 Apr, 08:09
subrat mahanty how to configure hadoop master ans slave set up Tue, 29 Apr, 08:38
subrat mahanty bash: c/bin/hadoop: No such file or directory Tue, 29 Apr, 09:51
Gene Campbell Question about adding tags or attributes to indexed info Tue, 29 Apr, 12:33
Aldarris Nutch 0.9: CMD works, web gui does not Tue, 29 Apr, 15:23
Aldarris Re: Nutch 0.9: CMD works, web gui does not Tue, 29 Apr, 15:59
ogjunk-nu...@yahoo.com Re: Error: Failed to get the current user's information: Login failed: Cannot run program "whoami": Tue, 29 Apr, 16:54
Bill Meltzer tika-mimetypes errors Tue, 29 Apr, 17:18
ogjunk-nu...@yahoo.com Re: tika-mimetypes errors Tue, 29 Apr, 17:22
Bill Meltzer RE: tika-mimetypes errors Tue, 29 Apr, 17:28
Miguel Costa RE: Problems with encoding (UTF-8), display of search results with special characters Tue, 29 Apr, 17:32
Gene Campbell Fwd: Question about adding tags or attributes to indexed info Tue, 29 Apr, 20:20
John Mendenhall Re: Error: Failed to get the current user's information: Login failed: Cannot run program "whoami": Tue, 29 Apr, 21:39
Gene Campbell Please reply Tue, 29 Apr, 22:00
vkblogger Re: Error: Failed to get the current user's information: Login failed: Cannot run program "whoami": Tue, 29 Apr, 22:24
Mathias Conradt RE: Problems with encoding (UTF-8), display of search results with special characters Wed, 30 Apr, 02:48
Gene Campbell Test Wed, 30 Apr, 03:06
Gene Campbell unit tests for indexing Wed, 30 Apr, 05:07
vkblogger Re: index-more problem? Wed, 30 Apr, 05:15
vkblogger Re: index-more problem? Wed, 30 Apr, 05:15
Iskandar Zaynutdinov Re: unit tests for indexing Wed, 30 Apr, 05:17
Gene Campbell Re: unit tests for indexing Wed, 30 Apr, 05:33
Iskandar Zaynutdinov Re: unit tests for indexing Wed, 30 Apr, 05:44
Gene Campbell Re: unit tests for indexing Wed, 30 Apr, 06:39
gabriele renzi score of freshly injected urls Wed, 30 Apr, 10:15
Gene Campbell Storing fields best practice question Wed, 30 Apr, 11:02
Gene Campbell Storing fields best practice question Wed, 30 Apr, 11:12
Iskandar Zaynutdinov Re: unit tests for indexing Wed, 30 Apr, 14:18
Rohit Potnis Searching parameterized URLs Wed, 30 Apr, 17:13
Rohit Potnis Parameterized URL search using Nutch Wed, 30 Apr, 17:32
Jasper Kamperman Re: Searching parameterized URLs Wed, 30 Apr, 17:32
ogjunk-nu...@yahoo.com Re: unit tests for indexing Wed, 30 Apr, 17:58
ogjunk-nu...@yahoo.com Re: Searching parameterized URLs Wed, 30 Apr, 18:00
ogjunk-nu...@yahoo.com Re: index-more problem? Wed, 30 Apr, 18:06
ogjunk-nu...@yahoo.com Re: score of freshly injected urls Wed, 30 Apr, 18:07
Devang - Google RE: score of freshly injected urls Wed, 30 Apr, 18:39
gabriele renzi Re: score of freshly injected urls Wed, 30 Apr, 19:06
Gene Campbell Re: unit tests for indexing Wed, 30 Apr, 20:29
Message list« Previous · 1 · 2 · 3Thread · Author · Date
Box list
Dec 200981
Nov 2009308
Oct 2009258
Sep 2009184
Aug 2009199
Jul 2009312
Jun 2009196
May 2009163
Apr 2009247
Mar 2009408
Feb 2009214
Jan 2009204
Dec 2008229
Nov 2008193
Oct 2008171
Sep 2008269
Aug 2008165
Jul 2008122
Jun 2008243
May 2008220
Apr 2008294
Mar 2008209
Feb 2008191
Jan 2008272
Dec 2007145
Nov 2007228
Oct 2007261
Sep 2007273
Aug 2007292
Jul 2007339
Jun 2007392
May 2007242
Apr 2007309
Mar 2007283
Feb 2007188
Jan 2007370
Dec 2006225
Nov 2006160
Oct 2006251
Sep 2006412
Aug 2006450
Jul 2006315
Jun 2006380
May 2006232
Apr 2006458
Mar 2006659
Feb 2006581
Jan 2006592
Dec 2005430
Nov 2005398
Oct 2005304
Sep 2005404
Aug 2005278
Jul 2005342
Jun 2005216
May 2005151
Apr 2005220
Mar 2005167