Mailing list archives: April 2008

Site index · List index
Message list« Previous · 1 · 2 · 3 · Next »Thread · Author · Date
Hilkiah Lavinier Re: index-more problem? Thu, 17 Apr, 23:06
Hilkiah Lavinier hadoop dfs -ls and nutch generate/fetch commands Fri, 18 Apr, 00:54
Hilkiah Lavinier Re: use crawl command to fetch arbitrary pages? Wed, 23 Apr, 13:35
Hilkiah Lavinier Re: crawl command & urlfilter Fri, 25 Apr, 13:33
Howie Wang RE: Extracting Embedded Outlinks Wed, 23 Apr, 17:12
Iskandar Zaynutdinov Weather I should use nutch to search Domain model? Tue, 22 Apr, 09:42
Iskandar Zaynutdinov Re: unit tests for indexing Wed, 30 Apr, 05:17
Iskandar Zaynutdinov Re: unit tests for indexing Wed, 30 Apr, 05:44
Iskandar Zaynutdinov Re: unit tests for indexing Wed, 30 Apr, 14:18
James Moore using prefix-urlfilter instead of regular expressions Mon, 21 Apr, 23:05
Jason Boss Errors with Tomcat Sat, 19 Apr, 00:36
Jason Boss Re: Errors with Tomcat Sat, 19 Apr, 03:26
Jason Boss hadoop Mon, 21 Apr, 22:42
Jason Boss Re: hadoop Tue, 22 Apr, 00:36
Jason Boss Re: hadoop Tue, 22 Apr, 02:53
Jason Boss lightweight index Tue, 22 Apr, 02:57
Jason Boss hadoop slaves Tue, 22 Apr, 16:44
Jasper Kamperman Re: Filtering on a field Sun, 20 Apr, 22:50
Jasper Kamperman Re: Searching parameterized URLs Wed, 30 Apr, 17:32
Jeet Singh Controlling no. of pages to crawl Tue, 01 Apr, 05:30
Jeet Singh Issue in parsing the query Sat, 19 Apr, 14:00
Jeet Singh Filtering on a field Sun, 20 Apr, 12:40
Jeet Singh How to tell a custom field to searcher Tue, 22 Apr, 08:37
John Mendenhall nutch, hadoop, and windows Fri, 11 Apr, 23:48
John Mendenhall Re: Next Generation Nutch Fri, 11 Apr, 23:55
John Mendenhall nutch data on *nix and windows Thu, 17 Apr, 00:27
John Mendenhall Re: hadoop dfs -ls and nutch generate/fetch commands Fri, 18 Apr, 17:49
John Mendenhall Re: Error: Failed to get the current user's information: Login failed: Cannot run program "whoami": Tue, 29 Apr, 21:39
Ken Krugler Re: Normalizing host names (e.g. www1|www2 => www) Sun, 27 Apr, 18:03
Liu Yan Re: Reduce tasks doesn't start Wed, 02 Apr, 18:13
Liu Yan Re: Reduce tasks doesn't start Thu, 03 Apr, 14:54
Liu Yan Re: Reduce tasks doesn't start Thu, 03 Apr, 15:30
Liu Yan Re: Reduce tasks doesn't start Thu, 03 Apr, 16:13
Lukas Vlcek Crawling MOSS 2007 content using Nutch via GSA connector Thu, 24 Apr, 10:41
Lyndon Maydwell Stemming plugin problem. Tue, 01 Apr, 09:58
Lyndon Maydwell Re: Can two different urls be configured as same ? Tue, 22 Apr, 05:20
Mathias Conradt Problems with encoding (UTF-8), display of search results with special characters Tue, 29 Apr, 08:09
Mathias Conradt RE: Problems with encoding (UTF-8), display of search results with special characters Wed, 30 Apr, 02:48
Miguel Costa RE: Problems with encoding (UTF-8), display of search results with special characters Tue, 29 Apr, 17:32
Ola Daniel java.io.IOException: No input paths specified in input Tue, 15 Apr, 08:11
Otis Gospodnetic Re: Nutch or Heritrix? Mon, 07 Apr, 05:04
Otis Gospodnetic Re: Slow Crawl Speed and Tika Error Media type alias already exists: text/xml Mon, 07 Apr, 05:06
Otis Gospodnetic Re: dealing with utf-8 characters Mon, 07 Apr, 05:08
Otis Gospodnetic Re: Slow Crawl Speed and Tika Error Media type alias already exists: text/xml Wed, 09 Apr, 06:11
Otis Gospodnetic Fetch task 100% done, but still fetching Thu, 10 Apr, 21:11
Otis Gospodnetic Re: Next Generation Nutch Sat, 12 Apr, 04:08
Otis Gospodnetic Re: Next Generation Nutch Sat, 12 Apr, 04:13
Otis Gospodnetic Re: Next Generation Nutch Mon, 14 Apr, 01:19
Otis Gospodnetic Re: Next Generation Nutch Mon, 14 Apr, 17:57
Otis Gospodnetic Re: Parallel operations in fetch Wed, 16 Apr, 04:24
Otis Gospodnetic Re: Files removed from https://svn.apache.org/repos/asf/lucene/nutch/trunk/bin??? Fri, 18 Apr, 15:26
Otis Gospodnetic Re: Parser bug? Fri, 18 Apr, 15:29
POIRIER David Generator: 0 records selected for fetching, exiting ... Tue, 22 Apr, 08:58
POIRIER David RE: Generator: 0 records selected for fetching, exiting ... Tue, 22 Apr, 09:33
POIRIER David RE: Generator: 0 records selected for fetching, exiting ... Tue, 22 Apr, 15:10
POIRIER David RE: Generator: 0 records selected for fetching, exiting ... Wed, 23 Apr, 07:42
POIRIER David crawl command & urlfilter Fri, 25 Apr, 12:24
POIRIER David RE: Generator: 0 records selected for fetching, exiting ... Fri, 25 Apr, 12:41
POIRIER David RE: crawl command & urlfilter Fri, 25 Apr, 13:54
Raj Malhotra Can two different urls be configured as same ? Tue, 22 Apr, 05:05
Raj Malhotra Re: Can two different urls be configured as same ? Tue, 22 Apr, 10:50
Raj Malhotra Re: Can two different urls be configured as same ? Tue, 22 Apr, 11:44
Raj Malhotra Re: Can two different urls be configured as same ? Tue, 22 Apr, 12:31
Ravish Bhagdev Re: what is the best way to learn search engin technology Wed, 09 Apr, 18:50
Richard Cyganiak Re: Can two different urls be configured as same ? Tue, 22 Apr, 11:22
Richard Cyganiak Re: Can two different urls be configured as same ? Tue, 22 Apr, 11:55
Rohit Potnis Searching parameterized URLs Wed, 30 Apr, 17:13
Rohit Potnis Parameterized URL search using Nutch Wed, 30 Apr, 17:32
Sami Siren Re: Next Generation Nutch Sat, 12 Apr, 08:20
Samuel Guo Any HDFS protocol plugin like File protocol plugin ? Wed, 16 Apr, 12:12
Samuel Guo Any HDFS protocol plugin like File protocol plugin ? Wed, 16 Apr, 12:14
Samuel Guo Re: Any HDFS protocol plugin like File protocol plugin ? Wed, 16 Apr, 12:34
Samuel Guo Nutch Performance Fri, 25 Apr, 01:13
Samuel Guo Re: crawling crashed at dedup Fri, 25 Apr, 03:44
Samuel Guo Re: Generator: 0 records selected for fetching, exiting ... Sat, 26 Apr, 06:06
Samuel Guo Re: Nutch Performance Tue, 29 Apr, 04:07
Samuel Guo Re: Nutch Performance Tue, 29 Apr, 04:21
Sandeep Tata NoSuchMethodError Thu, 10 Apr, 03:17
Sandeep Tata NoSuchMethodError Thu, 10 Apr, 03:21
Sebastian Steinmetz Re: Slow Crawl Speed and Tika Error Media type alias already exists: text/xml Tue, 08 Apr, 10:16
Siddhartha Reddy Re: Fetching inefficiency Mon, 21 Apr, 20:59
Siddhartha Reddy Re: hadoop Tue, 22 Apr, 04:06
Siddhartha Reddy Re: Fetching inefficiency Wed, 23 Apr, 04:49
Siddhartha Reddy Re: Fetching inefficiency Thu, 24 Apr, 04:49
Srinivasarao Vundavalli Problem with the index Thu, 17 Apr, 06:46
Stefan Will Hardware Requirements Mon, 07 Apr, 01:24
Stefan Will Re: On-page javascript treated as relative link Mon, 28 Apr, 04:28
Susam Pal Re: Crawl dies unexpectedly Tue, 01 Apr, 18:44
Susam Pal Re: fetching error Thu, 03 Apr, 16:57
Susam Pal Re: Nutch fetching skipped files Thu, 03 Apr, 17:04
Susam Pal Re: Nutch fetching skipped files Fri, 04 Apr, 16:34
Susam Pal Re: fetching error Thu, 10 Apr, 14:07
Svein Yngvar Willassen Nutch or Heritrix? Sat, 05 Apr, 13:35
Svein Yngvar Willassen JobStream.py Tue, 15 Apr, 11:57
Svein Yngvar Willassen Re: JobStream.py Tue, 15 Apr, 17:17
Svein Yngvar Willassen Re: JobStream.py Tue, 15 Apr, 17:30
Svein Yngvar Willassen Parser bug? Wed, 16 Apr, 19:56
Svein Yngvar Willassen Re: how to setup cluster for two system in hadoop Thu, 17 Apr, 06:39
Svein Yngvar Willassen Re: Parallel operations in fetch Thu, 17 Apr, 08:18
Svein Yngvar Willassen Re: Parser bug? Thu, 17 Apr, 11:46
Message list« Previous · 1 · 2 · 3 · Next »Thread · Author · Date
Box list
Dec 2009103
Nov 2009308
Oct 2009258
Sep 2009184
Aug 2009199
Jul 2009312
Jun 2009196
May 2009163
Apr 2009247
Mar 2009408
Feb 2009214
Jan 2009204
Dec 2008229
Nov 2008193
Oct 2008171
Sep 2008269
Aug 2008165
Jul 2008122
Jun 2008243
May 2008220
Apr 2008294
Mar 2008209
Feb 2008191
Jan 2008272
Dec 2007145
Nov 2007228
Oct 2007261
Sep 2007273
Aug 2007292
Jul 2007339
Jun 2007392
May 2007242
Apr 2007309
Mar 2007283
Feb 2007188
Jan 2007370
Dec 2006225
Nov 2006160
Oct 2006251
Sep 2006412
Aug 2006450
Jul 2006315
Jun 2006380
May 2006232
Apr 2006458
Mar 2006659
Feb 2006581
Jan 2006592
Dec 2005430
Nov 2005398
Oct 2005304
Sep 2005404
Aug 2005278
Jul 2005342
Jun 2005216
May 2005151
Apr 2005220
Mar 2005167