Mailing list archives: January 2006

Site index · List index
Message list1 · 2 · 3 · 4 · 5 · 6 · Next »Thread · Author · Date
Jérôme Charron Re: no static NutchConf Wed, 04 Jan, 17:57
Jérôme Charron Re: no static NutchConf Wed, 04 Jan, 18:14
Jérôme Charron Re: no static NutchConf Thu, 05 Jan, 10:52
Jérôme Charron Re: mapred crawling exception - Job failed! Thu, 05 Jan, 13:26
Jérôme Charron Re: [VOTE] Commiter access for Stefan Groschupf Thu, 05 Jan, 22:03
Jérôme Charron Re: problems http-client Fri, 06 Jan, 10:02
Jérôme Charron Re: test suite fails? Mon, 09 Jan, 17:50
Jérôme Charron Re: svn commit: r367137 - in /lucene/nutch/trunk/src: java/org/apache/nutch/net/protocols/ plugin/ plugin/lib-http/ plugin/lib-http/src/ plugin/lib-http/src/java/ plugin/lib-http/src/java/org/ plugin/lib-http/src/java/org/apache/ plugin/lib-http/src/ Mon, 09 Jan, 21:08
Jérôme Charron Re: HTMLMetaProcessor a bug? Tue, 10 Jan, 10:06
Jérôme Charron Re: ParserFactory test fail Tue, 10 Jan, 17:24
Jérôme Charron Re: lang identifier and nutch analyzer in trunk Fri, 20 Jan, 16:44
Jérôme Charron Re: lang identifier and nutch analyzer in trunk Mon, 23 Jan, 11:51
Jérôme Charron Re: lang identifier and nutch analyzer in trunk Mon, 23 Jan, 13:18
Jérôme Charron Re: xml-parser plugin contribution Tue, 24 Jan, 09:17
Jérôme Charron Re: lang identifier and nutch analyzer in trunk Tue, 24 Jan, 09:44
Jérôme Charron Re: lang identifier and nutch analyzer in trunk Tue, 24 Jan, 09:51
Jérôme Charron Re: lang identifier and nutch analyzer in trunk Tue, 24 Jan, 11:17
Matthias Günter (JIRA) [jira] Created: (NUTCH-174) Problem encountered with ant during compilation Sat, 14 Jan, 15:09
Matthias Günter (JIRA) [jira] Created: (NUTCH-175) No input directories specified in: while crawing in nightly build from the 14.1.2006: sh ./nutch crawl urllist.txt -dir tmpdir Sat, 14 Jan, 20:07
Matthias Günter (JIRA) [jira] Created: (NUTCH-176) Using -dir: creates an error, when the directory already exists Sun, 15 Jan, 13:10
Matthias Günter (JIRA) [jira] Created: (NUTCH-177) Default installation seems to produce working entity of nutch Sun, 15 Jan, 13:20
Matthias Günter (JIRA) [jira] Updated: (NUTCH-177) Default installation seems to produce working entity of nutch Sun, 15 Jan, 13:22
Matthias Günter (JIRA) [jira] Updated: (NUTCH-177) Default installation seems to produce working entity of nutch Sun, 15 Jan, 13:22
AJ Chen Re: problems http-client Fri, 06 Jan, 20:33
Andrew McNabb Reporter interface Fri, 06 Jan, 23:43
Andrew McNabb Re: Reporter interface Mon, 09 Jan, 23:13
Andrew McNabb Re: Reporter interface Tue, 10 Jan, 00:34
Andrew McNabb Re: Reporter interface Tue, 10 Jan, 13:42
Andrew McNabb Re: Reporter interface Tue, 10 Jan, 17:56
Andrew McNabb Re: [bug] combiner class never used Mon, 30 Jan, 21:06
Andrzej Bialecki Re: Mega-cleanup in trunk/ Mon, 02 Jan, 12:08
Andrzej Bialecki Re: svn commit: r359822 - in /lucene/nutch/trunk: bin/ conf/ src/java/org/apache/nutch/crawl/ src/java/org/apache/nutch/fetcher/ src/java/org/apache/nutch/indexer/ src/java/org/apache/nutch/parse/ src/java/org/apache/nutch/segment/ src/java/org/apache/nutc... Mon, 02 Jan, 20:19
Andrzej Bialecki Re: IndexSorter optimizer Mon, 02 Jan, 22:49
Andrzej Bialecki Re: IndexSorter optimizer Tue, 03 Jan, 07:57
Andrzej Bialecki Re: NullPointerException (new as of Dec 31st) Tue, 03 Jan, 08:35
Andrzej Bialecki Re: mapred crawling exception - Job failed! Wed, 04 Jan, 07:51
Andrzej Bialecki Re: mapred crawling exception - Job failed! Wed, 04 Jan, 11:03
Andrzej Bialecki Re: no static NutchConf Wed, 04 Jan, 16:52
Andrzej Bialecki Re: no static NutchConf Wed, 04 Jan, 18:07
Andrzej Bialecki Re: IndexSorter optimizer Wed, 04 Jan, 18:24
Andrzej Bialecki Re: no static NutchConf Wed, 04 Jan, 19:10
Andrzej Bialecki Re: svn commit: r365850 - in /lucene/nutch/trunk/src/plugin/protocol-httpclient: ./ lib/ src/java/org/apache/nutch/protocol/httpclient/ Wed, 04 Jan, 19:42
Andrzej Bialecki Re: mapred crawling exception - Job failed! Thu, 05 Jan, 07:29
Andrzej Bialecki Re: mapred crawling exception - Job failed! Thu, 05 Jan, 09:49
Andrzej Bialecki Per-page crawling policy Thu, 05 Jan, 13:58
Andrzej Bialecki Re: no static NutchConf Thu, 05 Jan, 15:26
Andrzej Bialecki Re: Per-page crawling policy Thu, 05 Jan, 15:47
Andrzej Bialecki Re: Per-page crawling policy Thu, 05 Jan, 17:41
Andrzej Bialecki Re: problems http-client Thu, 05 Jan, 21:12
Andrzej Bialecki Re: problems http-client Fri, 06 Jan, 10:24
Andrzej Bialecki Re: Class Cast exception Fri, 06 Jan, 19:27
Andrzej Bialecki Re: Class Cast exception Fri, 06 Jan, 20:39
Andrzej Bialecki Re: Per-page crawling policy Fri, 06 Jan, 20:41
Andrzej Bialecki Re: Class Cast exception Fri, 06 Jan, 20:51
Andrzej Bialecki Re: Class Cast exception Fri, 06 Jan, 21:55
Andrzej Bialecki Re: Nutch Deployment Sat, 07 Jan, 08:29
Andrzej Bialecki Re: NPE in Indexer.java line 184 Sun, 08 Jan, 09:07
Andrzej Bialecki Re: NPE in Indexer.java line 184 Mon, 09 Jan, 08:43
Andrzej Bialecki Re: NPE in Indexer.java line 184 Mon, 09 Jan, 09:53
Andrzej Bialecki Re: OpenOffice and Excel parsers Tue, 10 Jan, 19:50
Andrzej Bialecki Re: Problem with latest SVN during reduce phase Wed, 11 Jan, 20:03
Andrzej Bialecki Re: MapReduce and segment merging Thu, 12 Jan, 17:56
Andrzej Bialecki Re: MapReduce and segment merging Thu, 12 Jan, 19:13
Andrzej Bialecki Generating multiple fetchlists between updates Fri, 13 Jan, 13:31
Andrzej Bialecki Re: Per-page crawling policy Mon, 16 Jan, 18:06
Andrzej Bialecki Re: tool to mount nutch filesystem Sat, 21 Jan, 16:59
Andrzej Bialecki Re: lang identifier and nutch analyzer in trunk Mon, 23 Jan, 12:11
Andrzej Bialecki Re: lang identifier and nutch analyzer in trunk Mon, 23 Jan, 22:19
Andrzej Bialecki Re: xml-parser plugin contribution Tue, 24 Jan, 07:59
Andrzej Bialecki Re: lang identifier and nutch analyzer in trunk Tue, 24 Jan, 11:11
Andrzej Bialecki Re: Two possible extensions Tue, 24 Jan, 11:20
Andrzej Bialecki Re: lang identifier and nutch analyzer in trunk Tue, 24 Jan, 11:42
Andrzej Bialecki Re: [jira] Commented: (NUTCH-139) Standard metadata property names in the ParseData metadata Thu, 26 Jan, 21:22
Andrzej Bialecki Re: [Nutch-cvs] svn commit: r372810 - /lucene/nutch/trunk/bin/nutch Fri, 27 Jan, 11:01
Andrzej Bialecki Re: [Nutch-cvs] svn commit: r372810 - /lucene/nutch/trunk/bin/nutch Fri, 27 Jan, 12:14
Andrzej Bialecki Re: svn commit: r372810 - /lucene/nutch/trunk/bin/nutch Fri, 27 Jan, 21:09
Andrzej Bialecki Re: svn commit: r359822 - in /lucene/nutch/trunk: bin/ conf/ src/java/org/apache/nutch/crawl/ src/java/org/apache/nutch/fetcher/ src/java/org/apache/nutch/indexer/ src/java/org/apache/nutch/parse/ src/java/org/apache/nutch/segment/ src/java/org/apache/nutc... Sun, 29 Jan, 12:38
Andrzej Bialecki Re: where we need meta data? Mon, 30 Jan, 07:56
Andrzej Bialecki Re: indexSorter - applied to SVN or patch in Jira? Tue, 31 Jan, 15:23
Andrzej Bialecki Re: [jira] Commented: (NUTCH-169) remove static NutchConf Tue, 31 Jan, 18:39
Andrzej Bialecki Lucene's VInt for lengths/counts/sizes Tue, 31 Jan, 21:06
Andrzej Bialecki (JIRA) [jira] Commented: (NUTCH-139) Standard metadata property names in the ParseData metadata Thu, 05 Jan, 22:30
Andrzej Bialecki (JIRA) [jira] Commented: (NUTCH-139) Standard metadata property names in the ParseData metadata Sat, 07 Jan, 08:25
Andrzej Bialecki (JIRA) [jira] Commented: (NUTCH-169) remove static NutchConf Wed, 11 Jan, 13:13
Andrzej Bialecki (JIRA) [jira] Commented: (NUTCH-180) Performance problem with widely used keywords Mon, 16 Jan, 07:06
Andrzej Bialecki (JIRA) [jira] Commented: (NUTCH-169) remove static NutchConf Wed, 18 Jan, 16:19
Andrzej Bialecki (JIRA) [jira] Commented: (NUTCH-139) Standard metadata property names in the ParseData metadata Fri, 20 Jan, 12:11
Andrzej Bialecki (JIRA) [jira] Closed: (NUTCH-136) mapreduce segment generator generates 50 % less than excepted urls Tue, 24 Jan, 22:20
Andrzej Bialecki (JIRA) [jira] Commented: (NUTCH-186) mapred-default.xml is over ridden by nutch-site.xml Tue, 24 Jan, 22:23
Andrzej Bialecki (JIRA) [jira] Commented: (NUTCH-139) Standard metadata property names in the ParseData metadata Wed, 25 Jan, 11:28
Andrzej Bialecki (JIRA) [jira] Closed: (NUTCH-190) ParseUtil drops reason for failed parse Thu, 26 Jan, 23:49
Andrzej Bialecki (JIRA) [jira] Commented: (NUTCH-95) DeleteDuplicates depends on the order of input segments Sun, 29 Jan, 02:05
Andrzej Bialecki (JIRA) [jira] Assigned: (NUTCH-169) remove static NutchConf Mon, 30 Jan, 20:50
Andrzej Bialecki (JIRA) [jira] Commented: (NUTCH-192) meta data support for CrawlDatum Tue, 31 Jan, 09:04
Andrzej Bialecki (JIRA) [jira] Commented: (NUTCH-169) remove static NutchConf Tue, 31 Jan, 09:14
Andrzej Bialecki (JIRA) [jira] Closed: (NUTCH-169) remove static NutchConf Tue, 31 Jan, 16:10
Andrzej Bialecki (JIRA) [jira] Commented: (NUTCH-193) move NDFS and MapReduce to a separate project Tue, 31 Jan, 18:19
Andrzej Bialecki (JIRA) [jira] Commented: (NUTCH-193) move NDFS and MapReduce to a separate project Tue, 31 Jan, 19:02
Andrzej Bialecki (JIRA) [jira] Commented: (NUTCH-192) meta data support for CrawlDatum Tue, 31 Jan, 20:00
Andy Liu injection infinite loop Wed, 04 Jan, 21:57
Message list1 · 2 · 3 · 4 · 5 · 6 · Next »Thread · Author · Date
Box list
Dec 200933
Nov 2009154
Oct 200988
Sep 200932
Aug 200982
Jul 200977
Jun 200994
May 2009104
Apr 200985
Mar 2009255
Feb 2009250
Jan 2009197
Dec 2008130
Nov 2008117
Oct 200884
Sep 2008101
Aug 200858
Jul 200832
Jun 200893
May 200857
Apr 200878
Mar 2008152
Feb 2008189
Jan 2008151
Dec 200768
Nov 2007186
Oct 2007162
Sep 2007189
Aug 2007135
Jul 2007283
Jun 2007241
May 2007188
Apr 2007144
Mar 2007282
Feb 2007241
Jan 2007266
Dec 2006103
Nov 2006222
Oct 2006187
Sep 2006166
Aug 2006281
Jul 2006180
Jun 2006262
May 2006282
Apr 2006247
Mar 2006304
Feb 2006349
Jan 2006558
Dec 2005412
Nov 2005288
Oct 2005313
Sep 2005339
Aug 2005426
Jul 2005228
Jun 2005178
May 2005140
Apr 2005497
Mar 2005398
Feb 200510