Mailing list archives: December 2007

Site index · List index
Message listThread · Author · Date
lv david Re: some question about development Sat, 01 Dec, 11:20
Trey Spiva Re: Image Search Engine Input Sun, 02 Dec, 02:30
Otis Gospodnetic (JIRA) [jira] Commented: (NUTCH-585) [PARSE-HTML plugin] Block certain parts of HTML code from being indexed Sun, 02 Dec, 17:26
Otis Gospodnetic (JIRA) [jira] Commented: (NUTCH-442) Integrate Solr/Nutch Sun, 02 Dec, 21:36
Doğacan Güney (JIRA) [jira] Commented: (NUTCH-442) Integrate Solr/Nutch Sun, 02 Dec, 23:00
Otis Gospodnetic (JIRA) [jira] Commented: (NUTCH-442) Integrate Solr/Nutch Mon, 03 Dec, 06:45
Dennis Kubes (JIRA) [jira] Commented: (NUTCH-581) DistributedSearch does not update search servers added to search-servers.txt on the fly Mon, 03 Dec, 20:20
Dennis Kubes (JIRA) [jira] Updated: (NUTCH-581) DistributedSearch does not update search servers added to search-servers.txt on the fly Mon, 03 Dec, 20:26
Ned Rockson Task process exit with nonzero status of 65 Mon, 03 Dec, 21:36
Andrzej Bialecki (JIRA) [jira] Commented: (NUTCH-581) DistributedSearch does not update search servers added to search-servers.txt on the fly Mon, 03 Dec, 23:29
Dennis Kubes (JIRA) [jira] Created: (NUTCH-587) Upgrade Nutch to use Hadoop 0.15.1 release Mon, 03 Dec, 23:35
Dennis Kubes (JIRA) [jira] Updated: (NUTCH-587) Upgrade Nutch to use Hadoop 0.15.1 release Mon, 03 Dec, 23:39
Dennis Kubes (JIRA) [jira] Commented: (NUTCH-581) DistributedSearch does not update search servers added to search-servers.txt on the fly Mon, 03 Dec, 23:43
Dennis Kubes (JIRA) [jira] Updated: (NUTCH-581) DistributedSearch does not update search servers added to search-servers.txt on the fly Tue, 04 Dec, 02:03
Enis Soztutar (JIRA) [jira] Commented: (NUTCH-586) Add option to run compiled classes w/o job file Tue, 04 Dec, 09:59
Andrea Spinelli (JIRA) [jira] Commented: (NUTCH-585) [PARSE-HTML plugin] Block certain parts of HTML code from being indexed Tue, 04 Dec, 11:15
Andrzej Bialecki (JIRA) [jira] Commented: (NUTCH-586) Add option to run compiled classes w/o job file Tue, 04 Dec, 11:51
Doğacan Güney (JIRA) [jira] Commented: (NUTCH-581) DistributedSearch does not update search servers added to search-servers.txt on the fly Tue, 04 Dec, 13:19
Enis Soztutar (JIRA) [jira] Updated: (NUTCH-586) Add option to run compiled classes w/o job file Tue, 04 Dec, 13:21
Teccon Ingenieros (JIRA) [jira] Created: (NUTCH-588) Help Need Tue, 04 Dec, 16:40
Dennis Kubes (JIRA) [jira] Resolved: (NUTCH-581) DistributedSearch does not update search servers added to search-servers.txt on the fly Tue, 04 Dec, 19:15
Matt Kangas (JIRA) [jira] Commented: (NUTCH-585) [PARSE-HTML plugin] Block certain parts of HTML code from being indexed Tue, 04 Dec, 21:43
Ryan Levering (JIRA) [jira] Created: (NUTCH-589) Hierarchical Classloaders Wed, 05 Dec, 00:04
quxy Nutch\nutch-0.9\build.xml:61: Specify at least one source--a file or resource collection. Wed, 05 Dec, 04:06
Hudson (JIRA) [jira] Commented: (NUTCH-581) DistributedSearch does not update search servers added to search-servers.txt on the fly Wed, 05 Dec, 05:47
Nathaniel Powell (JIRA) [jira] Created: (NUTCH-590) Index multiple docs per call using IndexingFilter extension point Thu, 06 Dec, 00:59
Ned Rockson Filter spam URLs Fri, 07 Dec, 01:14
Enis Soztutar (JIRA) [jira] Resolved: (NUTCH-588) Help Need Fri, 07 Dec, 10:46
Andrzej Bialecki Re: Filter spam URLs Fri, 07 Dec, 13:51
Dennis Kubes (JIRA) [jira] Commented: (NUTCH-587) Upgrade Nutch to use Hadoop 0.15.1 release Mon, 10 Dec, 23:46
Andrzej Bialecki (JIRA) [jira] Commented: (NUTCH-587) Upgrade Nutch to use Hadoop 0.15.1 release Tue, 11 Dec, 01:08
patil fnm frq like files are not creating while crwaling some site Wed, 12 Dec, 09:59
Les Cheong (JIRA) [jira] Commented: (NUTCH-559) NTLM, Basic and Digest Authentication schemes for web/proxy server Wed, 12 Dec, 19:34
Neumann, Vladimir cached.jsp for the new dev-version Thu, 13 Dec, 10:24
novikov1 cached.jsp for the new dev-version Thu, 13 Dec, 10:59
frank ling (JIRA) [jira] Created: (NUTCH-591) StringIndexOutOfBoundsException when extracting text from a Word document. Fri, 14 Dec, 00:47
patil files are not generated in index folder by indexer for the site http://www.traguiden.se(for other sites its working good) while crwaling Fri, 14 Dec, 06:25
Emmanuel Joke (JIRA) [jira] Created: (NUTCH-592) Fetcher2 : NPE for page with status ProtocolStatus.TEMP_MOVED Sun, 16 Dec, 15:23
Emmanuel Joke (JIRA) [jira] Updated: (NUTCH-592) Fetcher2 : NPE for page with status ProtocolStatus.TEMP_MOVED Sun, 16 Dec, 15:23
Andrzej Bialecki (JIRA) [jira] Resolved: (NUTCH-586) Add option to run compiled classes w/o job file Mon, 17 Dec, 18:24
Hudson (JIRA) [jira] Commented: (NUTCH-586) Add option to run compiled classes w/o job file Tue, 18 Dec, 04:20
Joseph Chen (JIRA) [jira] Commented: (NUTCH-579) Feed plugin only indexes one post per feed due to identical digest Tue, 18 Dec, 23:34
sudarat (JIRA) [jira] Created: (NUTCH-593) Nutch crawl problem Wed, 19 Dec, 02:49
Nigel Daley Hudson Upgrade Dec 19 Wed, 19 Dec, 06:59
Nigel Daley Re: Hudson Upgrade Dec 19 Thu, 20 Dec, 19:45
Peter Boot errors compiling index-extra Fri, 21 Dec, 04:25
Dennis Kubes (JIRA) [jira] Created: (NUTCH-594) Serve Nutch search results in XML and JSON Fri, 21 Dec, 17:10
Dennis Kubes (JIRA) [jira] Updated: (NUTCH-594) Serve Nutch search results in XML and JSON Fri, 21 Dec, 17:18
Peter Boot (JIRA) [jira] Commented: (NUTCH-422) index-extra plugin creates additional fields in the index, based on configurable logic Fri, 21 Dec, 21:17
Lirida Kercelli scoring algorithm Sun, 23 Dec, 14:00
Torontoer Enable Nutch to search for local file system Mon, 24 Dec, 03:33
Hudson (JIRA) [jira] Commented: (NUTCH-575) NPE in OpenSearchServlet when summary is null Tue, 25 Dec, 04:19
Doğacan Güney (JIRA) [jira] Commented: (NUTCH-559) NTLM, Basic and Digest Authentication schemes for web/proxy server Wed, 26 Dec, 18:08
Emmanuel Joke (JIRA) [jira] Commented: (NUTCH-528) CrawlDbReader: add some new stats + dump into a csv format Thu, 27 Dec, 10:30
Andrzej Bialecki (JIRA) [jira] Commented: (NUTCH-528) CrawlDbReader: add some new stats + dump into a csv format Thu, 27 Dec, 11:51
Andrzej Bialecki (JIRA) [jira] Created: (NUTCH-595) "Target file:/.... already exists" Thu, 27 Dec, 13:08
Emmanuel Joke (JIRA) [jira] Commented: (NUTCH-595) "Target file:/.... already exists" Thu, 27 Dec, 13:21
Emmanuel Joke (JIRA) [jira] Commented: (NUTCH-534) SegmentMerger: add -normalize option Thu, 27 Dec, 13:26
Andrzej Bialecki (JIRA) [jira] Commented: (NUTCH-534) SegmentMerger: add -normalize option Thu, 27 Dec, 13:36
Emmanuel Joke (JIRA) [jira] Updated: (NUTCH-528) CrawlDbReader: add some new stats + dump into a csv format Fri, 28 Dec, 02:59
hud...@lucene.zones.apache.org Build failed in Hudson: Nutch-Nightly #307 Fri, 28 Dec, 05:08
hud...@lucene.zones.apache.org Build failed in Hudson: Nutch-Nightly #308 Sat, 29 Dec, 04:10
hud...@lucene.zones.apache.org Hudson build is back to normal: Nutch-Nightly #309 Sat, 29 Dec, 05:33
Emmanuel Joke (JIRA) [jira] Created: (NUTCH-596) ParseSegments parse content even if its not CrawlDatum.STATUS_FETCH_SUCCESS Sun, 30 Dec, 09:52
Doğacan Güney (JIRA) [jira] Commented: (NUTCH-596) ParseSegments parse content even if its not CrawlDatum.STATUS_FETCH_SUCCESS Sun, 30 Dec, 11:16
Remco Verhoef (JIRA) [jira] Created: (NUTCH-597) Fetcher2 - java.lang.NullPointerException when host does not exist and fetcher.threads.per.host.by.ip is set to true causes threads to finish. Sun, 30 Dec, 16:29
Remco Verhoef (JIRA) [jira] Updated: (NUTCH-597) Fetcher2 - java.lang.NullPointerException when host does not exist and fetcher.threads.per.host.by.ip is set to true causes threads to finish. Sun, 30 Dec, 16:31
hud...@lucene.zones.apache.org Build failed in Hudson: Nutch-Nightly #311 Mon, 31 Dec, 04:34
Message listThread · Author · Date
Box list
Nov 2009106
Oct 200988
Sep 200932
Aug 200982
Jul 200977
Jun 200994
May 2009104
Apr 200985
Mar 2009255
Feb 2009250
Jan 2009197
Dec 2008130
Nov 2008117
Oct 200884
Sep 2008101
Aug 200858
Jul 200832
Jun 200893
May 200857
Apr 200878
Mar 2008152
Feb 2008189
Jan 2008151
Dec 200768
Nov 2007186
Oct 2007162
Sep 2007189
Aug 2007135
Jul 2007283
Jun 2007241
May 2007188
Apr 2007144
Mar 2007282
Feb 2007241
Jan 2007266
Dec 2006103
Nov 2006222
Oct 2006187
Sep 2006166
Aug 2006281
Jul 2006180
Jun 2006262
May 2006282
Apr 2006247
Mar 2006304
Feb 2006349
Jan 2006558
Dec 2005412
Nov 2005288
Oct 2005313
Sep 2005339
Aug 2005426
Jul 2005228
Jun 2005178
May 2005140
Apr 2005497
Mar 2005398
Feb 200510