| Vinci (JIRA) |
[jira] Updated: (NUTCH-624) Better parsed text by default parser |
Tue, 01 Apr, 09:08 |
| Vinci (JIRA) |
[jira] Updated: (NUTCH-625) Non-ascii character broken in dumped content for mixed encoding (utf-8 and multi-byte) |
Tue, 01 Apr, 09:20 |
| Vinci |
Re: [jira] Created: (NUTCH-624) Better parsed text |
Tue, 01 Apr, 09:21 |
| Edward J. Yoon |
Is there any LSI implementation? |
Wed, 02 Apr, 01:55 |
| Apache Hudson Server |
Build failed in Hudson: Nutch-trunk #408 |
Wed, 02 Apr, 06:59 |
| Apache Hudson Server |
Build failed in Hudson: Nutch-trunk #409 |
Thu, 03 Apr, 07:05 |
| Apache Hudson Server |
Hudson build is back to normal: Nutch-trunk #410 |
Fri, 04 Apr, 06:55 |
| Apache Hudson Server |
Build failed in Hudson: Nutch-trunk #411 |
Sat, 05 Apr, 06:06 |
| Apache Hudson Server |
Hudson build is back to normal: Nutch-trunk #412 |
Sun, 06 Apr, 04:18 |
| Remco Verhoef (JIRA) |
[jira] Created: (NUTCH-626) fetcher2 breaks out the domain with db.ignore.external.links set at cross domain redirects |
Sun, 06 Apr, 21:18 |
| Remco Verhoef (JIRA) |
[jira] Updated: (NUTCH-626) fetcher2 breaks out the domain with db.ignore.external.links set at cross domain redirects |
Sun, 06 Apr, 21:20 |
| ogjunk-nu...@yahoo.com |
Re: Is there any LSI implementation? |
Mon, 07 Apr, 05:14 |
| Apache Hudson Server |
Build failed in Hudson: Nutch-trunk #413 |
Mon, 07 Apr, 07:07 |
| Apache Hudson Server |
Hudson build is back to normal: Nutch-trunk #414 |
Tue, 08 Apr, 04:28 |
| cybercouf |
found a bug in plugin/protocol-http |
Tue, 08 Apr, 15:08 |
| minskv |
what is the difference between nutch and some other opensource search engines |
Wed, 09 Apr, 18:44 |
| ogjunk-nu...@yahoo.com |
Re: what is the difference between nutch and some other opensource search engines |
Wed, 09 Apr, 21:18 |
| Otis Gospodnetic (JIRA) |
[jira] Created: (NUTCH-627) Minimize host address lookup |
Thu, 10 Apr, 04:12 |
| Hudson (JIRA) |
[jira] Commented: (NUTCH-500) Add hadoop masters configuration file into conf folder |
Thu, 10 Apr, 04:12 |
| Otis Gospodnetic (JIRA) |
[jira] Updated: (NUTCH-627) Minimize host address lookup |
Thu, 10 Apr, 04:12 |
| Apache Hudson Server |
Hudson build is back to normal: Nutch-trunk #416 |
Thu, 10 Apr, 04:13 |
| Andrzej Bialecki |
Re: [jira] Updated: (NUTCH-627) Minimize host address lookup |
Thu, 10 Apr, 08:45 |
| Dennis Kubes (JIRA) |
[jira] Closed: (NUTCH-500) Add hadoop masters configuration file into conf folder |
Thu, 10 Apr, 15:24 |
| Dennis Kubes |
Re: [jira] Updated: (NUTCH-627) Minimize host address lookup |
Thu, 10 Apr, 15:25 |
| Chris Mattmann |
Re: [jira] Updated: (NUTCH-627) Minimize host address lookup |
Thu, 10 Apr, 15:28 |
| ogjunk-nu...@yahoo.com |
Re: [jira] Updated: (NUTCH-627) Minimize host address lookup |
Thu, 10 Apr, 15:41 |
| Otis Gospodnetic (JIRA) |
[jira] Commented: (NUTCH-570) Improvement of URL Ordering in Generator.java |
Thu, 10 Apr, 21:08 |
| Sandeep Tata |
Fetcher2 Reduce Phase Question |
Fri, 11 Apr, 21:25 |
| Andrzej Bialecki |
Re: Fetcher2 Reduce Phase Question |
Fri, 11 Apr, 22:32 |
| Amit Kumar Verma |
Keywords in documents |
Fri, 11 Apr, 22:35 |
| ogjunk-nu...@yahoo.com |
Re: Keywords in documents |
Sat, 12 Apr, 03:45 |
| Otis Gospodnetic (JIRA) |
[jira] Created: (NUTCH-628) Host database to keep track of host-level information |
Sat, 12 Apr, 04:21 |
| Otis Gospodnetic (JIRA) |
[jira] Created: (NUTCH-629) Detect slow and timeout servers and drop their URLs |
Sat, 12 Apr, 07:07 |
| Otis Gospodnetic (JIRA) |
[jira] Updated: (NUTCH-629) Detect slow and timeout servers and drop their URLs |
Sat, 12 Apr, 07:09 |
| ogjunk-nu...@yahoo.com |
Wiki -> email -> nutch-dev? |
Sun, 13 Apr, 03:55 |
| Dennis Kubes |
Re: Wiki -> email -> nutch-dev? |
Sun, 13 Apr, 14:52 |
| ogjunk-nu...@yahoo.com |
Re: Wiki -> email -> nutch-dev? |
Mon, 14 Apr, 01:41 |
| Dennis Kubes |
Re: Wiki -> email -> nutch-dev? |
Mon, 14 Apr, 05:04 |
| Otis Gospodnetic (JIRA) |
[jira] Commented: (NUTCH-629) Detect slow and timeout servers and drop their URLs |
Mon, 14 Apr, 19:47 |
| ogjunk-nu...@yahoo.com |
Re: Wiki -> email -> nutch-dev? |
Mon, 14 Apr, 20:16 |
| Otis Gospodnetic (JIRA) |
[jira] Commented: (NUTCH-442) Integrate Solr/Nutch |
Mon, 14 Apr, 20:23 |
| Otis Gospodnetic (JIRA) |
[jira] Updated: (NUTCH-628) Host database to keep track of host-level information |
Tue, 15 Apr, 16:17 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-442) Integrate Solr/Nutch |
Tue, 15 Apr, 16:51 |
| Otis Gospodnetic (JIRA) |
[jira] Updated: (NUTCH-628) Host database to keep track of host-level information |
Thu, 17 Apr, 05:27 |
| Otis Gospodnetic (JIRA) |
[jira] Issue Comment Edited: (NUTCH-628) Host database to keep track of host-level information |
Thu, 17 Apr, 05:41 |
| Otis Gospodnetic (JIRA) |
[jira] Issue Comment Edited: (NUTCH-628) Host database to keep track of host-level information |
Thu, 17 Apr, 05:43 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-628) Host database to keep track of host-level information |
Thu, 17 Apr, 08:11 |
| Doğacan Güney (JIRA) |
[jira] Updated: (NUTCH-442) Integrate Solr/Nutch |
Thu, 17 Apr, 14:15 |
| Doğacan Güney (JIRA) |
[jira] Updated: (NUTCH-596) ParseSegments parse content even if its not CrawlDatum.STATUS_FETCH_SUCCESS |
Thu, 17 Apr, 15:03 |
| Doğacan Güney (JIRA) |
[jira] Assigned: (NUTCH-596) ParseSegments parse content even if its not CrawlDatum.STATUS_FETCH_SUCCESS |
Thu, 17 Apr, 15:03 |
| Doğacan Güney (JIRA) |
[jira] Updated: (NUTCH-596) ParseSegments parse content even if its not CrawlDatum.STATUS_FETCH_SUCCESS |
Thu, 17 Apr, 15:03 |
| Otis Gospodnetic (JIRA) |
[jira] Issue Comment Edited: (NUTCH-628) Host database to keep track of host-level information |
Thu, 17 Apr, 20:39 |
| Otis Gospodnetic (JIRA) |
[jira] Commented: (NUTCH-596) ParseSegments parse content even if its not CrawlDatum.STATUS_FETCH_SUCCESS |
Fri, 18 Apr, 15:32 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-596) ParseSegments parse content even if its not CrawlDatum.STATUS_FETCH_SUCCESS |
Fri, 18 Apr, 15:48 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-628) Host database to keep track of host-level information |
Fri, 18 Apr, 18:40 |
| Doğacan Güney (JIRA) |
[jira] Resolved: (NUTCH-596) ParseSegments parse content even if its not CrawlDatum.STATUS_FETCH_SUCCESS |
Fri, 18 Apr, 18:50 |
| Doğacan Güney (JIRA) |
[jira] Updated: (NUTCH-596) ParseSegments parse content even if its not CrawlDatum.STATUS_FETCH_SUCCESS |
Fri, 18 Apr, 18:52 |
| Doğacan Güney (JIRA) |
[jira] Closed: (NUTCH-596) ParseSegments parse content even if its not CrawlDatum.STATUS_FETCH_SUCCESS |
Fri, 18 Apr, 18:52 |
| ogjunk-nu...@yahoo.com |
Re: [jira] Commented: (NUTCH-628) Host database to keep track of host-level information |
Fri, 18 Apr, 22:05 |
| Hudson (JIRA) |
[jira] Commented: (NUTCH-596) ParseSegments parse content even if its not CrawlDatum.STATUS_FETCH_SUCCESS |
Sat, 19 Apr, 04:10 |
| ogjunk-nu...@yahoo.com |
Fw: [jira] Closed: (INFRA-1583) Wiki => email not working for Nutch wiki |
Sat, 19 Apr, 19:49 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-628) Host database to keep track of host-level information |
Sat, 19 Apr, 21:38 |
| Andrzej Bialecki |
Re: [jira] Commented: (NUTCH-628) Host database to keep track of host-level information |
Sat, 19 Apr, 22:07 |
| ogjunk-nu...@yahoo.com |
Re: [jira] Commented: (NUTCH-628) Host database to keep track of host-level information |
Sun, 20 Apr, 00:50 |
| Andrzej Bialecki |
Re: [jira] Commented: (NUTCH-628) Host database to keep track of host-level information |
Sun, 20 Apr, 20:56 |
| ogjunk-nu...@yahoo.com |
Re: Fetching inefficiency |
Mon, 21 Apr, 20:40 |
| Ken Krugler |
Re: Fetching inefficiency |
Tue, 22 Apr, 00:06 |
| ogjunk-nu...@yahoo.com |
Re: [jira] Commented: (NUTCH-628) Host database to keep track of host-level information |
Tue, 22 Apr, 05:46 |
| Andrzej Bialecki |
Re: [jira] Commented: (NUTCH-628) Host database to keep track of host-level information |
Tue, 22 Apr, 08:48 |
| Apache Wiki |
[Nutch Wiki] Update of "GettingNutchRunningWithDebian" by StevenHayles |
Tue, 22 Apr, 13:12 |
| Apache Wiki |
[Nutch Wiki] Update of "FetchCycleOverlap" by OtisGospodnetic |
Wed, 23 Apr, 05:58 |
| Apache Wiki |
[Nutch Wiki] Update of "FetchCycleOverlap" by OtisGospodnetic |
Thu, 24 Apr, 04:03 |
| Apache Wiki |
[Nutch Wiki] Update of "Nutch2Architecture" by DennisKubes |
Thu, 24 Apr, 19:42 |
| Apache Wiki |
[Nutch Wiki] Update of "Nutch2Architecture" by DennisKubes |
Thu, 24 Apr, 19:49 |
| i...@web2seo.com |
Nutch 2 Architecture |
Fri, 25 Apr, 01:22 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-442) Integrate Solr/Nutch |
Sun, 27 Apr, 09:40 |
| Dennis Kubes |
Re: Nutch 2 Architecture |
Tue, 29 Apr, 05:33 |
| taknev ivrok (JIRA) |
[jira] Created: (NUTCH-630) Error caused by index-more plugin in the latest svn revision - 652259 |
Wed, 30 Apr, 15:21 |