| Doğacan Güney (JIRA) |
[jira] Resolved: (NUTCH-504) NUTCH-443 broke parsing during fetching |
Sun, 24 Jun, 10:05 |
| Doğacan Güney (JIRA) |
[jira] Updated: (NUTCH-356) Plugin repository cache can lead to memory leak |
Sun, 24 Jun, 19:05 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-505) Outlink urls should be validated |
Mon, 25 Jun, 08:09 |
| Kai_testing Middleton |
Re: [jira] Commented: (NUTCH-505) Outlink urls should be validated |
Tue, 26 Jun, 18:04 |
| Dennis Kubes (JIRA) |
[jira] Closed: (NUTCH-497) Extreme Nested Tags causes StackOverflowException in DomContentUtils...Spider Trap |
Tue, 26 Jun, 03:35 |
| Dennis Kubes (JIRA) |
[jira] Resolved: (NUTCH-497) Extreme Nested Tags causes StackOverflowException in DomContentUtils...Spider Trap |
Tue, 26 Jun, 03:35 |
|
Re: svn commit: r550669 - in /lucene/nutch/trunk/src: java/org/apache/nutch/util/ plugin/languageidentifier/src/java/org/apache/nutch/analysis/lang/ plugin/parse-html/src/java/org/apache/nutch/parse/html/ test/org/apache/nutch/fetcher/ testresources/fetch-... |
|
| Chris Mattmann |
Re: svn commit: r550669 - in /lucene/nutch/trunk/src: java/org/apache/nutch/util/ plugin/languageidentifier/src/java/org/apache/nutch/analysis/lang/ plugin/parse-html/src/java/org/apache/nutch/parse/html/ test/org/apache/nutch/fetcher/ testresources/fetch-... |
Tue, 26 Jun, 04:40 |
| Dennis Kubes |
Re: svn commit: r550669 - in /lucene/nutch/trunk/src: java/org/apache/nutch/util/ plugin/languageidentifier/src/java/org/apache/nutch/analysis/lang/ plugin/parse-html/src/java/org/apache/nutch/parse/html/ test/org/apache/nutch/fetcher/ testresources/fetch-... |
Tue, 26 Jun, 04:45 |
| Chris Mattmann |
Re: svn commit: r550669 - in /lucene/nutch/trunk/src: java/org/apache/nutch/util/ plugin/languageidentifier/src/java/org/apache/nutch/analysis/lang/ plugin/parse-html/src/java/org/apache/nutch/parse/html/ test/org/apache/nutch/fetcher/ testresources/fetch-... |
Tue, 26 Jun, 04:49 |
|
[jira] Commented: (NUTCH-499) Refactor LinkDb and LinkDbMerger to reuse code |
|
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-499) Refactor LinkDb and LinkDbMerger to reuse code |
Tue, 26 Jun, 12:48 |
| Sami Siren (JIRA) |
[jira] Commented: (NUTCH-499) Refactor LinkDb and LinkDbMerger to reuse code |
Wed, 27 Jun, 07:01 |
| Hudson (JIRA) |
[jira] Commented: (NUTCH-499) Refactor LinkDb and LinkDbMerger to reuse code |
Thu, 28 Jun, 07:04 |
|
[jira] Updated: (NUTCH-434) Replace usage of ObjectWritable with something based on GenericWritable |
|
| Doğacan Güney (JIRA) |
[jira] Updated: (NUTCH-434) Replace usage of ObjectWritable with something based on GenericWritable |
Tue, 26 Jun, 13:26 |
| Doğacan Güney (JIRA) |
[jira] Updated: (NUTCH-434) Replace usage of ObjectWritable with something based on GenericWritable |
Tue, 26 Jun, 17:25 |
| Luca Rondanini |
Re-crawling Problem |
Tue, 26 Jun, 15:37 |
|
[jira] Commented: (NUTCH-434) Replace usage of ObjectWritable with something based on GenericWritable |
|
| Sami Siren (JIRA) |
[jira] Commented: (NUTCH-434) Replace usage of ObjectWritable with something based on GenericWritable |
Tue, 26 Jun, 15:51 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-434) Replace usage of ObjectWritable with something based on GenericWritable |
Tue, 26 Jun, 16:44 |
| Sami Siren (JIRA) |
[jira] Commented: (NUTCH-434) Replace usage of ObjectWritable with something based on GenericWritable |
Tue, 26 Jun, 16:56 |
| Kai_testing Middleton |
NUTCH-119 :: how hard to fix |
Wed, 27 Jun, 00:49 |
| Doğacan Güney |
Re: NUTCH-119 :: how hard to fix |
Wed, 27 Jun, 05:56 |
| Kai_testing Middleton |
Re: NUTCH-119 :: how hard to fix |
Wed, 27 Jun, 20:00 |
| Doğacan Güney |
Re: NUTCH-119 :: how hard to fix |
Thu, 28 Jun, 06:51 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-289) CrawlDatum should store IP address |
Wed, 27 Jun, 06:39 |
| Doğacan Güney |
JIRA email question |
Wed, 27 Jun, 07:02 |
| Doug Cutting |
Re: JIRA email question |
Wed, 27 Jun, 17:43 |
| Doğacan Güney (JIRA) |
[jira] Closed: (NUTCH-434) Replace usage of ObjectWritable with something based on GenericWritable |
Wed, 27 Jun, 07:07 |
| Doğacan Güney (JIRA) |
[jira] Resolved: (NUTCH-434) Replace usage of ObjectWritable with something based on GenericWritable |
Wed, 27 Jun, 07:07 |
| Doğacan Güney (JIRA) |
[jira] Resolved: (NUTCH-499) Refactor LinkDb and LinkDbMerger to reuse code |
Wed, 27 Jun, 08:40 |
| Doğacan Güney (JIRA) |
[jira] Closed: (NUTCH-499) Refactor LinkDb and LinkDbMerger to reuse code |
Wed, 27 Jun, 08:40 |
| Rob Young (JIRA) |
[jira] Updated: (NUTCH-479) Support for OR queries |
Wed, 27 Jun, 10:58 |
| Doğacan Güney (JIRA) |
[jira] Closed: (NUTCH-498) Use Combiner in LinkDb to increase speed of linkdb generation |
Wed, 27 Jun, 12:47 |
| Doğacan Güney (JIRA) |
[jira] Resolved: (NUTCH-498) Use Combiner in LinkDb to increase speed of linkdb generation |
Wed, 27 Jun, 12:47 |
|
[jira] Commented: (NUTCH-474) Fetcher2 sets server-delay and blocking checks incorrectly |
|
| Hudson (JIRA) |
[jira] Commented: (NUTCH-474) Fetcher2 sets server-delay and blocking checks incorrectly |
Thu, 28 Jun, 07:03 |
| Doğacan Güney |
Re: [jira] Commented: (NUTCH-474) Fetcher2 sets server-delay and blocking checks incorrectly |
Thu, 28 Jun, 07:35 |
| Doğacan Güney (JIRA) |
[jira] Updated: (NUTCH-392) OutputFormat implementations should pass on Progressable |
Thu, 28 Jun, 15:59 |
| Tsengtan A Shuy |
problem with nutch 0.8.1 compile |
Thu, 28 Jun, 16:37 |
| Tsengtan A Shuy |
RE: problem with nutch 0.8.1 compile |
Thu, 28 Jun, 23:18 |
| Tsengtan A Shuy |
RE: problem with nutch 0.8.1 compile |
Thu, 28 Jun, 23:44 |
| Doğacan Güney (JIRA) |
[jira] Created: (NUTCH-506) Nutch should delegate compression to Hadoop |
Fri, 29 Jun, 12:46 |
| Doğacan Güney (JIRA) |
[jira] Updated: (NUTCH-506) Nutch should delegate compression to Hadoop |
Fri, 29 Jun, 12:48 |
| Doğacan Güney (JIRA) |
[jira] Issue Comment Edited: (NUTCH-506) Nutch should delegate compression to Hadoop |
Fri, 29 Jun, 12:51 |
| Tsengtan A Shuy |
problem running "bin/nutch crawl urls -dir crawl -depth 3 -topN 50" command |
Fri, 29 Jun, 16:47 |
| Tsengtan A Shuy |
RE: problem running "bin/nutch crawl urls -dir crawl -depth 3 -topN 50" command |
Fri, 29 Jun, 17:01 |
| Oscar |
Fwd: failed to subscribe 'nutch-user' maillist |
Sat, 30 Jun, 10:58 |
| Susam Pal |
Re: failed to subscribe 'nutch-user' maillist |
Sat, 30 Jun, 11:06 |
| hud...@lucene.zones.apache.org |
Build failed in Hudson: Nutch-Nightly #134 |
Sun, 01 Jul, 07:00 |