| Karsten Dello (JIRA) |
[jira] Created: (NUTCH-424) CLONE - Problem persists with Nutch 0.8.1 (Nekohtml 0.9.4) - NekoHTML's DOMFragmentParser hangs on certain URLs |
Mon, 01 Jan, 21:27 |
| Karsten Dello (JIRA) |
[jira] Commented: (NUTCH-424) CLONE - Problem persists with Nutch 0.8.1 (Nekohtml 0.9.4) - NekoHTML's DOMFragmentParser hangs on certain URLs |
Mon, 01 Jan, 21:42 |
| thomasa...@gmx.net |
database exchange of 2 nutches (hybridity of nutch with yacy) |
Tue, 02 Jan, 00:00 |
| Toufeeq Hussain |
Re: [Search-l] database exchange of 2 nutches (hybridity of nutch with yacy) |
Tue, 02 Jan, 01:50 |
| Zaheed Haque |
Re: database exchange of 2 nutches (hybridity of nutch with yacy) |
Tue, 02 Jan, 09:44 |
| thomasa...@gmx.net |
Re: database exchange of 2 nutches (hybridity of nutch with yacy) |
Tue, 02 Jan, 18:07 |
| Alan Tanaman |
New index-extra plugin and patch to IndexFilters |
Tue, 02 Jan, 10:24 |
|
[jira] Commented: (NUTCH-422) index-extra plugin creates additional fields in the index, based on configurable logic |
|
| nutch.newbie (JIRA) |
[jira] Commented: (NUTCH-422) index-extra plugin creates additional fields in the index, based on configurable logic |
Tue, 02 Jan, 11:03 |
| Alan Tanaman |
RE: [jira] Commented: (NUTCH-422) index-extra plugin creates additional fields in the index, based on configurable logic |
Tue, 02 Jan, 11:52 |
| Alan Tanaman (JIRA) |
[jira] Commented: (NUTCH-422) index-extra plugin creates additional fields in the index, based on configurable logic |
Tue, 02 Jan, 22:57 |
| "Thomas Müller"Thomas Müller" |
Re: [jira] Commented: (NUTCH-422) index-extra plugin creates additional fields in the index, based on configurable logic |
Wed, 03 Jan, 06:35 |
| Alan Tanaman |
RE: [jira] Commented: (NUTCH-422) index-extra plugin creates additional fields in the index, based on configurable logic |
Wed, 03 Jan, 12:09 |
| Sami Siren (JIRA) |
[jira] Commented: (NUTCH-422) index-extra plugin creates additional fields in the index, based on configurable logic |
Fri, 12 Jan, 20:39 |
| Sami Siren (JIRA) |
[jira] Commented: (NUTCH-422) index-extra plugin creates additional fields in the index, based on configurable logic |
Fri, 12 Jan, 20:51 |
| Alan Tanaman (JIRA) |
[jira] Commented: (NUTCH-422) index-extra plugin creates additional fields in the index, based on configurable logic |
Mon, 15 Jan, 10:52 |
| Alan Tanaman |
Creating Lucence Compound Index |
Tue, 02 Jan, 12:57 |
| Andrzej Bialecki |
Re: Creating Lucence Compound Index |
Tue, 02 Jan, 13:07 |
| Alan Tanaman |
RE: Creating Lucence Compound Index |
Tue, 02 Jan, 13:34 |
| Andrzej Bialecki |
Re: Creating Lucence Compound Index |
Tue, 02 Jan, 14:06 |
| Alan Tanaman |
RE: Creating Lucence Compound Index |
Tue, 02 Jan, 14:12 |
|
Nutch Programmer Wanted |
|
| Nutch User |
Nutch Programmer Wanted |
Tue, 02 Jan, 21:24 |
| Chee Wu |
nutch81 pages seems were not kept but no error message found |
Wed, 03 Jan, 12:30 |
| Meghna Kukreja |
Bug in Nutch, possibly due to issues-273 and 322 |
Wed, 03 Jan, 19:03 |
| Andrzej Bialecki |
Re: Bug in Nutch, possibly due to issues-273 and 322 |
Wed, 03 Jan, 19:50 |
|
[jira] Commented: (NUTCH-420) DeleteDuplicates.HashPartitioner depends on the order of IndexDocs |
|
| Dogacan Güney (JIRA) |
[jira] Commented: (NUTCH-420) DeleteDuplicates.HashPartitioner depends on the order of IndexDocs |
Thu, 04 Jan, 09:30 |
| Dogacan Güney (JIRA) |
[jira] Commented: (NUTCH-420) DeleteDuplicates.HashPartitioner depends on the order of IndexDocs |
Mon, 08 Jan, 15:28 |
| Sami Siren (JIRA) |
[jira] Commented: (NUTCH-420) DeleteDuplicates.HashPartitioner depends on the order of IndexDocs |
Mon, 08 Jan, 15:46 |
| Dogacan Güney (JIRA) |
[jira] Commented: (NUTCH-420) DeleteDuplicates.HashPartitioner depends on the order of IndexDocs |
Tue, 09 Jan, 08:42 |
|
[jira] Updated: (NUTCH-420) DeleteDuplicates.HashPartitioner depends on the order of IndexDocs |
|
| Dogacan Güney (JIRA) |
[jira] Updated: (NUTCH-420) DeleteDuplicates.HashPartitioner depends on the order of IndexDocs |
Thu, 04 Jan, 09:30 |
| Dogacan Güney (JIRA) |
[jira] Updated: (NUTCH-420) DeleteDuplicates.HashPartitioner depends on the order of IndexDocs |
Mon, 08 Jan, 15:28 |
| Dogacan Güney (JIRA) |
[jira] Updated: (NUTCH-420) DeleteDuplicates.HashPartitioner depends on the order of IndexDocs |
Tue, 09 Jan, 08:42 |
| srinath |
Issues Starting Hadoop Process in Nutch0.9l.1 |
Thu, 04 Jan, 17:00 |
| st...@archive.org (JIRA) |
[jira] Created: (NUTCH-425) parse-js pollutes anchor text with base URL of source page |
Thu, 04 Jan, 17:21 |
| st...@archive.org (JIRA) |
[jira] Updated: (NUTCH-425) parse-js pollutes anchor text with base URL of source page |
Thu, 04 Jan, 19:05 |
| st...@archive.org (JIRA) |
[jira] Commented: (NUTCH-425) parse-js pollutes anchor text with base URL of source page |
Thu, 04 Jan, 19:14 |
| st...@archive.org (JIRA) |
[jira] Created: (NUTCH-426) parse-js skips parsing if found URL fails java.net.URL parse |
Thu, 04 Jan, 20:12 |
| st...@archive.org (JIRA) |
[jira] Commented: (NUTCH-426) parse-js skips parsing if found URL fails java.net.URL parse |
Thu, 04 Jan, 20:14 |
| st...@archive.org (JIRA) |
[jira] Updated: (NUTCH-426) parse-js skips parsing if found URL fails java.net.URL parse |
Thu, 04 Jan, 20:14 |
| Armel Nene (JIRA) |
[jira] Created: (NUTCH-427) protocol-smb: plugin protocol implementing the CIFS/SMB protocol. This protocol allows Nutch to crawl Microsoft Windows Shares remotely using the CIFS/SMB protocol implmentation. |
Fri, 05 Jan, 14:44 |
| Armel Nene (JIRA) |
[jira] Updated: (NUTCH-427) protocol-smb: plugin protocol implementing the CIFS/SMB protocol. This protocol allows Nutch to crawl Microsoft Windows Shares remotely using the CIFS/SMB protocol implmentation. |
Fri, 05 Jan, 15:11 |
| Armel T. Nene |
protocol-smb: a new protocol plugin for Windows Shares |
Fri, 05 Jan, 15:22 |
|
[jira] Commented: (NUTCH-427) protocol-smb: plugin protocol implementing the CIFS/SMB protocol. This protocol allows Nutch to crawl Microsoft Windows Shares remotely using the CIFS/SMB protocol implmentation. |
|
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-427) protocol-smb: plugin protocol implementing the CIFS/SMB protocol. This protocol allows Nutch to crawl Microsoft Windows Shares remotely using the CIFS/SMB protocol implmentation. |
Fri, 05 Jan, 15:56 |
| Armel Nene (JIRA) |
[jira] Commented: (NUTCH-427) protocol-smb: plugin protocol implementing the CIFS/SMB protocol. This protocol allows Nutch to crawl Microsoft Windows Shares remotely using the CIFS/SMB protocol implmentation. |
Fri, 05 Jan, 16:02 |
| Andrzej Bialecki (JIRA) |
[jira] Closed: (NUTCH-426) parse-js skips parsing if found URL fails java.net.URL parse |
Fri, 05 Jan, 17:01 |
| Andrzej Bialecki (JIRA) |
[jira] Closed: (NUTCH-425) parse-js pollutes anchor text with base URL of source page |
Fri, 05 Jan, 17:01 |
| Sami Siren (JIRA) |
[jira] Resolved: (NUTCH-325) UrlFilters.java throws NPE in case urlfilter.order contains Filters that are not in plugin.includes |
Sat, 06 Jan, 09:44 |
| Sami Siren (JIRA) |
[jira] Assigned: (NUTCH-421) Allow predeterminate running order of index filters |
Sat, 06 Jan, 10:36 |
| Sami Siren (JIRA) |
[jira] Assigned: (NUTCH-422) index-extra plugin creates additional fields in the index, based on configurable logic |
Sat, 06 Jan, 10:36 |
| Sami Siren (JIRA) |
[jira] Resolved: (NUTCH-421) Allow predeterminate running order of index filters |
Sat, 06 Jan, 20:01 |
| J. Delgado |
Job Opportunity (Sunnyvale, CA) |
Wed, 10 Jan, 03:20 |
| Piyush (JIRA) |
[jira] Created: (NUTCH-428) NullPointerException |
Wed, 10 Jan, 14:57 |
| DS jha |
sort result on different set of terms |
Wed, 10 Jan, 15:02 |
| Dennis Kubes |
Re: sort result on different set of terms |
Thu, 11 Jan, 20:54 |
| DS jha |
Re: sort result on different set of terms |
Fri, 12 Jan, 16:40 |
| Dennis Kubes |
Re: sort result on different set of terms |
Fri, 12 Jan, 21:40 |
| Piyush (JIRA) |
[jira] Created: (NUTCH-429) Secured Searches |
Thu, 11 Jan, 20:08 |
| Piotr Kosiorowski (JIRA) |
[jira] Closed: (NUTCH-429) Secured Searches |
Thu, 11 Jan, 20:48 |
| Andrzej Bialecki (JIRA) |
[jira] Closed: (NUTCH-420) DeleteDuplicates.HashPartitioner depends on the order of IndexDocs |
Thu, 11 Jan, 22:02 |
| Sami Siren (JIRA) |
[jira] Resolved: (NUTCH-428) NullPointerException |
Fri, 12 Jan, 22:16 |
| Sami Siren (JIRA) |
[jira] Created: (NUTCH-430) integer overflow in HashComparator.compare |
Sat, 13 Jan, 23:07 |
| Sami Siren (JIRA) |
[jira] Updated: (NUTCH-430) integer overflow in HashComparator.compare |
Sat, 13 Jan, 23:09 |
| Scott Green |
How can I get one plugin's root dir |
Mon, 15 Jan, 02:40 |
| Scott Green |
Re: How can I get one plugin's root dir |
Mon, 15 Jan, 17:14 |
| Andrzej Bialecki |
Re: How can I get one plugin's root dir |
Mon, 15 Jan, 17:33 |
| Scott Green |
Re: How can I get one plugin's root dir |
Mon, 15 Jan, 17:44 |
| Dennis Kubes |
Re: How can I get one plugin's root dir |
Mon, 15 Jan, 17:44 |
| Scott Green |
Re: How can I get one plugin's root dir |
Mon, 15 Jan, 17:51 |
| Sami Siren |
Re: How can I get one plugin's root dir |
Tue, 16 Jan, 15:19 |
| Scott Green |
Re: How can I get one plugin's root dir |
Tue, 16 Jan, 15:31 |
| Andrzej Bialecki |
Re: How can I get one plugin's root dir |
Tue, 16 Jan, 15:44 |
| Scott Green |
Re: How can I get one plugin's root dir |
Tue, 16 Jan, 16:33 |
| Andrzej Bialecki |
Re: How can I get one plugin's root dir |
Tue, 16 Jan, 16:55 |
| Scott Green |
Re: How can I get one plugin's root dir |
Tue, 16 Jan, 17:39 |
| Andrzej Bialecki |
Re: How can I get one plugin's root dir |
Tue, 16 Jan, 19:27 |
| Doug Cutting |
Re: How can I get one plugin's root dir |
Tue, 16 Jan, 20:16 |
| Scott Green |
Re: How can I get one plugin's root dir |
Wed, 17 Jan, 03:03 |
| Dennis Kubes |
Re: How can I get one plugin's root dir |
Wed, 17 Jan, 17:54 |
|
[jira] Commented: (NUTCH-61) Adaptive re-fetch interval. Detecting umodified content |
|
| Armel Nene (JIRA) |
[jira] Commented: (NUTCH-61) Adaptive re-fetch interval. Detecting umodified content |
Mon, 15 Jan, 10:12 |
| Sami Siren (JIRA) |
[jira] Commented: (NUTCH-61) Adaptive re-fetch interval. Detecting umodified content |
Wed, 17 Jan, 18:38 |
| Andrzej Bialecki (JIRA) |
[jira] Commented: (NUTCH-61) Adaptive re-fetch interval. Detecting umodified content |
Wed, 17 Jan, 19:34 |
| Sami Siren (JIRA) |
[jira] Commented: (NUTCH-61) Adaptive re-fetch interval. Detecting umodified content |
Wed, 17 Jan, 20:15 |
| Armel Nene (JIRA) |
[jira] Commented: (NUTCH-61) Adaptive re-fetch interval. Detecting umodified content |
Thu, 18 Jan, 10:00 |
| Sami Siren (JIRA) |
[jira] Resolved: (NUTCH-430) integer overflow in HashComparator.compare |
Mon, 15 Jan, 15:05 |