|
[jira] Commented: (NUTCH-489) URLFilter-suffix management of the url path when the url contains some query parameters |
|
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-489) URLFilter-suffix management of the url path when the url contains some query parameters |
Tue, 22 May, 09:23 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-489) URLFilter-suffix management of the url path when the url contains some query parameters |
Wed, 23 May, 06:10 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-489) URLFilter-suffix management of the url path when the url contains some query parameters |
Tue, 29 May, 12:22 |
| Marcin Okraszewski (JIRA) |
[jira] Created: (NUTCH-490) Extension point with filters for Neko HTML parser (with patch) |
Tue, 22 May, 12:18 |
|
[jira] Updated: (NUTCH-490) Extension point with filters for Neko HTML parser (with patch) |
|
| Marcin Okraszewski (JIRA) |
[jira] Updated: (NUTCH-490) Extension point with filters for Neko HTML parser (with patch) |
Tue, 22 May, 12:18 |
| Marcin Okraszewski (JIRA) |
[jira] Updated: (NUTCH-490) Extension point with filters for Neko HTML parser (with patch) |
Tue, 22 May, 12:20 |
| Vadim Bauer (JIRA) |
[jira] Commented: (NUTCH-427) protocol-smb: plugin protocol implementing the CIFS/SMB protocol. This protocol allows Nutch to crawl Microsoft Windows Shares remotely using the CIFS/SMB protocol implmentation. |
Tue, 22 May, 12:37 |
| Otis Gospodnetic |
IntelliJ & Eclipse Lucene code styles available |
Wed, 23 May, 06:20 |
| Yakn |
Get meta name="description" and other meta tags from Content |
Wed, 23 May, 15:02 |
| Andrzej Bialecki |
Re: Get meta name="description" and other meta tags from Content |
Wed, 23 May, 16:54 |
| Nicolás Lichtmaier (JIRA) |
[jira] Created: (NUTCH-491) dedup fails with ArrayIndexOutOfBoundsException |
Wed, 23 May, 16:53 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-491) dedup fails with ArrayIndexOutOfBoundsException |
Thu, 24 May, 11:55 |
| karthik085 |
NUTCH-348 and Nutch-0.7.2 |
Thu, 24 May, 14:01 |
| Doug Cutting |
Re: NUTCH-348 and Nutch-0.7.2 |
Thu, 24 May, 16:27 |
|
[jira] Updated: (NUTCH-427) protocol-smb: plugin protocol implementing the CIFS/SMB protocol. This protocol allows Nutch to crawl Microsoft Windows Shares remotely using the CIFS/SMB protocol implmentation. |
|
| Vadim Bauer (JIRA) |
[jira] Updated: (NUTCH-427) protocol-smb: plugin protocol implementing the CIFS/SMB protocol. This protocol allows Nutch to crawl Microsoft Windows Shares remotely using the CIFS/SMB protocol implmentation. |
Fri, 25 May, 21:05 |
| Vadim Bauer (JIRA) |
[jira] Updated: (NUTCH-427) protocol-smb: plugin protocol implementing the CIFS/SMB protocol. This protocol allows Nutch to crawl Microsoft Windows Shares remotely using the CIFS/SMB protocol implmentation. |
Fri, 25 May, 21:09 |
| Nicolás Lichtmaier (JIRA) |
[jira] Created: (NUTCH-492) java.lang.OutOfMemoryError while indexing. |
Sat, 26 May, 23:42 |
| Andrzej Bialecki (JIRA) |
[jira] Work started: (NUTCH-466) Flexible segment format |
Mon, 28 May, 09:01 |
| Gal Nitzan |
proposal for committer |
Mon, 28 May, 12:32 |
| Enis Soztutar |
Re: proposal for committer |
Tue, 29 May, 12:39 |
| Doug Cutting |
Re: proposal for committer |
Tue, 29 May, 20:45 |
| Nicolás Lichtmaier |
Plugins initialized all the time! |
Mon, 28 May, 20:47 |
| Nicolás Lichtmaier |
Re: Plugins initialized all the time! |
Mon, 28 May, 21:00 |
| Doğacan Güney |
Re: Plugins initialized all the time! |
Tue, 29 May, 15:50 |
| Briggs |
Re: Plugins initialized all the time! |
Tue, 29 May, 16:07 |
| Doğacan Güney |
Re: Plugins initialized all the time! |
Tue, 29 May, 16:52 |
| Briggs |
Re: Plugins initialized all the time! |
Tue, 29 May, 17:16 |
| Nicolás Lichtmaier |
Re: Plugins initialized all the time! |
Tue, 29 May, 20:39 |
| Doğacan Güney |
Re: Plugins initialized all the time! |
Wed, 30 May, 06:07 |
| Andrzej Bialecki |
Re: Plugins initialized all the time! |
Wed, 30 May, 11:01 |
| Doğacan Güney |
Re: Plugins initialized all the time! |
Wed, 30 May, 11:47 |
| Doğacan Güney |
Re: Plugins initialized all the time! |
Thu, 31 May, 14:02 |
| Nicolás Lichtmaier |
Re: Plugins initialized all the time! |
Thu, 31 May, 17:54 |
| Nicolás Lichtmaier |
Re: Plugins initialized all the time! |
Tue, 29 May, 21:56 |
| prem kumar |
running nutch without http proxy |
Tue, 29 May, 14:03 |
| Marcin Okraszewski |
Re: running nutch without http proxy |
Wed, 30 May, 06:03 |
| wangxu (JIRA) |
[jira] Created: (NUTCH-493) contentType parse not correctly,,,,got empty content using readseg -get |
Wed, 30 May, 00:05 |
| Chris Mattmann |
Committer |
Wed, 30 May, 13:42 |
| Manoharam Reddy |
OutOfMemoryError - Why should the while(1) loop stop? |
Wed, 30 May, 14:55 |
| Dennis Kubes |
Re: OutOfMemoryError - Why should the while(1) loop stop? |
Wed, 30 May, 15:38 |
| Andrzej Bialecki (JIRA) |
[jira] Resolved: (NUTCH-61) Adaptive re-fetch interval. Detecting umodified content |
Wed, 30 May, 18:37 |
| rubdabadub |
Re: [jira] Resolved: (NUTCH-61) Adaptive re-fetch interval. Detecting umodified content |
Thu, 31 May, 08:04 |
| Andrzej Bialecki |
Re: [jira] Resolved: (NUTCH-61) Adaptive re-fetch interval. Detecting umodified content |
Thu, 31 May, 10:18 |
| hud...@lucene.zones.apache.org |
Build failed in Hudson: Nutch-Nightly #102 |
Thu, 31 May, 07:00 |
| Manoharam Reddy |
What is parse-oo and why doesn't parsed PDF content show up in cached.jsp ? |
Thu, 31 May, 07:07 |
| Manoharam Reddy |
How is lib-http plugin called? It is not there in plugins.include! |
Thu, 31 May, 07:10 |
| Dennis Kubes |
Re: How is lib-http plugin called? It is not there in plugins.include! |
Thu, 31 May, 15:32 |
| Doğacan Güney (JIRA) |
[jira] Created: (NUTCH-494) FindBugs: CrawlDbReader and DeleteDuplicates |
Thu, 31 May, 08:52 |
| Doğacan Güney (JIRA) |
[jira] Updated: (NUTCH-494) FindBugs: CrawlDbReader and DeleteDuplicates |
Thu, 31 May, 08:52 |
| Doğacan Güney (JIRA) |
[jira] Created: (NUTCH-495) Unnecessary delays in Fetcher2 |
Thu, 31 May, 15:49 |
| Doğacan Güney (JIRA) |
[jira] Updated: (NUTCH-495) Unnecessary delays in Fetcher2 |
Thu, 31 May, 15:51 |
| hud...@lucene.zones.apache.org |
Hudson build is back to normal: Nutch-Nightly #103 |
Thu, 31 May, 16:56 |
|
[jira] Updated: (NUTCH-466) Flexible segment format |
|
| Andrzej Bialecki (JIRA) |
[jira] Updated: (NUTCH-466) Flexible segment format |
Thu, 31 May, 18:42 |
| Andrzej Bialecki (JIRA) |
[jira] Updated: (NUTCH-466) Flexible segment format |
Thu, 31 May, 19:55 |
| Andrzej Bialecki (JIRA) |
[jira] Resolved: (NUTCH-486) Break searcher dependency on commons-cli |
Thu, 31 May, 19:01 |
| Doğacan Güney (JIRA) |
[jira] Commented: (NUTCH-466) Flexible segment format |
Thu, 31 May, 19:28 |
| Nicolás Lichtmaier |
Making "Hits" work as a normal List |
Thu, 31 May, 20:58 |
| Andrzej Bialecki (JIRA) |
[jira] Resolved: (NUTCH-392) OutputFormat implementations should pass on Progressable |
Thu, 31 May, 21:25 |
| Nicolás Lichtmaier |
[PATCH] Moving HitDetails construction to a constructor =) |
Thu, 31 May, 21:57 |
| Manoharam Reddy |
How to create patch? |
Fri, 01 Jun, 06:12 |