nutch-dev mailing list archives: April 2011

Site index · List index
Message list1 · 2 · 3 · 4 · 5 · Next »Thread · Author · Date
Apache Hudson Server Build failed in Jenkins: Nutch-trunk #1443 Fri, 01 Apr, 04:02
Markus Jelsma (JIRA) [jira] [Updated] (NUTCH-897) Subcollection requires blacklist element Fri, 01 Apr, 13:05
Markus Jelsma (JIRA) [jira] [Updated] (NUTCH-897) Subcollection requires blacklist element Fri, 01 Apr, 13:33
Markus Jelsma (JIRA) [jira] [Updated] (NUTCH-897) Subcollection requires blacklist element Fri, 01 Apr, 13:35
Markus Jelsma Clean up open legacy issues in Jira Fri, 01 Apr, 14:03
Markus Jelsma (JIRA) [jira] [Commented] (NUTCH-973) Remove Segment Merger in 1.3 Fri, 01 Apr, 14:05
Markus Jelsma Re: java.sql.BatchUpdateException after fetch and wrong WebPage.protocolStatus in trunk Fri, 01 Apr, 14:11
Mattmann, Chris A (388J) Re: Clean up open legacy issues in Jira Fri, 01 Apr, 14:20
Julien Nioche (JIRA) [jira] [Closed] (NUTCH-973) Remove Segment Merger in 1.3 Fri, 01 Apr, 14:21
Markus Jelsma (JIRA) [jira] [Closed] (NUTCH-13) If dns points to 127.0.0.1, the url is also crawled Fri, 01 Apr, 14:27
Markus Jelsma (JIRA) [jira] [Closed] (NUTCH-36) Chinese in Nutch Fri, 01 Apr, 14:27
Markus Jelsma (JIRA) [jira] [Closed] (NUTCH-18) Windows servers include illegal characters in URLs Fri, 01 Apr, 14:27
Markus Jelsma (JIRA) [jira] [Closed] (NUTCH-39) pagination in search result Fri, 01 Apr, 14:27
Markus Jelsma (JIRA) [jira] [Closed] (NUTCH-79) Fault tolerant searching. Fri, 01 Apr, 14:29
Markus Jelsma (JIRA) [jira] [Closed] (NUTCH-103) Vivisimo like treeview and url redirect Fri, 01 Apr, 14:29
Markus Jelsma (JIRA) [jira] [Closed] (NUTCH-83) Release deliverable as zip Fri, 01 Apr, 14:29
David Escuer (JIRA) [jira] [Commented] (NUTCH-18) Windows servers include illegal characters in URLs Fri, 01 Apr, 14:31
Markus Jelsma (JIRA) [jira] [Closed] (NUTCH-180) Performance problem with widely used keywords Fri, 01 Apr, 14:31
Markus Jelsma (JIRA) [jira] [Closed] (NUTCH-132) Add ability to sort on more than one column Fri, 01 Apr, 14:31
Markus Jelsma (JIRA) [jira] [Closed] (NUTCH-104) Nutch query parser does not support CJK bi-gram segmentation. Fri, 01 Apr, 14:31
Markus Jelsma (JIRA) [jira] [Closed] (NUTCH-144) corrupt language identifier tri files and bad language recognition for german Fri, 01 Apr, 14:31
Markus Jelsma (JIRA) [jira] [Closed] (NUTCH-877) Allow setting of slop values for non-quote phrase queries on query-basic plugin Fri, 01 Apr, 14:33
Markus Jelsma (JIRA) [jira] [Closed] (NUTCH-581) DistributedSearch does not update search servers added to search-servers.txt on the fly Fri, 01 Apr, 14:33
Markus Jelsma (JIRA) [jira] [Closed] (NUTCH-775) Enhance Searcher interface Fri, 01 Apr, 14:33
Markus Jelsma (JIRA) [jira] [Updated] (NUTCH-377) Add possibility to search for multiple values Fri, 01 Apr, 14:35
Markus Jelsma (JIRA) [jira] [Updated] (NUTCH-674) NutchBean doesn't check for searcher.dir existance. Fri, 01 Apr, 14:35
Markus Jelsma (JIRA) [jira] [Updated] (NUTCH-423) Add other index-basic fields as query plugins Fri, 01 Apr, 14:35
Markus Jelsma (JIRA) [jira] [Updated] (NUTCH-541) Index url field untokenized Fri, 01 Apr, 14:35
Markus Jelsma (JIRA) [jira] [Updated] (NUTCH-265) Getting Clustered results in better form. Fri, 01 Apr, 14:35
Markus Jelsma (JIRA) [jira] [Updated] (NUTCH-47) Configure host filter to do wildcard prefixes - *.redhat.com Fri, 01 Apr, 14:35
Markus Jelsma (JIRA) [jira] [Updated] (NUTCH-542) Null Pointer Exception on getSummary when segment no longer exists Fri, 01 Apr, 14:35
Markus Jelsma (JIRA) [jira] [Updated] (NUTCH-943) Search Results default dedup field "site" should be stored in index. Fri, 01 Apr, 14:35
Markus Jelsma (JIRA) [jira] [Updated] (NUTCH-470) Adding optional terms to a query Fri, 01 Apr, 14:35
Markus Jelsma (JIRA) [jira] [Updated] (NUTCH-480) Searching multiple indexes with a single nutch instance Fri, 01 Apr, 14:35
Markus Jelsma (JIRA) [jira] [Updated] (NUTCH-72) Query basic filter with correction feature Fri, 01 Apr, 14:35
Markus Jelsma (JIRA) [jira] [Updated] (NUTCH-466) Flexible segment format Fri, 01 Apr, 14:35
Markus Jelsma (JIRA) [jira] [Updated] (NUTCH-294) Topic-maps of related searchwords Fri, 01 Apr, 14:35
Markus Jelsma (JIRA) [jira] [Updated] (NUTCH-540) some problem about the Nutch cache Fri, 01 Apr, 14:35
Markus Jelsma (JIRA) [jira] [Updated] (NUTCH-941) Search returns blank page, when there is more than one SOLR server configured Fri, 01 Apr, 14:35
Markus Jelsma (JIRA) [jira] [Updated] (NUTCH-469) changes to geoPosition plugin to make it work on nutch 0.9 Fri, 01 Apr, 14:35
Markus Jelsma (JIRA) [jira] [Updated] (NUTCH-355) The title of query result could like the summary have the highlight?? Fri, 01 Apr, 14:35
Markus Jelsma (JIRA) [jira] [Updated] (NUTCH-453) Move stop words to a config file Fri, 01 Apr, 14:35
Markus Jelsma (JIRA) [jira] [Updated] (NUTCH-260) Three new plugins that parse, index and query meta tags defined in the configuration Fri, 01 Apr, 14:35
Markus Jelsma (JIRA) [jira] [Updated] (NUTCH-820) Infinite loop when hitspersite is set Fri, 01 Apr, 14:35
Markus Jelsma (JIRA) [jira] [Updated] (NUTCH-708) NutchBean: OOM due to searcher.max.hits and dedup. Fri, 01 Apr, 14:35
Markus Jelsma (JIRA) [jira] [Updated] (NUTCH-92) DistributedSearch incorrectly scores results Fri, 01 Apr, 14:35
Markus Jelsma (JIRA) [jira] [Updated] (NUTCH-445) Domain ─░ndexing / Query Filter Fri, 01 Apr, 14:35
Markus Jelsma (JIRA) [jira] [Updated] (NUTCH-638) Launching Distributed Searchers with URI indicating filesystem to use rather than relying on hadoop config files. Fri, 01 Apr, 14:35
Markus Jelsma (JIRA) [jira] [Updated] (NUTCH-479) Support for OR queries Fri, 01 Apr, 14:35
Markus Jelsma (JIRA) [jira] [Updated] (NUTCH-764) Add support for vfsfile:// loading of plugins for JBoss Fri, 01 Apr, 14:35
Markus Jelsma (JIRA) [jira] [Updated] (NUTCH-455) dedup on tokenized fields is faulty Fri, 01 Apr, 14:35
Markus Jelsma (JIRA) [jira] [Updated] (NUTCH-386) Plugin to index categories by url rules Fri, 01 Apr, 14:35
Markus Jelsma (JIRA) [jira] [Updated] (NUTCH-573) Multiple Domains - Query Search Fri, 01 Apr, 14:35
Markus Jelsma (JIRA) [jira] [Closed] (NUTCH-294) Topic-maps of related searchwords Fri, 01 Apr, 14:37
Markus Jelsma (JIRA) [jira] [Closed] (NUTCH-72) Query basic filter with correction feature Fri, 01 Apr, 14:37
Markus Jelsma (JIRA) [jira] [Closed] (NUTCH-540) some problem about the Nutch cache Fri, 01 Apr, 14:37
Markus Jelsma (JIRA) [jira] [Closed] (NUTCH-943) Search Results default dedup field "site" should be stored in index. Fri, 01 Apr, 14:37
Markus Jelsma (JIRA) [jira] [Closed] (NUTCH-480) Searching multiple indexes with a single nutch instance Fri, 01 Apr, 14:37
Markus Jelsma (JIRA) [jira] [Closed] (NUTCH-47) Configure host filter to do wildcard prefixes - *.redhat.com Fri, 01 Apr, 14:37
Markus Jelsma (JIRA) [jira] [Closed] (NUTCH-466) Flexible segment format Fri, 01 Apr, 14:37
Markus Jelsma (JIRA) [jira] [Closed] (NUTCH-469) changes to geoPosition plugin to make it work on nutch 0.9 Fri, 01 Apr, 14:37
Markus Jelsma (JIRA) [jira] [Closed] (NUTCH-573) Multiple Domains - Query Search Fri, 01 Apr, 14:37
Markus Jelsma (JIRA) [jira] [Closed] (NUTCH-541) Index url field untokenized Fri, 01 Apr, 14:37
Markus Jelsma (JIRA) [jira] [Closed] (NUTCH-820) Infinite loop when hitspersite is set Fri, 01 Apr, 14:37
Markus Jelsma (JIRA) [jira] [Closed] (NUTCH-674) NutchBean doesn't check for searcher.dir existance. Fri, 01 Apr, 14:37
Markus Jelsma (JIRA) [jira] [Closed] (NUTCH-445) Domain ─░ndexing / Query Filter Fri, 01 Apr, 14:37
Markus Jelsma (JIRA) [jira] [Closed] (NUTCH-708) NutchBean: OOM due to searcher.max.hits and dedup. Fri, 01 Apr, 14:37
Markus Jelsma (JIRA) [jira] [Closed] (NUTCH-455) dedup on tokenized fields is faulty Fri, 01 Apr, 14:37
Markus Jelsma (JIRA) [jira] [Closed] (NUTCH-92) DistributedSearch incorrectly scores results Fri, 01 Apr, 14:37
Markus Jelsma (JIRA) [jira] [Closed] (NUTCH-479) Support for OR queries Fri, 01 Apr, 14:37
Markus Jelsma (JIRA) [jira] [Closed] (NUTCH-265) Getting Clustered results in better form. Fri, 01 Apr, 14:37
Markus Jelsma (JIRA) [jira] [Closed] (NUTCH-470) Adding optional terms to a query Fri, 01 Apr, 14:37
Markus Jelsma (JIRA) [jira] [Closed] (NUTCH-423) Add other index-basic fields as query plugins Fri, 01 Apr, 14:37
Markus Jelsma (JIRA) [jira] [Closed] (NUTCH-542) Null Pointer Exception on getSummary when segment no longer exists Fri, 01 Apr, 14:37
Markus Jelsma (JIRA) [jira] [Closed] (NUTCH-377) Add possibility to search for multiple values Fri, 01 Apr, 14:37
Markus Jelsma (JIRA) [jira] [Closed] (NUTCH-941) Search returns blank page, when there is more than one SOLR server configured Fri, 01 Apr, 14:37
Markus Jelsma (JIRA) [jira] [Closed] (NUTCH-355) The title of query result could like the summary have the highlight?? Fri, 01 Apr, 14:37
Markus Jelsma (JIRA) [jira] [Closed] (NUTCH-638) Launching Distributed Searchers with URI indicating filesystem to use rather than relying on hadoop config files. Fri, 01 Apr, 14:37
Markus Jelsma (JIRA) [jira] [Closed] (NUTCH-453) Move stop words to a config file Fri, 01 Apr, 14:37
Markus Jelsma (JIRA) [jira] [Closed] (NUTCH-386) Plugin to index categories by url rules Fri, 01 Apr, 14:37
Markus Jelsma (JIRA) [jira] [Closed] (NUTCH-260) Three new plugins that parse, index and query meta tags defined in the configuration Fri, 01 Apr, 14:37
Markus Jelsma (JIRA) [jira] [Closed] (NUTCH-764) Add support for vfsfile:// loading of plugins for JBoss Fri, 01 Apr, 14:37
Markus Jelsma (JIRA) [jira] [Closed] (NUTCH-343) Index MP3 SHA1 hashes Fri, 01 Apr, 14:41
Markus Jelsma (JIRA) [jira] [Closed] (NUTCH-316) Confusion about query languages Fri, 01 Apr, 14:41
Markus Jelsma (JIRA) [jira] [Closed] (NUTCH-299) Bittorrent Parser Fri, 01 Apr, 14:41
Markus Jelsma (JIRA) [jira] [Closed] (NUTCH-300) Clustering API improvements Fri, 01 Apr, 14:41
Markus Jelsma (JIRA) [jira] [Closed] (NUTCH-389) a url tokenizer implementation for tokenizing index fields : url and host Fri, 01 Apr, 14:41
Markus Jelsma (JIRA) [jira] [Closed] (NUTCH-290) parse-pdf: Garbage indexed when text-extraction not allowed Fri, 01 Apr, 14:41
Markus Jelsma (JIRA) [jira] [Closed] (NUTCH-352) Add jar command to bin/nutch to allow launching hadoop job jars Fri, 01 Apr, 14:41
Markus Jelsma (JIRA) [jira] [Closed] (NUTCH-396) mergesegs sorts URLs, making segments useless for subsequent fetch Fri, 01 Apr, 14:41
Markus Jelsma (JIRA) [jira] [Closed] (NUTCH-326) WordExtractor throws java.util.NoSuchElementException on some documents Fri, 01 Apr, 14:41
Markus Jelsma (JIRA) [jira] [Closed] (NUTCH-358) Language Switching PROBLEM FIXED Fri, 01 Apr, 14:41
Markus Jelsma Re: Clean up open legacy issues in Jira Fri, 01 Apr, 14:49
Markus Jelsma (JIRA) [jira] [Closed] (NUTCH-26) New Http Authentication mechanism Fri, 01 Apr, 14:57
Markus Jelsma (JIRA) [jira] [Closed] (NUTCH-162) country code "jp" is used instead of language code "ja" for Japanese Fri, 01 Apr, 14:57
Markus Jelsma (JIRA) [jira] [Closed] (NUTCH-164) Locale (language) choice by first session has global effect to all sessions Fri, 01 Apr, 14:57
Markus Jelsma (JIRA) [jira] [Closed] (NUTCH-259) Problem in IndexSorter after dedup Fri, 01 Apr, 14:57
Markus Jelsma (JIRA) [jira] [Closed] (NUTCH-251) Administration GUI Fri, 01 Apr, 14:57
Markus Jelsma (JIRA) [jira] [Closed] (NUTCH-283) If the Fetcher times out and abandons Fetcher Threads, severe errors will occur on those Threads Fri, 01 Apr, 14:57
Markus Jelsma (JIRA) [jira] [Closed] (NUTCH-48) "Did you mean" query enhancement/refignment feature request Fri, 01 Apr, 14:57
Message list1 · 2 · 3 · 4 · 5 · Next »Thread · Author · Date
Box list
Apr 2014217
Mar 2014248
Feb 2014168
Jan 2014471
Dec 2013186
Nov 2013177
Oct 2013182
Sep 2013158
Aug 2013182
Jul 2013240
Jun 2013321
May 2013288
Apr 2013437
Mar 2013521
Feb 2013201
Jan 2013560
Dec 2012176
Nov 2012251
Oct 2012200
Sep 2012219
Aug 2012230
Jul 2012301
Jun 2012391
May 2012317
Apr 2012352
Mar 2012297
Feb 2012395
Jan 2012298
Dec 2011318
Nov 2011524
Oct 2011483
Sep 2011605
Aug 2011528
Jul 2011635
Jun 2011418
May 2011176
Apr 2011453
Mar 2011139
Feb 201162
Jan 2011150
Dec 2010100
Nov 201096
Oct 2010177
Sep 2010143
Aug 2010289
Jul 2010364
Jun 2010246
May 201075
Apr 2010124
Mar 2010183
Feb 2010134
Jan 2010106
Dec 200998
Nov 2009154
Oct 200988
Sep 200932
Aug 200982
Jul 200977
Jun 200994
May 2009104
Apr 200985
Mar 2009255
Feb 2009250
Jan 2009197
Dec 2008158
Nov 2008117
Oct 200884
Sep 2008101
Aug 200858
Jul 200832
Jun 200893
May 200857
Apr 200878
Mar 2008152
Feb 2008190
Jan 2008155
Dec 200768
Nov 2007188
Oct 2007179
Sep 2007189
Aug 2007135
Jul 2007283
Jun 2007241
May 2007188
Apr 2007144
Mar 2007282
Feb 2007241
Jan 2007266
Dec 2006103
Nov 2006222
Oct 2006187
Sep 2006166
Aug 2006281
Jul 2006180
Jun 2006262
May 2006282
Apr 2006247
Mar 2006304
Feb 2006349
Jan 2006558
Dec 2005412
Nov 2005288
Oct 2005313
Sep 2005339
Aug 2005426
Jul 2005228
Jun 2005178
May 2005140
Apr 2005497
Mar 2005398
Feb 200510