accumulo-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Adam Fuchs (Created) (JIRA)" <j...@apache.org>
Subject [jira] [Created] (ACCUMULO-374) wikisearch-ingest stop list should be removed
Date Mon, 06 Feb 2012 19:37:59 GMT
wikisearch-ingest stop list should be removed
---------------------------------------------

                 Key: ACCUMULO-374
                 URL: https://issues.apache.org/jira/browse/ACCUMULO-374
             Project: Accumulo
          Issue Type: Bug
            Reporter: Adam Fuchs
            Assignee: Adam Fuchs


Wikisearch-ingest's WikipediaMapper has a list of stop words that presumably are supposed
to be ignored (not indexed). This feature should be removed because:
1. The StopFilter code does not work. Stop words are indexed anyway. Not sure why.
2. Stop lists are not a significant performance concern with this type of indexing. It's better
for cross-language search, phrase search, and for overall search efficiency not to use a stop
list.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Mime
View raw message