lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Olivier Favre (JIRA)" <j...@apache.org>
Subject [jira] [Created] (LUCENE-3071) PathHierarchyTokenizer adaptation for urls: splits reversed
Date Wed, 04 May 2011 16:10:03 GMT
PathHierarchyTokenizer adaptation for urls: splits reversed
-----------------------------------------------------------

                 Key: LUCENE-3071
                 URL: https://issues.apache.org/jira/browse/LUCENE-3071
             Project: Lucene - Java
          Issue Type: New Feature
          Components: contrib/analyzers
            Reporter: Olivier Favre
            Priority: Minor


{{PathHierarchyTokenizer}} should be usable to split urls the a "reversed" way (useful for
faceted search against urls):
{{www.site.com}} -> {{www.site.com, site.com, com}}

Moreover, it should be able to skip a given number of first (or last, if reversed) tokens:
{{/usr/share/doc/somesoftware/INTERESTING/PART}}
Should give with 4 tokens skipped:
{{INTERESTING}}
{{INTERESTING/PART}}

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


Mime
View raw message