lucene-solr-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Ryan McKinley (JIRA)" <j...@apache.org>
Subject [jira] Updated: (SOLR-1690) JSONKeyValueTokenizerFactory -- JSON Tokenizer
Date Wed, 30 Dec 2009 18:47:30 GMT

     [ https://issues.apache.org/jira/browse/SOLR-1690?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Ryan McKinley updated SOLR-1690:
--------------------------------

    Attachment: noggit-1.0-A1.jar

This tokenizer uses noggit
http://svn.apache.org/repos/asf/labs/noggit/

> JSONKeyValueTokenizerFactory -- JSON Tokenizer
> ----------------------------------------------
>
>                 Key: SOLR-1690
>                 URL: https://issues.apache.org/jira/browse/SOLR-1690
>             Project: Solr
>          Issue Type: New Feature
>          Components: Schema and Analysis
>            Reporter: Ryan McKinley
>            Priority: Minor
>         Attachments: noggit-1.0-A1.jar, SOLR-1690-JSONKeyValueTokenizerFactory.patch
>
>
> Sometimes it is nice to group structured data into a single field.
> This (rough) patch, takes JSON input and indexes tokens based on the key values pairs
in the json.
> For example, the text:
> {code}
>  { "hello": "world", "rank":5 }
> {code}
> gets indexed as two tokens:
> || term position | 	1 |	2 |
> || term text | 	hello:world	| rank:5 |
> || term type | 	word |	word |
> || source start,end | 	12,17	| 27,28 |

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message