lucene-general mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Kenneth Glidden <kenglid...@gmail.com>
Subject Re: How to convert Japanese Numerals to Arabic Numerals
Date Tue, 12 Aug 2014 16:17:53 GMT
Agreed


On Tue, Aug 12, 2014 at 12:13 PM, Itamar Syn-Hershko <itamar@code972.com>
wrote:

> If a regex based approach would work for you, it is super easy to create a
> TokenFilter that executes this regex on tokens (assuming proper
> tokenization was done). However, the real-world use-cases are probably way
> more complex than that, so for any non-trivial use case I'd go with Steve's
> answer
>
> --
>
> Itamar Syn-Hershko
> http://code972.com | @synhershko <https://twitter.com/synhershko>
> Freelance Developer & Consultant
> Author of RavenDB in Action <http://manning.com/synhershko/>
>
>
> On Tue, Aug 12, 2014 at 7:09 PM, Kenneth Glidden <kenglidden@gmail.com>
> wrote:
>
> > Thanks.  I'm familiar with the regex approach, but am wondering if
> someone
> > has cooked up a char or token filter that auto-magically takes care of
> > this.
> >
> >
> > On Tue, Aug 12, 2014 at 11:57 AM, Mark Bennett <
> > mark.bennett@lucidworks.com>
> > wrote:
> >
> > > I'd think a Regex based filter could do this.  In Solr it'd be
> something
> > > you'd put in the analyzer chain; are you using Solr or Lucene?
> > >
> > > --
> > > Mark Bennett / LucidWorks: Search & Big Data /
> > mark.bennett@lucidworks.com
> > > Office: 408-898-4201 / Telecommute: 408-733-0387 / Cell: 408-829-6513
> > >
> > > On Aug 12, 2014, at 8:45 AM, Kenneth Glidden <kenglidden@gmail.com>
> > wrote:
> > >
> > > > Is there a filter (or other option that I'm missing) that can convert
> > > > Japanese numerals to Arabic numerals?
> > > >
> > > > e.g. Convert *二五〇* to *250*.
> > >
> > >
> >
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message