lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Jack Krupansky <jack.krupan...@gmail.com>
Subject Re: Tokenizer for Brown Corpus?
Date Tue, 24 Feb 2015 13:29:08 GMT
This is the first mention that I have seen for that corpus on this list.

There seem to be more than a few references when I google for ""brown
corpus" lucene", such as:
https://github.com/INL/BlackLab/wiki/Blacklab-query-tool

-- Jack Krupansky

On Tue, Feb 24, 2015 at 1:40 AM, Koji Sekiguchi <koji.sekiguchi@rondhuit.com
> wrote:

> Hello,
>
> Doesn't Lucene have a Tokenizer/Analyzer for Brown Corpus?
> There doesn't seem to be such tokenizers/analyzers in Lucene.
>
> As I didn't want re-inventing the wheel, so I googled, I got
> the list of snippets that include "the quick brown fox..." :)
>
> Koji
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
> For additional commands, e-mail: java-user-help@lucene.apache.org
>
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message