lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Nicolas Paris <nicolas.pa...@riseup.net>
Subject Re: POS Tagger
Date Fri, 25 Oct 2019 15:56:52 GMT
Also we are using stanford POS tagger for french. The processing time is
mitigated by the spark-corenlp package which distribute the process over
multiple node.

Also I am interesting in the way you use POS information within solr
queries, or solr fields. 

Thanks,
On Fri, Oct 25, 2019 at 10:42:43AM -0400, David Hastings wrote:
> ah, yeah its not the fastest but it proved to be the best for my purposes,
> I use it to pre-process data before indexing, to apply more metadata to the
> documents in a separate field(s)
> 
> On Fri, Oct 25, 2019 at 10:40 AM Audrey Lorberfeld -
> Audrey.Lorberfeld@ibm.com <Audrey.Lorberfeld@ibm.com> wrote:
> 
> > No, I meant for part-of-speech tagging __ But that's interesting that you
> > use StanfordNLP. I've read that it's very slow, so we are concerned that it
> > might not work for us at query-time. Do you use it at query-time, or just
> > index-time?
> >
> > --
> > Audrey Lorberfeld
> > Data Scientist, w3 Search
> > IBM
> > Audrey.Lorberfeld@IBM.com
> >
> >
> > On 10/25/19, 10:30 AM, "David Hastings" <hastings.recursive@gmail.com>
> > wrote:
> >
> >     Do you mean for entity extraction?
> >     I make a LOT of use from the stanford nlp project, and get out the
> > entities
> >     and use them for different purposes in solr
> >     -Dave
> >
> >     On Fri, Oct 25, 2019 at 10:16 AM Audrey Lorberfeld -
> >     Audrey.Lorberfeld@ibm.com <Audrey.Lorberfeld@ibm.com> wrote:
> >
> >     > Hi All,
> >     >
> >     > Does anyone use a POS tagger with their Solr instance other than
> >     > OpenNLP’s? We are considering OpenNLP, SpaCy, and Watson.
> >     >
> >     > Thanks!
> >     >
> >     > --
> >     > Audrey Lorberfeld
> >     > Data Scientist, w3 Search
> >     > IBM
> >     > Audrey.Lorberfeld@IBM.com
> >     >
> >     >
> >
> >
> >

-- 
nicolas

Mime
View raw message