lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Frankcis <finalxc...@gmail.com>
Subject Re: Solr does not recognize language
Date Tue, 06 May 2014 10:42:25 GMT
my pleasure!


2014-05-06 16:43 GMT+08:00 Victor Pascual [via Lucene] <
ml-node+s472066n4134885h5@n3.nabble.com>:

> Thank you very much Ahmet for your help.
> It finally worked!
>
> For anyone interested, all your hints where more than useful. I basically
> had two problems:
> - Didn't have my language detection chain in the update/json
> requestHandler
> - Didn't create the field where the detected language should be stored
>
> Again, thanks for your help!
>
>
> On Mon, May 5, 2014 at 5:19 PM, Ahmet Arslan <[hidden email]<http://user/SendEmail.jtp?type=node&node=4134885&i=0>>
> wrote:
>
> > Hi Victor,
> >
> > I don't know mysolr, I assume you are using /update/json, lets add your
> > chain to defaults section.
> >
> >   <requestHandler name="/update/json" class="solr.UpdateRequestHandler">
> >
> >         <lst name="defaults">
> >          <str name="stream.contentType">application/json</str>
> >          <str name="update.chain">langid</str>
> >        </lst>
> >   </requestHandler>
> >
> >
> >
> >
> > On Monday, May 5, 2014 4:06 PM, Victor Pascual <
> > [hidden email] <http://user/SendEmail.jtp?type=node&node=4134885&i=1>>
> wrote:
> > Hi there,
> >
> > I'm indexing my documents using mysolr. I mainly generate a lost of json
> > objects and the run: solr.update(documents_array,'json')
> >
> >
> >
> > On Mon, May 5, 2014 at 1:08 PM, Ahmet Arslan <[hidden email]<http://user/SendEmail.jtp?type=node&node=4134885&i=2>>
> wrote:
> >
> > > Hi Victor,
> > >
> > > How do you index your documents? Your last config looks correct.
> However
> > > for example if you use data import handler you need to add
> update.chain
> > > there too. Same as extraction request hadler if you are using
> sole-cell.
> > >
> > > <requestHandler name="/dataimport"
> > > class="org.apache.solr.handler.dataimport.DataImportHandler">
> > >     <lst name="defaults">
> > >       <str name="config">/home/username/data-config.xml</str>
> > >       <str name="update.chain">langid</str>
> > >     </lst>
> > >   </requestHandler>
> > >
> > > By the way The URL
> > > http://localhost:8080/solr/update?commit=true&update.chain=langid was
> > > just an example and meant to feed xml update messages by POST method.
> Not
> > > to use in a browser.
> > >
> > > Ahmet
> > >
> > > On Monday, May 5, 2014 11:04 AM, Victor Pascual <
> > > [hidden email] <http://user/SendEmail.jtp?type=node&node=4134885&i=3>>
> wrote:
> > >
> > > Thank you very much for you help Ahmet.
> > >
> > > However the language detection is still not workin. :(
> > > My solrconfig.xml didn't contain that lst section inside the update
> > > requestHandler.
> > > That's the content I added:
> > >
> > >   <requestHandler name="/update"
> > > >                  class="solr.XmlUpdateRequestHandler">
> > > >       <lst name="defaults">
> > > >         <str name="update.chain">langid</str>
> > > >       </lst>
> > > >    </requestHandler>
> > > >
> > >
> > >    <updateRequestProcessorChain name="langid">
> > > >       <processor
> > >
> >
> class="org.apache.solr.update.processor.LangDetectLanguageIdentifierUpdateProcessorFactory">
>
> > > >          <lst name="defaults">
> > > >            <str name="langid.fl">text</str>
> > > >            <str name="langid.langField">lang</str>
> > > >          </lst>
> > > >        </processor>
> > > >        <processor class="solr.LogUpdateProcessorFactory" />
> > > >       <processor class="solr.RunUpdateProcessorFactory" />
> > > >     </updateRequestProcessorChain>
> > >
> > > Now, your suggested query
> > >
> http://localhost:8080/solr/update?commit=true&update.chain=langidreturns
> > >
> > > <response>
> > > ><lst name="responseHeader">
> > > ><int name="status">0</int>
> > > ><int name="QTime">14</int>
> > > ></lst>
> > > ></response>
> > > And there is still no lang field in my documents.
> > > Any idea what am I doing wrong?
> > >
> > >
> > >
> > >
> > > On Tue, Apr 29, 2014 at 5:33 PM, Ahmet Arslan <[hidden email]<http://user/SendEmail.jtp?type=node&node=4134885&i=4>>
> wrote:
> > >
> > > Hi,
> > > >
> > > >solr/update should be used, not /solr/select
> > > >
> > > >curl '
> http://localhost:8983/solr/update?commit=true&update.chain=langid
> > '
> > > >
> > > >By the way don't you have following definition in your
> solrconfig.xml?
> > > >
> > > > <requestHandler name="/update" class="solr.UpdateRequestHandler">
> > > >
> > > >       <lst name="defaults">
> > > >         <str name="update.chain">langid</str>
> > > >       </lst>
> > > >  </requestHandler>
> > > >
> > > >
> > > >
> > > >
> > > >On Tuesday, April 29, 2014 4:50 PM, Victor Pascual <
> > > [hidden email] <http://user/SendEmail.jtp?type=node&node=4134885&i=5>>
> wrote:
> > > >Hi Ahmet,
> > > >
> > > >thanks for your reply. Adding &update.chain=langid to my query
> doesn't
> > > >work: IP:8080/solr/select/?q=*%3A*&update.chain=langid
> > > >Regarding defining the chain in an UpdateRequestHandler... sorry for
> the
> > > >lame question but shall I paste those three lines to solrconfig.xml,
> or
> > > >shall I add them somewhere else?
> > > >
> > > >There is not UpdateRequestHandler in my solrconfig.
> > > >
> > > >Thanks!
> > > >
> > > >
> > > >
> > > >On Tue, Apr 29, 2014 at 3:13 PM, Ahmet Arslan <[hidden email]<http://user/SendEmail.jtp?type=node&node=4134885&i=6>>
>
> > wrote:
> > > >
> > > >> Hi,
> > > >>
> > > >> Did you attach your chain to a UpdateRequestHandler?
> > > >>
> > > >> You can do it by adding &update.chain=langid to the URL or defining
> it
> > > in
> > > >> a defaults section as follows
> > > >>
> > > >> <lst name="defaults">
> > > >>      <str name="update.chain">langid</str>
> > > >>    </lst>
> > > >>
> > > >>
> > > >>
> > > >> On Tuesday, April 29, 2014 3:18 PM, Victor Pascual <
> > > >> [hidden email]<http://user/SendEmail.jtp?type=node&node=4134885&i=7>>
> wrote:
> > > >> Dear all,
> > > >>
> > > >> I'm a new user of Solr. I've managed to index a bunch of documents
> (in
> > > >> fact, they are tweets) and everything works quite smoothly.
> > > >>
> > > >> Nevertheless it looks like Solr doesn't detect the language of my
> > > documents
> > > >> nor remove stopwords accordingly so I can extract the most frequent
> > > terms.
> > > >>
> > > >> I've added this piece of XML to my solrconfig.xml as well as the
> Tika
> > > lib
> > > >> jars.
> > > >>
> > > >>     <updateRequestProcessorChain name="langid">
> > > >>        <processor
> > > >>
> > > >>
> > >
> >
> class="org.apache.solr.update.processor.LangDetectLanguageIdentifierUpdateProcessorFactory">
>
> > > >>           <lst name="defaults">
> > > >>             <str name="langid.fl">text</str>
> > > >>             <str name="langid.langField">lang</str>
> > > >>           </lst>
> > > >>         </processor>
> > > >>         <processor class="solr.LogUpdateProcessorFactory" />
> > > >>        <processor class="solr.RunUpdateProcessorFactory" />
> > > >>      </updateRequestProcessorChain>
> > > >>
> > > >> There is no error in the tomcat log file, so I have no clue of why
> > this
> > > >> isn't working.
> > > >> Any hint on how to solve this problem will be much appreciated!
> > > >>
> > > >
> > > >
> > >
> >
> >
>
>
> ------------------------------
>  If you reply to this email, your message will be added to the discussion
> below:
>
> http://lucene.472066.n3.nabble.com/Solr-does-not-recognize-language-tp4133711p4134885.html
>  To unsubscribe from Solr does not recognize language, click here<http://lucene.472066.n3.nabble.com/template/NamlServlet.jtp?macro=unsubscribe_by_code&node=4133711&code=ZmluYWx4Y29kZUBnbWFpbC5jb218NDEzMzcxMXwyMDg1ODE1Mzg4>
> .
> NAML<http://lucene.472066.n3.nabble.com/template/NamlServlet.jtp?macro=macro_viewer&id=instant_html%21nabble%3Aemail.naml&base=nabble.naml.namespaces.BasicNamespace-nabble.view.web.template.NabbleNamespace-nabble.view.web.template.NodeNamespace&breadcrumbs=notify_subscribers%21nabble%3Aemail.naml-instant_emails%21nabble%3Aemail.naml-send_instant_email%21nabble%3Aemail.naml>
>




--
View this message in context: http://lucene.472066.n3.nabble.com/Solr-does-not-recognize-language-tp4133711p4134897.html
Sent from the Solr - User mailing list archive at Nabble.com.
Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message