manifoldcf-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Jack Krupansky" <jack.krupan...@lucidimagination.com>
Subject Re: Setting up Solr -- commit, event notifications
Date Wed, 02 Jun 2010 14:49:39 GMT
Yes, a sophisticated app with lots of complex jobs will have to be quite smart about how it
decides to commit. The goal for LCF would be simply to supply enough job status so that such
a sophisticated app could decide that the job status warrants a commit. As I suggested, the
simplest case would be to see that all non-continuous jobs (at least those that the app cares
about) have completed.

The app end might or might not be Solr itself. It could indeed be a plug-in for Solr, or just
some other app process that has the specified context handler.

And, yes, the "commit at end of job" option is not terribly useful for complex, overlapping
job arrangements. It's primary use case is for initial evaluation of LCF. But it might be
sufficient for some simpler apps. Not all Solr apps are horribly complicated.

Maybe the option should technically be spec'ed as "commit at end of job, but only if no other
jobs are active with the Solr output connector".

In some cases you might only want to commit when a specific job completes. For example, maybe
a series of jobs are scheduled to run in sequence and the commit is only desired on completion
of the final job in that sequence. In that case, the option is desired at the job level rather
than for the Solr output connection itself. Is there any provision for job-specific output
connector options?

-- Jack Krupansky


From: karl.wright@nokia.com 
Sent: Wednesday, June 02, 2010 10:19 AM
To: connectors-user@incubator.apache.org 
Subject: RE: Setting up Solr


What about job deletion document cleanup, etc?  Overlapping job runs using the same output
connection?  We've had this discussion before; the connector can certainly have hooks added
but unless you intend to construct some kind of data structure on the Solr end that tries
to keep track of all that, you're likely not going to get quite what you are looking for.

 

Karl

 

 

From: ext Jack Krupansky [mailto:jack.krupansky@lucidimagination.com] 
Sent: Wednesday, June 02, 2010 10:15 AM
To: connectors-user@incubator.apache.org
Subject: Re: Setting up Solr

 

It would be nice to have a "commit at end of job" option for the Solr output connector. Granted,
commit policy can be a lot more complicated than that, but it is a simple use case that would
facilitate initial evaluations of LCF with Solr.

 

Thinking further ahead, it would be very useful to have "job status notification" messages
that could be sent to an app (say, a "/update/lcf-job-status" request handler) that would
note start, end, abort, and periodic status of LCF jobs. Then the app could commit as it desires
with respect to individual job completion and larger collections of jobs for different repositories.
For example, an app might wait for all non-continuous jobs to complete before committing.
That would be a more comprehensive longer-term solution for the commit problem, but the simple
end-of-job commit option would be more user-friendly in the near-term.


-- Jack Krupansky

 

From: karl.wright@nokia.com 

Sent: Wednesday, June 02, 2010 9:09 AM

To: connectors-user@incubator.apache.org 

Subject: RE: Setting up Solr

 

Solr has autocommit functionality built in.  Google for it and you will find out how to configure
it.

 

Karl

 

From: ext Rohan.GPatil@cognizant.com [mailto:Rohan.GPatil@cognizant.com] 
Sent: Wednesday, June 02, 2010 9:08 AM
To: connectors-user@incubator.apache.org
Subject: RE: Setting up Solr

 

Why can we have a job for this ? else is there any other way ?? (Windows ? in linux there
are cron jobs )

 

Thanks & Regards,

Rohan G Patil

Cognizant  Programmer Analyst Trainee,Bangalore || Mob # +91 9535577001 

Rohan.GPatil@cognizant.com

 

From: karl.wright@nokia.com [mailto:karl.wright@nokia.com] 
Sent: Wednesday, June 02, 2010 6:32 PM
To: connectors-user@incubator.apache.org
Subject: RE: Setting up Solr

 

You can send any argument you want by configuring the output connector.  However, the explicit
commit on every post will slow down performance of your crawls.

 

Karl

 

From: ext Rohan.GPatil@cognizant.com [mailto:Rohan.GPatil@cognizant.com] 
Sent: Wednesday, June 02, 2010 9:00 AM
To: connectors-user@incubator.apache.org
Subject: RE: Setting up Solr

 

Hi,

 

Yes that is where I was stuck up.. making an explicit commit.. 

 

Can I send the argument commit=true while configuring the Repo connector.

 

Thanks & Regards,

Rohan G Patil

Cognizant  Programmer Analyst Trainee,Bangalore || Mob # +91 9535577001 

Rohan.GPatil@cognizant.com

 

From: Jack Krupansky [mailto:jack.krupansky@lucidimagination.com] 
Sent: Wednesday, June 02, 2010 4:42 PM
To: connectors-user@incubator.apache.org
Subject: Re: Setting up Solr

 

A short Solr tutorial is here:

 

http://lucene.apache.org/solr/tutorial.html

After running an LCF job that uses a Solr output connection, be sure to manually force a Solr
"commit", for example:

 

    cd .../apache-solr-1.4.0/example/exampledocs
    java -jar post.jar


-- Jack Krupansky

 

From: Rohan.GPatil@cognizant.com 

Sent: Wednesday, June 02, 2010 1:46 AM

To: connectors-user@incubator.apache.org 

Subject: Setting up Solr

 

Hi,

 

I am stuck at setting up the Solr server to be used with LCF.

 

I am new to Solr.

 

Thanks & Regards,

Rohan G Patil

Cognizant  Programmer Analyst Trainee,Bangalore || Mob # +91 9535577001 

Rohan.GPatil@cognizant.com

 

      This e-mail and any files transmitted with it are for the sole use of 
      the intended recipient(s) and may contain confidential and privileged 
      information.
      If you are not the intended recipient, please contact the sender by 
      reply e-mail and destroy all copies of the original message.
      Any unauthorized review, use, disclosure, dissemination, forwarding, 
      printing or copying of this email or any action taken in reliance on this 
      e-mail is strictly prohibited and may be unlawful.
     

 

      This e-mail and any files transmitted with it are for the sole use of 
      the intended recipient(s) and may contain confidential and privileged 
      information.
      If you are not the intended recipient, please contact the sender by 
      reply e-mail and destroy all copies of the original message.
      Any unauthorized review, use, disclosure, dissemination, forwarding, 
      printing or copying of this email or any action taken in reliance on this 
      e-mail is strictly prohibited and may be unlawful.
     

 

      This e-mail and any files transmitted with it are for the sole use of 
      the intended recipient(s) and may contain confidential and privileged 
      information.
      If you are not the intended recipient, please contact the sender by 
      reply e-mail and destroy all copies of the original message.
      Any unauthorized review, use, disclosure, dissemination, forwarding, 
      printing or copying of this email or any action taken in reliance on this 
      e-mail is strictly prohibited and may be unlawful.
     

 

Mime
View raw message