manifoldcf-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Karl Wright <daddy...@gmail.com>
Subject Re: [VOTE] Release Apache ManifoldCF 1.5, RC7
Date Thu, 06 Feb 2014 11:07:25 GMT
Oh - the other obvious reason for no indexing is that there many be no
changes in the document.

Karl



On Thu, Feb 6, 2014 at 5:50 AM, Karl Wright <daddywri@gmail.com> wrote:

> The document *will* be sent to the output connector if "Decided to ingest"
> is logged, so the problem must be in the solr connector itself.  The
> following code on the solr side is executed:
>
>     if (Logging.ingest.isDebugEnabled())
>       Logging.ingest.debug("indexPost(): '" + documentURI + "'");
>
>     // The SOLR connector cannot deal with folder-level security at this
> time.  If they are seen, reject the document.
>     if (document.countDirectoryACLs() != 0)
>       return false;
>
>     // If the document is too long, reject it.
>     if (maxDocumentLength != null && document.getBinaryLength() >
> maxDocumentLength.longValue())
>       return false;
>
> Otherwise, if a problem is seen, there should be a manifoldcf.log entry,
> and also a simple history entry.  Please look there first, and if that
> doesn't work, turn on ingest debugging to be sure we get to this point.
>
> Karl
>
>
>
> On Thu, Feb 6, 2014 at 4:46 AM, Erlend GarĂ¥sen <e.f.garasen@usit.uio.no>wrote:
>
>>
>> We're still having problems with this release on our test server. It runs
>> stable and does not hang anymore, but nothing gets sent to Solr. Since
>> there was a problem with the SSL certificate in previous RCs, maybe there
>> is a similar problem related to the Solr Output Connector? We have
>> configured the same certificate in order to post documents to Solr.
>>
>> I get entries like this in manifoldcf.log which indicates that documents
>> should be indexed, but they aren't:
>> DEBUG 2014-02-06 10:28:06,609 (Worker thread '29') - WEB: Decided to
>> ingest 'http://www.ibsen.uio.no/varia.xhtml'
>>
>> In Simple history, only fetch activities are shown. Any suggestions how
>> to debug what's really going on? I can try to turn on debug logging for
>> Httpclient in case that helps.
>>
>> Erlend
>>
>>
>> On 2/4/14 1:33 PM, Karl Wright wrote:
>>
>>> This is a major release of ManifoldCF that includes the following:
>>>
>>> - Federated authority support
>>> - Multiple authorization domains
>>> - ZooKeeper process coordination
>>> - Multiple agents processes
>>> - Support for SharePoint Claims-based authorization
>>> - An Email connector
>>> - A revamped look-and-feel
>>>
>>> Voting will remain open for 3 days.
>>>
>>> You can download the artifacts from
>>> http://people.apache.org/~kwright/apache-manifoldcf-1.5 .  There is
>>> also a
>>> release tag at
>>> https://svn.apache.org/repos/asf/manifoldcf/tags/release-1.5-RC7 .
>>>
>>> This RC includes changes to the dist directory organization so that jar
>>> files are not duplicated, saving 40MB from each binary download.  It also
>>> fixes an issue with connection limits in the zookeeper example.  Finally,
>>> it fixes a limitation in the CMIS connector (CONNECTORS-864) and a maven
>>> build problem (CONNECTORS-865).  Also fixes CONNECTORS-866 (the lockclean
>>> script), and two more Maven version issues.  Finally, corrects a LiveLink
>>> connector reversion described in CONNECTORS-871.  Missing SolrJ
>>> dependencies in CONNECTORS-873.  Workaround for SolrJ runtime exception
>>> being thrown in CONNECTORS-874.  Throttling lockup dealt with, improved,
>>> and tested in CONNECTORS-872.
>>>
>>> Karl
>>>
>>>
>>
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message