incubator-connectors-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From conflue...@apache.org
Subject [CONF] Apache Connectors Framework > How to Build and Deploy ManifoldCF
Date Thu, 04 Nov 2010 16:40:00 GMT
Space: Apache Connectors Framework (https://cwiki.apache.org/confluence/display/CONNECTORS)
Page: How to Build and Deploy ManifoldCF (https://cwiki.apache.org/confluence/display/CONNECTORS/How+to+Build+and+Deploy+ManifoldCF)
Comment: https://cwiki.apache.org/confluence/display/CONNECTORS/How+to+Build+and+Deploy+ManifoldCF?focusedCommentId=24185380#comment-24185380

Comment added by Karl Wright:
---------------------------------------------------------------------

My overall count was larger, because my solr and lucene had been compiled and built. 
I counted both folders and files in my docs/second calculations.  So my system was
performing about 2x as fast as yours.

In reply to a comment by Farzad:
The trunk of Solr has 14,752 files and 9,269 folders.  The job completd in 24 mintues and
11 seconds, or 1451 seconds.  I'm getting a rate of 16.6 items / sec. If I use only the files
the rate is 10.2 files / sec.  Did you use the total count or file count?

Is there a tool we can both use to compare the systems. Do you have ManifoldCF, database,
and appserver running off the same disk?  I first had the data on the same disk as Manifold,
then I moved it to a network drive, and the crawl time went up by 35 seconds.

My only goal at this point is to achieve your results on my system.

Change your notification preferences: https://cwiki.apache.org/confluence/users/viewnotifications.action

Mime
View raw message