lucene-solr-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Andrew Savory (JIRA)" <j...@apache.org>
Subject [jira] Commented: (SOLR-579) Extend SimplePost with RecurseDirectories, threads, document encoding , number of docs per commit
Date Wed, 21 May 2008 08:47:55 GMT

    [ https://issues.apache.org/jira/browse/SOLR-579?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12598589#action_12598589
] 

Andrew Savory commented on SOLR-579:
------------------------------------

If everyone has to write their own "ComplexPost" tool in order to populate their instances
of Solr, it would be a waste of time and effort. Isn't there a requirement for a general purpose
reusable tool, and wouldn't it's natural home be the Solr project? (Anything that helps people
in adopting solr is a good thing, after all.)

Perhaps this could extend SimplePost and live as a tool alongside the python, ruby and solrj
APIs?

> Extend SimplePost with RecurseDirectories, threads, document encoding , number of docs
per commit
> -------------------------------------------------------------------------------------------------
>
>                 Key: SOLR-579
>                 URL: https://issues.apache.org/jira/browse/SOLR-579
>             Project: Solr
>          Issue Type: New Feature
>    Affects Versions: 1.3
>         Environment: Applies to all platforms
>            Reporter: Patrick Debois
>            Priority: Minor
>   Original Estimate: 72h
>  Remaining Estimate: 72h
>
> -When specifying a directory, simplepost should read also the contents of a  directory
> New options for the commandline (some only usefull in DATAMODE= files)
> -RECURSEDIRS
>         Recursive read of directories as an option, this is usefull for directories with
a lot of files where the commandline expansion fails and xargs is too slow
> -DOCENCODING (default = system encoding or UTF-8) 
>         For non utf-8 clients , simplepost should include a way to set the encoding of
the documents posted
> -THREADSIZE (default =1 ) 
>         For large volume posts, a threading pool makes sense , using JDK 1.5 Threadpool
model
> -DOCSPERCOMMIT (default = 1)
>         Number of documents after which a commit is done, instead of only at the end
> Note: not to break the existing behaviour of the existing SimplePost tool (post.sh) might
be used in scripts 

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message