lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Moenieb Davids <Moenieb.Dav...@gpaa.gov.za>
Subject LineEntityProcessor | Separator --- /update/csv | OnError
Date Thu, 05 Jan 2017 15:51:38 GMT
Hi,

Just wanted to know if anybody can assist with the following scenario:
I have a pipe delimited mainframe file\s that sometimes misses certain fields in a row, which
obviously causes issues when I try the /update/csv handler.

Scenario 1:
The csv handler is quite fast, however, when it picks up a line that does not have all the
fields due to a missing delimiter, then the entire import fails.
So, is there a way to do a OnError skip type of scenario. 
I have check the 6.3 ref guide and web but no luck

Scenario 2:
I try to use a my own DIH and then configure my schema accordingly, however, I am trying to
use the separator parameter, but it seems to not be working.
It looks like the data always just goes to rawline which then means that the separator effectively
means nothing?

I am trying to not go custom too much, so does anybody know of a "standard" way of getting
the data in

Regards
Moenieb










===========================================================================
GPAA e-mail Disclaimers and confidential note 

This e-mail is intended for the exclusive use of the addressee only.
If you are not the intended recipient, you should not use the contents 
or disclose them to any other person. Please notify the sender immediately 
and delete the e-mail. This e-mail is not intended nor 
shall it be taken to create any legal relations, contractual or otherwise. 
Legally binding obligations can only arise for the GPAA by means of 
a written instrument signed by an authorised signatory.
===========================================================================
Mime
View raw message