nifi-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Andy LoPresto <alopre...@apache.org>
Subject Re: CsvToAttributes processor
Date Fri, 28 Oct 2016 03:34:01 GMT
Hi François,

I hope this is what you were looking for. If you do not get the entire thread via this email,
you can see the thread in a web view here [1].

[1] https://lists.apache.org/thread.html/ffa390534d35056d3ad8ab5116f25665b73687855214afe95fcf6cab@%3Cdev.nifi.apache.org%3E
<https://lists.apache.org/thread.html/ffa390534d35056d3ad8ab5116f25665b73687855214afe95fcf6cab@%3Cdev.nifi.apache.org%3E>

Andy LoPresto
alopresto@apache.org
alopresto.apache@gmail.com
PGP Fingerprint: 70EC B3E5 98A6 5A3F D3C4  BACE 3C6E F65B 2F7D EF69

> On Oct 27, 2016, at 6:31 AM, François Prunier <francois.prunier@hurence.com> wrote:
> 
> --------------7FEEA278B796C52DD32D150C
> Content-Type: text/plain; charset=utf-8; format=flowed
> Content-Transfer-Encoding: 8bit
> 
> Hello again nifi folks,
> 
> I did not get a direct reply to my email below. However, I've since
> noticed in the mailing list archive that some of you have kindly
> replied, although the emails did not make it to my inbox !
> 
> I wasn't part of the mailing list at the time, I am now, I guess that's
> why I did not got the responses, it still seems a bit weird though... (*).
> 
> Anyway, could someone reply to the thread and include my email so I can
> answer each of your comments while keeping the threading 'clean' ?
> 
> Thanks !
> 
> François
> 
> *: Maybe something the admins should look into, as some people might
> fire off an email to the list, see no answers and assume no one replied
> to them !
> 
> On 19/10/2016 11:10, François Prunier wrote:
>> 
>> Hello Nifi folks,
>> 
>> I've built a processor to parse CSV files with headers and turn each
>> line in a flowfile. Each resulting flowfile has as many attributes as
>> the number of columns. Each attributes has the name of a column with
>> the corresponding value for the line.
>> 
>> For example, this CSV file:
>> 
>> |col1,col2,col3 a,b,c d,e,f |
>> 
>> would generate two flowfiles with the following attributes:
>> 
>> |col1 = a col2 = b col3 = c |
>> 
>> and
>> 
>> |col1 = d col2 = e col3 = f |
>> As of now, you can configure the charset plus delimiter, quote and
>> escape character. It's based on the commons-csv parser.
>> 
>> It's very handy if you want to, for example, index a CSV file into
>> elasticsearch.
>> 
>> Would you guys be interested in a pull request to add this processor
>> to the main code base ? It needs a bit more documentation and cleanup
>> that I would need to add in but it's already successfully used in
>> production.
>> 
>> Best regards,
>> --
>> *François Prunier
>> * *Hurence* - /Vos experts Big Data/
>> http://www.hurence.com
>> *mobile:* +33 6 38 68 60 50
>> 
> 
> --
> *François Prunier
> * *Hurence* - /Vos experts Big Data/
> http://www.hurence.com
> *mobile:* +33


Mime
View raw message