nifi-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "ASF GitHub Bot (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (NIFI-1280) Create FilterCSVColumns Processor
Date Wed, 11 May 2016 16:53:12 GMT

    [ https://issues.apache.org/jira/browse/NIFI-1280?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15280416#comment-15280416
] 

ASF GitHub Bot commented on NIFI-1280:
--------------------------------------

Github user markap14 commented on the pull request:

    https://github.com/apache/nifi/pull/420#issuecomment-218520423
  
    I don't think we can use the provided test data as-is. Looking at http://www.briandunning.com/sample-data/
I don't see any sort of LICENSE info. Probably best to just mock up a few simple test files
manually instead. This will also be easier for testing purposes, I think, so that we can assert
specific things in the unit test


> Create FilterCSVColumns Processor
> ---------------------------------
>
>                 Key: NIFI-1280
>                 URL: https://issues.apache.org/jira/browse/NIFI-1280
>             Project: Apache NiFi
>          Issue Type: Task
>          Components: Extensions
>            Reporter: Mark Payne
>            Assignee: Toivo Adams
>
> We should have a Processor that allows users to easily filter out specific columns from
CSV data. For instance, a user would configure two different properties: "Columns of Interest"
(a comma-separated list of column indexes) and "Filtering Strategy" (Keep Only These Columns,
Remove Only These Columns).
> We can do this today with ReplaceText, but it is far more difficult than it would be
with this Processor, as the user has to use Regular Expressions, etc. with ReplaceText.
> Eventually a Custom UI could even be built that allows a user to upload a Sample CSV
and choose which columns from there, similar to the way that Excel works when importing CSV
by dragging and selecting the desired columns? That would certainly be a larger undertaking
and would not need to be done for an initial implementation.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message