nifi-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Bryan Bende (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (NIFI-1174) Create a Put HBase processor that can put multiple cells
Date Wed, 18 Nov 2015 22:36:10 GMT

     [ https://issues.apache.org/jira/browse/NIFI-1174?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Bryan Bende updated NIFI-1174:
------------------------------
    Attachment: NIFI-1174.patch

Attaching a patch which adds a PutHBaseJSON that can add a row to HBase using the elements
of the JSON as the column qualifiers and values. For right now it only supports input of a
single JSON document (the processor does operate on a configurable batch of FlowFiles though),
and only flat elements. If a value of an element is an array it will be skipped over and not
sent in the row.

Probably pushing my luck, but would be nice to get this in 0.4.0 for two reasons... 
* It refactors the HBaseClientService API a little bit to better support adding a whole row,
and it will be harder to change the API after we release. We can easily add other variations
after this such as CSV, or anything else.
* We can support a lot more HBase use cases by being able to add a whole row at a time

> Create a Put HBase processor that can put multiple cells
> --------------------------------------------------------
>
>                 Key: NIFI-1174
>                 URL: https://issues.apache.org/jira/browse/NIFI-1174
>             Project: Apache NiFi
>          Issue Type: Improvement
>            Reporter: Bryan Bende
>            Assignee: Bryan Bende
>            Priority: Minor
>         Attachments: NIFI-1174.patch
>
>
> We recently added a PutHBaseCell processor which works great for writing one individual
cell at a time, but it can require a significant amount of work in a flow to create a row
with multiple cells. 
> We should support a variation of this processor that can accept a flow file with key/value
pairs in the content of the flow file (possibly json). The key/value pairs then turned into
the cells for the given row and get added in one put operation. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message