nifi-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "ASF GitHub Bot (JIRA)" <>
Subject [jira] [Commented] (NIFI-3055) StandardRecordWriter can throw UTFDataFormatException
Date Mon, 06 Feb 2017 17:52:41 GMT


ASF GitHub Bot commented on NIFI-3055:

Github user jskora commented on the issue:
    Closing this PR, committed on [4f72e34](;a=commit;h=4f72e3491f2372c8c45afb96a765c1f5cdd2f07d).

> StandardRecordWriter can throw UTFDataFormatException
> -----------------------------------------------------
>                 Key: NIFI-3055
>                 URL:
>             Project: Apache NiFi
>          Issue Type: Bug
>    Affects Versions: 1.0.0, 0.7.1
>            Reporter: Brandon DeVries
>            Assignee: Joe Skora
>            Priority: Blocker
>             Fix For: 0.8.0, 1.2.0
> StandardRecordWriter.writeRecord()\[1] uses DataOutputStream.writeUTF()\[2] without checking
the length of the value to be written.  If this length is greater than  65535 (2^16 - 1),
you get a UTFDataFormatException "encoded string too long..."\[3].  Ultimately, this can result
in an IllegalStateException\[4], -bringing a halt to the data flow- causing PersistentProvenanceRepository
"Unable to merge <prov_journal> with other Journal Files due to..." WARNings.
> Several of the field values being written in this way are pre-defined, and thus not likely
an issue.  However, the "details" field can be populated by a processor, and can be of an
arbitrary length.  -Additionally, if the detail filed is indexed (which I didn't investigate,
but I'm sure is easy enough to determine), then the length might be subject to the Lucene
limit discussed in NIFI-2787-.
> \[1]
> \[2]
> \[3]
> \[4]

This message was sent by Atlassian JIRA

View raw message