hadoop-hdfs-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Erik Steffl (JIRA)" <j...@apache.org>
Subject [jira] Commented: (HDFS-1448) Create multi-format parser for edits logs file, support binary and XML formats initially
Date Fri, 08 Oct 2010 23:14:31 GMT

    [ https://issues.apache.org/jira/browse/HDFS-1448?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12919412#action_12919412
] 

Erik Steffl commented on HDFS-1448:
-----------------------------------

Patch HDFS-1448-0.22.patch implements binary and XML parser for edits file.

Binary file editsStored should go to src/test/hdfs/org/apache/hadoop/hdfs/tools/offlineEditsViewer/editsStored.

Unit test: ant -Dtestcase=TestOfflineEditsViewer -Dtest.output=yes test

Usage:

  $HADOOP_HOME/bin/hdfs oev -i rrr.xml -o rrr.bin -v

File type: *.xml file is parsed as XML file, all other files are treated as binary.

  -i name specifies input file name (xml or binary)
  -o name specifies output file name (xml or binary)
  -v means verbose, prints the filenames and XML output to screen (if XML output, otherwise
nothing)

> Create multi-format parser for edits logs file, support binary and XML formats initially
> ----------------------------------------------------------------------------------------
>
>                 Key: HDFS-1448
>                 URL: https://issues.apache.org/jira/browse/HDFS-1448
>             Project: Hadoop HDFS
>          Issue Type: New Feature
>          Components: tools
>    Affects Versions: 0.22.0
>            Reporter: Erik Steffl
>            Priority: Minor
>             Fix For: 0.22.0
>
>         Attachments: editsStored, HDFS-1448-0.22.patch
>
>
> Create multi-format parser for edits logs file, support binary and XML formats initially.
> Parsing should work from any supported format to any other supported format (e.g. from
binary to XML and from XML to binary).
> The binary format is the format used by FSEditLog class to read/write edits file.
> Primary reason to develop this tool is to help with troubleshooting, the binary format
is hard to read and edit (for human troubleshooters).
> Longer term it could be used to clean up and minimize parsers for fsimage and edits files.
Edits parser OfflineEditsViewer is written in a very similar fashion to OfflineImageViewer.
Next step would be to merge OfflineImageViewer and OfflineEditsViewer and use the result in
both FSImage and FSEditLog. This is subject to change, specifically depending on adoption
of avro (which would completely change how objects are serialized as well as provide ways
to convert files to different formats).

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message