orc-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From omalley <...@git.apache.org>
Subject [GitHub] orc pull request: ORC-8. Reimplement file-metadata to use the read...
Date Fri, 15 Jan 2016 21:13:45 GMT
Github user omalley commented on the pull request:

    https://github.com/apache/orc/pull/15#issuecomment-172093974
  
    For comparison, here is a before and after for examples/decimal.orc:
    
    ```
    Structure for ../examples/decimal.orc
    
     === Postscript ===
    footerLength: 89
    compression: NONE
    version: 0
    version: 12
    metadataLength: 44
    magic: "ORC"
    
     === Footer ===
    headerLength: 3
    contentLength: 16186
    stripes {
      offset: 3
      indexLength: 54
      dataLength: 16079
      footerLength: 50
      numberOfRows: 6000
    }
    types {
      kind: STRUCT
      subtypes: 1
      fieldNames: "_col0"
    }
    types {
      kind: DECIMAL
      precision: 10
      scale: 5
    }
    numberOfRows: 6000
    statistics {
      numberOfValues: 6000
    }
    statistics {
      numberOfValues: 4000
      decimalStatistics {
        minimum: "-1000.5"
        maximum: "1999.2"
        sum: "1998301.099"
      }
    }
    rowIndexStride: 10000
    
    === Stripe Statistics ===
    Stripe 1:
    offset: 3
    indexLength: 54
    dataLength: 16079
    footerLength: 50
    numberOfRows: 6000
        Stream: column 0 section 6 start: 3 length 7
        Stream: column 1 section 6 start: 10 length 47
        Stream: column 1 section 0 start: 57 length 12
        Stream: column 1 section 1 start: 69 length 13673
        Stream: column 1 section 5 start: 13742 length 2394
        Encoding column 0: 0
        Encoding column 1: 2
    File length: 16337 bytes
    Padding length: 0 bytes
    Padding ratio: 0.00 %
    ```
    
    and 
    
    ```
    { "name": "../examples/decimal.orc",
      "type": "struct<_col0:decimal(10,5)>",
      "rows": 6000,
      "stripe count": 1,
      "format": "0.12", "writer version": "original",
      "compression": "none",
      "file length": 16337,
      "content": 16186, "stripe stats": 44, "footer": 89, "postscript": 17,
      "row index stride": 10000,
      "user metadata": {
      },
      "stripes": [
        { "stripe": 0, "rows": 6000,
          "offset": 3, "length": 16183,
          "index": 54, "data": 16079, "footer": 50
        }
      ]
    }
    ```


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

Mime
View raw message