phoenix-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Hudson (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (PHOENIX-2417) Compress memory used by row key byte[] of guideposts
Date Wed, 20 Jan 2016 19:57:39 GMT

    [ https://issues.apache.org/jira/browse/PHOENIX-2417?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15109288#comment-15109288
] 

Hudson commented on PHOENIX-2417:
---------------------------------

FAILURE: Integrated in Phoenix-master #1091 (See [https://builds.apache.org/job/Phoenix-master/1091/])
PHOENIX-2417 Compress memory used by row key byte[] of guideposts (Ankit (jtaylor: rev 6a27cecce312002f9b7f76ffdcafcb694f5e3ea8)
* phoenix-core/src/main/java/org/apache/phoenix/schema/stats/StatisticsCollector.java
* phoenix-protocol/src/main/PGuidePosts.proto
* phoenix-core/src/main/java/org/apache/phoenix/util/UpgradeUtil.java
* phoenix-core/src/main/java/org/apache/phoenix/schema/stats/StatisticsWriter.java
* phoenix-core/src/main/java/org/apache/phoenix/coprocessor/MetaDataRegionObserver.java
* phoenix-core/src/main/java/org/apache/phoenix/util/PrefixByteCodec.java
* phoenix-core/src/main/java/org/apache/phoenix/schema/stats/StatisticsScanner.java
* phoenix-core/src/main/java/org/apache/phoenix/coprocessor/generated/PGuidePostsProtos.java
* phoenix-core/src/main/java/org/apache/phoenix/schema/stats/GuidePostsInfo.java
* phoenix-core/src/test/java/org/apache/phoenix/util/PrefixByteEncoderDecoderTest.java
* phoenix-core/src/main/java/org/apache/phoenix/util/PrefixByteDecoder.java
* phoenix-core/src/main/java/org/apache/phoenix/util/ByteUtil.java
* phoenix-core/src/main/java/org/apache/phoenix/execute/ScanPlan.java
* phoenix-core/src/main/java/org/apache/phoenix/iterate/BaseResultIterators.java
* phoenix-core/src/main/java/org/apache/phoenix/schema/stats/PTableStatsImpl.java
* phoenix-core/src/test/java/org/apache/phoenix/filter/SkipScanBigFilterTest.java
* phoenix-core/src/main/java/org/apache/phoenix/schema/PTableImpl.java
* phoenix-core/src/main/java/org/apache/phoenix/schema/stats/GuidePostsInfoBuilder.java
* phoenix-core/src/main/java/org/apache/phoenix/util/PrefixByteEncoder.java
* phoenix-core/src/main/java/org/apache/phoenix/coprocessor/generated/StatCollectorProtos.java
* phoenix-core/src/main/java/org/apache/phoenix/schema/stats/StatisticsUtil.java


> Compress memory used by row key byte[] of guideposts
> ----------------------------------------------------
>
>                 Key: PHOENIX-2417
>                 URL: https://issues.apache.org/jira/browse/PHOENIX-2417
>             Project: Phoenix
>          Issue Type: Sub-task
>            Reporter: James Taylor
>            Assignee: Ankit Singhal
>             Fix For: 4.7.0
>
>         Attachments: PHOENIX-2417.patch, PHOENIX-2417_encoder.diff, PHOENIX-2417_final.patch,
PHOENIX-2417_rebased.patch, PHOENIX-2417_rebased2.patch, PHOENIX-2417_v2_wip.patch, StatsUpgrade_wip.patch
>
>
> We've found that smaller guideposts are better in terms of minimizing any increase in
latency for point scans. However, this increases the amount of memory significantly when caching
the guideposts on the client. Guidepost are equidistant row keys in the form of raw byte[]
which are likely to have a large percentage of their leading bytes in common (as they're stored
in sorted order. We should use a simple compression technique to mitigate this. I noticed
that Apache Parquet has a run length encoding - perhaps we can use that.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message