incubator-cvs mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Apache Wiki <wikidi...@apache.org>
Subject [Incubator Wiki] Update of "PinotProposal" by kishoreg
Date Tue, 13 Feb 2018 07:49:51 GMT
Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Incubator Wiki" for change notification.

The "PinotProposal" page has been changed by kishoreg:
https://wiki.apache.org/incubator/PinotProposal?action=diff&rev1=1&rev2=2

  
  == Abstract ==
  
- Pinot is a distributed columnar storage engine that can ingest data in real-time and serve
analytical queries at low latency. There are two modes of data ingestion - batch and/or realtime.
Batch mode allows users to generate pinot segments externally using systems such as Hadoop.
These segments can be uploaded into Pinot via simple curl calls. Pinot can ingest data in
near real-time from streaming sources such as Kafka. Data ingested into Pinot is stored in
a columnar format. Pinot provides a SQL like interface (PQL) that supports filters, aggregations
and group by operations. It does not support joins by design, in order to guarantee predictable
latency. It leverages other Apache projects such as Zookeeper, Kafka, and Helix, along with
many libraries from the ASF.
+ Pinot is a distributed columnar storage engine that can ingest data in real-time and serve
analytical queries at low latency. There are two modes of data ingestion - batch and/or realtime.
Batch mode allows users to generate pinot segments externally using systems such as Hadoop.
These segments can be uploaded into Pinot via simple curl calls. Pinot can ingest data in
near real-time from streaming sources such as Kafka. Data ingested into Pinot is stored in
a columnar format. Pinot provides a SQL like interface (PQL) that supports filters, aggregations,
and group by operations. It does not support joins by design, in order to guarantee predictable
latency. It leverages other Apache projects such as Zookeeper, Kafka, and Helix, along with
many libraries from the ASF.
  
  == Proposal ==
  
  Pinot was open sourced by LinkedIn and hosted on GitHub. Majority of the development happens
at LinkedIn with other contributions from Uber and Slack. We believe that being a part of
Apache Software Foundation will improve the diversity and help form a strong community around
the project.
  
- LinkedIn submits this proposal to donate the code base to Apache Software Foundation. The
code is already under Apache License 2.0.  Code and the documentation is hosted on Github.
+ LinkedIn submits this proposal to donate the code base to Apache Software Foundation. The
code is already under Apache License 2.0.  Code and the documentation are hosted on Github.
- Code: http://github.com/linkedin/pinot
+ * Code: http://github.com/linkedin/pinot
- Documentation: https://github.com/linkedin/pinot/wiki
+ * Documentation: https://github.com/linkedin/pinot/wiki
  
  
  == Background ==
  
- LinkedIn, similar to other companies, has many applications that provide rich real-time
insights to members and customers (internal and external). The workload characteristics for
these applications vary a lot. Some internal applications simply need ad hoc query capabilities
with sub second to multiple seconds latency. But external site facing applications require
strong SLA even very high workloads. Prior to Pinot, LinkedIn had multiple solutions depending
on the workload generated by the application and this was inefficient. Pinot was developed
to be the one single platform that addresses all classes of applications. Today at LinkedIn,
Pinot powers more than 50 site facing products with workload ranging from few queries per
second to 1000’s of queries per second while maintaining the 99th percentile latency which
can be as low as few milliseconds. All internal dashboards at LinkedIn are powered by Pinot.
+ LinkedIn, similar to other companies, has many applications that provide rich real-time
insights to members and customers (internal and external). The workload characteristics for
these applications vary a lot. Some internal applications simply need ad-hoc query capabilities
with sub-second to multiple seconds latency. But external site facing applications require
strong SLA even very high workloads. Prior to Pinot, LinkedIn had multiple solutions depending
on the workload generated by the application and this was inefficient. Pinot was developed
to be the one single platform that addresses all classes of applications. Today at LinkedIn,
Pinot powers more than 50 site facing products with workload ranging from few queries per
second to 1000’s of queries per second while maintaining the 99th percentile latency which
can be as low as few milliseconds. All internal dashboards at LinkedIn are powered by Pinot.
  
  == Rationale ==
  
@@ -29, +29 @@

  
  == Meritocracy ==
  
- Following the Apache meritocracy model, we intend to build a open and diverse community
around Pinot. We will encourage the community to contribute in discussion and codebase.
+ Following the Apache meritocracy model, we intend to build an open and diverse community
around Pinot. We will encourage the community to contribute in discussion and codebase.
  
  == Community ==
  
@@ -85, +85 @@

  
  Dependencies from the ASF Category A
  === Apache License 2.0 ===
- com.101tec:zkclient:0.7
+ * com.101tec:zkclient:0.7
- com.alibaba:fastjson:1.1.24
+ * com.alibaba:fastjson:1.1.24
- com.clearspring.analytics:stream:2.7.0
+ * com.clearspring.analytics:stream:2.7.0
- com.fasterxml.jackson.core:jackson-annotations:2.8.0
+ * com.fasterxml.jackson.core:jackson-annotations:2.8.0
- com.fasterxml.jackson.core:jackson-core:2.8.0
+ * com.fasterxml.jackson.core:jackson-core:2.8.0
- com.fasterxml.jackson.core:jackson-databind:2.8.0
+ * com.fasterxml.jackson.core:jackson-databind:2.8.0
- com.google.code.findbugs:jsr305:3.0.0
+ * com.google.code.findbugs:jsr305:3.0.0
- com.google.guava:guava:19
+ * com.google.guava:guava:19
- com.ning:async-http-client:1.9.21
+ * com.ning:async-http-client:1.9.21
- com.yammer.metrics:metrics-core:2.2.0
+ * com.yammer.metrics:metrics-core:2.2.0
- commons-beanutils:commons-beanutils:1.8.3
+ * commons-beanutils:commons-beanutils:1.8.3
- commons-cli:commons-cli:1.2
+ * commons-cli:commons-cli:1.2
- commons-codec:commons-codec:1.6
+ * commons-codec:commons-codec:1.6
- commons-configuration:commons-configuration:1.6
+ * commons-configuration:commons-configuration:1.6
- commons-fileupload:commons-fileupload:1.2.2
+ * commons-fileupload:commons-fileupload:1.2.2
- commons-httpclient:commons-httpclient:3.1
+ * commons-httpclient:commons-httpclient:3.1
- commons-io:commons-io:2.1
+ * commons-io:commons-io:2.1
- commons-validator:commons-validator:1.4.0
+ * commons-validator:commons-validator:1.4.0
- io.netty:netty-all:4.1.4.Final
+ * io.netty:netty-all:4.1.4.Final
- io.swagger:swagger-jaxrs:1.5.10
+ * io.swagger:swagger-jaxrs:1.5.10
- io.swagger:swagger-jersey2-jaxrs:1.5.10
+ * io.swagger:swagger-jersey2-jaxrs:1.5.10
- it.unimi.dsi:fastutil:6.5.16
+ * it.unimi.dsi:fastutil:6.5.16
- joda-time:joda-time:2
+ * joda-time:joda-time:2
- log4j:log4j:1.2.17
+ * log4j:log4j:1.2.17
- me.lemire.integercompression:JavaFastPFOR:0.0.13
+ * me.lemire.integercompression:JavaFastPFOR:0.0.13
- nl.jqno.equalsverifier:equalsverifier:1.7.2
+ * nl.jqno.equalsverifier:equalsverifier:1.7.2
- org.apache.avro:avro:1.7.6
+ * org.apache.avro:avro:1.7.6
- org.apache.commons:commons-compress:1.9
+ * org.apache.commons:commons-compress:1.9
- org.apache.commons:commons-lang3:3.5
+ * org.apache.commons:commons-lang3:3.5
- org.apache.commons:commons-math:2.1
+ * org.apache.commons:commons-math:2.1
- org.apache.hadoop:hadoop-client:2.7.0
+ * org.apache.hadoop:hadoop-client:2.7.0
- org.apache.hadoop:hadoop-common:2.7.0
+ * org.apache.hadoop:hadoop-common:2.7.0
- org.apache.helix:helix-core:0.6.8
+ * org.apache.helix:helix-core:0.6.8
- org.apache.httpcomponents:httpclient:4.1.3
+ * org.apache.httpcomponents:httpclient:4.1.3
- org.apache.httpcomponents:httpclient:4.2.5
+ * org.apache.httpcomponents:httpclient:4.2.5
- org.apache.httpcomponents:httpcore:4.2.5
+ * org.apache.httpcomponents:httpcore:4.2.5
- org.apache.httpcomponents:httpmime:4.2.5
+ * org.apache.httpcomponents:httpmime:4.2.5
- org.apache.kafka:kafka_2.10:0.9.0.1
+ * org.apache.kafka:kafka_2.10:0.9.0.1
- org.apache.thrift:libthrift:0.9.1
+ * org.apache.thrift:libthrift:0.9.1
- org.apache.zookeeper:zookeeper:3.4.9
+ * org.apache.zookeeper:zookeeper:3.4.9
- org.codehaus.jackson:jackson-core-asl:1.9.6
+ * org.codehaus.jackson:jackson-core-asl:1.9.6
- org.codehaus.jackson:jackson-mapper-asl:1.9.6
+ * org.codehaus.jackson:jackson-mapper-asl:1.9.6
- org.json:json:20080701
+ * org.json:json:20080701
- org.roaringbitmap:RoaringBitmap:0.5.10
+ * org.roaringbitmap:RoaringBitmap:0.5.10
- org.testng:testng:6.0.1
+ * org.testng:testng:6.0.1
- org.twitter4j:twitter4j-core:4.0.3
+ * org.twitter4j:twitter4j-core:4.0.3
- org.webjars:swagger-ui:2.2.2
+ * org.webjars:swagger-ui:2.2.2
- org.xerial.larray:larray:0.2.1
+ * org.xerial.larray:larray:0.2.1
- org.yaml:snakeyaml:1.16
+ * org.yaml:snakeyaml:1.16
- xml-apis:xml-apis:1.0.b2
+ * xml-apis:xml-apis:1.0.b2
  === Dual license (Apache License 2.0 + LGPL 2.1), using under the Apache License ===
- org.codehaus.jackson:jackson-jaxrs:1.9.6
+ * org.codehaus.jackson:jackson-jaxrs:1.9.6
- org.codehaus.jackson:jackson-xc:1.9.6
+ * org.codehaus.jackson:jackson-xc:1.9.6
  === BSD ===
- com.jcabi:jcabi-log:0.17.1
+ * com.jcabi:jcabi-log:0.17.1
- org.antlr:antlr4-annotations:4.3
+ * org.antlr:antlr4-annotations:4.3
- org.antlr:antlr4-runtime:4.3
+ * org.antlr:antlr4-runtime:4.3
  === MIT ===
- com.github.nkzawa:socket.io-client:0.5.1
+ * com.github.nkzawa:socket.io-client:0.5.1
- org.mockito:mockito-core:2.10.0
+ * org.mockito:mockito-core:2.10.0
- org.slf4j:slf4j-api:1.7.7
+ * org.slf4j:slf4j-api:1.7.7
- org.slf4j:slf4j-log4j12:1.7.7
+ * org.slf4j:slf4j-log4j12:1.7.7
  
  === Dependencies from the ASF Category B ===
  Dual license (CDDL 1.1 + GPL 2 w/ CPE), using under the CDDL
- com.sun.jersey:jersey-client:1.19.2
+ * com.sun.jersey:jersey-client:1.19.2
- javax.servlet:javax.servlet-api:3.0.1
+ * javax.servlet:javax.servlet-api:3.0.1
- org.glassfish.jersey.containers:jersey-container-grizzly2-http:2.23
+ * org.glassfish.jersey.containers:jersey-container-grizzly2-http:2.23
- org.glassfish.jersey.core:jersey-common:2.23
+ * org.glassfish.jersey.core:jersey-common:2.23
- org.glassfish.jersey.core:jersey-server:2.23
+ * org.glassfish.jersey.core:jersey-server:2.23
- org.glassfish.jersey.media:jersey-media-json-jackson:2.24
+ * org.glassfish.jersey.media:jersey-media-json-jackson:2.24
- org.glassfish.jersey.media:jersey-media-multipart:2.23
+ * org.glassfish.jersey.media:jersey-media-multipart:2.23
  
  === Dependencies from the ASF Category X ===
  JSON License
- org.json:json:20080701 (to be removed before Apache incubation)
+ * org.json:json:20080701 (to be removed before Apache incubation)
  
  
  == Cryptography ==

---------------------------------------------------------------------
To unsubscribe, e-mail: cvs-unsubscribe@incubator.apache.org
For additional commands, e-mail: cvs-help@incubator.apache.org


Mime
View raw message