incubator-cvs mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Apache Wiki <wikidi...@apache.org>
Subject [Incubator Wiki] Trivial Update of "HRdfStoreProposal" by udanax
Date Thu, 06 Mar 2008 02:53:15 GMT
Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Incubator Wiki" for change notification.

The following page has been changed by udanax:
http://wiki.apache.org/incubator/HRdfStoreProposal

------------------------------------------------------------------------------
  == Abstract ==
- HrdfStore will develop a Planet-Scale RDF Data Store based on [http://hadoop.apache.org
Hadoop] & [http://hadoop.apache.org/hbase Hbase].
+ !HrdfStore will develop a Planet-Scale RDF Data Store based on [http://hadoop.apache.org
Hadoop] & [http://hadoop.apache.org/hbase Hbase].
  
  == Proposal ==
- HrdfStore will develop a Hadoop subsystem for RDF, called HrdfStore, which uses Hbase +
MapReduce to store RDF data and execute queries (e.g., SPARQL) on them.
+ !HrdfStore will develop a Hadoop subsystem for RDF, called !HrdfStore, which uses Hbase
+ !MapReduce to store RDF data and execute queries (e.g., SPARQL) on them.
  
  == Background ==
  We can store very sparse RDF data in a single table in Hbase, with as many columns as they
need. For example, we might make a row for each RDF subject in a table and store all the properties
and their values as columns in the table. This reduces costly self-joins in answering queries
asking questions on the same subject, which results in efficient processing of queries, although
we still need self-joins to answer RDF path queries.
  
- We can further accelerate query performance by using MapReduce for parallel, distributed
query processing. 
+ We can further accelerate query performance by using !MapReduce for parallel, distributed
query processing. 
  
  == Rationale ==
- === HRDF Data Loader ===
+ === HrdfStore Data Loader ===
  
- HRDF Data Loader (HDL) reads RDF data from a file, and organizes the data into a Hbase table
in such a way that efficient query processing is possible. In Hbase, we can store everything
in a single table. The sparsicy of RDF data is not a problem, because Hbase, which is a column-based
storage and adopts various compression techniques, is very good at dealing with nulls in the
table
+ !HrdfStore Data Loader (HDL) reads RDF data from a file, and organizes the data into a Hbase
table in such a way that efficient query processing is possible. In Hbase, we can store everything
in a single table. The sparsicy of RDF data is not a problem, because Hbase, which is a column-based
storage and adopts various compression techniques, is very good at dealing with nulls in the
table
  
- === HRDF Query Processor ===
+ === HrdfStore Query Processor ===
  
- HRDF Query Processor (HQP) executes RDF queries on RDF data stored in a Hbase table. It
translates RDF queries into API calls to Hbase, or MapReduce jobs, gathers and returns the
results to the user.
+ !HrdfStore Query Processor (HQP) executes RDF queries on RDF data stored in a Hbase table.
It translates RDF queries into API calls to Hbase, or MapReduce jobs, gathers and returns
the results to the user.
  
  Query processing steps are as follows:
  
@@ -26, +26 @@

  -> Physical operator tree -> Execution}}}
  
  Implemenation of each step may proceed as an individual issue.
- === HRDF Data Materializer ===
+ === HrdfStore Data Materializer ===
  
- HRDF Data Materializer (HDM) pre-computes RDF path queries and stores the results into a
Hbase table. Later, HQP uses those materialized data for efficient processing of RDF path
queries. 
+ !HrdfStore Data Materializer (HDM) pre-computes RDF path queries and stores the results
into a Hbase table. Later, HQP uses those materialized data for efficient processing of RDF
path queries. 
  
  == Current Status ==
  
@@ -40, +40 @@

  
  === Community ===
  
- HrdfStore seeks to develop developer and user communities during incubation. 
+ !HrdfStore seeks to develop developer and user communities during incubation. 
  
  == Core Developers ==
  
@@ -51, +51 @@

  
  == Alignment ==
  
- The developers of HrdfStore want to work with the Apache Software Foundation specifically
because Apache has proven to provide a strong foundation and set of practices for developing
standards-based infrastructure and server components. 
+ The developers of !HrdfStore want to work with the Apache Software Foundation specifically
because Apache has proven to provide a strong foundation and set of practices for developing
standards-based infrastructure and server components. 
  
  == Known Risks ==
  === Orphaned products ===
@@ -64, +64 @@

  === Reliance on Salaried Developers ===
  Edward is a full-time open source developer at NHN, and Inchul is a Ph.D student in computer
science. 
  === Relationships with Other Apache Products ===
- HrdfStore has a strong relationship with Apache [http://hadoop.apache.org Hadoop] &
[http://hadoop.apache.org/hbase Hbase]. Being part of Apache could help for a closer collaboration
between the three projects.
+ !HrdfStore has a strong relationship with Apache [http://hadoop.apache.org Hadoop] &
[http://hadoop.apache.org/hbase Hbase]. Being part of Apache could help for a closer collaboration
between the three projects.
  
  === A Excessive Fascination with the Apache Brand ===
  

---------------------------------------------------------------------
To unsubscribe, e-mail: cvs-unsubscribe@incubator.apache.org
For additional commands, e-mail: cvs-help@incubator.apache.org


Mime
View raw message