hadoop-hdfs-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From 易剑 <myhad...@gmail.com>
Subject Use DTS instead of DFS for data warehouse
Date Thu, 04 Feb 2010 08:40:52 GMT
*Glossary*
DTS: Distributed Table System, not a bigtable
DFS: Distributed File System


DFS is better for unstructed data, but DTS is better for structed data, data
warehouse is structed, so I think a table is better than a file. DTS is
following:
1. Break a logic big table into a many physical small table
2. The same size blocks is not necessary
3. The order of blocks is not  necessary
4. Only store structed data
5. Support block indexes
6. Support deleting and updating
7. The interfaces are SQL, but only a block
8. Spliting a table horizontally and vertically is supported at the same
time
9. 。。。

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message