incubator-cassandra-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From java8964 java8964 <>
Subject SSTableReader support files in Distribute File System
Date Thu, 05 Sep 2013 20:30:28 GMT
I don't think currently Cassandra can support this, but if it does, can someone tell me how,
or is it reasonable to ask this feature and where should I submit it to?
I am thinking that SSTableReader class, and underline supporting classes, should support handling
the files in a DFS, like CFS (Cassandra File System), HDFS (Hadoop Distribute File System)
or Amazon S3. Basically, if I pass a file URI, like "hdfs://xxxxxx/xxx.db_file" or "S3://xxxxx/xxxx.db_file",
as long as there is library in the runtime to support this DFS, Cassandra SSTableReader should
just work to read the Data/Index etc files from there. Anyway, they are all InputStream.
The reason I am asking this is that in my project, I want to parse the SSTable files in MR
job. But reusing SSTableReader is hard as internally it only uses File class. 
Any tech reasons will make it hard to support the SSTable files existed in a DFS?
View raw message