incubator-drill-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Tom Kiley <...@kiley.us>
Subject Re: Sequence File Suport
Date Mon, 10 Feb 2014 14:58:32 GMT
Hi Jason,

Thanks for the response.

I'm trying to apple the patch, but having a little trouble.  I'm trying to
apply it to the HEAD of the git repo.

Below is some of the output of applying the patch:

$ git apply --check r17833.patch
>
> error: cannot apply binary patch to 'sample-data/drill_hive_db/db.lck'
> without full index line
> error: sample-data/drill_hive_db/db.lck: patch does not apply
> error: cannot apply binary patch to 'sample-data/drill_hive_db/dbex.lck'
> without full index line
> error: sample-data/drill_hive_db/dbex.lck: patch does not apply
> error: cannot apply binary patch to
> 'sample-data/drill_hive_db/log/log.ctrl' without full index line
> error: sample-data/drill_hive_db/log/log.ctrl: patch does not apply
> error: cannot apply binary patch to
> 'sample-data/drill_hive_db/log/log1.dat' without full index line
> error: sample-data/drill_hive_db/log/log1.dat: patch does not apply
> error: cannot apply binary patch to
> 'sample-data/drill_hive_db/log/logmirror.ctrl' without full index line
> error: sample-data/drill_hive_db/log/logmirror.ctrl: patch does not apply

 [...]

error: cannot apply binary patch to
> 'sample-data/drill_hive_wh/kv_text/.kv1.txt.crc' without full index line
> error: sample-data/drill_hive_wh/kv_text/.kv1.txt.crc: patch does not apply
> error: cannot apply binary patch to
> 'sample-data/drill_hive_wh/testdb1.db/kv_seq/.kv1.seq.crc' without full
> index line
> error: sample-data/drill_hive_wh/testdb1.db/kv_seq/.kv1.seq.crc: patch
> does not apply
> error: cannot apply binary patch to
> 'sample-data/drill_hive_wh/testdb1.db/kv_seq/kv1.seq' without full index
> line
> error: sample-data/drill_hive_wh/testdb1.db/kv_seq/kv1.seq: patch does not
> apply


The lines I stripped are basically the same, but with different file names.


Thanks,
Tom



On Fri, Feb 7, 2014 at 12:44 PM, Jason Altekruse
<altekrusejason@gmail.com>wrote:

> Hello Tom,
>
> Steven just submitted a patch for a Hive Serde storage engine. I believe he
> successfully was able to read sequence file with this technique. We will be
> adding a native reader in the future (for improved performance), but for
> now this should be a decent way to get sequence file data into drill. He
> currently has the patch up for review, so if you are comfortable applying a
> patch, building the project and trying to read some of your data we would
> certainly appreciate feedback. It should be merged with mainline in the
> near future, which would remove the need to apply the patch.
>
> https://reviews.apache.org/r/17833/
>
> -Jason Altekruse
>
>
> On Fri, Feb 7, 2014 at 7:51 AM, Sebastian Schelter <ssc@apache.org> wrote:
>
> > There's no need to excuse for asking questions :)
> >
> >
> > On 02/07/2014 02:49 PM, Tom Kiley wrote:
> >
> >> Hello,
> >>
> >>
> >> Are there plans to support Hadoop's Sequence File (
> >> http://wiki.apache.org/hadoop/SequenceFile.)  Or are they already
> >> supported
> >> and I missed it?  I could see this being useful to use Drill on the
> output
> >> of MapReduce jobs.
> >>
> >> The sequence files I have are currently all NULL keys and JSON objects
> as
> >> the value.  Does anyone have a recommendation on converting to JSON
> files
> >> or Parquet files for Drill?  The JSON objects are generally the same
> >> format, but there may be some outliers with differences.  Some fields
> may
> >> be non-existant in some objects.
> >>
> >>
> >> Thanks,
> >> Tom
> >>
> >> P.S. Apologies for the noob questions.  I've just started looking at
> >> Drill.
> >>
> >>
> >
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message