drill-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Jacques Nadeau <jacq...@dremio.com>
Subject Re: Order of records read in a parquet file
Date Fri, 06 Nov 2015 23:34:48 GMT
It should be the first five records, same as before (although not
technically guaranteed by the sql). The moment that there is more than one
file or more than one row group, the behavior is indeterminate.

--
Jacques Nadeau
CTO and Co-Founder, Dremio

On Fri, Nov 6, 2015 at 3:31 PM, rahul challapalli <
challapallirahul@gmail.com> wrote:

> parquet-meta command suggests that there is only one row group
>
> On Fri, Nov 6, 2015 at 3:23 PM, Jacques Nadeau <jacques@dremio.com> wrote:
>
> > How many row groups?
> >
> > --
> > Jacques Nadeau
> > CTO and Co-Founder, Dremio
> >
> > On Fri, Nov 6, 2015 at 3:14 PM, rahul challapalli <
> > challapallirahul@gmail.com> wrote:
> >
> > > Drillers,
> > >
> > > With the new parquet library update, can someone throw some light on
> the
> > > order in which the records are read from a single parquet file?
> > >
> > > With the older library, when I run the below query on a single parquet
> > > file, I used to get a set of records. Now after the parquet library
> > update,
> > > I am seeing a different set of records. Just wanted to understand what
> > > specifically has changed.
> > >
> > > select * from `file.parquet` limit 5;
> > >
> > > - Rahul
> > >
> >
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message