drill-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "ASF GitHub Bot (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (DRILL-5323) Provide test tools to create, populate and compare row sets
Date Thu, 06 Apr 2017 00:17:42 GMT

    [ https://issues.apache.org/jira/browse/DRILL-5323?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15958086#comment-15958086
] 

ASF GitHub Bot commented on DRILL-5323:
---------------------------------------

Github user paul-rogers commented on a diff in the pull request:

    https://github.com/apache/drill/pull/785#discussion_r110055928
  
    --- Diff: exec/java-exec/src/test/java/org/apache/drill/test/rowSet/RowSetUtilities.java
---
    @@ -0,0 +1,83 @@
    +/*
    + * Licensed to the Apache Software Foundation (ASF) under one
    + * or more contributor license agreements.  See the NOTICE file
    + * distributed with this work for additional information
    + * regarding copyright ownership.  The ASF licenses this file
    + * to you under the Apache License, Version 2.0 (the
    + * "License"); you may not use this file except in compliance
    + * with the License.  You may obtain a copy of the License at
    + *
    + * http://www.apache.org/licenses/LICENSE-2.0
    + *
    + * Unless required by applicable law or agreed to in writing, software
    + * distributed under the License is distributed on an "AS IS" BASIS,
    + * WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
    + * See the License for the specific language governing permissions and
    + * limitations under the License.
    + */
    +package org.apache.drill.test.rowSet;
    +
    +import org.apache.drill.common.types.TypeProtos.MinorType;
    +import org.apache.drill.exec.record.selection.SelectionVector2;
    +import org.apache.drill.exec.vector.accessor.AccessorUtilities;
    +import org.apache.drill.exec.vector.accessor.ColumnAccessor.ValueType;
    +import org.apache.drill.exec.vector.accessor.ColumnWriter;
    +import org.apache.drill.test.rowSet.RowSet.RowSetWriter;
    +import org.joda.time.Duration;
    +import org.joda.time.Period;
    +
    +public class RowSetUtilities {
    +
    +  private RowSetUtilities() { }
    +
    +  public static void reverse(SelectionVector2 sv2) {
    +    int count = sv2.getCount();
    +    for (int i = 0; i < count / 2; i++) {
    +      char temp = sv2.getIndex(i);
    +      int dest = count - 1 - i;
    +      sv2.setIndex(i, sv2.getIndex(dest));
    +      sv2.setIndex(dest, temp);
    +    }
    +  }
    +
    +  /**
    +   * Set a test data value from an int. Uses the type information of the
    +   * column to handle interval types. Else, uses the value type of the
    +   * accessor. The value set here is purely for testing; the mapping
    +   * from ints to intervals has no real meaning.
    +   *
    +   * @param rowWriter
    +   * @param index
    +   * @param value
    +   */
    +
    +  public static void setFromInt(RowSetWriter rowWriter, int index, int value) {
    +    ColumnWriter writer = rowWriter.column(index);
    +    if (writer.valueType() == ValueType.PERIOD) {
    +      setPeriodFromInt(writer, rowWriter.schema().column(index).getType().getMinorType(),
value);
    +    } else {
    +      AccessorUtilities.setFromInt(writer, value);
    +    }
    +  }
    +
    +  public static void setPeriodFromInt(ColumnWriter writer, MinorType minorType,
    +      int value) {
    +    switch (minorType) {
    +    case INTERVAL:
    +      writer.setPeriod(Duration.millis(value).toPeriod());
    +      break;
    +    case INTERVALYEAR:
    +      writer.setPeriod(Period.years(value / 12).withMonths(value % 12));
    +      break;
    +    case INTERVALDAY:
    +      int sec = value % 60;
    +      value = value / 60;
    +      int min = value % 60;
    +      value = value / 60;
    +      writer.setPeriod(Period.days(value).withMinutes(min).withSeconds(sec));
    --- End diff --
    
    This is a data generator. The int has no real meaning, it is just a convenient way to
populate a field. So, here we just slice off some values to put into each field. Not pretty,
but convenient for testing.


> Provide test tools to create, populate and compare row sets
> -----------------------------------------------------------
>
>                 Key: DRILL-5323
>                 URL: https://issues.apache.org/jira/browse/DRILL-5323
>             Project: Apache Drill
>          Issue Type: Sub-task
>          Components: Tools, Build & Test
>    Affects Versions: 1.11.0
>            Reporter: Paul Rogers
>            Assignee: Paul Rogers
>             Fix For: 1.11.0
>
>
> Operators work with individual row sets. A row set is a collection of records stored
as column vectors. (Drill uses various terms for this concept. A record batch is a row set
with an operator implementation wrapped around it. A vector container is a row set, but with
much functionality left as an exercise for the developer. And so on.)
> To simplify tests, we need a {{TestRowSet}} concept that wraps a {{VectorContainer}}
and provides easy ways to:
> * Define a schema for the row set.
> * Create a set of vectors that implement the schema.
> * Populate the row set with test data via code.
> * Add an SV2 to the row set.
> * Pass the row set to operator components (such as generated code blocks.)
> * Compare the results of the operation with an expected result set.
> * Dispose of the underling direct memory when work is done.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

Mime
View raw message