Return-Path: X-Original-To: apmail-metamodel-dev-archive@minotaur.apache.org Delivered-To: apmail-metamodel-dev-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 542D01855E for ; Sat, 23 May 2015 17:27:25 +0000 (UTC) Received: (qmail 18593 invoked by uid 500); 23 May 2015 17:27:25 -0000 Delivered-To: apmail-metamodel-dev-archive@metamodel.apache.org Received: (qmail 18563 invoked by uid 500); 23 May 2015 17:27:25 -0000 Mailing-List: contact dev-help@metamodel.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@metamodel.apache.org Delivered-To: mailing list dev@metamodel.apache.org Received: (qmail 18551 invoked by uid 99); 23 May 2015 17:27:24 -0000 Received: from git1-us-west.apache.org (HELO git1-us-west.apache.org) (140.211.11.23) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 23 May 2015 17:27:24 +0000 Received: by git1-us-west.apache.org (ASF Mail Server at git1-us-west.apache.org, from userid 33) id CDDABE0520; Sat, 23 May 2015 17:27:24 +0000 (UTC) From: kaspersorensen To: dev@metamodel.apache.org Reply-To: dev@metamodel.apache.org References: In-Reply-To: Subject: [GitHub] metamodel pull request: Csv Content-Type: text/plain Message-Id: <20150523172724.CDDABE0520@git1-us-west.apache.org> Date: Sat, 23 May 2015 17:27:24 +0000 (UTC) Github user kaspersorensen commented on the pull request: https://github.com/apache/metamodel/pull/17#issuecomment-104928231 Thinking about this some more, I guess the main principle that I am trying to enforce is: _metadata about a column must be consistent with the values you get when you query that column_ In other words, it is currently consistent that ColumnType is always STRING because we always return a String value. I would be very open to a way of working with CSV files where you could specify/override the column type of certain columns. That would actually be a pretty cool feature - very related, but a different way to see it. If that would be the case then you could first use the untouched CsvDataContext to analyze the content of various columns. If you feel confident enough you might then switch the column type from STRING to something else. And when it is then set to eg. INTEGER then our cast method should automatically apply a Integer cast/conversion operation. Same with other types then. Would that maybe be a model that fits us all? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastructure@apache.org or file a JIRA ticket with INFRA. ---