Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id 7C5B8200CF9 for ; Thu, 27 Jul 2017 02:23:38 +0200 (CEST) Received: by cust-asf.ponee.io (Postfix) id 7AD09169DB8; Thu, 27 Jul 2017 00:23:38 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id C0603169DBC for ; Thu, 27 Jul 2017 02:23:37 +0200 (CEST) Received: (qmail 26240 invoked by uid 500); 27 Jul 2017 00:23:36 -0000 Mailing-List: contact dev-help@drill.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@drill.apache.org Delivered-To: mailing list dev@drill.apache.org Received: (qmail 26218 invoked by uid 99); 27 Jul 2017 00:23:36 -0000 Received: from git1-us-west.apache.org (HELO git1-us-west.apache.org) (140.211.11.23) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 27 Jul 2017 00:23:36 +0000 Received: by git1-us-west.apache.org (ASF Mail Server at git1-us-west.apache.org, from userid 33) id 4C154E967F; Thu, 27 Jul 2017 00:23:36 +0000 (UTC) From: paul-rogers To: dev@drill.apache.org Reply-To: dev@drill.apache.org References: In-Reply-To: Subject: [GitHub] drill pull request #886: Update 010-performance-tuning-introduction.md Content-Type: text/plain Message-Id: <20170727002336.4C154E967F@git1-us-west.apache.org> Date: Thu, 27 Jul 2017 00:23:36 +0000 (UTC) archived-at: Thu, 27 Jul 2017 00:23:38 -0000 Github user paul-rogers commented on a diff in the pull request: https://github.com/apache/drill/pull/886#discussion_r129728610 --- Diff: _docs/performance-tuning/010-performance-tuning-introduction.md --- @@ -3,9 +3,9 @@ title: "Performance Tuning Introduction" date: parent: "Performance Tuning" --- -You can apply performance tuning measures to improve how efficiently Drill queries data. To significantly improve performance in Drill, you must have knowledge about the underlying data and data sources, as well as familiarity with how Drill executes queries. +You can change system options in Drill to improve the query performance. Before you improve performance in Drill, you must choose a layout of the data and the choose an appropriate file format specific to your use case. For example, for an analytic workload operating on historical time series data, then choosing Parquet as the file format and a partitioning scheme that uses time as a partitionining dimension would be a recommended approach. In the case you are directly querying data data sources, you need to have an understanding of the data source itself. Some familiarity with how Drill executes queries can also help. -You can analyze query plans and profiles to identify the source of performance issues in Drill. Once you have isolated the source of an issue, you can apply the following tuning techniques to improve query performance: +You can analyze query plans and profiles to identify performance bottlenecks in Drill. Once you identified issue, here are a couple of best practices to get you started: --- End diff -- "Once you identified issue" --> "Once you have identified an issue" Actually, the original wording flows better... --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastructure@apache.org or file a JIRA ticket with INFRA. ---