From users-return-10820-archive-asf-public=cust-asf.ponee.io@pdfbox.apache.org Mon Feb 19 20:00:32 2018 Return-Path: X-Original-To: archive-asf-public@cust-asf.ponee.io Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by mx-eu-01.ponee.io (Postfix) with SMTP id 1F6F8180607 for ; Mon, 19 Feb 2018 20:00:31 +0100 (CET) Received: (qmail 35639 invoked by uid 500); 19 Feb 2018 19:00:30 -0000 Mailing-List: contact users-help@pdfbox.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: users@pdfbox.apache.org Delivered-To: mailing list users@pdfbox.apache.org Received: (qmail 35627 invoked by uid 99); 19 Feb 2018 19:00:30 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd4-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 19 Feb 2018 19:00:30 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd4-us-west.apache.org (ASF Mail Server at spamd4-us-west.apache.org) with ESMTP id C92D4C0048 for ; Mon, 19 Feb 2018 19:00:29 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd4-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 2.97 X-Spam-Level: ** X-Spam-Status: No, score=2.97 tagged_above=-999 required=6.31 tests=[KAM_LAZY_DOMAIN_SECURITY=1, RCVD_IN_BL_SPAMCOP_NET=2, RCVD_IN_DNSWL_NONE=-0.0001, RCVD_IN_MSPIKE_H3=-0.01, RCVD_IN_MSPIKE_WL=-0.01, T_RP_MATCHES_RCVD=-0.01] autolearn=disabled Received: from mx1-lw-us.apache.org ([10.40.0.8]) by localhost (spamd4-us-west.apache.org [10.40.0.11]) (amavisd-new, port 10024) with ESMTP id dG6DfT8c9rfH for ; Mon, 19 Feb 2018 19:00:28 +0000 (UTC) Received: from mailout02.t-online.de (mailout02.t-online.de [194.25.134.17]) by mx1-lw-us.apache.org (ASF Mail Server at mx1-lw-us.apache.org) with ESMTPS id 402F05F340 for ; Mon, 19 Feb 2018 19:00:28 +0000 (UTC) Received: from fwd32.aul.t-online.de (fwd32.aul.t-online.de [172.20.26.144]) by mailout02.t-online.de (Postfix) with SMTP id D620941B4721 for ; Mon, 19 Feb 2018 20:00:20 +0100 (CET) Received: from [192.168.2.108] (V+UQOGZvQhxb90Hdb8XR6ghJr4M1kqrQZskfi9p908vvzW9bkJSWYnwDOSiLMu5ZLx@[217.231.142.100]) by fwd32.t-online.de with (TLSv1.2:ECDHE-RSA-AES256-GCM-SHA384 encrypted) esmtp id 1enqfh-1PkngG0; Mon, 19 Feb 2018 20:00:17 +0100 Subject: Re: background color To: users@pdfbox.apache.org References: From: Tilman Hausherr Message-ID: <11847897-4ea3-4422-5194-d47eeed60990@t-online.de> Date: Mon, 19 Feb 2018 20:00:14 +0100 User-Agent: Mozilla/5.0 (Windows NT 10.0; WOW64; rv:52.0) Gecko/20100101 Thunderbird/52.6.0 MIME-Version: 1.0 In-Reply-To: Content-Type: text/plain; charset=utf-8; format=flowed Content-Transfer-Encoding: 7bit Content-Language: en-US X-ID: V+UQOGZvQhxb90Hdb8XR6ghJr4M1kqrQZskfi9p908vvzW9bkJSWYnwDOSiLMu5ZLx X-TOI-MSGID: 21192a9a-52a5-4819-a8f1-200431c31b18 Am 19.02.2018 um 10:33 schrieb Divya Bharatha: > Hi, > > > I am trying to find the tables and background color in pdf . Can > you please help me ASAP. Unless it is a tagged PDF there is no simple way to find out. PDF is not HTML. It is a graphic format. Open a file with PDFDebugger to see what a PDF is made of. You can try "tabula java" this can extract stuff from tables, but this is heuristics, so it may or may not always work. Tilman --------------------------------------------------------------------- To unsubscribe, e-mail: users-unsubscribe@pdfbox.apache.org For additional commands, e-mail: users-help@pdfbox.apache.org