Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id CC653200C84 for ; Mon, 29 May 2017 09:17:42 +0200 (CEST) Received: by cust-asf.ponee.io (Postfix) id CAFF1160BCE; Mon, 29 May 2017 07:17:42 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id 198A7160BC2 for ; Mon, 29 May 2017 09:17:41 +0200 (CEST) Received: (qmail 89080 invoked by uid 500); 29 May 2017 07:17:36 -0000 Mailing-List: contact users-help@pdfbox.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: users@pdfbox.apache.org Delivered-To: mailing list users@pdfbox.apache.org Delivered-To: moderator for users@pdfbox.apache.org Received: (qmail 62805 invoked by uid 99); 29 May 2017 06:56:58 -0000 X-Virus-Scanned: Debian amavisd-new at spamd2-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 2.129 X-Spam-Level: ** X-Spam-Status: No, score=2.129 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, FREEMAIL_ENVFROM_END_DIGIT=0.25, HTML_MESSAGE=2, RCVD_IN_DNSWL_NONE=-0.0001, RCVD_IN_MSPIKE_H3=-0.01, RCVD_IN_MSPIKE_WL=-0.01, SPF_PASS=-0.001] autolearn=disabled Authentication-Results: spamd2-us-west.apache.org (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=mime-version:from:date:message-id:subject:to; bh=tbNu2gUl0iNw9d4+ZmbQQujnLVWHCHc1Zhvu9ZO+usc=; b=T6Lbcpfzn5rcnh1BX+xh3hbPwls/EgzuvUXqSwdBpwSlMEvJCrJ2IySW8luTSZjJxh rQTROd+C+FnufCN+QlaPc0ySgetwKNSLNlNJBUtMg2NSCW5dXLTx7xCnozRvy2yeiFug Z3Nx8zH7n3D5H4fq3ZLfmnj+Y5Ztz4Qs6BAYxtyxdmN9yUdZE2NIss14jvf5fEZaoXmx ulNEnSt5CeYmqtMYQB9DXKpkB2Aegzq80zN+hzlX91zPa5cgvklUgzQow8Ruq7gnxwQQ vnBfvbg2/eNxVaooebSprAxMqA4GwDGU58MsH0B9fDqSUK/3NH8LyZXPBw41zA+acHFw EJ2w== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:from:date:message-id:subject:to; bh=tbNu2gUl0iNw9d4+ZmbQQujnLVWHCHc1Zhvu9ZO+usc=; b=Dku+fNYoupnbA0RBhHPXja2AO14XhxRWIICuGlfqqqQpXoNg6Bq+Aes4w69v3wj6UO CyfSX6fJ+tinPo+wi+0CQs6S+yErvUJnm5tUT0W2mA6A0YTS2K7JTCH1w0BZsfck152r s3g2bbwaHk0/9hzBABLB4vUVsbbfCR4EsrDp2VjK2mTGpf04Zb0iNV8dZkoxDUsteRiQ WRywVxSH91iKha+UJA2ZuDhAhnT4MkxggnXPaLPLmTPIwIVKDXP05rzY0rRHLjzJQadj Aobsz8TqNfhFaW+RVhhhATvYBXCxBm6BhxXvPwldnAGcyzCb33EZac//t8VOFUE5OuLT /+fQ== X-Gm-Message-State: AODbwcB02O2lX5/KrXUGKxRP23xTrunz6aOItG6dCpWali2COWJYSBf6 L1JgmgyJxGSMJ0hIj4wUDI7QN4XW84Iv X-Received: by 10.25.193.145 with SMTP id r139mr4090191lff.111.1496041008992; Sun, 28 May 2017 23:56:48 -0700 (PDT) MIME-Version: 1.0 From: Kunal Kashyap Date: Mon, 29 May 2017 12:26:48 +0530 Message-ID: Subject: Issues regarding PDFBOX To: users@pdfbox.apache.org, dev@pdfbox.apache.org Content-Type: multipart/mixed; boundary="94eb2c1a0878e96fd80550a43062" archived-at: Mon, 29 May 2017 07:17:43 -0000 --94eb2c1a0878e96fd80550a43062 Content-Type: multipart/alternative; boundary="94eb2c1a0878e96fd30550a43060" --94eb2c1a0878e96fd30550a43060 Content-Type: text/plain; charset="UTF-8" Hi All, I am trying to read text data from a pdf file using PdfBox API. So ,I want to skip all the charts data and images in the output .txt file . Can anyone help me regarding this. Also I want to extract data in proper alignment. PFA is the sample pdf file and sample .txt file(this is my desired output file) --94eb2c1a0878e96fd30550a43060 Content-Type: text/html; charset="UTF-8" Content-Transfer-Encoding: quoted-printable
Hi All,
I am trying to read text data from a pdf file = using PdfBox API. So ,I want to skip all the charts data and images in the = output .txt file . Can anyone help me regarding this. Also I want to extrac= t data in proper alignment.
PFA is the sample pdf file and sample= .txt file(this is my desired output file)
--94eb2c1a0878e96fd30550a43060-- --94eb2c1a0878e96fd80550a43062 Content-Type: text/plain; charset="US-ASCII"; name="SamplePDF.txt" Content-Disposition: attachment; filename="SamplePDF.txt" Content-Transfer-Encoding: base64 X-Attachment-Id: f_j39s63d11 S3VuYWwgS2FzaHlhcA0KTXkgbmFtZSBpcyBLdW5hbCBLYXNoeWFwLg0KSSB3b3JrIGluIElUIGlu ZHVzdHJ5Lg0KSSBhbSBmcm9tIEluZGlhLg0K --94eb2c1a0878e96fd80550a43062 Content-Type: text/plain; charset=us-ascii --------------------------------------------------------------------- To unsubscribe, e-mail: users-unsubscribe@pdfbox.apache.org For additional commands, e-mail: users-help@pdfbox.apache.org --94eb2c1a0878e96fd80550a43062--