Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id D6B9E200BAB for ; Fri, 7 Oct 2016 17:51:47 +0200 (CEST) Received: by cust-asf.ponee.io (Postfix) id D5A3E160AC6; Fri, 7 Oct 2016 15:51:47 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id 320C7160AE9 for ; Fri, 7 Oct 2016 17:51:47 +0200 (CEST) Received: (qmail 85495 invoked by uid 500); 7 Oct 2016 15:51:41 -0000 Mailing-List: contact users-help@pdfbox.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: users@pdfbox.apache.org Delivered-To: mailing list users@pdfbox.apache.org Delivered-To: moderator for users@pdfbox.apache.org Received: (qmail 14871 invoked by uid 99); 7 Oct 2016 15:35:44 -0000 X-Virus-Scanned: Debian amavisd-new at spamd2-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 1.879 X-Spam-Level: * X-Spam-Status: No, score=1.879 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, HTML_MESSAGE=2, RCVD_IN_DNSWL_NONE=-0.0001, RCVD_IN_MSPIKE_H3=-0.01, RCVD_IN_MSPIKE_WL=-0.01, SPF_PASS=-0.001] autolearn=disabled Authentication-Results: spamd2-us-west.apache.org (amavisd-new); dkim=pass (2048-bit key) header.d=outlook.com DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=outlook.com; s=selector1; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version; bh=+MPTcZ3h7b8AoXWroosJ3Cqxx0AyBfig8AGGtt0JLDk=; b=Gb104N1ylp/vwNGBEG+PdduIOs/FrOPBB+1+yQtbCk+bpzhBbQu5v8hk9aT/+7sCxh1rvmgIbqeqzNB3kLKFWjbtJNeo1xpgVqI9o3JZ9qNeLsW/PgC7/AlEaKMPX/6/1zEegfBYTnVrPWs0FkhpFQGNpRsLrG+G6hk5Lerrfl4J5WYZxFsEAjr27XeG+ikrjdV7El8OyP6cyXopGXXngyZ/Y7am9A+sYbSTRCzs87hnAJrcKF/DGF4EfpahVRvhj0idD1lGrRUVL6Ex+PKP+YpXDwhSfvhydHRFvkWw2bvSyEpvhOddGyTyPuz8j0BAuBwru3LDRH5GVQoPAstZTg== From: Christopher Begley To: "users@pdfbox.apache.org" Subject: Dump all objects on page with coordinates (images, text, color boxes, lines) Thread-Topic: Dump all objects on page with coordinates (images, text, color boxes, lines) Thread-Index: AQHSIK/CSzrq295oUEm/VJzNWMczWA== Date: Fri, 7 Oct 2016 15:35:33 +0000 Message-ID: Accept-Language: en-US Content-Language: en-US X-MS-Has-Attach: X-MS-TNEF-Correlator: authentication-results: spf=softfail (sender IP is 10.152.92.58) smtp.mailfrom=outlook.com; pdfbox.apache.org; dkim=none (message not signed) header.d=none;pdfbox.apache.org; dmarc=fail action=none header.from=outlook.com; received-spf: SoftFail (protection.outlook.com: domain of transitioning outlook.com discourages use of 10.152.92.58 as permitted sender) x-ms-exchange-messagesentrepresentingtype: 1 x-tmn: [wHc2bIDVO0yuKs6yNj62XZzBLnF5QB+v] x-eopattributedmessage: 0 x-microsoft-exchange-diagnostics: 1;BN3NAM04HT102;6:IvvZK+xZTAdsyFTIWlydwnfclGMoskQ7eI3KCdYFU9Kh1W6WICsYPv7XM6aOSf8Lhnk3dVoDQHog7qzHBU1RGabIOcnVT586AKxQQ69KAphi5hrQydoOMPOOHjiInwKaTdtXqdMu146Z/lL78uSGPtECLXVUBnPsAC1Pk0keBJnvyb05NRJlER32ec2hVQh27DAdL9+epnHNsrMQ5cyhjbRajTGalA4Ot7l5UmhN9tqUPN7pGCvzMBCSXQXLGYG1CtM2KBSWXrEsTEZVxsJxur3SEEcKN1FqQl/6W+XX6MQ=;5:h66brKkyc2qsDLDDiX+vDHMapRbp1hS7X+hIOE+ED4o9u+ihGL8plPVxmBH5zO5ABXvWtOAbzqk0lIDzwnB8pHAGMBBCKdaQ2M7tzHc36sr68+nxqzC7pub+ru5X1KUpNWL3ZkKjD+3iKhYKBkNUTg==;24:w3bEop5f/8vzGdlJRgEhFg1iXswG2acc0P7H31OzjFS8qVSuVvWt6GndfCdFTXJNkozbI0A/1dVV/mOdYs3kqCqA0BjZ3PlNsj/t0T8pQNQ=;7:ZOaaKLZZgKg7Vxqk5RJXqJE3qdRGwT4sHXBgGNprBOLacpGjvrGC25rXwQh1NTByC1PXFXOaWMEaVCnlWJzKyoAg3Vv4DspvL37q/zG/GxF2SjEEeJnBMCxaN1K6bju2LGEWULyP/BJWwq5UJ3+O5+aMwiGtKu4pnzZ7L3KZq3tE/g4KKpOxrrQgIMNHI1ni1oyTdPbg3iU6O9TYLi3HqdC1OI52aqBSvcmb/oXNfnDpgZkZ/V6KyC7YmiBI2xIQRwqucUrZxxsY6AVlyeMJDJvDRx8yf2VfnJAHFFf/EFIG0k+jVE5gv3/XTareLxQ/ x-forefront-antispam-report: EFV:NLI;SFV:NSPM;SFS:(10019020)(98900003);DIR:OUT;SFP:1102;SCL:1;SRVR:BN3NAM04HT102;H:CY4PR08MB2694.namprd08.prod.outlook.com;FPR:;SPF:None;LANG:en; x-ms-office365-filtering-correlation-id: 47bdfd69-ac32-4196-ffd5-08d3eec7941c x-microsoft-antispam: UriScan:;BCL:0;PCL:0;RULEID:(1601124038)(1603103081)(1601125047)(1603101256);SRVR:BN3NAM04HT102; x-exchange-antispam-report-cfa-test: BCL:0;PCL:0;RULEID:(432015012)(82015046);SRVR:BN3NAM04HT102;BCL:0;PCL:0;RULEID:;SRVR:BN3NAM04HT102; x-forefront-prvs: 0088C92887 spamdiagnosticoutput: 1:99 spamdiagnosticmetadata: NSPM Content-Type: multipart/alternative; boundary="_000_CY4PR08MB2694E08B0F0676A6389F042B95C60CY4PR08MB2694namp_" MIME-Version: 1.0 X-OriginatorOrg: outlook.com X-MS-Exchange-CrossTenant-originalarrivaltime: 07 Oct 2016 15:35:33.8161 (UTC) X-MS-Exchange-CrossTenant-fromentityheader: Internet X-MS-Exchange-CrossTenant-id: 84df9e7f-e9f6-40af-b435-aaaaaaaaaaaa X-MS-Exchange-Transport-CrossTenantHeadersStamped: BN3NAM04HT102 X-OriginalArrivalTime: 07 Oct 2016 15:35:35.0777 (UTC) FILETIME=[72B20D10:01D220B0] archived-at: Fri, 07 Oct 2016 15:51:48 -0000 --_000_CY4PR08MB2694E08B0F0676A6389F042B95C60CY4PR08MB2694namp_ Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: quoted-printable Hello All! New to PDFBox. My task to to basically map ALL elements on a page of a pdf = document. This includes text, color boxes, highlights, underlines, lines, c= urves, images, etc. Does there exist a way to dump all objects on a page and then retrieve info= rmation about each object? (Specifically, coordinates that can then be mapp= ed to page coordinates in another file format). From my limited perusal of the documentation, I don't see any obvious/intui= tive way to do this. Can someone point me the right direction on how to app= roach this problem? Thanks in advance, --_000_CY4PR08MB2694E08B0F0676A6389F042B95C60CY4PR08MB2694namp_--