Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id E5884200CBB for ; Tue, 4 Jul 2017 10:52:01 +0200 (CEST) Received: by cust-asf.ponee.io (Postfix) id E40D9160BEF; Tue, 4 Jul 2017 08:52:01 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id 3691E160BE1 for ; Tue, 4 Jul 2017 10:52:01 +0200 (CEST) Received: (qmail 24371 invoked by uid 500); 4 Jul 2017 08:52:00 -0000 Mailing-List: contact users-help@pdfbox.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: users@pdfbox.apache.org Delivered-To: mailing list users@pdfbox.apache.org Received: (qmail 24358 invoked by uid 99); 4 Jul 2017 08:52:00 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd3-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 04 Jul 2017 08:52:00 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd3-us-west.apache.org (ASF Mail Server at spamd3-us-west.apache.org) with ESMTP id 8CB88180354 for ; Tue, 4 Jul 2017 08:51:59 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd3-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 2.388 X-Spam-Level: ** X-Spam-Status: No, score=2.388 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, HTML_MESSAGE=2, RCVD_IN_DNSWL_NONE=-0.0001, REPTO_QUOTE_YAHOO=0.49, RP_MATCHES_RCVD=-0.001, SPF_PASS=-0.001] autolearn=disabled Authentication-Results: spamd3-us-west.apache.org (amavisd-new); dkim=pass (2048-bit key) header.d=yahoo.com Received: from mx1-lw-eu.apache.org ([10.40.0.8]) by localhost (spamd3-us-west.apache.org [10.40.0.10]) (amavisd-new, port 10024) with ESMTP id aDfLWgTfLBNM for ; Tue, 4 Jul 2017 08:51:58 +0000 (UTC) Received: from sonic318-49.consmr.mail.gq1.yahoo.com (sonic318-49.consmr.mail.gq1.yahoo.com [98.137.70.175]) by mx1-lw-eu.apache.org (ASF Mail Server at mx1-lw-eu.apache.org) with ESMTPS id CDD835FB40 for ; Tue, 4 Jul 2017 08:51:57 +0000 (UTC) X-YMail-OSG: h5dasc0VM1l5CC6.pbmqgmxBA7hCA19S8XzX.ZGMgxmwUV1T9Ee8wWsMOUUD3EJ TVTtRkvkUkyYF49TvQ606Tnjk8rvA4xEG5IaS3ZdnN2PP89DoXszfbPKbnk22rbQZNs8ndPIxq3u O4lsGmHVhO9mzrL9k4jIvDPSFcrKDWn4mNGkSdSzu6H3vHimO7lhh1cEGYIRSY_XJasx863xQQRI 0IVqnsaU8qbIUPYWX6Be3nzwo6vdN62TB.qOEN2074C42DFQqLWsYBgwwdxpTbI12vqmDRljvN00 TpoABL_..bMBgVFfRcOqJ3UaYP_3wJCd2AThFqPc4k6Mf7FhQjV5LmENWerbyh59MuwKnesKqjwN BtsLO8hZuaHmDBAuomGn7d4sOSxXb3C3aiNhbf205yk7dS9466hb_qjYDwB6Mv5iPHO2oLdJP3yR nHZN27qGlr9GxmmDaCmpgogFj0JahsC_bbbAQAfuecwcYQMq3mcRXsVtz59ADYCvjKzFnQz9OKvp xxVKru9aFlD.mzTyw8OY3b20TYHvo0MV4F7NccLKh7yhALOVk Received: from sonic.gate.mail.ne1.yahoo.com by sonic318.consmr.mail.gq1.yahoo.com with HTTP; Tue, 4 Jul 2017 08:51:49 +0000 Date: Tue, 4 Jul 2017 08:47:47 +0000 (UTC) From: lalit gupta Reply-To: "lalitlkg@yahoo.com" To: "gilad.denneboom@gmail.com" , "users@pdfbox.apache.org" Cc: "dev@pdfbox.apache.org" Message-ID: <1340309378.2226934.1499158067407@mail.yahoo.com> In-Reply-To: References: <906217527.3856857.1499146787659.ref@mail.yahoo.com> <906217527.3856857.1499146787659@mail.yahoo.com> Subject: Re: Split PDF help required MIME-Version: 1.0 Content-Type: multipart/alternative; boundary="----=_Part_2226933_193738421.1499158067404" X-Mailer: WebService/1.1.9978 YahooMailAndroidMobile YMobile/1.0 (com.yahoo.mobile.client.android.mail/5.17.2; Android/6.0; MPD24.65-18; lux_uds; motorola; XT1562; 5.16; 1776x1080;) archived-at: Tue, 04 Jul 2017 08:52:02 -0000 ------=_Part_2226933_193738421.1499158067404 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable Hi , Can you please send some demo or code if possible. I tried something around= but couldn't help.Flow will be like=C2=A0Read source PDFFind key or header= info in source PDFSplit PDF=C2=A0And find occurrence of same key word and = split it . Sent from Yahoo Mail on Android=20 =20 On Tue, Jul 4, 2017 at 13:43, Gilad Denneboom = wrote: You can use PDFTextStripper to extract the text of each page, and = if you find the word you're looking for within that text and then use the S= plitter utility to extract the desired pages. On Tue, Jul 4, 2017 at 7:39 AM, lalit gupta wr= ote: Hi Team,=C2=A0I need a help while splitting PDF .=C2=A0Here I want to split= PDF says with 50 pages PDF into multiple PDFs.Logic should be something li= ke I need to find a keywords into a PDF page and need to split PDF from tha= t location.Eg. So 50 pages PDF can be splited into multiple PDFs depends on= key words.So if same key word found on 10 times then out put will be 10 PD= F from 50 PDF.And each PDF will represent one transaction. Thanks . Sent from Yahoo Mail on Android =20 ------=_Part_2226933_193738421.1499158067404--