Return-Path: Delivered-To: apmail-jakarta-poi-user-archive@www.apache.org Received: (qmail 42498 invoked from network); 15 Jan 2006 23:19:15 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (209.237.227.199) by minotaur.apache.org with SMTP; 15 Jan 2006 23:19:15 -0000 Received: (qmail 94268 invoked by uid 500); 15 Jan 2006 23:19:13 -0000 Delivered-To: apmail-jakarta-poi-user-archive@jakarta.apache.org Received: (qmail 94249 invoked by uid 500); 15 Jan 2006 23:19:13 -0000 Mailing-List: contact poi-user-help@jakarta.apache.org; run by ezmlm Precedence: bulk List-Unsubscribe: List-Help: List-Post: List-Id: "POI Users List" Reply-To: "POI Users List" Delivered-To: mailing list poi-user@jakarta.apache.org Received: (qmail 94238 invoked by uid 99); 15 Jan 2006 23:19:12 -0000 Received: from asf.osuosl.org (HELO asf.osuosl.org) (140.211.166.49) by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 15 Jan 2006 15:19:12 -0800 X-ASF-Spam-Status: No, hits=0.0 required=10.0 tests= X-Spam-Check-By: apache.org Received-SPF: pass (asf.osuosl.org: local policy) Received: from [203.217.22.128] (HELO file1.syd.nuix.com.au) (203.217.22.128) by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 15 Jan 2006 15:19:11 -0800 Received: from [192.168.222.102] (demo1.syd.nuix.com.au [192.168.222.102]) by file1.syd.nuix.com.au (Postfix) with ESMTP id 6012EB7BEF for ; Mon, 16 Jan 2006 10:18:50 +1100 (EST) Message-ID: <43CADAA1.8030902@nuix.com.au> Date: Mon, 16 Jan 2006 10:28:33 +1100 From: Daniel Noll Organization: NUIX Pty Limited User-Agent: Thunderbird 1.5 (Windows/20051201) MIME-Version: 1.0 To: POI Users List Subject: Re: Image extraction References: <43C5E9AC.6090905@nuix.com.au> <43C6E42D.5060000@nuix.com.au> In-Reply-To: Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit X-Virus-Checked: Checked by ClamAV on apache.org X-Spam-Rating: minotaur.apache.org 1.6.2 0/1000/N Nick Burch wrote: > On Fri, 13 Jan 2006, Daniel Noll wrote: >> Actually, how long do you think it would take? We only really need >> to get at the binary data for the embedded image files (e.g. source >> PNGs), being able to render the vector drawings would be a bonus, but >> not necessarily a requirement at this point. > > I'm not sure, as I haven't looked that closely at how they're stored. > > If you fancied adding two files to bugzilla: a powerpoing document > containing a PNG (single page, just the PNG and some text), and the > original PNG, then I can look and see if powerpoint actually stores > the data as-is. > > If the PNG is stored in the PPT file as-is, I could probably knock up > something for you in an hour or so over the weekend. If it isn't > stored as-is, it'll take much longer (since I'd have to figure out how > to turn what's stored into something useful) Bug created: http://issues.apache.org/bugzilla/show_bug.cgi?id=38283 I had to set the component to "POI Overall" as there was no component yet for HSLF. In the meantime I'll probably be conducting my own tests to see how the same image is stored in Word and Excel, unless someone responsible for either of those two responds before I finish. ;-) Daniel -- Daniel Noll Nuix Australia Pty Ltd Suite 79, 89 Jones St, Ultimo NSW 2007, Australia Phone: (02) 9280 0699 Fax: (02) 9212 6902 This message is intended only for the named recipient. If you are not the intended recipient you are notified that disclosing, copying, distributing or taking any action in reliance on the contents of this message or attachment is strictly prohibited. --------------------------------------------------------------------- To unsubscribe, e-mail: poi-user-unsubscribe@jakarta.apache.org Mailing List: http://jakarta.apache.org/site/mail2.html#poi The Apache Jakarta Poi Project: http://jakarta.apache.org/poi/