Return-Path: X-Original-To: apmail-poi-user-archive@www.apache.org Delivered-To: apmail-poi-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 44B50106BD for ; Mon, 10 Jun 2013 20:12:53 +0000 (UTC) Received: (qmail 77482 invoked by uid 500); 10 Jun 2013 20:12:52 -0000 Delivered-To: apmail-poi-user-archive@poi.apache.org Received: (qmail 77370 invoked by uid 500); 10 Jun 2013 20:12:52 -0000 Mailing-List: contact user-help@poi.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: "POI Users List" Delivered-To: mailing list user@poi.apache.org Received: (qmail 77362 invoked by uid 99); 10 Jun 2013 20:12:51 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 10 Jun 2013 20:12:51 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of cbamford@mimecast.com designates 195.130.217.112 as permitted sender) Received: from [195.130.217.112] (HELO service-alpha-uk.mimecast.com) (195.130.217.112) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 10 Jun 2013 20:12:44 +0000 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=mimecast.com; s=20130419; t=1370895142; bh=O0NwIMxJTxgDYFW3QGF8ZvBTMwsA7j61PNSlY5kuknI=; h=From:To:Subject:Date:Message-ID:References:In-Reply-To:Content-ID:MIME-Version:Content-Type; b=C3Y2RenzVkO8inK/oS/z0jfOCQ7nn/GY0sj56Fc8niJ182TlMrSeM5PT1cmP2Iw4dhnSHOMTfWSYoevFljwHOzcE2HbuzRKVyrcsD3X60+tEkZf5aScKAbBaIsfPbK+iBbHspBSfYlTUl2YRpcy9PKCT6KYJNV59Osxu+0OPZgs= Received: from remote.mimecast.com (146.101.202.133 [146.101.202.133]) (Using TLS) by uk-sl-a.uk.mimecast.lan; Mon, 10 Jun 2013 21:12:21 +0100 Received: from MC-LON-EXCH03.mcsltd.internal ([fe80::3879:e7a7:5e3d:3699]) by MC-LON-EXCH03.mcsltd.internal ([fe80::3879:e7a7:5e3d:3699%15]) with mapi id 14.02.0342.003; Mon, 10 Jun 2013 21:12:20 +0100 From: Chris Bamford To: POI Users List Subject: Re: Extracting embedded files from HWPF docs Thread-Topic: Extracting embedded files from HWPF docs Thread-Index: AQHOY3sDjf13rPt9P06YzSCYX9djBJkqKbqAgAADyICABGU+gIAAP6OAgAANeICAAHYGAA== Date: Mon, 10 Jun 2013 20:12:19 +0000 Message-ID: <5586FE7C-0341-40F5-A632-8A2A612176BF@mimecast.com> References: <1363741413002-5712398.post@n5.nabble.com> <281B2E19-403E-4A2E-AC9B-E8508C8D30F5@mimecast.com> <5099E059-37D9-4220-9007-29C6657D17B5@mimecast.com> In-Reply-To: Accept-Language: en-GB, en-US Content-Language: en-US X-MS-Has-Attach: X-MS-TNEF-Correlator: x-originating-ip: [80.47.149.97] Content-ID: MIME-Version: 1.0 X-MC-Unique: e7214afe-bf5a-4824-afb7-a76a4f0660fa-1 Content-Type: multipart/alternative; boundary="MCBoundary=_11306102112210191" X-Virus-Checked: Checked by ClamAV on apache.org --MCBoundary=_11306102112210191 Content-Type: text/plain; charset=WINDOWS-1252 Content-Transfer-Encoding: quoted-printable Hi Nick, On 10 Jun 2013, at 14:09, Nick Burch wrote: >=20 >=20 >> Now POIFSLister shows the ObjectPool and the item in it: >>=20 >> Root Entry - >> SummaryInformation <(0x05)SummaryInformation> [412 / 0x19c] >> DocumentSummaryInformation <(0x05)DocumentSummaryInformation> [280 / 0x1= 18] >> WordDocument [4142 / 0x102e] >> 1Table [2087 / 0x827] >> ObjectPool - >> _1432368106 - >> CompObj <(0x01)CompObj> [76 / 0x4c] >> ObjInfo <(0x03)ObjInfo> [6 / 0x6] >> Ole10Native <(0x01)Ole10Native> [568849 / 0x8ae11] >> EPRINT <(0x03)EPRINT> [5000 / 0x1388] >> CompObj <(0x01)CompObj> [113 / 0x71] >> Data [4096 / 0x1000] >=20 > Try the Ole10Native - POI has code to handle that. My best guess is your = data is in there >=20 I'm not familiar with the POI code at all, but I found a useful utility cal= led POIFSDump which writes out all the objects to file. I found that if I = strip off the first 944 bytes from the Ole10Native file, I get the original= MP3 :-) These 944 bytes appear to be some sort of header... Questions=20 1) Is this header always 944 in size? 2) Is there a POI header object for it? 3) Is there code which skips this block and accesses the 'file data' direct= ly? Thanks again - Chris =20 > Nick >=20 > --------------------------------------------------------------------- > To unsubscribe, e-mail: user-unsubscribe@poi.apache.org > For additional commands, e-mail: user-help@poi.apache.org >=20 >=20=0A=0A=0AChris Bamford=0ASenior Developer=0A=0ACityPoint,=20=0AOne Rope= maker Street,=20=0ALondon,=20=0AEC2Y 9AW.=0A=0Amobile +44 7860 405292=0Atel= : +44 (0) 207 847 8700=0Aweb www.mimecast.com=0A=0A=0AThe information conta= ined in this communication from cbamford@mimecast.com is confidential and m= ay be legally privileged. It is intended solely for use by user@poi.apache.= org and others authorized to receive it. If you are not user@poi.apache.org= you are hereby notified that any disclosure, copying, distribution or taki= ng action in reliance of the contents of this information is strictly prohi= bited and may be unlawful.=0A=0A=0AMimecast Ltd. is a company registered in= England and Wales with the company number 4698693 VAT No. GB 123 4197 34= =0ARegistered Office: CityPoint, One Ropemaker Street, Moorgate, London, EC= 2Y 9AW Email Address: info@mimecast.com=0A=0AThis email message has been sc= anned for viruses by Mimecast.=0AMimecast delivers a complete managed email= solution from a single web based platform.=0AFor more information please v= isit http://www.mimecast.com=0A --MCBoundary=_11306102112210191--