Return-Path: X-Original-To: apmail-pdfbox-users-archive@www.apache.org Delivered-To: apmail-pdfbox-users-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id CC12C17ADE for ; Fri, 30 Jan 2015 09:20:11 +0000 (UTC) Received: (qmail 13052 invoked by uid 500); 30 Jan 2015 09:20:12 -0000 Delivered-To: apmail-pdfbox-users-archive@pdfbox.apache.org Received: (qmail 13030 invoked by uid 500); 30 Jan 2015 09:20:12 -0000 Mailing-List: contact users-help@pdfbox.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: users@pdfbox.apache.org Delivered-To: mailing list users@pdfbox.apache.org Received: (qmail 13017 invoked by uid 99); 30 Jan 2015 09:20:11 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 30 Jan 2015 09:20:11 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of thomasjjg@gmail.com designates 74.125.82.182 as permitted sender) Received: from [74.125.82.182] (HELO mail-we0-f182.google.com) (74.125.82.182) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 30 Jan 2015 09:20:06 +0000 Received: by mail-we0-f182.google.com with SMTP id l61so25912412wev.13 for ; Fri, 30 Jan 2015 01:19:00 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:date:message-id:subject:from:to:content-type; bh=IOv47AeNtPqvwU6LqoSWxS9/2PMDIawemRBaK+ozQng=; b=NXoarjvKfoIzWvuv791J8RjHYcr2n7zgaMBJPZluorRfYW0B0VoZQZdsDwSWwxHvL2 AcDve/oEohXoFdcHpS2mB29XbQe3FzYpfefhDHbCw/IsyJQ0aMHcnJQbyqtPuXl7+6Bm PYvYqv020YLQVE02YVM37in55wE7lUY6fSSawLVI9X8PMgg8a1XK9uonQlByhnLT6ZpW uRrhWZzcefrRc5rIsaNkfE/i3uCJLM3DdW8SfhGhaWh9iTMwZig9xqlHK441qw2geGS4 PuzJD0v039tqYkPuc1XMNS9umOeqOyEDQESa3K4QZBiKYgAEBzZM4+8ZTaSt8euub9sk l2UQ== MIME-Version: 1.0 X-Received: by 10.194.172.35 with SMTP id az3mr10037112wjc.43.1422609540173; Fri, 30 Jan 2015 01:19:00 -0800 (PST) Received: by 10.194.127.197 with HTTP; Fri, 30 Jan 2015 01:19:00 -0800 (PST) Date: Fri, 30 Jan 2015 14:49:00 +0530 Message-ID: Subject: Tamil PDF issues From: Thamizh Thomas To: users@pdfbox.apache.org Content-Type: multipart/alternative; boundary=089e0122e8b64bd735050ddb18a0 X-Virus-Checked: Checked by ClamAV on apache.org --089e0122e8b64bd735050ddb18a0 Content-Type: text/plain; charset=UTF-8 Hi All, I have a requirement to read tamil pdf content and persist in db. When I read using pdfbox, the characters are junked and unreadable. Please help me on this to resolve the issue. If you have worked on such contexts, please post me sample code snippet. ----- *Thanks* * Thamizh Thomas A* --089e0122e8b64bd735050ddb18a0--