pdfbox-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Andreas Lehmkühler (JIRA) <j...@apache.org>
Subject [jira] [Commented] (PDFBOX-1362) Slovakian characters
Date Mon, 05 Nov 2012 09:15:13 GMT

    [ https://issues.apache.org/jira/browse/PDFBOX-1362?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13490524#comment-13490524
] 

Andreas Lehmkühler commented on PDFBOX-1362:
--------------------------------------------

The most recent version is 1.7.1. There isn't any plan for a next release yet.

Please don't hijack JIRAs for such questions. Use our mailinglists instead [1]

[1] http://pdfbox.apache.org/mail-lists.html
                
> Slovakian characters
> --------------------
>
>                 Key: PDFBOX-1362
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-1362
>             Project: PDFBox
>          Issue Type: Bug
>          Components: Text extraction
>    Affects Versions: 1.7.0
>         Environment: Windows XP, Java 1.6.0_33
>            Reporter: Christian Czech
>            Assignee: Andreas Lehmkühler
>             Fix For: 1.8.0
>
>         Attachments: PDFBOX-1362.patch, test_7_2_test.pdf
>
>
> Hello,
> I have a PDF document with Slovakian characters:
> Hlavní administrátor
> My code:
> PDDocument document = null;
> document = PDDocument.load(pdfFile, true); PDFTextStripper stripper = 
> null; stripper = new PDFTextStripper("ISO-8859-2"); 
> stripper.getText(document);
> I always get this result: Hlavn\? administr\ ?tor 

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Mime
View raw message