pdfbox-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Andreas Lehmkühler" <andr...@lehmi.de>
Subject Re: [jira] Commented: (PDFBOX-227) ArrayIndexOutOfBoundsException:4
Date Thu, 05 Feb 2009 08:02:01 GMT
Hi Lars,

thanks for your comment. Is it possible to provide us with an example document which causes
this exception?
If so, please attach it to the jira-issue.

TIA,
Andreas

----- original Nachricht --------

Betreff: [jira] Commented: (PDFBOX-227) ArrayIndexOutOfBoundsException:4
Gesendet: Do, 05. Feb 2009
Von: Lars Torunski (JIRA)<jira@apache.org>

> 
>     [
> https://issues.apache.org/jira/browse/PDFBOX-227?page=com.atlassian.jira.plu
> gin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12670654#action_
> 12670654 ] 
> 
> Lars Torunski commented on PDFBOX-227:
> --------------------------------------
> 
> I got the same problem:
> 
> java.lang.ArrayIndexOutOfBoundsException: 4
> 	at org.fontbox.cmap.CMapParser.parseNextToken(CMapParser.java:294)
> 	at org.fontbox.cmap.CMapParser.parse(CMapParser.java:103)
> 	at org.pdfbox.pdmodel.font.PDFont.parseCmap(PDFont.java:535)
> 	at org.pdfbox.pdmodel.font.PDFont.encode(PDFont.java:387)
> 	at org.pdfbox.util.PDFStreamEngine.showString(PDFStreamEngine.java:325)
> 	at org.pdfbox.util.operator.ShowTextGlyph.process(ShowTextGlyph.java:80)
> 	at
> org.pdfbox.util.PDFStreamEngine.processOperator(PDFStreamEngine.java:452)
> 	at
> org.pdfbox.util.PDFStreamEngine.processSubStream(PDFStreamEngine.java:215)
> 	at org.pdfbox.util.PDFStreamEngine.processStream(PDFStreamEngine.java:174)
> 	at org.pdfbox.util.PDFTextStripper.processPage(PDFTextStripper.java:336)
> 	at org.pdfbox.util.PDFTextStripper.processPages(PDFTextStripper.java:259)
> 	at org.pdfbox.util.PDFTextStripper.writeText(PDFTextStripper.java:216)
> 
> Also no special security applied and the pdf opens fine.
> 
> > ArrayIndexOutOfBoundsException:4
> > --------------------------------
> >
> >                 Key: PDFBOX-227
> >                 URL: https://issues.apache.org/jira/browse/PDFBOX-227
> >             Project: PDFBox
> >          Issue Type: Bug
> >          Components: Text extraction
> >            Priority: Minor
> >
> > [imported from SourceForge]
> >
> http://sourceforge.net/tracker/index.php?group_id=78314&atid=552832&aid=1610
> 268
> > Originally submitted by fotb on 2006-12-06 09:04.
> > Does anyone know if there has been any resolution to the
> > ArrayIndexOutOfBoundsException:4  problem.
> > I have extracted text from 300 pdfs using  0.7.3.  All extractions were
> > successful except 3.  I recieved this error message when trying to
> extract
> > text from these 3 pds (java.lang.ArrayIndexOutOfBoundsException:4).  I am
> > able to open the pdfs fine and they have no special security applied to
> > them.  Any ideas as to why PDFBOX 0.7.3 is hiccuping while trying to
> > process these files?  I am not able to send the pdf over the internet
> because it is government property.
> > [comment on SourceForge]
> > Originally sent by fotb.
> > Logged In: YES 
> > user_id=1662347
> > Originator: YES
> > I loaded PDFBox-0.7.2 and the problem went away.  The 3 pdfs that were
> raising errors with PDFBox-0.7.3 are now being successfully processed with
> text being extracted from them.  Something in PDFBox-0.7.3 is causing the
> problem.  If anyone else is having the ArrayIndexOutOfBoundsException is, I
> would suggest loading PDFBox-0.7.2 and see if you still get the error. 
> Thanks  Ben for your time.  Be well.
> 
> -- 
> This message is automatically generated by JIRA.
> -
> You can reply to this email to add a comment to the issue online.
> 
> 

--- original Nachricht Ende ----


Mime
View raw message