pdfbox-users mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Leleu Eric <eric.leleu....@gmail.com>
Subject Re: Bug Report: java.lang.IllegalArgumentException from ExtractText
Date Wed, 24 Oct 2012 19:50:16 GMT
Hi,

I have tried your command with the given PDF, the extraction succeeded on
my environment.
I'm under a Fedora with an OpenJDK (version 1.6.0_22 64Bits)

Could you give us some details about the configuration of your environment?

Best regards,
Eric


2012/10/19 Peter Williams <peter.williams.97@gmail.com>

> Hi,
>
> Your web page seemed to say that bugs should be reported by emailing this
> address.
>
> Steps to Reproduce
>
> Download GAM-OptimalScaling2.pdf from
> http://www.math.vu.nl/sto/onderwijs/statlearn/GAM-OptimalScaling2.pdf
>
> java -jar pdfbox-app-1.7.1.jar ExtractText -sort  GAM-OptimalScaling2.pdf
> GAM-OptimalScaling2.sorted.txt
> ExtractText failed with the following exception:
> java.lang.IllegalArgumentException: Comparison method violates its general
> contract!
>         at java.util.TimSort.mergeHi(Unknown Source)
>         at java.util.TimSort.mergeAt(Unknown Source)
>         at java.util.TimSort.mergeCollapse(Unknown Source)
>         at java.util.TimSort.sort(Unknown Source)
>         at java.util.TimSort.sort(Unknown Source)
>         at java.util.Arrays.sort(Unknown Source)
>         at java.util.Collections.sort(Unknown Source)
>         at
> org.apache.pdfbox.util.PDFTextStripper.writePage(PDFTextStripper.java:558)
>         at
>
> org.apache.pdfbox.util.PDFTextStripper.processPage(PDFTextStripper.java:449)
>         at
>
> org.apache.pdfbox.util.PDFTextStripper.processPages(PDFTextStripper.java:372)
>         at
> org.apache.pdfbox.util.PDFTextStripper.writeText(PDFTextStripper.java:328)
>         at
> org.apache.pdfbox.ExtractText.startExtraction(ExtractText.java:274)
>         at org.apache.pdfbox.ExtractText.main(ExtractText.java:84)
>         at org.apache.pdfbox.PDFBox.main(PDFBox.java:42)
>
>
> ----------------------------------------------
> Peter Williams
> 0488 783 700 / +61 488 783 700
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message