pdfbox-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From le...@apache.org
Subject svn commit: r1481538 - /pdfbox/cmssite/trunk/content/userguide/faq.mdtext
Date Sun, 12 May 2013 12:16:47 GMT
Author: lehmi
Date: Sun May 12 12:16:47 2013
New Revision: 1481538

URL: http://svn.apache.org/r1481538
Log:
rearranged headers

Modified:
    pdfbox/cmssite/trunk/content/userguide/faq.mdtext

Modified: pdfbox/cmssite/trunk/content/userguide/faq.mdtext
URL: http://svn.apache.org/viewvc/pdfbox/cmssite/trunk/content/userguide/faq.mdtext?rev=1481538&r1=1481537&r2=1481538&view=diff
==============================================================================
--- pdfbox/cmssite/trunk/content/userguide/faq.mdtext (original)
+++ pdfbox/cmssite/trunk/content/userguide/faq.mdtext Sun May 12 12:16:47 2013
@@ -16,16 +16,16 @@ Notice:    Licensed to the Apache Softwa
            specific language governing permissions and limitations
            under the License.
 
-# FAQ
+## Frequently asked questions
 
-## General Questions
+### General Questions
 
  - [When will the next version of PDFBox be released?](#releaseplan)
  - [I am getting the below Log4J warning message, how do I remove it?](#log4j)
  - [Is PDFBox thread safe?](#threadsafe)
  - [Why do I get a "Warning: You did not close the PDF Document"?](#notclosed)
 
-## Text Extraction
+### Text Extraction
 
  - [How come I am not getting any text from the PDF document?](#notext)
  - [How come I am getting gibberish(G38G43G36G51G5) when extracting text?](#gibberish)
@@ -33,17 +33,17 @@ Notice:    Licensed to the Apache Softwa
  - [Why do I get "You do not have permission to extract text" on some documents?](#permission)
  - [Can't we just extract the text without parsing the whole document or extract text as
it is parsed?](#partially)
 
-# Answers
+## Answers
 
-## General Questions
+### General Questions
 
-### When will the next version of PDFBox be released ### {#releaseplan}
+#### When will the next version of PDFBox be released #### {#releaseplan}
 
 As fixes are made and integrated into the repository these changes are documented in the
 [release notes](http://pdfbox.apache.org/downloads.html). An estimate will be given of when
the next version will be released.
 Of course, this is only an estimate and could change.
 
-### I am getting the below Log4J warning message, how do I remove it? ### {#log4j}
+#### I am getting the below Log4J warning message, how do I remove it? #### {#log4j}
 
 	:::java
 	log4j:WARN No appenders could be found for logger (org.apache.pdfbox.util.ResourceLoader).
@@ -65,12 +65,12 @@ If this is not working for you then you 
 Please see [this](https://sourceforge.net/forum/forum.php?thread_id=1254229&forum_id=267205)
forum thread 
 for more information.
 
-### Is PDFBox thread safe ### {#threadsafe}
+#### Is PDFBox thread safe #### {#threadsafe}
 
 No! Only one thread may access a single document at a time. You can have multiple threads
 each accessing their own PDDocument object.
 
-### Why do I get a "Warning: You did not close the PDF Document"? ### {#notclosed}
+#### Why do I get a "Warning: You did not close the PDF Document"? #### {#notclosed}
 
 You need to call close() on the PDDocument inside the finally block, if you
 don't then the document will not be closed properly.  Also, you must close all
@@ -91,9 +91,9 @@ PDDocument objects; one from the "new PD
            }
         }
 
-## Text Extraction
+### Text Extraction
 
-### How come I am not getting any text from the PDF document? ### {#notext}
+#### How come I am not getting any text from the PDF document? #### {#notext}
 
 Text extraction from a pdf document is a complicated task and there are many factors
 involved that effect the possibility and accuracy of text extraction.  It would be helpful
@@ -104,7 +104,7 @@ should be able to as well and it is a bu
  - It might really be an image instead of text.  Some PDF documents are just images that
have been scanned in.
 You can tell by using the selection tool in Acrobat, if you can't select any text then it
is probably an image.
 
-### How come I am getting gibberish(G38G43G36G51G5) when extracting text? ### {#gibberish}
+#### How come I am getting gibberish(G38G43G36G51G5) when extracting text? #### {#gibberish}
 
 This is because the characters in a PDF document can use a custom encoding
 instead of unicode or ASCII.  When you see gibberish text then it
@@ -112,20 +112,20 @@ probably means that a meaningless intern
 only way to access the text is to use OCR.  This may be a future
 enhancement.
 
-### What does "java.io.IOException: Can't handle font width" mean? ### {#fontwidth}
+#### What does "java.io.IOException: Can't handle font width" mean? #### {#fontwidth}
 
 This probably means that the "Resources" directory is not in your classpath. The
 Resources directory is included in the PDFBox jar so this is only a problem if you
 are building PDFBox yourself and not using the binary.
 
-### Why do I get "You do not have permission to extract text" on some documents? ### {#permission}
+#### Why do I get "You do not have permission to extract text" on some documents? #### {#permission}
 
 PDF documents have certain security permissions that can be applied to them and two 
 passwords associated with them, a user password and a master password. If the "cannot extract
text"
 permission bit is set then you need to decrypt the document with the master password in order
 to extract the text.
 
-## Can't we just extract the text without parsing the whole document or extract text as it
is parsed. ### {#partially}
+#### Can't we just extract the text without parsing the whole document or extract text as
it is parsed. #### {#partially}
 
 Not really, for a couple reasons.
 



Mime
View raw message