poi-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From n...@apache.org
Subject svn commit: r1486433 - in /poi/site/publish: subversion.html text-extraction.html
Date Sun, 26 May 2013 17:02:45 GMT
Author: nick
Date: Sun May 26 17:02:45 2013
New Revision: 1486433

URL: http://svn.apache.org/r1486433
Log:
Update site publish

Modified:
    poi/site/publish/subversion.html
    poi/site/publish/text-extraction.html

Modified: poi/site/publish/subversion.html
URL: http://svn.apache.org/viewvc/poi/site/publish/subversion.html?rev=1486433&r1=1486432&r2=1486433&view=diff
==============================================================================
--- poi/site/publish/subversion.html (original)
+++ poi/site/publish/subversion.html Sun May 26 17:02:45 2013
@@ -255,11 +255,21 @@ if (VERSION > 3) {
       
 <p>
          Most users of the source code probably don't need to have day to 
-         day access to the source code as it changes. For these users we 
-         provide easy to unpack source code from releases via our
+         day access to the source code as it changes. Most users will want
+         to make use of our <a href="download.html">source release</a>
+         packages, which contain the complete source tree for each binary
+         release, suitable for browsing or debugging. These source releases
+         are available from our
          <a href="download.html">download page.</a>
       
 </p>
+      
+<p>
+         The Apache POI sourcecode is also available as source artifacts
+         in the Maven Central repository, which may be helpful for those
+         users who make use of POI and wish to inspect the source (eg when
+         debugging in an IDE).
+      </p>
     
     
 <a name="Access+the+Version+Controlled+Source+Code"></a>
@@ -268,18 +278,18 @@ if (VERSION > 3) {
 </div>
       
 <p>
-         For information on connecting to the ASF Subversion repositories, 
-         see the 
+         For general information on connecting to the ASF Subversion, 
+         repositories, see the 
          <a href="http://www.apache.org/dev/version-control.html">version control page.</a>
       
 </p>
       
 <p>Subversion is an open-source version control system. It has been contributed to
the Apache Software Foundation and is
-	now available <a href="http://incubator.apache.org/projects/subversion.html">here</a>.
+	now available <a href="http://subversion.apache.org/">here</a>.
       </p>
       
 <p>
-	The root url of the ASF Subversion repository is 
+       The root url of the ASF Subversion repository is 
        <a href="http://svn.apache.org/repos/asf/">http://svn.apache.org/repos/asf/</a>
        for non-committers and 
        <a href="https://svn.apache.org/repos/asf/">https://svn.apache.org/repos/asf/</a>

@@ -324,11 +334,32 @@ if (VERSION > 3) {
 </div>
       
 <p>
-        Git read-only access to POI sources is now available. 
-        Please see the <a href="http://git.apache.org/">Git at Apache</a> page
for details. 
-        Git Clone URL: <a href="git://git.apache.org/poi.git">git://git.apache.org/poi.git</a>

-        and Http Clone URL:  <a href="http://git.apache.org/poi.git">http://git.apache.org/poi.git</a>.
+        The master source repository for Apache POI is the Subversion
+        one listed above. To support those users and developers who prefer
+        to use the Git tooling, read-only access to the POI source tree is
+        also available via Git. The Git mirrors normally track SVN to 
+        within a few minutes.
+      </p>
+      
+<p>
+        The official read-only Git repository for Apache POI is available
+        from <a href="http://git.apache.org/">git.apache.org/</a> .
+        The Git Clone URL is: <a href="git://git.apache.org/poi.git">git://git.apache.org/poi.git</a>

+        and Http Clone URL: <a href="http://git.apache.org/poi.git">http://git.apache.org/poi.git</a>
.
+         Please see the <a href="http://git.apache.org/">Git at 
+         Apache</a> page for more details on the service.
       </p>
+      
+<p>
+        In addition to the <a href="http://git.apache.org/">git.apache.org/</a>
+        repository, changes are also mirrored in near-realtime to GitHub.
+        The GitHub repository is available at
+        <a href="https://github.com/apache/poi">https://github.com/apache/poi</a>
.
+        Please note that the GitHub repository is read-only, and all 
+        contributions should continue to be sent via Bugzilla for tracking.
+        (Git patches are fine though). Please see the
+        <a href="guidelines.html">contribution guidelines</a> for more 
+        information on getting involved in the project.</p>
     
   
   

Modified: poi/site/publish/text-extraction.html
URL: http://svn.apache.org/viewvc/poi/site/publish/text-extraction.html?rev=1486433&r1=1486432&r2=1486433&view=diff
==============================================================================
--- poi/site/publish/text-extraction.html (original)
+++ poi/site/publish/text-extraction.html Sun May 26 17:02:45 2013
@@ -253,15 +253,25 @@ if (VERSION > 3) {
 <h3>Overview</h3>
 </div>
       
-<p>Apache POI provides text extraction for all the supported file
-       formats. In addition, it provides access to the metadata
-       associated with a given file, such as title and author.</p>
+<p>For a number of years now, Apache POI has provided basic 
+       text extraction for all the project supported file formats. In 
+       addition, as well as the (plain) text, these provides access to 
+       the metadata associated with a given file, such as title and 
+       author.</p>
       
-<p>In addition to providing direct text extraction classes,
-       POI works closely with the 
-       <a href="http://incubator.apache.org/tika/">Apache Tika</a>
-       text extraction library. Users may wish to simply utilise 
-       the functionality provided by Tika.</p>
+<p>For more advanced text extraction needs, including Rich Text
+       extraction (such as formatting and styling), along with XML and
+       HTML output, Apache POI works closely with 
+       <a href="http://tika.apache.org/">Apache Tika</a> to deliver 
+       POI-powered Tika Parsers for all the project supported file formats.</p>
+      
+<p>If you are after turn-key text extraction, including the latest
+       support, styles etc, you are strongly advised to make use of 
+       <a href="http://tika.apache.org/">Apache Tika</a>, which builds 
+       on top of POI to provide Text and Metadata extraction. If you wish
+       to have something very simple and stand-alone, or you wish to make
+       heavy modificiations, then the POI provided text extractors documented
+       below might be a better fit for your needs.</p>
     
 
     
@@ -289,16 +299,21 @@ if (VERSION > 3) {
     
 
     
-<a name="Text+Extractor+Factory+-+POI+3.5+or+later"></a>
+<a name="Text+Extractor+Factory"></a>
 <div class="h3">
-<h3>Text Extractor Factory - POI 3.5 or later</h3>
+<h3>Text Extractor Factory</h3>
 </div>
      
-<p>A new class in POI 3.5, 
-      <em>org.apache.poi.extractor.ExtractorFactory</em> provides a
+<p>As part of the addition of OOXML support in Apache POI 3.5, there
+      is a common class to select the appropriate POI text extractor for 
+      you. <em>org.apache.poi.extractor.ExtractorFactory</em> provides a
       similar function to WorkbookFactory. You simply pass it an
-      InputStream, a file, a POIFSFileSystem or a OOXML Package. It
+      InputStream, a File, a POIFSFileSystem or a OOXML Package. It
       figures out the correct text extractor for you, and returns it.</p>
+     
+<p>For complete detection and text extractor auto-selection, users
+      are strongly encouraged to investigate
+      <a href="http://tika.apache.org/">Apache Tika</a>.</p>
     
 
     



---------------------------------------------------------------------
To unsubscribe, e-mail: commits-unsubscribe@poi.apache.org
For additional commands, e-mail: commits-help@poi.apache.org


Mime
View raw message