any23-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Hudson (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (ANY23-26) Upgrade dependency to Apache Tika 1.2
Date Sun, 20 Jan 2013 07:30:12 GMT

    [ https://issues.apache.org/jira/browse/ANY23-26?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13558193#comment-13558193
] 

Hudson commented on ANY23-26:
-----------------------------

Integrated in Any23-trunk #448 (See [https://builds.apache.org/job/Any23-trunk/448/])
    ANY23-26 part3 - upgrade of Tika dependency (Revision 1435729)
ANY23-26 part2 - Remove deprecated tests and plugins classes for spi-extractor improvements
(Revision 1435724)
ANY23-26 part1 - Improvement to spi-extractors (Revision 1435720)

     Result = UNSTABLE
lewismc : 
Files : 
* /any23/trunk/core/src/test/java/org/apache/any23/extractor/html/HCardExtractorTest.java
* /any23/trunk/mime/src/main/resources/org/apache/any23/mime/mimetypes.xml
* /any23/trunk/mime/src/main/resources/org/apache/any23/mime/tika-config.xml
* /any23/trunk/plugins/basic-crawler/pom.xml
* /any23/trunk/plugins/office-scraper/pom.xml
* /any23/trunk/pom.xml
* /any23/trunk/test-resources/src/test/resources/microformats/hcard/19-object-data-data-uri.html

lewismc : 
Files : 
* /any23/trunk/plugins/html-scraper/src/main/java/org/apache/any23/plugin/htmlscraper/HTMLScraperPlugin.java
* /any23/trunk/plugins/html-scraper/src/test/java/org/apache/any23/plugin/htmlscraper/HTMLScraperPluginTest.java
* /any23/trunk/plugins/office-scraper/src/main/java/org/apache/any23/plugin/officescraper/ExcelPlugin.java

lewismc : 
Files : 
* /any23/trunk/api/src/main/java/org/apache/any23/extractor/ExtractorDescription.java
* /any23/trunk/api/src/main/java/org/apache/any23/extractor/ExtractorFactory.java
* /any23/trunk/api/src/main/java/org/apache/any23/extractor/ExtractorRegistry.java
* /any23/trunk/api/src/main/java/org/apache/any23/plugin/Any23PluginManager.java
* /any23/trunk/api/src/main/java/org/apache/any23/plugin/ExtractorPlugin.java
* /any23/trunk/core/src/main/assembly/bin.xml
* /any23/trunk/core/src/main/java/org/apache/any23/cli/ExtractorDocumentation.java
* /any23/trunk/core/src/main/java/org/apache/any23/cli/PluginVerifier.java
* /any23/trunk/core/src/main/java/org/apache/any23/extractor/ExtractorRegistryImpl.java
* /any23/trunk/core/src/main/java/org/apache/any23/extractor/SimpleExtractorFactory.java
* /any23/trunk/core/src/main/java/org/apache/any23/extractor/csv/CSVExtractor.java
* /any23/trunk/core/src/main/java/org/apache/any23/extractor/csv/CSVExtractorFactory.java
* /any23/trunk/core/src/main/java/org/apache/any23/extractor/html/AdrExtractor.java
* /any23/trunk/core/src/main/java/org/apache/any23/extractor/html/AdrExtractorFactory.java
* /any23/trunk/core/src/main/java/org/apache/any23/extractor/html/GeoExtractor.java
* /any23/trunk/core/src/main/java/org/apache/any23/extractor/html/GeoExtractorFactory.java
* /any23/trunk/core/src/main/java/org/apache/any23/extractor/html/HCalendarExtractor.java
* /any23/trunk/core/src/main/java/org/apache/any23/extractor/html/HCalendarExtractorFactory.java
* /any23/trunk/core/src/main/java/org/apache/any23/extractor/html/HCardExtractor.java
* /any23/trunk/core/src/main/java/org/apache/any23/extractor/html/HCardExtractorFactory.java
* /any23/trunk/core/src/main/java/org/apache/any23/extractor/html/HListingExtractor.java
* /any23/trunk/core/src/main/java/org/apache/any23/extractor/html/HListingExtractorFactory.java
* /any23/trunk/core/src/main/java/org/apache/any23/extractor/html/HRecipeExtractor.java
* /any23/trunk/core/src/main/java/org/apache/any23/extractor/html/HRecipeExtractorFactory.java
* /any23/trunk/core/src/main/java/org/apache/any23/extractor/html/HResumeExtractor.java
* /any23/trunk/core/src/main/java/org/apache/any23/extractor/html/HResumeExtractorFactory.java
* /any23/trunk/core/src/main/java/org/apache/any23/extractor/html/HReviewExtractor.java
* /any23/trunk/core/src/main/java/org/apache/any23/extractor/html/HReviewExtractorFactory.java
* /any23/trunk/core/src/main/java/org/apache/any23/extractor/html/HTMLMetaExtractor.java
* /any23/trunk/core/src/main/java/org/apache/any23/extractor/html/HTMLMetaExtractorFactory.java
* /any23/trunk/core/src/main/java/org/apache/any23/extractor/html/HeadLinkExtractor.java
* /any23/trunk/core/src/main/java/org/apache/any23/extractor/html/HeadLinkExtractorFactory.java
* /any23/trunk/core/src/main/java/org/apache/any23/extractor/html/ICBMExtractor.java
* /any23/trunk/core/src/main/java/org/apache/any23/extractor/html/ICBMExtractorFactory.java
* /any23/trunk/core/src/main/java/org/apache/any23/extractor/html/LicenseExtractor.java
* /any23/trunk/core/src/main/java/org/apache/any23/extractor/html/LicenseExtractorFactory.java
* /any23/trunk/core/src/main/java/org/apache/any23/extractor/html/SpeciesExtractor.java
* /any23/trunk/core/src/main/java/org/apache/any23/extractor/html/SpeciesExtractorFactory.java
* /any23/trunk/core/src/main/java/org/apache/any23/extractor/html/TitleExtractor.java
* /any23/trunk/core/src/main/java/org/apache/any23/extractor/html/TitleExtractorFactory.java
* /any23/trunk/core/src/main/java/org/apache/any23/extractor/html/TurtleHTMLExtractor.java
* /any23/trunk/core/src/main/java/org/apache/any23/extractor/html/TurtleHTMLExtractorFactory.java
* /any23/trunk/core/src/main/java/org/apache/any23/extractor/html/XFNExtractor.java
* /any23/trunk/core/src/main/java/org/apache/any23/extractor/html/XFNExtractorFactory.java
* /any23/trunk/core/src/main/java/org/apache/any23/extractor/microdata/MicrodataExtractor.java
* /any23/trunk/core/src/main/java/org/apache/any23/extractor/microdata/MicrodataExtractorFactory.java
* /any23/trunk/core/src/main/java/org/apache/any23/extractor/rdf/NQuadsExtractor.java
* /any23/trunk/core/src/main/java/org/apache/any23/extractor/rdf/NQuadsExtractorFactory.java
* /any23/trunk/core/src/main/java/org/apache/any23/extractor/rdf/NTriplesExtractor.java
* /any23/trunk/core/src/main/java/org/apache/any23/extractor/rdf/NTriplesExtractorFactory.java
* /any23/trunk/core/src/main/java/org/apache/any23/extractor/rdf/RDFXMLExtractor.java
* /any23/trunk/core/src/main/java/org/apache/any23/extractor/rdf/RDFXMLExtractorFactory.java
* /any23/trunk/core/src/main/java/org/apache/any23/extractor/rdf/TriXExtractor.java
* /any23/trunk/core/src/main/java/org/apache/any23/extractor/rdf/TriXExtractorFactory.java
* /any23/trunk/core/src/main/java/org/apache/any23/extractor/rdf/TurtleExtractor.java
* /any23/trunk/core/src/main/java/org/apache/any23/extractor/rdf/TurtleExtractorFactory.java
* /any23/trunk/core/src/main/java/org/apache/any23/extractor/rdfa/RDFa11Extractor.java
* /any23/trunk/core/src/main/java/org/apache/any23/extractor/rdfa/RDFa11ExtractorFactory.java
* /any23/trunk/core/src/main/java/org/apache/any23/extractor/rdfa/RDFaExtractor.java
* /any23/trunk/core/src/main/java/org/apache/any23/extractor/rdfa/RDFaExtractorFactory.java
* /any23/trunk/core/src/main/java/org/apache/any23/extractor/xpath/XPathExtractor.java
* /any23/trunk/core/src/main/java/org/apache/any23/extractor/xpath/XPathExtractorFactory.java
* /any23/trunk/core/src/main/java/org/apache/any23/filter/IgnoreTitlesOfEmptyDocuments.java
* /any23/trunk/core/src/test/java/org/apache/any23/Any23Test.java
* /any23/trunk/core/src/test/java/org/apache/any23/extractor/csv/CSVExtractorTest.java
* /any23/trunk/core/src/test/java/org/apache/any23/extractor/example/ExampleExtractor.java
* /any23/trunk/core/src/test/java/org/apache/any23/extractor/example/ExampleExtractorFactory.java
* /any23/trunk/core/src/test/java/org/apache/any23/extractor/html/AdrExtractorTest.java
* /any23/trunk/core/src/test/java/org/apache/any23/extractor/html/HCalendarExtractorTest.java
* /any23/trunk/core/src/test/java/org/apache/any23/extractor/html/HCardExtractorTest.java
* /any23/trunk/core/src/test/java/org/apache/any23/extractor/html/HListingExtractorTest.java
* /any23/trunk/core/src/test/java/org/apache/any23/extractor/html/HRecipeExtractorTest.java
* /any23/trunk/core/src/test/java/org/apache/any23/extractor/html/HResumeExtractorTest.java
* /any23/trunk/core/src/test/java/org/apache/any23/extractor/html/HReviewExtractorTest.java
* /any23/trunk/core/src/test/java/org/apache/any23/extractor/html/HTMLMetaExtractorTest.java
* /any23/trunk/core/src/test/java/org/apache/any23/extractor/html/HeadLinkExtractorTest.java
* /any23/trunk/core/src/test/java/org/apache/any23/extractor/html/LicenseExtractorTest.java
* /any23/trunk/core/src/test/java/org/apache/any23/extractor/html/RDFMergerTest.java
* /any23/trunk/core/src/test/java/org/apache/any23/extractor/html/SpeciesExtractorTest.java
* /any23/trunk/core/src/test/java/org/apache/any23/extractor/html/TitleExtractorTest.java
* /any23/trunk/core/src/test/java/org/apache/any23/extractor/html/TurtleHTMLExtractorTest.java
* /any23/trunk/core/src/test/java/org/apache/any23/extractor/html/XFNExtractorTest.java
* /any23/trunk/core/src/test/java/org/apache/any23/extractor/microdata/MicrodataExtractorTest.java
* /any23/trunk/core/src/test/java/org/apache/any23/extractor/rdfa/RDFa11ExtractorTest.java
* /any23/trunk/core/src/test/java/org/apache/any23/extractor/rdfa/RDFaExtractorTest.java
* /any23/trunk/core/src/test/java/org/apache/any23/plugin/Any23PluginManagerTest.java
* /any23/trunk/plugins/html-scraper/src/main/java/org/apache/any23/plugin/htmlscraper/HTMLScraperExtractor.java
* /any23/trunk/plugins/html-scraper/src/main/java/org/apache/any23/plugin/htmlscraper/HTMLScraperExtractorFactory.java
* /any23/trunk/plugins/html-scraper/src/test/java/org/apache/any23/plugin/htmlscraper/HTMLScraperExtractorTest.java
* /any23/trunk/plugins/integration-test/pom.xml
* /any23/trunk/plugins/integration-test/src/test/java/org/apache/any23/plugin/PluginIT.java
* /any23/trunk/plugins/integration-test/src/test/resources/log4j.properties
* /any23/trunk/plugins/office-scraper/src/main/java/org/apache/any23/plugin/officescraper/ExcelExtractor.java
* /any23/trunk/plugins/office-scraper/src/main/java/org/apache/any23/plugin/officescraper/ExcelExtractorFactory.java
* /any23/trunk/plugins/office-scraper/src/test/java/org/apache/any23/plugin/officescraper/ExcelExtractorTest.java

                
> Upgrade dependency to Apache Tika 1.2
> -------------------------------------
>
>                 Key: ANY23-26
>                 URL: https://issues.apache.org/jira/browse/ANY23-26
>             Project: Apache Any23
>          Issue Type: Improvement
>    Affects Versions: 0.7.0
>            Reporter: Lewis John McGibbney
>             Fix For: 0.7.1
>
>         Attachments: 14-img-src-data-url.html, 19-object-data-data-uri.html, ANY23-26.patch,
org.apache.any23.extractor.html.HCardExtractorTest.txt, spi-extractors.diff, tika-1.2-dependency-tree-compare.txt,
tika-12.diff
>
>
> Upgrading to Apache Tika will hopefully provide a wealth of benefits for the project.
This issue should act as an umbrella issue to track these changes. It would be great to delegate
as much as possible to Tika if deemed suitable to enhance functionality and to reduce our
dependencies on external projects.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Mime
View raw message