| Michael McCandless (Created) (JIRA) |
[jira] [Created] (TIKA-735) OpenOffice parser: embedded OLE docs are extracted at the end, as extra <html>...</html> |
Sat, 01 Oct, 10:13 |
| Michael McCandless (Updated) (JIRA) |
[jira] [Updated] (TIKA-735) OpenOffice parser: embedded OLE docs are extracted at the end, as extra <html>...</html> |
Sat, 01 Oct, 10:15 |
| Michael McCandless (Created) (JIRA) |
[jira] [Created] (TIKA-736) OpenOffice parser: master footer text isn't extracted |
Sat, 01 Oct, 10:33 |
| Michael McCandless (Updated) (JIRA) |
[jira] [Updated] (TIKA-736) OpenOffice parser: master footer text isn't extracted |
Sat, 01 Oct, 10:35 |
| build...@apache.org |
buildbot failure in ASF Buildbot on tika-trunk |
Sat, 01 Oct, 10:57 |
| Michael McCandless (Resolved) (JIRA) |
[jira] [Resolved] (TIKA-632) Rtf parsing ignores links |
Sat, 01 Oct, 10:57 |
| build...@apache.org |
buildbot success in ASF Buildbot on tika-trunk |
Sat, 01 Oct, 11:05 |
| Apache Jenkins Server |
Jenkins build became unstable: Tika-trunk » Apache Tika parsers #657 |
Sat, 01 Oct, 11:07 |
| Apache Jenkins Server |
Jenkins build became unstable: Tika-trunk #657 |
Sat, 01 Oct, 11:07 |
| Apache Jenkins Server |
Jenkins build is back to stable : Tika-trunk » Apache Tika parsers #658 |
Sat, 01 Oct, 12:06 |
| Apache Jenkins Server |
Jenkins build is back to stable : Tika-trunk #658 |
Sat, 01 Oct, 12:06 |
| Nick Burch (Commented) (JIRA) |
[jira] [Commented] (TIKA-735) OpenOffice parser: embedded OLE docs are extracted at the end, as extra <html>...</html> |
Sat, 01 Oct, 12:45 |
| Nick Burch (Commented) (JIRA) |
[jira] [Commented] (TIKA-736) OpenOffice parser: master footer text isn't extracted |
Sat, 01 Oct, 12:47 |
| Michael McCandless |
Re: Jenkins build became unstable: Tika-trunk » Apache Tika parsers #657 |
Sat, 01 Oct, 14:25 |
| Michael McCandless (Commented) (JIRA) |
[jira] [Commented] (TIKA-736) OpenOffice parser: master footer text isn't extracted |
Sat, 01 Oct, 14:43 |
| Nick Burch (Created) (JIRA) |
[jira] [Created] (TIKA-737) Use (Incubating) ODFToolkit to improve ODF file format processing |
Sat, 01 Oct, 14:55 |
| Nick Burch (Commented) (JIRA) |
[jira] [Commented] (TIKA-736) OpenOffice parser: master footer text isn't extracted |
Sat, 01 Oct, 14:59 |
| Michael McCandless (Commented) (JIRA) |
[jira] [Commented] (TIKA-735) OpenOffice parser: embedded OLE docs are extracted at the end, as extra <html>...</html> |
Sat, 01 Oct, 15:31 |
| Michael McCandless (Commented) (JIRA) |
[jira] [Commented] (TIKA-737) Use (Incubating) ODFToolkit to improve ODF file format processing |
Sat, 01 Oct, 15:31 |
| Michael McCandless (Resolved) (JIRA) |
[jira] [Resolved] (TIKA-720) EBCDIC encoding not detected |
Sat, 01 Oct, 16:26 |
| Mattmann, Chris A (388J) |
[RESULT] [VOTE] Add Any23 to the Apache Incubator |
Sat, 01 Oct, 16:38 |
| Jukka Zitting (Commented) (JIRA) |
[jira] [Commented] (TIKA-735) OpenOffice parser: embedded OLE docs are extracted at the end, as extra <html>...</html> |
Sat, 01 Oct, 18:12 |
| Mattmann, Chris A (388J) |
[HEADS UP] Added Tika ApacheCon NA 2011 news item |
Sat, 01 Oct, 18:27 |
| Michael McCandless (Commented) (JIRA) |
[jira] [Commented] (TIKA-711) Word parser doesn't extract optional hyphen correctly |
Sat, 01 Oct, 20:52 |
| Michael McCandless (Assigned) (JIRA) |
[jira] [Assigned] (TIKA-711) Word parser doesn't extract optional hyphen correctly |
Sun, 02 Oct, 11:00 |
| Michael McCandless (Updated) (JIRA) |
[jira] [Updated] (TIKA-711) Word parser doesn't extract optional hyphen correctly |
Sun, 02 Oct, 11:02 |
| Michael McCandless (Assigned) (JIRA) |
[jira] [Assigned] (TIKA-721) UTF16-LE not detected |
Sun, 02 Oct, 13:17 |
| Nick Burch (Commented) (JIRA) |
[jira] [Commented] (TIKA-721) UTF16-LE not detected |
Sun, 02 Oct, 15:04 |
| Michael McCandless (Updated) (JIRA) |
[jira] [Updated] (TIKA-721) UTF16-LE not detected |
Sun, 02 Oct, 16:20 |
| Michael McCandless (Commented) (JIRA) |
[jira] [Commented] (TIKA-721) UTF16-LE not detected |
Sun, 02 Oct, 16:26 |
| Robert Muir (Commented) (JIRA) |
[jira] [Commented] (TIKA-721) UTF16-LE not detected |
Sun, 02 Oct, 16:38 |
| Michael McCandless (Commented) (JIRA) |
[jira] [Commented] (TIKA-721) UTF16-LE not detected |
Sun, 02 Oct, 17:50 |
| Robert Muir (Commented) (JIRA) |
[jira] [Commented] (TIKA-713) Tika can not parse all of the persian pdf files |
Sun, 02 Oct, 20:18 |
| Michael McCandless (Commented) (JIRA) |
[jira] [Commented] (TIKA-717) Comment/annotation is sometimes not extracted |
Mon, 03 Oct, 10:53 |
| Michael McCandless (Resolved) (JIRA) |
[jira] [Resolved] (TIKA-717) Comment/annotation is sometimes not extracted |
Mon, 03 Oct, 10:56 |
| Michael McCandless (Created) (JIRA) |
[jira] [Created] (TIKA-738) Tika fails to extract text from PDF annotations |
Mon, 03 Oct, 10:56 |
| Michael McCandless (Commented) (JIRA) |
[jira] [Commented] (TIKA-717) Comment/annotation is sometimes not extracted |
Mon, 03 Oct, 10:56 |
| Michael McCandless (Commented) (JIRA) |
[jira] [Commented] (TIKA-738) Tika fails to extract text from PDF annotations |
Mon, 03 Oct, 12:38 |
| Albert Law (Logik) |
Newb: IDE + Maven? |
Mon, 03 Oct, 14:42 |
| Nick Burch |
Re: Newb: IDE + Maven? |
Mon, 03 Oct, 14:46 |
| Jukka Zitting |
Re: Newb: IDE + Maven? |
Mon, 03 Oct, 14:57 |
| Albert Law (Logik) |
Re: Newb: IDE + Maven? |
Mon, 03 Oct, 15:28 |
| Robert Muir (Commented) (JIRA) |
[jira] [Commented] (TIKA-713) Tika can not parse all of the persian pdf files |
Mon, 03 Oct, 15:44 |
| Ken Krugler |
Re: Newb: IDE + Maven? |
Mon, 03 Oct, 16:03 |
| Robert Muir (Commented) (JIRA) |
[jira] [Commented] (TIKA-722) Arabic PDF doesn't extract correctly |
Mon, 03 Oct, 17:07 |
| Michael McCandless (Resolved) (JIRA) |
[jira] [Resolved] (TIKA-722) Arabic PDF doesn't extract correctly |
Mon, 03 Oct, 17:15 |
| Jeremy Anderson (Commented) (JIRA) |
[jira] [Commented] (TIKA-733) [PATCH] RTF TextExtractor processGroupEnd() NoSuchElementException |
Mon, 03 Oct, 18:01 |
| Jeremy Anderson (Issue Comment Edited) (JIRA) |
[jira] [Issue Comment Edited] (TIKA-733) [PATCH] RTF TextExtractor processGroupEnd() NoSuchElementException |
Mon, 03 Oct, 18:01 |
| Jeremy Anderson (Issue Comment Edited) (JIRA) |
[jira] [Issue Comment Edited] (TIKA-733) [PATCH] RTF TextExtractor processGroupEnd() NoSuchElementException |
Mon, 03 Oct, 18:03 |
| Jeremy Anderson (Issue Comment Edited) (JIRA) |
[jira] [Issue Comment Edited] (TIKA-733) [PATCH] RTF TextExtractor processGroupEnd() NoSuchElementException |
Mon, 03 Oct, 18:03 |
| Jeremy Anderson (Issue Comment Edited) (JIRA) |
[jira] [Issue Comment Edited] (TIKA-733) [PATCH] RTF TextExtractor processGroupEnd() NoSuchElementException |
Mon, 03 Oct, 18:05 |
| Jeremy Anderson (Issue Comment Edited) (JIRA) |
[jira] [Issue Comment Edited] (TIKA-733) [PATCH] RTF TextExtractor processGroupEnd() NoSuchElementException |
Mon, 03 Oct, 18:07 |
| Michael McCandless (Commented) (JIRA) |
[jira] [Commented] (TIKA-733) [PATCH] RTF TextExtractor processGroupEnd() NoSuchElementException |
Mon, 03 Oct, 18:19 |
| Michael McCandless (Resolved) (JIRA) |
[jira] [Resolved] (TIKA-733) [PATCH] RTF TextExtractor processGroupEnd() NoSuchElementException |
Mon, 03 Oct, 18:21 |
| Michael McCandless (Resolved) (JIRA) |
[jira] [Resolved] (TIKA-711) Word parser doesn't extract optional hyphen correctly |
Mon, 03 Oct, 18:27 |
| John Bartak (Created) (JIRA) |
[jira] [Created] (TIKA-739) For certain DWG files, the Tika content parser outputs garbage |
Mon, 03 Oct, 19:10 |
| John Bartak (Updated) (JIRA) |
[jira] [Updated] (TIKA-739) For certain DWG files, the Tika content parser outputs garbage |
Mon, 03 Oct, 19:12 |
| John Bartak (Issue Comment Edited) (JIRA) |
[jira] [Issue Comment Edited] (TIKA-739) For certain DWG files, the Tika content parser outputs garbage |
Mon, 03 Oct, 19:12 |
| John Bartak (Updated) (JIRA) |
[jira] [Updated] (TIKA-739) For certain DWG files, the Tika content parser outputs garbage |
Mon, 03 Oct, 19:14 |
| John Bartak (Updated) (JIRA) |
[jira] [Updated] (TIKA-739) For certain DWG files, the Tika content parser outputs garbage |
Mon, 03 Oct, 19:18 |
| Nick Burch (Commented) (JIRA) |
[jira] [Commented] (TIKA-739) For certain DWG files, the Tika content parser outputs garbage |
Mon, 03 Oct, 20:47 |
| John Bartak (Commented) (JIRA) |
[jira] [Commented] (TIKA-739) For certain DWG files, the Tika content parser outputs garbage |
Mon, 03 Oct, 21:12 |
| Nick Burch (Commented) (JIRA) |
[jira] [Commented] (TIKA-739) For certain DWG files, the Tika content parser outputs garbage |
Mon, 03 Oct, 21:37 |
| John Bartak (Commented) (JIRA) |
[jira] [Commented] (TIKA-739) For certain DWG files, the Tika content parser outputs garbage |
Mon, 03 Oct, 22:52 |
| John Bartak (Updated) (JIRA) |
[jira] [Updated] (TIKA-739) For certain DWG files, the Tika content parser outputs garbage |
Mon, 03 Oct, 23:22 |
| John Bartak (Commented) (JIRA) |
[jira] [Commented] (TIKA-739) For certain DWG files, the Tika content parser outputs garbage |
Mon, 03 Oct, 23:22 |
| John Bartak (Issue Comment Edited) (JIRA) |
[jira] [Issue Comment Edited] (TIKA-739) For certain DWG files, the Tika content parser outputs garbage |
Mon, 03 Oct, 23:26 |
| Michael McCandless (Commented) (JIRA) |
[jira] [Commented] (TIKA-739) For certain DWG files, the Tika content parser outputs garbage |
Mon, 03 Oct, 23:36 |
| Jeremy Anderson (Commented) (JIRA) |
[jira] [Commented] (TIKA-733) [PATCH] RTF TextExtractor processGroupEnd() NoSuchElementException |
Tue, 04 Oct, 00:52 |
| Erik Hetzner (Created) (JIRA) |
[jira] [Created] (TIKA-740) SAX parser used for HTML |
Tue, 04 Oct, 01:12 |
| Erik Hetzner (Created) (JIRA) |
[jira] [Created] (TIKA-741) Make "Zip bomb" (XML nesting) detection level configurable? |
Tue, 04 Oct, 01:16 |
| Michael McCandless (Created) (JIRA) |
[jira] [Created] (TIKA-742) PDF2XHTML fails to insert <p> nor space around page marker |
Tue, 04 Oct, 10:32 |
| Michael McCandless (Updated) (JIRA) |
[jira] [Updated] (TIKA-742) PDF2XHTML fails to insert <p> nor space around page marker |
Tue, 04 Oct, 10:32 |
| Michael McCandless (Updated) (JIRA) |
[jira] [Updated] (TIKA-742) PDF2XHTML fails to insert <p> nor space around page marker |
Tue, 04 Oct, 10:36 |
| Michael McCandless (Commented) (JIRA) |
[jira] [Commented] (TIKA-733) [PATCH] RTF TextExtractor processGroupEnd() NoSuchElementException |
Tue, 04 Oct, 10:48 |
| Mark Kerzner (Commented) (JIRA) |
[jira] [Commented] (TIKA-623) Add support for Outlook PST |
Wed, 05 Oct, 02:15 |
| Nick Burch (Resolved) (JIRA) |
[jira] [Resolved] (TIKA-622) Switch from POIFSFileSystem to NPOIFSFileSystem, for speed and memory improvements |
Wed, 05 Oct, 09:05 |
| Bernhard Berger |
Download-Link to tika-app-0.10.jar doesn't work |
Wed, 05 Oct, 09:06 |
| Michael McCandless (Resolved) (JIRA) |
[jira] [Resolved] (TIKA-742) PDF2XHTML fails to insert <p> nor space around page marker |
Wed, 05 Oct, 10:44 |
| Apache Jenkins Server |
Build failed in Jenkins: Tika-trunk » Apache Tika parsers #664 |
Wed, 05 Oct, 11:13 |
| Apache Jenkins Server |
Build failed in Jenkins: Tika-trunk #664 |
Wed, 05 Oct, 11:13 |
| Jukka Zitting |
Re: Build failed in Jenkins: Tika-trunk #664 |
Wed, 05 Oct, 12:09 |
| Michael McCandless |
Re: Build failed in Jenkins: Tika-trunk #664 |
Wed, 05 Oct, 12:29 |
| Jukka Zitting (Created) (JIRA) |
[jira] [Created] (TIKA-743) Upgrade to Apache parent POM version 10 |
Wed, 05 Oct, 13:05 |
| Jukka Zitting (Resolved) (JIRA) |
[jira] [Resolved] (TIKA-743) Upgrade to Apache parent POM version 10 |
Wed, 05 Oct, 13:12 |
| Apache Jenkins Server |
Jenkins build is back to normal : Tika-trunk #665 |
Wed, 05 Oct, 13:12 |
| Apache Jenkins Server |
Jenkins build is back to normal : Tika-trunk » Apache Tika parsers #665 |
Wed, 05 Oct, 13:12 |
| Jukka Zitting |
Re: Download-Link to tika-app-0.10.jar doesn't work |
Wed, 05 Oct, 13:21 |
| Jukka Zitting (Resolved) (JIRA) |
[jira] [Resolved] (TIKA-739) For certain DWG files, the Tika content parser outputs garbage |
Wed, 05 Oct, 13:52 |
| Jukka Zitting (Updated) (JIRA) |
[jira] [Updated] (TIKA-740) SAX parser used for HTML |
Wed, 05 Oct, 14:02 |
| Jukka Zitting (Updated) (JIRA) |
[jira] [Updated] (TIKA-741) "Zip bomb" (XML nesting) detection is too strict |
Wed, 05 Oct, 15:01 |
| Jukka Zitting (Resolved) (JIRA) |
[jira] [Resolved] (TIKA-741) "Zip bomb" (XML nesting) detection is too strict |
Wed, 05 Oct, 15:14 |
| Jukka Zitting (Commented) (JIRA) |
[jira] [Commented] (TIKA-734) Out of memory exception with Xlsx file less than 5 MB |
Wed, 05 Oct, 15:17 |
| Jukka Zitting (Resolved) (JIRA) |
[jira] [Resolved] (TIKA-730) WriteOutContentHandler concatenates title tag and body text. |
Wed, 05 Oct, 15:21 |
| Erik Hetzner (Commented) (JIRA) |
[jira] [Commented] (TIKA-741) "Zip bomb" (XML nesting) detection is too strict |
Wed, 05 Oct, 15:45 |
| Jukka Zitting (Updated) (JIRA) |
[jira] [Updated] (TIKA-605) Tika GDAL parser |
Wed, 05 Oct, 16:41 |
| Jukka Zitting (Resolved) (JIRA) |
[jira] [Resolved] (TIKA-699) Automatic checks against backwards-incompatible API changes |
Wed, 05 Oct, 16:59 |
| Chris A. Mattmann (Commented) (JIRA) |
[jira] [Commented] (TIKA-605) Tika GDAL parser |
Wed, 05 Oct, 17:03 |
| Jukka Zitting (Created) (JIRA) |
[jira] [Created] (TIKA-744) Drop support for Java 1.4 |
Wed, 05 Oct, 17:07 |
| Jukka Zitting (Resolved) (JIRA) |
[jira] [Resolved] (TIKA-744) Drop support for Java 1.4 |
Wed, 05 Oct, 17:09 |