tika-dev mailing list archives: October 2011

Site index · List index
Message list1 · 2 · 3 · 4 · Next »Thread · Author · Date
Michael McCandless (Created) (JIRA) [jira] [Created] (TIKA-735) OpenOffice parser: embedded OLE docs are extracted at the end, as extra <html>...</html> Sat, 01 Oct, 10:13
Michael McCandless (Updated) (JIRA) [jira] [Updated] (TIKA-735) OpenOffice parser: embedded OLE docs are extracted at the end, as extra <html>...</html> Sat, 01 Oct, 10:15
Michael McCandless (Created) (JIRA) [jira] [Created] (TIKA-736) OpenOffice parser: master footer text isn't extracted Sat, 01 Oct, 10:33
Michael McCandless (Updated) (JIRA) [jira] [Updated] (TIKA-736) OpenOffice parser: master footer text isn't extracted Sat, 01 Oct, 10:35
build...@apache.org buildbot failure in ASF Buildbot on tika-trunk Sat, 01 Oct, 10:57
Michael McCandless (Resolved) (JIRA) [jira] [Resolved] (TIKA-632) Rtf parsing ignores links Sat, 01 Oct, 10:57
build...@apache.org buildbot success in ASF Buildbot on tika-trunk Sat, 01 Oct, 11:05
Apache Jenkins Server Jenkins build became unstable: Tika-trunk » Apache Tika parsers #657 Sat, 01 Oct, 11:07
Apache Jenkins Server Jenkins build became unstable: Tika-trunk #657 Sat, 01 Oct, 11:07
Apache Jenkins Server Jenkins build is back to stable : Tika-trunk » Apache Tika parsers #658 Sat, 01 Oct, 12:06
Apache Jenkins Server Jenkins build is back to stable : Tika-trunk #658 Sat, 01 Oct, 12:06
Nick Burch (Commented) (JIRA) [jira] [Commented] (TIKA-735) OpenOffice parser: embedded OLE docs are extracted at the end, as extra <html>...</html> Sat, 01 Oct, 12:45
Nick Burch (Commented) (JIRA) [jira] [Commented] (TIKA-736) OpenOffice parser: master footer text isn't extracted Sat, 01 Oct, 12:47
Michael McCandless Re: Jenkins build became unstable: Tika-trunk » Apache Tika parsers #657 Sat, 01 Oct, 14:25
Michael McCandless (Commented) (JIRA) [jira] [Commented] (TIKA-736) OpenOffice parser: master footer text isn't extracted Sat, 01 Oct, 14:43
Nick Burch (Created) (JIRA) [jira] [Created] (TIKA-737) Use (Incubating) ODFToolkit to improve ODF file format processing Sat, 01 Oct, 14:55
Nick Burch (Commented) (JIRA) [jira] [Commented] (TIKA-736) OpenOffice parser: master footer text isn't extracted Sat, 01 Oct, 14:59
Michael McCandless (Commented) (JIRA) [jira] [Commented] (TIKA-735) OpenOffice parser: embedded OLE docs are extracted at the end, as extra <html>...</html> Sat, 01 Oct, 15:31
Michael McCandless (Commented) (JIRA) [jira] [Commented] (TIKA-737) Use (Incubating) ODFToolkit to improve ODF file format processing Sat, 01 Oct, 15:31
Michael McCandless (Resolved) (JIRA) [jira] [Resolved] (TIKA-720) EBCDIC encoding not detected Sat, 01 Oct, 16:26
Mattmann, Chris A (388J) [RESULT] [VOTE] Add Any23 to the Apache Incubator Sat, 01 Oct, 16:38
Jukka Zitting (Commented) (JIRA) [jira] [Commented] (TIKA-735) OpenOffice parser: embedded OLE docs are extracted at the end, as extra <html>...</html> Sat, 01 Oct, 18:12
Mattmann, Chris A (388J) [HEADS UP] Added Tika ApacheCon NA 2011 news item Sat, 01 Oct, 18:27
Michael McCandless (Commented) (JIRA) [jira] [Commented] (TIKA-711) Word parser doesn't extract optional hyphen correctly Sat, 01 Oct, 20:52
Michael McCandless (Assigned) (JIRA) [jira] [Assigned] (TIKA-711) Word parser doesn't extract optional hyphen correctly Sun, 02 Oct, 11:00
Michael McCandless (Updated) (JIRA) [jira] [Updated] (TIKA-711) Word parser doesn't extract optional hyphen correctly Sun, 02 Oct, 11:02
Michael McCandless (Assigned) (JIRA) [jira] [Assigned] (TIKA-721) UTF16-LE not detected Sun, 02 Oct, 13:17
Nick Burch (Commented) (JIRA) [jira] [Commented] (TIKA-721) UTF16-LE not detected Sun, 02 Oct, 15:04
Michael McCandless (Updated) (JIRA) [jira] [Updated] (TIKA-721) UTF16-LE not detected Sun, 02 Oct, 16:20
Michael McCandless (Commented) (JIRA) [jira] [Commented] (TIKA-721) UTF16-LE not detected Sun, 02 Oct, 16:26
Robert Muir (Commented) (JIRA) [jira] [Commented] (TIKA-721) UTF16-LE not detected Sun, 02 Oct, 16:38
Michael McCandless (Commented) (JIRA) [jira] [Commented] (TIKA-721) UTF16-LE not detected Sun, 02 Oct, 17:50
Robert Muir (Commented) (JIRA) [jira] [Commented] (TIKA-713) Tika can not parse all of the persian pdf files Sun, 02 Oct, 20:18
Michael McCandless (Commented) (JIRA) [jira] [Commented] (TIKA-717) Comment/annotation is sometimes not extracted Mon, 03 Oct, 10:53
Michael McCandless (Resolved) (JIRA) [jira] [Resolved] (TIKA-717) Comment/annotation is sometimes not extracted Mon, 03 Oct, 10:56
Michael McCandless (Created) (JIRA) [jira] [Created] (TIKA-738) Tika fails to extract text from PDF annotations Mon, 03 Oct, 10:56
Michael McCandless (Commented) (JIRA) [jira] [Commented] (TIKA-717) Comment/annotation is sometimes not extracted Mon, 03 Oct, 10:56
Michael McCandless (Commented) (JIRA) [jira] [Commented] (TIKA-738) Tika fails to extract text from PDF annotations Mon, 03 Oct, 12:38
Albert Law (Logik) Newb: IDE + Maven? Mon, 03 Oct, 14:42
Nick Burch Re: Newb: IDE + Maven? Mon, 03 Oct, 14:46
Jukka Zitting Re: Newb: IDE + Maven? Mon, 03 Oct, 14:57
Albert Law (Logik) Re: Newb: IDE + Maven? Mon, 03 Oct, 15:28
Robert Muir (Commented) (JIRA) [jira] [Commented] (TIKA-713) Tika can not parse all of the persian pdf files Mon, 03 Oct, 15:44
Ken Krugler Re: Newb: IDE + Maven? Mon, 03 Oct, 16:03
Robert Muir (Commented) (JIRA) [jira] [Commented] (TIKA-722) Arabic PDF doesn't extract correctly Mon, 03 Oct, 17:07
Michael McCandless (Resolved) (JIRA) [jira] [Resolved] (TIKA-722) Arabic PDF doesn't extract correctly Mon, 03 Oct, 17:15
Jeremy Anderson (Commented) (JIRA) [jira] [Commented] (TIKA-733) [PATCH] RTF TextExtractor processGroupEnd() NoSuchElementException Mon, 03 Oct, 18:01
Jeremy Anderson (Issue Comment Edited) (JIRA) [jira] [Issue Comment Edited] (TIKA-733) [PATCH] RTF TextExtractor processGroupEnd() NoSuchElementException Mon, 03 Oct, 18:01
Jeremy Anderson (Issue Comment Edited) (JIRA) [jira] [Issue Comment Edited] (TIKA-733) [PATCH] RTF TextExtractor processGroupEnd() NoSuchElementException Mon, 03 Oct, 18:03
Jeremy Anderson (Issue Comment Edited) (JIRA) [jira] [Issue Comment Edited] (TIKA-733) [PATCH] RTF TextExtractor processGroupEnd() NoSuchElementException Mon, 03 Oct, 18:03
Jeremy Anderson (Issue Comment Edited) (JIRA) [jira] [Issue Comment Edited] (TIKA-733) [PATCH] RTF TextExtractor processGroupEnd() NoSuchElementException Mon, 03 Oct, 18:05
Jeremy Anderson (Issue Comment Edited) (JIRA) [jira] [Issue Comment Edited] (TIKA-733) [PATCH] RTF TextExtractor processGroupEnd() NoSuchElementException Mon, 03 Oct, 18:07
Michael McCandless (Commented) (JIRA) [jira] [Commented] (TIKA-733) [PATCH] RTF TextExtractor processGroupEnd() NoSuchElementException Mon, 03 Oct, 18:19
Michael McCandless (Resolved) (JIRA) [jira] [Resolved] (TIKA-733) [PATCH] RTF TextExtractor processGroupEnd() NoSuchElementException Mon, 03 Oct, 18:21
Michael McCandless (Resolved) (JIRA) [jira] [Resolved] (TIKA-711) Word parser doesn't extract optional hyphen correctly Mon, 03 Oct, 18:27
John Bartak (Created) (JIRA) [jira] [Created] (TIKA-739) For certain DWG files, the Tika content parser outputs garbage Mon, 03 Oct, 19:10
John Bartak (Updated) (JIRA) [jira] [Updated] (TIKA-739) For certain DWG files, the Tika content parser outputs garbage Mon, 03 Oct, 19:12
John Bartak (Issue Comment Edited) (JIRA) [jira] [Issue Comment Edited] (TIKA-739) For certain DWG files, the Tika content parser outputs garbage Mon, 03 Oct, 19:12
John Bartak (Updated) (JIRA) [jira] [Updated] (TIKA-739) For certain DWG files, the Tika content parser outputs garbage Mon, 03 Oct, 19:14
John Bartak (Updated) (JIRA) [jira] [Updated] (TIKA-739) For certain DWG files, the Tika content parser outputs garbage Mon, 03 Oct, 19:18
Nick Burch (Commented) (JIRA) [jira] [Commented] (TIKA-739) For certain DWG files, the Tika content parser outputs garbage Mon, 03 Oct, 20:47
John Bartak (Commented) (JIRA) [jira] [Commented] (TIKA-739) For certain DWG files, the Tika content parser outputs garbage Mon, 03 Oct, 21:12
Nick Burch (Commented) (JIRA) [jira] [Commented] (TIKA-739) For certain DWG files, the Tika content parser outputs garbage Mon, 03 Oct, 21:37
John Bartak (Commented) (JIRA) [jira] [Commented] (TIKA-739) For certain DWG files, the Tika content parser outputs garbage Mon, 03 Oct, 22:52
John Bartak (Updated) (JIRA) [jira] [Updated] (TIKA-739) For certain DWG files, the Tika content parser outputs garbage Mon, 03 Oct, 23:22
John Bartak (Commented) (JIRA) [jira] [Commented] (TIKA-739) For certain DWG files, the Tika content parser outputs garbage Mon, 03 Oct, 23:22
John Bartak (Issue Comment Edited) (JIRA) [jira] [Issue Comment Edited] (TIKA-739) For certain DWG files, the Tika content parser outputs garbage Mon, 03 Oct, 23:26
Michael McCandless (Commented) (JIRA) [jira] [Commented] (TIKA-739) For certain DWG files, the Tika content parser outputs garbage Mon, 03 Oct, 23:36
Jeremy Anderson (Commented) (JIRA) [jira] [Commented] (TIKA-733) [PATCH] RTF TextExtractor processGroupEnd() NoSuchElementException Tue, 04 Oct, 00:52
Erik Hetzner (Created) (JIRA) [jira] [Created] (TIKA-740) SAX parser used for HTML Tue, 04 Oct, 01:12
Erik Hetzner (Created) (JIRA) [jira] [Created] (TIKA-741) Make "Zip bomb" (XML nesting) detection level configurable? Tue, 04 Oct, 01:16
Michael McCandless (Created) (JIRA) [jira] [Created] (TIKA-742) PDF2XHTML fails to insert <p> nor space around page marker Tue, 04 Oct, 10:32
Michael McCandless (Updated) (JIRA) [jira] [Updated] (TIKA-742) PDF2XHTML fails to insert <p> nor space around page marker Tue, 04 Oct, 10:32
Michael McCandless (Updated) (JIRA) [jira] [Updated] (TIKA-742) PDF2XHTML fails to insert <p> nor space around page marker Tue, 04 Oct, 10:36
Michael McCandless (Commented) (JIRA) [jira] [Commented] (TIKA-733) [PATCH] RTF TextExtractor processGroupEnd() NoSuchElementException Tue, 04 Oct, 10:48
Mark Kerzner (Commented) (JIRA) [jira] [Commented] (TIKA-623) Add support for Outlook PST Wed, 05 Oct, 02:15
Nick Burch (Resolved) (JIRA) [jira] [Resolved] (TIKA-622) Switch from POIFSFileSystem to NPOIFSFileSystem, for speed and memory improvements Wed, 05 Oct, 09:05
Bernhard Berger Download-Link to tika-app-0.10.jar doesn't work Wed, 05 Oct, 09:06
Michael McCandless (Resolved) (JIRA) [jira] [Resolved] (TIKA-742) PDF2XHTML fails to insert <p> nor space around page marker Wed, 05 Oct, 10:44
Apache Jenkins Server Build failed in Jenkins: Tika-trunk » Apache Tika parsers #664 Wed, 05 Oct, 11:13
Apache Jenkins Server Build failed in Jenkins: Tika-trunk #664 Wed, 05 Oct, 11:13
Jukka Zitting Re: Build failed in Jenkins: Tika-trunk #664 Wed, 05 Oct, 12:09
Michael McCandless Re: Build failed in Jenkins: Tika-trunk #664 Wed, 05 Oct, 12:29
Jukka Zitting (Created) (JIRA) [jira] [Created] (TIKA-743) Upgrade to Apache parent POM version 10 Wed, 05 Oct, 13:05
Jukka Zitting (Resolved) (JIRA) [jira] [Resolved] (TIKA-743) Upgrade to Apache parent POM version 10 Wed, 05 Oct, 13:12
Apache Jenkins Server Jenkins build is back to normal : Tika-trunk » Apache Tika parsers #665 Wed, 05 Oct, 13:12
Apache Jenkins Server Jenkins build is back to normal : Tika-trunk #665 Wed, 05 Oct, 13:12
Jukka Zitting Re: Download-Link to tika-app-0.10.jar doesn't work Wed, 05 Oct, 13:21
Jukka Zitting (Resolved) (JIRA) [jira] [Resolved] (TIKA-739) For certain DWG files, the Tika content parser outputs garbage Wed, 05 Oct, 13:52
Jukka Zitting (Updated) (JIRA) [jira] [Updated] (TIKA-740) SAX parser used for HTML Wed, 05 Oct, 14:02
Jukka Zitting (Updated) (JIRA) [jira] [Updated] (TIKA-741) "Zip bomb" (XML nesting) detection is too strict Wed, 05 Oct, 15:01
Jukka Zitting (Resolved) (JIRA) [jira] [Resolved] (TIKA-741) "Zip bomb" (XML nesting) detection is too strict Wed, 05 Oct, 15:14
Jukka Zitting (Commented) (JIRA) [jira] [Commented] (TIKA-734) Out of memory exception with Xlsx file less than 5 MB Wed, 05 Oct, 15:17
Jukka Zitting (Resolved) (JIRA) [jira] [Resolved] (TIKA-730) WriteOutContentHandler concatenates title tag and body text. Wed, 05 Oct, 15:21
Erik Hetzner (Commented) (JIRA) [jira] [Commented] (TIKA-741) "Zip bomb" (XML nesting) detection is too strict Wed, 05 Oct, 15:45
Jukka Zitting (Updated) (JIRA) [jira] [Updated] (TIKA-605) Tika GDAL parser Wed, 05 Oct, 16:41
Jukka Zitting (Resolved) (JIRA) [jira] [Resolved] (TIKA-699) Automatic checks against backwards-incompatible API changes Wed, 05 Oct, 16:59
Chris A. Mattmann (Commented) (JIRA) [jira] [Commented] (TIKA-605) Tika GDAL parser Wed, 05 Oct, 17:03
Jukka Zitting (Created) (JIRA) [jira] [Created] (TIKA-744) Drop support for Java 1.4 Wed, 05 Oct, 17:07
Jukka Zitting (Resolved) (JIRA) [jira] [Resolved] (TIKA-744) Drop support for Java 1.4 Wed, 05 Oct, 17:09
Message list1 · 2 · 3 · 4 · Next »Thread · Author · Date
Box list
Dec 2014191
Nov 2014389
Oct 2014481
Sep 2014364
Aug 2014393
Jul 2014328
Jun 2014671
May 2014298
Apr 2014161
Mar 2014226
Feb 2014293
Jan 2014150
Dec 2013155
Nov 201384
Oct 2013100
Sep 201386
Aug 2013103
Jul 2013146
Jun 2013138
May 2013126
Apr 201374
Mar 201370
Feb 2013174
Jan 2013205
Dec 2012109
Nov 2012124
Oct 2012118
Sep 201261
Aug 2012173
Jul 2012274
Jun 2012102
May 2012174
Apr 2012180
Mar 2012200
Feb 2012125
Jan 2012189
Dec 2011287
Nov 2011259
Oct 2011336
Sep 2011356
Aug 2011197
Jul 2011120
Jun 2011122
May 2011184
Apr 2011137
Mar 2011161
Feb 2011111
Jan 201185
Dec 201099
Nov 2010252
Oct 2010144
Sep 2010168
Aug 2010253
Jul 2010192
Jun 2010154
May 2010132
Apr 2010115
Mar 201090
Feb 201062
Jan 2010134
Dec 2009125
Nov 2009179
Oct 200989
Sep 2009115
Aug 200946
Jul 200977
Jun 200994
May 200981
Apr 200936
Mar 200996
Feb 200974
Jan 200993
Dec 2008112
Nov 2008147
Oct 200854
Sep 2008108
Aug 200826
Jul 200817
Jun 200820
May 200816
Apr 200844
Mar 200873
Feb 200836
Jan 200888
Dec 200785
Nov 2007100
Oct 2007424
Sep 2007265
Aug 200719
Jul 200730
Jun 200751
May 200721
Apr 200712
Mar 200712