tika-dev mailing list archives: December 2011

Site index · List index
Message list1 · 2 · 3 · Next »Thread · Author · Date
Guyot Raphaƫl Subscribing Mon, 05 Dec, 21:01
Aami arrayindex out of bounds exception Fri, 30 Dec, 06:52
Adei Mandaluniz (Commented) (JIRA) [jira] [Commented] (TIKA-291) Adobe InDesign support Mon, 19 Dec, 12:15
Adei Mandaluniz (Updated) (JIRA) [jira] [Updated] (TIKA-682) Creative Suite formats are not supported Mon, 19 Dec, 12:17
Albert L. (Commented) (JIRA) [jira] [Commented] (TIKA-817) (PPT/PPTX) Missing date/time in text content. Mon, 19 Dec, 16:09
Albert L. (Commented) (JIRA) [jira] [Commented] (TIKA-816) (XLS/XLSX) Improperly formatted date/time in text content. Mon, 19 Dec, 18:23
Albert L. (Commented) (JIRA) [jira] [Commented] (TIKA-816) (XLS/XLSX) Improperly formatted date/time in text content. Mon, 19 Dec, 18:23
Albert L. (Commented) (JIRA) [jira] [Commented] (TIKA-817) (PPT/PPTX) Missing date/time in text content. Mon, 19 Dec, 18:27
Albert L. (Commented) (JIRA) [jira] [Commented] (TIKA-819) Make Option to Exclude Embedded Files' Text for Text Content Tue, 20 Dec, 14:33
Albert L. (Commented) (JIRA) [jira] [Commented] (TIKA-819) Make Option to Exclude Embedded Files' Text for Text Content Wed, 21 Dec, 14:23
Albert L. (Created) (JIRA) [jira] [Created] (TIKA-816) (XLS/XLSX) Missing date/time in text content. Mon, 19 Dec, 15:57
Albert L. (Created) (JIRA) [jira] [Created] (TIKA-817) (PPT/PPTX) Missing date/time in text content. Mon, 19 Dec, 16:07
Albert L. (Created) (JIRA) [jira] [Created] (TIKA-819) Make Option to Exclude Embedded Files' Text for Text Content Mon, 19 Dec, 18:49
Albert L. (Updated) (JIRA) [jira] [Updated] (TIKA-816) (XLS/XLSX) Improperly formatted date/time in text content. Mon, 19 Dec, 15:59
Alex Ott Re: Tesseract OCR engine Thu, 01 Dec, 08:10
Alex Ott (Commented) (JIRA) [jira] [Commented] (TIKA-806) MS Word Detection magics are a bit overzealous Fri, 09 Dec, 15:52
Alex Ott (Commented) (JIRA) [jira] [Commented] (TIKA-823) Detect StarOffice files Wed, 21 Dec, 08:13
Andrzej Bialecki (Commented) (JIRA) [jira] [Commented] (TIKA-623) Add support for Outlook PST Thu, 01 Dec, 11:48
Andrzej Bialecki (Created) (JIRA) [jira] [Created] (TIKA-800) mark/reset not supported from POIFSContainerDetector Mon, 05 Dec, 11:25
Andrzej Bialecki (Created) (JIRA) [jira] [Created] (TIKA-801) ContentHandlerDecorator outputs invalid element Mon, 05 Dec, 12:37
Andrzej Bialecki (Updated) (JIRA) [jira] [Updated] (TIKA-813) Webarchive detection. Tue, 13 Dec, 18:58
Antoni Mylka Re: [ANNOUNCE] Welcome Antoni Mylka as Tika committer + PMC member Mon, 12 Dec, 18:19
Antoni Mylka JIRA rights. Tue, 13 Dec, 13:05
Antoni Mylka Re: Pushing parsers upstream Tue, 13 Dec, 13:44
Antoni Mylka Re: Pushing parsers upstream Tue, 13 Dec, 17:34
Antoni Mylka Re: Pushing parsers upstream Fri, 16 Dec, 18:45
Antoni Mylka Re: Pushing parsers upstream Fri, 16 Dec, 19:04
Antoni Mylka Re: Pushing parsers upstream Fri, 16 Dec, 20:27
Antoni Mylka (Closed) (JIRA) [jira] [Closed] (TIKA-798) Distinguish between EMF and WMF Tue, 13 Dec, 16:23
Antoni Mylka (Closed) (JIRA) [jira] [Closed] (TIKA-791) Fix the detection of protected OOXML files Wed, 14 Dec, 13:23
Antoni Mylka (Closed) (JIRA) [jira] [Closed] (TIKA-812) Improve the detection of Works Spreadsheet 7.0 files Mon, 19 Dec, 11:19
Antoni Mylka (Closed) (JIRA) [jira] [Closed] (TIKA-813) Webarchive detection. Mon, 19 Dec, 11:29
Antoni Mylka (Closed) (JIRA) [jira] [Closed] (TIKA-814) Increase the amount of bytes read by TextDetector Mon, 19 Dec, 11:41
Antoni Mylka (Closed) (JIRA) [jira] [Closed] (TIKA-823) Detect StarOffice files Wed, 21 Dec, 12:03
Antoni Mylka (Commented) (JIRA) [jira] [Commented] (TIKA-806) MS Word Detection magics are a bit overzealous Mon, 12 Dec, 14:52
Antoni Mylka (Commented) (JIRA) [jira] [Commented] (TIKA-806) MS Word Detection magics are a bit overzealous Tue, 13 Dec, 12:43
Antoni Mylka (Commented) (JIRA) [jira] [Commented] (TIKA-810) Upgrade to PDFbox 1.7.0 as available Fri, 16 Dec, 17:50
Antoni Mylka (Commented) (JIRA) [jira] [Commented] (TIKA-686) Split tika-parsers into separate components Tue, 20 Dec, 12:43
Antoni Mylka (Commented) (JIRA) [jira] [Commented] (TIKA-821) Support detecting old MIcrosoft Works Word Processor formats Tue, 20 Dec, 15:57
Antoni Mylka (Created) (JIRA) [jira] [Created] (TIKA-797) MimeType.getExtension for application/vnd.ms-powerpoint returns ppz. I'd expect ppt. Fri, 02 Dec, 12:07
Antoni Mylka (Created) (JIRA) [jira] [Created] (TIKA-798) Distinguish between EMF and WMF Fri, 02 Dec, 13:43
Antoni Mylka (Created) (JIRA) [jira] [Created] (TIKA-806) MS Word Detection magics are a bit overzealous Fri, 09 Dec, 15:06
Antoni Mylka (Created) (JIRA) [jira] [Created] (TIKA-812) Improve the detection of Works Spreadsheet 7.0 files Tue, 13 Dec, 16:21
Antoni Mylka (Created) (JIRA) [jira] [Created] (TIKA-813) Webarchive detection. Tue, 13 Dec, 18:12
Antoni Mylka (Created) (JIRA) [jira] [Created] (TIKA-814) Increase the amount of bytes read by TextDetector Tue, 13 Dec, 20:33
Antoni Mylka (Created) (JIRA) [jira] [Created] (TIKA-821) Support detecting old MIcrosoft Works Word Processor formats Tue, 20 Dec, 15:51
Antoni Mylka (Created) (JIRA) [jira] [Created] (TIKA-823) Detect StarOffice files Tue, 20 Dec, 23:07
Antoni Mylka (Resolved) (JIRA) [jira] [Resolved] (TIKA-806) MS Word Detection magics are a bit overzealous Tue, 13 Dec, 13:37
Antoni Mylka (Updated) (JIRA) [jira] [Updated] (TIKA-797) MimeType.getExtension for application/vnd.ms-powerpoint returns ppz. I'd expect ppt. Fri, 02 Dec, 12:09
Antoni Mylka (Updated) (JIRA) [jira] [Updated] (TIKA-798) Distinguish between EMF and WMF Fri, 02 Dec, 13:43
Antoni Mylka (Updated) (JIRA) [jira] [Updated] (TIKA-806) MS Word Detection magics are a bit overzealous Fri, 09 Dec, 15:42
Antoni Mylka (Updated) (JIRA) [jira] [Updated] (TIKA-806) MS Word Detection magics are a bit overzealous Fri, 09 Dec, 15:44
Antoni Mylka (Updated) (JIRA) [jira] [Updated] (TIKA-806) MS Word Detection magics are a bit overzealous Fri, 09 Dec, 15:58
Antoni Mylka (Updated) (JIRA) [jira] [Updated] (TIKA-806) MS Word Detection magics are a bit overzealous Mon, 12 Dec, 15:33
Antoni Mylka (Updated) (JIRA) [jira] [Updated] (TIKA-812) Improve the detection of Works Spreadsheet 7.0 files Tue, 13 Dec, 16:25
Antoni Mylka (Updated) (JIRA) [jira] [Updated] (TIKA-813) Webarchive detection. Tue, 13 Dec, 18:12
Antoni Mylka (Updated) (JIRA) [jira] [Updated] (TIKA-814) Increase the amount of bytes read by TextDetector Tue, 13 Dec, 20:33
Antoni Mylka (Updated) (JIRA) [jira] [Updated] (TIKA-813) Webarchive detection. Wed, 14 Dec, 11:47
Antoni Mylka (Updated) (JIRA) [jira] [Updated] (TIKA-813) Webarchive detection. Wed, 14 Dec, 11:47
Antoni Mylka (Updated) (JIRA) [jira] [Updated] (TIKA-812) Improve the detection of Works Spreadsheet 7.0 files Wed, 14 Dec, 12:11
Antoni Mylka (Updated) (JIRA) [jira] [Updated] (TIKA-823) Detect StarOffice files Tue, 20 Dec, 23:09
Apache Jenkins Server Build failed in Jenkins: Tika-trunk #742 Tue, 06 Dec, 02:02
Apache Jenkins Server Jenkins build is back to normal : Tika-trunk #743 Tue, 06 Dec, 16:25
Arthur Meneau (Commented) (JIRA) [jira] [Commented] (TIKA-802) NullPointerException when parsing iWork files Tue, 06 Dec, 01:17
Arthur Meneau (Commented) (JIRA) [jira] [Commented] (TIKA-802) NullPointerException when parsing iWork files Tue, 06 Dec, 03:01
Arthur Meneau (Commented) (JIRA) [jira] [Commented] (TIKA-802) NullPointerException when parsing iWork files Tue, 06 Dec, 03:25
Arthur Meneau (Created) (JIRA) [jira] [Created] (TIKA-799) ForkParser does not populate metadata object after completing a parse Sat, 03 Dec, 02:27
Arthur Meneau (Created) (JIRA) [jira] [Created] (TIKA-802) NullPointerException when parsing iWork files Tue, 06 Dec, 01:13
Arthur Meneau (Updated) (JIRA) [jira] [Updated] (TIKA-802) NullPointerException when parsing iWork files Tue, 06 Dec, 01:17
Arthur Meneau (Updated) (JIRA) [jira] [Updated] (TIKA-802) NullPointerException when parsing iWork files Tue, 06 Dec, 01:17
Babu Gajendran (Commented) (JIRA) [jira] [Commented] (TIKA-804) Parsing outlook format template (.oft ) Sat, 10 Dec, 06:23
Babu Gajendran (Commented) (JIRA) [jira] [Commented] (TIKA-804) Parsing outlook format template (.oft ) Mon, 12 Dec, 06:54
Babu Gajendran (Created) (JIRA) [jira] [Created] (TIKA-804) Parsing outlook format template (.oft ) Fri, 09 Dec, 14:28
Babu Gajendran (Updated) (JIRA) [jira] [Updated] (TIKA-804) Parsing outlook format template (.oft ) Sat, 10 Dec, 06:23
Babu Gajendran (Updated) (JIRA) [jira] [Updated] (TIKA-804) Parsing outlook format template (.oft ) Sat, 10 Dec, 06:23
Chris A. Mattmann (Assigned) (JIRA) [jira] [Assigned] (TIKA-824) Extract rel attr with LinkContentHandler Wed, 21 Dec, 15:41
Damon Rand (Updated) (JIRA) [jira] [Updated] (TIKA-682) Creative Suite formats are not supported Fri, 09 Dec, 10:04
Daniel Bonniot de Ruisselet (Commented) (JIRA) [jira] [Commented] (TIKA-820) Locator is unset for HTML parser Tue, 20 Dec, 09:03
Daniel Bonniot de Ruisselet (Created) (JIRA) [jira] [Created] (TIKA-820) Locator is unset for HTML parser Tue, 20 Dec, 09:01
Daniel Bonniot de Ruisselet (Updated) (JIRA) [jira] [Updated] (TIKA-820) Locator is unset for HTML parser Tue, 20 Dec, 09:01
David Tran (Commented) (JIRA) [jira] [Commented] (TIKA-423) Parse docx and output to text file missing words Tue, 20 Dec, 06:27
Devin Han [VOTE] Release Apache ODF Toolkit 0.5-incubating(RC6) Sat, 24 Dec, 09:11
Devin Han [VOTE] Release Apache ODF Toolkit 0.5-incubating(RC7) Tue, 27 Dec, 16:45
Emmanuel Hugonnet (Commented) (JIRA) [jira] [Commented] (TIKA-811) Upgrade metadatExtractor version for OpenJDK 7 support Tue, 13 Dec, 10:31
Emmanuel Hugonnet (Created) (JIRA) [jira] [Created] (TIKA-811) Upgrade metadatExtractor version for OpenJDK 7 support Tue, 13 Dec, 09:53
Emmanuel Hugonnet (Updated) (JIRA) [jira] [Updated] (TIKA-811) Upgrade metadatExtractor version for OpenJDK 7 support Tue, 13 Dec, 09:55
Fabian Lange (Commented) (JIRA) [jira] [Commented] (TIKA-762) EXIF extraction from PNG images Fri, 02 Dec, 15:37
Fabian Lange (Commented) (JIRA) [jira] [Commented] (TIKA-526) OOXMLParser fails to extract text from within smart tags Mon, 05 Dec, 10:07
Fabian Lange (Commented) (JIRA) [jira] [Commented] (TIKA-423) Parse docx and output to text file missing words Mon, 05 Dec, 12:37
Franz Canaval (Created) (JIRA) [jira] [Created] (TIKA-796) Tika breaks words of rotated text in PDF documents Thu, 01 Dec, 11:21
George Kappel (Created) (JIRA) [jira] [Created] (TIKA-834) server problem only 1st (-m -j) result is correct additional runs include data from previous runs Wed, 28 Dec, 22:51
George Kappel (Updated) (JIRA) [jira] [Updated] (TIKA-834) server problem only 1st result is correct additional runs include data from 1st run Thu, 29 Dec, 03:26
Ingo Renner Re: Multilingual Tika Sat, 10 Dec, 12:22
Ingo Renner (Created) (JIRA) [jira] [Created] (TIKA-807) PHP version of Tika Sat, 10 Dec, 12:21
Ingo Renner (Updated) (JIRA) [jira] [Updated] (TIKA-807) PHP version of Tika Sat, 10 Dec, 12:23
Jeremy Anderson (Closed) (JIRA) [jira] [Closed] (TIKA-833) POI Daily beta6 as of 12/27 breaks ExcelParserTest.testExcelParserFormatting() Thu, 29 Dec, 16:17
Jeremy Anderson (Commented) (JIRA) [jira] [Commented] (TIKA-810) Upgrade to PDFbox 1.7.0 as available Fri, 23 Dec, 13:22
Jeremy Anderson (Commented) (JIRA) [jira] [Commented] (TIKA-833) POI Daily beta6 as of 12/27 breaks ExcelParserTest.testExcelParserFormatting() Tue, 27 Dec, 17:10
Jeremy Anderson (Commented) (JIRA) [jira] [Commented] (TIKA-833) POI Daily beta6 as of 12/27 breaks ExcelParserTest.testExcelParserFormatting() Thu, 29 Dec, 13:47
Jeremy Anderson (Created) (JIRA) [jira] [Created] (TIKA-810) Upgrade to PDFbox 1.7.0 as available Mon, 12 Dec, 19:29
Message list1 · 2 · 3 · Next »Thread · Author · Date
Box list
May 2015303
Apr 2015475
Mar 2015891
Feb 2015445
Jan 2015601
Dec 2014253
Nov 2014389
Oct 2014481
Sep 2014364
Aug 2014393
Jul 2014328
Jun 2014671
May 2014298
Apr 2014161
Mar 2014226
Feb 2014293
Jan 2014150
Dec 2013155
Nov 201384
Oct 2013100
Sep 201386
Aug 2013103
Jul 2013146
Jun 2013138
May 2013126
Apr 201374
Mar 201370
Feb 2013174
Jan 2013205
Dec 2012109
Nov 2012124
Oct 2012118
Sep 201261
Aug 2012173
Jul 2012274
Jun 2012102
May 2012174
Apr 2012180
Mar 2012200
Feb 2012125
Jan 2012189
Dec 2011287
Nov 2011259
Oct 2011336
Sep 2011356
Aug 2011197
Jul 2011120
Jun 2011122
May 2011184
Apr 2011137
Mar 2011161
Feb 2011111
Jan 201185
Dec 201099
Nov 2010252
Oct 2010144
Sep 2010168
Aug 2010253
Jul 2010192
Jun 2010154
May 2010132
Apr 2010115
Mar 201090
Feb 201062
Jan 2010134
Dec 2009125
Nov 2009179
Oct 200989
Sep 2009115
Aug 200946
Jul 200977
Jun 200994
May 200981
Apr 200936
Mar 200996
Feb 200974
Jan 200993
Dec 2008112
Nov 2008147
Oct 200854
Sep 2008108
Aug 200826
Jul 200817
Jun 200820
May 200816
Apr 200844
Mar 200873
Feb 200836
Jan 200888
Dec 200785
Nov 2007100
Oct 2007424
Sep 2007265
Aug 200719
Jul 200730
Jun 200751
May 200721
Apr 200712
Mar 200712