tika-dev mailing list archives: October 2009

Site index · List index
Message listThread · Author · Date
Jukka Zitting (JIRA) [jira] Resolved: (TIKA-294) TikaCLI always uses System.in for input Fri, 02 Oct, 10:19
Jukka Zitting (JIRA) [jira] Resolved: (TIKA-290) org.apache.tika.exception.TikaException: Unexpected RuntimeException from org.apache.tika.parser.txt.TXTParser@6caf16 Fri, 02 Oct, 10:53
Jukka Zitting (JIRA) [jira] Resolved: (TIKA-256) MSWord parser does not extract footnotes and comments Fri, 02 Oct, 10:59
Jukka Zitting (JIRA) [jira] Resolved: (TIKA-279) XWPFWordExtractorDecorator does not extract some headers/footers Fri, 02 Oct, 11:07
Jukka Zitting (JIRA) [jira] Resolved: (TIKA-293) XWPFWordExtractorDecorator does not extract bookmarks Fri, 02 Oct, 11:11
Jukka Zitting (JIRA) [jira] Resolved: (TIKA-295) Rough cut of mbox parser Fri, 02 Oct, 11:27
Jukka Zitting (JIRA) [jira] Commented: (TIKA-288) Support override parsers in AutoDetectParser Fri, 02 Oct, 11:31
Jukka Zitting (JIRA) [jira] Resolved: (TIKA-277) Tika stand alone CLI --possibility to specify output encoding (--text) Fri, 02 Oct, 11:33
MRIT64 (JIRA) [jira] Commented: (TIKA-290) org.apache.tika.exception.TikaException: Unexpected RuntimeException from org.apache.tika.parser.txt.TXTParser@6caf16 Fri, 02 Oct, 18:53
Bart Hanssens (JIRA) [jira] Created: (TIKA-300) rename openoffice.. parser classes to odf.. Mon, 05 Oct, 20:45
Bart Hanssens (JIRA) [jira] Created: (TIKA-301) patch: embedded ODF and office:annotation Mon, 05 Oct, 20:53
Bart Hanssens (JIRA) [jira] Updated: (TIKA-301) patch: embedded ODF and office:annotation Mon, 05 Oct, 20:55
Bart Hanssens (JIRA) [jira] Created: (TIKA-302) patch: initial support for ePUB Wed, 07 Oct, 21:13
Bart Hanssens (JIRA) [jira] Updated: (TIKA-302) patch: initial support for ePUB Wed, 07 Oct, 21:15
Ken Krugler Info from parser on handling partial input Thu, 08 Oct, 16:52
Hanssens Bart RE: [bulk] Info from parser on handling partial input Thu, 08 Oct, 17:34
Luciano Leggieri (JIRA) [jira] Commented: (TIKA-245) Support of CHM Format Thu, 08 Oct, 20:27
Benson Margulies (JIRA) [jira] Created: (TIKA-303) XHTMLContentHandler mishandles headers Thu, 08 Oct, 23:34
Benson Margulies (JIRA) [jira] Commented: (TIKA-303) XHTMLContentHandler mishandles headers Fri, 09 Oct, 01:56
Jukka Zitting Re: Info from parser on handling partial input Fri, 09 Oct, 09:00
Jukka Zitting Re: [bulk] Info from parser on handling partial input Fri, 09 Oct, 09:06
Jukka Zitting (JIRA) [jira] Commented: (TIKA-303) XHTMLContentHandler mishandles headers Fri, 09 Oct, 09:36
Jukka Zitting (JIRA) [jira] Commented: (TIKA-245) Support of CHM Format Fri, 09 Oct, 09:42
Jukka Zitting (JIRA) [jira] Commented: (TIKA-302) patch: initial support for ePUB Fri, 09 Oct, 09:46
Hanssens Bart RE: [bulk] [jira] Commented: (TIKA-302) patch: initial support for ePUB Fri, 09 Oct, 09:58
Benson Margulies (JIRA) [jira] Commented: (TIKA-303) XHTMLContentHandler mishandles headers Fri, 09 Oct, 11:26
Benson Margulies (JIRA) [jira] Updated: (TIKA-303) XHTMLContentHandler mishandles headers Fri, 09 Oct, 12:21
Benson Margulies (JIRA) [jira] Updated: (TIKA-303) XHTMLContentHandler mishandles headers Fri, 09 Oct, 12:21
Benson Margulies (JIRA) [jira] Created: (TIKA-304) HtmlParser could be easier to subclass Fri, 09 Oct, 12:27
Benson Margulies (JIRA) [jira] Updated: (TIKA-304) HtmlParser could be easier to subclass Fri, 09 Oct, 12:27
Benson Margulies (JIRA) [jira] Updated: (TIKA-304) HtmlParser could be easier to subclass Fri, 09 Oct, 12:29
Benson Margulies (JIRA) [jira] Created: (TIKA-305) XHTML href attributes end up in the wrong namespace Fri, 09 Oct, 12:42
Benson Margulies (JIRA) [jira] Updated: (TIKA-305) XHTML href attributes end up in the wrong namespace Fri, 09 Oct, 12:42
Ken Krugler (JIRA) [jira] Commented: (TIKA-304) HtmlParser could be easier to subclass Fri, 09 Oct, 16:33
Benson Margulies (JIRA) [jira] Commented: (TIKA-304) HtmlParser could be easier to subclass Fri, 09 Oct, 16:39
Bart Hanssens (JIRA) [jira] Updated: (TIKA-306) patch: OOXMLParserTest uses OpenOfficeParser Fri, 09 Oct, 18:27
Bart Hanssens (JIRA) [jira] Created: (TIKA-306) patch: OOXMLParserTest uses OpenOfficeParser Fri, 09 Oct, 18:27
Ken Krugler Re: Info from parser on handling partial input Sat, 10 Oct, 14:15
Ken Krugler (JIRA) [jira] Created: (TIKA-307) Better handling of partial/truncated input data to parsers Sat, 10 Oct, 14:15
Ken Krugler Re: Super-types for text mime types Sun, 11 Oct, 15:44
Ken Krugler (JIRA) [jira] Created: (TIKA-308) Improve supertype handling in type registry Sun, 11 Oct, 15:44
Ken Krugler (JIRA) [jira] Commented: (TIKA-298) CompositeParser.getParser() should use mimetype hierarchy when falling back Sun, 11 Oct, 15:48
Ken Krugler Re: Fall-back parser in AutoDetectParser Sun, 11 Oct, 15:50
Ken Krugler (JIRA) [jira] Commented: (TIKA-295) Rough cut of mbox parser Sun, 11 Oct, 16:06
Ken Krugler (JIRA) [jira] Commented: (TIKA-288) Support override parsers in AutoDetectParser Sun, 11 Oct, 16:08
Yuan-Fang Li (JIRA) [jira] Created: (TIKA-309) Mime type application/rdf+xml not correctly detected Tue, 13 Oct, 04:41
Alex Baranov (JIRA) [jira] Commented: (TIKA-295) Rough cut of mbox parser Wed, 14 Oct, 05:51
Alex Baranov (JIRA) [jira] Issue Comment Edited: (TIKA-295) Rough cut of mbox parser Wed, 14 Oct, 06:13
Thilo Goetz (JIRA) [jira] Commented: (TIKA-295) Rough cut of mbox parser Wed, 14 Oct, 09:36
Ken Krugler (JIRA) [jira] Commented: (TIKA-295) Rough cut of mbox parser Wed, 14 Oct, 14:20
Ken Krugler (JIRA) [jira] Commented: (TIKA-295) Rough cut of mbox parser Wed, 14 Oct, 14:22
Jukka Zitting Re: Fall-back parser in AutoDetectParser Wed, 14 Oct, 18:33
Jukka Zitting Eclipse formatter (Was: [jira] Commented: (TIKA-295) Rough cut of mbox parser) Wed, 14 Oct, 18:47
Jukka Zitting (JIRA) [jira] Created: (TIKA-310) Use TagSoup to parse HTML Wed, 14 Oct, 19:40
Jukka Zitting (JIRA) [jira] Resolved: (TIKA-310) Use TagSoup to parse HTML Wed, 14 Oct, 19:50
Jukka Zitting FYI: NekoHTML/Xerces dependency replaced with TagSoup Wed, 14 Oct, 19:57
Jukka Zitting (JIRA) [jira] Created: (TIKA-311) Broken handling of <a name="..."/> tags Wed, 14 Oct, 20:04
Ken Krugler Re: FYI: NekoHTML/Xerces dependency replaced with TagSoup Wed, 14 Oct, 20:23
Jukka Zitting (JIRA) [jira] Resolved: (TIKA-311) Broken handling of <a name="..."/> tags Wed, 14 Oct, 20:42
Jukka Zitting (JIRA) [jira] Assigned: (TIKA-287) HtmlParser should resolve relative paths in <a href="xxx"> elements Wed, 14 Oct, 21:06
Jukka Zitting (JIRA) [jira] Commented: (TIKA-287) HtmlParser should resolve relative paths in <a href="xxx"> elements Wed, 14 Oct, 22:20
Ken Krugler (JIRA) [jira] Updated: (TIKA-287) HtmlParser should resolve relative paths in <a href="xxx"> elements Wed, 14 Oct, 23:06
Ken Krugler (JIRA) [jira] Commented: (TIKA-287) HtmlParser should resolve relative paths in <a href="xxx"> elements Wed, 14 Oct, 23:16
Jukka Zitting (JIRA) [jira] Resolved: (TIKA-287) HtmlParser should resolve relative paths in <a href="xxx"> elements Fri, 16 Oct, 12:24
Jukka Zitting (JIRA) [jira] Resolved: (TIKA-309) Mime type application/rdf+xml not correctly detected Fri, 16 Oct, 13:21
Jukka Zitting (JIRA) [jira] Resolved: (TIKA-306) patch: OOXMLParserTest uses OpenOfficeParser Fri, 16 Oct, 13:25
Maxim Valyanskiy (JIRA) [jira] Created: (TIKA-312) TikaCLI can't print metadata Fri, 16 Oct, 13:31
Jukka Zitting (JIRA) [jira] Resolved: (TIKA-303) XHTMLContentHandler mishandles headers Fri, 16 Oct, 13:31
Maxim Valyanskiy (JIRA) [jira] Updated: (TIKA-312) TikaCLI can't print metadata Fri, 16 Oct, 13:33
Jukka Zitting (JIRA) [jira] Resolved: (TIKA-305) XHTML href attributes end up in the wrong namespace Fri, 16 Oct, 14:01
Jukka Zitting (JIRA) [jira] Resolved: (TIKA-304) HtmlParser could be easier to subclass Fri, 16 Oct, 14:53
Jukka Zitting (JIRA) [jira] Resolved: (TIKA-302) patch: initial support for ePUB Fri, 16 Oct, 15:45
Jukka Zitting (JIRA) [jira] Resolved: (TIKA-301) patch: embedded ODF and office:annotation Fri, 16 Oct, 15:49
Jukka Zitting (JIRA) [jira] Resolved: (TIKA-312) TikaCLI can't print metadata Fri, 16 Oct, 15:51
Jukka Zitting (JIRA) [jira] Resolved: (TIKA-300) rename openoffice.. parser classes to odf.. Fri, 16 Oct, 16:07
Jukka Zitting (JIRA) [jira] Resolved: (TIKA-187) Extract the summary.getCategory() from MSOffice documents Fri, 16 Oct, 16:19
mastcheshmi MarkUnsupportedException Sat, 17 Oct, 12:43
Bart Hanssens (JIRA) [jira] Created: (TIKA-313) patch: ODF improvements for svg:desc, presentation notes Sat, 17 Oct, 20:15
Bart Hanssens (JIRA) [jira] Updated: (TIKA-313) patch: ODF improvements for svg:desc, presentation notes Sat, 17 Oct, 20:17
Maxim Valyanskiy (JIRA) [jira] Created: (TIKA-314) Initial support for JPEG EXIF metadata extraction Tue, 20 Oct, 13:21
Maxim Valyanskiy (JIRA) [jira] Updated: (TIKA-314) Initial support for JPEG EXIF metadata extraction Tue, 20 Oct, 13:21
Maxim Valyanskiy (JIRA) [jira] Updated: (TIKA-314) Initial support for JPEG EXIF metadata extraction Tue, 20 Oct, 13:25
Maxim Valyanskiy (JIRA) [jira] Updated: (TIKA-314) Initial support for JPEG EXIF metadata extraction Tue, 20 Oct, 13:25
Jukka Zitting Re: MarkUnsupportedException Sat, 24 Oct, 19:02
Jukka Zitting (JIRA) [jira] Commented: (TIKA-314) Initial support for JPEG EXIF metadata extraction Sat, 24 Oct, 23:24
Jukka Zitting (JIRA) [jira] Commented: (TIKA-314) Initial support for JPEG EXIF metadata extraction Sat, 24 Oct, 23:42
Sanjeev Rao (JIRA) [jira] Created: (TIKA-315) Tika appears to skip over an entire section of a Microsoft Word Document Mon, 26 Oct, 16:14
Sanjeev Rao (JIRA) [jira] Updated: (TIKA-315) Tika appears to skip over an entire section of a Microsoft Word Document Mon, 26 Oct, 16:16
Sanjeev Rao (JIRA) [jira] Updated: (TIKA-315) Tika appears to skip over an entire section of a Microsoft Word Document Mon, 26 Oct, 16:22
Message listThread · Author · Date
Box list
Sep 2014205
Aug 2014393
Jul 2014328
Jun 2014671
May 2014298
Apr 2014161
Mar 2014226
Feb 2014293
Jan 2014150
Dec 2013155
Nov 201384
Oct 2013100
Sep 201386
Aug 2013103
Jul 2013146
Jun 2013138
May 2013126
Apr 201374
Mar 201370
Feb 2013174
Jan 2013205
Dec 2012109
Nov 2012124
Oct 2012118
Sep 201261
Aug 2012173
Jul 2012274
Jun 2012102
May 2012174
Apr 2012180
Mar 2012200
Feb 2012125
Jan 2012189
Dec 2011287
Nov 2011259
Oct 2011336
Sep 2011356
Aug 2011197
Jul 2011120
Jun 2011122
May 2011184
Apr 2011137
Mar 2011161
Feb 2011111
Jan 201185
Dec 201099
Nov 2010252
Oct 2010144
Sep 2010168
Aug 2010253
Jul 2010192
Jun 2010154
May 2010132
Apr 2010115
Mar 201090
Feb 201062
Jan 2010134
Dec 2009125
Nov 2009179
Oct 200989
Sep 2009115
Aug 200946
Jul 200977
Jun 200994
May 200981
Apr 200936
Mar 200996
Feb 200974
Jan 200993
Dec 2008112
Nov 2008147
Oct 200854
Sep 2008108
Aug 200826
Jul 200817
Jun 200820
May 200816
Apr 200844
Mar 200873
Feb 200836
Jan 200888
Dec 200785
Nov 2007100
Oct 2007424
Sep 2007265
Aug 200719
Jul 200730
Jun 200751
May 200721
Apr 200712
Mar 200712