tika-dev mailing list archives: October 2009

Site index · List index
Message list1 · 2 · Next »Thread · Author · Date
Jukka Zitting (JIRA) [jira] Resolved: (TIKA-294) TikaCLI always uses System.in for input Fri, 02 Oct, 10:19
Jukka Zitting (JIRA) [jira] Resolved: (TIKA-290) org.apache.tika.exception.TikaException: Unexpected RuntimeException from org.apache.tika.parser.txt.TXTParser@6caf16 Fri, 02 Oct, 10:53
Jukka Zitting (JIRA) [jira] Resolved: (TIKA-256) MSWord parser does not extract footnotes and comments Fri, 02 Oct, 10:59
Jukka Zitting (JIRA) [jira] Resolved: (TIKA-279) XWPFWordExtractorDecorator does not extract some headers/footers Fri, 02 Oct, 11:07
Jukka Zitting (JIRA) [jira] Resolved: (TIKA-293) XWPFWordExtractorDecorator does not extract bookmarks Fri, 02 Oct, 11:11
Jukka Zitting (JIRA) [jira] Resolved: (TIKA-295) Rough cut of mbox parser Fri, 02 Oct, 11:27
[jira] Commented: (TIKA-288) Support override parsers in AutoDetectParser
Jukka Zitting (JIRA)   [jira] Commented: (TIKA-288) Support override parsers in AutoDetectParser Fri, 02 Oct, 11:31
Ken Krugler (JIRA)   [jira] Commented: (TIKA-288) Support override parsers in AutoDetectParser Sun, 11 Oct, 16:08
Jukka Zitting (JIRA) [jira] Resolved: (TIKA-277) Tika stand alone CLI --possibility to specify output encoding (--text) Fri, 02 Oct, 11:33
MRIT64 (JIRA) [jira] Commented: (TIKA-290) org.apache.tika.exception.TikaException: Unexpected RuntimeException from org.apache.tika.parser.txt.TXTParser@6caf16 Fri, 02 Oct, 18:53
Bart Hanssens (JIRA) [jira] Created: (TIKA-300) rename openoffice.. parser classes to odf.. Mon, 05 Oct, 20:45
Bart Hanssens (JIRA) [jira] Created: (TIKA-301) patch: embedded ODF and office:annotation Mon, 05 Oct, 20:53
Bart Hanssens (JIRA) [jira] Updated: (TIKA-301) patch: embedded ODF and office:annotation Mon, 05 Oct, 20:55
Bart Hanssens (JIRA) [jira] Created: (TIKA-302) patch: initial support for ePUB Wed, 07 Oct, 21:13
Hanssens Bart   RE: [bulk] [jira] Commented: (TIKA-302) patch: initial support for ePUB Fri, 09 Oct, 09:58
Bart Hanssens (JIRA) [jira] Updated: (TIKA-302) patch: initial support for ePUB Wed, 07 Oct, 21:15
Ken Krugler Info from parser on handling partial input Thu, 08 Oct, 16:52
Hanssens Bart   RE: [bulk] Info from parser on handling partial input Thu, 08 Oct, 17:34
Jukka Zitting     Re: [bulk] Info from parser on handling partial input Fri, 09 Oct, 09:06
Jukka Zitting   Re: Info from parser on handling partial input Fri, 09 Oct, 09:00
Ken Krugler     Re: Info from parser on handling partial input Sat, 10 Oct, 14:15
[jira] Commented: (TIKA-245) Support of CHM Format
Luciano Leggieri (JIRA)   [jira] Commented: (TIKA-245) Support of CHM Format Thu, 08 Oct, 20:27
Jukka Zitting (JIRA)   [jira] Commented: (TIKA-245) Support of CHM Format Fri, 09 Oct, 09:42
Benson Margulies (JIRA) [jira] Created: (TIKA-303) XHTMLContentHandler mishandles headers Thu, 08 Oct, 23:34
[jira] Commented: (TIKA-303) XHTMLContentHandler mishandles headers
Benson Margulies (JIRA)   [jira] Commented: (TIKA-303) XHTMLContentHandler mishandles headers Fri, 09 Oct, 01:56
Jukka Zitting (JIRA)   [jira] Commented: (TIKA-303) XHTMLContentHandler mishandles headers Fri, 09 Oct, 09:36
Benson Margulies (JIRA)   [jira] Commented: (TIKA-303) XHTMLContentHandler mishandles headers Fri, 09 Oct, 11:26
Jukka Zitting (JIRA) [jira] Commented: (TIKA-302) patch: initial support for ePUB Fri, 09 Oct, 09:46
[jira] Updated: (TIKA-303) XHTMLContentHandler mishandles headers
Benson Margulies (JIRA)   [jira] Updated: (TIKA-303) XHTMLContentHandler mishandles headers Fri, 09 Oct, 12:21
Benson Margulies (JIRA)   [jira] Updated: (TIKA-303) XHTMLContentHandler mishandles headers Fri, 09 Oct, 12:21
[jira] Updated: (TIKA-304) HtmlParser could be easier to subclass
Benson Margulies (JIRA)   [jira] Updated: (TIKA-304) HtmlParser could be easier to subclass Fri, 09 Oct, 12:27
Benson Margulies (JIRA)   [jira] Updated: (TIKA-304) HtmlParser could be easier to subclass Fri, 09 Oct, 12:29
Benson Margulies (JIRA) [jira] Created: (TIKA-304) HtmlParser could be easier to subclass Fri, 09 Oct, 12:27
Benson Margulies (JIRA) [jira] Updated: (TIKA-305) XHTML href attributes end up in the wrong namespace Fri, 09 Oct, 12:42
Benson Margulies (JIRA) [jira] Created: (TIKA-305) XHTML href attributes end up in the wrong namespace Fri, 09 Oct, 12:42
[jira] Commented: (TIKA-304) HtmlParser could be easier to subclass
Ken Krugler (JIRA)   [jira] Commented: (TIKA-304) HtmlParser could be easier to subclass Fri, 09 Oct, 16:33
Benson Margulies (JIRA)   [jira] Commented: (TIKA-304) HtmlParser could be easier to subclass Fri, 09 Oct, 16:39
Bart Hanssens (JIRA) [jira] Updated: (TIKA-306) patch: OOXMLParserTest uses OpenOfficeParser Fri, 09 Oct, 18:27
Bart Hanssens (JIRA) [jira] Created: (TIKA-306) patch: OOXMLParserTest uses OpenOfficeParser Fri, 09 Oct, 18:27
Ken Krugler (JIRA) [jira] Created: (TIKA-307) Better handling of partial/truncated input data to parsers Sat, 10 Oct, 14:15
Re: Super-types for text mime types
Ken Krugler   Re: Super-types for text mime types Sun, 11 Oct, 15:44
Ken Krugler (JIRA) [jira] Created: (TIKA-308) Improve supertype handling in type registry Sun, 11 Oct, 15:44
Ken Krugler (JIRA) [jira] Commented: (TIKA-298) CompositeParser.getParser() should use mimetype hierarchy when falling back Sun, 11 Oct, 15:48
Re: Fall-back parser in AutoDetectParser
Ken Krugler   Re: Fall-back parser in AutoDetectParser Sun, 11 Oct, 15:50
Jukka Zitting     Re: Fall-back parser in AutoDetectParser Wed, 14 Oct, 18:33
[jira] Commented: (TIKA-295) Rough cut of mbox parser
Ken Krugler (JIRA)   [jira] Commented: (TIKA-295) Rough cut of mbox parser Sun, 11 Oct, 16:06
Alex Baranov (JIRA)   [jira] Commented: (TIKA-295) Rough cut of mbox parser Wed, 14 Oct, 05:51
Thilo Goetz (JIRA)   [jira] Commented: (TIKA-295) Rough cut of mbox parser Wed, 14 Oct, 09:36
Ken Krugler (JIRA)   [jira] Commented: (TIKA-295) Rough cut of mbox parser Wed, 14 Oct, 14:20
Ken Krugler (JIRA)   [jira] Commented: (TIKA-295) Rough cut of mbox parser Wed, 14 Oct, 14:22
Yuan-Fang Li (JIRA) [jira] Created: (TIKA-309) Mime type application/rdf+xml not correctly detected Tue, 13 Oct, 04:41
Alex Baranov (JIRA) [jira] Issue Comment Edited: (TIKA-295) Rough cut of mbox parser Wed, 14 Oct, 06:13
Jukka Zitting Eclipse formatter (Was: [jira] Commented: (TIKA-295) Rough cut of mbox parser) Wed, 14 Oct, 18:47
Jukka Zitting (JIRA) [jira] Created: (TIKA-310) Use TagSoup to parse HTML Wed, 14 Oct, 19:40
Jukka Zitting (JIRA) [jira] Resolved: (TIKA-310) Use TagSoup to parse HTML Wed, 14 Oct, 19:50
Jukka Zitting FYI: NekoHTML/Xerces dependency replaced with TagSoup Wed, 14 Oct, 19:57
Ken Krugler   Re: FYI: NekoHTML/Xerces dependency replaced with TagSoup Wed, 14 Oct, 20:23
Jukka Zitting (JIRA) [jira] Created: (TIKA-311) Broken handling of <a name="..."/> tags Wed, 14 Oct, 20:04
Message list1 · 2 · Next »Thread · Author · Date
Box list
Oct 2014285
Sep 2014364
Aug 2014393
Jul 2014328
Jun 2014671
May 2014298
Apr 2014161
Mar 2014226
Feb 2014293
Jan 2014150
Dec 2013155
Nov 201384
Oct 2013100
Sep 201386
Aug 2013103
Jul 2013146
Jun 2013138
May 2013126
Apr 201374
Mar 201370
Feb 2013174
Jan 2013205
Dec 2012109
Nov 2012124
Oct 2012118
Sep 201261
Aug 2012173
Jul 2012274
Jun 2012102
May 2012174
Apr 2012180
Mar 2012200
Feb 2012125
Jan 2012189
Dec 2011287
Nov 2011259
Oct 2011336
Sep 2011356
Aug 2011197
Jul 2011120
Jun 2011122
May 2011184
Apr 2011137
Mar 2011161
Feb 2011111
Jan 201185
Dec 201099
Nov 2010252
Oct 2010144
Sep 2010168
Aug 2010253
Jul 2010192
Jun 2010154
May 2010132
Apr 2010115
Mar 201090
Feb 201062
Jan 2010134
Dec 2009125
Nov 2009179
Oct 200989
Sep 2009115
Aug 200946
Jul 200977
Jun 200994
May 200981
Apr 200936
Mar 200996
Feb 200974
Jan 200993
Dec 2008112
Nov 2008147
Oct 200854
Sep 2008108
Aug 200826
Jul 200817
Jun 200820
May 200816
Apr 200844
Mar 200873
Feb 200836
Jan 200888
Dec 200785
Nov 2007100
Oct 2007424
Sep 2007265
Aug 200719
Jul 200730
Jun 200751
May 200721
Apr 200712
Mar 200712