tika-dev mailing list archives: July 2010

Site index · List index
Message list1 · 2 · Next »Thread · Author · Date
Nick Burch (JIRA) [jira] Commented: (TIKA-451) Inconsistent date format for Metadata.CREATION_DATE and Metadata.LAST_MODIFIED Thu, 01 Jul, 16:21
Chris A. Mattmann (JIRA) [jira] Commented: (TIKA-451) Inconsistent date format for Metadata.CREATION_DATE and Metadata.LAST_MODIFIED Thu, 01 Jul, 20:33
Nick Burch (JIRA) [jira] Commented: (TIKA-451) Inconsistent date format for Metadata.CREATION_DATE and Metadata.LAST_MODIFIED Thu, 01 Jul, 20:46
Janno Veldemann (JIRA) [jira] Created: (TIKA-453) Conflicting Estonian language profile code to ISO 639 Fri, 02 Jul, 13:50
Julien Nioche (JIRA) [jira] Created: (TIKA-454) Illegal Charset Name crashes HTMLParser Fri, 02 Jul, 14:54
Julien Nioche (JIRA) [jira] Updated: (TIKA-454) Illegal Charset Name crashes HTMLParser Fri, 02 Jul, 15:02
Julien Nioche (JIRA) [jira] Assigned: (TIKA-454) Illegal Charset Name crashes HTMLParser Fri, 02 Jul, 15:34
Nick Burch (JIRA) [jira] Assigned: (TIKA-408) Word 6.0/7.0 documents support in office parser Fri, 02 Jul, 21:51
Nick Burch (JIRA) [jira] Commented: (TIKA-408) Word 6.0/7.0 documents support in office parser Fri, 02 Jul, 21:54
Nick Burch (JIRA) [jira] Commented: (TIKA-315) Tika appears to skip over an entire section of a Microsoft Word Document Fri, 02 Jul, 21:56
Nick Burch (JIRA) [jira] Resolved: (TIKA-315) Tika appears to skip over an entire section of a Microsoft Word Document Fri, 02 Jul, 21:56
Kevin Miller (JIRA) [jira] Commented: (TIKA-212) Do you have Tika in .NET? Sat, 03 Jul, 22:30
Martijn van Groningen (JIRA) [jira] Updated: (TIKA-402) Support for iWork documents Sun, 04 Jul, 19:02
Julien Nioche (JIRA) [jira] Closed: (TIKA-454) Illegal Charset Name crashes HTMLParser Mon, 05 Jul, 08:51
Andrzej Bialecki (JIRA) [jira] Created: (TIKA-455) Zip parser stuck on truncated zip files. Mon, 05 Jul, 12:08
Andrzej Bialecki (JIRA) [jira] Updated: (TIKA-455) Zip parser stuck on truncated zip files. Mon, 05 Jul, 12:08
Stefan Bodewig (JIRA) [jira] Commented: (TIKA-455) Zip parser stuck on truncated zip files. Mon, 05 Jul, 12:32
Ken Krugler (JIRA) [jira] Created: (TIKA-456) Support timeouts for parsers Mon, 05 Jul, 20:42
Ken Krugler (JIRA) [jira] Commented: (TIKA-456) Support timeouts for parsers Mon, 05 Jul, 20:48
Jukka Zitting (JIRA) [jira] Commented: (TIKA-456) Support timeouts for parsers Tue, 06 Jul, 11:40
Jukka Zitting (JIRA) [jira] Commented: (TIKA-456) Support timeouts for parsers Tue, 06 Jul, 11:48
Jukka Zitting (JIRA) [jira] Resolved: (TIKA-455) Zip parser stuck on truncated zip files. Tue, 06 Jul, 11:50
Jukka Zitting (JIRA) [jira] Reopened: (TIKA-455) Zip parser stuck on truncated zip files. Tue, 06 Jul, 11:50
Jukka Zitting (JIRA) [jira] Resolved: (TIKA-455) Zip parser stuck on truncated zip files. Tue, 06 Jul, 11:50
Andrzej Bialecki (JIRA) [jira] Commented: (TIKA-456) Support timeouts for parsers Tue, 06 Jul, 12:13
Jukka Zitting (JIRA) [jira] Resolved: (TIKA-402) Support for iWork documents Tue, 06 Jul, 12:19
Martijn van Groningen (JIRA) [jira] Commented: (TIKA-402) Support for iWork documents Tue, 06 Jul, 12:39
Jukka Zitting (JIRA) [jira] Commented: (TIKA-402) Support for iWork documents Tue, 06 Jul, 12:45
Jukka Zitting (JIRA) [jira] Commented: (TIKA-292) PDFBox is too verbose Tue, 06 Jul, 12:49
Jukka Zitting (JIRA) [jira] Reopened: (TIKA-446) Upgrade to PDFBox 1.2.0 Tue, 06 Jul, 12:57
Martijn v Groningen Re: [jira] Commented: (TIKA-402) Support for iWork documents Tue, 06 Jul, 12:59
Apache Hudson Server Hudson build became unstable: Tika-trunk » Apache Tika parsers #313 Tue, 06 Jul, 13:01
Apache Hudson Server Hudson build became unstable: Tika-trunk #313 Tue, 06 Jul, 13:01
Jukka Zitting (JIRA) [jira] Reopened: (TIKA-402) Support for iWork documents Tue, 06 Jul, 13:19
Jukka Zitting Re: Hudson build became unstable: Tika-trunk » Apache Tika parsers #313 Tue, 06 Jul, 13:19
Apache Hudson Server Hudson build is back to stable : Tika-trunk » Apache Tika parsers #314 Tue, 06 Jul, 14:26
Apache Hudson Server Hudson build is back to stable : Tika-trunk #314 Tue, 06 Jul, 14:26
Jukka Zitting (JIRA) [jira] Commented: (TIKA-451) Inconsistent date format for Metadata.CREATION_DATE and Metadata.LAST_MODIFIED Tue, 06 Jul, 17:07
Chris A. Mattmann (JIRA) [jira] Commented: (TIKA-451) Inconsistent date format for Metadata.CREATION_DATE and Metadata.LAST_MODIFIED Tue, 06 Jul, 17:41
Oleg Tikhonov Re: [jira] Commented: (TIKA-451) Inconsistent date format for Metadata.CREATION_DATE and Metadata.LAST_MODIFIED Tue, 06 Jul, 18:54
Nick Burch (JIRA) [jira] Assigned: (TIKA-451) Inconsistent date format for Metadata.CREATION_DATE and Metadata.LAST_MODIFIED Tue, 06 Jul, 20:21
Nick Burch (JIRA) [jira] Commented: (TIKA-451) Inconsistent date format for Metadata.CREATION_DATE and Metadata.LAST_MODIFIED Tue, 06 Jul, 20:21
Martijn van Groningen (JIRA) [jira] Updated: (TIKA-402) Support for iWork documents Tue, 06 Jul, 20:49
Jukka Zitting (JIRA) [jira] Resolved: (TIKA-402) Support for iWork documents Wed, 07 Jul, 07:11
build...@apache.org buildbot success in ASF Buildbot on tika-trunk Wed, 07 Jul, 07:17
Jukka Zitting (JIRA) [jira] Commented: (TIKA-402) Support for iWork documents Wed, 07 Jul, 07:21
rohanpatil Tika 0.7 And Solr Wed, 07 Jul, 11:01
Arturo Beltran Re: Getting started Wed, 07 Jul, 11:25
Martijn van Groningen (JIRA) [jira] Commented: (TIKA-402) Support for iWork documents Wed, 07 Jul, 13:31
Mattmann, Chris A (388J) Re: Getting started Wed, 07 Jul, 14:04
Julien Nioche Specify HTMLHandler via Context Wed, 07 Jul, 15:08
Mattmann, Chris A (388J) Re: Specify HTMLHandler via Context Wed, 07 Jul, 15:11
Julien Nioche (JIRA) [jira] Created: (TIKA-457) HTMLParser gets an early </body> event Wed, 07 Jul, 15:28
Julien Nioche (JIRA) [jira] Updated: (TIKA-458) Specify HTMLHandler via Context Wed, 07 Jul, 15:30
Julien Nioche (JIRA) [jira] Created: (TIKA-458) Specify HTMLHandler via Context Wed, 07 Jul, 15:30
Ken Krugler Re: Tika 0.7 And Solr Wed, 07 Jul, 16:44
rohanpatil Re: Tika 0.7 And Solr Wed, 07 Jul, 17:15
Nick Burch (JIRA) [jira] Commented: (TIKA-451) Inconsistent date format for Metadata.CREATION_DATE and Metadata.LAST_MODIFIED Wed, 07 Jul, 17:56
Chris A. Mattmann (JIRA) [jira] Commented: (TIKA-451) Inconsistent date format for Metadata.CREATION_DATE and Metadata.LAST_MODIFIED Wed, 07 Jul, 18:14
Ken Krugler (JIRA) [jira] Commented: (TIKA-457) HTMLParser gets an early </body> event Wed, 07 Jul, 20:30
Jukka Zitting (JIRA) [jira] Commented: (TIKA-458) Specify HTMLHandler via Context Wed, 07 Jul, 22:11
Nick Burch (JIRA) [jira] Commented: (TIKA-451) Inconsistent date format for Metadata.CREATION_DATE and Metadata.LAST_MODIFIED Wed, 07 Jul, 22:21
Jukka Zitting (JIRA) [jira] Commented: (TIKA-451) Inconsistent date format for Metadata.CREATION_DATE and Metadata.LAST_MODIFIED Wed, 07 Jul, 22:25
Nick Burch (JIRA) [jira] Commented: (TIKA-451) Inconsistent date format for Metadata.CREATION_DATE and Metadata.LAST_MODIFIED Wed, 07 Jul, 22:28
Ken Krugler (JIRA) [jira] Closed: (TIKA-359) Calls to Charset.isSupported() will throw exceptions for invalid charset names Wed, 07 Jul, 23:36
Ken Krugler (JIRA) [jira] Created: (TIKA-459) Improve handling of incorrect charset names in HTTP response header Thu, 08 Jul, 00:08
Ken Krugler (JIRA) [jira] Updated: (TIKA-459) Improve handling of incorrect charset names in HTTP response header Thu, 08 Jul, 00:16
Chris A. Mattmann (JIRA) [jira] Commented: (TIKA-459) Improve handling of incorrect charset names in HTTP response header Thu, 08 Jul, 01:09
Chris A. Mattmann (JIRA) [jira] Commented: (TIKA-451) Inconsistent date format for Metadata.CREATION_DATE and Metadata.LAST_MODIFIED Thu, 08 Jul, 02:01
Julien Nioche (JIRA) [jira] Created: (TIKA-460) HTMLHandler misses treatment of A elements Thu, 08 Jul, 13:54
Julien Nioche (JIRA) [jira] Updated: (TIKA-460) HTMLHandler misses treatment of A elements Thu, 08 Jul, 13:56
Joshua Turner (JIRA) [jira] Created: (TIKA-461) RFC822 messages not parsed Thu, 08 Jul, 14:45
build...@apache.org buildbot failure in ASF Buildbot on tika-trunk Thu, 08 Jul, 17:56
Ken Krugler (JIRA) [jira] Resolved: (TIKA-459) Improve handling of incorrect charset names in HTTP response header Thu, 08 Jul, 17:58
Mattmann, Chris A (388J) Re: buildbot failure in ASF Buildbot on tika-trunk Thu, 08 Jul, 18:00
Ken Krugler Re: buildbot failure in ASF Buildbot on tika-trunk Thu, 08 Jul, 18:35
Mattmann, Chris A (388J) Re: buildbot failure in ASF Buildbot on tika-trunk Thu, 08 Jul, 18:38
Ken Krugler Re: buildbot failure in ASF Buildbot on tika-trunk Thu, 08 Jul, 18:44
Mattmann, Chris A (388J) Re: buildbot failure in ASF Buildbot on tika-trunk Thu, 08 Jul, 22:49
Ken Krugler (JIRA) [jira] Updated: (TIKA-453) Conflicting Estonian language profile code to ISO 639 Fri, 09 Jul, 21:20
Ken Krugler (JIRA) [jira] Updated: (TIKA-453) Conflicting Estonian language profile code to ISO 639 Fri, 09 Jul, 21:23
Ken Krugler (JIRA) [jira] Resolved: (TIKA-453) Conflicting Estonian language profile code to ISO 639 Fri, 09 Jul, 21:27
build...@apache.org buildbot success in ASF Buildbot on tika-trunk Fri, 09 Jul, 21:31
Ken Krugler (JIRA) [jira] Created: (TIKA-462) Add Boilerpipe 1.0.4 to Maven central and remove java.net repository from parser pom Fri, 09 Jul, 21:39
Ken Krugler (JIRA) [jira] Assigned: (TIKA-462) Add Boilerpipe 1.0.4 to Maven central and remove java.net repository from parser pom Fri, 09 Jul, 21:40
Ken Krugler (JIRA) [jira] Updated: (TIKA-420) [PATCH] Integration of boilerpipe: Boilerplate Removal and Fulltext Extraction from HTML pages Sat, 10 Jul, 00:18
Ken Krugler TIKA-420 patch for boilerplate removal Sat, 10 Jul, 00:23
Jukka Zitting (JIRA) [jira] Updated: (TIKA-446) Upgrade to PDFBox 1.2.1 Sun, 11 Jul, 04:57
Jukka Zitting (JIRA) [jira] Resolved: (TIKA-446) Upgrade to PDFBox 1.2.1 Sun, 11 Jul, 05:03
Ken Krugler (JIRA) [jira] Assigned: (TIKA-394) Missing spaces on html parsing Sun, 11 Jul, 21:34
Ken Krugler (JIRA) [jira] Commented: (TIKA-394) Missing spaces on html parsing Sun, 11 Jul, 21:39
Paul Jakubik Packages and attributes Mon, 12 Jul, 15:03
Nick Burch Re: Packages and attributes Mon, 12 Jul, 15:37
Paul Jakubik Re: Packages and attributes Mon, 12 Jul, 16:26
Nick Burch Re: Packages and attributes Mon, 12 Jul, 17:12
Ken Krugler Boilerpipe integration Mon, 12 Jul, 17:34
Ken Krugler (JIRA) [jira] Resolved: (TIKA-420) [PATCH] Integration of boilerpipe: Boilerplate Removal and Fulltext Extraction from HTML pages Mon, 12 Jul, 17:34
build...@apache.org buildbot failure in ASF Buildbot on tika-trunk Mon, 12 Jul, 17:35
Alex Ott Re: Packages and attributes Mon, 12 Jul, 17:59
Ken Krugler Re: buildbot failure in ASF Buildbot on tika-trunk Mon, 12 Jul, 18:05
Message list1 · 2 · Next »Thread · Author · Date
Box list
Sep 2014211
Aug 2014393
Jul 2014328
Jun 2014671
May 2014298
Apr 2014161
Mar 2014226
Feb 2014293
Jan 2014150
Dec 2013155
Nov 201384
Oct 2013100
Sep 201386
Aug 2013103
Jul 2013146
Jun 2013138
May 2013126
Apr 201374
Mar 201370
Feb 2013174
Jan 2013205
Dec 2012109
Nov 2012124
Oct 2012118
Sep 201261
Aug 2012173
Jul 2012274
Jun 2012102
May 2012174
Apr 2012180
Mar 2012200
Feb 2012125
Jan 2012189
Dec 2011287
Nov 2011259
Oct 2011336
Sep 2011356
Aug 2011197
Jul 2011120
Jun 2011122
May 2011184
Apr 2011137
Mar 2011161
Feb 2011111
Jan 201185
Dec 201099
Nov 2010252
Oct 2010144
Sep 2010168
Aug 2010253
Jul 2010192
Jun 2010154
May 2010132
Apr 2010115
Mar 201090
Feb 201062
Jan 2010134
Dec 2009125
Nov 2009179
Oct 200989
Sep 2009115
Aug 200946
Jul 200977
Jun 200994
May 200981
Apr 200936
Mar 200996
Feb 200974
Jan 200993
Dec 2008112
Nov 2008147
Oct 200854
Sep 2008108
Aug 200826
Jul 200817
Jun 200820
May 200816
Apr 200844
Mar 200873
Feb 200836
Jan 200888
Dec 200785
Nov 2007100
Oct 2007424
Sep 2007265
Aug 200719
Jul 200730
Jun 200751
May 200721
Apr 200712
Mar 200712