tika-dev mailing list archives: August 2012

Site index · List index
Message list1 · 2 · Next »Thread · Author · Date
Luis Filipe Nassif (JIRA) [jira] [Commented] (TIKA-885) Possible ConcurrentModificationException while accessing Metadata produced by ParsingReader Wed, 01 Aug, 01:02
122jxgcn Re: Custom parser error Wed, 01 Aug, 01:14
Uwe Schindler RE: Custom parser error Wed, 01 Aug, 06:55
Jukka Zitting (JIRA) [jira] [Comment Edited] (TIKA-965) Text Detection Fails on Mostly Non-ASCII UTF-8 Files Wed, 01 Aug, 10:14
Jukka Zitting (JIRA) [jira] [Commented] (TIKA-965) Text Detection Fails on Mostly Non-ASCII UTF-8 Files Wed, 01 Aug, 10:14
Ray Gauss II (JIRA) [jira] [Commented] (TIKA-965) Text Detection Fails on Mostly Non-ASCII UTF-8 Files Wed, 01 Aug, 11:13
Jukka Zitting (JIRA) [jira] [Updated] (TIKA-965) Text Detection Fails on Mostly Non-ASCII UTF-8 Files Wed, 01 Aug, 11:31
Ray Gauss II (JIRA) [jira] [Commented] (TIKA-965) Text Detection Fails on Mostly Non-ASCII UTF-8 Files Wed, 01 Aug, 11:54
Jukka Zitting (JIRA) [jira] [Commented] (TIKA-965) Text Detection Fails on Mostly Non-ASCII UTF-8 Files Wed, 01 Aug, 12:15
Ray Gauss II (JIRA) [jira] [Resolved] (TIKA-965) Text Detection Fails on Mostly Non-ASCII UTF-8 Files Wed, 01 Aug, 13:47
Apache Jenkins Server Build failed in Jenkins: Tika-trunk #906 Wed, 01 Aug, 14:17
Ray Gauss II Re: Build failed in Jenkins: Tika-trunk #906 Wed, 01 Aug, 14:22
Jukka Zitting Re: Build failed in Jenkins: Tika-trunk #906 Wed, 01 Aug, 14:25
Jukka Zitting (JIRA) [jira] [Commented] (TIKA-966) org.apache.tika.Tika missing from tika-bundle-1.2.jar Wed, 01 Aug, 14:58
Gary Karasiuk (JIRA) [jira] [Commented] (TIKA-966) org.apache.tika.Tika missing from tika-bundle-1.2.jar Wed, 01 Aug, 16:40
Gary Karasiuk (JIRA) [jira] [Commented] (TIKA-966) org.apache.tika.Tika missing from tika-bundle-1.2.jar Wed, 01 Aug, 16:51
Jukka Zitting (JIRA) [jira] [Commented] (TIKA-966) org.apache.tika.Tika missing from tika-bundle-1.2.jar Wed, 01 Aug, 16:51
Jukka Zitting (JIRA) [jira] [Commented] (TIKA-966) org.apache.tika.Tika missing from tika-bundle-1.2.jar Wed, 01 Aug, 16:55
Jukka Zitting (JIRA) [jira] [Commented] (TIKA-885) Possible ConcurrentModificationException while accessing Metadata produced by ParsingReader Wed, 01 Aug, 19:30
Andreas Hubold (JIRA) [jira] [Created] (TIKA-967) Tika comes with transitive Maven dependency to a test artifact of vorbis-java-core Thu, 02 Aug, 07:26
122jxgcn Executing file inside Parser Thu, 02 Aug, 07:50
Nick Burch (JIRA) [jira] [Commented] (TIKA-967) Tika comes with transitive Maven dependency to a test artifact of vorbis-java-core Thu, 02 Aug, 08:40
Jukka Zitting (JIRA) [jira] [Resolved] (TIKA-709) Tika network server does not print anything in response to, for example, Word documents Thu, 02 Aug, 09:33
Andreas Hubold (JIRA) [jira] [Commented] (TIKA-967) Tika comes with transitive Maven dependency to a test artifact of vorbis-java-core Thu, 02 Aug, 12:32
Gary Karasiuk (JIRA) [jira] [Commented] (TIKA-966) org.apache.tika.Tika missing from tika-bundle-1.2.jar Thu, 02 Aug, 14:17
Gary Karasiuk (JIRA) [jira] [Commented] (TIKA-966) org.apache.tika.Tika missing from tika-bundle-1.2.jar Thu, 02 Aug, 14:45
Gary Karasiuk (JIRA) [jira] [Commented] (TIKA-966) org.apache.tika.Tika missing from tika-bundle-1.2.jar Thu, 02 Aug, 17:17
Gary Karasiuk (JIRA) [jira] [Created] (TIKA-968) tika-bundle missing org.apache.commons.logging.LogFactory Thu, 02 Aug, 17:42
Gary Karasiuk (JIRA) [jira] [Commented] (TIKA-968) tika-bundle missing org.apache.commons.logging.LogFactory Thu, 02 Aug, 17:46
Nick Burch Re: Executing file inside Parser Thu, 02 Aug, 21:42
Dave Meikle Re: Executing file inside Parser Thu, 02 Aug, 22:08
Jukka Zitting Tika at ApacheCon Fri, 03 Aug, 08:54
Nick Burch Re: Tika at ApacheCon Fri, 03 Aug, 10:45
Jukka Zitting Re: Tika at ApacheCon Fri, 03 Aug, 10:47
Gary Karasiuk (JIRA) [jira] [Commented] (TIKA-968) tika-bundle missing org.apache.commons.logging.LogFactory Fri, 03 Aug, 11:19
Richard Eccles (JIRA) [jira] [Created] (TIKA-969) Exception "org.apache.tika.exception.TikaException: Can't read JPEG metada" / "com.drew.metadata.MetadataException: Tag '34855' cannot be cast to int. It is of type 'class [I" when indexing some items Fri, 03 Aug, 12:05
Nick Burch (JIRA) [jira] [Commented] (TIKA-969) Exception "org.apache.tika.exception.TikaException: Can't read JPEG metada" / "com.drew.metadata.MetadataException: Tag '34855' cannot be cast to int. It is of type 'class [I" when indexing some items Fri, 03 Aug, 12:07
Richard Eccles (JIRA) [jira] [Updated] (TIKA-969) Exception "org.apache.tika.exception.TikaException: Can't read JPEG metada" / "com.drew.metadata.MetadataException: Tag '34855' cannot be cast to int. It is of type 'class [I" when indexing some items Fri, 03 Aug, 12:09
Ray Gauss II (JIRA) [jira] [Updated] (TIKA-969) TikaException Thrown When Handling Unknown Fields for Some JPEGs Fri, 03 Aug, 12:25
Richard Eccles (JIRA) [jira] [Commented] (TIKA-969) TikaException Thrown When Handling Unknown Fields for Some JPEGs Fri, 03 Aug, 12:27
Ray Gauss II (JIRA) [jira] [Resolved] (TIKA-969) TikaException Thrown When Handling Unknown Fields for Some JPEGs Fri, 03 Aug, 12:35
Richard Eccles (JIRA) [jira] [Commented] (TIKA-969) TikaException Thrown When Handling Unknown Fields for Some JPEGs Fri, 03 Aug, 12:53
Apache Jenkins Server Jenkins build is back to normal : Tika-trunk #907 Fri, 03 Aug, 13:12
Andrew Jackson (JIRA) [jira] [Created] (TIKA-970) Full identification of the JPEG 2000 family of formats Fri, 03 Aug, 13:21
Andrew Jackson (JIRA) [jira] [Updated] (TIKA-970) Full identification of the JPEG 2000 family of formats Fri, 03 Aug, 13:21
Andrew Jackson (JIRA) [jira] [Commented] (TIKA-970) Full identification of the JPEG 2000 family of formats Fri, 03 Aug, 13:35
Ray Gauss II (JIRA) [jira] [Commented] (TIKA-970) Full identification of the JPEG 2000 family of formats Fri, 03 Aug, 13:37
Andrew Jackson (JIRA) [jira] [Commented] (TIKA-970) Full identification of the JPEG 2000 family of formats Fri, 03 Aug, 14:02
Andrew Jackson (JIRA) [jira] [Commented] (TIKA-970) Full identification of the JPEG 2000 family of formats Fri, 03 Aug, 14:14
Andrew Jackson (JIRA) [jira] [Commented] (TIKA-970) Full identification of the JPEG 2000 family of formats Fri, 03 Aug, 14:26
Jukka Zitting (JIRA) [jira] [Commented] (TIKA-970) Full identification of the JPEG 2000 family of formats Fri, 03 Aug, 15:27
Mattmann, Chris A (388J) [VOTE] Graduate Apache Any23 from the Apache Incubator Fri, 03 Aug, 17:50
Michael McCandless (JIRA) [jira] [Assigned] (TIKA-956) Embedded docs in Word doc are not inlined (text is always added to the end) Sat, 04 Aug, 12:54
Michael McCandless (JIRA) [jira] [Updated] (TIKA-956) Embedded docs in Word doc are not inlined (text is always added to the end) Sat, 04 Aug, 13:01
Lewis John Mcgibbney Re: [VOTE] Graduate Apache Any23 from the Apache Incubator Sat, 04 Aug, 13:56
Jukka Zitting (JIRA) [jira] [Commented] (TIKA-956) Embedded docs in Word doc are not inlined (text is always added to the end) Sun, 05 Aug, 13:50
Jukka Zitting (JIRA) [jira] [Resolved] (TIKA-970) Full identification of the JPEG 2000 family of formats Sun, 05 Aug, 14:46
Jukka Zitting (JIRA) [jira] [Resolved] (TIKA-966) org.apache.tika.Tika missing from tika-bundle-1.2.jar Sun, 05 Aug, 17:15
Jukka Zitting (JIRA) [jira] [Resolved] (TIKA-968) tika-bundle missing org.apache.commons.logging.LogFactory Sun, 05 Aug, 17:58
Michael McCandless (JIRA) [jira] [Commented] (TIKA-956) Embedded docs in Word doc are not inlined (text is always added to the end) Sun, 05 Aug, 22:36
Michael McCandless (JIRA) [jira] [Updated] (TIKA-956) Embedded docs in Word doc are not inlined (text is always added to the end) Sun, 05 Aug, 22:40
122jxgcn AutoDetectParser not picking up custom parser Mon, 06 Aug, 11:48
Jukka Zitting Re: [VOTE] Graduate Apache Any23 from the Apache Incubator Mon, 06 Aug, 13:52
Dave Meikle Re: [VOTE] Graduate Apache Any23 from the Apache Incubator Mon, 06 Aug, 20:44
Andrew Jackson (JIRA) [jira] [Commented] (TIKA-970) Full identification of the JPEG 2000 family of formats Mon, 06 Aug, 21:03
Nick Burch Re: AutoDetectParser not picking up custom parser Mon, 06 Aug, 22:08
122jxgcn Re: AutoDetectParser not picking up custom parser Tue, 07 Aug, 01:14
Oleg Tikhonov Re: [VOTE] Graduate Apache Any23 from the Apache Incubator Tue, 07 Aug, 04:02
Nick Burch Re: AutoDetectParser not picking up custom parser Tue, 07 Aug, 08:47
122jxgcn Detecting content type with file extension Tue, 07 Aug, 09:02
Michael McCandless (JIRA) [jira] [Resolved] (TIKA-956) Embedded docs in Word doc are not inlined (text is always added to the end) Tue, 07 Aug, 21:43
François Ouellette (JIRA) [jira] [Created] (TIKA-971) The ToXMLContentHandler handler creates extra <?xml > entry when reading ODT files Wed, 08 Aug, 05:57
Michael McCandless (JIRA) [jira] [Resolved] (TIKA-948) Embedded PDF extracted incorrectly as MS Works file from Word 97-2003 doc Thu, 09 Aug, 17:42
Priya Kujur (JIRA) [jira] [Created] (TIKA-972) Unexpected RuntimeException from org.apache.tika.parser.pdf.PDFParser . Thu, 09 Aug, 19:42
Michael Graessle (JIRA) [jira] [Created] (TIKA-973) PDF form data isn't included in extracted content. Thu, 09 Aug, 19:54
Ken Krugler TIKA-431 and CONTENT_ENCODING Thu, 09 Aug, 20:56
Ken Krugler TIKA-431 and CONTENT_ENCODING (updated) Thu, 09 Aug, 21:24
Ken Krugler (JIRA) [jira] [Commented] (TIKA-889) XHTMLContentHandler wont emit newline when html element matches ENDLINE set Thu, 09 Aug, 21:47
Ken Krugler (JIRA) [jira] [Resolved] (TIKA-869) IdentityHtmlMapper.mapSafeElement() needs to return lower-cased incoming name Thu, 09 Aug, 21:55
Ken Krugler (JIRA) [jira] [Resolved] (TIKA-889) XHTMLContentHandler wont emit newline when html element matches ENDLINE set Thu, 09 Aug, 21:59
Ken Krugler Re: [ANNOUNCE] Welcome Jörg Ehrlich as new Tika PMC member and committer Thu, 09 Aug, 22:04
Ken Krugler (JIRA) [jira] [Assigned] (TIKA-728) Return RDFa meta tags via Metadata Thu, 09 Aug, 22:05
Ken Krugler InputStream reset issue Thu, 09 Aug, 22:11
Ken Krugler (JIRA) [jira] [Commented] (TIKA-881) HtmlParser sometimes(!) throws IOException while determining Html-Encoding Thu, 09 Aug, 22:13
Ken Krugler (JIRA) [jira] [Assigned] (TIKA-881) HtmlParser sometimes(!) throws IOException while determining Html-Encoding Thu, 09 Aug, 22:13
Ken Krugler (JIRA) [jira] [Commented] (TIKA-820) Locator is unset for HTML parser Thu, 09 Aug, 22:37
Ken Krugler (JIRA) [jira] [Assigned] (TIKA-820) Locator is unset for HTML parser Thu, 09 Aug, 22:39
Jukka Zitting Re: TIKA-431 and CONTENT_ENCODING Fri, 10 Aug, 00:44
122jxgcn How can I let Tika know the resource name? Mon, 13 Aug, 11:31
Eric Pascal (JIRA) [jira] [Commented] (TIKA-792) NoSuchMethodException "CTMarkupImpl.<init>(org.apache.xmlbeans.SchemaType, boolean)" processing a OOXML document Mon, 13 Aug, 14:00
Ken Krugler (JIRA) [jira] [Commented] (TIKA-868) TXT parser does not honour the specified encoding Mon, 13 Aug, 16:54
Ken Krugler (JIRA) [jira] [Closed] (TIKA-868) TXT parser does not honour the specified encoding Mon, 13 Aug, 16:56
Ken Krugler (JIRA) [jira] [Commented] (TIKA-771) "Hello, World!" in UTF-8/ASCII gets detected as IBM500 Mon, 13 Aug, 16:58
Ken Krugler (JIRA) [jira] [Assigned] (TIKA-961) No whitespace added if BoilerpipeContentHandler.setIncludeMarkup(true) Mon, 13 Aug, 17:16
Ken Krugler (JIRA) [jira] [Commented] (TIKA-961) No whitespace added if BoilerpipeContentHandler.setIncludeMarkup(true) Mon, 13 Aug, 17:16
Markus Jelsma (JIRA) [jira] [Commented] (TIKA-961) No whitespace added if BoilerpipeContentHandler.setIncludeMarkup(true) Mon, 13 Aug, 17:28
Ken Krugler (JIRA) [jira] [Created] (TIKA-974) No longer return charset info in Metadata's CONTENT_ENCODING Mon, 13 Aug, 17:36
Ken Krugler Re: TIKA-431 and CONTENT_ENCODING Mon, 13 Aug, 17:36
Ken Krugler (JIRA) [jira] [Resolved] (TIKA-771) "Hello, World!" in UTF-8/ASCII gets detected as IBM500 Mon, 13 Aug, 17:55
Ken Krugler (JIRA) [jira] [Commented] (TIKA-961) No whitespace added if BoilerpipeContentHandler.setIncludeMarkup(true) Mon, 13 Aug, 17:57
Message list1 · 2 · Next »Thread · Author · Date
Box list
Oct 2014481
Sep 2014364
Aug 2014393
Jul 2014328
Jun 2014671
May 2014298
Apr 2014161
Mar 2014226
Feb 2014293
Jan 2014150
Dec 2013155
Nov 201384
Oct 2013100
Sep 201386
Aug 2013103
Jul 2013146
Jun 2013138
May 2013126
Apr 201374
Mar 201370
Feb 2013174
Jan 2013205
Dec 2012109
Nov 2012124
Oct 2012118
Sep 201261
Aug 2012173
Jul 2012274
Jun 2012102
May 2012174
Apr 2012180
Mar 2012200
Feb 2012125
Jan 2012189
Dec 2011287
Nov 2011259
Oct 2011336
Sep 2011356
Aug 2011197
Jul 2011120
Jun 2011122
May 2011184
Apr 2011137
Mar 2011161
Feb 2011111
Jan 201185
Dec 201099
Nov 2010252
Oct 2010144
Sep 2010168
Aug 2010253
Jul 2010192
Jun 2010154
May 2010132
Apr 2010115
Mar 201090
Feb 201062
Jan 2010134
Dec 2009125
Nov 2009179
Oct 200989
Sep 2009115
Aug 200946
Jul 200977
Jun 200994
May 200981
Apr 200936
Mar 200996
Feb 200974
Jan 200993
Dec 2008112
Nov 2008147
Oct 200854
Sep 2008108
Aug 200826
Jul 200817
Jun 200820
May 200816
Apr 200844
Mar 200873
Feb 200836
Jan 200888
Dec 200785
Nov 2007100
Oct 2007424
Sep 2007265
Aug 200719
Jul 200730
Jun 200751
May 200721
Apr 200712
Mar 200712