tika-dev mailing list archives: August 2011

Site index · List index
Message list1 · 2 · 3 · Next »Thread · Author · Date
Heitor Peralles sub Mon, 01 Aug, 19:21
Maxim Valyanskiy (JIRA) [jira] [Commented] (TIKA-593) Tika network server Tue, 02 Aug, 11:49
Re: svn commit: r1153097 - in /tika/trunk/tika-server: ./ src/main/java/org/apache/tika/server/ src/main/resources/ src/test/java/org/apache/tika/server/
Mattmann, Chris A (388J)   Re: svn commit: r1153097 - in /tika/trunk/tika-server: ./ src/main/java/org/apache/tika/server/ src/main/resources/ src/test/java/org/apache/tika/server/ Tue, 02 Aug, 12:00
Maxim Valyanskiy     Re: svn commit: r1153097 - in /tika/trunk/tika-server: ./ src/main/java/org/apache/tika/server/ src/main/resources/ src/test/java/org/apache/tika/server/ Tue, 02 Aug, 13:34
Mattmann, Chris A (388J)       Re: svn commit: r1153097 - in /tika/trunk/tika-server: ./ src/main/java/org/apache/tika/server/ src/main/resources/ src/test/java/org/apache/tika/server/ Tue, 02 Aug, 15:32
Alexander Sherbakov [Tika Parser 0.9] Errors in parsing of mp3 files Tue, 02 Aug, 12:45
Nick Burch   Re: [Tika Parser 0.9] Errors in parsing of mp3 files Tue, 02 Aug, 13:05
[jira] [Commented] (TIKA-638) Language recognition - Failed trying to load language profile for language lt . Error: java.lang.IllegalArgumentException: Unable to add an ngram of incorrect length: 5 != 3
Jan Høydahl (JIRA)   [jira] [Commented] (TIKA-638) Language recognition - Failed trying to load language profile for language lt . Error: java.lang.IllegalArgumentException: Unable to add an ngram of incorrect length: 5 != 3 Tue, 02 Aug, 14:53
Joseph Vychtrle (JIRA)   [jira] [Commented] (TIKA-638) Language recognition - Failed trying to load language profile for language lt . Error: java.lang.IllegalArgumentException: Unable to add an ngram of incorrect length: 5 != 3 Wed, 03 Aug, 11:56
Joseph Vychtrle (JIRA)   [jira] [Commented] (TIKA-638) Language recognition - Failed trying to load language profile for language lt . Error: java.lang.IllegalArgumentException: Unable to add an ngram of incorrect length: 5 != 3 Wed, 17 Aug, 00:33
Joseph Vychtrle (JIRA)   [jira] [Commented] (TIKA-638) Language recognition - Failed trying to load language profile for language lt . Error: java.lang.IllegalArgumentException: Unable to add an ngram of incorrect length: 5 != 3 Wed, 17 Aug, 00:49
Mattmann, Chris A (388J) 1.0 RC in next 2 weeks Tue, 02 Aug, 23:32
Jukka Zitting   Re: 1.0 RC in next 2 weeks Wed, 03 Aug, 11:31
Georger Araújo (JIRA) [jira] [Commented] (TIKA-369) Improve accuracy of language detection Wed, 03 Aug, 04:55
Re: WMA Parser
Jukka Zitting   Re: WMA Parser Wed, 03 Aug, 13:14
Georger Araújo (JIRA) [jira] [Issue Comment Edited] (TIKA-369) Improve accuracy of language detection Wed, 03 Aug, 17:31
Cristian Vat (JIRA) [jira] [Commented] (TIKA-632) Rtf parsing ignores links Sat, 06 Aug, 20:15
Cristian Vat (JIRA) [jira] [Commented] (TIKA-642) Few of RTF files not extracting properly Sat, 06 Aug, 20:51
Cristian Vat (JIRA) [jira] [Commented] (TIKA-666) Unable to extract content from RTF files Sat, 06 Aug, 22:13
[jira] [Updated] (TIKA-683) RTF Parser issues with non european characters
Cristian Vat (JIRA)   [jira] [Updated] (TIKA-683) RTF Parser issues with non european characters Sat, 06 Aug, 23:11
Cristian Vat (JIRA)   [jira] [Updated] (TIKA-683) RTF Parser issues with non european characters Sun, 07 Aug, 11:50
Michael McCandless (JIRA)   [jira] [Updated] (TIKA-683) RTF Parser issues with non european characters Wed, 17 Aug, 12:30
Michael McCandless (JIRA)   [jira] [Updated] (TIKA-683) RTF Parser issues with non european characters Mon, 22 Aug, 22:14
[jira] [Commented] (TIKA-683) RTF Parser issues with non european characters
Cristian Vat (JIRA)   [jira] [Commented] (TIKA-683) RTF Parser issues with non european characters Sat, 06 Aug, 23:27
Michael McCandless (JIRA)   [jira] [Commented] (TIKA-683) RTF Parser issues with non european characters Mon, 15 Aug, 10:55
Chris A. Mattmann (JIRA)   [jira] [Commented] (TIKA-683) RTF Parser issues with non european characters Wed, 17 Aug, 15:22
Michael McCandless (JIRA)   [jira] [Commented] (TIKA-683) RTF Parser issues with non european characters Wed, 17 Aug, 15:28
Chris A. Mattmann (JIRA)   [jira] [Commented] (TIKA-683) RTF Parser issues with non european characters Wed, 17 Aug, 15:54
Michael McCandless (JIRA)   [jira] [Commented] (TIKA-683) RTF Parser issues with non european characters Wed, 17 Aug, 16:00
Cristian Vat (JIRA)   [jira] [Commented] (TIKA-683) RTF Parser issues with non european characters Thu, 18 Aug, 22:54
Jukka Zitting (JIRA)   [jira] [Commented] (TIKA-683) RTF Parser issues with non european characters Fri, 19 Aug, 08:38
Michael McCandless (JIRA)   [jira] [Commented] (TIKA-683) RTF Parser issues with non european characters Fri, 19 Aug, 13:08
Michael McCandless (JIRA)   [jira] [Commented] (TIKA-683) RTF Parser issues with non european characters Mon, 22 Aug, 22:14
Michael McCandless (JIRA)   [jira] [Commented] (TIKA-683) RTF Parser issues with non european characters Sat, 27 Aug, 16:20
[jira] [Commented] (TIKA-636) Taking very high heap space while parsing docx - Resulting in OOM in tha app
Nicholas Dodd (JIRA)   [jira] [Commented] (TIKA-636) Taking very high heap space while parsing docx - Resulting in OOM in tha app Mon, 08 Aug, 13:46
Nick Burch (JIRA)   [jira] [Commented] (TIKA-636) Taking very high heap space while parsing docx - Resulting in OOM in tha app Mon, 08 Aug, 14:06
Jukka Zitting (JIRA)   [jira] [Commented] (TIKA-636) Taking very high heap space while parsing docx - Resulting in OOM in tha app Mon, 08 Aug, 14:58
Chris Lott (JIRA) [jira] [Created] (TIKA-688) Enhance content-type detector to recognize almost plain text Tue, 09 Aug, 18:56
Chris Lott (JIRA) [jira] [Commented] (TIKA-688) Enhance content-type detector to recognize almost plain text Tue, 09 Aug, 19:08
Yegor Kozlov Supporting quotes for Apache POI Fri, 12 Aug, 11:02
Joseph Vychtrle (JIRA) [jira] [Created] (TIKA-689) MimeTypes detector detects text/plain content type of a PPT file Sun, 14 Aug, 09:05
[jira] [Commented] (TIKA-689) MimeTypes detector detects text/plain content type of a PPT file
Nick Burch (JIRA)   [jira] [Commented] (TIKA-689) MimeTypes detector detects text/plain content type of a PPT file Sun, 14 Aug, 09:12
Joseph Vychtrle (JIRA)   [jira] [Commented] (TIKA-689) MimeTypes detector detects text/plain content type of a PPT file Sun, 14 Aug, 09:34
Nick Burch (JIRA)   [jira] [Commented] (TIKA-689) MimeTypes detector detects text/plain content type of a PPT file Sun, 14 Aug, 09:47
Joseph Vychtrle (JIRA)   [jira] [Commented] (TIKA-689) MimeTypes detector detects text/plain content type of a PPT file Sun, 14 Aug, 11:43
Joseph Vychtrle (JIRA) [jira] [Closed] (TIKA-689) MimeTypes detector detects text/plain content type of a PPT file Sun, 14 Aug, 11:45
Joseph Vychtrle (JIRA) [jira] [Created] (TIKA-690) WordExtractor doesn't extract text from HWPFDocument Sun, 14 Aug, 13:19
Eddie Verkhoturov (JIRA) [jira] [Created] (TIKA-691) java.lang.ArrayIndexOutOfBoundsException by MS Word CDF V2 Document Sun, 14 Aug, 14:57
Eddie Verkhoturov (JIRA) [jira] [Updated] (TIKA-691) java.lang.ArrayIndexOutOfBoundsException by MS Word CDF V2 Document Sun, 14 Aug, 14:59
[jira] [Commented] (TIKA-690) WordExtractor doesn't extract text from HWPFDocument
Nick Burch (JIRA)   [jira] [Commented] (TIKA-690) WordExtractor doesn't extract text from HWPFDocument Sun, 14 Aug, 15:51
Joseph Vychtrle (JIRA)   [jira] [Commented] (TIKA-690) WordExtractor doesn't extract text from HWPFDocument Sun, 14 Aug, 17:33
Nick Burch (JIRA)   [jira] [Commented] (TIKA-690) WordExtractor doesn't extract text from HWPFDocument Sun, 14 Aug, 18:11
Joseph Vychtrle (JIRA)   [jira] [Commented] (TIKA-690) WordExtractor doesn't extract text from HWPFDocument Sun, 14 Aug, 18:30
[jira] [Commented] (TIKA-691) java.lang.ArrayIndexOutOfBoundsException by MS Word CDF V2 Document
Nick Burch (JIRA)   [jira] [Commented] (TIKA-691) java.lang.ArrayIndexOutOfBoundsException by MS Word CDF V2 Document Sun, 14 Aug, 15:51
Eddie Verkhoturov (JIRA)   [jira] [Commented] (TIKA-691) java.lang.ArrayIndexOutOfBoundsException by MS Word CDF V2 Document Mon, 15 Aug, 06:46
Nick Burch (JIRA)   [jira] [Commented] (TIKA-691) java.lang.ArrayIndexOutOfBoundsException by MS Word CDF V2 Document Mon, 15 Aug, 09:09
Eddie Verkhoturov (JIRA)   [jira] [Commented] (TIKA-691) java.lang.ArrayIndexOutOfBoundsException by MS Word CDF V2 Document Wed, 17 Aug, 06:44
Re: [Tika Wiki] Trivial Update of "ReleaseProcess" by MikeMcCandless
Mattmann, Chris A (388J)   Re: [Tika Wiki] Trivial Update of "ReleaseProcess" by MikeMcCandless Sun, 14 Aug, 16:24
Michael McCandless     Re: [Tika Wiki] Trivial Update of "ReleaseProcess" by MikeMcCandless Sun, 14 Aug, 21:06
Joseph Vychtrle (JIRA) [jira] [Closed] (TIKA-690) WordExtractor doesn't extract text from HWPFDocument Sun, 14 Aug, 18:30
Michael McCandless (JIRA) [jira] [Updated] (TIKA-422) Wrong charset conversion in some RTF documents. Mon, 15 Aug, 10:15
Markus Jelsma (JIRA) [jira] [Updated] (TIKA-648) Parsing HTML anchors with embedded div faulty Mon, 15 Aug, 14:26
Jukka Zitting (JIRA) [jira] [Commented] (TIKA-565) Improved OSGi bundling Mon, 15 Aug, 21:25
Steve Aulenbach Failed test: testBMP(org.apache.tika.parser.image.ImageParserTest) Tue, 16 Aug, 21:38
Mattmann, Chris A (388J)   Re: Failed test: testBMP(org.apache.tika.parser.image.ImageParserTest) Wed, 17 Aug, 01:02
Steve Aulenbach     Re: Failed test: testBMP(org.apache.tika.parser.image.ImageParserTest) Fri, 19 Aug, 15:50
Mattmann, Chris A (388J)       Re: Failed test: testBMP(org.apache.tika.parser.image.ImageParserTest) Fri, 19 Aug, 15:55
[jira] [Assigned] (TIKA-683) RTF Parser issues with non european characters
Chris A. Mattmann (JIRA)   [jira] [Assigned] (TIKA-683) RTF Parser issues with non european characters Wed, 17 Aug, 14:58
Michael McCandless (JIRA)   [jira] [Assigned] (TIKA-683) RTF Parser issues with non european characters Tue, 30 Aug, 17:44
[jira] [Commented] (TIKA-676) Boilerpipe fails
Markus Jelsma (JIRA)   [jira] [Commented] (TIKA-676) Boilerpipe fails Wed, 17 Aug, 15:10
Jukka Zitting (JIRA)   [jira] [Commented] (TIKA-676) Boilerpipe fails Sun, 21 Aug, 14:35
Markus Jelsma (JIRA)   [jira] [Commented] (TIKA-676) Boilerpipe fails Tue, 23 Aug, 11:08
Jukka Zitting (JIRA)   [jira] [Commented] (TIKA-676) Boilerpipe fails Tue, 23 Aug, 11:42
Markus Jelsma (JIRA)   [jira] [Commented] (TIKA-676) Boilerpipe fails Tue, 23 Aug, 11:56
[jira] [Commented] (TIKA-422) Wrong charset conversion in some RTF documents.
Chris A. Mattmann (JIRA)   [jira] [Commented] (TIKA-422) Wrong charset conversion in some RTF documents. Wed, 17 Aug, 15:48
Michael McCandless (JIRA)   [jira] [Commented] (TIKA-422) Wrong charset conversion in some RTF documents. Wed, 17 Aug, 16:26
Tom Grant Appending Mime Types Thu, 18 Aug, 22:04
Antoni Mylka   Re: Appending Mime Types Fri, 19 Aug, 08:59
Nick Burch   Re: Appending Mime Types Mon, 22 Aug, 17:00
Tom Grant     Re: Appending Mime Types Mon, 22 Aug, 18:37
Nick Burch       Re: Appending Mime Types Tue, 23 Aug, 11:20
Tom Grant         Re: Appending Mime Types Tue, 23 Aug, 19:58
Tom Grant           Re: Appending Mime Types Wed, 24 Aug, 01:26
Antoni Mylka       Re: Appending Mime Types Tue, 23 Aug, 13:40
nirnaydewan Tika 0.9 integration in Solr 3.3.0 Fri, 19 Aug, 11:44
Tom Gross   Re: Tika 0.9 integration in Solr 3.3.0 Fri, 19 Aug, 12:27
nirnaydewan     Re: Tika 0.9 integration in Solr 3.3.0 Fri, 19 Aug, 13:20
Tom Gross       Re: Tika 0.9 integration in Solr 3.3.0 Fri, 19 Aug, 13:43
nirnaydewan         Re: Tika 0.9 integration in Solr 3.3.0 Fri, 19 Aug, 19:51
nirnaydewan           Re: Tika 0.9 integration in Solr 3.3.0 Mon, 22 Aug, 09:08
Jukka Zitting             Re: Tika 0.9 integration in Solr 3.3.0 Mon, 22 Aug, 09:33
Tom Gross               Re: Tika 0.9 integration in Solr 3.3.0 Thu, 25 Aug, 08:16
nirnaydewan Issue in text extraction in Solr / Tika Fri, 19 Aug, 11:49
Michael McCandless   Re: Issue in text extraction in Solr / Tika Fri, 19 Aug, 15:21
nirnaydewan     Re: Issue in text extraction in Solr / Tika Fri, 19 Aug, 19:32
Michael McCandless       Re: Issue in text extraction in Solr / Tika Fri, 19 Aug, 23:44
nirnaydewan         Re: Issue in text extraction in Solr / Tika Sat, 20 Aug, 05:07
Michael McCandless           Re: Issue in text extraction in Solr / Tika Sat, 20 Aug, 10:40
Michael McCandless             Re: Issue in text extraction in Solr / Tika Sat, 20 Aug, 12:35
Uwe Schindler               Re: Issue in text extraction in Solr / Tika Sat, 20 Aug, 12:39
Michael McCandless                 Re: Issue in text extraction in Solr / Tika Sat, 20 Aug, 13:25
Uwe Schindler                   RE: Issue in text extraction in Solr / Tika Sat, 20 Aug, 14:16
Uwe Schindler                   RE: Issue in text extraction in Solr / Tika Sat, 20 Aug, 14:19
Michael McCandless                     Re: Issue in text extraction in Solr / Tika Sat, 20 Aug, 15:32
Uwe Schindler                       RE: Issue in text extraction in Solr / Tika Sat, 20 Aug, 16:11
Michael McCandless                         Re: Issue in text extraction in Solr / Tika Sat, 20 Aug, 16:25
Michael McCandless (JIRA) [jira] [Commented] (TIKA-392) RTF parser smashes words together in subsequent table cells Fri, 19 Aug, 12:34
Michael McCandless (JIRA) [jira] [Updated] (TIKA-392) RTF parser smashes words together in subsequent table cells Fri, 19 Aug, 12:40
Message list1 · 2 · 3 · Next »Thread · Author · Date
Box list
Dec 2014250
Nov 2014389
Oct 2014481
Sep 2014364
Aug 2014393
Jul 2014328
Jun 2014671
May 2014298
Apr 2014161
Mar 2014226
Feb 2014293
Jan 2014150
Dec 2013155
Nov 201384
Oct 2013100
Sep 201386
Aug 2013103
Jul 2013146
Jun 2013138
May 2013126
Apr 201374
Mar 201370
Feb 2013174
Jan 2013205
Dec 2012109
Nov 2012124
Oct 2012118
Sep 201261
Aug 2012173
Jul 2012274
Jun 2012102
May 2012174
Apr 2012180
Mar 2012200
Feb 2012125
Jan 2012189
Dec 2011287
Nov 2011259
Oct 2011336
Sep 2011356
Aug 2011197
Jul 2011120
Jun 2011122
May 2011184
Apr 2011137
Mar 2011161
Feb 2011111
Jan 201185
Dec 201099
Nov 2010252
Oct 2010144
Sep 2010168
Aug 2010253
Jul 2010192
Jun 2010154
May 2010132
Apr 2010115
Mar 201090
Feb 201062
Jan 2010134
Dec 2009125
Nov 2009179
Oct 200989
Sep 2009115
Aug 200946
Jul 200977
Jun 200994
May 200981
Apr 200936
Mar 200996
Feb 200974
Jan 200993
Dec 2008112
Nov 2008147
Oct 200854
Sep 2008108
Aug 200826
Jul 200817
Jun 200820
May 200816
Apr 200844
Mar 200873
Feb 200836
Jan 200888
Dec 200785
Nov 2007100
Oct 2007424
Sep 2007265
Aug 200719
Jul 200730
Jun 200751
May 200721
Apr 200712
Mar 200712