tika-dev mailing list archives: January 2009

Site index · List index
Message listThread · Author · Date
Dave Meikle Re: [TIKA-147] Flash Files Mon, 05 Jan, 19:17
Marek Sikl Metadata Tue, 06 Jan, 08:56
Marek Sikl Metadata Tue, 06 Jan, 09:12
Michael Wechner Re: Metadata Tue, 06 Jan, 09:26
iapilgrim AutodetectParser fail with text file Tue, 06 Jan, 09:42
Karl Heinz Marbaise Re: AutodetectParser fail with text file Tue, 06 Jan, 10:05
Jukka Zitting Re: AutodetectParser fail with text file Tue, 06 Jan, 10:32
iapilgrim Re: AutodetectParser fail with text file Tue, 06 Jan, 10:58
Jukka Zitting Re: AutodetectParser fail with text file Tue, 06 Jan, 11:06
iapilgrim Re: AutodetectParser fail with text file Tue, 06 Jan, 11:12
Jukka Zitting (JIRA) [jira] Resolved: (TIKA-182) Allow clients to listen to the raw SAX events if available Wed, 07 Jan, 01:47
Neil Benn OOXML Wed, 07 Jan, 14:25
Jukka Zitting (JIRA) [jira] Resolved: (TIKA-180) XHTMLContentHandler unable to extract text from MSWord file Wed, 07 Jan, 16:28
Andrzej Rusin (JIRA) [jira] Commented: (TIKA-154) Better detection of plain text versus binary formats with a text header Thu, 08 Jan, 15:20
Andrzej Rusin (JIRA) [jira] Created: (TIKA-185) XML files with (unsatisfied) SYSTEM entities can not be indexed Thu, 08 Jan, 15:30
Andrzej Rusin (JIRA) [jira] Updated: (TIKA-185) XML files with (unsatisfied) SYSTEM entities can not be indexed Thu, 08 Jan, 15:42
Jukka Zitting (JIRA) [jira] Commented: (TIKA-185) XML files with (unsatisfied) SYSTEM entities can not be indexed Fri, 09 Jan, 00:20
Chris A. Mattmann (JIRA) [jira] Commented: (TIKA-185) XML files with (unsatisfied) SYSTEM entities can not be indexed Fri, 09 Jan, 00:40
Uwe Schindler (JIRA) [jira] Commented: (TIKA-185) XML files with (unsatisfied) SYSTEM entities can not be indexed Fri, 09 Jan, 09:04
Andrzej Rusin (JIRA) [jira] Commented: (TIKA-185) XML files with (unsatisfied) SYSTEM entities can not be indexed Fri, 09 Jan, 09:24
Andrzej Rusin (JIRA) [jira] Commented: (TIKA-185) XML files with (unsatisfied) SYSTEM entities can not be indexed Fri, 09 Jan, 09:28
Andrzej Rusin (JIRA) [jira] Updated: (TIKA-185) XML files with (unsatisfied) SYSTEM entities can not be extracted Fri, 09 Jan, 10:20
Peter Becker (JIRA) [jira] Commented: (TIKA-185) XML files with (unsatisfied) SYSTEM entities can not be extracted Fri, 09 Jan, 10:52
Andrzej Rusin (JIRA) [jira] Updated: (TIKA-186) Refactor the MS Office property names to MSOffice.java Fri, 09 Jan, 11:18
Andrzej Rusin (JIRA) [jira] Created: (TIKA-186) Refactor the MS Office property names to MSOffice.java Fri, 09 Jan, 11:18
Andrzej Rusin (JIRA) [jira] Updated: (TIKA-187) Extract the summary.getCategory() from MSOffice documents Fri, 09 Jan, 11:26
Andrzej Rusin (JIRA) [jira] Created: (TIKA-187) Extract the summary.getCategory() from MSOffice documents Fri, 09 Jan, 11:26
Andrzej Rusin (JIRA) [jira] Commented: (TIKA-186) Refactor the MS Office property names to MSOffice.java Fri, 09 Jan, 11:28
Uwe Schindler (JIRA) [jira] Commented: (TIKA-185) XML files with (unsatisfied) SYSTEM entities can not be extracted Fri, 09 Jan, 11:34
Andrzej Rusin (JIRA) [jira] Commented: (TIKA-185) XML files with (unsatisfied) SYSTEM entities can not be extracted Fri, 09 Jan, 11:40
Andrzej Rusin (JIRA) [jira] Commented: (TIKA-185) XML files with (unsatisfied) SYSTEM entities can not be extracted Fri, 09 Jan, 12:02
Jukka Zitting Content type sniffing Fri, 09 Jan, 13:03
Jukka Zitting (JIRA) [jira] Commented: (TIKA-185) XML files with (unsatisfied) SYSTEM entities can not be extracted Fri, 09 Jan, 13:38
Dave Meikle (JIRA) [jira] Commented: (TIKA-185) XML files with (unsatisfied) SYSTEM entities can not be extracted Fri, 09 Jan, 18:01
Dave Meikle Re: Content type sniffing Fri, 09 Jan, 18:06
Jukka Zitting Re: Metadata Fri, 09 Jan, 22:40
Babak Farhang (JIRA) [jira] Commented: (TIKA-153) Allow passing of files or memory buffers to parsers Tue, 13 Jan, 23:06
Jukka Zitting (JIRA) [jira] Created: (TIKA-188) Automatic whitespace for block elements in XHTMLContentHandler Thu, 15 Jan, 22:31
Jukka Zitting (JIRA) [jira] Resolved: (TIKA-188) Automatic whitespace for block elements in XHTMLContentHandler Thu, 15 Jan, 22:45
Uwe Schindler (JIRA) [jira] Commented: (TIKA-188) Automatic whitespace for block elements in XHTMLContentHandler Thu, 15 Jan, 22:51
Jukka Zitting (JIRA) [jira] Resolved: (TIKA-185) XML files with (unsatisfied) SYSTEM entities can not be extracted Thu, 15 Jan, 23:56
Jukka Zitting Re: Dropping or repurposing the CHANGES file Fri, 16 Jan, 00:53
Jukka Zitting (JIRA) [jira] Resolved: (TIKA-154) Better detection of plain text versus binary formats with a text header Sat, 17 Jan, 01:17
Jukka Zitting (JIRA) [jira] Commented: (TIKA-154) Better detection of plain text versus binary formats with a text header Sat, 17 Jan, 01:21
Georger Rommel Ferreira de Araújo (JIRA) [jira] Created: (TIKA-189) Text extraction from Excel files juxtaposes cells Sat, 17 Jan, 18:52
Georger Rommel Ferreira de Araújo (JIRA) [jira] Updated: (TIKA-189) Text extraction from Excel files juxtaposes cells Sat, 17 Jan, 18:54
Jukka Zitting Extensible content type detection Sat, 17 Jan, 22:57
Sami Siren Re: Extensible content type detection Mon, 19 Jan, 06:25
Jukka Zitting Re: Extensible content type detection Mon, 19 Jan, 10:29
Niall Pemberton Re: Extensible content type detection Mon, 19 Jan, 20:45
Jukka Zitting Re: Extensible content type detection Mon, 19 Jan, 20:57
Niall Pemberton Re: Extensible content type detection Mon, 19 Jan, 21:24
Jukka Zitting Re: Extensible content type detection Mon, 19 Jan, 22:17
Sami Siren Re: Extensible content type detection Tue, 20 Jan, 10:07
kumar raja jana (JIRA) [jira] Commented: (TIKA-189) Text extraction from Excel files juxtaposes cells Thu, 22 Jan, 14:21
Georger Rommel Ferreira de Araújo (JIRA) [jira] Commented: (TIKA-189) Text extraction from Excel files juxtaposes cells Thu, 22 Jan, 16:25
Uwe Schindler (JIRA) [jira] Commented: (TIKA-189) Text extraction from Excel files juxtaposes cells Thu, 22 Jan, 17:13
Uwe Schindler (JIRA) [jira] Issue Comment Edited: (TIKA-189) Text extraction from Excel files juxtaposes cells Thu, 22 Jan, 17:15
Georger Rommel Ferreira de Araújo (JIRA) [jira] Commented: (TIKA-189) Text extraction from Excel files juxtaposes cells Thu, 22 Jan, 17:28
Uwe Schindler (JIRA) [jira] Updated: (TIKA-189) Text extraction from Excel files juxtaposes cells Thu, 22 Jan, 20:53
Georger Araújo (JIRA) [jira] Commented: (TIKA-189) Text extraction from Excel files juxtaposes cells Thu, 22 Jan, 23:06
Uwe Schindler (JIRA) [jira] Created: (TIKA-190) wrong handling of ignorableWhitespace/characters in SafeContentHandler and WriteoutContentHandler Fri, 23 Jan, 08:19
Uwe Schindler (JIRA) [jira] Updated: (TIKA-190) wrong handling of ignorableWhitespace/characters in SafeContentHandler and WriteoutContentHandler Fri, 23 Jan, 08:20
Uwe Schindler (JIRA) [jira] Commented: (TIKA-189) Text extraction from Excel files juxtaposes cells Fri, 23 Jan, 08:26
Georger Araújo (JIRA) [jira] Commented: (TIKA-189) Text extraction from Excel files juxtaposes cells Fri, 23 Jan, 11:57
Jukka Zitting (JIRA) [jira] Commented: (TIKA-189) Text extraction from Excel files juxtaposes cells Sat, 24 Jan, 08:33
Uwe Schindler (JIRA) [jira] Commented: (TIKA-189) Text extraction from Excel files juxtaposes cells Sat, 24 Jan, 09:49
Uwe Schindler (JIRA) [jira] Updated: (TIKA-189) Text extraction from Excel files juxtaposes cells Sat, 24 Jan, 09:53
Jukka Zitting (JIRA) [jira] Resolved: (TIKA-190) wrong handling of ignorableWhitespace/characters in SafeContentHandler and WriteoutContentHandler Sun, 25 Jan, 20:32
Jukka Zitting (JIRA) [jira] Resolved: (TIKA-189) Text extraction from Excel files juxtaposes cells Sun, 25 Jan, 21:16
Uwe Schindler (JIRA) [jira] Commented: (TIKA-189) Text extraction from Excel files juxtaposes cells Sun, 25 Jan, 22:02
Karl Heinz Marbaise (JIRA) [jira] Created: (TIKA-191) Using of maven-changes-plugin instead of hand made changes.txt Sun, 25 Jan, 23:08
Jonathan Koren failing to detecting mime types from custom mimetype.xml Mon, 26 Jan, 05:49
Jukka Zitting (JIRA) [jira] Created: (TIKA-192) Add GIF type information Mon, 26 Jan, 19:53
Jukka Zitting Re: failing to detecting mime types from custom mimetype.xml Mon, 26 Jan, 20:05
Jukka Zitting (JIRA) [jira] Updated: (TIKA-192) Add glob and magic patterns for image types Mon, 26 Jan, 20:09
Jonathan Koren Re: failing to detecting mime types from custom mimetype.xml Mon, 26 Jan, 22:03
Jukka Zitting Re: failing to detecting mime types from custom mimetype.xml Mon, 26 Jan, 22:15
Jonathan Koren Re: failing to detecting mime types from custom mimetype.xml Mon, 26 Jan, 23:39
Jonathan Koren (JIRA) [jira] Updated: (TIKA-192) Add glob and magic patterns for image types Tue, 27 Jan, 06:28
Jonathan Koren Re: failing to detecting mime types from custom mimetype.xml Tue, 27 Jan, 06:30
Jonathan Koren (JIRA) [jira] Updated: (TIKA-193) PDFParser adds mime-type twice Tue, 27 Jan, 06:54
Jonathan Koren (JIRA) [jira] Created: (TIKA-193) PDFParser adds mime-type twice Tue, 27 Jan, 06:54
Sami Siren (JIRA) [jira] Commented: (TIKA-193) PDFParser adds mime-type twice Tue, 27 Jan, 07:42
Jukka Zitting Re: failing to detecting mime types from custom mimetype.xml Tue, 27 Jan, 11:15
Andrzej Rusin (JIRA) [jira] Commented: (TIKA-86) Support magic(5) files Tue, 27 Jan, 11:56
Jukka Zitting Re: Extensible content type detection Tue, 27 Jan, 15:00
Chris A. Mattmann (JIRA) [jira] Created: (TIKA-194) Support java regular expressions in glob pattern spec for mime repo Tue, 27 Jan, 15:31
Jana, Kumar Raja FW: Customizing Tika to parse MSProject Files Wed, 28 Jan, 11:33
Jukka Zitting (JIRA) [jira] Updated: (TIKA-192) Add glob and magic patterns for image types Thu, 29 Jan, 08:43
Jukka Zitting (JIRA) [jira] Commented: (TIKA-192) Add glob and magic patterns for image types Thu, 29 Jan, 08:43
Jukka Zitting MIME registry use cases Thu, 29 Jan, 19:16
Dmitry Kudryavtsev TikaConfig and java 1.4 Sat, 31 Jan, 11:23
Message listThread · Author · Date
Box list
Mar 2015278
Feb 2015445
Jan 2015601
Dec 2014253
Nov 2014389
Oct 2014481
Sep 2014364
Aug 2014393
Jul 2014328
Jun 2014671
May 2014298
Apr 2014161
Mar 2014226
Feb 2014293
Jan 2014150
Dec 2013155
Nov 201384
Oct 2013100
Sep 201386
Aug 2013103
Jul 2013146
Jun 2013138
May 2013126
Apr 201374
Mar 201370
Feb 2013174
Jan 2013205
Dec 2012109
Nov 2012124
Oct 2012118
Sep 201261
Aug 2012173
Jul 2012274
Jun 2012102
May 2012174
Apr 2012180
Mar 2012200
Feb 2012125
Jan 2012189
Dec 2011287
Nov 2011259
Oct 2011336
Sep 2011356
Aug 2011197
Jul 2011120
Jun 2011122
May 2011184
Apr 2011137
Mar 2011161
Feb 2011111
Jan 201185
Dec 201099
Nov 2010252
Oct 2010144
Sep 2010168
Aug 2010253
Jul 2010192
Jun 2010154
May 2010132
Apr 2010115
Mar 201090
Feb 201062
Jan 2010134
Dec 2009125
Nov 2009179
Oct 200989
Sep 2009115
Aug 200946
Jul 200977
Jun 200994
May 200981
Apr 200936
Mar 200996
Feb 200974
Jan 200993
Dec 2008112
Nov 2008147
Oct 200854
Sep 2008108
Aug 200826
Jul 200817
Jun 200820
May 200816
Apr 200844
Mar 200873
Feb 200836
Jan 200888
Dec 200785
Nov 2007100
Oct 2007424
Sep 2007265
Aug 200719
Jul 200730
Jun 200751
May 200721
Apr 200712
Mar 200712