tika-dev mailing list archives: November 2010

Site index · List index
Message list1 · 2 · 3 · Next »Thread · Author · Date
Jukka Zitting (JIRA) [jira] Commented: (TIKA-531) xmpTPg:NPages creates invalid XML Mon, 01 Nov, 00:17
Apache Hudson Server Hudson build is still unstable: Tika-trunk » Apache Tika parsers #395 Mon, 01 Nov, 01:03
Apache Hudson Server Hudson build is still unstable: Tika-trunk #395 Mon, 01 Nov, 01:03
Jukka Zitting Re: Hudson build is still unstable: Tika-trunk #395 Mon, 01 Nov, 01:16
Mattmann, Chris A (388J) Re: Hudson build is still unstable: Tika-trunk #395 Mon, 01 Nov, 02:38
Mattmann, Chris A (388J) Re: Hudson build is still unstable: Tika-trunk #395 Mon, 01 Nov, 03:38
Mattmann, Chris A (388J) Re: Hudson build is still unstable: Tika-trunk #395 Mon, 01 Nov, 04:08
Chris A. Mattmann (JIRA) [jira] Resolved: (TIKA-490) Support for adding language profiles dynamically Mon, 01 Nov, 05:21
Chris A. Mattmann (JIRA) [jira] Updated: (TIKA-524) Unification of HTML output from Office, OOXML and Open Document parsers Mon, 01 Nov, 05:23
Chris A. Mattmann (JIRA) [jira] Updated: (TIKA-508) HtmlParser link processing should skip usemap and codebase attributes Mon, 01 Nov, 05:25
Chris A. Mattmann (JIRA) [jira] Updated: (TIKA-497) HtmlHandler should fix up incorrect capitalization of names in <meta http-equiv="xxx"> attributes before putting into metadata Mon, 01 Nov, 05:25
Chris A. Mattmann (JIRA) [jira] Updated: (TIKA-538) Add method get file extension from MimeTypes Mon, 01 Nov, 05:29
Chris A. Mattmann (JIRA) [jira] Updated: (TIKA-526) OOXMLParser fails to extract text from within smart tags Mon, 01 Nov, 05:29
Apache Hudson Server Hudson build is still unstable: Tika-trunk » Apache Tika parsers #396 Mon, 01 Nov, 05:30
Apache Hudson Server Hudson build is still unstable: Tika-trunk #396 Mon, 01 Nov, 05:30
Chris A. Mattmann (JIRA) [jira] Updated: (TIKA-390) Missing Header/Footer text for ODT documents Mon, 01 Nov, 05:31
Chris A. Mattmann (JIRA) [jira] Updated: (TIKA-525) Mismatched start and end elements in HtmlParser Mon, 01 Nov, 05:31
Chris A. Mattmann (JIRA) [jira] Updated: (TIKA-533) Mis-detection of zip files as application/vnd.apple.iwork Mon, 01 Nov, 05:31
Chris A. Mattmann (JIRA) [jira] Updated: (TIKA-539) Encoding detection is too biased by encoding in meta tag Mon, 01 Nov, 06:13
Chris A. Mattmann (JIRA) [jira] Resolved: (TIKA-503) Add a ContentHandler for collecting links from parser output Mon, 01 Nov, 06:15
Chris A. Mattmann (JIRA) [jira] Resolved: (TIKA-531) xmpTPg:NPages creates invalid XML Mon, 01 Nov, 06:17
Chris A. Mattmann (JIRA) [jira] Updated: (TIKA-503) Add a ContentHandler for collecting links from parser output Mon, 01 Nov, 06:19
Mattmann, Chris A (388J) 0.8 release: latest status Mon, 01 Nov, 06:22
Apache Hudson Server Hudson build is still unstable: Tika-trunk » Apache Tika parsers #397 Mon, 01 Nov, 07:01
Apache Hudson Server Hudson build is still unstable: Tika-trunk #397 Mon, 01 Nov, 07:01
Attila Király (JIRA) [jira] Commented: (TIKA-373) Upgrade to POI 3.7 Mon, 01 Nov, 09:13
Paul Jakubik Re: Hudson build is still unstable: Tika-trunk #395 Mon, 01 Nov, 13:10
Jukka Zitting Java 6 (Was: Hudson build is still unstable: Tika-trunk #395) Mon, 01 Nov, 13:39
Apache Hudson Server Hudson build is back to stable : Tika-trunk » Apache Tika parsers #398 Mon, 01 Nov, 14:49
Apache Hudson Server Hudson build is back to stable : Tika-trunk #398 Mon, 01 Nov, 14:49
Apache Hudson Server Hudson build is back to normal : Tika-trunk » Apache Tika application #400 Mon, 01 Nov, 21:07
Apache Hudson Server Hudson build is back to normal : Tika-trunk #400 Mon, 01 Nov, 21:07
Jukka Zitting (JIRA) [jira] Resolved: (TIKA-373) Upgrade to POI 3.7 Mon, 01 Nov, 22:52
Hasan Diwan (JIRA) [jira] Created: (TIKA-541) Use commons-cli in lieu of writing our own option parser Tue, 02 Nov, 06:58
Hasan Diwan (JIRA) [jira] Updated: (TIKA-541) Use commons-cli in lieu of writing our own option parser Tue, 02 Nov, 07:00
Dominique Béjean (JIRA) [jira] Commented: (TIKA-517) java.io.UnsupportedEncodingException with Russian, Chinese, ... document Tue, 02 Nov, 12:00
Dominique Béjean (JIRA) [jira] Issue Comment Edited: (TIKA-517) java.io.UnsupportedEncodingException with Russian, Chinese, ... document Tue, 02 Nov, 12:02
Dominique Béjean (JIRA) [jira] Issue Comment Edited: (TIKA-517) java.io.UnsupportedEncodingException with Russian, Chinese, ... document Tue, 02 Nov, 12:04
Ken Krugler (JIRA) [jira] Closed: (TIKA-517) java.io.UnsupportedEncodingException with Russian, Chinese, ... document Tue, 02 Nov, 13:12
Jan Høydahl (JIRA) [jira] Updated: (TIKA-537) Command line option --list-parsers should list 2nd level parsers below CompositeParsers Wed, 03 Nov, 01:06
Jan Høydahl (JIRA) [jira] Commented: (TIKA-537) Command line option --list-parsers should list 2nd level parsers below CompositeParsers Wed, 03 Nov, 01:18
Jan Høydahl (JIRA) [jira] Updated: (TIKA-527) Allow override mapping mime<-->parsers through config Wed, 03 Nov, 01:34
Jan Høydahl / Cominvent Re: 0.8 release: latest status Wed, 03 Nov, 01:44
Mattmann, Chris A (388J) Re: 0.8 release: latest status Wed, 03 Nov, 01:50
Benson Margulies Build problem with trunk? Thu, 04 Nov, 12:21
Benson Margulies (JIRA) [jira] Created: (TIKA-542) Publish Javadoc on tika.apache.org Thu, 04 Nov, 12:33
Benson Margulies Boilerpipe is nice, but what about readability? Thu, 04 Nov, 13:02
Ken Krugler (JIRA) [jira] Created: (TIKA-543) Remove rome 1.0 dependency on java.net repository Thu, 04 Nov, 13:47
Ken Krugler (JIRA) [jira] Commented: (TIKA-543) Remove rome 1.0 dependency on java.net repository Thu, 04 Nov, 13:59
Ken Krugler (JIRA) [jira] Commented: (TIKA-466) Feed Parser Thu, 04 Nov, 13:59
Ken Krugler Re: Build problem with trunk? Thu, 04 Nov, 14:01
Benson Margulies Re: Build problem with trunk? Thu, 04 Nov, 14:02
Benson Margulies Charset SPI Thu, 04 Nov, 14:08
Jukka Zitting (JIRA) [jira] Resolved: (TIKA-542) Publish Javadoc on tika.apache.org Thu, 04 Nov, 14:41
Sjoerd Smeets (JIRA) [jira] Commented: (TIKA-531) xmpTPg:NPages creates invalid XML Thu, 04 Nov, 15:32
Sjoerd Smeets (JIRA) [jira] Closed: (TIKA-531) xmpTPg:NPages creates invalid XML Thu, 04 Nov, 15:32
Ken Krugler (JIRA) [jira] Commented: (TIKA-462) Add Boilerpipe 1.0.4 to Maven central and remove java.net repository from parser pom Thu, 04 Nov, 16:09
Ken Krugler (JIRA) [jira] Commented: (TIKA-462) Add Boilerpipe 1.0.4 to Maven central and remove java.net repository from parser pom Thu, 04 Nov, 16:09
Maxim Valyanskiy (JIRA) [jira] Commented: (TIKA-540) extract text from .docx footnotes Fri, 05 Nov, 13:03
Maxim Valyanskiy (JIRA) [jira] Resolved: (TIKA-540) extract text from .docx footnotes Fri, 05 Nov, 13:05
Ken Krugler (JIRA) [jira] Updated: (TIKA-462) Add Boilerpipe 1.0.4 to Maven central and remove java.net repository from parser pom Fri, 05 Nov, 13:34
Ken Krugler (JIRA) [jira] Resolved: (TIKA-462) Add Boilerpipe 1.0.4 to Maven central and remove java.net repository from parser pom Fri, 05 Nov, 13:54
Ken Krugler (JIRA) [jira] Created: (TIKA-544) AutoDetectParser ignores charset in Content-Type metadata Fri, 05 Nov, 20:14
Ken Krugler (JIRA) [jira] Closed: (TIKA-544) AutoDetectParser ignores charset in Content-Type metadata Fri, 05 Nov, 21:30
Ken Krugler (JIRA) [jira] Commented: (TIKA-539) Encoding detection is too biased by encoding in meta tag Fri, 05 Nov, 21:36
Ken Krugler (JIRA) [jira] Issue Comment Edited: (TIKA-539) Encoding detection is too biased by encoding in meta tag Fri, 05 Nov, 21:36
Ken Krugler (JIRA) [jira] Issue Comment Edited: (TIKA-539) Encoding detection is too biased by encoding in meta tag Fri, 05 Nov, 21:40
Benson Margulies (JIRA) [jira] Commented: (TIKA-539) Encoding detection is too biased by encoding in meta tag Sat, 06 Nov, 14:27
Ken Krugler (JIRA) [jira] Commented: (TIKA-539) Encoding detection is too biased by encoding in meta tag Sat, 06 Nov, 18:58
Ken Krugler Re: Charset SPI Sat, 06 Nov, 19:19
Benson Margulies Re: Charset SPI Sat, 06 Nov, 19:30
Mattmann, Chris A (388J) My ApacheConNA 2010 slides Sat, 06 Nov, 19:52
Jukka Zitting (JIRA) [jira] Updated: (TIKA-543) Remove rome 1.0 dependency on java.net repository Sat, 06 Nov, 23:48
Jukka Zitting (JIRA) [jira] Commented: (TIKA-543) Remove rome 1.0 dependency on java.net repository Sat, 06 Nov, 23:48
Jukka Zitting (JIRA) [jira] Resolved: (TIKA-543) Remove rome 1.0 dependency on java.net repository Sat, 06 Nov, 23:48
Chris A. Mattmann (JIRA) [jira] Commented: (TIKA-543) Remove rome 1.0 dependency on java.net repository Sat, 06 Nov, 23:52
Chris A. Mattmann (JIRA) [jira] Updated: (TIKA-537) Command line option --list-parsers should list 2nd level parsers below CompositeParsers Sat, 06 Nov, 23:54
Chris A. Mattmann (JIRA) [jira] Assigned: (TIKA-537) Command line option --list-parsers should list 2nd level parsers below CompositeParsers Sat, 06 Nov, 23:54
Mattmann, Chris A (388J) Re: 0.8 release: latest status Sun, 07 Nov, 00:00
Chris A. Mattmann (JIRA) [jira] Resolved: (TIKA-537) Command line option --list-parsers should list 2nd level parsers below CompositeParsers Sun, 07 Nov, 00:00
Mattmann, Chris A (388J) Re: 0.8 release: latest status Sun, 07 Nov, 00:01
Jan Høydahl (JIRA) [jira] Updated: (TIKA-523) Add application/ms-tnef as alias to application/vnd.ms-tnef Sun, 07 Nov, 13:29
Chris A. Mattmann (JIRA) [jira] Assigned: (TIKA-523) Add application/ms-tnef as alias to application/vnd.ms-tnef Sun, 07 Nov, 17:20
Chris A. Mattmann (JIRA) [jira] Resolved: (TIKA-523) Add application/ms-tnef as alias to application/vnd.ms-tnef Sun, 07 Nov, 17:32
Chris A. Mattmann (JIRA) [jira] Updated: (TIKA-487) ContainerAwareDetector doesn't support truncated Open XML files Mon, 08 Nov, 00:38
Chris A. Mattmann (JIRA) [jira] Updated: (TIKA-518) Attribute values are not indexed Mon, 08 Nov, 00:40
Chris A. Mattmann (JIRA) [jira] Updated: (TIKA-521) OutOfMemoryError Parsing XSLX File Mon, 08 Nov, 00:40
Chris A. Mattmann (JIRA) [jira] Updated: (TIKA-530) InvalidFormatException on a PackagePart in OOXML Mon, 08 Nov, 00:40
Chris A. Mattmann (JIRA) [jira] Updated: (TIKA-539) Encoding detection is too biased by encoding in meta tag Mon, 08 Nov, 00:42
Chris A. Mattmann (JIRA) [jira] Updated: (TIKA-471) Avoid Charset name bottleneck when multiple threads are using HtmlParser Mon, 08 Nov, 00:42
Chris A. Mattmann (JIRA) [jira] Updated: (TIKA-497) HtmlHandler should fix up incorrect capitalization of names in <meta http-equiv="xxx"> attributes before putting into metadata Mon, 08 Nov, 00:42
Mattmann, Chris A (388J) [ANNOUNCE] Welcome Maxim Valyanskiy as Tika PMC/Committer Mon, 08 Nov, 07:20
samraj (JIRA) [jira] Created: (TIKA-545) While trying to extract meta data(Created date,Modified date) from .docx,.xlsx files it returns only current date. Mon, 08 Nov, 07:47
Jan Høydahl (JIRA) [jira] Created: (TIKA-546) Add ability to create language profiles to tika-app Mon, 08 Nov, 08:39
Chris A. Mattmann (JIRA) [jira] Updated: (TIKA-545) While trying to extract meta data(Created date,Modified date) from .docx,.xlsx files it returns only current date. Mon, 08 Nov, 13:57
Chris A. Mattmann (JIRA) [jira] Resolved: (TIKA-545) While trying to extract meta data(Created date,Modified date) from .docx,.xlsx files it returns only current date. Mon, 08 Nov, 13:57
Mattmann, Chris A (388J) Re: 0.8 release: latest status Tue, 09 Nov, 01:07
samraj (JIRA) [jira] Reopened: (TIKA-545) While trying to extract meta data(Created date,Modified date) from .docx,.xlsx files it returns only current date. Tue, 09 Nov, 03:38
samraj (JIRA) [jira] Updated: (TIKA-545) While trying to extract meta data(Created date,Modified date) from .docx,.xlsx files it returns only current date. Tue, 09 Nov, 03:40
Chris A. Mattmann (JIRA) [jira] Updated: (TIKA-545) While trying to extract meta data(Created date,Modified date) from .docx,.xlsx files it returns only current date. Tue, 09 Nov, 03:44
Message list1 · 2 · 3 · Next »Thread · Author · Date
Box list
Aug 2016228
Jul 2016197
Jun 2016328
May 2016344
Apr 2016620
Mar 2016423
Feb 2016463
Jan 2016296
Dec 2015185
Nov 2015170
Oct 2015320
Sep 2015388
Aug 2015397
Jul 2015323
Jun 2015307
May 2015317
Apr 2015475
Mar 2015891
Feb 2015445
Jan 2015601
Dec 2014253
Nov 2014389
Oct 2014481
Sep 2014364
Aug 2014393
Jul 2014328
Jun 2014671
May 2014298
Apr 2014161
Mar 2014226
Feb 2014293
Jan 2014150
Dec 2013155
Nov 201384
Oct 2013100
Sep 201386
Aug 2013103
Jul 2013146
Jun 2013138
May 2013126
Apr 201374
Mar 201370
Feb 2013174
Jan 2013205
Dec 2012109
Nov 2012124
Oct 2012118
Sep 201261
Aug 2012173
Jul 2012274
Jun 2012102
May 2012174
Apr 2012180
Mar 2012200
Feb 2012125
Jan 2012189
Dec 2011287
Nov 2011259
Oct 2011336
Sep 2011356
Aug 2011197
Jul 2011120
Jun 2011122
May 2011184
Apr 2011137
Mar 2011161
Feb 2011111
Jan 201185
Dec 201099
Nov 2010252
Oct 2010144
Sep 2010168
Aug 2010253
Jul 2010192
Jun 2010154
May 2010132
Apr 2010115
Mar 201090
Feb 201062
Jan 2010134
Dec 2009125
Nov 2009179
Oct 200989
Sep 2009115
Aug 200946
Jul 200977
Jun 200994
May 200981
Apr 200936
Mar 200996
Feb 200974
Jan 200993
Dec 2008112
Nov 2008147
Oct 200854
Sep 2008108
Aug 200826
Jul 200817
Jun 200820
May 200816
Apr 200844
Mar 200873
Feb 200836
Jan 200888
Dec 200785
Nov 2007100
Oct 2007424
Sep 2007265
Aug 200719
Jul 200730
Jun 200751
May 200721
Apr 200712
Mar 200712