tika-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Ken Krugler <kkrugler_li...@transpac.com>
Subject Re: TIKA-431 and CONTENT_ENCODING
Date Mon, 13 Aug 2012 17:36:55 GMT

On Aug 9, 2012, at 5:44pm, Jukka Zitting wrote:

> Hi,
> 
> On Thu, Aug 9, 2012 at 10:56 PM, Ken Krugler
> <kkrugler_lists@transpac.com> wrote:
>> You made a note in Changes.txt that this was deprecated, so I'm assuming that you
>> think we should hold off on fixing the abuse of CONTENT_ENCODING until after the
>> 1.2 release, right?
> 
> Right, there might still be clients out there that expect this
> information to be present as CONTENT_ENCODING.
> 
> In fact, unless the abuse of that field is actively harmful (i.e.
> clients need to add extra workarounds to clean up the metadata), I'd
> keep the field in place all the way until Tika 2.0.

Agreed - filed https://issues.apache.org/jira/browse/TIKA-974 to track this.

-- Ken

--------------------------
Ken Krugler
http://www.scaleunlimited.com
custom big data solutions & training
Hadoop, Cascading, Mahout & Solr





Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message