camel-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Chris Mattmann <>
Subject Re: Apache Tika Component
Date Mon, 23 Jan 2017 04:49:09 GMT
Great job, Bob! ☺

On 1/22/17, 8:17 PM, "Bob Paulin" <> wrote:

    I'd like to propose an Apache Tika[1] connector for Apache Camel.  I see
    Camel uses a number of Tika components like PDFBox but it could be
    interesting to have a full assortment of file parsers to convert files
    to text.
    The basic configuration would allow MIME type detection and parsing
    files to text. 
    File/Inputstream -> camel-tika -> MIME Type
    File/Inputstream ->  camel-tika -> OutputStream in text
    I have a basic implementation that I'd be happy to send in a PR but I
    wanted to see if this was something the community was interested in.  I
    think it might be interesting to combine a project that integrates
    everything with the project the parses everything.  I also think having
    a camel-tika component might help achieve some of Tika's 2.0 goals.
    - Bob Paulin

View raw message