I agree with Jorn, I think that's the faster way.
Tommaso
2011/2/14 Jörn Kottmann <kottmann@gmail.com>
> On 2/14/11 4:49 AM, Radhouane Aniba wrote:
>
>> Hello everyone,
>>
>> Quite unusual request to this list, I am wondering if there is any
>> analysis
>> engine that allow to mine MBOX like formats such as the famous mailman
>> mailing list archives in a way that it allow to structure these kind of
>> data
>> into messages-replies ?
>>
>> If anyone have already treated this topic I will be very interested in
>> discussing it further.
>>
>
> We have a tika integration, and tika has support for mbox.
> Maybe that is good enough to do the extraction.
>
> Jörn
>
|