Not sure where you want to go with this, but one
approach might be to preprocess your data into some
better structured format, and only start your UIMA
analysis after you've done that.
I have used a project called mstor on sourceforge to
process mbox files, and then you can use javamail
to get at the thread IDs and whatever else you need.
--Thilo
On 2/14/2011 04:49, Radhouane Aniba wrote:
> Hello everyone,
>
> Quite unusual request to this list, I am wondering if there is any analysis
> engine that allow to mine MBOX like formats such as the famous mailman
> mailing list archives in a way that it allow to structure these kind of data
> into messages-replies ?
>
> If anyone have already treated this topic I will be very interested in
> discussing it further.
>
> Regards
>
> Radhouane
>
> --
>
|