uima-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Richard Eckart de Castilho <...@apache.org>
Subject Re: Serialization NonXML
Date Tue, 05 Jul 2016 17:58:33 GMT
On 05.07.2016, at 12:02, Henrik Matzen <hurich07@googlemail.com> wrote:
> 
> because of the known problem that you cannot serialize the cas if it has
> non xml chracters I tried this:
> 
> I know its not working because of this (cas =
> doReplaceNonXml(cas.toString()).toCas;)
> - Because there is no .toCas method.
> 
> Does anyone of you know how I can solve this?

Are you required to use XMI? If not, consider serializing
your CASes in a binary format. [1]

DKPro Core has reader/writer components that support all different kinds of
UIMA binary serialization including some custom variants [2].

Cheers,

-- Richard

[1] https://uima.apache.org/d/uimaj-current/references.html#ugr.ref.compress 
[2] https://dkpro.github.io/dkpro-core/releases/1.8.0/docs/format-reference.html#format-BinaryCas



Mime
View raw message