ctakes-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Finan, Sean" <Sean.Fi...@childrens.harvard.edu>
Subject RE: how to run i2b2 data
Date Thu, 06 Aug 2015 00:24:01 GMT
Hi Justin,

A shot in the dark:
You could create a collection reader that works similarly to org.apache.ctakes.core.cr.FilesInDirectoryCollectionReader
, but instead of grabbing all of the files in a directory it grabs all the records parsed
from a single .xml and runs a pipeline per record.  Basically, swap a directory for an .xml,
a text file for an xml element containing a record.
Somebody out there might have something that already does as much.


-----Original Message-----
From: Justin Zhang [mailto:justinzhang.xl@gmail.com] 
Sent: Wednesday, August 05, 2015 6:40 PM
To: user@ctakes.apache.org; dev@ctakes.apache.org
Subject: how to run i2b2 data

Hello everyone,

I am running ctakes with i2b2 data

In each xml file, there are multiple patient records. I am able to separate each patient into
single files and process them with "runCPE.sh"

Is there a way to convert this single xml file into the format "ctakes"
accepted, and process as a single input file, and generate a single output file (results labelled
by patient id). For example, each patient id has a "smoking status".


View raw message