ctakes-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From james-mas...@apache.org
Subject svn commit: r1517713 - in /ctakes/trunk: ctakes-chunker-res/src/main/resources/org/apache/ctakes/chunker/models/ ctakes-chunker-res/src/main/resources/org/apache/ctakes/chunker/models/small/ ctakes-chunker/desc/ ctakes-chunker/src/test/resources/data/ ...
Date Mon, 26 Aug 2013 22:10:20 GMT
Author: james-masanz
Date: Mon Aug 26 22:10:19 2013
New Revision: 1517713

URL: http://svn.apache.org/r1517713
Log:
merge from 3.1.0 branch changes for CTAKES-211

Removed:
    ctakes/trunk/ctakes-chunker-res/src/main/resources/org/apache/ctakes/chunker/models/chunk-model.claims-1.5.zip
    ctakes/trunk/ctakes-chunker-res/src/main/resources/org/apache/ctakes/chunker/models/small/
    ctakes/trunk/ctakes-chunker/desc/ChunkerCPE-using-unit-test-models_and_tag-dictionary.xml
    ctakes/trunk/ctakes-chunker/src/test/resources/data/unit-test-model.bin.gz
    ctakes/trunk/ctakes-chunker/src/test/resources/data/unit-test-tag-dictionary.txt
    ctakes/trunk/ctakes-chunker/src/test/resources/data/unit-test.chunker.model.bin.gz
    ctakes/trunk/ctakes-chunker/src/test/resources/data/unit-test.postagger.model.bin.gz
    ctakes/trunk/ctakes-pos-tagger-res/src/main/resources/org/apache/ctakes/postagger/models/postagger.model.bin.gz
    ctakes/trunk/ctakes-pos-tagger-res/src/main/resources/org/apache/ctakes/postagger/models/small/
    ctakes/trunk/ctakes-pos-tagger/data/pos/training/sample/sample-tagdict.txt
    ctakes/trunk/ctakes-pos-tagger/resources/launch/PosTagDictionaryCreator--Sample.launch
    ctakes/trunk/ctakes-pos-tagger/resources/launch/PosTagDictionaryCreator.launch
    ctakes/trunk/ctakes-pos-tagger/src/test/resources/data/unit-test-model.bin.gz
Modified:
    ctakes/trunk/ctakes-chunker/src/test/resources/data/README
    ctakes/trunk/ctakes-core/src/main/java/org/apache/ctakes/core/ci/HyphenTextModifierImpl.java
    ctakes/trunk/ctakes-core/src/main/java/org/apache/ctakes/core/knowtator/KnowtatorXMLParser.java
    ctakes/trunk/ctakes-pos-tagger/src/test/resources/data/README

Modified: ctakes/trunk/ctakes-chunker/src/test/resources/data/README
URL: http://svn.apache.org/viewvc/ctakes/trunk/ctakes-chunker/src/test/resources/data/README?rev=1517713&r1=1517712&r2=1517713&view=diff
==============================================================================
--- ctakes/trunk/ctakes-chunker/src/test/resources/data/README (original)
+++ ctakes/trunk/ctakes-chunker/src/test/resources/data/README Mon Aug 26 22:10:19 2013
@@ -2,12 +2,9 @@ The files in this directory are here to 
 
 contents:
 
-unit-test.chunker.model.bin.gz - chunker model generated by OpenNLP on unit-test.opennlp.chunks
 unit-test.opennlp.chunks - training data for test model.  Contains ~1000 words from GENIA.
-unit-test-model.bin.gz - pos tagging model generated by OpenNLP (copied from POS tagger project)
-unit-test-tag-dictionary.txt - simple tag model for unit tests, with all words constrained
to spit out "IN" it is a good contrast with the predicted values not using the tag dictionary.
 text-files - directory containing sample input files for testing the chunker 
 
-The chunker model was generated with the following command:
+A chunker model can be generated with the following command:
 
 java opennlp.tools.chunker.ChunkerME target/test-classes/data/unit-test.opennlp.chunks target/test-classes/data/unit-test.chunker.model.bin.gz

Modified: ctakes/trunk/ctakes-core/src/main/java/org/apache/ctakes/core/ci/HyphenTextModifierImpl.java
URL: http://svn.apache.org/viewvc/ctakes/trunk/ctakes-core/src/main/java/org/apache/ctakes/core/ci/HyphenTextModifierImpl.java?rev=1517713&r1=1517712&r2=1517713&view=diff
==============================================================================
--- ctakes/trunk/ctakes-core/src/main/java/org/apache/ctakes/core/ci/HyphenTextModifierImpl.java
(original)
+++ ctakes/trunk/ctakes-core/src/main/java/org/apache/ctakes/core/ci/HyphenTextModifierImpl.java
Mon Aug 26 22:10:19 2013
@@ -52,7 +52,7 @@ public class HyphenTextModifierImpl impl
 	private Tokenizer iv_tokenizer = null;
 
 	/*
-	 * DECPRECATED: Uses InputSteam instead
+	 * DECPRECATED: Use InputSteam instead of filename
 	 */
 	public HyphenTextModifierImpl(String hyphenfilename, int windowSize) {
 		iv_windowSize = windowSize;

Modified: ctakes/trunk/ctakes-core/src/main/java/org/apache/ctakes/core/knowtator/KnowtatorXMLParser.java
URL: http://svn.apache.org/viewvc/ctakes/trunk/ctakes-core/src/main/java/org/apache/ctakes/core/knowtator/KnowtatorXMLParser.java?rev=1517713&r1=1517712&r2=1517713&view=diff
==============================================================================
--- ctakes/trunk/ctakes-core/src/main/java/org/apache/ctakes/core/knowtator/KnowtatorXMLParser.java
(original)
+++ ctakes/trunk/ctakes-core/src/main/java/org/apache/ctakes/core/knowtator/KnowtatorXMLParser.java
Mon Aug 26 22:10:19 2013
@@ -147,7 +147,7 @@ public class KnowtatorXMLParser {
                 if (mentionSlot != null) {
                   annotation.annotationSlots.put(mentionSlot.name, mentionSlot.value);
                 } else {
-                  throw new RuntimeException("no slot for " + slotId);
+                  LOGGER.warn("no simple slot for " + slotId);
                 }
               }
             }

Modified: ctakes/trunk/ctakes-pos-tagger/src/test/resources/data/README
URL: http://svn.apache.org/viewvc/ctakes/trunk/ctakes-pos-tagger/src/test/resources/data/README?rev=1517713&r1=1517712&r2=1517713&view=diff
==============================================================================
--- ctakes/trunk/ctakes-pos-tagger/src/test/resources/data/README (original)
+++ ctakes/trunk/ctakes-pos-tagger/src/test/resources/data/README Mon Aug 26 22:10:19 2013
@@ -3,11 +3,10 @@ The files in this directory are here to 
 contents:
 GENIAcorpus3.02.pos.test.xml - contains two GENIA abstracts used for unit testing scripts/java/data.pos.training.GeniaPosTrainingDataExtractor.java
 unit-test-2lines-training-data.txt - 2 sentences from GENIA corpus in OpenNLP format with
some (obvious) modifications.  used for unit testing.
-unit-test-model.bin.gz - pos tagging model generated by OpenNLP
 unit-test-tag-dictionary.txt - simple tag model for unit tests, with all words constrained
to spit out "IN" it is a good contrast with the predicted values not using the tag dictionary.
 unit-test-tags.opennlp.format - supports unit tests for collection reader OpenNLPPOSCollectionReader.java
 unit-test-training-data.txt - 500 sentences from GENIA corpus in OpenNLP format
 
-The model was generated with the following command:
+A model can be generated with the following command:
 
 java opennlp.tools.postag.POSTaggerME data/test/unit-test-training-data.txt data/test/unit-test-model.bin.gz
\ No newline at end of file



Mime
View raw message