uima-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Greg Holmberg <holmberg2...@comcast.net>
Subject Bug in TikaAnnotator/desc/MarkupAnnotatorTypeSystem.xml ?
Date Fri, 16 Sep 2011 17:00:41 GMT


In TikaAnnotator/desc/MarkupAnnotatorTypeSystem.xml, the MarkupAnnotation type is defined
to have a feature named "attributes" of range-type FSArray and element-type FSArray. 

In a small sample of XMI output, I see MarkupAnnotations with "attributes" values referencing
objects of type AttributeFS, not FSArray.  For example: 

  <tika:MarkupAnnotation xmi:id="97" sofa="61" begin="33" end="52" attributes="110" name="a"
qualifiedName="a" uri="http://www.w3.org/1999/xhtml" /> 

  <tika:AttributeFS xmi:id="110" localName="href" qualifiedName="href" uri="" value="/Title?0091209"
/> 

  
Shouldn't the element-type of the "attributes" feature of the MarkupAnnotation type be AttributeFS,
not FSArray? 

    <typeDescription>
      <name>org.apache.uima.tika.MarkupAnnotation</name>
      <description/>
      <supertypeName>uima.tcas.Annotation</supertypeName>
      <features>
        <featureDescription>
          <name>attributes</name>
          <description/>
          <rangeTypeName>uima.cas.FSArray</rangeTypeName>
          <elementType>org.apache.uima.tika.AttributeFS</elementType>
        </featureDescription>


Thanks,

Greg

Mime
View raw message