ws-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Apache Wiki <wikidi...@apache.org>
Subject [Ws Wiki] Update of "SummerOfCode/2005/guththila/28" by DineshPremalal
Date Wed, 24 Aug 2005 03:28:03 GMT
Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Ws Wiki" for change notification.

The following page has been changed by DineshPremalal:
http://wiki.apache.org/ws/SummerOfCode/2005/guththila/28

New page:
=== current status ===
------------

   * I was able to implement namespace support for Guththila C parser.
  
   * Now Guththila C parser has following API

             XML_PullParser_createPullParser ()

             XML_PullParser_freePullParser ()
   
             XML_PullParser_next ()

             XML_PullParser_getAttribute ()

             XML_PullParser_getAttributeName ()

             XML_PullParser_getAttributePrefix ()

             XML_PullParser_getAttributeValue ()

             XML_PullParser_getAttributeCount ()

             XML_PullParser_getNamespaceCount ()

             XML_PullParser_getNamespace ()

             XML_PullParser_getNamespacePrefix ()

             XML_PullParser_getNamespaceUri ()
 
             XML_PullParser_getName ()
 
             XML_PullParser_getPrefix ()

The use who createPullParser is responsible for calling freePullParser method after using
all the stuff. 

   * Here we don't support DTD's , PI's and CDATA's , if parser meet DTD's or PI's middle
of the documents it will try to ignore them.

   * Parser Ignore comments 


=== Special Notes ===

==== To Do ====

    * Implement Encoding support ( Give parser ability to work with both UTF-8 and UTF-16
documents)
            
===== General Plan on Implmenting Encoding support =====
          1. parser could auto detect the encoding type base on the BOM (Byte order mark )
(whether it UTF-8 or UTF-16)
          2. It will keep pointers in the buffer as necessary
          3. Encoding conversion will be done at Token_toString () method, which use to convert
token to a string
          4. Guththila C parser Give out put in UTF-8 format
    * Test for XML 1.0 Compliance 

Mime
View raw message