commons-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Tim O'Brien <tobr...@transolutions.net>
Subject Re: Handling text encodings
Date Mon, 18 Nov 2002 15:52:29 GMT
The codec package is very simple.  Right now it contains 3 encoders 
specifically geared towards language ( Soundex, RefinedSoundex, and 
Metaphone ).  It also contains a Base64 encoder and decoder.

There is only one interface "Encoder" with one method  "public 
String encode(String pString)".  I think we need another interface 
"Decoder", with a similarly simple interface "public String decode(String 
pString)". 

Soundex, RefinedSoundex, and Metaphone are all one way Encoders, there is 
no way to "Decode" from these algorithms.  Punycode, Bootstring, and 
Base64 are encoding and decoding algorithms, so I see the need for another 
Decoder algorithm.  

Tim O'Brien
tobrien@discursive.com

On Mon, 18 Nov 2002, Ola Berg wrote:

> > Sounds like a logical addition to the codec project.  Also, the IETF
> > draft contains a sample implementation in C, it should be easy to
> > convert this over to Java. 
> >
> > http://www.ietf.org/internet-drafts/draft-ietf-idn-punycode-03.txt
> 
> Can you give me any starting points for how to implement a codec according to your framework.
I plan to implement a generic Bootstring codec, and a specialized Punycode codec on top of
that.
> 
> /O
> 
> 
> > -----Original Message-----
> > From: Ola Berg [mailto:ola.berg@ports.se] 
> > Sent: Friday, November 15, 2002 3:06 AM
> > To: Jakarta Commons Developers List
> > Subject: Handling text encodings
> > 
> > 
> > I am about to develope punycode and race converters at work 
> > (both are mechanisms for representing Unicode as ASCII back 
> > and forth, used fx in multilingual domain names such as 
> > "bärs-och-öl.org"), which gave me this thought:
> > 
> > Text codecing is a pretty common task (commons net, Tomcat 
> > etc etc). Isn't there room for a common mechanism?
> > 
> > And BTW, if anyone has done (or know of) punycode and/or race 
> > codecs, maybe there is room for cooperation? I am 
> > investigating, and just found some example code from a 
> > Unicode.org meeting.
> > 
> > /Ó
> > 
> > 
> > 
> > 
> > 
> > 
> > --
> > To unsubscribe, e-mail:   
> > <mailto:commons-dev-> unsubscribe@jakarta.apache.org>
> > For 
> > additional commands, 
> > e-mail: <mailto:commons-dev-help@jakarta.apache.org>
> > 
> 
> 
> 
> --
> To unsubscribe, e-mail:   <mailto:commons-dev-unsubscribe@jakarta.apache.org>
> For additional commands, e-mail: <mailto:commons-dev-help@jakarta.apache.org>
> 
> 
> 
> 
> --
> To unsubscribe, e-mail:   <mailto:commons-dev-unsubscribe@jakarta.apache.org>
> For additional commands, e-mail: <mailto:commons-dev-help@jakarta.apache.org>
> 
> 

-- 
----------------------
Tim O'Brien
Evanston, IL
(847) 863-7045
tobrien@discursive.com



--
To unsubscribe, e-mail:   <mailto:commons-dev-unsubscribe@jakarta.apache.org>
For additional commands, e-mail: <mailto:commons-dev-help@jakarta.apache.org>


Mime
View raw message