commons-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Michael Davey <Michael.Da...@coderage.org>
Subject Re: validator: email-validation not accepting german "umlaute"
Date Tue, 20 Apr 2004 11:11:28 GMT
Sven Schliesing wrote:

> As for version 1.1.1 the Jakarta Commons-Validator is not accepting 
> german "umlaute" as parts of valid domain names like müller.de or 
> münchen.de
>
> Is this a known issue in validator or might this be a setting in Struts?

Valid domain names must contain only the characters a-z, A-Z, 0-9, "." 
and "-".  They must start with a letter and end with
a letter or digit.  The "." symbol is used exclusively to seperate 
subdomains (see RFC 1035 section 2.3.1 
<http://www.ietf.org/rfc/rfc1035.txt>).

To support internationalised domain names (IDN), both the client and the 
server must be punycode aware.  Punycode is a fairly new standards 
proposal (rfc3492) that encodes non-ascii characters into an ascii 
string, prefixed with "xn--".  For instance, müller.de is encoded as 
xn--mller-kva.de.

<http://www.faqs.org/rfcs/rfc3492.html>
<http://www.afilias.info/cgi-bin/convert_punycode.cgi>

Commons-Validator would need to be made Punycode-aware to achieve what 
you need, or alternatively, you could do the punycode translation in 
your own code, before passing the string to validator.

-- 
Michael


---------------------------------------------------------------------
To unsubscribe, e-mail: commons-dev-unsubscribe@jakarta.apache.org
For additional commands, e-mail: commons-dev-help@jakarta.apache.org


Mime
View raw message