commons-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Sebb (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (VALIDATOR-235) UrlValidator rejects url with german umlaut
Date Sun, 04 Jan 2015 22:18:35 GMT

    [ https://issues.apache.org/jira/browse/VALIDATOR-235?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14264062#comment-14264062
] 

Sebb commented on VALIDATOR-235:
--------------------------------

There are two parts to this.

1) Syntax validation.
At present the Regex does not allow Unicode characters, because they are not permitted by
RFC3986

2) Domain validation
At present the Unicode versions of TLDs are not included in the DomainValidator tables, and
the code does not convert Unicode domains to punycode in order to check against the punycode
entries.
There is no point doing either of these until the Regex issues are sorted.

Presumably the intention is to extend RFC3986 so that Alpha characters can now include Unicode
characters.
Similarly for Alphanumerics. But there may be some exceptions. Need the relevant RFCs.

> UrlValidator rejects url with german umlaut
> -------------------------------------------
>
>                 Key: VALIDATOR-235
>                 URL: https://issues.apache.org/jira/browse/VALIDATOR-235
>             Project: Commons Validator
>          Issue Type: Bug
>    Affects Versions: 1.3.1 Release
>            Reporter: Brian Preuß
>
> e.g. http://www.düsseldorf.de/



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message