Mailing-List: contact derby-dev-help@db.apache.org; run by ezmlm
Precedence: bulk
Reply-To: <derby-dev@db.apache.org>
Received-SPF: pass (herse.apache.org: domain of msatoor@gmail.com designates
 209.85.132.246 as permitted sender)
DomainKey-Signature: a=rsa-sha1; c=nofws;
        d=gmail.com; s=beta;
        h=received:message-id:date:from:to:subject:in-reply-to:mime-version:content-type:references;
        b=NUK9044VQyAGXqXdIE1IVJRzTS4hFsVbFQ3PfVsM9p7x7gzV6k9+5O0TXyq4FrGdiyfPHPkAQ2FtHni81lPYqaPrjQBtp0+hzQfE+xzmV5JmJGKtJowyTg2kOKUsJOTpaiKsjDhwusRDatIBXDJarQPHNd0V62wTjDrKBYf58GQ=
Message-ID: <d9619e4a0704030925h7163d47fk1054a253a08c1ed3@mail.gmail.com>
Date: Tue, 3 Apr 2007 09:25:04 -0700
From: "Mamta Satoor" <msatoor@gmail.com>
To: derby-dev@db.apache.org
Subject: Re: Feedback on wiki page
 http://wiki.apache.org/db-derby/BuiltInLanguageBasedOrderingDERBY-1478
In-Reply-To: <46127B2B.7080204@sun.com>
MIME-Version: 1.0
Content-Type: multipart/alternative;
	boundary="----=_Part_12993_11777023.1175617504855"
References: <d9619e4a0704022305l66bfc330n7c9377cbf3da2fa7@mail.gmail.com>
	 <46127B2B.7080204@sun.com>

------=_Part_12993_11777023.1175617504855
Content-Type: text/plain; charset=ISO-8859-1; format=flowed
Content-Transfer-Encoding: 7bit
Content-Disposition: inline

Rick, Dan had also brought up this point. Dan's comment from *
http://www.nabble.com/Collation-feature-discussion-p9526316.html*
"This approach means that CHAR(varchar_col, 20) behaves differently to CAST
(varchar_col AS CHAR(20)). Not sure if that's good or bad, but they might be
implemented today using the same code path. "

I think what you are proposing will be easier to implement and easier to
explain to the users and fits in the SQL spec model. I wasn't trying to
solve any paritcular scenario but was just trying to make CHAR work like
TRIM when a character string type was it's first parameter. If no objections
by the end of the day, then I will go ahead and change the wiki page for
CHAR/VARCHAR functions to have the same collation as current schema's
character set no matter what kind of parameter is passed to it.


On 4/3/07, Rick Hillegas <Richard.Hillegas@sun.com> wrote:
>
> Hi Mamta,
>
> Thanks for describing this behavior on a tidy wiki page. Having all of
> this material collected in one place is great. I have a comment:
>
> 6)CHAR, VARCHAR functions do not look like they are defined in the SQL
> spec. But based on 5) above, the result character string type's
> collation can be considered same as the first argument's collation type
> if the first argument to CHAR/VARCHAR function is a character string
> type. If the first argument is not character string type, then the
> result character string of CHAR/VARCHAR will have the same collation as
> current schema's character set. The collation derivation will be implicit.
>
> I think the behavior would be easier to understand if it were uniform,
> that is, if the CHAR and VARCHAR operators always returned strings which
> had the collation of the current schema. I suspect you will find that
> this is easier to implement. I also think that this is the intention of
> the SQL Standard. Here is my reasoning:
>
> It seems to me that there is a default (implementation-defined)
> character set and collation for the whole database. That default can be
> overridden at the session, schema, and client-module levels. That is,
> once you know what database, session, schema, and client-module you are
> in, you know the default character set and collation for string
> datatypes mentioned by your SQL statements. This default can be
> explicitly overridden with a CAST or COLLATE clause. There are also
> explicit exceptions to this behavior for certain operators ( e.g., TRIM,
> UPPER, LOWER, SUBSTR). The default character set and collation apply
> unless the SQL Standard explicitly defines an exception or your
> statement explicitly overrides the default. The default character set
> and collation apply to the return types of the CHAR and VARCHAR
> operators because the SQL Standard does not carve out an explicit
> exception for these operators.
>
> Is there some problem that would be solved by adopting the non-uniform
> behavior proposed on the wiki page?
>
> Thanks,
> -Rick
>
> Mamta Satoor wrote:
> > Hi,
> >
> > I have created a wiki page for DERBY-1478 : Add built in language
> > based ordering and like processing to Derby
> >
> > The wiki page is located at
> > http://wiki.apache.org/db-derby/BuiltInLanguageBasedOrderingDERBY-1478and
> > it includes the current design proposal along with line items. If
> > anyone has any comments, please let me know.
> >
> > thanks,
> > Mamta
>
>

------=_Part_12993_11777023.1175617504855
Content-Type: text/html; charset=ISO-8859-1
Content-Transfer-Encoding: 7bit
Content-Disposition: inline

<div>Rick, Dan had also brought up this point. Dan&#39;s comment from <u><font color="#800080"><a href="http://www.nabble.com/Collation-feature-discussion-p9526316.html">http://www.nabble.com/Collation-feature-discussion-p9526316.html
</a></font></u> </div>
<div>&quot;This approach means that CHAR(varchar_col, 20) behaves differently to CAST (varchar_col AS CHAR(20)). Not sure if that&#39;s good or bad, but they might be implemented today using the same code path. &quot;</div>

<div>&nbsp;</div>
<div>I think what you are proposing will&nbsp;be easier to implement and easier to explain to the users and fits in the SQL spec model. I wasn&#39;t trying to solve any paritcular scenario but was just trying to make CHAR work like TRIM when a character string type was it&#39;s first parameter. If no objections by the end of the day, then I will go ahead and change the wiki page for CHAR/VARCHAR functions to have&nbsp;the&nbsp;same collation as current schema&#39;s character set no matter what kind of parameter is passed to it.
<br><br>&nbsp;</div>
<div><span class="gmail_quote">On 4/3/07, <b class="gmail_sendername">Rick Hillegas</b> &lt;<a onclick="return top.js.OpenExtLink(window,event,this)" href="mailto:Richard.Hillegas@sun.com" target="_blank">Richard.Hillegas@sun.com
</a>&gt; wrote:</span> 
<blockquote class="gmail_quote" style="PADDING-LEFT: 1ex; MARGIN: 0px 0px 0px 0.8ex; BORDER-LEFT: #ccc 1px solid">Hi Mamta,<br><br>Thanks for describing this behavior on a tidy wiki page. Having all of<br>this material collected in one place is great. I have a comment: 
<br><br>6)CHAR, VARCHAR functions do not look like they are defined in the SQL<br>spec. But based on 5) above, the result character string type&#39;s<br>collation can be considered same as the first argument&#39;s collation type 
<br>if the first argument to CHAR/VARCHAR function is a character string<br>type. If the first argument is not character string type, then the<br>result character string of CHAR/VARCHAR will have the same collation as<br>
current schema&#39;s character set. The collation derivation will be implicit.<br><br>I think the behavior would be easier to understand if it were uniform,<br>that is, if the CHAR and VARCHAR operators always returned strings which 
<br>had the collation of the current schema. I suspect you will find that<br>this is easier to implement. I also think that this is the intention of<br>the SQL Standard. Here is my reasoning:<br><br>It seems to me that there is a default (implementation-defined) 
<br>character set and collation for the whole database. That default can be<br>overridden at the session, schema, and client-module levels. That is,<br>once you know what database, session, schema, and client-module you are 
<br>in, you know the default character set and collation for string<br>datatypes mentioned by your SQL statements. This default can be<br>explicitly overridden with a CAST or COLLATE clause. There are also<br>explicit exceptions to this behavior for certain operators ( 
e.g., TRIM,<br>UPPER, LOWER, SUBSTR). The default character set and collation apply<br>unless the SQL Standard explicitly defines an exception or your<br>statement explicitly overrides the default. The default character set 
<br>and collation apply to the return types of the CHAR and VARCHAR<br>operators because the SQL Standard does not carve out an explicit<br>exception for these operators.<br><br>Is there some problem that would be solved by adopting the non-uniform 
<br>behavior proposed on the wiki page?<br><br>Thanks,<br>-Rick<br><br>Mamta Satoor wrote:<br>&gt; Hi,<br>&gt;<br>&gt; I have created a wiki page for DERBY-1478 : Add built in language<br>&gt; based ordering and like processing to Derby 
<br>&gt;<br>&gt; The wiki page is located at<br>&gt; <a onclick="return top.js.OpenExtLink(window,event,this)" href="http://wiki.apache.org/db-derby/BuiltInLanguageBasedOrderingDERBY-1478" target="_blank">http://wiki.apache.org/db-derby/BuiltInLanguageBasedOrderingDERBY-1478
</a> and<br>&gt; it includes the current design proposal along with line items. If <br>&gt; anyone has any comments, please let me know.<br>&gt;<br>&gt; thanks,<br>&gt; Mamta<br><br></blockquote></div><br>

------=_Part_12993_11777023.1175617504855--