hive-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "John Sichi (JIRA)" <j...@apache.org>
Subject [jira] Commented: (HIVE-1747) Make case-insensitive substring search as fast as case-sensitive variant
Date Thu, 11 Nov 2010 22:04:14 GMT

    [ https://issues.apache.org/jira/browse/HIVE-1747?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12931227#action_12931227
] 

John Sichi commented on HIVE-1747:
----------------------------------

Also:  while at ApacheCon, I heard that the Pig team was using this as a faster replacement
for Java regexp:

http://www.brics.dk/automaton/

We might want to do the same.  See PIG-965 for more details.


> Make case-insensitive substring search as fast as case-sensitive variant
> ------------------------------------------------------------------------
>
>                 Key: HIVE-1747
>                 URL: https://issues.apache.org/jira/browse/HIVE-1747
>             Project: Hive
>          Issue Type: Improvement
>          Components: Query Processor
>    Affects Versions: 0.7.0
>            Reporter: John Sichi
>
> {noformat}
> WHERE str_col like '%XYZ%'
> {noformat}
> is very fast because the pattern matching code is special-cased.  However,
> {noformat}
> WHERE str_col rlike '(?i).*XYZ.*'
> {noformat}
> is very slow because it goes through the generic Java pattern matching.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message