hadoop-pig-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Earl Cahill (JIRA)" <j...@apache.org>
Subject [jira] Updated: (PIG-488) extractor search terms from referer
Date Fri, 10 Oct 2008 09:28:44 GMT

     [ https://issues.apache.org/jira/browse/PIG-488?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Earl Cahill updated PIG-488:
----------------------------

    Attachment: SearchTermExtractor-PIG-488

I based my regexes, code and tests on Spiros Denaxas' code here

http://search.cpan.org/~sden/URI-ParseSearchString-2.6/

> extractor search terms from referer
> -----------------------------------
>
>                 Key: PIG-488
>                 URL: https://issues.apache.org/jira/browse/PIG-488
>             Project: Pig
>          Issue Type: New Feature
>            Reporter: Earl Cahill
>         Attachments: SearchTermExtractor-PIG-488
>
>
> Want to be able to extract search terms from a url. For example,
> http://www.google.com/search?hl=en&safe=active&rls=GGLG,GGLG:2005-24,GGLG:en&q=purpose+of+life&btnG=Search
>  
> then
> purpose of life
> would be extracted.
> Pig latin usage looks like
> searchTerms = FOREACH row GENERATE org.apache.pig.piggybank.evaluation.util.apachelogparser.SearchTermExtractor(url);

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message