lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Steven Rowe (JIRA)" <j...@apache.org>
Subject [jira] Commented: (LUCENE-2745) ArabicAnalyzer - the ability to recognise email addresses host names and so on
Date Thu, 11 Nov 2010 15:47:13 GMT

    [ https://issues.apache.org/jira/browse/LUCENE-2745?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12931046#action_12931046
] 

Steven Rowe commented on LUCENE-2745:
-------------------------------------

{quote}
bq. Likely LUCENE-2302 is the biggest issue (it will block compilation), but if I remember
correctly, the change is fairly simple.

Simply use only the now-deprecated TermAttribute instead of backporting this issue. For StandardTokenizer
this should be simple, just replace some methods like copyBuffer() to setTermBuffer() and
replace the addAttributes and remove Generics, but add casts.
{quote}

Yes, sorry, I meant what Uwe is saying here - there is no (good) reason to backport LUCENE-2302
to 2.9.X.

> ArabicAnalyzer - the ability to recognise email addresses host names and so on
> ------------------------------------------------------------------------------
>
>                 Key: LUCENE-2745
>                 URL: https://issues.apache.org/jira/browse/LUCENE-2745
>             Project: Lucene - Java
>          Issue Type: Improvement
>          Components: contrib/analyzers
>    Affects Versions: 2.9.2, 2.9.3, 3.0, 3.0.1, 3.0.2
>         Environment: All
>            Reporter: M Alexander
>
> The ArabicAnalyzer does not recognise email addresses, hostnames and so on. For example,
> adam@hotmail.com
> will be tokenised to [adam] [hotmail] [com]
> It would be great if the ArabicAnalyzer can tokenises this to [adam@hotmail.com]. The
same applies to hostnames and so on.
> Can this be resolved? I hope so
> Thanks
> MAA

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


Mime
View raw message