lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Otis Gospodnetic <otis_gospodne...@yahoo.com>
Subject Re: EMAIL ADDRESS: Tokenize (i.e. an EmailAnalyzer)
Date Sun, 30 Jul 2006 03:43:54 GMT
No, you're not missing anything. :)
That JavaMail API is good for getting the whole email, but you then need to chop it up with
your EmailAnalyzer, so you're doing the right thing.

Otis

----- Original Message ----
From: Michael J. Prichard <michael_prichard@mac.com>
To: java-user@lucene.apache.org
Sent: Saturday, July 29, 2006 2:51:59 PM
Subject: Re: EMAIL ADDRESS: Tokenize (i.e. an EmailAnalyzer)

Hasan Diwan wrote:

> Michael:
>
> On 7/28/06, Michael J. Prichard <michael_prichard@mac.com> wrote:
>
>> Howdy....not sure if anyone else wants this but here is my first attempt
>> at writing an analyzer for an email address...modifications, updates,
>> fixes welcome.
>
>
> Why reinvent the wheel? See
> http://java.sun.com/products/javamail/javadocs/javax/mail/internet/InternetAddress.html#parse(java.lang.String)

>
> and use as:
>
> InternetAddress valid = InternetAddress.parse(string)[0]; // far
> simpler than rewriting it
>
i dont see where i can break an email address into simpler pieces for 
tokens.  i use javamail when parsing the message and then pulling the 
email using InternetAddress.  I don't see where I can break an email 
address like john@foo.com into "john@foo.com", "john", "foo.com", "foo" 
and "com" without splitting it.  Am I missing something?

Thanks!
Michael

---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org





---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org


Mime
View raw message