lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Will Johnson <willjohnsonsea...@gmail.com>
Subject RE: Why indexing database is necessary? (RE: indexing database)
Date Tue, 04 Mar 2008 20:00:15 GMT
Not necessarily, many of the high traffic search sites on the market today
for everything from yellow pages to job boards to ecommerce sites use search
engines to exclusively search *and* retrieve/serve content.  The key is that
they don't have to return all matching rows only the 'best' which are
probably the ones you would want anyways.

- will

-----Original Message-----
From: Duan, Nick [mailto:NDuan@mcdonaldbradley.com] 
Sent: Tuesday, March 04, 2008 2:29 PM
To: java-user@lucene.apache.org
Subject: RE: Why indexing database is necessary? (RE: indexing database)

Hmm, I guess that's because a database query returns a list of records,
whereas search engine returns only the links, not the actual content.
So a search engine works only in the index space, whereas a database
query engine would have to work in both index and content space...


ND

-----Original Message-----
From: Will Johnson [mailto:willjohnsonsearch@gmail.com] 
Sent: Tuesday, March 04, 2008 2:18 PM
To: java-user@lucene.apache.org
Subject: RE: Why indexing database is necessary? (RE: indexing database)

Don't forget the number 1 reason: speed.  For certain types of queries a
search engine can return results orders of magnitude faster than a
database.
I've seen search engines return hits in hundreds of milliseconds when
the
same database query took hours or even days.  That's not to say that a
search engine is always better, just the it often times is for when the
inputs and outputs are carefully defined.

- will

-----Original Message-----
From: Darren Hartford [mailto:dhartford@ghsinc.com] 
Sent: Tuesday, March 04, 2008 1:52 PM
To: java-user@lucene.apache.org
Subject: RE: Why indexing database is necessary? (RE: indexing database)

Indexing with lucene/nutch on top of/instead of DB indexing for:

1) relativity scoring
2) alias searching (i.e. a large amount of aliases, like first names)
3) highlighting
4) cross-datasource searching (multi DB, DB + XML files, etc).

As for best approach to externally index, I do not have any direct
pointers.  I would recommend looking at an ETL tool that can be extended
for this purpose (I've started writing a plugin for Pentaho, but got
pulled off and haven't finished it -- and that was for Solr, not
lucene/nutch).

-D

> -----Original Message-----
> From: Duan, Nick [mailto:NDuan@mcdonaldbradley.com]
> Sent: Tuesday, March 04, 2008 1:33 PM
> To: java-user@lucene.apache.org
> Subject: Why indexing database is necessary? (RE: indexing database)
> 
> Could anyone provide any insight on why someone would use nutch/lucene
> or any other search engines to index relational databases? With use
> cases if possible?  Shouldn't the database's own indexing mechanism be
> used since it is more efficient?
> 
> If there is such a need of indexing the database content using search
> engines, what would be the best approach other than de-normalizing the
> database?
> 
> Thanks a lot in advance!
> 
> ND
> -----Original Message-----
> From: payo [mailto:payo22@yahoo.com]
> Sent: Tuesday, March 04, 2008 12:36 PM
> To: nutch-user@lucene.apache.org
> Subject: indexing database
> 
> 
> hi to all
> 
> i can index a database with nutch?
> 
> i am use nutch 0.8.1
> 
> thanks
> --
> View this message in context:
> http://www.nabble.com/indexing-database-tp15832696p15832696.html
> Sent from the Nutch - User mailing list archive at Nabble.com.
> 
> 
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
> For additional commands, e-mail: java-user-help@lucene.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org


Mime
View raw message