lucene-general mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From mik07 <>
Subject Re: Online Question Answering demo using Lucene
Date Wed, 14 May 2008 17:02:13 GMT

Thanks! And you are right, it's roughly the same as Powerset.

It's slower because:
* The demo runs on a single machine (not on a cluster).
* We need to query search engines through their API, which have a 1 second
build-in delay per query.
* We parse sentences once we retrieve them from the search engines and
parsers are still rather slow. Powerset on the other hand, parses Wikipedia
before indexing and indexes the semantic structures. So no parsing needs to
be performed when a user asks a query (beside the parsing of that query, I
* The Lucene index of the complete English Wikipedia we built is 8.3 GB big.
On our machine it takes 2 seconds per query to get a result.

You could address these issues with enough money and man power. But it's
just a research project, developed by one person. We don't have the
resources. (Please drop me an email if you have some ;-)

View this message in context:
Sent from the Lucene - General mailing list archive at

View raw message