lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From mark harwood <>
Subject Re: SpanQueries in Luke
Date Fri, 05 Mar 2010 10:22:36 GMT
>>I'll commit the current mostly-working state today, you can take a look

OK. However I think this XMLQueryParser addition will only resurface a long-standing issue
with Luke and Lucene in general.
This query parser works best on multiple fields (e.g. free-text <UserQuery> tags and
<TermsFilter> on structured fields). Each field typically requires different analyzers
and there is currently no way of recording this information as metadata alongside an index.
Without this metadata each user's Luke session starts with a game of "guess-which-analyzer-to-use?"

I use my own proprietary system for storing such index metadata and this is through an XML
file that contains a BeanEncoder-serialized PerFieldAnalyserWrapper among other things.
It would be nice to see some standardisation in how this information can be made available
in *any* Lucene index but I guess this overlaps with things like Solr's config.


----- Original Message ----
From: Andrzej Bialecki <>
Sent: Fri, 5 March, 2010 10:03:23
Subject: Re: SpanQueries in Luke

On 2010-03-05 10:47, mark harwood wrote:
>>> No, this simply means that you will be able to use the xml-query-parser instead
of the regular one
> Not sure exactly what you have in mind for an editor, Andrzej but there is an opportunity
to do something smart here for little effort.
> The XMLQueryParser comes with a DTD which means you can have an editor that auto-completes/suggests
legal query syntax. Any schema-driven editing widget for XML could be used e.g.
> If none of these are any good another startpoint may be to use DTDdoc to parse the schema.
The DTD includes a lot of help-text comments which are used by the DTDdoc to generate HTML
documentation so using this tool gives you schema *and* help text.

I didn't have any editor in mind, I used a lo-tek solution of copy&paste 
to the query textarea.

I'll commit the current mostly-working state today, you can take a look 
- you've written some cool Luke plugins before .. ;)

Best regards,
Andrzej Bialecki     <><
  ___. ___ ___ ___ _ _   __________________________________
[__ || __|__/|__||\/|  Information Retrieval, Semantic Web
___|||__||  \|  ||  |  Embedded Unix, System Integration  Contact: info at sigram dot com

To unsubscribe, e-mail:
For additional commands, e-mail:


To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message