lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Jack Krupansky <jack.krupan...@gmail.com>
Subject Re: How to escape URL at indexing time
Date Sun, 27 Dec 2015 21:16:19 GMT
It looks like you attempted to quote the URL in your query using
apostrophes (sometimes referred to as single quotes), but you need to use
quote (sometimes referred to as double quote).

Change:
id:'http://www.yahoo.com'
to:
id:"http://www.yahoo.com"


-- Jack Krupansky

On Sun, Dec 27, 2015 at 3:53 PM, Daniel Valdivia <hola@danielvaldivia.com>
wrote:

> Hi
>
> I'm trying to index documents that have a URL in some field, however as
> soon as I try to index a URL like "http://yahoo.com" I get error:
>
> org.apache.lucene.queryparser.classic.ParseException: Cannot parse 'id:'
> http://www.yahoo.com'': Encountered " ":" ": "" at line 1, column 8.
>
> I asume I need to escape the URL, but not sure if encoding the URL is the
> right way to go.
>
> my indexing code:
>
> Document doc = new Document();
>
> doc.add(new StringField("id", url, Field.Store.YES));
> doc.add(new StringField("domain", domain, Field.Store.NO));
> doc.add(new StringField("title", pageTitle, Field.Store.NO));
> doc.add(new TextField("body", pageBody, Field.Store.NO));
> w.addDocument(doc);
>
> Any ideas on how I can avoid the parsing issue?
>
> I’m using Lucene 5.4.0

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message