lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Bill Fowler" <wwfow...@gmail.com>
Subject Indexing a URL
Date Wed, 05 Sep 2007 20:59:49 GMT
Hello,

I am trying to post the following to my index:

<field name="url">http://www.nytimes.com/2007/08/25/business/worldbusiness/25yuan.html?ex=1345694400&en=499af384a9ebd18f&ei=5088&partner=rssnyt&emc=rss
</field>

The url field is defined as:

   <field name="url" type="string" indexed="false" stored="true" />

However, I get the following error:

Posting file docstor/ffc110ee5c9a2ed28c8f35aa243bb53b.xml to
http://localhost:8983/news_feed/update
<html>
<head>
<meta http-equiv="Content-Type" content="text/html; charset=ISO-8859-1"/>
<title>Error 500 </title>
</head>
<body><h2>HTTP ERROR: 500</h2><pre>ParseError at [row,col]:[3,104]
Message: The reference to entity "en" must end with the ';' delimiter.

It is apparently attempting to parse &en=499af384a9ebd18f in the URL.  I am
not clear why it would do this as I specified indexed="false."  I need to
store this because that is how the user gets to the original article.

Is there any data type that simply ignores the characters in the field?  I
don't care that it can't be a search field.  I've tried the "ignored" field
type and it still gives me the same error.

Thanks,

Bill

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message