Return-Path: X-Original-To: apmail-lucene-solr-user-archive@minotaur.apache.org Delivered-To: apmail-lucene-solr-user-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 1705910898 for ; Mon, 30 Sep 2013 22:16:42 +0000 (UTC) Received: (qmail 91668 invoked by uid 500); 30 Sep 2013 22:16:37 -0000 Delivered-To: apmail-lucene-solr-user-archive@lucene.apache.org Received: (qmail 91301 invoked by uid 500); 30 Sep 2013 22:16:32 -0000 Mailing-List: contact solr-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: solr-user@lucene.apache.org Delivered-To: mailing list solr-user@lucene.apache.org Received: (qmail 90858 invoked by uid 99); 30 Sep 2013 22:16:29 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 30 Sep 2013 22:16:29 +0000 X-ASF-Spam-Status: No, hits=2.2 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_NONE,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of kcunningham@telligent.com designates 64.78.56.72 as permitted sender) Received: from [64.78.56.72] (HELO hub021-ca-7.exch021.serverdata.net) (64.78.56.72) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 30 Sep 2013 22:16:21 +0000 Received: from MBX021-W3-CA-4.exch021.domain.local ([10.254.4.80]) by HUB021-CA-7.exch021.domain.local ([10.254.4.109]) with mapi id 14.03.0123.003; Mon, 30 Sep 2013 15:15:59 -0700 From: Kevin Cunningham To: "solr-user@lucene.apache.org" Subject: No longer allowed to store html in a 'string' type Thread-Topic: No longer allowed to store html in a 'string' type Thread-Index: Ac6+Kp2HrhkJ+CzKRoSvUM3tu2xiYQ== Date: Mon, 30 Sep 2013 22:15:59 +0000 Message-ID: Accept-Language: en-US Content-Language: en-US X-MS-Has-Attach: X-MS-TNEF-Correlator: x-originating-ip: [162.213.112.162] Content-Type: multipart/alternative; boundary="_000_C03EB30312E7D846B68DF9E7D034F6D312D2214AMBX021W3CA4exch_" MIME-Version: 1.0 X-Virus-Checked: Checked by ClamAV on apache.org --_000_C03EB30312E7D846B68DF9E7D034F6D312D2214AMBX021W3CA4exch_ Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: quoted-printable We have been using Solr for a while now, went from 1.4 -> 3.6. While runni= ng some tests in 4.4 we are no longer allowed to store raw html in a docume= nts field with a type of 'string', which we used to be able to do. Has some= thing changed here? Now we get the following error: Undeclared general ent= ity \"nbsp\"\r\n at [row,col {unknown-source}]: [11,53] I understand what its saying and can change the way we store and extract it= if it's a must but would like to understand what changed. Sounds like som= ething just became more strict to adhering to rules.

Testing #bananas = tag

document document document document document document

blog
--_000_C03EB30312E7D846B68DF9E7D034F6D312D2214AMBX021W3CA4exch_--