Return-Path: X-Original-To: apmail-lucene-solr-user-archive@minotaur.apache.org Delivered-To: apmail-lucene-solr-user-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 5F5B39983 for ; Fri, 16 Mar 2012 06:45:42 +0000 (UTC) Received: (qmail 81071 invoked by uid 500); 16 Mar 2012 06:45:39 -0000 Delivered-To: apmail-lucene-solr-user-archive@lucene.apache.org Received: (qmail 80845 invoked by uid 500); 16 Mar 2012 06:45:33 -0000 Mailing-List: contact solr-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: solr-user@lucene.apache.org Delivered-To: mailing list solr-user@lucene.apache.org Received: (qmail 80813 invoked by uid 99); 16 Mar 2012 06:45:32 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 16 Mar 2012 06:45:32 +0000 X-ASF-Spam-Status: No, hits=2.8 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS,URI_HEX X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of fancyerii@gmail.com designates 209.85.220.176 as permitted sender) Received: from [209.85.220.176] (HELO mail-vx0-f176.google.com) (209.85.220.176) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 16 Mar 2012 06:45:25 +0000 Received: by vcbfl17 with SMTP id fl17so6156565vcb.35 for ; Thu, 15 Mar 2012 23:45:04 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; bh=fhAF6f3tC4Vwo8YXTvvASyCXr0kmMpmtVOjXmSAIqFA=; b=GQ9dSnF657gNEDFqoOMZpe2zJ3OhHZgzQKb5pIYK2/IRCUUF41VN1SDG8HDXPYBXJx LATwO29cGFsp7EgE0F6Ub11Qb3UELqRLhCyid3ehQUaRhW6NLtu3jlFkZqNmj+QM09ro CmXLoTJgXprFWmSCj/Pz3W9MrJG+TEBd5OPxEAZ9NTZrUIoYo249nzmYbBvx6uqYksoy 2Z1Ho8pq9aHRv82efgoNyqWuSbXi10rtF0VJHv51PewM9ddtNLFonVSoM17P7Mz8aUbU IMdUR9/SLvFxl4p+d4JPPleaLTyoxqb0wEs6XPE2vAuCXa5uBdSZgzOAqcNcb6wHnd9u 757w== MIME-Version: 1.0 Received: by 10.52.92.140 with SMTP id cm12mr964707vdb.115.1331880304637; Thu, 15 Mar 2012 23:45:04 -0700 (PDT) Received: by 10.220.193.140 with HTTP; Thu, 15 Mar 2012 23:45:04 -0700 (PDT) In-Reply-To: <1331877807324-3831064.post@n3.nabble.com> References: <1331714047311-3824726.post@n3.nabble.com> <1331742806877-3825959.post@n3.nabble.com> <1331877807324-3831064.post@n3.nabble.com> Date: Fri, 16 Mar 2012 14:45:04 +0800 Message-ID: Subject: Re: How to avoid the unexpected character error? From: Li Li To: solr-user@lucene.apache.org Content-Type: multipart/alternative; boundary=20cf307f339c70f83b04bb568c72 X-Virus-Checked: Checked by ClamAV on apache.org --20cf307f339c70f83b04bb568c72 Content-Type: text/plain; charset=ISO-8859-1 it's not the right place. when you use java -Durl=http://... -jar post.jar data.xml the data.xml file must be a valid xml file. you shoud escape special chars in this file. I don't know how you generate this file. if you use java program(or other scripts) to generate this file, you should use xml tools to generate this file. but if you generate like this: StringBuilder buf=new StringBuilder(); buf.append(""); buf.append(""); buf.append("text content"); you should escape special chars. if you use java, you can make use of org.apache.solr.common.util.XML class On Fri, Mar 16, 2012 at 2:03 PM, neosky wrote: > I am sorry, but I can't get what you mean. > I tried the HTMLStripCharFilter and PatternReplaceCharFilter. It doesn't > work. > Could you give me an example? Thanks! > > positionIncrementGap="100"> > > > > > > > I also tried: > > replacement="" > maxBlockChars="10000" blockDelimiters="|"/> > > -- > View this message in context: > http://lucene.472066.n3.nabble.com/How-to-avoid-the-unexpected-character-error-tp3824726p3831064.html > Sent from the Solr - User mailing list archive at Nabble.com. > --20cf307f339c70f83b04bb568c72--