Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id B5C0C200B41 for ; Thu, 7 Jul 2016 10:10:36 +0200 (CEST) Received: by cust-asf.ponee.io (Postfix) id B433E160A68; Thu, 7 Jul 2016 08:10:36 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id 0972D160A59 for ; Thu, 7 Jul 2016 10:10:35 +0200 (CEST) Received: (qmail 48751 invoked by uid 500); 7 Jul 2016 08:10:34 -0000 Mailing-List: contact solr-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: solr-user@lucene.apache.org Delivered-To: mailing list solr-user@lucene.apache.org Received: (qmail 48740 invoked by uid 99); 7 Jul 2016 08:10:34 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd4-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 07 Jul 2016 08:10:34 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd4-us-west.apache.org (ASF Mail Server at spamd4-us-west.apache.org) with ESMTP id A2633C0403 for ; Thu, 7 Jul 2016 08:10:33 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd4-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 2 X-Spam-Level: ** X-Spam-Status: No, score=2 tagged_above=-999 required=6.31 tests=[HTML_MESSAGE=2] autolearn=disabled Received: from mx2-lw-eu.apache.org ([10.40.0.8]) by localhost (spamd4-us-west.apache.org [10.40.0.11]) (amavisd-new, port 10024) with ESMTP id t29L4Q0KK5jS for ; Thu, 7 Jul 2016 08:10:31 +0000 (UTC) Received: from step-net.it (host23-209-static.6-79-b.business.telecomitalia.it [79.6.209.23]) by mx2-lw-eu.apache.org (ASF Mail Server at mx2-lw-eu.apache.org) with ESMTP id C24C75F2F2 for ; Thu, 7 Jul 2016 08:10:30 +0000 (UTC) Received: from [127.0.0.1] by step-net.it (MDaemon PRO v10.1.1) with ESMTP id md50000778818.msg for ; Thu, 07 Jul 2016 10:06:11 +0200 X-Spam-Processed: step-net.it, Thu, 07 Jul 2016 10:06:11 +0200 (not processed: spam filter heuristic analysis disabled) X-Authenticated-Sender: valentina@step-net.it X-Return-Path: valentina@step-net.it X-Envelope-From: valentina@step-net.it X-MDaemon-Deliver-To: solr-user@lucene.apache.org To: solr-user@lucene.apache.org From: Valentina Cavazza Subject: search custom tags and attributes and get contents in solr Organization: Step srl Message-ID: <577E0E72.90507@step-net.it> Date: Thu, 7 Jul 2016 10:10:26 +0200 User-Agent: Mozilla/5.0 (Windows NT 6.1; WOW64; rv:38.0) Gecko/20100101 Thunderbird/38.5.1 MIME-Version: 1.0 Content-Type: multipart/alternative; boundary="------------000604090302070201070101" X-Antivirus: avast! (VPS 160706-1, 06/07/2016), Outbound message X-Antivirus-Status: Clean archived-at: Thu, 07 Jul 2016 08:10:36 -0000 --------------000604090302070201070101 Content-Type: text/plain; charset=iso-8859-15; format=flowed Content-Transfer-Encoding: 7bit I have a different problem so I created a new thead: I have a custom field type: in this field i have to seach custom tags and their attributes (i mean tag like html tag lile
) i would be able to search: a tag with an attribute equal to something, like:
*
a tag with an attribute that contain a certain word, like: word or like
*word*
a tag with an attribute that contain another tag that contain a certain word:
*word*
: in this case is important to find the final
match In the highlighter if I search a div I want to get the contents inside the div. I think i have to change the tokenizer but do not know which tokenizer to use. The tokenizer must be compatible with ICUFoldingFilterFactory because I need to make accents insensitive searches. --------------000604090302070201070101--