Return-Path: Delivered-To: apmail-lucene-solr-user-archive@minotaur.apache.org Received: (qmail 1098 invoked from network); 27 May 2010 04:02:16 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 27 May 2010 04:02:16 -0000 Received: (qmail 89727 invoked by uid 500); 27 May 2010 04:02:13 -0000 Delivered-To: apmail-lucene-solr-user-archive@lucene.apache.org Received: (qmail 89607 invoked by uid 500); 27 May 2010 04:02:13 -0000 Mailing-List: contact solr-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: solr-user@lucene.apache.org Delivered-To: mailing list solr-user@lucene.apache.org Received: (qmail 89599 invoked by uid 99); 27 May 2010 04:02:12 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 27 May 2010 04:02:12 +0000 X-ASF-Spam-Status: No, hits=-0.5 required=10.0 tests=AWL,FREEMAIL_FROM,RCVD_IN_DNSWL_NONE,SPF_PASS,T_TO_NO_BRKTS_FREEMAIL X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of goksron@gmail.com designates 209.85.221.191 as permitted sender) Received: from [209.85.221.191] (HELO mail-qy0-f191.google.com) (209.85.221.191) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 27 May 2010 04:02:06 +0000 Received: by qyk29 with SMTP id 29so9401120qyk.14 for ; Wed, 26 May 2010 21:01:45 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:received:received:in-reply-to :references:date:message-id:subject:from:to:content-type; bh=EgVnwIBR+De72qPQ2vWqoT+Ad5/Lx6m2FWBQK3zoC6Y=; b=k4mkbYU/YgJl9zf6k7oneaeaVh7ygk3oX0Zf8VjGtVIaXRQ2CyJQd4l6d3ojCZUhik YZhkwh1hUFwwudC+OBoxmtr4YXaXhHZwYXk4qXUvbf6JPgCVc9c2+DByKI44obny2f/5 ej0tDaEbz7PmslUWUpNkeU+dtSAbXlTEowl7M= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; b=g1oGUooe18M5IxGW0ypy9xIqGsfrnaz06xKZG+NWF+WtUpCmofTt4aGE2Rv5jY2gby 7hYzPIUmGzlXa1bESBpZolxkzGpdJxy7mG0jZDZTgXRbRaVIfwJLrbSy5uXkYIrd9agG aNcHDRzRS25hOXHZDBn1rvCGanBrPagm8dAJU= MIME-Version: 1.0 Received: by 10.224.72.15 with SMTP id k15mr5559659qaj.356.1274932905443; Wed, 26 May 2010 21:01:45 -0700 (PDT) Received: by 10.229.21.11 with HTTP; Wed, 26 May 2010 21:01:44 -0700 (PDT) In-Reply-To: References: Date: Wed, 26 May 2010 21:01:44 -0700 Message-ID: Subject: Re: Dynamic analyzers From: Lance Norskog To: solr-user@lucene.apache.org Content-Type: text/plain; charset=UTF-8 If you want to OR a search across many language inputs, you can copy all of the text into an all-languages field. A pan-language search would just hit that field. On Mon, May 24, 2010 at 9:28 AM, dan sutton wrote: > Hi, > > I have a requirement to dynamically choose a fieldType to analyze text in > multiple languages. I will know the language (in a separate field) at index > and query time. > > I've tried implementing this with a custom UpdateRequestProcessorFactory and > custom DocumentBuilder.toDocument to change the FieldType, but this dosen't > work. > > I realize I can have e.g. text_en, text_de,... and dynamically populate this > with a custom UpdateRequestProcessorFactory, but we are worried with all the > languages (lets say 50+) that effectively doing an OR with 50 fields will be > a performance issue, is this true? > > Many thanks in advance, > Dan > -- Lance Norskog goksron@gmail.com