Return-Path: X-Original-To: apmail-lucene-java-user-archive@www.apache.org Delivered-To: apmail-lucene-java-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 131511756D for ; Fri, 24 Oct 2014 20:43:10 +0000 (UTC) Received: (qmail 30476 invoked by uid 500); 24 Oct 2014 20:43:08 -0000 Delivered-To: apmail-lucene-java-user-archive@lucene.apache.org Received: (qmail 30415 invoked by uid 500); 24 Oct 2014 20:43:08 -0000 Mailing-List: contact java-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-user@lucene.apache.org Delivered-To: mailing list java-user@lucene.apache.org Received: (qmail 30402 invoked by uid 99); 24 Oct 2014 20:43:07 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 24 Oct 2014 20:43:07 +0000 X-ASF-Spam-Status: No, hits=-0.7 required=5.0 tests=RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of bimargulies@gmail.com designates 209.85.214.177 as permitted sender) Received: from [209.85.214.177] (HELO mail-ob0-f177.google.com) (209.85.214.177) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 24 Oct 2014 20:42:41 +0000 Received: by mail-ob0-f177.google.com with SMTP id m8so117900obr.22 for ; Fri, 24 Oct 2014 13:42:40 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:date:message-id:subject:from:to:content-type; bh=nbkZmZwjunXAygNx11zhjh8MGEJP9Dxkc884T06qhkg=; b=axB1T3S22OU3gYzNjwvI39TCkXniFN7SWe4mIgmnjBAPb0HE01ehLGxkZ/V/S2Bxjb 8gx3nV4nXMHkY0PfwKAbGuIZPWQGBH6fVQ7rwelBMXv9CRb2r4RF6xv2cKofN40lC2/Z A1A5pGS8oepKbJwWCwmmXljCS2l8EFLRBBi8lYplMCm0xqGVh3XNcRmfqG5reo6jYOOX w7biUoZVXUQpdzmSjE7dC5wfseaEqF/WQq+eUZ+TChTynfaj5QCXqtsiB2YF7Is7kMEX f2TdSX92Vy9WMAwPimuf5p+0P1qUsy4QZI1R9NOg0rWfgqNlNIBR6QzP6gMkHzUemWls Ex/w== MIME-Version: 1.0 X-Received: by 10.182.95.9 with SMTP id dg9mr3704508obb.44.1414183360151; Fri, 24 Oct 2014 13:42:40 -0700 (PDT) Received: by 10.202.1.84 with HTTP; Fri, 24 Oct 2014 13:42:40 -0700 (PDT) Date: Fri, 24 Oct 2014 16:42:40 -0400 Message-ID: Subject: A really hairy token graph case From: Benson Margulies To: "java-user@lucene.apache.org" Content-Type: text/plain; charset=UTF-8 X-Virus-Checked: Checked by ClamAV on apache.org Consider a case where we have a token which can be subdivided in several ways. This can happen in German. We'd like to represent this with positionIncrement/positionLength, but it does not seem possible. Once the position has moved out from one set of 'subtokens', we see no way to move it back for the second set of alternatives. Is this something that was considered? --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org For additional commands, e-mail: java-user-help@lucene.apache.org