Return-Path: X-Original-To: apmail-lucene-java-user-archive@www.apache.org Delivered-To: apmail-lucene-java-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 57D6AE751 for ; Thu, 13 Dec 2012 21:46:04 +0000 (UTC) Received: (qmail 60078 invoked by uid 500); 13 Dec 2012 21:46:02 -0000 Delivered-To: apmail-lucene-java-user-archive@lucene.apache.org Received: (qmail 60029 invoked by uid 500); 13 Dec 2012 21:46:02 -0000 Mailing-List: contact java-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-user@lucene.apache.org Delivered-To: mailing list java-user@lucene.apache.org Received: (qmail 59942 invoked by uid 99); 13 Dec 2012 21:45:59 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 13 Dec 2012 21:45:59 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of goksron@gmail.com designates 209.85.220.48 as permitted sender) Received: from [209.85.220.48] (HELO mail-pa0-f48.google.com) (209.85.220.48) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 13 Dec 2012 21:45:50 +0000 Received: by mail-pa0-f48.google.com with SMTP id fa1so1785660pad.35 for ; Thu, 13 Dec 2012 13:45:29 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=message-id:date:from:user-agent:mime-version:to:subject:references :in-reply-to:content-type; bh=oR3wGBRGVR4/Vdm6Kc3WpsUHmDm7FoFAJfaxnxRcfJs=; b=WNWBjygLvhk5LDegQdHMF2dE3LOTZde2fC5mS3HoiWQZ7j6Pk+g/9rNHgu6zyeMRtz phBHREeXQheUBeIP5jXpWQ49QQuvelvkS6GmuHZeyMY9wIVVT+E/NjejW7ZTGmL9iye/ 7Lo6mZaNTKkY2ToLXuNldtKmq8TDnDGPw8tlLXD5N81q47wc+DPvQWtF5fLRrmAKgyZ1 3jWoHoAXHpZbOhOv4VfHHeFMCOEg3G/tgyt/DEMC7eNQHSq1wYhSui3M9KBG9+KxqeLP gAC0KPHj8dFECaeFBkQvCK15O76cUFAvPSbwq+BfNMdJTP2e6JjJUYvrbtHdp3kOhaB1 ynxg== Received: by 10.68.240.36 with SMTP id vx4mr9326802pbc.90.1355435128879; Thu, 13 Dec 2012 13:45:28 -0800 (PST) Received: from [10.19.220.76] (64-71-21-34.static.wiline.com. [64.71.21.34]) by mx.google.com with ESMTPS id l5sm1889737paz.14.2012.12.13.13.45.26 (version=SSLv3 cipher=OTHER); Thu, 13 Dec 2012 13:45:27 -0800 (PST) Message-ID: <50CA4C73.7010607@gmail.com> Date: Thu, 13 Dec 2012 13:45:23 -0800 From: Lance Norskog User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:17.0) Gecko/17.0 Thunderbird/17.0 MIME-Version: 1.0 To: java-user@lucene.apache.org Subject: Re: What is "flexible indexing" in Lucene 4.0 if it's not the ability to make new postings codecs? References: <5E12F7FD7D58D54DA5542ECC8B1451980205B3@MSGPEXCHA28B.mfad.mfroot.org> In-Reply-To: <5E12F7FD7D58D54DA5542ECC8B1451980205B3@MSGPEXCHA28B.mfad.mfroot.org> Content-Type: multipart/alternative; boundary="------------010304090401090609000603" X-Virus-Checked: Checked by ClamAV on apache.org --------------010304090401090609000603 Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit Parts-of-speech is available now, in the indexer. LUCENE-2899 adds OpenNLP to the Lucene&Solr codebase. It does parts-of-speech, chunking and Named Entity Recognition. OpenNLP is an Apache project for natural-language processing. Some parts are in Solr that could be in Lucene. https://issues.apache.org/jira/browse/lucene-2899 On 12/12/2012 12:02 PM, Wu, Stephen T., Ph.D. wrote: >>> Is there any (preliminary) code checked in somewhere that I can look at, >>> that would help me understand the practical issues that would need to be >>> addressed? >> Maybe we can make this more concrete: what new attribute are you >> needing to record in the postings and access at search time? > For example: > - part of speech of a token. > - syntactic parse subtree (over a span). > - semantically normalized phrase (to canonical text or ontological code). > - semantic group (of a span). > - coreference link. > > stephen > > > --------------------------------------------------------------------- > To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org > For additional commands, e-mail: java-user-help@lucene.apache.org > --------------010304090401090609000603--