Return-Path: X-Original-To: apmail-opennlp-dev-archive@www.apache.org Delivered-To: apmail-opennlp-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 25E1110144 for ; Thu, 26 Sep 2013 20:15:09 +0000 (UTC) Received: (qmail 40271 invoked by uid 500); 26 Sep 2013 20:15:08 -0000 Delivered-To: apmail-opennlp-dev-archive@opennlp.apache.org Received: (qmail 40087 invoked by uid 500); 26 Sep 2013 20:15:04 -0000 Mailing-List: contact dev-help@opennlp.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@opennlp.apache.org Delivered-To: mailing list dev@opennlp.apache.org Received: (qmail 40078 invoked by uid 99); 26 Sep 2013 20:15:02 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 26 Sep 2013 20:15:02 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of nando.nlp@gmail.com designates 74.125.82.176 as permitted sender) Received: from [74.125.82.176] (HELO mail-we0-f176.google.com) (74.125.82.176) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 26 Sep 2013 20:14:58 +0000 Received: by mail-we0-f176.google.com with SMTP id u56so1691965wes.7 for ; Thu, 26 Sep 2013 13:14:37 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; bh=rjMXxDgZdUA0KM+mIBd9QclmSzsmBcPkQ44BK1IDH5Q=; b=Saf8CKV9OCfTRqPyUZI/UUjMZl8Pzh9dAFlsNE5rM28JpEgGoIwwO0DOq/wAQzCkRA MLQUGnXPJMNRaPzKvmaP/9gqQK62rULhog8NikXwGC6pcGeag4ftWaEe7LuLYSkU013/ p6SHxFvnzwryDqyTFBAJHaVDb8jeuxEx2ruNDfmkGiqgcqVCLKWgxIo64rxCVELSmoN9 TOW8MkhRmVRVCb7t4L1KzzGB6fA/YIUjJ/kYoA5iCZEx/nwi5vN6MFszdk4HenoBFciZ pnpfyMLHFgdWICCLIwrQ+IEnIM1bEj3VOtFZ5Ure7yOYfTAO7j/JoiYNwmqqR/JWh0KR OW7A== MIME-Version: 1.0 X-Received: by 10.180.81.71 with SMTP id y7mr28956627wix.63.1380226477225; Thu, 26 Sep 2013 13:14:37 -0700 (PDT) Received: by 10.216.21.198 with HTTP; Thu, 26 Sep 2013 13:14:37 -0700 (PDT) In-Reply-To: References: Date: Thu, 26 Sep 2013 14:14:37 -0600 Message-ID: Subject: Re: Triplet Extraction with OpenNLP From: Carlos Scheidecker To: dev@opennlp.apache.org Content-Type: multipart/alternative; boundary=f46d04428cc2e27c4504e74f0492 X-Virus-Checked: Checked by ClamAV on apache.org --f46d04428cc2e27c4504e74f0492 Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable Thanks Svetoslav, Would you have an example on that? cheers, Carlos. On Thu, Sep 26, 2013 at 5:09 AM, Svetoslav Marinov < svetoslav.marinov@findwise.com> wrote: > Hi Carlos, > > This is not exactly answer to your question but I am not really convinced > that a Phrase structure tree is the best way to extract triplets. A > dependency graph is a much better option. > > There would be a number of NPs and PPs that are neither the subject nor > the object, and not sure at all whether an adjective can be an object. > > However, if you want to use OpenNLP and the parse tree, maybe you can > consider mapping the tree to FrameNet, thus you will see what kind of > arguments a verb will have and which of these can potentially be the > subject and the object. > > Best, > > Svetoslav > ________________________________________ > Fr=E5n: Carlos Scheidecker > Skickat: den 26 september 2013 11:37 > Till: dev@opennlp.apache.org > =C4mne: Triplet Extraction with OpenNLP > > Hello all, > > I am interested in performing Triplet Extraction. > > For that, I need to traverse the parse tree. > > I know how to use the ChunkMe, however I am not sure how to use the Parse= r > so that I can create a tree to traverse it. > > Ideally, I want to obtain the subject, predicate and object. > > To find the subject I need to search in the NP subtree selecting the firs= t > descendent of NP that is a Noun via breadth first search. > > To find the predicate I will search the VP subtree, the deepest verb > descendent on that tree will give the predicate. > > Now for the object(s) they can be in 3 different subtrees. PP, NP and ADJ= . > In NP and PP they will be the first noun while on the ADJ we need to loca= te > the first adjective. > > Therefore, what I need to learn is how to create the parser and the main > tree so that I can navigate the subtrees. > > Thanks for the help, > > Carlos. > --f46d04428cc2e27c4504e74f0492--