Return-Path: X-Original-To: apmail-lucene-java-user-archive@www.apache.org Delivered-To: apmail-lucene-java-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 64C9F10136 for ; Thu, 20 Nov 2014 15:25:36 +0000 (UTC) Received: (qmail 22197 invoked by uid 500); 20 Nov 2014 15:25:25 -0000 Delivered-To: apmail-lucene-java-user-archive@lucene.apache.org Received: (qmail 22048 invoked by uid 500); 20 Nov 2014 15:25:25 -0000 Mailing-List: contact java-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-user@lucene.apache.org Delivered-To: mailing list java-user@lucene.apache.org Received: (qmail 22014 invoked by uid 99); 20 Nov 2014 15:25:24 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 20 Nov 2014 15:25:24 +0000 X-ASF-Spam-Status: No, hits=-0.0 required=5.0 tests=RCVD_IN_DNSWL_NONE,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of koji@r.email.ne.jp designates 202.224.39.197 as permitted sender) Received: from [202.224.39.197] (HELO mail1.asahi-net.or.jp) (202.224.39.197) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 20 Nov 2014 15:24:59 +0000 Received: from Koji3.local (ad004171.dynamic.ppp.asahi-net.or.jp [180.235.4.171]) by mail1.asahi-net.or.jp (Postfix) with ESMTP id D9CA7EB7C; Fri, 21 Nov 2014 00:24:52 +0900 (JST) Message-ID: <546E07C9.2080309@r.email.ne.jp> Date: Fri, 21 Nov 2014 00:24:57 +0900 From: Koji Sekiguchi User-Agent: Mozilla/5.0 (Macintosh; Intel Mac OS X 10.10; rv:24.0) Gecko/20100101 Thunderbird/24.6.0 MIME-Version: 1.0 To: java-user@lucene.apache.org, "solr-user@lucene.apache.org" Subject: Re: [ANN] word2vec for Lucene References: <546DB022.6090107@r.email.ne.jp> <9E43DC60-7E74-441C-9D56-D7CC17273642@hoplahup.net> In-Reply-To: <9E43DC60-7E74-441C-9D56-D7CC17273642@hoplahup.net> Content-Type: text/plain; charset=ISO-2022-JP Content-Transfer-Encoding: 7bit X-Virus-Checked: Checked by ClamAV on apache.org Hi Paul, I cannot compare it to SemanticVectors as I don't know SemanticVectors. But word vectors that are produced by word2vec have interesting properties. Here is the description of the original word2vec web site: https://code.google.com/p/word2vec/#Interesting_properties_of_the_word_vectors Interesting properties of the word vectors It was recently shown that the word vectors capture many linguistic regularities, for example vector operations vector('Paris') - vector('France') + vector('Italy') results in a vector that is very close to vector('Rome'), and vector('king') - vector('man') + vector('woman') is close to vector('queen') Thanks, Koji (2014/11/20 20:01), Paul Libbrecht wrote: > Hello Koji, > > how would you compare that to SemanticVectors? > > paul > > On 20 nov. 2014, at 10:10, Koji Sekiguchi wrote: > >> Hello, >> >> It's my pleasure to share that I have an interesting tool "word2vec for Lucene" >> available at https://github.com/kojisekig/word2vec-lucene . >> >> As you can imagine, you can use "word2vec for Lucene" to extract word vectors from Lucene index. >> >> Thank you, >> >> Koji >> -- >> http://soleami.com/blog/comparing-document-classification-functions-of-lucene-and-mahout.html > > > --------------------------------------------------------------------- > To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org > For additional commands, e-mail: java-user-help@lucene.apache.org > > -- http://soleami.com/blog/comparing-document-classification-functions-of-lucene-and-mahout.html --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org For additional commands, e-mail: java-user-help@lucene.apache.org