Return-Path: X-Original-To: apmail-flink-dev-archive@www.apache.org Delivered-To: apmail-flink-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id A2F0B186BB for ; Mon, 18 Jan 2016 12:40:51 +0000 (UTC) Received: (qmail 79326 invoked by uid 500); 18 Jan 2016 12:40:51 -0000 Delivered-To: apmail-flink-dev-archive@flink.apache.org Received: (qmail 79267 invoked by uid 500); 18 Jan 2016 12:40:51 -0000 Mailing-List: contact dev-help@flink.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@flink.apache.org Delivered-To: mailing list dev@flink.apache.org Received: (qmail 79256 invoked by uid 99); 18 Jan 2016 12:40:51 -0000 Received: from Unknown (HELO spamd1-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 18 Jan 2016 12:40:51 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd1-us-west.apache.org (ASF Mail Server at spamd1-us-west.apache.org) with ESMTP id 9B098C3DB3 for ; Mon, 18 Jan 2016 12:40:50 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd1-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 1.8 X-Spam-Level: * X-Spam-Status: No, score=1.8 tagged_above=-999 required=6.31 tests=[KAM_ASCII_DIVIDERS=0.8, KAM_LAZY_DOMAIN_SECURITY=1, RP_MATCHES_RCVD=-0.001, URIBL_BLOCKED=0.001] autolearn=disabled Received: from mx1-eu-west.apache.org ([10.40.0.8]) by localhost (spamd1-us-west.apache.org [10.40.0.7]) (amavisd-new, port 10024) with ESMTP id oJHFWEh3iHVy for ; Mon, 18 Jan 2016 12:40:41 +0000 (UTC) Received: from sea-mail.dfki.de (sea-mail.dfki.de [134.96.191.185]) by mx1-eu-west.apache.org (ASF Mail Server at mx1-eu-west.apache.org) with ESMTPS id 4EB0123130 for ; Mon, 18 Jan 2016 12:40:41 +0000 (UTC) Received: from sea-mail.dfki.de (localhost.localdomain [127.0.0.1]) by localhost (Email Security Appliance) with SMTP id DDFE6A0CB6_69CDD48B for ; Mon, 18 Jan 2016 12:40:40 +0000 (GMT) Received: from mail.dfki.de (lnv-104.sb.dfki.de [134.96.191.146]) (using TLSv1 with cipher DHE-RSA-AES256-SHA (256/256 bits)) (Client did not present a certificate) by sea-mail.dfki.de (Sophos Email Appliance) with ESMTPS id BBE049E4AF_69CDD48F for ; Mon, 18 Jan 2016 12:40:40 +0000 (GMT) Received: from [172.16.148.163] (b-employee-dynamic-148-163.b.dfki.de [172.16.148.163]) (using TLSv1 with cipher DHE-RSA-AES128-SHA (128/128 bits)) (No client certificate requested) by mail.dfki.de (Postfix) with ESMTPSA id 9053E311A0 for ; Mon, 18 Jan 2016 13:40:40 +0100 (CET) Subject: Re: Flink ML Vector and DenseVector To: dev@flink.apache.org References: <569CD85A.6050309@dfki.de> From: Hilmi Yildirim Message-ID: <569CDD48.4030704@dfki.de> Date: Mon, 18 Jan 2016 13:40:40 +0100 User-Agent: Mozilla/5.0 (Windows NT 6.3; WOW64; rv:38.0) Gecko/20100101 Thunderbird/38.5.1 MIME-Version: 1.0 In-Reply-To: Content-Type: text/plain; charset=utf-8; format=flowed Content-Transfer-Encoding: 8bit Hi, how I explained it in a previous E-Mail, I need a LabeledVector where the label is also a vector. After we discussed this issue, I created a new class named LabeledSequenceVector with the labels as a Vector. In my use case, I want to train a POS-Tagger system, so the "vector" is a vector of strings and the "labels" is also a vector of strings. If I use the Flink Vector/DenseVector implementation then the vector does only have double values but I need String values. Best Regards, Hilmi Am 18.01.2016 um 13:33 schrieb Chiwan Park: > Hi Hilmi, > > In NLP, which types are used for vector values? I think we can cover typical case using double values. > >> On Jan 18, 2016, at 9:19 PM, Hilmi Yildirim wrote: >> >> Hi, >> the Vector and DenseVector implementations of Flink ML only allow Double values. But there are cases where the values are not Doubles, e.g. in NLP. Does it make sense to make the implementations generic, i.e. Vector[T] and DenseVector[T]? >> >> Best Regards, >> Hilmi >> >> -- >> ================================================================== >> Hilmi Yildirim, M.Sc. >> Researcher >> >> DFKI GmbH >> Intelligente Analytik für Massendaten >> DFKI Projektbüro Berlin >> Alt-Moabit 91c >> D-10559 Berlin >> Phone: +49 30 23895 1814 >> >> E-Mail: Hilmi.Yildirim@dfki.de >> >> ------------------------------------------------------------- >> Deutsches Forschungszentrum fuer Kuenstliche Intelligenz GmbH >> Firmensitz: Trippstadter Strasse 122, D-67663 Kaiserslautern >> >> Geschaeftsfuehrung: >> Prof. Dr. Dr. h.c. mult. Wolfgang Wahlster (Vorsitzender) >> Dr. Walter Olthoff >> >> Vorsitzender des Aufsichtsrats: >> Prof. Dr. h.c. Hans A. Aukes >> >> Amtsgericht Kaiserslautern, HRB 2313 >> ------------------------------------------------------------- >> > Regards, > Chiwan Park >