asterixdb-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Mike Carey <dtab...@gmail.com>
Subject Re: GSoC 2016: OpenNLP Sentiment Analysis: Status Update
Date Fri, 20 May 2016 06:08:57 GMT
And here I was hoping you had a summer project planned to add NLP
processing to the AsterixDB Twitter feed...!
On May 19, 2016 4:02 PM, "Mattmann, Chris A (3980)" <
chris.a.mattmann@jpl.nasa.gov> wrote:

> Hi Chen,
>
> Sorry this should have went to the Tika lists, my bad!
>
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> Chris Mattmann, Ph.D.
> Chief Architect
> Instrument Software and Science Data Systems Section (398)
> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
> Office: 168-519, Mailstop: 168-527
> Email: chris.a.mattmann@nasa.gov
> WWW:  http://sunset.usc.edu/~mattmann/
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> Director, Information Retrieval and Data Science Group (IRDS)
> Adjunct Associate Professor, Computer Science Department
> University of Southern California, Los Angeles, CA 90089 USA
> WWW: http://irds.usc.edu/
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>
>
>
>
>
>
>
>
>
>
> On 5/18/16, 11:33 PM, "Chen Li" <chenli@gmail.com> wrote:
>
> >Just curious, how is this task related to AsterixDB?
> >
> >
> >
> >On Wed, May 18, 2016 at 8:57 AM, Mattmann, Chris A (3980) <
> >chris.a.mattmann@jpl.nasa.gov> wrote:
> >
> >> Hi Everyone,
> >>
> >> Anastasija and I met this morning. Here are her next steps:
> >>
> >>
> >> 0. Completed learning, installing and using GeoTopicParser in Apache
> Tika
> >> 1. Learning about Movie Review Dataset (labeled data, yay!)
> >> 2. Try and build OpeNNLP model for that
> >>
> >> She and I will meet again next week and report progress.
> >>
> >> Cheers,
> >> Chris
> >>
> >> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> >> Chris Mattmann, Ph.D.
> >> Chief Architect
> >> Instrument Software and Science Data Systems Section (398)
> >> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
> >> Office: 168-519, Mailstop: 168-527
> >> Email: chris.a.mattmann@nasa.gov
> >> WWW:  http://sunset.usc.edu/~mattmann/
> >> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> >> Director, Information Retrieval and Data Science Group (IRDS)
> >> Adjunct Associate Professor, Computer Science Department
> >> University of Southern California, Los Angeles, CA 90089 USA
> >> WWW: http://irds.usc.edu/
> >> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> >>
> >>
> >>
> >>
> >>
> >>
> >>
> >>
> >>
> >> On 4/26/16, 12:23 PM, "Rodrigo Agerri" <rodrigo.agerri@ehu.eus> wrote:
> >>
> >> >Hello,
> >> >
> >> >Everything looks very interesting.  Other options are the Aspect Based
> >> >Sentiment Analysis tasks as described in
> >> >
> >> >http://alt.qcri.org/semeval2014/task4/
> >> >http://alt.qcri.org/semeval2015/task12/
> >> >http://alt.qcri.org/semeval2016/task5/
> >> >
> >> >The task is well circumscribed plus data is publicly available, which
> >> >is good to try and make manageable objectives for a GSOC.
> >> >
> >> >Best,
> >> >
> >> >Rodrigo
> >> >
> >> >
> >> >
> >> >On Tue, Apr 26, 2016 at 6:10 PM, Anthony Beylerian
> >> ><anthony.beylerian@gmail.com> wrote:
> >> >> Please check this approach [1] it could be useful to combine
> >> >> a labeled seed set with unlabeled Fisher CallHome.
> >> >> Since it maybe a long read there's a shorter ppt as well [2]
> >> >>
> >> >> [1] link.springer.com/article/10.1023%2FA%3A1007692713085
> >> >> [2] cseweb.ucsd.edu/~atsmith/presentation_final.ppt
> >> >>
> >> >>
> >> >> On Tue, Apr 26, 2016 at 11:36 PM, Joern Kottmann <kottmann@gmail.com
> >
> >> wrote:
> >> >>
> >> >>> The Large Movie Review Dataset might be interesting for this as
> well:
> >> >>> http://ai.stanford.edu/~amaas/data/sentiment/
> >> >>>
> >> >>> Jörn
> >> >>>
> >> >>> On Tue, Apr 26, 2016 at 4:26 PM, Anthony Beylerian <
> >> >>> anthony.beylerian@gmail.com> wrote:
> >> >>>
> >> >>> > sentiment analysis discussion doc :
> >> >>> >
> >> >>> >
> >> >>> >
> >> >>>
> >>
> https://docs.google.com/document/d/1Gi59YqtisY4NLaVY3B7CNLMTgCRZm9JEk17kmBmWXqQ/edit?usp=sharing
> >> >>> >
> >> >>> > On Tue, Apr 26, 2016 at 10:56 PM, Mattmann, Chris A (3980)
<
> >> >>> > chris.a.mattmann@jpl.nasa.gov> wrote:
> >> >>> >
> >> >>> > > Hi,
> >> >>> > >
> >> >>> > > Sure here is the link:
> >> >>> > >
> >> >>> > > https://hangouts.google.com/call/a2w5cgdtirf6jgfb4ww5l2l64ee
> >> >>> > >
> >> >>> > > Sorry for the delay.
> >> >>> > >
> >> >>> > > Cheers,
> >> >>> > > Chris
> >> >>> > >
> >> >>> > >
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> >> >>> > > Chris Mattmann, Ph.D.
> >> >>> > > Chief Architect
> >> >>> > > Instrument Software and Science Data Systems Section
(398)
> >> >>> > > NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
> >> >>> > > Office: 168-519, Mailstop: 168-527
> >> >>> > > Email: chris.a.mattmann@nasa.gov
> >> >>> > > WWW:  http://sunset.usc.edu/~mattmann/
> >> >>> > >
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> >> >>> > > Director, Information Retrieval and Data Science Group
(IRDS)
> >> >>> > > Adjunct Associate Professor, Computer Science Department
> >> >>> > > University of Southern California, Los Angeles, CA 90089
USA
> >> >>> > > WWW: http://irds.usc.edu/
> >> >>> > >
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> >> >>> > >
> >> >>> > >
> >> >>> > >
> >> >>> > >
> >> >>> > >
> >> >>> > >
> >> >>> > >
> >> >>> > >
> >> >>> > >
> >> >>> > > On 4/26/16, 6:48 AM, "Anastasija Mensikova" <
> >> >>> > > mensikova.anastasija@gmail.com> wrote:
> >> >>> > >
> >> >>> > > >Hi everyone,
> >> >>> > > >
> >> >>> > > >
> >> >>> > > >Is the 9:40 ET hangout still happening? I just have
to leave
> soon
> >> to
> >> >>> go
> >> >>> > > to class.
> >> >>> > > >
> >> >>> > > >
> >> >>> > > >Thank you,
> >> >>> > > >Anastasija
> >> >>> > > >
> >> >>> > > >
> >> >>> > > >On 25 April 2016 at 23:39, Anastasija Mensikova
> >> >>> > > ><mensikova.anastasija@gmail.com> wrote:
> >> >>> > > >
> >> >>> > > >Hi Chris,
> >> >>> > > >
> >> >>> > > >
> >> >>> > > >Yes, that's perfect. I'll be ready by 9:40am.
> >> >>> > > >
> >> >>> > > >
> >> >>> > > >Thank you,
> >> >>> > > >Anastasija
> >> >>> > > >
> >> >>> > > >
> >> >>> > > >On 25 April 2016 at 23:28, Mattmann, Chris A (3980)
> >> >>> > > ><chris.a.mattmann@jpl.nasa.gov> wrote:
> >> >>> > > >
> >> >>> > > >Hey Anastasija,
> >> >>> > > >
> >> >>> > > >To be honest 9am EST is a little aggressive, I will
likely be
> able
> >> >>> > > >to do 6:40 am PT (am traveling back from DC as I
type this)
> which
> >> >>> > > >is 9:40am ET.
> >> >>> > > >
> >> >>> > > >My GChat handle is chris.mattmann@gmail.com. I will
create a
> >> hangout
> >> >>> > > >and send to the list please contact me at 6:40am
PT.
> >> >>> > > >
> >> >>> > > >Cheers,
> >> >>> > > >Chris
> >> >>> > > >
> >> >>> > >
> >++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> >> >>> > > >Chris Mattmann, Ph.D.
> >> >>> > > >Chief Architect
> >> >>> > > >Instrument Software and Science Data Systems Section
(398)
> >> >>> > > >NASA Jet Propulsion Laboratory Pasadena, CA 91109
USA
> >> >>> > > >Office: 168-519, Mailstop: 168-527
> >> >>> > > >Email: chris.a.mattmann@nasa.gov
> >> >>> > > >WWW:
> >> >>> > > >http://sunset.usc.edu/~mattmann/ <
> >> http://sunset.usc.edu/~mattmann/>
> >> >>> > >
> >++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> >> >>> > > >Director, Information Retrieval and Data Science
Group (IRDS)
> >> >>> > > >Adjunct Associate Professor, Computer Science Department
> >> >>> > > >University of Southern California, Los Angeles, CA
90089 USA
> >> >>> > > >WWW: http://irds.usc.edu/
> >> >>> > >
> >++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> >> >>> > > >
> >> >>> > > >
> >> >>> > > >
> >> >>> > > >
> >> >>> > > >
> >> >>> > > >
> >> >>> > > >
> >> >>> > > >
> >> >>> > > >
> >> >>> > > >On 4/25/16, 11:07 PM, "Anastasija Mensikova" <
> >> >>> > > mensikova.anastasija@gmail.com> wrote:
> >> >>> > > >
> >> >>> > > >>Hi everyone,
> >> >>> > > >>
> >> >>> > > >>
> >> >>> > > >>So is the hangout session tomorrow (Tuesday)
at 6:30pm IST
> (9am
> >> EST)
> >> >>> > > confirmed or not?
> >> >>> > > >>
> >> >>> > > >>
> >> >>> > > >>Thank you,
> >> >>> > > >>Anastasija
> >> >>> > > >>
> >> >>> > > >>
> >> >>> > > >>On 25 April 2016 at 15:23, Madhawa Kasun Gunasekara
> >> >>> > > >><madhawa30@gmail.com> wrote:
> >> >>> > > >>
> >> >>> > > >>Hi all,
> >> >>> > > >>
> >> >>> > > >>
> >> >>> > > >>Shall we have the hangout session tomorrow (Tuesday)
about
> 18:30
> >> IST
> >> >>> ?
> >> >>> > > >>
> >> >>> > > >>
> >> >>> > > >>Thanks,
> >> >>> > > >>
> >> >>> > > >>Madhawa
> >> >>> > > >>
> >> >>> > > >>
> >> >>> > > >>
> >> >>> > > >>
> >> >>> > > >>Madhawa
> >> >>> > > >>
> >> >>> > > >>
> >> >>> > > >>
> >> >>> > > >>
> >> >>> > > >>On Sun, Apr 24, 2016 at 10:33 PM, Mondher Bouazizi
> >> >>> > > >><mondher.bouazizi@gmail.com> wrote:
> >> >>> > > >>
> >> >>> > > >>Hi,
> >> >>> > > >>
> >> >>> > > >>I am sorry for my late reply.
> >> >>> > > >>
> >> >>> > > >>Given the time difference between Japan and USA,
I think I
> won't
> >> be
> >> >>> > > >>available on weekdays. I will be available only
on
> >> Friday/Saturday
> >> >>> > > morning
> >> >>> > > >>(9-10am EST).
> >> >>> > > >>
> >> >>> > > >>I am not sure if Chris is OK with that, we had
our previous
> >> meetings
> >> >>> on
> >> >>> > > >>Saturday mornings.
> >> >>> > > >>
> >> >>> > > >>Otherwise, please go ahead. I will join as soon
as I can.
> >> >>> > > >>
> >> >>> > > >>Thanks.
> >> >>> > > >>
> >> >>> > > >>@Chris: my github ID is mondher-bouazizi
> >> >>> > > >>
> >> >>> > > >>Best regards,
> >> >>> > > >>
> >> >>> > > >>Mondher
> >> >>> > > >>
> >> >>> > > >>On Mon, Apr 25, 2016 at 1:44 AM, Anastasija Mensikova
<
> >> >>> > > >>mensikova.anastasija@gmail.com> wrote:
> >> >>> > > >>
> >> >>> > > >>> Hi Anthony,
> >> >>> > > >>>
> >> >>> > > >>> I can make it by Madhawa's proposal too,
after 6pm IST on
> >> Tuesday
> >> >>> > > (after
> >> >>> > > >>> 8:30am EST). Let me know when exactly!
> >> >>> > > >>>
> >> >>> > > >>> Thank you,
> >> >>> > > >>> Anastasija
> >> >>> > > >>>
> >> >>> > > >>> On 24 April 2016 at 03:02, Anthony Beylerian
<
> >> >>> > > anthony.beylerian@gmail.com>
> >> >>> > > >>> wrote:
> >> >>> > > >>>
> >> >>> > > >>>> Hi Anastasija,
> >> >>> > > >>>>
> >> >>> > > >>>> I'm not available by those times (00-07
JST).  I could make
> >> it by
> >> >>> > > >>>> Madhawa's proposal, but otherwise please
go ahead, we may
> >> discuss
> >> >>> > some
> >> >>> > > >>>> other time.
> >> >>> > > >>>>
> >> >>> > > >>>> @Chris: github ID : beylerian
> >> >>> > > >>>>
> >> >>> > > >>>> Best,
> >> >>> > > >>>>
> >> >>> > > >>>> Anthony
> >> >>> > > >>>>
> >> >>> > > >>>>
> >> >>> > > >>>> Please find my github profile
> >> >>> > > >
> >> >>> > > >
> >> >>> > > >>https://github.com/madhawa-gunasekara <
> >> >>> > > https://github.com/madhawa-gunasekara>
> >> >>> > > >>>>
> >> >>> > > >>>> Madhawa
> >> >>> > > >>>>
> >> >>> > > >>>> On Sun, Apr 24, 2016 at 12:13 AM, Madhawa
Kasun Gunasekara
> <
> >> >>> > > >>>> madhawa30@gmail.com> wrote:
> >> >>> > > >>>>
> >> >>> > > >>>> > Hi Chris,
> >> >>> > > >>>> >
> >> >>> > > >>>> > I'm available on Tuesday &
Wednesday after 6.00 pm IST.
> >> >>> > > >>>> >
> >> >>> > > >>>> > Thanks,
> >> >>> > > >>>> > Madhawa
> >> >>> > > >>>> >
> >> >>> > > >>>> > Madhawa
> >> >>> > > >>>> >
> >> >>> > > >>>> > On Sat, Apr 23, 2016 at 11:38 PM,
Anastasija Mensikova <
> >> >>> > > >>>> > mensikova.anastasija@gmail.com>
wrote:
> >> >>> > > >>>> >
> >> >>> > > >>>> >> Hi Chris,
> >> >>> > > >>>> >>
> >> >>> > > >>>> >> Thank you very much for your
email. I'm so excited to
> work
> >> with
> >> >>> > > you!
> >> >>> > > >>>> >>
> >> >>> > > >>>> >> My Github name is amensiko.
> >> >>> > > >>>> >>
> >> >>> > > >>>> >> And yes, next week sounds good!
I'm available on:
> Tuesday
> >> at
> >> >>> > 4:20pm
> >> >>> > > >>>> EST,
> >> >>> > > >>>> >> Thursday 11am - 2:30pm and
4:20 - 6pm EST, Friday 11am -
> >> 3pm
> >> >>> EST.
> >> >>> > > >>>> >>
> >> >>> > > >>>> >> Thank you,
> >> >>> > > >>>> >> Anastasija
> >> >>> > > >>>> >>
> >> >>> > > >>>> >> On 23 April 2016 at 10:21,
Mattmann, Chris A (3980) <
> >> >>> > > >>>> >> chris.a.mattmann@jpl.nasa.gov>
wrote:
> >> >>> > > >>>> >>
> >> >>> > > >>>> >>> Hi Anastasija,
> >> >>> > > >>>> >>>
> >> >>> > > >>>> >>> Hope you are well. It’s
now time to get started on the
> >> >>> project.
> >> >>> > > >>>> >>> Monder, Anthony, Madhawa
and I have been discussing
> ideas
> >> >>> about
> >> >>> > > >>>> >>> how to proceed with the
project and even developing a
> task
> >> >>> list.
> >> >>> > > >>>> >>> Let’s get your tasks
input into that list, and also
> >> >>> coordinate.
> >> >>> > > >>>> >>>
> >> >>> > > >>>> >>> I also have an action to
share some Spanish/English
> data
> >> to
> >> >>> try
> >> >>> > > >>>> >>> and do cross lingual sentiment
analysis.
> >> >>> > > >>>> >>>
> >> >>> > > >>>> >>> Are you available to chat
this week?
> >> >>> > > >>>> >>>
> >> >>> > > >>>> >>> Cheers,
> >> >>> > > >>>> >>> Chris
> >> >>> > > >>>> >>>
> >> >>> > > >>>> >>>
> >> >>> > ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> >> >>> > > >>>> >>> Chris Mattmann, Ph.D.
> >> >>> > > >>>> >>> Chief Architect
> >> >>> > > >>>> >>> Instrument Software and
Science Data Systems Section
> (398)
> >> >>> > > >>>> >>> NASA Jet Propulsion Laboratory
Pasadena, CA 91109 USA
> >> >>> > > >>>> >>> Office: 168-519, Mailstop:
168-527
> >> >>> > > >>>> >>> Email: chris.a.mattmann@nasa.gov
> >> >>> > > >>>> >>> WWW:
> >> >>> > > >
> >> >>> > > >
> >> >>> > > >>http://sunset.usc.edu/~mattmann/ <
> >> http://sunset.usc.edu/~mattmann/>
> >> >>> > > >>>> >>>
> >> >>> > ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> >> >>> > > >>>> >>> Director, Information Retrieval
and Data Science Group
> >> (IRDS)
> >> >>> > > >>>> >>> Adjunct Associate Professor,
Computer Science
> Department
> >> >>> > > >>>> >>> University of Southern
California, Los Angeles, CA
> 90089
> >> USA
> >> >>> > > >>>> >>> WWW: http://irds.usc.edu/
> >> >>> > > >>>> >>>
> >> >>> > ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> >> >>> > > >>>> >>>
> >> >>> > > >>>> >>>
> >> >>> > > >>>> >>>
> >> >>> > > >>>> >>>
> >> >>> > > >>>> >>>
> >> >>> > > >>>> >>>
> >> >>> > > >>>> >>>
> >> >>> > > >>>> >>>
> >> >>> > > >>>> >>>
> >> >>> > > >>>> >>> On 4/23/16, 4:49 AM, "Anthony
Beylerian" <
> >> >>> > > anthony.beylerian@gmail.com
> >> >>> > > >>>> >
> >> >>> > > >>>> >>> wrote:
> >> >>> > > >>>> >>>
> >> >>> > > >>>> >>> >Hello,
> >> >>> > > >>>> >>> >
> >> >>> > > >>>> >>> >Congratulations for
being accepted for this year's
> GSoC.
> >> >>> > > >>>> >>> >Although Mondher and
myself will not participate this
> >> year as
> >> >>> > > >>>> students,
> >> >>> > > >>>> >>> we
> >> >>> > > >>>> >>> >will do our best to
help.
> >> >>> > > >>>> >>> >We are currently busy
with academic research, but will
> >> join
> >> >>> the
> >> >>> > > >>>> efforts
> >> >>> > > >>>> >>> >when possible.
> >> >>> > > >>>> >>> >Otherwise, for any
discussion concerning the proposed
> >> >>> > approaches,
> >> >>> > > >>>> please
> >> >>> > > >>>> >>> >let us know.
> >> >>> > > >>>> >>> >
> >> >>> > > >>>> >>> >Best,
> >> >>> > > >>>> >>> >
> >> >>> > > >>>> >>> >On Sat, Apr 23, 2016
at 6:02 PM, Madhawa Kasun
> >> Gunasekara <
> >> >>> > > >>>> >>> >madhawa30@gmail.com>
wrote:
> >> >>> > > >>>> >>> >
> >> >>> > > >>>> >>> >> Sure we will start
working on this.
> >> >>> > > >>>> >>> >>
> >> >>> > > >>>> >>> >> Thanks,
> >> >>> > > >>>> >>> >> Madhawa
> >> >>> > > >>>> >>> >>
> >> >>> > > >>>> >>> >> Madhawa
> >> >>> > > >>>> >>> >>
> >> >>> > > >>>> >>> >> On Sat, Apr 23,
2016 at 1:38 AM, Chris Mattmann <
> >> >>> > > >>>> mattmann@apache.org>
> >> >>> > > >>>> >>> >> wrote:
> >> >>> > > >>>> >>> >>
> >> >>> > > >>>> >>> >>> Congrats!
> >> >>> > > >>>> >>> >>>
> >> >>> > > >>>> >>> >>> time to get
started team.
> >> >>> > > >>>> >>> >>>
> >> >>> > > >>>> >>>
> >> >>> > > >>>> >>
> >> >>> > > >>>> >>
> >> >>> > > >>>> >
> >> >>> > > >>>>
> >> >>> > > >>>
> >> >>> > > >>>
> >> >>> > > >>
> >> >>> > > >>
> >> >>> > > >>
> >> >>> > > >>
> >> >>> > > >>
> >> >>> > > >>
> >> >>> > > >>
> >> >>> > > >>
> >> >>> > > >>
> >> >>> > > >>
> >> >>> > > >>
> >> >>> > > >>
> >> >>> > > >>
> >> >>> > > >>
> >> >>> > > >
> >> >>> > > >
> >> >>> > > >
> >> >>> > > >
> >> >>> > > >
> >> >>> > > >
> >> >>> > > >
> >> >>> > > >
> >> >>> > > >
> >> >>> > > >
> >> >>> > > >
> >> >>> > > >
> >> >>> > > >
> >> >>> > > >
> >> >>> > >
> >> >>> >
> >> >>>
> >>
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message