asterixdb-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Mattmann, Chris A (3980)" <chris.a.mattm...@jpl.nasa.gov>
Subject Re: GSoC 2016: OpenNLP Sentiment Analysis: Status Update
Date Thu, 19 May 2016 14:01:47 GMT
Hi Chen,

Sorry this should have went to the Tika lists, my bad!

++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Chris Mattmann, Ph.D.
Chief Architect
Instrument Software and Science Data Systems Section (398)
NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
Office: 168-519, Mailstop: 168-527
Email: chris.a.mattmann@nasa.gov
WWW:  http://sunset.usc.edu/~mattmann/
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Director, Information Retrieval and Data Science Group (IRDS)
Adjunct Associate Professor, Computer Science Department
University of Southern California, Los Angeles, CA 90089 USA
WWW: http://irds.usc.edu/
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++










On 5/18/16, 11:33 PM, "Chen Li" <chenli@gmail.com> wrote:

>Just curious, how is this task related to AsterixDB?
>
>
>
>On Wed, May 18, 2016 at 8:57 AM, Mattmann, Chris A (3980) <
>chris.a.mattmann@jpl.nasa.gov> wrote:
>
>> Hi Everyone,
>>
>> Anastasija and I met this morning. Here are her next steps:
>>
>>
>> 0. Completed learning, installing and using GeoTopicParser in Apache Tika
>> 1. Learning about Movie Review Dataset (labeled data, yay!)
>> 2. Try and build OpeNNLP model for that
>>
>> She and I will meet again next week and report progress.
>>
>> Cheers,
>> Chris
>>
>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>> Chris Mattmann, Ph.D.
>> Chief Architect
>> Instrument Software and Science Data Systems Section (398)
>> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
>> Office: 168-519, Mailstop: 168-527
>> Email: chris.a.mattmann@nasa.gov
>> WWW:  http://sunset.usc.edu/~mattmann/
>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>> Director, Information Retrieval and Data Science Group (IRDS)
>> Adjunct Associate Professor, Computer Science Department
>> University of Southern California, Los Angeles, CA 90089 USA
>> WWW: http://irds.usc.edu/
>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>>
>>
>>
>>
>>
>>
>>
>>
>>
>> On 4/26/16, 12:23 PM, "Rodrigo Agerri" <rodrigo.agerri@ehu.eus> wrote:
>>
>> >Hello,
>> >
>> >Everything looks very interesting.  Other options are the Aspect Based
>> >Sentiment Analysis tasks as described in
>> >
>> >http://alt.qcri.org/semeval2014/task4/
>> >http://alt.qcri.org/semeval2015/task12/
>> >http://alt.qcri.org/semeval2016/task5/
>> >
>> >The task is well circumscribed plus data is publicly available, which
>> >is good to try and make manageable objectives for a GSOC.
>> >
>> >Best,
>> >
>> >Rodrigo
>> >
>> >
>> >
>> >On Tue, Apr 26, 2016 at 6:10 PM, Anthony Beylerian
>> ><anthony.beylerian@gmail.com> wrote:
>> >> Please check this approach [1] it could be useful to combine
>> >> a labeled seed set with unlabeled Fisher CallHome.
>> >> Since it maybe a long read there's a shorter ppt as well [2]
>> >>
>> >> [1] link.springer.com/article/10.1023%2FA%3A1007692713085
>> >> [2] cseweb.ucsd.edu/~atsmith/presentation_final.ppt
>> >>
>> >>
>> >> On Tue, Apr 26, 2016 at 11:36 PM, Joern Kottmann <kottmann@gmail.com>
>> wrote:
>> >>
>> >>> The Large Movie Review Dataset might be interesting for this as well:
>> >>> http://ai.stanford.edu/~amaas/data/sentiment/
>> >>>
>> >>> Jörn
>> >>>
>> >>> On Tue, Apr 26, 2016 at 4:26 PM, Anthony Beylerian <
>> >>> anthony.beylerian@gmail.com> wrote:
>> >>>
>> >>> > sentiment analysis discussion doc :
>> >>> >
>> >>> >
>> >>> >
>> >>>
>> https://docs.google.com/document/d/1Gi59YqtisY4NLaVY3B7CNLMTgCRZm9JEk17kmBmWXqQ/edit?usp=sharing
>> >>> >
>> >>> > On Tue, Apr 26, 2016 at 10:56 PM, Mattmann, Chris A (3980) <
>> >>> > chris.a.mattmann@jpl.nasa.gov> wrote:
>> >>> >
>> >>> > > Hi,
>> >>> > >
>> >>> > > Sure here is the link:
>> >>> > >
>> >>> > > https://hangouts.google.com/call/a2w5cgdtirf6jgfb4ww5l2l64ee
>> >>> > >
>> >>> > > Sorry for the delay.
>> >>> > >
>> >>> > > Cheers,
>> >>> > > Chris
>> >>> > >
>> >>> > > ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>> >>> > > Chris Mattmann, Ph.D.
>> >>> > > Chief Architect
>> >>> > > Instrument Software and Science Data Systems Section (398)
>> >>> > > NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
>> >>> > > Office: 168-519, Mailstop: 168-527
>> >>> > > Email: chris.a.mattmann@nasa.gov
>> >>> > > WWW:  http://sunset.usc.edu/~mattmann/
>> >>> > > ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>> >>> > > Director, Information Retrieval and Data Science Group (IRDS)
>> >>> > > Adjunct Associate Professor, Computer Science Department
>> >>> > > University of Southern California, Los Angeles, CA 90089 USA
>> >>> > > WWW: http://irds.usc.edu/
>> >>> > > ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>> >>> > >
>> >>> > >
>> >>> > >
>> >>> > >
>> >>> > >
>> >>> > >
>> >>> > >
>> >>> > >
>> >>> > >
>> >>> > > On 4/26/16, 6:48 AM, "Anastasija Mensikova" <
>> >>> > > mensikova.anastasija@gmail.com> wrote:
>> >>> > >
>> >>> > > >Hi everyone,
>> >>> > > >
>> >>> > > >
>> >>> > > >Is the 9:40 ET hangout still happening? I just have to
leave soon
>> to
>> >>> go
>> >>> > > to class.
>> >>> > > >
>> >>> > > >
>> >>> > > >Thank you,
>> >>> > > >Anastasija
>> >>> > > >
>> >>> > > >
>> >>> > > >On 25 April 2016 at 23:39, Anastasija Mensikova
>> >>> > > ><mensikova.anastasija@gmail.com> wrote:
>> >>> > > >
>> >>> > > >Hi Chris,
>> >>> > > >
>> >>> > > >
>> >>> > > >Yes, that's perfect. I'll be ready by 9:40am.
>> >>> > > >
>> >>> > > >
>> >>> > > >Thank you,
>> >>> > > >Anastasija
>> >>> > > >
>> >>> > > >
>> >>> > > >On 25 April 2016 at 23:28, Mattmann, Chris A (3980)
>> >>> > > ><chris.a.mattmann@jpl.nasa.gov> wrote:
>> >>> > > >
>> >>> > > >Hey Anastasija,
>> >>> > > >
>> >>> > > >To be honest 9am EST is a little aggressive, I will likely
be able
>> >>> > > >to do 6:40 am PT (am traveling back from DC as I type
this) which
>> >>> > > >is 9:40am ET.
>> >>> > > >
>> >>> > > >My GChat handle is chris.mattmann@gmail.com. I will create
a
>> hangout
>> >>> > > >and send to the list please contact me at 6:40am PT.
>> >>> > > >
>> >>> > > >Cheers,
>> >>> > > >Chris
>> >>> > > >
>> >>> > > >++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>> >>> > > >Chris Mattmann, Ph.D.
>> >>> > > >Chief Architect
>> >>> > > >Instrument Software and Science Data Systems Section (398)
>> >>> > > >NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
>> >>> > > >Office: 168-519, Mailstop: 168-527
>> >>> > > >Email: chris.a.mattmann@nasa.gov
>> >>> > > >WWW:
>> >>> > > >http://sunset.usc.edu/~mattmann/ <
>> http://sunset.usc.edu/~mattmann/>
>> >>> > > >++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>> >>> > > >Director, Information Retrieval and Data Science Group
(IRDS)
>> >>> > > >Adjunct Associate Professor, Computer Science Department
>> >>> > > >University of Southern California, Los Angeles, CA 90089
USA
>> >>> > > >WWW: http://irds.usc.edu/
>> >>> > > >++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>> >>> > > >
>> >>> > > >
>> >>> > > >
>> >>> > > >
>> >>> > > >
>> >>> > > >
>> >>> > > >
>> >>> > > >
>> >>> > > >
>> >>> > > >On 4/25/16, 11:07 PM, "Anastasija Mensikova" <
>> >>> > > mensikova.anastasija@gmail.com> wrote:
>> >>> > > >
>> >>> > > >>Hi everyone,
>> >>> > > >>
>> >>> > > >>
>> >>> > > >>So is the hangout session tomorrow (Tuesday) at 6:30pm
IST (9am
>> EST)
>> >>> > > confirmed or not?
>> >>> > > >>
>> >>> > > >>
>> >>> > > >>Thank you,
>> >>> > > >>Anastasija
>> >>> > > >>
>> >>> > > >>
>> >>> > > >>On 25 April 2016 at 15:23, Madhawa Kasun Gunasekara
>> >>> > > >><madhawa30@gmail.com> wrote:
>> >>> > > >>
>> >>> > > >>Hi all,
>> >>> > > >>
>> >>> > > >>
>> >>> > > >>Shall we have the hangout session tomorrow (Tuesday)
about 18:30
>> IST
>> >>> ?
>> >>> > > >>
>> >>> > > >>
>> >>> > > >>Thanks,
>> >>> > > >>
>> >>> > > >>Madhawa
>> >>> > > >>
>> >>> > > >>
>> >>> > > >>
>> >>> > > >>
>> >>> > > >>Madhawa
>> >>> > > >>
>> >>> > > >>
>> >>> > > >>
>> >>> > > >>
>> >>> > > >>On Sun, Apr 24, 2016 at 10:33 PM, Mondher Bouazizi
>> >>> > > >><mondher.bouazizi@gmail.com> wrote:
>> >>> > > >>
>> >>> > > >>Hi,
>> >>> > > >>
>> >>> > > >>I am sorry for my late reply.
>> >>> > > >>
>> >>> > > >>Given the time difference between Japan and USA, I
think I won't
>> be
>> >>> > > >>available on weekdays. I will be available only on
>> Friday/Saturday
>> >>> > > morning
>> >>> > > >>(9-10am EST).
>> >>> > > >>
>> >>> > > >>I am not sure if Chris is OK with that, we had our
previous
>> meetings
>> >>> on
>> >>> > > >>Saturday mornings.
>> >>> > > >>
>> >>> > > >>Otherwise, please go ahead. I will join as soon as
I can.
>> >>> > > >>
>> >>> > > >>Thanks.
>> >>> > > >>
>> >>> > > >>@Chris: my github ID is mondher-bouazizi
>> >>> > > >>
>> >>> > > >>Best regards,
>> >>> > > >>
>> >>> > > >>Mondher
>> >>> > > >>
>> >>> > > >>On Mon, Apr 25, 2016 at 1:44 AM, Anastasija Mensikova
<
>> >>> > > >>mensikova.anastasija@gmail.com> wrote:
>> >>> > > >>
>> >>> > > >>> Hi Anthony,
>> >>> > > >>>
>> >>> > > >>> I can make it by Madhawa's proposal too, after
6pm IST on
>> Tuesday
>> >>> > > (after
>> >>> > > >>> 8:30am EST). Let me know when exactly!
>> >>> > > >>>
>> >>> > > >>> Thank you,
>> >>> > > >>> Anastasija
>> >>> > > >>>
>> >>> > > >>> On 24 April 2016 at 03:02, Anthony Beylerian
<
>> >>> > > anthony.beylerian@gmail.com>
>> >>> > > >>> wrote:
>> >>> > > >>>
>> >>> > > >>>> Hi Anastasija,
>> >>> > > >>>>
>> >>> > > >>>> I'm not available by those times (00-07 JST).
 I could make
>> it by
>> >>> > > >>>> Madhawa's proposal, but otherwise please
go ahead, we may
>> discuss
>> >>> > some
>> >>> > > >>>> other time.
>> >>> > > >>>>
>> >>> > > >>>> @Chris: github ID : beylerian
>> >>> > > >>>>
>> >>> > > >>>> Best,
>> >>> > > >>>>
>> >>> > > >>>> Anthony
>> >>> > > >>>>
>> >>> > > >>>>
>> >>> > > >>>> Please find my github profile
>> >>> > > >
>> >>> > > >
>> >>> > > >>https://github.com/madhawa-gunasekara <
>> >>> > > https://github.com/madhawa-gunasekara>
>> >>> > > >>>>
>> >>> > > >>>> Madhawa
>> >>> > > >>>>
>> >>> > > >>>> On Sun, Apr 24, 2016 at 12:13 AM, Madhawa
Kasun Gunasekara <
>> >>> > > >>>> madhawa30@gmail.com> wrote:
>> >>> > > >>>>
>> >>> > > >>>> > Hi Chris,
>> >>> > > >>>> >
>> >>> > > >>>> > I'm available on Tuesday & Wednesday
after 6.00 pm IST.
>> >>> > > >>>> >
>> >>> > > >>>> > Thanks,
>> >>> > > >>>> > Madhawa
>> >>> > > >>>> >
>> >>> > > >>>> > Madhawa
>> >>> > > >>>> >
>> >>> > > >>>> > On Sat, Apr 23, 2016 at 11:38 PM, Anastasija
Mensikova <
>> >>> > > >>>> > mensikova.anastasija@gmail.com> wrote:
>> >>> > > >>>> >
>> >>> > > >>>> >> Hi Chris,
>> >>> > > >>>> >>
>> >>> > > >>>> >> Thank you very much for your email.
I'm so excited to work
>> with
>> >>> > > you!
>> >>> > > >>>> >>
>> >>> > > >>>> >> My Github name is amensiko.
>> >>> > > >>>> >>
>> >>> > > >>>> >> And yes, next week sounds good!
I'm available on: Tuesday
>> at
>> >>> > 4:20pm
>> >>> > > >>>> EST,
>> >>> > > >>>> >> Thursday 11am - 2:30pm and 4:20
- 6pm EST, Friday 11am -
>> 3pm
>> >>> EST.
>> >>> > > >>>> >>
>> >>> > > >>>> >> Thank you,
>> >>> > > >>>> >> Anastasija
>> >>> > > >>>> >>
>> >>> > > >>>> >> On 23 April 2016 at 10:21, Mattmann,
Chris A (3980) <
>> >>> > > >>>> >> chris.a.mattmann@jpl.nasa.gov>
wrote:
>> >>> > > >>>> >>
>> >>> > > >>>> >>> Hi Anastasija,
>> >>> > > >>>> >>>
>> >>> > > >>>> >>> Hope you are well. It’s now
time to get started on the
>> >>> project.
>> >>> > > >>>> >>> Monder, Anthony, Madhawa and
I have been discussing ideas
>> >>> about
>> >>> > > >>>> >>> how to proceed with the project
and even developing a task
>> >>> list.
>> >>> > > >>>> >>> Let’s get your tasks input
into that list, and also
>> >>> coordinate.
>> >>> > > >>>> >>>
>> >>> > > >>>> >>> I also have an action to share
some Spanish/English data
>> to
>> >>> try
>> >>> > > >>>> >>> and do cross lingual sentiment
analysis.
>> >>> > > >>>> >>>
>> >>> > > >>>> >>> Are you available to chat this
week?
>> >>> > > >>>> >>>
>> >>> > > >>>> >>> Cheers,
>> >>> > > >>>> >>> Chris
>> >>> > > >>>> >>>
>> >>> > > >>>> >>>
>> >>> > ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>> >>> > > >>>> >>> Chris Mattmann, Ph.D.
>> >>> > > >>>> >>> Chief Architect
>> >>> > > >>>> >>> Instrument Software and Science
Data Systems Section (398)
>> >>> > > >>>> >>> NASA Jet Propulsion Laboratory
Pasadena, CA 91109 USA
>> >>> > > >>>> >>> Office: 168-519, Mailstop: 168-527
>> >>> > > >>>> >>> Email: chris.a.mattmann@nasa.gov
>> >>> > > >>>> >>> WWW:
>> >>> > > >
>> >>> > > >
>> >>> > > >>http://sunset.usc.edu/~mattmann/ <
>> http://sunset.usc.edu/~mattmann/>
>> >>> > > >>>> >>>
>> >>> > ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>> >>> > > >>>> >>> Director, Information Retrieval
and Data Science Group
>> (IRDS)
>> >>> > > >>>> >>> Adjunct Associate Professor,
Computer Science Department
>> >>> > > >>>> >>> University of Southern California,
Los Angeles, CA 90089
>> USA
>> >>> > > >>>> >>> WWW: http://irds.usc.edu/
>> >>> > > >>>> >>>
>> >>> > ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>> >>> > > >>>> >>>
>> >>> > > >>>> >>>
>> >>> > > >>>> >>>
>> >>> > > >>>> >>>
>> >>> > > >>>> >>>
>> >>> > > >>>> >>>
>> >>> > > >>>> >>>
>> >>> > > >>>> >>>
>> >>> > > >>>> >>>
>> >>> > > >>>> >>> On 4/23/16, 4:49 AM, "Anthony
Beylerian" <
>> >>> > > anthony.beylerian@gmail.com
>> >>> > > >>>> >
>> >>> > > >>>> >>> wrote:
>> >>> > > >>>> >>>
>> >>> > > >>>> >>> >Hello,
>> >>> > > >>>> >>> >
>> >>> > > >>>> >>> >Congratulations for being
accepted for this year's GSoC.
>> >>> > > >>>> >>> >Although Mondher and myself
will not participate this
>> year as
>> >>> > > >>>> students,
>> >>> > > >>>> >>> we
>> >>> > > >>>> >>> >will do our best to help.
>> >>> > > >>>> >>> >We are currently busy with
academic research, but will
>> join
>> >>> the
>> >>> > > >>>> efforts
>> >>> > > >>>> >>> >when possible.
>> >>> > > >>>> >>> >Otherwise, for any discussion
concerning the proposed
>> >>> > approaches,
>> >>> > > >>>> please
>> >>> > > >>>> >>> >let us know.
>> >>> > > >>>> >>> >
>> >>> > > >>>> >>> >Best,
>> >>> > > >>>> >>> >
>> >>> > > >>>> >>> >On Sat, Apr 23, 2016 at
6:02 PM, Madhawa Kasun
>> Gunasekara <
>> >>> > > >>>> >>> >madhawa30@gmail.com>
wrote:
>> >>> > > >>>> >>> >
>> >>> > > >>>> >>> >> Sure we will start
working on this.
>> >>> > > >>>> >>> >>
>> >>> > > >>>> >>> >> Thanks,
>> >>> > > >>>> >>> >> Madhawa
>> >>> > > >>>> >>> >>
>> >>> > > >>>> >>> >> Madhawa
>> >>> > > >>>> >>> >>
>> >>> > > >>>> >>> >> On Sat, Apr 23, 2016
at 1:38 AM, Chris Mattmann <
>> >>> > > >>>> mattmann@apache.org>
>> >>> > > >>>> >>> >> wrote:
>> >>> > > >>>> >>> >>
>> >>> > > >>>> >>> >>> Congrats!
>> >>> > > >>>> >>> >>>
>> >>> > > >>>> >>> >>> time to get started
team.
>> >>> > > >>>> >>> >>>
>> >>> > > >>>> >>>
>> >>> > > >>>> >>
>> >>> > > >>>> >>
>> >>> > > >>>> >
>> >>> > > >>>>
>> >>> > > >>>
>> >>> > > >>>
>> >>> > > >>
>> >>> > > >>
>> >>> > > >>
>> >>> > > >>
>> >>> > > >>
>> >>> > > >>
>> >>> > > >>
>> >>> > > >>
>> >>> > > >>
>> >>> > > >>
>> >>> > > >>
>> >>> > > >>
>> >>> > > >>
>> >>> > > >>
>> >>> > > >
>> >>> > > >
>> >>> > > >
>> >>> > > >
>> >>> > > >
>> >>> > > >
>> >>> > > >
>> >>> > > >
>> >>> > > >
>> >>> > > >
>> >>> > > >
>> >>> > > >
>> >>> > > >
>> >>> > > >
>> >>> > >
>> >>> >
>> >>>
>>
Mime
View raw message