hadoop-general mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Josh Patterson <j...@cloudera.com>
Subject Re: Digital Signal Processing Library + Hadoop
Date Tue, 08 Mar 2011 14:24:12 GMT
A basic time series construct is the "sliding" window in conjunction
with sorted time/value data; A sample implementation is at my github:


There are two jobs in there, one that uses the shuffle and one that
does not --- to illustrate the difference. I have a blog draft coming
that accompanies this code, I'll follow up and send you a copy draft
of it.

>From that code you should be able to build out a more complex time
series / DSP process (using it as base code), something along the
lines of a 1NN classifier:


I'm in the process of updating that older openPDC code to be more
modern and modular for general data sources.


On Sat, Mar 5, 2011 at 12:05 AM, Roger Smith <rogersmith1711@gmail.com> wrote:
> All -
> I wonder if any of you have integrated a DSP library with Hadoop.
> We are considering using Hadoop to processing time series data, but don't
> want to write standard DSP functions.
> Roger.

Twitter: @jpatanooga
Solution Architect @ Cloudera
hadoop: http://www.cloudera.com
blog: http://jpatterson.floe.tv

View raw message