predictionio-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Christian Yonathan <christiany...@gmail.com>
Subject Re: Setup PredictionIO for large events
Date Tue, 30 Aug 2016 09:48:27 GMT
hello Digambar,

im newbie too, but according from http://predictionio.incubator.
apache.org/deploy/monitoring/
how this help,
but all my storage data are about 2.5 million which include
events+items+user.

it's so incredible project, what kind a project you working on it? hehehe
good luck!

so many thanks,
Yohan

-Christian Yonathan S-

On Tue, Aug 30, 2016 at 2:21 PM, Digambar Bhat <digambarbhat14@gmail.com>
wrote:

> Hello,
>
> I am using PredictionIO since last one  year. It's working fine for me.
>
> Earlier importing, training was working flawlessly. But now training is
> very slow as events are increased. Training almost taking 9-10 hours.
>
> Currently, events are about 15 million and items are about 10 million.
>
> Architecture is like below:
> Spark and elastic search is on two machines. Hadoop and hbase is on
> another two separate machines.
>
> Each machine has following configuration:
> 160GB ram, CPUs 40, Cores per socket 10, cpu MHz 3000
>
> So please let me know what is right configuration for such large events.
> Also let me know what possibility should I consider as my events are going
> to increase to billion. Will it work for such large data set?
>
> Thanks in advance.
>
> Thanks,
> Digambar
>

Mime
View raw message