lucene-general mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "James" <>
Subject RE: Open Relevance Infrastucture Request
Date Tue, 26 May 2009 15:35:49 GMT
Hey Ted,

What are the parameters of the file bandwidth and access you need?  We'd consider donating
this, but I'm not sure exactly how it would work.  We can't have completely open FTP where
abusive people find out about it and start storing movies and such.  Would it be read-only,
or would we set up accounts for interested parties?

BTW, a mention on the project page would be great, but not necessary -- our site is free so
any visibility is always appreciated.

James Ryley, Ph.D. / 

This communication is to be treated as confidential and the information in it may not be used
or disclosed except for the purpose for which it has been sent. Nothing contained herein nor
on a related web site should be construed as legal or patenting advice.

> -----Original Message-----
> From: Ted Dunning []
> Sent: Tuesday, May 26, 2009 11:21 AM
> To:
> Subject: Re: Open Relevance Infrastucture Request
> The cost for storing a few hundred GB of data would be < $100/month.
> The cost for transfer would be $17/100GB which could add up fairly quickly
> if more than dozens of downloads happen.  My guess is that would be
> unlikely.
> Another option is to request Amazon host the dataset as a public dataset:
> On Tue, May 26, 2009 at 7:14 AM, Grant Ingersoll <>wrote:
> > You can cap the cost by limiting how much data you store right? You can use
> >> RequesterPayBuckets
> >>
> ets.html to
> >> move the cost onto the users who want the data. Per user, it would still be
> >> fairly cheap. You get the added bonus of other S3 services, like being able
> >> to send a device back and forth to import/export on site. You would just
> pay
> >> for storage and transferring the data in - both cap-able by limiting the
> >> amount of data you put in it.
> >>
> >>
> > One of the goals is to make the data available for free, so I don't think
> > this would work.  Currently, one can get the TREC data for a nominal fee as
> > well.
> --
> Ted Dunning, CTO
> DeepDyve

View raw message