atlas-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Sandeep Nayak <datacacoph...@gmail.com>
Subject Re: Interest in Apache Atlas
Date Sat, 03 Dec 2016 18:30:57 GMT
Hi all,

Sending a reminder, I am looking for answers to the questions below. Can
someone help?

Thanks in advance for your attention.

- Sandeep

On Thu, Dec 1, 2016 at 12:13 AM, Sandeep Nayak <datacacophony@gmail.com>
wrote:

> Hi all,
>
> I had asked a couple questions to Venkatesh earlier please see email
> below. He recommended that I move the questions to the dev mailing list and
> thus this mail.
>
> To follow up on the questions asked below to my queries
>
> (a) Multi-tenancy: If I were to bring in data-sets from different
> customers then I need to record, annotate or tag and provide access to
> data-sets only to the relevant owners. Is it possible for me to record and
> manage data-sets for different customers in a single Atlas instance? Does
> Atlas provide me with the necessary constructs to separate recording of
> data-sets by tenant and tracking metadata etc by tenant?
>
> (c) Performance Numbers: I understand it is built to scale given the use
> of HBase but any performance numbers that can be shared will be helpful.
> E.g. Is there a limit to the number of data-sets I can record on Atlas? Are
> there performance numbers on the number of queries?
>
> (d) Are there companies using Atlas in production at this stage?
>
> Thanks in advance for your responses.
>
> - Sandeep
>
>
>
>
> On Fri, Nov 18, 2016 at 9:10 AM, Venkatesh Seetharam <venkatesh@apache.org
> > wrote:
>
>> Sandeep - please use the dev mailing list for atlas for a prompt response.
>>
>> (a) How can one achieve multi-tenancy on Apache Atlas?
>> Can you pls elaborate? You can always have a package structure for your
>> data sets.
>>
>> (b) Is Atlas ready for production usage?
>> It depends, I think it is but needs some scripting around BCP, etc.
>>
>> (c) Are there published numbers on the volume of data-sets Atlas can
>> manage?
>> Its built to scale, uses Titan & Hbase as a backend store which is known
>> to scale.
>>
>> On Fri, Nov 4, 2016 at 12:02 PM Sandeep Nayak <datacacophony@gmail.com>
>> wrote:
>>
>>> Hi Venkatesh,
>>>
>>> I apologize for the direct email, if there is a better channel to
>>> surface my questions I will be happy to go there. I am subscribed to
>>> dev@atlas but thought that may not be the right forum for questions
>>> potential Atlas users may have.
>>>
>>> I am looking for Data Catalog solutions and in early evaluation and from
>>> what I read so far it appears Apache Atlas provides most of the
>>> capabilities I am looking for. Namely data-set registration, lineage
>>> tracking, access control (via Ranger), auditing to name a few.
>>>
>>> I do have a couple questions which will help me in my evaluation
>>>
>>> (a) How can one achieve multi-tenancy on Apache Atlas?
>>> (b) Is Atlas ready for production usage?
>>> (c) Are there published numbers on the volume of data-sets Atlas can
>>> manage? One of the requirements I pointed out above is data lineage and if
>>> I am ingesting streaming and batch data sets the typical volumes could be
>>> very high.
>>>
>>> Hoping you will point me in the right direction to get answers.
>>>
>>> Thanks for your time and help.
>>>
>>> Regards,
>>>
>>> Sandeep
>>>
>>
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message