atlas-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Shwetha G S (JIRA)" <j...@apache.org>
Subject [jira] [Assigned] (ATLAS-342) Atlas is sending an ENTITY_CREATE event to the ATLAS_ENTITIES topic even if the entity exists already.
Date Mon, 30 Nov 2015 05:47:10 GMT

     [ https://issues.apache.org/jira/browse/ATLAS-342?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Shwetha G S reassigned ATLAS-342:
---------------------------------

    Assignee: Shwetha G S

> Atlas is sending an ENTITY_CREATE event to the ATLAS_ENTITIES topic even if the entity
exists already.
> ------------------------------------------------------------------------------------------------------
>
>                 Key: ATLAS-342
>                 URL: https://issues.apache.org/jira/browse/ATLAS-342
>             Project: Atlas
>          Issue Type: Bug
>            Reporter: Hemanth Yamijala
>            Assignee: Shwetha G S
>
> * Configure Hive hook in Hive.
> * Configure Atlas to talk to Kafka topics on a separately set Kafka instance.
> * Run a script to create tables in Hive multiple times, for e.g.
> {code}
> for i in `seq 1 10`; do ./bin/hive -e "create table tbl$i (column${i}1 string, column${i}2
int)"; done
> {code}
> * After the script completes, check the number of entities & list the actual entities
in the ATLAS_ENTITIES topic.
> We can see one ENTITY_CREATE event for the Hive database for every table created in Hive.
For e.g.: 
> {code}
> {"entity":{"jsonClass":"org.apache.atlas.typesystem.json.InstanceSerialization$_Reference","id":{"jsonClass":"org.apache.atlas.typesystem.json.InstanceSerialization$_Id","id":"e7ed4ad9-1fba-47ad-a089-1ff7e715c1ad","version":0,"typeName":"hive_db"},"typeName":"hive_db","values":{"name":"default","description":"Default
Hive database","ownerType":{"value":"ROLE","ordinal":2},"qualifiedName":"primary.default","locationUri":"hdfs://localhost:9000/user/hive/warehouse","ownerName":"public","clusterName":"primary"},"traitNames":[],"traits":{}},"operationType":"ENTITY_CREATE","traits":[]}
> {"entity":{"jsonClass":"org.apache.atlas.typesystem.json.InstanceSerialization$_Reference","id":{"jsonClass":"org.apache.atlas.typesystem.json.InstanceSerialization$_Id","id":"e7ed4ad9-1fba-47ad-a089-1ff7e715c1ad","version":0,"typeName":"hive_db"},"typeName":"hive_db","values":{"name":"default","description":"Default
Hive database","ownerType":{"value":"ROLE","ordinal":2},"qualifiedName":"primary.default","locationUri":"hdfs://localhost:9000/user/hive/warehouse","ownerName":"public","clusterName":"primary"},"traitNames":[],"traits":{}},"operationType":"ENTITY_CREATE","traits":[]}
> {code}
> The expectation is that we don't have these extraneous events as there is no change to
the entity.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message