kylin-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Li Yang <liy...@apache.org>
Subject Re: ERROR WHEN Saving Cuboid Statistics
Date Sun, 10 Jun 2018 00:48:05 GMT
The missing
>
hdfs:/kylin/kylin_metadata/kylin-6faa47ea-3a5b-4020-976f-c9fcf9d93bd2/kylin_sales_cube/fact_distinct_columns/statistics
should be the output of the "Fact distinct values" step.

I would check 1) if the file is really there; 2) in kylin.log if anything
suspicious around the step of "Fact distinct values".

If find nothing, then discard and re-submit the job is a temporary
workaround.

On Fri, Jun 8, 2018 at 11:08 AM, op <520075694@qq.com> wrote:

>
> Hi,everyone,I‘am running example kylin_sales_model.
>
>  At saving cuboid statistics step
>
>    log says
>
>    details in kylin.log are below:
>
> there isn't any other error information ,and i can't find out what's
> wrong,can you help me please?
> my version :
> kylin 2.0.0
> hbase 1.2.6
> hdaoop 2.9.0
> hive 2.3.2  beeline
>
> 2018-06-07 16:08:18,252 WARN  [Job 6faa47ea-3a5b-4020-976f-c9fcf9d93bd2-424]
> conf.HiveConf:4116 : HiveConf of name hive.mapred.supports.subdirectories
> does not exist
> 2018-06-07 16:08:18,253 INFO  [Job 6faa47ea-3a5b-4020-976f-c9fcf9d93bd2-424]
> hive.metastore:407 : Trying to connect to metastore with URI
> thrift://hadoop1001:9083
> 2018-06-07 16:08:18,254 INFO  [Job 6faa47ea-3a5b-4020-976f-c9fcf9d93bd2-424]
> hive.metastore:481 : Opened a connection to metastore, current connections:
> 51
> 2018-06-07 16:08:18,254 INFO  [Job 6faa47ea-3a5b-4020-976f-c9fcf9d93bd2-424]
> hive.metastore:534 : Connected to metastore.
> 2018-06-07 16:08:18,341 INFO  [Job 6faa47ea-3a5b-4020-976f-c9fcf9d93bd2-424]
> lookup.SnapshotManager:214 : Loading snapshotTable from
> /table_snapshot/KYLIN.KYLIN_CAL_DT/c611bd70-b196-45ef-9edc-18d6a1126277.snapshot,
> with loadData: false
> 2018-06-07 16:08:18,384 INFO  [Job 6faa47ea-3a5b-4020-976f-c9fcf9d93bd2-424]
> lookup.SnapshotManager:130 : Identical input FileSignature
> [path=hdfs://beh/user/hive/warehouse/kylin.db/kylin_cal_dt, size=512997,
> lastModifiedTime=1528356298671], reuse existing snapshot at
> /table_snapshot/KYLIN.KYLIN_CAL_DT/c611bd70-b196-45ef-9edc-18d6a1126277
> .snapshot
> 2018-06-07 16:08:18,384 INFO  [Job 6faa47ea-3a5b-4020-976f-c9fcf9d93bd2-424]
> cube.CubeManager:366 : Updating cube instance 'kylin_sales_cube'
> 2018-06-07 16:08:18,384 WARN  [Job 6faa47ea-3a5b-4020-976f-c9fcf9d93bd2-424]
> cube.CubeValidator:102 : NEW segment start does not fit/connect with other
> segments: kylin_sales_cube[20120101000000_20121231000000]
> 2018-06-07 16:08:18,384 WARN  [Job 6faa47ea-3a5b-4020-976f-c9fcf9d93bd2-424]
> cube.CubeValidator:104 : NEW segment end does not fit/connect with other
> segments: kylin_sales_cube[20120101000000_20121231000000]
> 2018-06-07 16:08:18,387 INFO  [Job 6faa47ea-3a5b-4020-976f-c9fcf9d93bd2-424]
> cli.DictionaryGeneratorCLI:91 : Building snapshot of KYLIN.KYLIN_COUNTRY
> 2018-06-07 16:08:18,387 INFO  [pool-11-thread-1]
> cachesync.Broadcaster:123 : Announcing new broadcast event:
> BroadcastEvent{entity=cube, event=update, cacheKey=kylin_sales_cube}
> 2018-06-07 16:08:18,391 DEBUG [http-bio-7070-exec-3]
> cachesync.Broadcaster:189 : Broadcasting metadata change: entity=cube,
> event=UPDATE, cacheKey=kylin_sales_cube, listeners=[org.apache.kylin.
> cube.CubeManager$CubeSyncListener@11f9b854, org.apache.kylin.storage.
> hybrid.HybridManager$HybridSyncListener@581a3b84,
> org.apache.kylin.rest.service.CacheService$1@6c4e4197]
> 2018-06-07 16:08:18,393 INFO  [http-bio-7070-exec-3] cube.CubeManager:785
> : Reloaded cube kylin_sales_cube being CUBE[name=kylin_sales_cube] having 1
> segments
> 2018-06-07 16:08:18,393 DEBUG [http-bio-7070-exec-3]
> cachesync.Broadcaster:189 : Broadcasting metadata change:
> entity=project_data, event=UPDATE, cacheKey=learn_kylin,
> listeners=[org.apache.kylin.metadata.project.ProjectManager$
> ProjectSyncListener@51853c91, org.apache.kylin.metadata.MetadataManager$
> SrcTableSyncListener@67cc135, org.apache.kylin.metadata.MetadataManager$
> SrcTableExtSyncListener@33eb1107, org.apache.kylin.metadata.
> MetadataManager$DataModelSyncListener@6d749bd4, org.apache.kylin.metadata.
> MetadataManager$ExtFilterSyncListener@9f57e2e, org.apache.kylin.cube.
> CubeDescManager$CubeDescSyncListener@71ecba35, org.apache.kylin.cube.
> CubeManager$CubeSyncListener@11f9b854, org.apache.kylin.storage.
> hybrid.HybridManager$HybridSyncListener@581a3b84,
> org.apache.kylin.metadata.realization.RealizationRegistry$1@7a3c2aa7,
> org.apache.kylin.rest.service.CacheService$1@6c4e4197]
> 2018-06-07 16:08:18,394 INFO  [http-bio-7070-exec-3]
> service.CacheService:135 : removeOLAPDataSource is called for project
> learn_kylin
> 2018-06-07 16:08:18,394 INFO  [http-bio-7070-exec-3]
> service.CacheService:117 : cleaning cache for project learn_kylin
> (currently remove all entries)
> 2018-06-07 16:08:18,394 DEBUG [http-bio-7070-exec-3]
> cachesync.Broadcaster:219 : Done broadcasting metadata change:
> entity=project_data, event=UPDATE, cacheKey=learn_kylin
> 2018-06-07 16:08:18,395 DEBUG [http-bio-7070-exec-3]
> cachesync.Broadcaster:219 : Done broadcasting metadata change: entity=cube,
> event=UPDATE, cacheKey=kylin_sales_cube
> 2018-06-07 16:08:18,511 WARN  [Job 6faa47ea-3a5b-4020-976f-c9fcf9d93bd2-424]
> conf.HiveConf:4116 : HiveConf of name hive.server2.custom.authentication.file
> does not exist
> 2018-06-07 16:08:18,512 WARN  [Job 6faa47ea-3a5b-4020-976f-c9fcf9d93bd2-424]
> conf.HiveConf:4116 : HiveConf of name hive.mapred.supports.subdirectories
> does not exist
> 2018-06-07 16:08:18,513 INFO  [Job 6faa47ea-3a5b-4020-976f-c9fcf9d93bd2-424]
> hive.metastore:407 : Trying to connect to metastore with URI
> thrift://hadoop1001:9083
> 2018-06-07 16:08:18,513 INFO  [Job 6faa47ea-3a5b-4020-976f-c9fcf9d93bd2-424]
> hive.metastore:481 : Opened a connection to metastore, current connections:
> 52
> 2018-06-07 16:08:18,514 INFO  [Job 6faa47ea-3a5b-4020-976f-c9fcf9d93bd2-424]
> hive.metastore:534 : Connected to metastore.
> 2018-06-07 16:08:18,595 INFO  [Job 6faa47ea-3a5b-4020-976f-c9fcf9d93bd2-424]
> lookup.SnapshotManager:214 : Loading snapshotTable from
> /table_snapshot/KYLIN.KYLIN_COUNTRY/02c57ee0-b8a3-47eb-bedc-ebd7d9caeb60.snapshot,
> with loadData: false
> 2018-06-07 16:08:18,639 INFO  [Job 6faa47ea-3a5b-4020-976f-c9fcf9d93bd2-424]
> lookup.SnapshotManager:130 : Identical input FileSignature
> [path=hdfs://beh/user/hive/warehouse/kylin.db/kylin_country, size=8537,
> lastModifiedTime=1528356298714], reuse existing snapshot at
> /table_snapshot/KYLIN.KYLIN_COUNTRY/02c57ee0-b8a3-47eb-
> bedc-ebd7d9caeb60.snapshot
> 2018-06-07 16:08:18,640 INFO  [Job 6faa47ea-3a5b-4020-976f-c9fcf9d93bd2-424]
> cube.CubeManager:366 : Updating cube instance 'kylin_sales_cube'
> 2018-06-07 16:08:18,640 WARN  [Job 6faa47ea-3a5b-4020-976f-c9fcf9d93bd2-424]
> cube.CubeValidator:102 : NEW segment start does not fit/connect with other
> segments: kylin_sales_cube[20120101000000_20121231000000]
> 2018-06-07 16:08:18,640 WARN  [Job 6faa47ea-3a5b-4020-976f-c9fcf9d93bd2-424]
> cube.CubeValidator:104 : NEW segment end does not fit/connect with other
> segments: kylin_sales_cube[20120101000000_20121231000000]
> 2018-06-07 16:08:18,643 INFO  [Job 6faa47ea-3a5b-4020-976f-c9fcf9d93bd2-424]
> cli.DictionaryGeneratorCLI:96 : Checking snapshot of
> TableRef[BUYER_COUNTRY:KYLIN_COUNTRY]
> 2018-06-07 16:08:18,644 INFO  [Job 6faa47ea-3a5b-4020-976f-c9fcf9d93bd2-424]
> cli.DictionaryGeneratorCLI:96 : Checking snapshot of
> TableRef[SELLER_ACCOUNT:KYLIN_ACCOUNT]
> 2018-06-07 16:08:18,646 INFO  [pool-11-thread-1]
> cachesync.Broadcaster:123 : Announcing new broadcast event:
> BroadcastEvent{entity=cube, event=update, cacheKey=kylin_sales_cube}
> 2018-06-07 16:08:18,650 INFO  [Job 6faa47ea-3a5b-4020-976f-c9fcf9d93bd2-424]
> cli.DictionaryGeneratorCLI:96 : Checking snapshot of
> TableRef[KYLIN_CATEGORY_GROUPINGS]
> 2018-06-07 16:08:18,650 INFO  [Job 6faa47ea-3a5b-4020-976f-c9fcf9d93bd2-424]
> cli.DictionaryGeneratorCLI:96 : Checking snapshot of
> TableRef[SELLER_COUNTRY:KYLIN_COUNTRY]
> 2018-06-07 16:08:18,651 INFO  [Job 6faa47ea-3a5b-4020-976f-c9fcf9d93bd2-424]
> cli.DictionaryGeneratorCLI:96 : Checking snapshot of
> TableRef[BUYER_ACCOUNT:KYLIN_ACCOUNT]
> 2018-06-07 16:08:18,652 DEBUG [http-bio-7070-exec-3]
> cachesync.Broadcaster:189 : Broadcasting metadata change: entity=cube,
> event=UPDATE, cacheKey=kylin_sales_cube, listeners=[org.apache.kylin.
> cube.CubeManager$CubeSyncListener@11f9b854, org.apache.kylin.storage.
> hybrid.HybridManager$HybridSyncListener@581a3b84,
> org.apache.kylin.rest.service.CacheService$1@6c4e4197]
> 2018-06-07 16:08:18,654 INFO  [http-bio-7070-exec-3] cube.CubeManager:785
> : Reloaded cube kylin_sales_cube being CUBE[name=kylin_sales_cube] having 1
> segments
> 2018-06-07 16:08:18,654 DEBUG [http-bio-7070-exec-3]
> cachesync.Broadcaster:189 : Broadcasting metadata change:
> entity=project_data, event=UPDATE, cacheKey=learn_kylin,
> listeners=[org.apache.kylin.metadata.project.ProjectManager$
> ProjectSyncListener@51853c91, org.apache.kylin.metadata.MetadataManager$
> SrcTableSyncListener@67cc135, org.apache.kylin.metadata.MetadataManager$
> SrcTableExtSyncListener@33eb1107, org.apache.kylin.metadata.
> MetadataManager$DataModelSyncListener@6d749bd4, org.apache.kylin.metadata.
> MetadataManager$ExtFilterSyncListener@9f57e2e, org.apache.kylin.cube.
> CubeDescManager$CubeDescSyncListener@71ecba35, org.apache.kylin.cube.
> CubeManager$CubeSyncListener@11f9b854, org.apache.kylin.storage.
> hybrid.HybridManager$HybridSyncListener@581a3b84,
> org.apache.kylin.metadata.realization.RealizationRegistry$1@7a3c2aa7,
> org.apache.kylin.rest.service.CacheService$1@6c4e4197]
> 2018-06-07 16:08:18,654 INFO  [http-bio-7070-exec-3]
> service.CacheService:135 : removeOLAPDataSource is called for project
> learn_kylin
> 2018-06-07 16:08:18,655 INFO  [http-bio-7070-exec-3]
> service.CacheService:117 : cleaning cache for project learn_kylin
> (currently remove all entries)
> 2018-06-07 16:08:18,655 DEBUG [http-bio-7070-exec-3]
> cachesync.Broadcaster:219 : Done broadcasting metadata change:
> entity=project_data, event=UPDATE, cacheKey=learn_kylin
> 2018-06-07 16:08:18,655 DEBUG [http-bio-7070-exec-3]
> cachesync.Broadcaster:219 : Done broadcasting metadata change: entity=cube,
> event=UPDATE, cacheKey=kylin_sales_cube
> 2018-06-07 16:08:18,657 INFO  [Job 6faa47ea-3a5b-4020-976f-c9fcf9d93bd2-424]
> cli.DictionaryGeneratorCLI:96 : Checking snapshot of TableRef[KYLIN_CAL_DT]
> 2018-06-07 16:08:18,676 DEBUG [Job 6faa47ea-3a5b-4020-976f-c9fcf9d93bd2-424]
> dao.ExecutableDao:217 : updating job output, id: 6faa47ea-3a5b-4020-976f-
> c9fcf9d93bd2-03
> 2018-06-07 16:08:18,681 DEBUG [Job 6faa47ea-3a5b-4020-976f-c9fcf9d93bd2-424]
> dao.ExecutableDao:217 : updating job output, id: 6faa47ea-3a5b-4020-976f-
> c9fcf9d93bd2-03
> 2018-06-07 16:08:18,683 INFO  [Job 6faa47ea-3a5b-4020-976f-c9fcf9d93bd2-424]
> execution.ExecutableManager:389 : job id:6faa47ea-3a5b-4020-976f-
> c9fcf9d93bd2-03 from RUNNING to SUCCEED
> 2018-06-07 16:08:18,687 DEBUG [Job 6faa47ea-3a5b-4020-976f-c9fcf9d93bd2-424]
> dao.ExecutableDao:217 : updating job output, id: 6faa47ea-3a5b-4020-976f-
> c9fcf9d93bd2
> 2018-06-07 16:08:18,705 DEBUG [Job 6faa47ea-3a5b-4020-976f-c9fcf9d93bd2-424]
> dao.ExecutableDao:217 : updating job output, id: 6faa47ea-3a5b-4020-976f-
> c9fcf9d93bd2
> 2018-06-07 16:08:18,707 INFO  [Job 6faa47ea-3a5b-4020-976f-c9fcf9d93bd2-424]
> execution.ExecutableManager:389 : job id:6faa47ea-3a5b-4020-976f-c9fcf9d93bd2
> from RUNNING to READY
> 2018-06-07 16:08:18,716 INFO  [pool-8-thread-1]
> threadpool.DefaultScheduler:114 : CubingJob{id=6faa47ea-3a5b-4020-976f-c9fcf9d93bd2,
> name=kylin_sales_cube - 20120101000000_20121231000000 - BUILD - GMT+08:00
> 2018-06-07 16:05:02, state=READY} prepare to schedule
> 2018-06-07 16:08:18,716 INFO  [pool-8-thread-1]
> threadpool.DefaultScheduler:117 : CubingJob{id=6faa47ea-3a5b-4020-976f-c9fcf9d93bd2,
> name=kylin_sales_cube - 20120101000000_20121231000000 - BUILD - GMT+08:00
> 2018-06-07 16:05:02, state=READY} scheduled
> 2018-06-07 16:08:18,716 INFO  [Job 6faa47ea-3a5b-4020-976f-c9fcf9d93bd2-424]
> execution.AbstractExecutable:110 : Executing AbstractExecutable
> (kylin_sales_cube - 20120101000000_20121231000000 - BUILD - GMT+08:00
> 2018-06-07 16:05:02)
> 2018-06-07 16:08:18,717 INFO  [pool-8-thread-1]
> threadpool.DefaultScheduler:124 : Job Fetcher: 0 should running, 1 actual
> running, 0 stopped, 1 ready, 0 already succeed, 0 error, 2 discarded, 0
> others
> 2018-06-07 16:08:18,717 DEBUG [Job 6faa47ea-3a5b-4020-976f-c9fcf9d93bd2-424]
> dao.ExecutableDao:217 : updating job output, id: 6faa47ea-3a5b-4020-976f-
> c9fcf9d93bd2
> 2018-06-07 16:08:18,720 INFO  [Job 6faa47ea-3a5b-4020-976f-c9fcf9d93bd2-424]
> execution.ExecutableManager:389 : job id:6faa47ea-3a5b-4020-976f-c9fcf9d93bd2
> from READY to RUNNING
> 2018-06-07 16:08:18,727 INFO  [Job 6faa47ea-3a5b-4020-976f-c9fcf9d93bd2-424]
> execution.AbstractExecutable:110 : Executing AbstractExecutable (Save
> Cuboid Statistics)
> 2018-06-07 16:08:18,728 DEBUG [Job 6faa47ea-3a5b-4020-976f-c9fcf9d93bd2-424]
> dao.ExecutableDao:217 : updating job output, id: 6faa47ea-3a5b-4020-976f-
> c9fcf9d93bd2-04
> 2018-06-07 16:08:18,730 INFO  [Job 6faa47ea-3a5b-4020-976f-c9fcf9d93bd2-424]
> execution.ExecutableManager:389 : job id:6faa47ea-3a5b-4020-976f-
> c9fcf9d93bd2-04 from READY to RUNNING
> 2018-06-07 16:08:18,771 ERROR [Job 6faa47ea-3a5b-4020-976f-c9fcf9d93bd2-424]
> steps.SaveStatisticsStep:82 : fail to save cuboid statistics
> java.io.IOException: fail to find the statistics file in base dir:
> hdfs:/kylin/kylin_metadata/kylin-6faa47ea-3a5b-4020-976f-
> c9fcf9d93bd2/kylin_sales_cube/fact_distinct_columns/statistics
> at org.apache.kylin.engine.mr.steps.SaveStatisticsStep.
> doWork(SaveStatisticsStep.java:66)
> at org.apache.kylin.job.execution.AbstractExecutable.
> execute(AbstractExecutable.java:124)
> at org.apache.kylin.job.execution.DefaultChainedExecutable.doWork(
> DefaultChainedExecutable.java:64)
> at org.apache.kylin.job.execution.AbstractExecutable.
> execute(AbstractExecutable.java:124)
> at org.apache.kylin.job.impl.threadpool.DefaultScheduler$JobRunner.run(
> DefaultScheduler.java:142)
> at java.util.concurrent.ThreadPoolExecutor.runWorker(
> ThreadPoolExecutor.java:1149)
> at java.util.concurrent.ThreadPoolExecutor$Worker.run(
> ThreadPoolExecutor.java:624)
> at java.lang.Thread.run(Thread.java:748)
> 2018-06-07 16:08:18,772 DEBUG [Job 6faa47ea-3a5b-4020-976f-c9fcf9d93bd2-424]
> dao.ExecutableDao:217 : updating job output, id: 6faa47ea-3a5b-4020-976f-
> c9fcf9d93bd2-04
> 2018-06-07 16:08:18,777 DEBUG [Job 6faa47ea-3a5b-4020-976f-c9fcf9d93bd2-424]
> dao.ExecutableDao:217 : updating job output, id: 6faa47ea-3a5b-4020-976f-
> c9fcf9d93bd2-04
> 2018-06-07 16:08:18,779 INFO  [Job 6faa47ea-3a5b-4020-976f-c9fcf9d93bd2-424]
> execution.ExecutableManager:389 : job id:6faa47ea-3a5b-4020-976f-
> c9fcf9d93bd2-04 from RUNNING to ERROR
> 2018-06-07 16:08:18,784 DEBUG [Job 6faa47ea-3a5b-4020-976f-c9fcf9d93bd2-424]
> dao.ExecutableDao:217 : updating job output, id: 6faa47ea-3a5b-4020-976f-
> c9fcf9d93bd2
> 2018-06-07 16:08:18,788 DEBUG [Job 6faa47ea-3a5b-4020-976f-c9fcf9d93bd2-424]
> dao.ExecutableDao:217 : updating job output, id: 6faa47ea-3a5b-4020-976f-
> c9fcf9d93bd2
> 2018-06-07 16:08:18,790 DEBUG [Job 6faa47ea-3a5b-4020-976f-c9fcf9d93bd2-424]
> dao.ExecutableDao:217 : updating job output, id: 6faa47ea-3a5b-4020-976f-
> c9fcf9d93bd2
> 2018-06-07 16:08:18,792 INFO  [Job 6faa47ea-3a5b-4020-976f-c9fcf9d93bd2-424]
> execution.ExecutableManager:389 : job id:6faa47ea-3a5b-4020-976f-c9fcf9d93bd2
> from RUNNING to ERROR
> 2018-06-07 16:08:18,792 WARN  [Job 6faa47ea-3a5b-4020-976f-c9fcf9d93bd2-424]
> execution.AbstractExecutable:258 : no need to send email, user list is
> empty
> 2018-06-07 16:08:18,798 INFO  [pool-8-thread-1]
> threadpool.DefaultScheduler:124 : Job Fetcher: 0 should running, 0 actual
> running, 0 stopped, 0 ready, 0 already succeed, 1 error, 2 discarded, 0
> others
> 2018-06-07 16:08:56,561 INFO  [pool-8-thread-1]
> threadpool.DefaultScheduler:124 : Job Fetcher: 0 should running, 0 actual
> running, 0 stopped, 0 ready, 0 already succeed, 1 error, 2 discarded, 0
> others
>
> 上一封 下一封
> « 返回
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message