phoenix-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From GitBox <...@apache.org>
Subject [GitHub] karanmehta93 commented on issue #419: PHOENIX-4009 Run UPDATE STATISTICS command by using MR integration on…
Date Wed, 09 Jan 2019 05:19:25 GMT
karanmehta93 commented on issue #419: PHOENIX-4009 Run UPDATE STATISTICS command by using MR
integration on…
URL: https://github.com/apache/phoenix/pull/419#issuecomment-452575444
 
 
   > I didn't see my concern "if time since last update is less than certain threshold,
the job doesn't need to update the regions' stats again. Suppose one MR job failed but some
regions still get updated, the rerun job only needs to update the regions that the first job
didn't update." described in the PHOENIX-4009 being addressed in this change, and I still
think it should be addressed.
   
   The first aspect should be fairly simple to add. We can update it as part of this Jira
or the PHOENIX-5091. 
   
   The second aspect of re-running only the necessary tasks is bit complicated. The mappers
would retry for particular region when they fail (upto limit of max attempts). However I dont
really feel the need for that optimization as of now. Let me know what your thoughts are.


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
users@infra.apache.org


With regards,
Apache Git Services

Mime
View raw message