spark-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Attila Zsolt Piros (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (SPARK-23394) Storage info's Cached Partitions doesn't consider the replications (but sc.getRDDStorageInfo does)
Date Mon, 12 Feb 2018 12:53:00 GMT

     [ https://issues.apache.org/jira/browse/SPARK-23394?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Attila Zsolt Piros updated SPARK-23394:
---------------------------------------
    Description: 
Start spark as:
{code:bash}
$ bin/spark-shell --master local-cluster[2,1,1024]
{code}


{code:scala}
scala> import org.apache.spark.storage.StorageLevel._
import org.apache.spark.storage.StorageLevel._

scala> sc.parallelize((1 to 100), 10).persist(MEMORY_AND_DISK_2).count
res0: Long = 100                                                                

scala> sc.getRDDStorageInfo(0).numCachedPartitions
res1: Int = 20
{code}

But on the UI at the Storage tab Cached Partitions is 10. See attached screenshot  !Storage_Tab.png!
.

  was:
Start spark as:
{code:bash}
$ bin/spark-shell --master local-cluster[2,1,1024]
{code}


{code:scala}
scala> import org.apache.spark.storage.StorageLevel._
import org.apache.spark.storage.StorageLevel._

scala> sc.parallelize((1 to 100), 10).persist(MEMORY_AND_DISK_2).count
res0: Long = 100                                                                

scala> sc.getRDDStorageInfo(0).numCachedPartitions
res1: Int = 20
{code}

But on the UI at the Storage tab Cached Partitions is 10. See attached screenshot.


> Storage info's Cached Partitions doesn't consider the replications (but sc.getRDDStorageInfo
does)
> --------------------------------------------------------------------------------------------------
>
>                 Key: SPARK-23394
>                 URL: https://issues.apache.org/jira/browse/SPARK-23394
>             Project: Spark
>          Issue Type: Bug
>          Components: Spark Core
>    Affects Versions: 2.3.0
>            Reporter: Attila Zsolt Piros
>            Priority: Major
>         Attachments: Spark_2.2.1.png, Spark_2.4.0-SNAPSHOT.png, Storage_Tab.png
>
>
> Start spark as:
> {code:bash}
> $ bin/spark-shell --master local-cluster[2,1,1024]
> {code}
> {code:scala}
> scala> import org.apache.spark.storage.StorageLevel._
> import org.apache.spark.storage.StorageLevel._
> scala> sc.parallelize((1 to 100), 10).persist(MEMORY_AND_DISK_2).count
> res0: Long = 100                                                                
> scala> sc.getRDDStorageInfo(0).numCachedPartitions
> res1: Int = 20
> {code}
> But on the UI at the Storage tab Cached Partitions is 10. See attached screenshot  !Storage_Tab.png!
.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org


Mime
View raw message