spark-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Krishna Shekhram (JIRA)" <>
Subject [jira] [Created] (SPARK-14108) calling count() on empty dataframe throws java.util.NoSuchElementException
Date Wed, 23 Mar 2016 22:30:25 GMT
Krishna Shekhram created SPARK-14108:

             Summary: calling count() on empty dataframe throws java.util.NoSuchElementException
                 Key: SPARK-14108
             Project: Spark
          Issue Type: Bug
          Components: SQL
    Affects Versions: 1.6.1
         Environment: Tested in Hadoop 2.7.2 EMR 4.x
            Reporter: Krishna Shekhram
            Priority: Minor

When calling count() on empty dataframe, then spark code still tries to iterate through the
empty iterator and throws java.util.NoSuchElementException.

Stacktrace :
java.util.NoSuchElementException: next on empty iterator
	at scala.collection.Iterator$$anon$
	at scala.collection.Iterator$$anon$
	at scala.collection.IndexedSeqLike$
	at scala.collection.IterableLike$class.head(IterableLike.scala:91)
	at scala.collection.mutable.ArrayOps$ofRef.scala$collection$IndexedSeqOptimized$$super$head(ArrayOps.scala:108)
	at scala.collection.IndexedSeqOptimized$class.head(IndexedSeqOptimized.scala:120)
	at scala.collection.mutable.ArrayOps$ofRef.head(ArrayOps.scala:108)
	at org.apache.spark.sql.DataFrame$$anonfun$count$1.apply(DataFrame.scala:1515)
	at org.apache.spark.sql.DataFrame$$anonfun$count$1.apply(DataFrame.scala:1514)
	at org.apache.spark.sql.DataFrame.withCallback(DataFrame.scala:2099)
	at org.apache.spark.sql.DataFrame.count(DataFrame.scala:1514)

Code Snippet:
This code fails
if(this.df !=null){
			long countOfRows = this.df.count();

If I do this then it works
if(this.df !=null && ! this.df.rdd().isEmpty()){
			long countOfRows = this.df.count();

This message was sent by Atlassian JIRA

To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message