Empty schema not supported
Writing a dataframe with an empty or nested empty schema using any file format, such as parquet, orc, json, text, or csv is not allowed.
Type of change: Syntactic/Spark core
Spark 1.6 - 2.3
Writing a dataframe with an empty or nested empty schema using any file format is allowed and will not throw an exception.
Spark 2.4
An exception is thrown when you attempt to write dataframes with empty schema.
For example, if there are statements such as
df.write.format("parquet").mode("overwrite").save(somePath)
, the following
error occurs:
org.apache.spark.sql.AnalysisException: Parquet data source does not support null data type.
Action Required
Make sure that DataFrame is not empty. Check whether DataFrame is empty or not as follows:
if (!df.isEmpty) df.write.format("parquet").mode("overwrite").save("somePath")