Description
After looking through the pull requests below at Spark CSV datasources,
https://github.com/databricks/spark-csv/pull/256
https://github.com/databricks/spark-csv/issues/141
https://github.com/databricks/spark-csv/pull/186
It looks Spark might need to be able to set minPartitions.
repartition() or coalesce() can be alternatives but it looks it needs to shuffle the data for most cases.
Although I am still not sure if it needs this, I will open this ticket just for discussion.