Description
FileFormat data sources like Parquet and Avro (provided by spark-avro) have customized file filtering logics. For example, Parquet needs to filter out summary files, while Avro provides a Hadoop configuration option to filter out all files whose names don't end with ".avro".
It would be nice to have a general file filtering interface in FileFormat to handle similar requirements.