Support Parquet v2 Spark vectorized read

### Feature Request / Improvement

As it stands today, if you want to employ both Spark and AWS Athena for your iceberg tables in v1.1.0, you must disable the vectorized reader. The reason is because Athena writes fields in a delta encoded manner, which is unsupported by the vectorized reader.

If you have ever hit the following error for a primitive type (complex types can be solved by https://github.com/apache/iceberg/issues/521), you have probably been impacted by this issue:
```
java.lang.UnsupportedOperationException: Cannot support vectorized reads for column [email] optional binary email (STRING) = 1 with encoding DELTA_BYTE_ARRAY. Disable vectorized reads to read this table/file
	at org.apache.iceberg.arrow.vectorized.parquet.VectorizedPageIterator.initDataReader(VectorizedPageIterator.java:96)
```

Spark has implemented this support in 2022: https://github.com/apache/spark/pull/35262. However, Iceberg uses its own vectorized reader.

Is it possible to implement support for these encodings? It would solve a significant interoperability problem between Athena, Spark, and possibly other query engines using them.

### Query engine

Athena + Spark

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Support Parquet v2 Spark vectorized read #7162

Feature Request / Improvement

Query engine

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Support Parquet v2 Spark vectorized read #7162

Description

Feature Request / Improvement

Query engine

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions