✅ How do Avro, ORC and Parquet work?
I have been learning about big data and these came across as a few of the popular "optimized file formats". I want to understand how they work.
7 Replies
Not very familiar with the others, but at least for Avro the unique selling point is probably the potential to differ in reader and writer schema. The consumer and producer do not always have to stick to the exact same encoding, they just need to be compatible
I also dug up a small example of how the Avro variable length binary encoding works
For the schema
👍
Unknown User•4d ago
Message Not Public
Sign In & Join Server To View
okay
Unknown User•4d ago
Message Not Public
Sign In & Join Server To View