C
C#2mo ago
tommy

✅ How do Avro, ORC and Parquet work?

I have been learning about big data and these came across as a few of the popular "optimized file formats". I want to understand how they work.
7 Replies
Sossenbinder
Sossenbinder2mo ago
Not very familiar with the others, but at least for Avro the unique selling point is probably the potential to differ in reader and writer schema. The consumer and producer do not always have to stick to the exact same encoding, they just need to be compatible I also dug up a small example of how the Avro variable length binary encoding works
Sossenbinder
Sossenbinder2mo ago
No description
Sossenbinder
Sossenbinder2mo ago
For the schema
record Person {
string userName;
union { null, long } favoriteNumber = null;
array<string> interests;
}
record Person {
string userName;
union { null, long } favoriteNumber = null;
array<string> interests;
}
tommy
tommyOP2mo ago
👍
Unknown User
Unknown User2mo ago
Message Not Public
Sign In & Join Server To View
tommy
tommyOP2mo ago
okay
Unknown User
Unknown User2mo ago
Message Not Public
Sign In & Join Server To View
Want results from more Discord servers?
Add your server