I won’t say one is better and the other one is not as it totally depends where are they going to be used. Apache Avro is a remote procedure call and data serialization framework developed within Apache’s Hadoop project. It uses JSON for defining data types and protocols, and serializes data in a compact binary format. (by Wikipedia) Since it’s a row based format, it’s better to use when all fields