PinnedVu TrinhinThe Deep HubHow do we run Kafka 100% on the object storage?Let’s see how AutoMQ makes this dream come true.Aug 275Aug 275
PinnedVu TrinhinData Engineer ThingsI spent 8 hours learning Parquet. Here’s what I discoveredI finally sat down and learned about it.Aug 2416Aug 2416
PinnedVu TrinhinData Engineer ThingsHow does Uber build real-time infrastructure to handle petabytes of data every day?All insights from the paper: Real-time data infrastructure at UberMar 2319Mar 2319
Vu TrinhI spent 8 hours researching WarpStreamRewriting Kafka protocol in Go and running 100% on object storageOct 51Oct 51
Vu TrinhinData Engineer ThingsI spent 8 hours diving deep into Snowflake (again)Virtual Warehouse, Intermediate Storage, Cache, and Remote StorageSep 281Sep 281
Vu TrinhinGoogle Cloud - CommunityI spent 5 hours learning how Google lets us build a Lakehouse.The Google Cloud BigLakeSep 24Sep 24
Vu TrinhinData Engineer ThingsI spent 5 hours learning how ClickHouse built their internal data warehouse.19 data sources and a total of 470 TB of compressed data.Sep 211Sep 211
Vu TrinhinData Engineer ThingsI spent 5 hours learning how Google manages terabytes of metadata for BigQuery.How Google manages metadata at a large scale.Sep 17Sep 17
Vu TrinhinData Engineer ThingsUber’s Big Data Revolution: From MySQL to Hadoop and BeyondVolume: 100+ PB Data, Latency: MinutesSep 141Sep 141
Vu TrinhinData Engineer ThingsI spent 6 hours learning how Apache Spark plans the execution for us.Catalyst, Adaptive Query Execution, and how Airbnb leverages Spark 3.Sep 111Sep 111