Posts
All the articles I've posted.
- 8 MIN READ•Apr 29, 2026
B-Trees, LSM Trees, and the Indexing Tradeoff Spectrum
B-trees balance reads and writes for OLTP. LSM trees maximize write throughput. Bitmap indexes accelerate OLAP filtering. Here is when to use each.
database indexing strategiesB-tree vs LSM treebitmap index - 8 MIN READ•Apr 29, 2026
Performance and Apache Iceberg's Metadata
Iceberg's three-layer metadata tree eliminates directory listing and enables multi-level data skipping. Here is how scan planning actually works.
Apache Iceberg metadata performanceIceberg scan planningmanifest pruning - 8 MIN READ•Apr 29, 2026
How Databases Organize Data on Disk: Pages, Blocks, and File Formats
Databases structure data on disk as heap files, sorted files, or LSM trees, then wrap it in formats like Parquet with metadata that lets engines skip irrelevant blocks.
data file formatsParquet file formatdatabase storage internals