Programming
Notes from my day job as a data engineer. Things I learned, things I broke, things I wish someone had told me earlier. Pick a category to dig in.
Categories
-
SQL Server
Indexes, partitions, query plans, and the day-to-day reality of running T-SQL at scale.
40 posts -
Python
Idioms, the standard library tricks I keep forgetting, and what's new in the language.
60 posts -
PySpark
Distributed dataframes, joins that don't blow up the cluster, and the parts of Spark that bite.
60 posts -
Architecture
Single-server apps to global distributed systems. Storage choices, replication, streaming, orchestration, cost, and the case studies that show how it's done at scale.
81 posts -
Career notes
What this job actually looks like, and how I think about it.
24 posts