-

Why learn 8 scripts when you can learn 256 bytes?
12 min read -

Most slow Pandas code “works”, until it doesn’t. Learn how to spot hidden bottlenecks, avoid…
18 min read
Latest
-

How does decision-gravity dictate this gap?
12 min read -

Learn about function approximation and the different choices for approximation functions
9 min read -

A local, zero-cost project that cleans, structures, and summarizes your reading automatically
12 min read -

Learn how to get the most out of Claude Code
10 min read -

More variables don’t make a better scoring model. Stable variables do. Here’s how to find them.
7 min read -

A practical pipeline for classifying messy free-text data into meaningful categories using a locally hosted…
8 min read -

Mario asked me why 18% of his shipments were late when every team hit their…
9 min read -

The silent gaps in synthetic data that only show up when your model is already…
11 min read
Editor’s Picks
-

Using Causal Inference to Estimate the Impact of Tube Strikes on Cycling Usage in London
Data ScienceTurning free-to-use data into a hypothesis-ready dataset
19 min read -

A short intro to scientific methodology to combat “prompt in, slop out”
6 min read -

-

Why it tickles your brain to use an LLM, and what that means for the…
8 min read -

Git worktrees, parallel agentic coding sessions, and the setup tax you should be aware of
20 min read -

How I turned my eight-year weekly visualization habit into a reusable AI workflow
7 min read -

Architectures, pitfalls, and patterns that work
14 min read -

Inside MareNostrum V: SLURM schedulers, fat-tree topologies, and scaling pipelines across 8,000 nodes in a…
11 min read -

The upstream decision no model, or LLM can fix once you get it wrong
22 min read
The Variable Newsletter
-

Sorting through the good, bad, and ambiguous aspects of vibe coding
4 min read
Deep Dives
-

It’s simpler than you think.
24 min read -

Learn how Propensity Score Matching uncovers true causality in observational data. By finding “statistical twins,”…
12 min read -

How you can build your own Thompson Sampling Algorithm object in Python and apply it…
17 min read -

For any data scientist who works in a team, being able to undo Git actions…
24 min read -

The hidden cost of probabilistic outputs in systems that demand reliability
13 min read -

Conceptual overview and practical guidance
16 min read

