Insights
Long-form articles on the craft of modern data work — machine learning models, data visualisation and statistical analysis. Practical, opinionated, and free to read.
From linear baselines to gradient boosting and deep neural networks — how to pick, train and trust the model that actually fits your problem.
Charts are arguments, not decoration. A short guide to picking the right visual, removing noise and making numbers persuasive without distorting them.
Hypothesis tests, effect sizes, confidence intervals and the assumptions everyone forgets. A practical refresher for analysts who want defensible conclusions.
Algorithms get most of the credit, but features decide the ceiling. A field guide to encoding, scaling, interactions, leakage and the features AutoML still misses.
Most dashboards die from neglect within three months. A guide to building dashboards that drive decisions — hierarchy, defaults, alerts and the one chart you really need.
Priors, posteriors, credible intervals and why probabilistic answers communicate better than p-values in most business conversations.
SHAP, LIME, permutation importance and partial dependence — a practical guide to extracting trustworthy explanations from complex models.
Stacking, blending, boosting and bagging — how to combine models so the ensemble outperforms every member, and when it is worth the complexity.
The best visualisations do not just show data — they tell a story. A framework for structuring analytical narratives that persuade and endure.
Animation can guide attention, reveal change over time, or waste it. Principles for using motion responsibly in charts, dashboards and presentations.
When randomised experiments are impossible, how do you still estimate causal effects? A tour of difference-in-differences, regression discontinuity and instrumental variables.
When the event has not happened yet for everyone, ordinary regression fails. A guide to Kaplan-Meier curves, Cox models and handling censoring with confidence.
A tour of the eleven dashboard layouts inside ML Studio — what each one is optimised for, where it fails, and how to pick the grid that matches your audience.