The Crucial Role of NUMA Awareness in High-Performance Deep Learning

The Crucial Role of NUMA Awareness in High-Performance Deep Learning
world of deep learning training, the role of the ML developer can be likened to that of the conductor of ...
Read more

Work Data Is the Next Frontier for GenAI

Work Data Is the Next Frontier for GenAI
, the work output of knowledge workers, is the single most valuable data source for LLM training, uniquely capable of ...
Read more

How to Fine-Tune Small Language Models to Think with Reinforcement Learning

How to Fine-Tune Small Language Models to Think with Reinforcement Learning
in fashion. DeepSeek-R1, Gemini-2.5-Pro, OpenAI’s O-series models, Anthropic’s Claude, Magistral, and Qwen3 — there is a new one every month. ...
Read more

Run Your Python Code up to 80x Faster Using the Cython Library

Run Your Python Code up to 80x Faster Using the Cython Library
excellent language for rapid prototyping and code development, but one thing I often hear people say about using it is ...
Read more

The Five-Second Fingerprint: Inside Shazam’s Instant Song ID

The Five-Second Fingerprint: Inside Shazam’s Instant Song ID
This post continues Behind the Tap, a series exploring the hidden mechanics of everyday tech — from Uber to Spotify to search ...
Read more

GraphRAG in Action: A Simple Agent for Know-Your-Customer Investigations

GraphRAG in Action: A Simple Agent for Know-Your-Customer Investigations
the world of financial services, Know-Your-Customer (KYC) and Anti-Money Laundering (AML) are critical defense lines against illicit activities. KYC is ...
Read more

Explainable Anomaly Detection with RuleFit: An Intuitive Guide

Explainable Anomaly Detection with RuleFit: An Intuitive Guide
your anomaly detection results to your stakeholders, the immediate next question is always “why?”. In practice, simply flagging an anomaly ...
Read more

Change-Aware Data Validation with Column-Level Lineage

Change-Aware Data Validation with Column-Level Lineage
tools like dbt make constructing SQL data pipelines easy and systematic. But even with the added structure and clearly defined ...
Read more

My Honest Advice for Aspiring Machine Learning Engineers

My Honest Advice for Aspiring Machine Learning Engineers
want to be machine learning engineers. I get it. It’s a great job, with interesting work, great pay, and overall, ...
Read more

Rethinking Data Science Interviews in the Age of AI

Rethinking Data Science Interviews in the Age of AI
AI is rewriting the day-to-day of data scientists. , data scientists must learn how to improve productivity and unlock new ...
Read more