Hitchhiker’s Guide to RAG: From Tiny Files to Tolstoy with OpenAI’s API and LangChain

, I walked you through setting up a very simple RAG pipeline in Python, using OpenAI’s API, LangChain, and your ...
Read more
The Crucial Role of NUMA Awareness in High-Performance Deep Learning

world of deep learning training, the role of the ML developer can be likened to that of the conductor of ...
Read more
Run Your Python Code up to 80x Faster Using the Cython Library

excellent language for rapid prototyping and code development, but one thing I often hear people say about using it is ...
Read more
Explainable Anomaly Detection with RuleFit: An Intuitive Guide

your anomaly detection results to your stakeholders, the immediate next question is always “why?”. In practice, simply flagging an anomaly ...
Read more
My Honest Advice for Aspiring Machine Learning Engineers

want to be machine learning engineers. I get it. It’s a great job, with interesting work, great pay, and overall, ...
Read more
Taking ResNet to the Next Level

If you read the title of this article, you might probably think that ResNeXt is directly derived from ResNet. Well, ...
Read more
How to Access NASA’s Climate Data — And How It’s Powering the Fight Against Climate Change Pt. 1

can’t think of a more important dataset. Just today, I saw a headline like this: ‘Heat Waves Are Getting More ...
Read more
STOP Building Useless ML Projects – What Actually Works

all the time: “What projects should I do to get a job in data science or machine learning?” This question ...
Read more
Prescriptive Modeling Makes Causal Bets – Whether You Know it or Not!

modeling is the pinnacle of analytics value. It doesn’t focus on what happened, or even what will happen – it ...
Read more
The Mythical Pivot Point from Buy to Build for Data Platforms

TL;DR: with data-intensive architectures, there often comes a pivotal point where building in-house data platforms makes more sense than buying ...
Read more