How to Use LLMs for Powerful Automatic Evaluations

discuss how you can perform automatic evaluations using LLM as a judge. LLMs are widely used today for a variety ...
Read more
GPT-5 Doesn’t Dislike You—It Might Just Need a Benchmark for Emotional Intelligence

Since the all-new ChatGPT launched on Thursday, some users have mourned the disappearance of a peppy and encouraging personality in ...
Read more
Coconut: A Framework for Latent Reasoning in LLMs

Paper link: https://arxiv.org/abs/2412.06769 Released: 9th of December 2024 Figure 1. The two reasoning modes of Coconut. In Language Mode (left), ...
Read more
Estimating from No Data: Deriving a Continuous Score from Categories

has collected data on the outcomes of patients who have acquired “Pathogen A” responsible for an infectious respiratory illness. Available ...
Read more
Demystifying Cosine Similarity | Towards Data Science

is a commonly used metric for operationalizing tasks such as semantic search and document comparison in the field of natural ...
Read more
How to Write Insightful Technical Articles

, I discuss how you can write technical articles. I have been writing such articles for around 2.5 years, and ...
Read more
How to Design Machine Learning Experiments — the Right Way

Never miss a new edition of The Variable, our weekly newsletter featuring a top-notch selection of editors’ picks, deep dives, ...
Read more
Time Series Forecasting Made Simple (Part 3.2): A Deep Dive into LOESS-Based Smoothing

In Part 3.1 we started discussing how decomposes the time series data into trend, seasonality, and residual components, and as ...
Read more
Finding Golden Examples: A Smarter Approach to In-Context Learning

Context using Large Language Models (LLMs), In-Context Learning (ICL), where input and output are provided to LLMs to learn from ...
Read more
Context Engineering — A Comprehensive Hands-On Tutorial with DSPy

Context Engineering by now. This article will cover the key ideas behind creating LLM applications using Context Engineering principles, visually ...
Read more