Docling: The Document Alchemist | Towards Data Science

Why do we still wrestle with documents in 2025? in any data-driven organisation, and you’ll encounter a host of PDFs, Word ...
Read more
Is Your Training Data Representative? A Guide to Checking with PSI in Python

To get the most out of this tutorial, you should have a solid understanding of how to compare two distributions. ...
Read more
How to Context Engineer to Optimize Question Answering Pipelines

engineering is one of the most relevant topics in machine learning today, which is why I’m writing my third article ...
Read more
Anthropic Agrees to Pay Authors at Least $1.5 Billion in AI Copyright Settlement

Anthropic has agreed to pay at least $1.5 billion to settle a lawsuit brought by a group of book authors ...
Read more
Should We Use LLMs As If They Were Swiss Knives?

or so, it has been impossible to deny that there has been an increase in the hype level towards AI, ...
Read more
The Generalist: The New All-Around Type of Data Professional?

(or 2010s to be more precise) big-data boom brought the emergence of specialization in data roles. What used to be ...
Read more
How to Develop Powerful Internal LLM Benchmarks

LLMs being released almost weekly. Some recent releases we’ve had are Qwen3 coing models, GPT 5, Grok 4, all of ...
Read more
The Hidden Ingredients Behind AI’s Creativity

The original version of this story appeared in Quanta Magazine. We were once promised self-driving cars and robot maids. Instead, ...
Read more
What If I Had AI in 2020: Rent The Runway Dynamic Pricing Model

of Shopify, recently told his employees in an internal memo: “Before asking for more headcount and resources, teams must demonstrate ...
Read more










