Effortless Spreadsheet Normalisation With LLM

This article is part of a series of articles on automating Data Cleaning for any tabular dataset. You can test ...
Read more
Forget About Cloud Computing. On-Premises Is All the Rage Again

Ten years ago, everybody was fascinated by the cloud. It was the new thing, and companies that adopted it rapidly ...
Read more
Mastering Prompt Engineering with Functional Testing: A Systematic Guide to Reliable LLM Outputs

Creating efficient prompts for large language models often starts as a simple task… but it doesn’t always stay that way. ...
Read more
The Impact of GenAI and Its Implications for Data Scientists

GenAI systems affect how we work. This general notion is well known. However, we are still unaware of the exact ...
Read more
Mastering Hadoop, Part 3: Hadoop Ecosystem: Get the most out of your cluster

As we have already seen with the basic components (Part 1, Part 2), the Hadoop ecosystem is constantly evolving and ...
Read more
Nine Pico PIO Wats with Rust (Part 2)

This is Part 2 of an exploration into the unexpected quirks of programming the Raspberry Pi Pico PIO with Micropython. ...
Read more
Advanced Time Intelligence in DAX with Performance in Mind

We all know the usual Time Intelligence function based on years, quarters, months, and days. But sometimes, we need to perform ...
Read more
How to Fine-Tune DistilBERT for Emotion Classification

The customer support teams were drowning with the overwhelming volume of customer inquiries at every company I’ve worked at. Have ...
Read more
Learnings from a Machine Learning Engineer — Part 3: The Evaluation

In this third part of my series, I will explore the evaluation process which is a critical piece that will ...
Read more
Tutorial: Semantic Clustering of User Messages with LLM Prompts

As a Developer Advocate, it’s challenging to keep up with user forum messages and understand the big picture of what ...
Read more