Hitchhiker’s Guide to RAG with ChatGPT API and LangChain

generate tons of words and responses based on general knowledge, but what happens when we need answers requiring accurate and ...
Read more
A Developer’s Guide to Building Scalable AI: Workflows vs Agents

I had just started experimenting with CrewAI and LangGraph, and it felt like I’d unlocked a whole new dimension of ...
Read more
Pipelining AI/ML Training Workloads with CUDA Streams

ninth in our series on performance profiling and optimization in PyTorch aimed at emphasizing the critical role of performance analysis and optimization ...
Read more
A Caching Strategy for Identifying Bottlenecks on the Data Input Pipeline

in the data input pipeline of a machine learning model running on a GPU can be particularly frustrating. In most ...
Read more
Use OpenAI Whisper for Automated Transcriptions

development lately with large language models (LLMs). A lot of the focus is on the question-answering you can do with ...
Read more
Agentic AI: Implementing Long-Term Memory

, you know they are stateless. If you haven’t, think of them as having no short-term memory. An example of ...
Read more
Data Has No Moat! | Towards Data Science

of AI and data-driven projects, the importance of data and its quality have been recognized as critical to a project’s ...
Read more
Reinforcement Learning from Human Feedback, Explained Simply

The appearance of ChatGPT in 2022 completely changed how the world started perceiving artificial intelligence. The incredible performance of ChatGPT ...
Read more
What PyTorch Really Means by a Leaf Tensor and Its Grad

isn’t yet another explanation of the chain rule. It’s a tour through the bizarre side of autograd — where gradients ...
Read more
From Configuration to Orchestration: Building an ETL Workflow with AWS Is No Longer a Struggle

to lead the cloud industry with a whopping 32% share due to its early market entry, robust technology and comprehensive ...
Read more