Cookbook

This cookbook, inspired by OpenAI's cookbook, is a collection of recipes for common use cases of Braintrust. Each recipe is an open source self-contained example, hosted on GitHub. We welcome community contributions and aspire for the cookbook to be a collaborative, living, breathing collection of best practices for building high quality AI products.

python

Evaluating SimpleQA

Avatar
Avatar
Ankur Goyal, Ornella Altunyan
Dec 6, 2024datasetsevals
typescript

Using Python functions to extract text from images

Avatar
Ornella Altunyan
Nov 22, 2024pythontoolsocrfunctions
typescript

Using OpenTelemetry for LLM observability

Avatar
Ornella Altunyan
Oct 31, 2024evalstools
typescript

Using functions to build a RAG agent

Avatar
Avatar
Ornella Altunyan, Ankur Goyal
Oct 8, 2024functionsragtools
python

Evaluating multimodal receipt extraction

Avatar
Ankur Goyal
Sep 30, 2024evalsmultimodalreceipts
typescript

Unreleased AI: A full stack Next.js app for generating changelogs

Avatar
Ornella Altunyan
Aug 28, 2024evalsloggingnext.js
python

An agent that runs OpenAPI commands

Avatar
Ankur Goyal
Aug 12, 2024agentragevals
typescript

Benchmarking inference providers

Avatar
Ankur Goyal
Jul 29, 2024evalsllama-3.1providers
typescript

Tool calls in LLaMa 3.1

Avatar
Ankur Goyal
Jul 26, 2024evalsllama-3.1tools
typescript

Evaluating a chat assistant

Avatar
Tara Nagar
Jul 16, 2024evalschat
python

LLM Eval For Text2SQL

Avatar
Ankur Goyal
May 29, 2024evalsdatasetstext2sql
python

Optimizing Ragas to evaluate a RAG pipeline

Avatar
Avatar
Ankur Goyal, Nelson Auner
May 27, 2024evalsrag
typescript

Comparing evals across multiple AI models

Avatar
John Huang
May 22, 2024evalscharts
python

Detecting Prompt Injections

Avatar
Nelson Auner
May 20, 2024evalsclassification
python

AI Search Bar

Avatar
Austin Moehle
Mar 4, 2024evalssql
typescript

How Zapier uses assertions to evaluate tool usage in chatbots

Avatar
Vítor Balocco
Feb 13, 2024evalsassertionstools
typescript

Generating release notes and hill-climbing to improve them

Avatar
Ankur Goyal
Feb 2, 2024evalshill-climbing
typescript

Generating beautiful HTML components

Avatar
Ankur Goyal
Jan 29, 2024loggingdatasetsevals
python

Coda's Help Desk with and without RAG

Avatar
Avatar
Austin Moehle, Kenny Wong
Dec 21, 2023evalsrag
typescript

Improving Github issue titles using their contents

Avatar
Ankur Goyal
Oct 29, 2023evalssummarization
python

Classifying news articles

Avatar
David Song
Sep 1, 2023evalsclassification
python

Text-to-SQL

Avatar
Ankur Goyal
Aug 12, 2023evalssql