This repository includes the slides and code for the workshop on Inspect given at Mastering LLMs: A Conference for Developers and Data Scientists.
Intro to Inspect: Open Source Framework for LLM Evals
Directory | Description |
---|---|
honeycomb | Evals for for Honeycomb Natural Langaguge Query generator |
langchain | Demonstrates evaluating a LangChain Agent using Inspect |
benchmarks | Implemetnations of some popular benchmarks using Inspect |
slides | Source code and PDF version of workshop slides |
To ensure that you can run the example code in this repo, please install the dependencies using the requirements.txt file in the root directory.