agents

Fast Start: Building Agent workflows with small language models with llmware

Welcome to llmware!

Set up

pip3 install llmware or pip3 install 'llmware[full]' or, if you prefer clone the github repo locally, e.g., git clone git@github.com:llmware-ai/llmware.git. If you clone the repo, then we would recommend that you run the welcome_to_llmware.sh or welcome_to_llmware_windows.sh scripts to install all of the dependencies.

Platforms:

Mac M1/M2/M3, Windows, Linux (Ubuntu 20 or Ubuntu 22 preferred)
RAM: 16 GB minimum (32 GB recommended)
Python 3.9, 3.10, 3.11, 3.12

What is an Agent in llmware?

There are a lot of different industry definitions of an Agent or an agent-based process. Our implementation is very specific in focusing on building multi-step, multi-model workflows that can be instantiated and run entirely locally or in a self-hosted manner. We use small specialized models that are "tools" that can be easily stacked together as part of building a more complex pipeline consisting of multiple calls to LLMs, along with other processing logic.

In short, we see agents as the way to evolve beyond simple chatbots, and start using LLMs to unlock enterprise process automation, and integrating LLMs safely, securely and cost-effectively into private enterprise workflows.

Each of these examples below will walk you through the basics of how to start using models in llmware, and then how to start composing more complex applications by combining different combinations of models and related tools.

There are 15 examples, designed to be used step-by-step, but each is self-contained, so you can feel free to jump into any of the examples, in any order, that you prefer.

Each example has been designed to be "copy-paste" and RUN with lots of helpful comments and explanations embedded in the code samples.

Examples:

Start here - start downloading and running question-answering and function-calling models in minutes.
llmware_sampler_bling_dragon - get started with BLING and DRAGON models for high-quality, fact-based inferencing.
using-slim-extract - start using function-calling small specialized models for extracting information from documents.
using-slim-summary - start using function-calling small specialized models for summarizing information.
agent-llmfx - build your first agent process and run it all locally.
agent-multistep-process - a second example of a multi-step agent process to analyze, classify and extract information from a complex document.
using-whisper - voice transcription to text in minutes, running locally with whisper-cpp.
using-phi-3-function-calls - using phi3-mini for various function call processes.
summarize_document - summarizing a larger document in multiple chunks of the document and then assembling.
semantic similarity ranking - using a semantic reranker to filter and build relevant text chunks from larger documents.
gguf_streaming - use the stream interface to stream text for larger generations.
web_services - integrate web services to build a complex research report, combined with three distinct function calling models.
text-2-sql - convert natural language queries into SQL and extract information from structured databases.
rag-instruct-benchark-tester - script for building rag benchmark performance tests.
using-rag-benchmark-scores - how to access and filter models by ranking accuracy on the benchmark test.

After completing these 15 examples, you should have a good foundation and set of recipes to start exploring the other 100+ examples in the /examples folder, and build more sophisticated LLM-based applications.

Models

All of these examples are optimized for using local CPU-based models, primarily BLING, DRAGON and SLIM models.
If you want to substitute for any other model in the catalog, it is generally as easy as switching the model_name. If the model requires API keys, we show in the examples how to pass those keys as an environment variable.

Local Private - All of the processing will take place locally on your laptop.

This is an ongoing initiative to provide easy-to-get-started tutorials - we welcome and encourage feedback, as well as contributions with examples and other tips for helping others on their LLM application journeys!

Let's get started!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

agents

agents

README.md

Fast Start: Building Agent workflows with small language models with llmware

Name		Name	Last commit message	Last commit date
parent directory ..
README.md		README.md
agents-1-start_here.py		agents-1-start_here.py
agents-10-using_semantic_reranker_with_rag.py		agents-10-using_semantic_reranker_with_rag.py
agents-11-gguf_streaming.py		agents-11-gguf_streaming.py
agents-12-web_services_slim_fx.py		agents-12-web_services_slim_fx.py
agents-13-text2sql-end-to-end-2.py		agents-13-text2sql-end-to-end-2.py
agents-14-rag_instruct_benchmark_tester.py		agents-14-rag_instruct_benchmark_tester.py
agents-15-get_model_benchmarks.py		agents-15-get_model_benchmarks.py
agents-2-llmware_model_sampler_bling_dragon.py		agents-2-llmware_model_sampler_bling_dragon.py
agents-3-using_slim_extract.py		agents-3-using_slim_extract.py
agents-4-using_slim_summary.py		agents-4-using_slim_summary.py
agents-5-agent-llmfx-getting-started.py		agents-5-agent-llmfx-getting-started.py
agents-6-agent-multistep-analysis.py		agents-6-agent-multistep-analysis.py
agents-7-using-whisper-cpp-sample-files.py		agents-7-using-whisper-cpp-sample-files.py
agents-8-using-phi-3-function-calls.py		agents-8-using-phi-3-function-calls.py
agents-9-document_summarizer.py		agents-9-document_summarizer.py

Files

agents

Directory actions

More options

Directory actions

More options

Latest commit

History

agents

Folders and files

parent directory

README.md

Fast Start: Building Agent workflows with small language models with llmware