guardrails

Trust and Safety with LLM

The Guardrails service enhances the security of LLM-based applications by offering a suite of microservices designed to ensure trustworthiness, safety, and security.

MicroService	Description
Llama Guard	Provides guardrails for inputs and outputs to ensure safe interactions using Llama Guard
WildGuard	Provides guardrails for inputs and outputs to ensure safe interactions using WildGuard
PII Detection	Detects Personally Identifiable Information (PII) and Business Sensitive Information (BSI)
Toxicity Detection	Detects Toxic language (rude, disrespectful, or unreasonable language that is likely to make someone leave a discussion)
Bias Detection	Detects Biased language (framing bias, epistemological bias, and demographic bias)
Prompt Injection Detection	Detects malicious prompts causing the system running an LLM to execute the attacker’s intentions)

Additional safety-related microservices will be available soon.

Name		Name	Last commit message	Last commit date
parent directory ..
deployment		deployment
src		src
README.md		README.md
__init__.py		__init__.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

guardrails

guardrails

README.md

Trust and Safety with LLM

Files

guardrails

Directory actions

More options

Directory actions

More options

Latest commit

History

guardrails

Folders and files

parent directory

README.md

Trust and Safety with LLM