hallucination-evaluation

Here are 5 public repositories matching this topic...

IAAR-Shanghai / UHGEval

[ACL 2024] User-friendly evaluation framework: Eval Suite & Benchmarks: UHGEval, HaluEval, HalluQA, etc.

benchmark evaluation dataset openai hallucination huggingface huggingface-transformers ceval gpt-3 openai-api hallucinations gpt-4 large-language-models llm chatgpt qwen hallucination-evaluation hallucination-detection

Updated Nov 12, 2024
Python

NishilBalar / Awesome-LVLM-Hallucination

Star

up-to-date curated list of state-of-the-art Large vision language models hallucinations research work, papers & resources

mlm hallucination large-language-models llm mllm large-vision-language-models multimodal-large-language-models hallucination-evaluation hallucination-detection vision-language-models lvlm hallucination-mitigation hallucination-survey hallucination-research hallucination-benchmark multimodal-language-model

Updated Dec 25, 2024

Ruiyang-061X / VL-Uncertainty

Star

🔎Official code for our paper: "VL-Uncertainty: Detecting Hallucination in Large Vision-Language Model via Uncertainty Estimation".

uncertainty uncertainty-quantification multi-modal uncertainty-estimation uncertainty-analysis hallucination vision-language vision-language-model large-vision-language-model hallucination-evaluation hallucination-detection multi-modal-large-language-model

Updated Dec 19, 2024
Python

Rakin061 / RAG-Domain-Adaptation-Hotel-Domain

Star

Dataset Generation and Pre-processing Scripts for the Research titled: Leveraging the Domain Adaptation of Retrieval Augmented Generation (RAG) Models in Conversational AI for Enhanced Customer Service

domain-adaptation rag hallucination-evaluation

Updated Sep 28, 2024
Jupyter Notebook

sebDtSci / ChatBotBRMS

Star

This project integrates business rules management systems (BRMS) and a RAG, to offer an automated text generation solution, applicable in different contexts and significantly reducing LLM hallucinations. It's a complete architecture available in a chatBot and fully scalable according to needs

api machine-learning automation brms rag llm generative-ai hallucination-evaluation

Updated Dec 8, 2024
Python

Improve this page

Add a description, image, and links to the hallucination-evaluation topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the hallucination-evaluation topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

hallucination-evaluation

Here are 5 public repositories matching this topic...

IAAR-Shanghai / UHGEval

NishilBalar / Awesome-LVLM-Hallucination

Ruiyang-061X / VL-Uncertainty

Rakin061 / RAG-Domain-Adaptation-Hotel-Domain

sebDtSci / ChatBotBRMS

Improve this page

Add this topic to your repo