Persona Chatbot (NLP)

This project focuses on fine-tuning large language models (LLMs) like GPT-3.5-turbo and Llama-3-8B on Google's Synthetic Persona Chat dataset to enhance persona consistency in chatbots.

Project Overview

The main objective of this project is to create chatbots that maintain a consistent persona throughout conversations. The models were fine-tuned using Google's Synthetic Persona Chat dataset and evaluated using various metrics such as BLEU, ROUGE, BERTScore, and perplexity. The fine-tuned models demonstrated significant improvements in persona coherence and overall performance compared to baseline models.

Key Features

Fine-tuning of LLMs: Utilized GPT-3.5-turbo and Llama-3-8B for fine-tuning on persona chat datasets.
Evaluation Metrics: Models were assessed using BLEU, ROUGE, BERTScore, and perplexity metrics.
Enhanced Persona Consistency: Achieved improved persona coherence in chatbot responses.
Quantitative and Qualitative Analysis: Both automated metrics and human evaluations were used to assess model performance.

Dataset

We used Google's Synthetic Persona Chat dataset, which provides high-quality conversational data with predefined personas. This dataset ensures that the generated conversations are faithful to the user personas.

Google's Synthetic Persona Chat Dataset

Fine-tuning Strategy

Data Preparation: Converted the training data into JSONL format suitable for model fine-tuning.
Prompt Engineering: Developed effective prompts for persona injection.
Model Training: Fine-tuned GPT-3.5-turbo and Llama-3-8B on the prepared dataset.

Evaluation

The models were evaluated using the following metrics:

BLEU: Measures the similarity between generated text and reference translations.
ROUGE: Assesses the overlap between generated and reference texts.
BERTScore: Evaluates semantic similarity using contextual embeddings.
Perplexity: Measures the quality of the generated text.

Results

The fine-tuned models outperformed traditional models in both quantitative metrics and human evaluations. The results highlighted the effectiveness of our fine-tuning approach in enhancing persona coherence in chatbots.

Limitations and Future Work

Cost of Fine-tuning: Fine-tuning on OpenAI's GPT-3.5-turbo was expensive, suggesting a need for more budget-friendly solutions.
Prompt Engineering: Further research is needed to optimize prompt engineering techniques.
Evaluation Metrics: Current metrics do not fully align with human judgment, indicating a need for better automated evaluation methods.

References

Authors

Harshil Bhullar - hbhullar18@g.ucla.edu
Siddarth Chalasani - siddarth@cs.ucla.edu
Krish Patel - knpatel@g.ucla.edu

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
Data		Data
.gitignore		.gitignore
CSV-to-JSON.ipynb		CSV-to-JSON.ipynb
Evaluation Functions.ipynb		Evaluation Functions.ipynb
Final Report.pdf		Final Report.pdf
Final-Presentation.pdf		Final-Presentation.pdf
Final-Project-Ideas.pdf		Final-Project-Ideas.pdf
Finetune_GPT.ipynb		Finetune_GPT.ipynb
Finetuned Model Evaluation.ipynb		Finetuned Model Evaluation.ipynb
Intial-Proposal.pdf		Intial-Proposal.pdf
Llama-3 8b Unsloth baseline.ipynb		Llama-3 8b Unsloth baseline.ipynb
Llama-3 8b Unsloth finetuning.ipynb		Llama-3 8b Unsloth finetuning.ipynb
Llama-Finetuned-Loss.png		Llama-Finetuned-Loss.png
Midterm-report.pdf		Midterm-report.pdf
README.md		README.md
Run-Parlai.ipynb		Run-Parlai.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Persona Chatbot (NLP)

Project Overview

Key Features

Dataset

Fine-tuning Strategy

Evaluation

Results

Limitations and Future Work

References

Authors

About

Releases

Packages

Contributors 3

Languages

krish1925/Persona-Chatbot-G28

Folders and files

Latest commit

History

Repository files navigation

Persona Chatbot (NLP)

Project Overview

Key Features

Dataset

Fine-tuning Strategy

Evaluation

Results

Limitations and Future Work

References

Authors

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages