Robo Reviews - Automated Product Review Generation

This project was part of a 9-week training course I attended in 2024.

Bootcamp: Ironhack AI Engineering bootcamp
Date: September to November 2024
Project topics: Data processing, Clustering, Sentiment Analysis, Generative AI, Prompt FineTuning, Sklearn, LLMs, Mistral7B

Final Grade and teacher's feedback:

- Presentation a bit rushed, you spent 3 minutes just with the intro, and the interesting parts were at the end.
- Great choices, very smart decisions and combination of techniques
- Excellent visual evaluation of the clustering model, which was probably the hardest one as there were no real labels
- Preprocessing and modeling was very good, with lots of experimentation
- Good README
- The code is well structured in notebooks, with clear sections, and very readable

Grade: 11.40 / 12

Robo Reviews - Automated Product Review Generation

Robo Reviews is a project designed to process, analyze, and summarize product reviews. It implements 3 different transformer models:

A clustering model to group products into categories
A sentiment analysis model to classify positive, neutral and negative reviews
A text generation model to produce insightful product reviews based on recurring user feedback.

The tool provides insights into the most frequently mentioned pros and cons, helping users quickly understand the strengths and weaknesses of various products.

Project Results

I successfully implemented the 3 models mentioned above.

At the end of this project, I was able to input a dataset of amazon product reviews, clusterize them into 6 main categories and classify the main review sentiment (positive, neutral, negative):

Then I leveraged prompt-engineering using Mistral7B (4bits quant.) to generate an HTML review of the best rated product in a given category based on a subset of data.

Project Setup

Follow these steps to clone the repository and create a virtual environment:

git clone https://github.com/alexdjulin/ik_robo_reviews.git
cd ik_robo_reviews
python3 -m venv venv
source venv/bin/activate  # On Windows use: venv\Scripts\activate
pip install -r requirements.txt

Files Description

Notebooks

1_dataset_review.ipynb: Jupyter notebook for initial dataset exploration and review analysis. Includes preprocessing of the review data.
2a_category_clustering_SimilarityClustering.ipynb: Notebook implementing similarity-based clustering to categorize products based on review embeddings.
2b_category_clustering_UnsupervisedLearning.ipynb: Notebook for unsupervised clustering methods, including KMeans, to categorize products into specific groups.
3_sentiment_analysis_model.ipynb: Notebook exploring models fine-tuned on review sentiments. Then fine-tuning a DistilBERT model for sentiment analysis and evaluates the model’s performance on the dataset.
4_products_score.ipynb: Notebook scoring products based on the number of positive, neutral, and negative reviews, and to identify the best products per category.
5_generator.ipynb: Notebook exploring prompt fine-tuning on Mistral 7B to construct a structured product description from user reviews.
demo1_gradio.ipynb: Gradio demo for clustering and sentiment analysis models.
demo2_review.ipynb: Gradio demo for summarizing product reviews based on input data, showcasing the model’s review summarization capabilities.

Custom Modules

helpers.py: A helper module that contains various utility functions to load data, preprocess text, and interact with models.
prompting.py: Contains code for generating prompts for the models, including text generation and summarization prompts.

Documents

robo_reviews_presentation.pdf: Final presentation of the project

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
notebooks		notebooks
readme		readme
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
Robo_Reviews_presentation.pdf		Robo_Reviews_presentation.pdf
helpers.py		helpers.py
prompting.py		prompting.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Robo Reviews - Automated Product Review Generation

Project Results

Project Setup

Files Description

Notebooks

Custom Modules

Documents

About

Releases

Packages

Languages

License

alexdjulin/ik-robo-reviews-generator

Folders and files

Latest commit

History

Repository files navigation

Robo Reviews - Automated Product Review Generation

Project Results

Project Setup

Files Description

Notebooks

Custom Modules

Documents

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages