GitHub - NauelSerraino/BERT-Gender-Bias: Tool to quantify gender bias of words.

How to quantify gender bias in word embeddings?

This Machine Learning tool helps measure and visualize gender bias in word embeddings.

The tool is based on a pipeline that uses BERT embeddings as a starting point. The pipeline is composed by the following modules:

Logistic Regression - l1 regularization
PCA
Support Vector Classifier

The tool is able provide a visualization instrument to analyze how much word is biased towards male or female gender, here is an example of the visualization:

Performance Metrics:

Metric	Value
C (Logistic Regression)	0.175
C (SVM)	0.375
Accuracy	0.7786
Number of Selected Features (pre-PCA)	38

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
data		data
output		output
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

How to quantify gender bias in word embeddings?

About

Releases

Packages

Languages

License

NauelSerraino/BERT-Gender-Bias

Folders and files

Latest commit

History

Repository files navigation

How to quantify gender bias in word embeddings?

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages