DeTexD: A Benchmark Dataset for Delicate Text Detection

This is the official repository for DeTexD paper. Here you can find scripts used in the paper to evaluate models.

See also: DeTexD dataset, detexd-roberta-base model.

Install

pip install -r requirements.txt

Usage

Run evaluate_detexd_roberta.py to get the published model (grammarly/detexd-roberta-base) results on published dataset (grammarly/detexd-benchmark).

Run founta_basile_comparison.ipynb to reproduce results for models comparison from the paper. Note that you need to acquire the datsets because they have separate licences.

Run country_bias.ipynb to reproduce country bias analysis.

Run compare_hatebert.ipynb to reproduce hatebert models comparison.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DeTexD: A Benchmark Dataset for Delicate Text Detection

Install

Usage

About

Releases

Packages

Contributors 3

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
LICENSE		LICENSE
README.md		README.md
compare_hatebert.ipynb		compare_hatebert.ipynb
country_bias.ipynb		country_bias.ipynb
evaluate_detexd_roberta.py		evaluate_detexd_roberta.py
founta_basile_comparison.ipynb		founta_basile_comparison.ipynb
requirements.txt		requirements.txt

License

grammarly/detexd

Folders and files

Latest commit

History

Repository files navigation

DeTexD: A Benchmark Dataset for Delicate Text Detection

Install

Usage

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages