t5s

title	emoji	colorFrom	colorTo	sdk	app_file	pinned
t5s	💯	yellow	red	streamlit	app.py	false

t5s

T5 Summarisation Using Pytorch Lightning, DVC, DagsHub and HuggingFace Spaces

Here you will find the code for the project, but also the data, models, pipelines and experiments. This means that the project is easily reproducible on any machine, but also that you can contribute data, models, and code to it.

Have a great idea for how to improve the model? Want to add data and metrics to make it more explainable/fair? We'd love to get your help.

Blog: https://dagshub.com/blog/machine-summarization-an-open-data-science-project/

Installation

To use and run the DVC pipeline install the t5s package

pip install t5s

Usage

Firstly we need to clone the repo containing the code so we can do that using:

t5s clone

We would then have to create the required directories to run the pipeline

t5s dirs

Now to define the parameters for the run we have to run:

t5s start [-h] [-d DATASET] [-s SPLIT] [-n NAME] [-mt MODEL_TYPE]
                 [-m MODEL_NAME] [-e EPOCHS] [-lr LEARNING_RATE]
                 [-b BATCH_SIZE]

Then we need to pull the models from DVC

t5s pull

Now to run the training pipeline we can run:

t5s run

Before pushing make sure that the DVC remote is setup correctly:


dvc remote modify origin url https://dagshub.com/{user_name}/summarization.dvc
dvc remote modify origin --local auth basic
dvc remote modify origin --local user {user_name}
dvc remote modify origin --local password {your_token}

Finally to push the model to DVC

t5s push

To push this model to HuggingFace Hub for inference you can run:

t5s upload

Next if we would like to test the model and visualise the results we can run:

t5s visualize

And this would create a streamlit app for testing

Name		Name	Last commit message	Last commit date
Latest commit History 1,012 Commits
.github		.github
docs		docs
notebooks		notebooks
references		references
reports		reports
src		src
t5s		t5s
.dvcignore		.dvcignore
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
app.py		app.py
data.dvc		data.dvc
data_params.yml		data_params.yml
dvc.lock		dvc.lock
dvc.yaml		dvc.yaml
model_params.yml		model_params.yml
params.yml		params.yml
requirements.txt		requirements.txt
setup.py		setup.py
tox.ini		tox.ini

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

t5s

Installation

Usage

About

Releases

Sponsor this project

Packages

Languages

License

gagan3012/T5-Summarization

Folders and files

Latest commit

History

Repository files navigation

t5s

Installation

Usage

About

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases

Sponsor this project

Packages 0

Languages

Packages