Tags: mozilla/translations
Tags
Update the training guide (#239) * Update training guide * Fix docs * Add index file * Remove header * Fix docs link * Remove tensorboard section * Add theme * Update navigation * Add logo * Use absolute links * Fix code links * Fix code links * Fix link * Clarify what config is * Fix note for bicleaner Co-authored-by: Marco Castelluccio <mcastelluccio@mozilla.com> * Fix typo Co-authored-by: Greg Tatum <gregtatum@users.noreply.github.com> * Fix link * Fix mentioning of Marian Co-authored-by: Greg Tatum <gregtatum@users.noreply.github.com> * Remove "my" * Make note about snakemake more visible * Fix phrasing * Add link to bilceaner paper * Add clarifications * Add links to default training configs * Add reference to bilceaner section * Small fixes --------- Co-authored-by: Marco Castelluccio <mcastelluccio@mozilla.com> Co-authored-by: Greg Tatum <gregtatum@users.noreply.github.com>
Minor improvements (#20) - Flores dataset importer - custom dataset importer - ability to use a pre-trained backward model - save experiment config on start - stubs for dataset caching ( decided to sync implementation with workflow manager integration ) - use best bleu models instead of best ce-mean-words - fix linting warnings
Bicleaner support + fixes (#13) SacreBLEU is a regular importer now and evaluation is not limited to sacrebleu datasets. fixes Added bicleaner-ai and bicleaner filtering (one or another based on available pretrained language packs). fixes Added script to find all datasets based on language pair and importer type, ready to use in config fixes Fixed conda environment activation to be reproducible on GCP Other minor reproducibility fixes