Skip to content
@apertium

Apertium

Free/open-source platform for developing rule-based machine translation systems and language technology

Apertium is a rule-based machine translation toolchain and ecosystem, with many of our tools based on finite-state transducers.

Our language agnostic tools are native and written in C++. The various development helpers are mostly in Python.

Our language data is in various formats, including XML and other human-editable texts. Language data is split into single-language packages that can analyse and generate a given language, and translation pairs that perform transfer and transformation between two languages. The single-language packages are shared amongst many pairs.

If you wish to contribute to the language agnostic native tools you'll need to know C++.

If you wish to contribute language data to Apertium, your contributions should fit in our existing pipeline. That is, it should be rule-based and deterministic. We will happily help you learn our formats and methods, and we know from experience it is possible to learn and use Apertium in short time.

We do not currently include any statistical or neural machine translation tools or methods. We are often asked if contributions can be made with statistical or neural systems, but for now they cannot.

For more information about how to contribute, see Contributing.

Pinned Loading

  1. organisation organisation Public

    Second point of contact for all things Apertium

    TeX 19 5

  2. apertium-trunk apertium-trunk Public

    Released Apertium translation pairs

    29 5

  3. apertium-languages apertium-languages Public

    Released Apertium language modules

    39 9

  4. apertium-staging apertium-staging Public

    Unreleased Apertium language modules and pairs with extensive development

    4 1

  5. apertium-nursery apertium-nursery Public

    Unreleased Apertium language modules and pairs with some development

    4 2

  6. apertium-tools apertium-tools Public

    Apertium tools

    19 1

Repositories

Showing 10 of 631 repositories
  • lexd Public

    A lexicon compiler for non-suffixational morphologies

    apertium/lexd’s past year of commit activity
    C++ 11 GPL-3.0 3 1 2 Updated Dec 26, 2024
  • apertium-recursive Public

    Recursive structural transfer module for Apertium

    apertium/apertium-recursive’s past year of commit activity
    C++ 6 GPL-3.0 4 38 2 Updated Dec 26, 2024
  • apertium-lex-tools Public

    Module for compiling lexical selection rules and processing them in the pipeline.

    apertium/apertium-lex-tools’s past year of commit activity
    JavaScript 5 GPL-3.0 12 4 2 Updated Dec 26, 2024
  • apertium Public

    Core tools (driver script, transfer, tagger, formatters) for the FOSS RBMT system Apertium

    apertium/apertium’s past year of commit activity
    C++ 94 GPL-2.0 27 50 (2 issues need help) 3 Updated Dec 26, 2024
  • apertium-sme-sma Public

    Apertium translation pair for Northern Sami and Southern Sami

    apertium/apertium-sme-sma’s past year of commit activity
    XML 1 GPL-3.0 0 0 0 Updated Dec 23, 2024
  • apertium-nno-nob Public

    Apertium translation pair for Norwegian Nynorsk and Norwegian Bokmål

    apertium/apertium-nno-nob’s past year of commit activity
    XML 22 GPL-2.0 9 1 0 Updated Dec 22, 2024
  • lttoolbox Public

    Finite state compiler, processor and helper tools used by apertium

    apertium/lttoolbox’s past year of commit activity
    C++ 18 GPL-2.0 23 35 (1 issue needs help) 5 Updated Dec 20, 2024
  • apertium-nob Public

    Apertium linguistic data for Norwegian Bokmål

    apertium/apertium-nob’s past year of commit activity
    XML 8 GPL-2.0 3 0 0 Updated Dec 20, 2024
  • apertium-nno Public

    Apertium linguistic data for Norwegian Nynorsk

    apertium/apertium-nno’s past year of commit activity
    XML 4 GPL-2.0 2 0 0 Updated Dec 20, 2024
  • apertium-cat Public

    Apertium linguistic data for Catalan

    apertium/apertium-cat’s past year of commit activity
    Shell 9 GPL-2.0 6 6 0 Updated Dec 19, 2024