CLDF is a specification describing how to store cross-linguistic data (i.e. data about (many) languages) in a way that maximizes reusability. pycldf implements this specification, providing tools to manipulate and validate CLDF datasets, based on csvw which implements the underlying CSVW spec.
This GitHub organization collects repositories containing
- documentation about CLDF or
- tools to work with CLDF
For collections of repositories holding CLDF data, see cldf-datasets, lexibank or dictionaria or search Zenodo.
Many CLDF datasets are deposited on Zenodo for longterm access. Such datasets can be retrieved programmatically using cldfzenodo.