Skip to content

Transforms the Project Gutenberg metadata set into Linked Data and OBDA-ready

License

Notifications You must be signed in to change notification settings

alexdma/gutenberg-ld

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Gutenberg-LD

This is a suite of scripts and models that refactor the metadata set of the Project Gutenberg digital library, with the aim of turning it into a proper Linked Data set.

Features

  • Reconciliation of blank nodes, resulting in a much smaller dataset (~29% smaller as of March 2020)
  • Linking with Library of Congress subject headings and classification systems
  • Structuring of Table Of Contents data
  • Ontology alignment of undocumented Gutenberg terms

Requirements

You need:

Usage

  1. Download the metadata set from Gutenberg and load it onto your RDF store.
  2. cd gutenberg-fixes
  3. In settings.py set the SPARQL service and RDF graph name
  4. python refactor.py bookshelves formats toc (or a subset of the three arguments)
  5. in gutenberg-fixes/queries you can find other SPARQL queries to run by yourselves.

Licensing

Gutenberg-LD is licensed under the Apache License, Version 2.0. See LICENSE for the full license text.

About

Transforms the Project Gutenberg metadata set into Linked Data and OBDA-ready

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages