Final Project for the "Data Science" course ("Computational Management of Data"), Silvio Peroni @ University of Bologna, MA "DHDK".
The goal of the project is to develop a software that enables one to process data stored in different formats and to upload them into two distinct databases to query these databases simultaneously according to predefined operations. The software must be accompanied by a document (i.e., a Jupyter notebook) describing the data to process (their main characteristics and possible issues) and how the software has been organised (name of the files, where have been defined the various Python classes, etc.).
Please use the impl.py
file to use the software in the best possible manner. It includes all the classes and functions required for the optimal functioning of the software.
While this repository still contains some parts of the previous code developed for the project, issues and mistakes were corrected by the two contributors.
- Manu Srivastava - solved issues regarding: TriplestoreDataProcessor, TriplestoreProcessor, TriplestoreQueryProcessor, and GenericQueryProcessor (main attributes and half of the methods)
- Orsola Maria Borrini - solved issues regarding: RelationalQueryProcessor, GenericQueryProcessor, and general debugging