Python-Spark implementation of the GMQL system
- A python environment
- Apache Spark
- The following GMQL modules exported as a jar file
- GMQL-Core
- GMQL-Server
- GMQL-Spark
- The following python libraries:
- pandas
- tqdm
- spylon
The following procedure is very stupid and not user friendly and it will be changed ASAP:
- Download this repository
- In the file init.py set the
c.jars
attribute with the paths of the GMQL jar files listed above - In the file init.py set the
c._spark_home
attribute with the path of your spark installation - Install the code with
pip install .
Your spark engine is instantiated when you call the following
import gmql as gl
In the folder tests you can find some examples of using the library