Releases: hipagesgroup/data-ml-utils
Releases · hipagesgroup/data-ml-utils
Add databricks sql connector functionality
This release covers the following:
- add databricks sql connector functionality so that querying from local machine is made possible
v0.3.8
This release covers the following:
- removal of all AWS EMR and Sagemaker related utils
- add Databricks utils
- add MLflow utils
- add common requests functions for our endpoint integration tests
v0.2.9
This release includes the following patch fix:
- change dependencies version
v0.2.8
This release includes the following patch fix:
- change autoscaling
ondemand
limit to 0 so that only spot ec2 instances are spun up
v0.2.7
This release includes the following patch fix:
- add EMR version as part of the function parameter to prevent hardcoding of versions
- add auto scaling for number of ec2 instances of cluster;
min
60 andmax
120 - changed the logic to check if the master DNS address with polling request every 15 minutes
v0.2.6
This release includes the following patch fix:
- shortening spot instances timeout duration to
5
minutes - increasing the time to check if the EMR cluster has a master DNS address to
10.5
minutes
v0.2.4
This release includes the following:
- Fix the bug to correctly store
int
andfloat
data type asnumpy int
andnumpy float
when querying and storing data as pandas dataframe.
v0.2.3
This release includes the following:
- loosening dependency requirements to be compatible with other ml repos
v0.2.2
This release includes the following:
- fixes EMR pytests to use botocore stubber,
moto
throws anmalformatted inputs
and not quite sure the cause of it
v0.2.1
This release includes the following:
- pyathena client query engine
- boto3 client for EMR
- boto3 client for AWS Sagemaker Model Registry
- minor bug fixes and code optimisation for
pyathena
- fix up requirements