forked from FedML-AI/FedML
-
Notifications
You must be signed in to change notification settings - Fork 0
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
[CoreEngine] add on-premise deployment files.
- Loading branch information
1 parent
2aedf33
commit 17d1452
Showing
10 changed files
with
126 additions
and
13 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,78 @@ | ||
# Local directory where your source code resides. | ||
# It should be the relative path to this job yaml file or the absolute path. | ||
# If your job doesn't contain any source code, it can be empty. | ||
workspace: hello_world | ||
|
||
# Running entry commands which will be executed as the job entry point. | ||
# If an error occurs, you should exit with a non-zero code, e.g. exit 1. | ||
# Otherwise, you should exit with a zero code, e.g. exit 0. | ||
# Support multiple lines, which can not be empty. | ||
job: | | ||
echo "current job id: $FEDML_CURRENT_RUN_ID" | ||
echo "current edge id: $FEDML_CURRENT_EDGE_ID" | ||
echo "Hello, Here is the launch platform." | ||
echo "Current directory is as follows." | ||
pwd | ||
python3 hello_world.py | ||
#sleep 20 | ||
#exit 1 | ||
#echo "Current GPU information is as follows." | ||
#nvidia-smi # Print GPU information | ||
#gpustat | ||
#echo "Download the file from http://212.183.159.230/200MB.zip ..." | ||
#wget http://212.183.159.230/200MB.zip | ||
#rm ./200MB.zip* | ||
#echo "The downloading task has finished." | ||
# echo "Training the vision transformer model using PyTorch..." | ||
# python vision_transformer.py --epochs 1 | ||
# If you want to use the job created by the MLOps platform, | ||
# just uncomment the following three, then set job_id and config_id to your desired job id and related config. | ||
#job_args: | ||
# job_id: 2070 | ||
# config_id: 111 | ||
|
||
# If you want to create the job with specific name, just uncomment the following line and set job_name to your desired job name | ||
#job_name: cv_job | ||
|
||
job_type: train # options: train, deploy, federate | ||
|
||
# train subtype: general_training, single_machine_training, cluster_distributed_training, cross_cloud_training | ||
# federate subtype: cross_silo, simulation, web, smart_phone | ||
# deploy subtype: none | ||
job_subtype: generate_training | ||
|
||
docker: | ||
image: fedml/fedml-default-launch:cu12.1-u22.04 | ||
#registry: docker.io | ||
#username: my_hub_user | ||
#password: my_hub_password | ||
#ports: [30001,3002,3003] | ||
|
||
|
||
# Bootstrap shell commands which will be executed before running entry commands. | ||
# Support multiple lines, which can be empty. | ||
bootstrap: | | ||
# pip install -r requirements.txt | ||
echo "Bootstrap finished." | ||
computing: | ||
minimum_num_gpus: 1 # minimum # of GPUs to provision | ||
maximum_cost_per_hour: $3000 # max cost per hour for your job per gpu card | ||
#allow_cross_cloud_resources: true # true, false | ||
#device_type: CPU # options: GPU, CPU, hybrid | ||
resource_type: A100-80G # e.g., A100-80G, please check the resource type list by "fedml show-resource-type" or visiting URL: https://open.fedml.ai/accelerator_resource_type | ||
|
||
data_args: | ||
dataset_name: mnist | ||
dataset_path: ./dataset | ||
dataset_type: csv | ||
|
||
model_args: | ||
input_dim: '784' | ||
model_cache_path: /Users/alexliang/fedml_models | ||
model_name: lr | ||
output_dim: '10' | ||
|
||
training_params: | ||
learning_rate: 0.004 |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters