Skip to content

APPFL/APPFL

Repository files navigation

APPFL logo

APPFL - Advanced Privacy-Preserving Federated Learning Framework.

discord DOI Doc Build pre-commit APPFL APPFL-Advance

APPFL, Advanced Privacy-Preserving Federated Learning, is an open-source and highly extensible software framework that allows research communities to implement, test, and validate various ideas related to privacy-preserving federated learning (FL), and deploy real FL experiments easily and safely among distributed clients to train more robust ML models.With this framework, developers and users can easily

  • Train any user-defined machine learning model on decentralized data with optional differential privacy and client authentication.
  • Simulate various synchronous and asynchronous PPFL algorithms on high-performance computing (HPC) architecture with MPI.
  • Implement customizations in a plug-and-play manner for all aspects of FL, including aggregation algorithms, server scheduling strategies, and client local trainers.

Documentation: please check out our documentation for tutorials, users guide, and developers guide.

Table of Contents

🛠️ Installation

We highly recommend creating a new Conda virtual environment and install the required packages for APPFL.

conda create -n appfl python=3.8
conda activate appfl

User installation

For most users such as data scientists, this simple installation must be sufficient for running the package.

pip install pip --upgrade
pip install "appfl[examples,mpi]"

💡 Note: If you do not need to use MPI for simulations, then you can install the package without the mpi option: pip install "appfl[examples]".

If we want to even minimize the installation of package dependencies, we can skip the installation of a few packages (e.g., matplotlib and jupyter):

pip install "appfl"

Developer installation

Code developers and contributors may want to work on the local repositofy. To set up the development environment,

git clone --single-branch --branch main https://github.com/APPFL/APPFL.git
cd APPFL
pip install -e ".[mpi,dev,examples]"

💡 Note: If you do not need to use MPI for simulations, then you can install the package without the mpi option: pip install -e ".[dev,examples]".

On Ubuntu: If the install process failed, you can try:

sudo apt install libopenmpi-dev,libopenmpi-bin,libopenmpi-doc

🧱 Technical Components

APPFL is primarily composed of the following six technical components

  • Aggregator: APPFL supports several popular algorithms to aggregate one or several client local models.
  • Scheduler: APPFL supports several synchronous and asynchronous scheduling algorithms at the server-side to deal with different arrival times of client local models.
  • Trianer: APPFL supports several client local trainers for various training tasks.
  • Privacy: APPFL supports several global/local differential privacy schemes.
  • Communicator: APPFL supports MPI for single-machine/cluster simulation, and gRPC and Globus Compute with authenticator for secure distributed training.
  • Compressor: APPFL supports several lossy compressors for model parameters, including SZ2, SZ3, ZFP, and SZx.

💡 Framework Overview

In the design of the APPFL framework, we essentially create the server agent and client agent, using the six technical components above as building blocks, to act on behalf of the FL server and clients to conduct FL experiments. For more details, please refer to our documentation.

📄 Citation

If you find APPFL useful for your research or development, please consider citing the following papers:

@article{li2024advances,
  title={Advances in APPFL: A Comprehensive and Extensible Federated Learning Framework},
  author={Li, Zilinghan and He, Shilan and Yang, Ze and Ryu, Minseok and Kim, Kibaek and Madduri, Ravi},
  journal={arXiv preprint arXiv:2409.11585},
  year={2024}
}

@inproceedings{ryu2022appfl,
  title={APPFL: open-source software framework for privacy-preserving federated learning},
  author={Ryu, Minseok and Kim, Youngdae and Kim, Kibaek and Madduri, Ravi K},
  booktitle={2022 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW)},
  pages={1074--1083},
  year={2022},
  organization={IEEE}
}

🏆 Acknowledgements

This material is based upon work supported by the U.S. Department of Energy, Office of Science, under contract number DE-AC02-06CH11357.