Estimate and track carbon emissions from your compute, quantify and analyze their impact.
- About CodeCarbon π‘
- Installation π - Install from PyPI repository - Install from Conda repository
- Quickstart π
- Examples π€
- Built-in Visualization Tool π
- Comet Integration π₯
- Report your emissions: LateX template π»
- Infrastructure Support π₯οΈ
- Data Sources ποΈ
- Contributing π€
- Build Documentation π¨οΈ
- Contact π
While computing currently represents roughly 0.5% of the worldβs energy consumption, that percentage is projected to grow beyond 2% in the coming years, which will entail a significant rise in global CO2 emissions if not done properly. Given this increase, it is important to quantify and track the extent and origin of this energy usage, and to minimize the emissions incurred as much as possible.
For this purpose, we created CodeCarbon, a Python package for tracking the carbon emissions produced by various kinds of computer programs, from straightforward algorithms to deep neural networks.
By taking into account your computing infrastructure, location, usage and running time, CodeCarbon can provide an estimate of how much CO2 you produced, and give you some comparisons with common modes of transportation to give you an order of magnitude.
Our hope is that this package will be used widely for estimating the carbon footprint of computing, and for establishing best practices with regards to the disclosure and reduction of this footprint.
Follow the steps below to set up the package and don't hesitate to open an issue if you need help!
Create a virtual environment using conda
for easier management of dependencies and packages.
For installing conda, follow the instructions on the official conda website
conda create --name codecarbon
conda activate codecarbon
pip install codecarbon
or
pip install codecarbon[viz]
if you wish to use the built-in visualization tool described below.
conda install -c conda-forge codecarbon
codecarbon
is now installed in your the local environment
This is the most straightforward usage of the package, which is possible if you have access to the Internet, which is necessary to gather information regarding your geographical location.
from codecarbon import EmissionsTracker
tracker = EmissionsTracker()
tracker.start()
# GPU Intensive code goes here
tracker.stop()
You can also use the decorator:
from codecarbon import track_emissions
@track_emissions
def training_loop():
pass
The EmissionsTracker
also works as a context manager:
from codecarbon import EmissionsTracker
with EmissionsTracker() as tracker:
# GPU Intensive code goes here
This will write a csv file of the CO2 emissions in the current directory.
(This feature is currently in ALPHA stage, meaning it's only available for developers installing CodeCarbon from this repository. The documentation and implementation are is still under development.)
This mode use the Code Carbon API to upload the timeseries of your emissions.
Before using it, you need an experiment_id, to get one, run:
codecarbon init
It will ask the API for an experiment_id on the default project and save it to .codecarbon.config
for you.
Then, you could tell Code Carbon to monitor your machine:
codecarbon monitor
Or use the API in your code:
from codecarbon import track_emissions
@track_emissions(save_to_api=True)
def train_model():
# Your code here
if __name__ == "__main__":
train_model()
This is the simplest way. You could specify others options in @track_emissions
or in .codecarbon.config
.
This mode can be used in setups without internet access, but requires a manual specification of your country code. A complete list of country ISO codes can be found on Wikipedia.
The offline tracker can be used as follows:
from codecarbon import OfflineEmissionsTracker
tracker = OfflineEmissionsTracker(country_iso_code="CAN")
tracker.start()
# GPU Intensive code goes here
tracker.stop()
or by using the decorator:
from codecarbon import track_emissions
@track_emissions(offline=True, country_iso_code="CAN")
def training_loop():
pass
or by using the context manager:
from codecarbon import EmissionsTracker
with EmissionsTracker() as tracker:
# GPU Intensive code goes here
By default, Code Carbon only writes the CVS output file when it stops. But for a long run you may want to save intermediate data. It is possible to call the flush() method to do so.
Have a look to ./examples/mnist_callback.py for an example.
Note that if you use the API it will also call it when you call flush(). You could set api_call_interval to -1, so that it will not be called automatically and then force a call at the end of each epoch to get the emission of each epoch.
Nothing to do here π― ! Comet automatically starts a tracker and logs your emissions if you have CodeCarbon installed. More about comet and adding the CodeCarbon panel to your project in Comet Integration.
CodeCarbon is developed with flexibility in mind. This means you don't have to keep passing the same arguments over and over to EmissionTracker
objects in your scripts. Any and all arguments can be set from configuration files. CodeCarbon will look sequentially for arguments in:
~/.codecarbon.config
(global user config)./.codecarbon.config
(local directory config)CODECARBON_ARG
(environment variables)EmissionsTracker(arg=value)
Config files are INI text files which should look like:
[codecarbon]
arg=value
You can specify many parameters, for example :
[codecarbon]
experiment_id = 3f53714b-c1ed-4bc7-b791-0b3f076b81a2
api_endpoint = http://localhost:8008
#api_endpoint = https://api.codecarbon.io
log_level = DEBUG
save_to_api = True
api_call_interval = 600
logger_preamble = MySuperServer
misfire_grace_time = 15
More details in the documentation.
As an illustration of how to use CodeCarbon, we created a simple example using TensorFlow for digit classification on the MNIST dataset:
First, install Tensorflow 2.0:
pip install tensorflow
Then, run the examples in the examples/
folder, which will call the online version of the Emissions tracker by default:
python examples/mnist.py
python examples/mnist_decorator.py
This will create a .csv
file with information about the energy that you used to carry out the classification task, and an estimate of the CO2 that you generated, complete with comparisons to common modes of transportation to give you a better idea of the order of magnitude of your emissions.
To track the evolution of your CO2 emissions or just to see some nice graphs and charts, you can use the visualization tool that we created. As input, it takes a .csv file in the format generated by the Emissions Tracker, and it generates a visualization of the emissions incurred.
You can run try it out on a sample data file such as the one in examples/emissions.csv
, and run it with the following command from the code base:
python codecarbon/viz/carbonboard.py --filepath="examples/emissions.csv"
If you have the package installed, you can run the CLI command:
carbonboard --filepath="examples/emissions.csv" --port=xxxx
Once you have generated your own .csv file based on your computations, you can feed that into the visualization tool to see a more visual representation of your own emissions.
You can also see the carbon intensity of different regions and countries:
As well as the relative carbon intensity of different compute regions of cloud providers:
CodeCarbon automatically integrates with Comet for experiment tracking and visualization. Comet provides data scientists with powerful tools to track, compare, explain and reproduce their experiments, and now with CodeCarbon you can easily track the carbon footprint of your jobs along with your training metrics, hyperparameters, dataset samples, artifacts and more.
To get started with the Comet-CodeCarbon integration, make sure you have comet-ml
installed:
pip install comet_ml>=3.2.2
The minimum Comet version
Go to Comet's website and create a free account. From your account settings page, copy your personal API key.
In the mnist-comet.py
example file, replace the placeholder code with your API key:
experiment = Experiment(api_key="YOUR API KEY")
Run your experiment and click on the link in stdout to be taken back to the Comet UI. Automatically you'll see your metrics, hyperparameters, graph definition, system and environment details and more.
Comet will automatically create an EmissionsTracker
object from the codecarbon
package when your code runs. To visualize the carbon footprint of your experiment, go to the Panels
tab in the left sidebar and click Add Panel
.
From the Panel Gallery click the Public
tab and search for CodeCarbon Footprint
. Once you've found it, add it to your Experiment.
Now back in the Panels
tab you'll see your CodeCarbon Footprint visualization in the Comet UI. To render the visualization by default, save your View
. And voilΓ ! Every time you run your experiments, you'll be able to visualize your CodeCarbon emissions data alongside everything else you need to track for your research.
We believe that an important step towards reducing carbon emissions is the generalization of emissions reporting in papers, blog posts and publications in general. Here's an example LateX snippet you might want to use:
\usepackage{hyperref}
\subsection{CO2 Emission Related to Experiments}
Experiments were conducted using {cloud provider} in {region}, which has a carbon efficiency of {carbon efficiency} kgCO$_2$eq/kWh. A cumulative of {hours used} hours of computation was performed on hardware of type {hardware type} (TDP of W).
Total emissions are estimated to be {emission} kgCO$_2$eq of which {percentage offset} percents were directly offset by the cloud provider.
%Uncomment if you bought additional offsets:
%XX kg CO2eq were manually offset through \href{link}{Offset Provider}.
Estimations were conducted using the \href{https://github.com/mlco2/codecarbon}{CodeCarbon emissions tracker}, a joint effort from authors of \cite{lacoste2019quantifying} and \cite{lottick2019nergy}.
@article{lacoste2019quantifying,
title={Quantifying the Carbon Emissions of Machine Learning},
author={Lacoste, Alexandre and Luccioni, Alexandra and Schmidt, Victor and Dandres, Thomas},
journal={Workshop on Tackling Climate Change with Machine Learning at NeurIPS 2019},
year={2019}
}
@article{lottick2019nergy,
title={Energy Usage Reports: Environmental awareness as part of algorithmic
accountability},
author={Kadan Lottick and Silvia Susai and Sorelle A. Friedler and Jonathan P. Wilson},
year={2019},
journal={Workshop on Tackling Climate Change with Machine Learning at NeurIPS 2019}
}
@article{codecarbon,
author={Victor Schmidt and Kamal Goyal and Aditya Joshi and Boris Feld and Liam Conell and Nikolas Laskaris and Doug Blank and Jonathan Wilson and Sorelle Friedler and Sasha Luccioni},
title={{CodeCarbon: Estimate and Track Carbon Emissions from Machine Learning Computing}},
year={2021},
howpublished={\url{https://github.com/mlco2/codecarbon}},
DOI={10.5281/zenodo.4658424},
publisher={Zenodo},
}
Currently the package supports following hardware infrastructure.
- Tracks Nvidia GPUs power consumption using
pynvml
library, (which is installed with the package).
- Tracks Intel processors power consumption using the
Intel Power Gadget
- You need to install it independently for CodeCarbon to function.
Note: Please ensure that the Intel Power Gadget has the required security permissions on MacOS.
- Tracks Intel Processors power consumption from Intel RAPL files at
/sys/class/powercap/intel-rapl
(reference) - All CPUs listed in this βοΈ directory will be tracked. Help us improve this and make it configurable.
Note: The Power Consumption will be tracked only if the RAPL files exist at the above mentioned path.
If CodeCarbon cannot find the appropriate software to track the CPUs' energy consumption, it will use a fallback strategy:
- Find TDP:
- If it can match your cpu type to a list of 2000+ Intel and AMD CPUs it will retrieve its thermal design power (TDPs)
- If it really can't tell anything about your CPU it will use a constant TDP of 85W
- CodeCarbon assumes that the (average) actual power consumption of CPUs will be 50% of their TDP.
To find the carbon efficiency of your cloud region, you can look into CodeCarbon's cloud data. If you are using a private infrastructure you can look into the CodeCarbon's private infrastructure. A number of resources can help you find the carbon efficiency of you local grid if you cannot find it in the previous links.
We are hoping that the open-source community will help us edit the code and make it better!
You are welcome to open issues, even suggest solutions and better still contribute the fix/improvement! We can guide you if you're not sure where to start but want to help us out π₯
In order to contribute a change to our code base, please submit a pull request (PR) via GitHub and someone from our team will go over it and accept it.
Check out our contribution guidelines
Contact @vict0rsch to be added to our slack workspace if you want to contribute regularly!
No software is complete without great documentation!
To make generating documentation easier, install the sphinx
package and use it to edit and improve the existing documentation:
pip install -U sphinx sphinx_rtd_theme
cd docs/edit
make docs
In order to make changes, edit the .rst
files that are in the /docs/edit
folder, and then run:
make docs
to regenerate the html files.
Maintainers are @vict0rsch @benoit-cty and @SaboniAmine. Codecarbon is developed by volunteers from Mila and the DataForGoodFR community alongside donated professional time of engineers at Comet.ml and BCG GAMMA.