This repository features the document data and API of the Sustainabity Reporting Navigator. The large bulk of the initial reports in our database comes from the generous contribution by Arianna Pisciella, Gaia Melloni, Bianca Minuth, and Paul Pronobis and is supplemented by the ongoing data collection of the SRN team. Our objective is to develop this repository into a collaborative data platform that provides extensive coverage of sustainability-related documents published by European publicly-listed firms.
Currently our data covers 11,885 documents from 888 firms spanning 20 countries and data from the time period 2010 to 2023. Further information on the covered firm-years can be assessed from the table below.
Country | 2010 | 2011 | 2012 | 2013 | 2014 | 2015 | 2016 | 2017 | 2018 | 2019 | 2020 | 2021 | 2022 | 2023 |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Austria | 7 | 7 | 7 | 7 | 7 | 7 | 8 | 8 | 8 | 8 | 8 | 5 | 8 | 7 |
Belgium | 10 | 10 | 11 | 11 | 12 | 12 | 16 | 14 | 15 | 16 | 16 | 7 | 18 | 17 |
Czech Republic | 1 | 1 | ||||||||||||
Denmark | 13 | 14 | 14 | 14 | 14 | 17 | 19 | 19 | 20 | 20 | 22 | 21 | 28 | 28 |
Finland | 10 | 9 | 10 | 11 | 11 | 12 | 14 | 15 | 15 | 15 | 16 | 7 | 18 | 18 |
France | 45 | 48 | 53 | 52 | 56 | 59 | 68 | 68 | 70 | 73 | 74 | 62 | 79 | 72 |
Germany | 45 | 44 | 44 | 43 | 48 | 49 | 63 | 65 | 68 | 68 | 88 | 167 | 180 | 140 |
Ireland | 5 | 5 | 5 | 4 | 5 | 6 | 7 | 7 | 7 | 7 | 7 | 5 | 8 | 9 |
Italy | 14 | 18 | 20 | 22 | 22 | 23 | 27 | 27 | 27 | 27 | 30 | 25 | 32 | 29 |
Mexico | 1 | 1 | ||||||||||||
Netherlands | 14 | 16 | 17 | 18 | 23 | 22 | 27 | 26 | 28 | 30 | 33 | 26 | 34 | 32 |
Norway | 12 | 13 | 13 | 15 | 15 | 15 | 16 | 16 | 16 | 17 | 18 | 11 | 20 | 18 |
Poland | 1 | 2 | 2 | 3 | 4 | 4 | 6 | 6 | 6 | 6 | 8 | 6 | 10 | 9 |
Portugal | 2 | 2 | 3 | 3 | 3 | 3 | 4 | 4 | 4 | 4 | 4 | 1 | 4 | 4 |
Russia | 1 | |||||||||||||
Spain | 13 | 17 | 18 | 21 | 22 | 22 | 25 | 25 | 25 | 25 | 27 | 24 | 28 | 26 |
Sweden | 45 | 44 | 47 | 50 | 52 | 55 | 58 | 62 | 62 | 62 | 64 | 51 | 71 | 63 |
Switzerland | 33 | 35 | 36 | 37 | 39 | 40 | 49 | 50 | 50 | 52 | 53 | 41 | 56 | 53 |
United Kingdom | 98 | 101 | 100 | 103 | 106 | 114 | 127 | 128 | 134 | 136 | 149 | 84 | 156 | 139 |
United States | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 41 | 53 | 93 | 7 |
We try to collect all documents that contain relevant sustainability information. This includes but is not limited to annual and sustainability reports (AR and SR). For some firms it also includes additional reports like integreated reports (IR), Carbon Diclosure Project data (CDP), and other reporting formats. All materials are referred to by URL links and are provided as is. Neither the team of the SRN nor the maintainers of this data repository claim any ownership of or legal rights to the provided data.
Country | # Firms | AR | SR | IR | CDP | Other |
---|---|---|---|---|---|---|
Austria | 9 | 91 | 50 | 1 | 0 | 17 |
Belgium | 20 | 171 | 57 | 2 | 0 | 5 |
Czech Republic | 1 | 2 | 2 | 0 | 0 | 0 |
Denmark | 30 | 220 | 201 | 7 | 0 | 68 |
Finland | 20 | 176 | 86 | 3 | 0 | 67 |
France | 81 | 728 | 427 | 45 | 1 | 58 |
Germany | 183 | 1058 | 799 | 41 | 3 | 72 |
Ireland | 9 | 86 | 42 | 1 | 1 | 3 |
Italy | 32 | 303 | 247 | 10 | 0 | 62 |
Mexico | 1 | 3 | 0 | 0 | 0 | 0 |
Netherlands | 36 | 266 | 179 | 15 | 0 | 13 |
Norway | 22 | 172 | 118 | 5 | 0 | 3 |
Poland | 11 | 49 | 35 | 1 | 0 | 11 |
Portugal | 4 | 42 | 19 | 1 | 0 | 7 |
Russia | 1 | 1 | 0 | 0 | 0 | 0 |
Spain | 29 | 239 | 232 | 12 | 0 | 102 |
Sweden | 75 | 701 | 348 | 27 | 13 | 185 |
Switzerland | 59 | 547 | 324 | 16 | 0 | 56 |
United Kingdom | 168 | 1486 | 917 | 63 | 1 | 56 |
United States | 97 | 117 | 252 | 13 | 21 | 4 |
We provide an easy to use API for data download. See the file
srn_docs_api.py
in this repo for pointers. We are working on making
this process even easier to handle over the next weeks. Stay tuned!
Thank you for asking! We very much appreciate you giving credit to our data collection efforts. We will be using a concept DOI and versioned DOIs from Zenodo in the future. Currently, please cite the data as follows (in alphabetical order):
Donau, Charlotte-Louise, Fikir Worku Edossa, Joachim Gassen, Gaia Melloni, Inga Meringdal, Bianca Minuth, Arianna Piscella, Paul Pronobis and Victor Wagner (2023): SRN Document Database, https://github.com/trr266/srn_docs.
What a great question! We really hope that this repo will be used to
discuss data quality issues as well as methods to use the data in
research projects. Again, as a first teaser, take a look at
extraxt_text_from_docs.py
that features a method to extract page-wise
text data from our documents.
Thank you! Please open an issue on GitHub, mentioning the document id and describing the issue that you encountered.
Thank you and yes, of course! We would be very happy about people that provide historical data and/or are willing to maintain our data going forward on a country or index basis. So, if you would like to help, please reach out to Victor Wagner.