Transferring climate change knowledge

A repository including the code necessary to reproduce the results present in Immorlano et al. 2023, “Transferring climate change knowledge” submitted to Nature on September 21, 2023.

Content

Related Publication
Installation
Run a demo version
Run the full version
Reproduce the results present in the paper
Contributors
Acknowledgements and References
License

Related Publication

Immorlano, F., Eyring, V., le Monnier de Gouville, T., Accarino, G., Elia, D., Aloisio, G. & Gentine, P. Transferring climate change knowledge. arXiv preprint. DOI: arXiv.2309.14780 (2023). (in review)

Installation

Python version 3.11.0 or higher is needed.

A conda env containing all the packages and versions required to run the workflow and/or reproduce the figures present in the paper can be created by running the following command:

conda env create --file transferring_env.yml

This makes the installation easy and fast. The conda env was created and tested on a MacBook M2 Pro with MacOS Ventura 13.4.1 to enable the user to run a demo version of the workflow and reproduce the results present in the paper.

On the other hand, Tensorflow version 2.12.0 was used to run the entire workflow on Ginsburg (Columbia University) and Juno (CMCC Foundation) supercomputers.

Files description

architectures.py → Definition and building of the Deep Neural Network (DNN) used in the Transfer Learning approach
area_cella.csv → Area weighting factors for each gridpoint of the grid corresponding to CanESM5_CanOE_grid used in this work
CanESM5_CanOE_grid → Description of the grid used in this work (Grid from CanESM5-CanOE simulation)
transferring_env.yml → YAML file needed to create the conda environment to reproduce the experiments and the figures
erf_estimates_with_aerosols_Zebedee_Nichols.csv → Dataset of Effective Radiative Forcing values for each year
lib.py → Routines called in several scripts
variables.py → Definition of variables used in the scripts
BEST_regridded_annual_1979-2022.nc → BEST observational maps resulting from the preprocessing (i.e., regridding to CanESM5-CanOE grid, computaion of annual average maps, delete of years out of 1979-2022)
gaussian_noise_5 → Directory containing noisy BEST observational maps to be used for Transfer Learning on observations
Land_and_Ocean_global_average_annual.txt → Average annual uncertainties associated with BEST observational data. The column headers were manually removed for ease of use

The following scripts should be used to download and process CMIP6 and Berkeley Earth Surface Temperatures (BEST) data:

CMIP6_download_process.py → Download of CMIP6 simulations from Climate Data Store and processing
BEST_data_processing.py → Processing of BEST observational maps
BEST_data_add_gaussian_noise.py → Generation of noisy BEST observational maps to be used for Transfer Learning on observations

The following script should be used to perform Training, Transfer Learning on Simulations (leave-one-out cross validation) and the Transfer Learning on Observations:

First_Training.py → Training of each DNN on the simulation of one of the 22 Earth System Models (ESMs) for one of the 3 SSP scenarios
Transfer_learning_simulations.py → Transfer Learning on ESMs simulations according to the Leave-One-Out cross validation approach
Transfer_learning_observations.py → Transfer Learning on BEST observational data

The following scripts must be used to reproduce the figures and compute the values for Supplementary Table 1 present in the paper:

Fig_1.py
Fig_2.py
Fig_3.py
Fig_4.py
Ext_Fig_1.py
Ext_Fig_2.py
Supp_Table_1.py

Run a demo version

A demo version fo the entire workflow can be run. After having downloaded the GitHub repository, all the files needed to run the demo shall be organized as the following hierarchy:

├── root
│   ├── architectures.py
│   ├── area_cella.csv
│   ├── BEST_data_add_gaussian_noise.py
│   ├── BEST_data_processing.py
│   ├── CanESM5-CanOE_grid
│   ├── CMIP6_download_process.py
│   ├── Demo_download
│   │   ├── Data
│   │   │   ├── BEST_data
│   │   │   │   ├── Land_and_Ocean_global_average_annual.txt
│   ├── Demo_no_download
│   │   ├── Data
│   │   │   ├── BEST_data
│   │   │   │   ├── BEST_regridded_annual_1979-2022.nc
│   │   │   │   ├── gaussian_noise_5
│   │   │   │   ├── Land_and_Ocean_global_average_annual.txt
│   │   │   ├── CMIP6_data
│   ├── erf_estimates_with_aerosols_Zebedee_Nichols.csv
│   ├── Figures 
│   ├── First_Training.py
│   ├── lib.py
│   ├── Transfer_learning_observations.py
│   ├── Transfer_learning_simulations.py
│   ├── variables.py

The demo version can be run with two options:

download and build a small dataset and then train the DNNs on those data
start directly with training the DNNs on the same small dataset already downloaded and processed

In both cases, the workflow will be executed for CNRM-ESM2-1, FGOALS-f3-L and MIROC6 models simulations and for the SSP2-4.5 scenario.

Data downloading and processing + training

To this aim, the BEST observational maps used in this study (i.e., Global Monthly Land + Ocean — Average Temperature with Air Temperatures at Sea Ice (Recommended; 1850 – Recent) — 1º x 1º Latitude-Longitude Grid) should be gathered from the BEST archive (direct download: 1º x 1º Latitude-Longitude Grid (~400 MB)) . The file must be named BEST_2022.nc and placed in root/Demo_download/Data/BEST_data.
Since CMIP6 data were gathered from the Copernicus Climate Data Store, the CDS API key should be configured on your laptop according to the CDS API documentation before downloading them.
The following variables in variables.py should be set to:

demo_download = True
demo_no_download = False

Now, the scripts should be executed in the following order:

CMIP6_download_process.py to download and process (i.e., regridding to CanESM5-CanOE grid, computing annual average maps) CMIP6 simulation data for the three models and for SSP2-4.5. The resulting nc files will be saved in root/Demo_download/Data/CMIP6_data/near_surface_air_temperature/Annual_uniform_remapped.
BEST_data_processing.py to process BEST observational maps (i.e., regridding to CanESM5-CanOE grid, computing annual average maps, delete years out of 1979-2022). The resulting file will be saved as root/Demo_download/Data/BEST_data/BEST_regridded_annual_1979-2022.nc
BEST_data_add_gaussian_noise.py to add noise to the BEST observational maps. Specifically, the noise is sampled from a Gaussian distribution with 0 mean and stddev equal to the BEST observational data uncertainty. The addition of noise is repeated 5 times for each model and each scenario. The resulting file will be saved in root/Demo_download/Data/BEST_data/gaussian_noise_5.
First_Training.py to train an individual DNN on the simulation of one of the three ESMs for the SSP2-4.5 scenario. A total of three DNNs are trained, each with the same architecture. The results will be saved in root/Demo_download/Experiments/First_Training/First_Training_[date-time].
Transfer_learning_simulations.py to transfer learn the trained DNNs on the ESMs simulations according to the Leave-One-Out cross validation (LOO-CV) approach. The variable FIRST_TRAINING_DIRECTORY in Transfer_learning_simulations.py should be set to the directory name related to the first training (i.e. FIRST_TRAINING_DIRECTORY = First_Training_[date-time]). This is necessary to load the pre-trained models. The results will be saved in root/Demo_download/Experiments/Transfer_Learning_on_Simulations/Transfer_learning_[date-time]/Shuffle_[number]. The Shuffle_[number] corresponds to an iteration of the LOO-CV approach.
Transfer_learning_observations.py to transfer learn the trained DNNs on the BEST observational data. The variable FIRST_TRAINING_DIRECTORY in Transfer_learning_simulations.py must be set to the directory name related to the first training (i.e. FIRST_TRAINING_DIRECTORY = First_Training_[date-time]). This is necessary to load the pre-trained models. The results will be saved in root/Demo_download/Experiments/Transfer_Learning_on_Observations/Transfer_learning_obs_[date-time].

Training (without downloading and processing)

A demo version can be run directly starting with training the DNNs on the CNRM-ESM2-1, FGOALS-f3-L and MIROC6 models simulations.
In this case, the following variables in variables.py should be set to:

demo_download = False
demo_no_download = True

Now, the scripts should be executed in the following order:

First_Training.py to train an individual DNN on the simulation of one of the three ESMs for the SSP2-4.5 scenario. A total of three DNNs are trained, each with the same architecture. The results will be saved in root/Demo_no_download/Experiments/First_Training/First_Training_[date-time].
Transfer_learning_simulations.py to transfer learn the trained DNNs on the ESMs simulations according to the Leave-One-Out cross validation (LOO-CV) approach. The variable FIRST_TRAINING_DIRECTORY in Transfer_learning_simulations.py must be set to the directory name related to the first training (i.e. FIRST_TRAINING_DIRECTORY = First_Training_[date-time]). The results will be saved in root/Demo_no_download/Experiments/Transfer_Learning_on_Simulations/Transfer_learning_[date-time]/Shuffle_[number]. This is necessary to load the pre-trained models. The Shuffle_[number] corresponds to an iteration of the LOO-CV approach.
Transfer_learning_observations.py to transfer learn the trained DNNs on the BEST observational data. The variable FIRST_TRAINING_DIRECTORY in Transfer_learning_simulations.py must be set to the directory name related to the first training (i.e. FIRST_TRAINING_DIRECTORY = First_Training_[date-time]). This is necessary to load the pre-trained models. The results will be saved in root/Demo_no_download/Experiments/Transfer_Learning_on_Observations/Transfer_learning_obs_[date-time].

The Demo software was tested on a MacBook M2 Pro equipped with MacOS Ventura 13.4.1. The expected run times are the following:

CMIP6_download_process.py: about 12 seconds (after the CMIP6 data download)
BEST_data_processing.py: about 122 seconds
BEST_data_add_gaussian_noise.py: about 7 seconds
First_Training.py, Transfer_learning_simulations.py, Transfer_learning_observations.py: about 4.5 seconds per epoch

Run the full version

The full version of the entire workflow can be run. After having downloaded Source_data.zip from Zenodo, the files needed to run the entire workflow shall be organized as the following hierarchy:

├── root
│   ├── architectures.py
│   ├── area_cella.csv
│   ├── BEST_data_add_gaussian_noise.py
│   ├── BEST_data_processing.py
│   ├── CanESM5-CanOE_grid
│   ├── CMIP6_download_process.py
│   ├── erf_estimates_with_aerosols_Zebedee_Nichols.csv
│   ├── First_Training.py
│   ├── lib.py
│   ├── Source_data
│   │   ├── BEST_data
│   │   │   ├── BEST_regridded_annual_1979-2022.nc
│   │   │   ├── gaussian_noise_5
│   │   │   ├── Land_and_Ocean_global_average_annual.txt
│   │   ├── CMIP6_data
│   │   ├── Transfer_Learning_on_Observations
│   │   ├── Transfer_Learning_on_Simulations
│   ├── Transfer_learning_observations.py
│   ├── Transfer_learning_simulations.py
│   ├── variables.py

The full version will be run by training the DNNs on the 22 ESMs simulations for SSP2-4.5, 3-7.0 and 5-8.5 scenarios.
In this case, the following variables in variables.py must be set to:

demo_download = False
demo_no_download = False

Now, the scripts should be executed in the following order:

First_Training.py to train an individual DNN on the simulation of one of the 22 ESMs for the SSP2-4.5, 3-7.0 and 5-8.5 scenarios. A total of 66 DNNs are trained, each with the same architecture. The results will be saved in root/Experiments/First_Training/First_Training_[date-time].
Transfer_learning_simulations.py to transfer learn the trained DNNs on the ESMs simulations according to the Leave-One-Out cross validation (LOO-CV) approach. The variable FIRST_TRAINING_DIRECTORY in Transfer_learning_simulations.py should be set to the directory name related to the first training (i.e. FIRST_TRAINING_DIRECTORY = First_Training_[date-time]). The results will be saved in root/Experiments/Transfer_Learning_on_Simulations/Transfer_learning_[date-time]/Shuffle_[number]. This is necessary to load the pre-trained models. The Shuffle_[number] corresponds to an iteration of the LOO-CV approach.
Transfer_learning_observations.py to transfer learn the trained DNNs on the BEST observational data. the variable FIRST_TRAINING_DIRECTORY in Transfer_learning_simulations.py should be set to the directory name of the first training (i.e. FIRST_TRAINING_DIRECTORY = First_Training_[date-time]). This is necessary to load the pre-trained models. The results will be saved in root/Experiments/Transfer_Learning_on_Observations/Transfer_learning_obs_[date-time].

The models resulting from transfer learning on BEST observational data and used to predict temperature values up to 2098 can be downloaded from Hugging Face

Reproduce the results present in the paper

The figures and the results present in the paper can be reproduced. After having downloaded Source_data from Zenodo, the files needed to reproduce the results shall be organized as the following hierarchy:

├── root
│   ├── area_cella.csv
│   ├── Figures
│   │   ├── Ext_Fig_1.py
│   │   ├── Ext_Fig_2.py
│   │   ├── Fig_1.py
│   │   ├── Fig_2.py
│   │   ├── Fig_3.py
│   │   ├── Fig_4.py
│   │   ├── Supp_Table_1.py
│   ├── Source_data
│   │   ├── BEST_data
│   │   ├── CMIP6_data
│   │   ├── Transfer_Learning_on_Observations
│   │   ├── Transfer_Learning_on_Simulations

The following scripts should be used to reproduce the figures and compute the values for Supplementary Table 1 present in the paper:

Fig_1.py
Fig_2.py
Fig_3.py
Fig_4.py
Ext_Fig_1.py
Ext_Fig_2.py
Supp_Table_1.py

Contributors

Francesco Immorlano (francesco.immorlano@cmcc.it)
Veronika Eyring (veronika.eyring@dlr.de)
Thomas le Monnier de Gouville (thomas.le-monnier-de-gouville@polytechnique.edu)
Gabriele Accarino (gabriele.accarino@cmcc.it)
Donatello Elia (donatello.elia@cmcc.it)
Giovanni Aloisio (giovanni.aloisio@cmcc.it)
Pierre Gentine (pg2328@columbia.edu)

Acknowledgements and References

If you use the resource in your research, please cite our paper. At the moment, we offer the bibliography of the arXiv preprint version.

@misc{immorlano2023transferring,
      title={Transferring climate change knowledge}, 
      author={Francesco Immorlano and Veronika Eyring and Thomas le Monnier de Gouville and Gabriele Accarino and Donatello Elia and Giovanni Aloisio and Pierre Gentine},
      year={2023},
      eprint={2309.14780},
      archivePrefix={arXiv},
      primaryClass={physics.ao-ph}
}

License

MIT License

Permission is hereby granted, free of charge, to any person obtaining a copy of this software and associated documentation files (the "Software"), to deal in the Software without restriction, including without limitation the rights to use, copy, modify, merge, publish, distribute, sublicense, and/or sell copies of the Software, and to permit persons to whom the Software is furnished to do so, subject to the following conditions:

The above copyright notice and this permission notice shall be included in all copies or substantial portions of the Software.

THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE SOFTWARE.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Transferring climate change knowledge

Content

Related Publication

Installation

Files description

Run a demo version

Data downloading and processing + training

Training (without downloading and processing)

Run the full version

Reproduce the results present in the paper

Contributors

Acknowledgements and References

License

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
Demo_download/Data/BEST_data		Demo_download/Data/BEST_data
Demo_no_download/Data		Demo_no_download/Data
Figures		Figures
BEST_data_add_gaussian_noise.py		BEST_data_add_gaussian_noise.py
BEST_data_processing.py		BEST_data_processing.py
CMIP6_download_process.py		CMIP6_download_process.py
CanESM5-CanOE_grid		CanESM5-CanOE_grid
First_Training.py		First_Training.py
LICENSE		LICENSE
README.md		README.md
Transfer_learning_observations.py		Transfer_learning_observations.py
Transfer_learning_simulations.py		Transfer_learning_simulations.py
architectures.py		architectures.py
area_cella.csv		area_cella.csv
erf_estimates_with_aerosols_Zebedee_Nichols.csv		erf_estimates_with_aerosols_Zebedee_Nichols.csv
lib.py		lib.py
transferring_env.yml		transferring_env.yml
variables.py		variables.py

License

liming489/transferring_climate

Folders and files

Latest commit

History

Repository files navigation

Transferring climate change knowledge

Content

Related Publication

Installation

Files description

Run a demo version

Data downloading and processing + training

Training (without downloading and processing)

Run the full version

Reproduce the results present in the paper

Contributors

Acknowledgements and References

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages