GPTCast: a weather language model for precipitation nowcasting

Description

Code release for the paper "GPTCast: a weather language model for precipitation nowcasting"

@Article{gmd-18-5351-2025,
AUTHOR = {Franch, G. and Tomasi, E. and Wanjari, R. and Poli, V. and Cardinali, C. and Alberoni, P. P. and Cristoforetti, M.},
TITLE = {GPTCast: a weather language model for precipitation nowcasting},
JOURNAL = {Geoscientific Model Development},
VOLUME = {18},
YEAR = {2025},
NUMBER = {16},
PAGES = {5351--5371},
URL = {https://gmd.copernicus.org/articles/18/5351/2025/},
DOI = {10.5194/gmd-18-5351-2025}
}

paper: https://gmd.copernicus.org/articles/18/5351/2025/

data: https://doi.org/10.5281/zenodo.13692016

models: https://doi.org/10.5281/zenodo.13594332

How to run

Install dependencies

# install python3.12 on ubuntu
bash install_python_ubuntu.sh

# create environment with poetry
bash create_environment.sh

# activate the environment
source .venv/bin/activate

Use the pretrained models

Check the notebooks in the notebooks folder on how to use the pretrained models.

See the notebook notebooks/example_gptcast_forecast.ipynb for running the models on a test batch and generating a forecast.
See the notebook notebooks/example_autoencoder_reconstruction.ipynb for a test on the VAE reconstruction.

Training

To train the model on the original dataset, first run the script in the data folder to download the dataset.

# download the dataset
python data/download_data.py

Train the VAE

Train the first stage (the VAE) with one of the following configurations contained in the folder configs/experiment/:

vaeganvq_mae - Mean Absolute Error loss
vaeganvq_mwae - Magnitude Weighted Absolute Error loss

# train a VAE with WMAE reconstruction loss on GPU
# the result (including model checkpoints) will be saved in the folder `logs/train/`
python gptcast/train.py trainer=gpu experiment=vaeganvq_mwae.yaml

Train GPTCast

After training the VAE, train the GPTCast model with one of the following configurations contained in the folder configs/experiment/:

gptcast_8x8 - 8x8 token spatial context (128x128 pixels)
gptcast_16x16 - 16x16 token spatial context (256x256 pixels)

# train GPTCast with a 16x16 token spatial context on GPU
# the result (including model checkpoints) will be saved in the folder `logs/train/`
# the VAE checkpoint path should be provided
python gptcast/train.py trainer=gpu experiment=gptcast_16x16.yaml model.first_stage.ckpt_path=<path_to_vae_checkpoint>

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
configs		configs
data		data
gptcast		gptcast
models		models
notebooks		notebooks
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
.project-root		.project-root
LICENSE		LICENSE
README.md		README.md
create_environment.sh		create_environment.sh
install_python_ubuntu.sh		install_python_ubuntu.sh
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

GPTCast: a weather language model for precipitation nowcasting

Description

How to run

Use the pretrained models

Training

Train the VAE

Train GPTCast

About

Uh oh!

Releases 1

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

GPTCast: a weather language model for precipitation nowcasting

Description

How to run

Use the pretrained models

Training

Train the VAE

Train GPTCast

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages