Tutorial: Accessing GEOS-Chem Input Data using AWS CLI

This tutorial will guide you through the process of accessing and using the GEOS-Chem Input Data with AWS CLI. Alternatively, you can access the data via AWS S3 Explorer.

The workflow is

  1. Install and configure AWS CLI

    • This step only has to be done once.

  2. Download data from the GEOS-Chem Input Data portal.

  3. Run a GEOS-Chem Classic, GCHP, or HEMCO standalone simulation.

Install and configure AWS CLI

If you have already installed and configured the AWS CLI previously, continue to Data access.

Step 1: Install AWS CLI

Follow the instructions to install the AWS CLI from the AWS CLI User Guide.

Step 2: Configure AWS CLI

Run the following command to configure AWS CLI with your credentials:

$ aws configure

For instructions on aws configure, refer to the Configure the AWS CLI .

Access and download data

Step 1: List available data

To view the available data in the GEOS-Chem Input Data S3 bucket, use the following command:

$ aws s3 ls s3://geos-chem/

or without AWS account required

$ aws s3 ls --no-sign-request s3://geos-chem/

Step 2: Navigate through the directories

You can navigate through the directories to find the specific data you need. For example,

$ aws s3 ls s3://geos-chem/GEOS_0.5x0.625/MERRA2/2024/05

Step 3: Download the data

Tip

If you are using GEOS-Chem Classic or the HEMCO standalone model, you can download data with a dry-run simulation, while still using the AWS CLI data transfer protocol.

Once you have located the data you need, you can download it to your local cluster or an EC2 instance. For example,

$ aws s3 cp s3://geos-chem/GEOS_0.5x0.625/MERRA2/2024/05 ./ --recursive

This command will copy the data to your current path.

Run simulations using downloaded data

Once you have downloaded the data from the GEOS-Chem Input Data portal to your computer system or EC2 instance, you may run a GEOS-Chem Classic, GCHP, or HEMCO standalone simulation. Please refer to the relevant user guide listed below.

Running GCHP on AWS

If you wish to use the computing resources on AWS to run GCHP and are seeking for an AMI, feel free to check our Set up AWS ParallelCluster guide.