.. |br| raw:: html
.. _gcid-data: ############################### The GEOS-Chem Input Data portal ############################### The main `GEOS-Chem Input Data `_ portal is hosted at the AWS S3 bucket `s3://geos-chem `_. From here you may download the data required to run :program:`GEOS-Chem Classic`, :program:`GCHP`, or :program:`HEMCO standalone` simulations. .. note:: We are pleased to announce that the GEOS-Chem Input Data portal is part of the `AWS Open Data Sponsorship Program `_. As a result, **the data is completely free to use**. You will NOT incur any data egress fees when downloading data from the `s3://geos-chem `_ bucket. This is now the recommended repository for GEOS-Chem data download. We also maintain :ref:`additional portals for special meteorology products `. .. _gcid-data-org: ================= Data organization ================= The GEOS-Chem Input Data portal is structured into the following categories: #. :ref:`gcid-data-org-init-cond` (aka `Restart files `_) #. :ref:`gcid-data-org-chem-inputs` #. :ref:`gcid-data-org-emis-inputs` #. :ref:`gcid-data-org-met` .. _gcid-data-org-init-cond: Initial conditions input data ----------------------------- Initial conditions include initial species concentrations (aka `Restart files `_) used to start a GEOS-Chem simulation. .. _gcid-data-org-chem-inputs: Chemistry input data -------------------- Chemistry input data includes: - Tables of aerosol optical properties - Quantum yields and cross sections for photolysis using either :program:`Cloud-J` or legacy :program:`FAST-JX` - Climatology data for :program:`Linoz` stratospheric ozone chemistry - Boundary conditions for :program:`UCX` stratospheric chemistry routines .. _gcid-data-org-emis-inputs: Emissions input data -------------------- Emissions input data includes the following data: - Emissions inventories - Input data for :program:`HEMCO` Extensions - Input data for :program:`GEOS-Chem` specialty simulations - Scale factors - Mask definitions - Surface boundary conditions - Leaf area indices - Land cover map .. _gcid-data-org-met: Meteorology input data ---------------------- GEOS-Chem Classic be driven by the following meteorology products: #. `MERRA-2 `_ #. `GEOS-FP `_ #. `GEOS-IT `_ #. GCAP 2.0 (available at the `atmos.earth.rochester.edu data portal `_) .. _gcid-data-access: =========== Data access =========== You may access the GEOS-Chem Input Data portal in several ways, as described below. .. _gcid-data-access-we: AWS S3 Explorer --------------- You can browse the contents of the GEOS-Chem Input Data portal with the :program:`AWS S3 Explorer` interface. Simply point your web browser to the following link: - https://geos-chem.s3.amazonaws.com/index.html. This is an easy way for you to familiarize yourself with the directory structure. Before downloading large amounts of data, we recommend that you use the AWS S3 Explorer to find the path to the relevant data directories. .. _gcid-data-access-cli: AWS CLI (command-line interface) -------------------------------- You can also use the AWS command-line interface (aka :program:`AWS CLI`) to browse and download data from the GEOS-Chem Input Data portal. For example, if you have an AWS account and have installed AWS CLI on your system, you may use this command to get a data listing: .. code-block:: console $ aws s3 ls s3://geos-chem/ # Get a directory listing If you do not have an AWS account (or do not wish to open one), you may still use AWS CLI to access or download data via anonymous login, which is completely free. Simply add the :literal:`--no-sign-request` flag after each AWS CLI command, such as: .. code-block:: console $ aws s3 ls --no-sign-request s3://geos-chem/ # Get a directory listing via anonymous login For detailed instructions about using AWS CLI, please see: :ref:`gcid-tut`. .. _gcid-data-access-http: HTTP or wget download --------------------- You can also access the GEOS-Chem Input Data portal via the alternate web link http://geoschemdata.wustl.edu. As with the AWS S3 Explorer, you can navigate through the web interface to find the data sets that you wish to download. You can then use the :program:`wget` command to download the data. .. _gcid-data-access-dryrun: Dry-run simulation (GEOS-Chem Classic and HEMCO standalone only) ---------------------------------------------------------------- If you plan to run a `GEOS-Chem Classic `_ or `HEMCO standalone `_ simulation, we recommend first performing a :program:`dry-run simulation`. The dry-run simulation workflow is as follows: #. Configure your GEOS-Chem Classic or HEMCO standalone simulation. |br| |br| #. Run GEOS-Chem Classic or HEMCO standalone with the :code:`--dryrun` flag. This will generate a list of required data files. |br| |br| #. Pass this list to a Python script, which will download the data to your computer system or AWS EC2 instance. For more information, please see the following links: - `GEOS-Chem Classic dry-run instructions `_ - `HEMCO standalone dry-run instructions `_ .. _gcid-data-access-globus: Globus ------ Many institutions use the :program:`Globus` file transfer utility, which has much higher data download speeds than normal SSH or HTTP connections. If your institution uses Globus, you can download data from the :program:`GEOS-Chem Data (WashU)` endpoint to your computer system. .. _gcid-data-access-bashdatacatalog: Bashdatacatalog --------------- We have created the :program:`bashdatacatalog` tool to facilitate downloading large amounts of data from the GEOS-Chem Input Data portal. Please see our :ref:`bashdatacatalog` guide for usage instructions. .. _gcid-data-dir-structure: =========================== Example directory structure =========================== The directory structure of the GEOS-Chem Input Data portal adheres to the format listed below. You can see easily browse through the portal using one of the following web links: - https://geos-chem.s3.amazonaws.com/index.html (Recommended) - http://geoschemdata.wustl.edu .. code-block:: text ExtData/ │ ├── GEOSCHEM-RESTARTS/ │ ├── GC_14.2.0/ │ ├── GC_14.3.0/ │ └── ... │ ├── CHEM_INPUTS/ │ ├── CLOUD-J/ │ ├── FAST-JX/ │ └── ... │ ├── HEMCO/ │ ├── UVALBEDO/ │ └── ... │ ├── GEOS_0.5x0.625/ │ ├── MERRA2/ │ │ ├── 2023/ │ │ ├── 2024/ │ │ └── ... │ └── ... │ ├── GEOS_0.25x0.3125/ │ ├── GEOS_FP/ │ │ ├── 2023/ │ │ ├── 2024/ │ │ └── ... │ ├── GEOS_FP_Raw/ │ └── ... │ └── ...