Versions | |
Documentation and Support | |
Open Source | |
Coding Standards | |
Development Status | |
Funding | |
The cdm_reader_mapper
toolbox is a python tool designed for:
- to read data files compliant with a user specified data model
- to map observed variables and its associated metadata from a data model or models combination to the C3S CDS Common Data Model (CDM) format
- detect and flag or remove duplicated observations
It was developed to read the IMMA (International Maritime Meteorological Archive) data format, but it has been enhanced to account for meteorological data formats in the case of:
- Data that is stored in a human-readable manner: “ASCII” format.
- Data is stored in a Network Common Data Format: "NetCDF" format.
- Data that is organized in single line reports
- Reports that have a coherent internal structure and can be modelised.
- Reports that have a fixed width or field delimited types
- Reports that can be organized in sections, in which case each section can be of different types (fixed width of delimited)
You can install the package directly from pip:
pip install cdm_reader_mapper
If you want to contribute, we recommend cloning the repository and installing the package in development mode, e.g.
git clone https://github.com/glamod/cdm_reader_mapper
cd cdm_reader_mapper
pip install -e .
This will install the package but you can still edit it and you don't need the package in your PYTHONPATH
The official documentation is at https://cdm-reader-mapper.readthedocs.io/
How to make the most of cdm_reader_mapper:
- How to read an IMMA file
- How to build your own data model schema
- How to map to the Common Data Model (CDM)
By default, cdm_reader_mapper
outputs logging information to stdout
. To tell cdm_reader_mapper
to output logs to a file, set the CDM_LOG_FILE
environment variable before loading cdm_reader_mapper
.
import os
os.environ["CDM_LOG_FILE"] = "log_file.log"
import cdm_reader_mapper as cdm
This will set the file log_file.log
as the output for all logging information from cdm_reader_mapper
, including the initial logging on loading of the package.
Read imma data with the cdm_reader_mapper.read_mdf function:
from cdm_reader_mapper import read_mdf
from cdm_reader_mapper.data import test_data
imodel = "icoads_r300_d701"
data = test_data.test_icoads_r300_d701.get("source")
imma_bundle = read(filepath, imodel=imodel, sections=["core", "c1", "c98"])
Map this data to a CDM build for the same deck (in this case deck 704: US Marine Metereological Journal collection of data):
imma_bundle.map_model(
data_raw,
imodel=imodel,
log_level="DEBUG",
)
Detect and flag duplicated observations:
imma_bundle.duplicate_check()
imma_bundle.flag_duplicates(overwrite=False)
For more details on how to use the cdm_reader_mapper
toolbox see the following jupyter example notebooks.
If you're interested in participating in the development of cdm_reader_mapper by suggesting new features, new indices or report bugs, please leave us a message on the issue tracker.
If you would like to contribute code or documentation (which is greatly appreciated!), check out the Contributing Guidelines before you begin!
If you wish to cite glamod-marine-processing in a research publication, we kindly ask that you refer to Zenodo: https://zenodo.org/records/14135493.
This is free software: you can redistribute it and/or modify it under the terms of the Apache License 2.0. A copy of this license is provided in the code repository (LICENSE).
cdm_reader_mapper
development is funded through Copernicus Climate Change Service (C3S).
Furthermore, acknowledgments go to National Oceanography Centre (NOC) and UK Marine and Climate Advisory Service (UKMCAS).
This package was created with Cookiecutter and the audreyfeldroy/cookiecutter-pypackage project template.