Workflow to aggregate and ingest NetCDF files for one year

Source code

Workflow

Description

Sub-workflow to aggregate a NetCDF file for one year over a given geography (zip codes or counties). Before aggregation, downloads shape files fo this year from US Census website

Inputs

Name

Type

Default

Description

depends_on

Any?

proxy

string?

HTTP/HTTPS Proxy if required

downloads

Directory

geography

string

variable

string

component

string[]

year

int

strategy

string

Rasterization strategy

ram

string

2GB

Runtime memory, available to the process

shape_file_collection

string

tiger

Collection of shapefiles, either GENZ or TIGER

table

string?

Optional name ot the table where the aggregated data will be eventually stored

Outputs

Name

Type

Description

shapes

File[]

consolidated_data

File

aggregate_data

File

data_dictionary

File

aggregate_log

File

aggregate_err

File

Steps

Name

Runs

Description

get_shapes

get_shapes.cwl

find_pm25_file

wustl_file_pattern.cwl

Given input directory, variable (band), year and month, evaluates the expected file name for the main variable input data

find_components_files

wustl_file_pattern.cwl

Given input directory, variable (band), year and month, evaluates the expected file name for the main variable input data

consolidate

wustl_consolidate_components.cwl

consolidate components into one file

aggregate

aggregate_wustl.cwl

Aggregate data over geographies