Workflow to aggregate and ingest NetCDF files for one year
Workflow
Description
Sub-workflow to aggregate a NetCDF file for one year over a given geography (zip codes or counties). Before aggregation, downloads shape files fo this year from US Census website
Inputs
Name |
Type |
Default |
Description |
---|---|---|---|
depends_on |
Any? |
||
proxy |
string? |
HTTP/HTTPS Proxy if required |
|
downloads |
Directory |
||
geography |
string |
||
variable |
string |
||
component |
string[] |
||
year |
int |
||
strategy |
string |
Rasterization strategy |
|
ram |
string |
|
Runtime memory, available to the process |
shape_file_collection |
string |
|
Collection of shapefiles, either GENZ or TIGER |
table |
string? |
Optional name ot the table where the aggregated data will be eventually stored |
Outputs
Name |
Type |
Description |
---|---|---|
shapes |
File[] |
|
consolidated_data |
File |
|
aggregate_data |
File |
|
data_dictionary |
File |
|
aggregate_log |
File |
|
aggregate_err |
File |
Steps
Name |
Runs |
Description |
---|---|---|
get_shapes |
||
find_pm25_file |
Given input directory, variable (band), year and month, evaluates the expected file name for the main variable input data |
|
find_components_files |
Given input directory, variable (band), year and month, evaluates the expected file name for the main variable input data |
|
consolidate |
consolidate components into one file |
|
aggregate |
Aggregate data over geographies |