The aggregator Module

API to aggregate data over shapes

The Aggregator class expects a netCDF dataset, containing 3 variables: value, latitude and longitude

class Aggregator(infile: str, variable: str, outfile: str, strategy: RasterizationStrategy, shapefile: str, geography: Geography, extra_columns: Optional[Tuple[List[str], List[str]]] = None, ram=0)[source]
Parameters:
  • infile – Path to file with raster data to be aggregated. Can be either NetCDF or GeoTiff file

  • variable – Name of variable or variables that need to be aggregated

  • outfile – Path to the output “csv.gz” file

  • strategy – Rasterization strategy

  • shapefile – Path to shapefile with polygons

  • geography – What kind of geography: US Counties or ZIP/ZCTA codes

  • extra_columns – if we need to add any extra columns to the CSV

  • ram – Runtime memory available to the process

set_strategy(strategy: RasterizationStrategy)[source]
on_set_strategy(ram: int, set_factor: bool)[source]
prepare()[source]
abstract open()[source]
abstract get_dataset_variables() Set[str][source]
abstract get_layer(var)[source]
get_header() List[str][source]
write_header()[source]
get_registry(domain_name: str, table_name: str, description: Optional[str] = None)[source]
execute(mode: str = 'wt')[source]

Executes computational task

Parameters:

mode (str) – mode to use opening result file

Returns:

collect_data(collector: Collector)[source]
downscale(layer)[source]
compute(writer: Collector, layers)[source]
class NetCDFAggregator(infile: str, variable: str, outfile: str, strategy: RasterizationStrategy, shapefile: str, geography: Geography, extra_columns: Optional[Tuple[List[str], List[str]]] = None, ram=0)[source]
Parameters:
  • infile – Path to file with raster data to be aggregated. Can be either NetCDF or GeoTiff file

  • variable – Name of variable or variables that need to be aggregated

  • outfile – Path to the output “csv.gz” file

  • strategy – Rasterization strategy

  • shapefile – Path to shapefile with polygons

  • geography – What kind of geography: US Counties or ZIP/ZCTA codes

  • extra_columns – if we need to add any extra columns to the CSV

  • ram – Runtime memory available to the process

open()[source]
get_dataset_variables() Set[str][source]
get_layer(var)[source]
on_set_strategy(ram: int, set_factor)[source]
class GeoTiffAggregator(infile: str, variable: str, outfile: str, strategy: RasterizationStrategy, shapefile: str, geography: Geography, extra_columns: Optional[Tuple[List[str], List[str]]] = None, ram: int = 0)[source]
Parameters:
  • infile – Path to file with raster data to be aggregated. Can be either NetCDF or GeoTiff file

  • variable – Name of variable or variables that need to be aggregated

  • outfile – Path to the output “csv.gz” file

  • strategy – Rasterization strategy

  • shapefile – Path to shapefile with polygons

  • geography – What kind of geography: US Counties or ZIP/ZCTA codes

  • extra_columns – if we need to add any extra columns to the CSV

  • ram – Runtime memory available to the process

open()[source]
get_dataset_variables() Set[str][source]
get_layer(var)[source]
on_set_strategy(ram: int, set_factor)[source]