Summarize and aggregate impacts from run_fredi()
(calculate national totals, average across models, sum impact types, and interpolate between impact year estimates)
Source: R/aggregate_impacts.R
aggregate_impacts.Rd
Summarize and aggregate impacts from run_fredi()
(calculate national totals, average across models, sum impact types, and interpolate between impact estimate years).
Usage
aggregate_impacts(
data,
aggLevels = c("national", "modelaverage", "impacttype", "impactyear"),
columns = c("physical_impacts", "annual_impacts"),
groupByCols = c(c(c(c("sector", "variant", "impactType", "impactYear"), "region",
"state", "postal"), "model_type", "model"), "includeaggregate", "sectorprimary"),
silent = TRUE
)
Arguments
- data
Data frame of results FrEDI (outputs from
run_fredi()
)- aggLevels
Levels of aggregation at which to summarize data: one or more of
c("national", "modelAverage", "impactYear", "impactType", "all" )
. Defaults to all levels (i.e.,aggLevels = "all"
). Note that, if"impacttype"
is inaggLevels
(e.g.,aggLevels = "all"
), column"physical_measure"
is dropped from thegroupByCols
and column"physical_impacts"
is dropped fromcolumns
. This is because aggregating over impact types for some sectors requires summing costs over different types of physical impacts, so reporting the physical impacts would be nonsensical.- columns
Character vector of columns for which to aggregate results (defaults to
columns = c( "physical_impacts", "annual_impacts")
).- groupByCols
Character vector indicating which columns to use for grouping. Defaults to
groupByCols = c("sector", "variant", "impactYear", "impactType", "model_type", "model", "sectorprimary", "includeaggregate", "physicalmeasure", "region", "state", "postal")
. Note that the"variant"
column referred to below contains information about the variant or adaptation name (or“N/A”
), as applicable.
Details
This function can be used to aggregate and summarize the FrEDI results to levels of aggregation specified by the user (passed to aggLevels
). Users can specify all aggregation levels at once by specifying aggLevels = "all"
(default) or no aggregation levels (aggLevels = "none"
). Users can specify a single aggregation level or multiple aggregation levels by passing a single character string or character vector to aggLevels
. Options for aggregation include calculating national totals (aggLevels= "national"
), averaging across model types and models (aggLevels = "modelAverage"
), summing over all impact types (aggLevels = "impactType"
), and interpolating between impact year estimates (aggLevels = "impactYear"
).
Before aggregating impacts for national totals and/or model averages, aggregate_impacts()
will drop any pre-summarized results (i.e., values for which region = "National Total"
and/or for which model = "Average"
, respectively) that are already present in the data and then re-summarize results at those respective levels.
If users specify aggLevels = "none"
, aggregate_impacts()
returns the data frame passed to the data
argument.
If users specify aggLevels = "all"
or other combinations of aggregation levels, the aggregate_impacts()
function uses performs the following calculations using the grouping columns specified by the groupByCols
argument: "sector"
, "variant"
, "impactType"
, "impactYear"
, "region"
, "state"
, "postal"
, "model_type"
, "model"
, "sectorprimary"
, "includeaggregate"
, "physicalmeasure"
, and "year"
.
Aggregation Level | Description |
impactyear | To aggregate over impact years, aggregate_impacts() first separates results for sectors with only one impact year estimate (i.e., impactYear = "N/A" ) from from observations with multiple impact year estimates (i.e., sectors with results for both impactYear = "2010" and impactYear = "2090" ). For these sectors with multiple impact years, physical impacts and annual costs (columns "physical_impacts" and "annual_impacts" ) are linearly interpolated between impact year estimates. For any model run years above 2090, annual results for sectors with multiple impact years return the 2090 estimate. The interpolated values are then row-bound to the results for sectors with a single impact year estimate, and column impactYear set to impactYear = "Interpolation" for all values. If "impactyear" is included in aggLevels (e.g., aggLevels = "all" ), aggregate_impacts() aggregates over impact years before performing other types of aggregation. |
modelaverage | To aggregate over models for temperature-driven sectors, aggregate_impacts() averages physical impacts and annual costs (columns "physical_impacts" and "annual_impacts" , respectively) across all GCM models present in the data. aggregate_impacts() drops the column "model" from the grouping columns when averaging over models. Averages exclude observations with missing values. However, If all values within a grouping are missing, the model average is set to NA . The values in column "model" are set to "Average" for model averages and the model averages data frame is then row-bound to the main results data frame. For SLR-driven sectors, there is no need for additional model aggregation; these values already have model = "Interpolation" . If "modelaverage" is included in aggLevels (e.g., aggLevels = "all" ), aggregate_impacts() first aggregates over impact years (if "impactyear" present in aggLevels or if aggLevels = "all" ) before aggregating over models. |
national | To aggregate values to the national level, aggregate_impacts() sums physical impacts and annual costs (columns "physical_impacts" and "annual_impacts" , respectively) across all states present in the data. aggregate_impacts() drops the columns "region" , "state" , and "postal" when summing over states and regions. Years which have missing column data for all states return as NA . Values for columns "region" , "state" , and "postal" are set to "National Total" , All , and US , respectively. The data frame with national totals is then row-bound to the main results data frame. If "national" is included in aggLevels (e.g., aggLevels = "all" ), aggregate_impacts() first aggregates over impact years and/or models (if "impactyear" and/or "modelaverage" are present in aggLevels or if aggLevels = "all" ) before aggregating over models. |
impacttype | To aggregate values over impact types, aggregate_impacts() sums annual impacts (column "annual_impacts" ) across all impact types for each sector. aggregate_impacts() drops the column "impactType" and "physicalmeasure" from the grouping columns when summing over impact types. Years which have missing column data for all impact types return as NA . All values in column "impactType" are set to "all" . Aggregating over impact types, drops columns related to physical impacts (i.e., columns "physicalmeasure" and "physical_impacts" ). These columns are dropped since aggregating over impact types for some sectors requires summing costs over different types of physical impacts, so reporting the physical impacts would be nonsensical. |
After aggregating values, aggregate_impacts()
joins the data frame of impacts with information about "driverType"
, "driverUnit"
, "driverValue"
, "gdp_usd"
, "national_pop"
, "gdp_percap"
, and "state_pop"
.
References
Environmental Protection Agency (EPA). 2021. Technical Documentation on The Framework for Evaluating Damages and Impacts (FrEDI). Technical Report EPA 430-R-21-004, EPA, Washington, DC. Available at https://epa.gov/cira/FrEDI/.
Examples
### Create temperature binning scenario
df_results1 <- run_fredi(aggLevels="none", silent=TRUE)
#> Checking input values...
#> No temperature scenario provided...using default temperature scenario...
#> Creating SLR scenario from temperature scenario...
#> No GDP scenario provided...Using default GDP scenario...
#> Creating population scenario from defaults...
#> Updating scalars...
#> Calculating impacts...
#> Formatting results...
#>
#> Finished.
### Aggregate temperature binning summary across multiple columns
df_results2 <- df_results1 |> aggregate_impacts(columns=c("annual_impacts"), aggLevels="all")