Bias correction of satellite-based precipitation estimates is crucial in climate studies to enhance their accuracy and reliability. This exercise explores a method that combines Linear Scaling (LS) and Empirical Quantile Mapping (EQM) with tail adjustment using the Generalized Pareto Distribution (GPD). This hybrid approach aims to correct both the mean bias and the distributional discrepancies between satellite estimates and ground-based observations.
The following sections will describe the methodology, the role of different functions, their strengths, and weaknesses in achieving accurate bias correction.
The Linear Scaling (LS) method is applied first to correct the mean bias in the IMERG precipitation data relative to the CPC precipitation data. The underlying theory for LS involves the calculation of a scale factor as the ratio of the observed mean to the modeled mean:
where
The strength of this approach lies in its simplicity and effectiveness in correcting systematic biases. However, it does not address distributional differences, particularly in the tails of the distribution where extreme values occur.
Empirical Quantile Mapping (EQM) is a comprehensive method used to align the distributions of satellite-based precipitation estimates with ground-based observations. By matching the empirical cumulative distribution functions (CDFs) of the two datasets, EQM corrects distributional biases across the entire range of precipitation values. This section explores the various components of EQM, including the application of gamma distribution-based quantile mapping and tail adjustment using the Generalized Pareto Distribution (GPD), to enhance the accuracy of bias correction.
Empirical Quantile Mapping (EQM) works by transforming the precipitation values such that their quantiles match:
where
To further refine the bias correction, gamma distribution-based quantile mapping is applied. This involves fitting gamma distributions to both IMERG and CPC data using the method of moments, ensuring accurate representation of the distribution shapes. The gamma distribution is defined by its shape
Fitting these parameters involves solving moment equations that relate the moments of the data to the parameters of the gamma distribution. The strength of this method is its mathematical rigor in fitting the entire distribution. However, it requires careful handling of parameter bounds to avoid infeasibility issues during optimization
One critical enhancement in the approach is the adjustment for extreme values using GPD. This involves fitting a GPD to the excesses above a high threshold, defined as the 95th percentile in this study. The GPD is defined by its shape
This step is crucial for accurately capturing extreme precipitation events, which are often underrepresented in observational datasets. The main strength of this approach is its focus on tail behavior, improving the representation of extremes. However, the requirement for sufficient data points above the threshold can be a limitation in sparse datasets.
The moving window approach is used to balance the advantages of capturing both spatial and temporal variations in precipitation patterns. By considering both spatial and temporal windows, the bias correction method can address local variations and seasonal changes more effectively.
-
Seasonal and Short-term Variability:
- Captures seasonal variations and short-term changes in precipitation patterns, which is essential for accurate bias correction over time.
- Can adapt to the changing nature of weather patterns, providing more responsive adjustments.
-
Stability:
- Temporal aggregation can help in smoothing out short-term fluctuations and noise in the data.
- This can be particularly useful if there are periods with sparse station data or missing values.
-
Trend and Cycle Detection:
- Helps in identifying trends and cycles in precipitation patterns that can be important for long-term studies and threshold definitions.
- Spatial Variability Ignored:
- May not adequately account for spatial variations in precipitation, especially in regions with complex terrain or diverse weather patterns.
-
Spatial Consistency:
- Ensures spatial coherence and consistency in the data, accounting for local variations in precipitation patterns.
- Provides localized corrections that adapt to spatial variability, leading to more accurate adjustments at each grid cell.
-
Alignment with Resolution:
- Aligns well with the spatial resolution differences between IMERG and CPC-UNI, ensuring appropriate scaling to match the higher resolution data.
-
Improved Spatial Representations:
- Helps maintain the spatial structure and patterns of precipitation, which can be critical for defining accurate thresholds for extreme rainfall events.
- Short-term Variability Ignored:
- May not capture short-term temporal variations as effectively as a temporal moving window.
- Can miss transient weather phenomena that occur over short periods.
The method incorporates both Empirical Quantile Mapping (EQM) and Generalized Pareto Distribution (GPD) fitting to address extreme values in precipitation data. Empirical Quantile Mapping is adept at matching the distribution of satellite-derived IMERG precipitation values to ground-based CPC observations by mapping the quantiles of the IMERG data to those of the CPC data. However, EQM can sometimes fail to capture extreme values accurately, especially if these extremes are not well-represented in the CPC data. To mitigate this, the method applies Generalized Pareto Distribution (GPD) fitting to the tails of the distribution. This involves identifying a threshold (typically set at the 95th percentile) and fitting a GPD to the excesses above this threshold. This approach enhances the ability to model and correct for extreme precipitation events, ensuring that the corrected dataset retains the significant variability and magnitude of these extremes.
- Improved Extreme Value Representation: By using GPD fitting for the tails of the distribution, the method ensures that extreme precipitation values are better captured and represented.
- Distribution Matching: EQM effectively matches the overall distribution of satellite and ground-based data, improving the reliability of the corrected dataset.
- Dynamic Adjustment: The method dynamically adjusts the tails of the distribution, which is particularly useful for regions with sparse observational data.
- Complexity: The combined approach of EQM and GPD fitting adds complexity to the bias correction process, requiring careful implementation and validation.
- Dependence on Threshold Selection: The accuracy of the GPD fitting is sensitive to the chosen threshold, which may require empirical tuning for different datasets or regions.
- Computational Overhead: The process of fitting GPD and performing quantile mapping can be computationally intensive, especially for large datasets or high-resolution models.
Given the trade-offs and specific goals, a hybrid approach combining both spatial and temporal windows can leverage the advantages of both methods. This hybrid approach balances spatial consistency with temporal responsiveness, addressing both local variations and short-term changes.
- Spatial Window: Use a 5x5 spatial window to ensure spatial coherence and consistency, adjusting for local variations.
- Temporal Window: Apply a shorter temporal window (e.g., 5 days) within the spatial window to capture short-term temporal variations and seasonal patterns.
-
Spatial Window Calculation:
- For each grid cell, consider a 5x5 spatial window around the target cell.
- Aggregate the precipitation values within this window.
-
Temporal Window Calculation:
- Within each spatial window, consider a temporal window (e.g., 5 days).
- Aggregate the precipitation values over this period.
-
Bias Correction:
- Calculate the scale factor and apply linear scaling based on the combined spatial and temporal window values.
- Perform empirical quantile mapping using the combined values.
- Comprehensive Correction: By combining different methods, the approach corrects both mean and distributional biases.
- Extreme Value Preservation: Tail adjustment with GPD ensures that extreme values, critical for understanding climate impacts, are preserved.
- Mathematical Rigor: The use of gamma distribution fitting and GPD adds mathematical robustness to the correction process.
- **Improved Extreme Value
Representation**: By using GPD fitting for the tails of the distribution, the method ensures that extreme precipitation values are better captured and represented.
- Distribution Matching: EQM effectively matches the overall distribution of satellite and ground-based data, improving the reliability of the corrected dataset.
- Dynamic Adjustment: The method dynamically adjusts the tails of the distribution, which is particularly useful for regions with sparse observational data.
- Data Requirements: The need for sufficient data points above the threshold for GPD fitting can be challenging in regions with sparse observations.
- Computational Complexity: The combined approach is computationally intensive, requiring significant processing time and resources.
- Parameter Sensitivity: The fitting processes for gamma and GPD distributions are sensitive to parameter bounds and initial guesses, requiring careful tuning to avoid convergence issues.
This hybrid approach leverages the strengths of both spatial and temporal windows to provide a more robust and accurate bias correction, ensuring that both moderate and extreme precipitation values are accurately corrected, preserving the physical characteristics of the precipitation data and providing a robust basis for defining precise thresholds for extreme rainfall events.
This exercise demonstrates a robust method for bias correction of satellite-based precipitation estimates, combining Linear Scaling, Empirical Quantile Mapping, and Generalized Pareto Distribution. While the approach is computationally intensive and data-sensitive, it offers a comprehensive solution for correcting both moderate and extreme precipitation values. Future work can focus on optimizing computational efficiency and exploring alternative methods for regions with limited observational data.
Work in progress
Contact: