Resources
Data sources commonly used in environmental economics research (to be updated).
I. Air Pollution
1.1. Ground Monitoring Data
- Source: China Ministry of Ecology and Environment
- Resolution: monitor x hourly
- Coverage: China (national, ~1,600 monitors across all 330+ prefecture-level cities), 2013 – today
- Pollutants: PM2.5, PM10, O3, NO2, SO2, CO
- Real-time monitoring data are posted here. Historical data going back to 2013 are available for purchase from a private nonenterprise. Another version of the historical data (May 2014 onward) is provided by Xiaolei Wang free of charge here.
1.2. U.S. Embassy & Consulates Monitoring Data
- Source: U.S. Department of State Air Quality Monitoring Program
- Resolution: monitor x hourly
- Coverage: Beijing (2008 - 2017), Chengdu (2012 - 2017), Guangdong (2011 - 2017), Shanghai (2011 - 2017), Shenyang (2013 - 2017)
- Pollutants: PM2.5
- These data can be downloaded from stateair.net. Click the city of interest -> View Data -> Agree to the data use agreement.
1.3. Estimated PM2.5 from Combining Satellite Data and Chemical Transport Modeling
- Source: Atmospheric Composition Analysis Group V5.GL.03.
- Resolution: 0.01° x 0.01° x yearly or 0.01° x 0.01° x monthly
- Coverage: global, 2000–2022
- Pollutants: PM2.5
- PM2.5 are estimated using information from satellite-, simulation- and monitor-based sources. Aerosol optical depth from multiple satellites (MODIS, VIIRS, MISR, and SeaWiFS) and their respective retrievals (Dark Target, Deep Blue, MAIAC) are combined with simulation (GEOS-Chem) based upon their relative uncertainties as determined using ground-based sun photometer (AERONET) observations to produce geophysical estimates that explain most of the variance in ground-based PM2.5 measurements.
1.4. SO2 Satellite Data
- Source: OMI/Aura Sulphur Dioxide (SO2) Global Gridded V3
- Resolution: 0.125 ° x 0.125 ° x daily
- Coverage: global, 2004.10–present
- Pollutants: SO2 (molecule per square centimeter)
- OMSO2e is collected by OMI, a spectrometer aboard NASA’s Aura satellite. OMSO2e measures Total Column Density of SO2 in the Planetary Boundary Layer.
- Note: OMSO2e contains many negative values with a mean value close to zero. Negative values are a result of OMSO2e’s retrieval algorithm, which is based on principal component analysis. The algorithm may give slightly negative values in regions where SO2 emissions are low. However, when SO2 emissions are high, we will see a stronger signal and higher SO2 value.
1.5. Global High Air Pollutants (GHAP)
- The GlobalHighAirPollutants (GHAP) dataset refers to the long-term, full-coverage, high-resolution, and high-quality datasets of ground-level air pollutants. It is generated from big data (e.g., ground-based measurements, satellite remote sensing products, atmospheric reanalysis, and model simulations) using artificial intelligence by considering the spatiotemporal heterogeneity of air pollution.
- Resolution and Coverage: 1 km x Daily/Monthly/Yearly
- Coverage: global, 2000-Present
- Pollutants: 7 major air pollutants (i.e., PM1, PM2.5, PM10, O3, NO2, SO2, and CO), PM2.5 chemical composition (i.e., SO42-, NO3-, NH4+, Cl-, BC, and OM), and ambient polycyclic aromatic hydrocarbons (PAHs), including 7 carcinogenic PAHs (i.e., BaA, Chr, BbF, BkF, BaP, DahA, IcdP).
- Released by: Wei, Jing, Zhanqing Li, Alexei Lyapustin, Jun Wang, Oleg Dubovik, Joel Schwartz, Lin Sun, Chi Li, Song Liu, and Tong Zhu. 2023. “First Close Insight into Global Daily Gapless 1 Km PM2.5 Pollution, Variability, and Health Impact.” Nature Communications 14 (1): 8349.
- The data can be downloaded from Zenodo. More air quality datasets of different air pollutants can be found at their github page.
II. Instrumental Variable for Air Pollution
2.1. Thermal Inversion
- Source: MERRA-2 product of NASA
- Resolution: 0.5 ° x 0.625 ° x 3 hours x 42 pressure levels
- Coverage: global, 1980–present
- These data can be downloaded from NASA Earth Data.
III. Water Pollution
3.1. Monitoring Data
- Source: National Environmental Quality Monitoring Network–Surface Water Monitoring System
- Resolution: monitoring section x 4-hour (note: 2014-04-2020-11: ~100 monitoring sections, after 2020.11: 1600+ monitoring sections)
- Coverage: 2014.04-present
- Pollutants
- Before 2020.05: pH, Dissolved Oxygen (DO), Permanganate Index (CODMn), Total Organic Carbon (TOC), Ammonia Nitrogen (NH3-N)
- After 2020-05: Water Temperature (°C), pH, Dissolved Oxygen (mg/L), Electrical Conductivity (μs/cm), Turbidity (NTU), Permanganate Index (mg/L), Ammonia Nitrogen (mg/L), Total Phosphorus (mg/L), Total Nitrogen (mg/L), Chlorophyll-a (mg/L), Algal Density (cells/L)
IV. Firm-level Emission Data
4.1. Environmental Survey and Reporting (ESR)
- Source: administrated by MEP, first self-reported by each polluter, and then randomly verified by government auditors
- Resolution: firm x yearly
- Coverage: since 2001, top firms/plants contributing 85% of total emissions of the major pollutants in a county are entered in the ERS (Wang et al. 2018)
- Pollutants: COD, ammonia nitrogen, sulfur dioxide, industrial smoke and dust, and solid waste
- The most comprehensive environmental data in China and monitors polluting activities of all major polluting sources, including heavily polluting industrial firms, hospitals, residential pollution discharging units, hazardous waste treatment plants, and urban sewage treatment plants.
- Papers that use this data
- He, G., Wang, S., & Zhang, B. (2020). Watering Down Environmental Regulation in China*. The Quarterly Journal of Economics.
- Liu, Z., Shen, H., Welker, M., Zhang, N., & Zhao, Y. (2021). Gone with the wind: An externality of earnings pressure. Journal of Accounting and Economics, 72(1), 101403.
- Wang, C., Wu, J., & Zhang, B. (2018). Environmental regulation, emissions and productivity: Evidence from Chinese COD-emitting manufacturers. Journal of Environmental Economics and Management.
- Wu, H., Guo, H., Zhang, B., & Bu, M. (2017). Westward movement of new polluting firms in China: Pollution reduction mandates and location choice. Journal of Comparative Economics, 45(1), 119–138.
- Zhang, B., Chen, X., & Guo, H. (2018). Does central supervision enhance local environmental enforcement? Quasi-experimental evidence from China. Journal of Public Economics, 164, 70–90.
4.2. Continuous Emissions Monitoring Systems (CEMS) Data
- Source: each province’s publicly accessible data platform
- Resolution: firm x unit x hourly
- Coverage: e.g., Guangdong, Hubei, Shandong, Hebei, Zhejiang, and Shanghai
- Pollutants: SO2, NOx, smoke and dust, CO2, etc.
V. Weather / Climate
5.1. Ground Monitoring Data
- Source: National Meteorological Information Center of China
- Resolution: monitor x daily
- Coverage: national (700+ monitors across all 330+ prefecture-level cities), 2008–2021
- Weather Conditions: temperature, air pressure, water pressure, precipitation, relative humidity, sunshine duration, wind speed, wind direction
- Note: Downloading the data needs access as an education user.
5.2. NOAA Monitoring Data
- Source: NOAA
- Resolution: monitor x hourly
- Coverage: global; for China: national (400+ monitors across all 200+ prefecture-level cities), 1901-present
- Include elevation, temperature, dew temperature, cloud height, air pressure, visual distance, rain, snow depth, wind speed, wind direction
5.3. ERA5 Reanalysis Data
- Source: ECMWF
- Resolution: 0.25° x 0.25° x hourly (atmosphere), 0.5° x 0.5° x hourly (ocean waves)
- Coverage: global, 1940-present
- Include all the common-used variables; in addition, some other useful variables such as boundary layer height are also included.
5.4. Weather Forecast Data
- 5.4.1. ECMWF Integrated Forecasting System (IFS)
- Resolution: Different models and products have different resolutions: e.g., for the Atmospheric model Ensemble 15-day forecast (Set III - ENS): 0.1° x 0.1°, 4 forecast runs per day (00/06/12/18); different forecast time stamp for different runs (00/06/12/18): hourly, 3-hourly, 6-hourly. Detailed descriptions for different models are available here for detailed descriptions for different models.
- Access: need a subscription (€3000 (EUR) per annum). Real-time forecast data are available to the public free of charge.
- 5.4.2 NOAA Global Forecast System (GFS)
- Resolution: From 0.25°, 0.5° to 1°, depending on the product; 4 runs per day: 00, 06, 12, 18UTC; time stamp varies from hourly, 3-hourly, 12-hourly, etc.
- Coverage: 2004–present, depending on the product
- 5.4.3 National Meteorological Information Center of China
- Resolution: district x daily; including hourly forecast, next day half-day forecast, 15-day forecast, 40-day forecast.
- Coverage: 2017–present
- Note: CMDSC only has real-time data available to the public; historical data going back to 2017 are available for purchase from a private nonenterprise.
5.5. Climate Projections
- Source: CMIP5
- Resolution: From 0.125° x 0.125° x daily to 5° x 5° x daily depending on the model
- Coverage: global, 1850–2300
- Historical experiments which cover the period where modern climate observations exist. These experiments show how the GCMs performs for the past climate and can be used as a reference period for comparison with scenario runs for the future. The period covered is typically 1850-2005.
- Ensemble of experiments from the Atmospheric Model Intercomparison Project (AMIP), which prescribes the oceanic variables for all models and during all period of the experiment. This configuration removes the added complexity of ocean-atmosphere feedbacks in the climate system. The period covered is typically 1950-2005.
- Ensemble of climate projection experiments following the Representative Concentration Pathways (RCP) 2.6, 4.5, 6.0 and 8.5. The RCP scenarios provide different pathways of the future climate forcing. The period covered is typically, 2006-2100 some extended RCP experimental data is available from 2100-2300.
VI. Flood
6.1. Flood Events Data #1
- Source: Global Active Archive of Large Flood Events or Global Flood Database (GFD) generated by the Dartmouth Flood Observatory (DFO).
- Coverage: Global, 1985-2021
- Including location (shp file), start and end date, severity (quantified by damage, displaced/deaths), and main causes of large flood events across the globe.
- Note: A map of the flood event with greater geographical accuracy (a pixel resolution of 250 meters) is available at The Global Flood Database.
6.2. Flood Events Data #2
- Source: FloodList. (Access to the data has been closed but one can request the data on the waiting list.)
- Coverage: Global, early 2016–present
- For each flood event, the database includes (where possible) dates, locations (with geo-location), magnitude (rainfall, rivers, etc, also with geo-location) and damages (including fatalities, injuries, affected population, evacuations, and other material damages, also with geo-location).
6.3. MODIS/Aqua+Terra Global Flood Product
- Source: LANCE MODIS NRT global flood product (MCDWD) by NASA
- Resolution: ~250m x daily
- Coverage: global, 2012-present
- The Flood Product shows flood and surface water detected from the twice-daily overpass of the MODIS optical sensors (onboard Terra and Aqua satellites). Several flood composites are generated: 1-Day, 2-Day, and 3-Day. These require 1, 2, and 3 water detections, respectively, to mark a pixel as water. Detailed description can be found at the description page.
VII. Others
- Satellite Data in Agricultural and Environmental Economics: Theory and Practice by David Wüpper, Wyclife Agumba Oluoch and Hadi