<?xml version='1.0' encoding='utf-8'?>
<oai_dc:dc xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:oai_dc="http://www.openarchives.org/OAI/2.0/oai_dc/" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://www.openarchives.org/OAI/2.0/oai_dc/ http://www.openarchives.org/OAI/2.0/oai_dc.xsd">
  <dc:contributor>Marjolein H. J. Van Huijgevoort</dc:contributor>
  <dc:contributor>Elena Shevliakova</dc:contributor>
  <dc:contributor>Sergey Malyshev</dc:contributor>
  <dc:contributor>Paul C. D. Milly</dc:contributor>
  <dc:contributor>Paul P. G. Gauthier</dc:contributor>
  <dc:contributor>Benjamin N. Sulman</dc:contributor>
  <dc:creator>Nathaniel W. Chaney</dc:creator>
  <dc:date>2018</dc:date>
  <dc:description>&lt;p&gt;&lt;span&gt;The continual growth in the availability, detail, and wealth of environmental data provides an invaluable asset to improve the characterization of land heterogeneity in Earth system models – a persistent challenge in macroscale models. However, due to the nature of these data (volume and complexity) and computational constraints, these data are underused for global applications. As a proof of concept, this study explores how to effectively and efficiently harness these data in Earth system models over a 1/4° ( ∼ &lt;/span&gt;&lt;span&gt;25&lt;/span&gt;&lt;span&gt;km) grid cell in the western foothills of the Sierra Nevada in central California. First, a novel hierarchical multivariate clustering approach (HMC) is introduced that summarizes the high-dimensional environmental data space into hydrologically interconnected representative clusters (i.e., tiles). These tiles and their associated properties are then used to parameterize the sub-grid heterogeneity of the Geophysical Fluid Dynamics Laboratory (GFDL) LM4-HB land model. To assess how this clustering approach impacts the simulated water, energy, and carbon cycles, model experiments are run using a series of different tile configurations assembled using HMC. The results over the test domain show that (1)&amp;nbsp;the observed similarity over the landscape makes it possible to converge on the macroscale response of the fully distributed model with around 300 sub-grid land model tiles; (2)&amp;nbsp;assembling the sub-grid tile configuration from available environmental data can have a large impact on the macroscale states and fluxes of the water, energy, and carbon cycles; for example, the defined subsurface connections between the tiles lead to a dampening of macroscale extremes; (3)&amp;nbsp;connecting the fine-scale grid to the model tiles via HMC enables circumvention of the classic scale discrepancies between the macroscale and field-scale estimates; this has potentially significant implications for the evaluation and application of Earth system models.&lt;/span&gt;&lt;/p&gt;</dc:description>
  <dc:format>application/pdf</dc:format>
  <dc:identifier>10.5194/hess-22-3311-2018</dc:identifier>
  <dc:language>en</dc:language>
  <dc:publisher>European Geosciences Union</dc:publisher>
  <dc:title>Harnessing big data to rethink land heterogeneity in Earth system models</dc:title>
  <dc:type>article</dc:type>
</oai_dc:dc>