Forward & Inverse Problems#

This section explains:

  • The hydrological modeling problem (forward/direct problem), that consists in modeling the spatio-temporal evolution of water states-fluxes within a spatio-temporal domain given atmospheric forcings and basin physical descriptors.

  • The parameter estimation problem (inverse problem), that aims to estimating uncertain or unknows model parameters from the available spatio-temporal observations of hydrological state-fluxes and from basin physical descriptors.

Forward problem statement#

The forward/direct hydrological modeling problem statement is formulated here.

The 2D spatial domain is denoted Ω with x the vector of spatial coordinates, and t is the time in the simulation window ]0,T].

Hydrological model definition#

The spatially distributed hydrological model is a dynamic operator M projecting fields of atmospheric forcings I, catchment physical descriptors D onto surface discharge Q, model states h, and internal fluxes q such that:

(1)#U(x,t)=(Q,h,q)(x,t)=M([I,D](x,t);[θ,h0](x))

with U(x,t) the modeled state-flux variables, θ the parameters and h0=h(x,t=0) the initial states.

../_images/forward_simple_flowchart.png

Flowchart of the forward modeling problem: input data, forward hydrological model M, simulated quantites.#

Detail on model variables

The sizes of the variables in the forward/direct problem are detailed here. We denote by N=Nx×Nt with Nx the number of cells in Ω and Nt the number of simulation time steps in ]0,T].

  • Surface discharge Q(x,t)RN

  • States h=(h1(x,t),...,hNh(x,t))RN×Nh with Nh the number of distinct state variables

  • Internal fluxes q=(q1(x,t),...,qNq(x,t))RN×Nq with Nq the number of distinct internal fluxes

  • Atmospheric forcings I=(I1(x,t),...,INI(x,t))RN×NI with NI the number of atmospheric forcings types

  • Physiographic descriptors D=(D1(x,t),...,DND(x,t))RN×ND with ND the number of physical descriptors

  • Parameters θ=(θ1(x),...,θNθ(x))RN×Nθ with Nθ the number of distinct parameters

  • Initial states h0=h(x,t=0)

Operators Chaining Principle#

The forward hydrological model M is obtained by chaining through fluxes at least two operators: the hydrological operator Mrr to simulate runoff from atmospheric forcings and use this runoff to feed a routing operator Mhy for cell to cell flow routing.

A snow module Msnw can also be added.

A learnable mapping ϕ, composed of neural networks, can also be included into the forward model to predict parameters and/or fluxes corrections from various input data.

Several differentiable model structures are proposed in smash and detailed in model strucures section.

../_images/forward_composition_flowchart.png

Schematic view of operators composition into the forward model M.#

Hydrological Model Operators#

The forward hydrological model is obtained by partial composition (each operator taking various other inputs data and paramters) of the flow operators writes:

(2)#M=Mhy(.,Mrr(.,Msnw(.)))

with the snow module Msnw producing a melt flux qsnwrr(x,t) feeding the production module Mrr that produces runoff flux qrrhy(x,t) feeding the routing module Mhy.

Models structures are detailed in model strucures section.

Learnable Mapping#

The spatio-temporal fields of model parameters and initial states can be constrained with spatialization rules (e.g. spatial patches for control reduction), or even explained by physiographic descriptors D. This can be achieved via an operator ϕ projecting physical descriptors D onto model conceptual parameters such that

(3)#(θ(x),h0(x))=ϕ(D(x,t),ρ)

with ρ the control vector that can be optimized.

Consequently, replacing in Eq. 1 the parameters and initial states predicted by ϕ operator, the forward model writes as:

(4)#U(x,t)=(Q,h,q)(x,t)=M([I,D](x,t);ϕ(D(x,t),ρ))

The descriptors-to-parameters mappings are described in mapping section.

Parameter Estimation problem statement#

A general formulation of the model parameter estimation problem is given here. The aim is to fit modeled quantities U(x,t)=(Q,h,q)(x,t) onto the available observations Y of hydrological responses. This is for example the classical calibration problem on discharge time series at measurement gages over a river network, or more advanced data assimilation processes using multi source observations (ex. discharge, moisture, etc) and complex data-to-parameters mappings and other constrains and regularization.

A general description of the cost function, of the optimization problem and process is given here.

../_images/Inversion_process_flowchart.png

Schematic view of the optimization process of the parameters of the forward model M (adapted from data assimilation course of [Monnier, 2024]). The parameters control vector ρ that is optimized can simply be the hydrological model control ρ:=θ in case where the learnable mapping ϕ is not used. This parameters control vector ρ can also contain initial states h0 (for example in short range data assimilation for states correction).#

Cost function#

Consider the following generic differentiable cost function composed of an observation term Jobs and a regularization term Jreg weighted by α0:

(5)#J=Jobs+αJreg

Observation term#

The modeled states variables U(x,t)=(Q,h,q)(x,t) are observed in a vector Y=H[M(ρ)]Y with H:XY the observation operator from state space X to observation space Y.

Given observations Y(x,t)Y of hydrological responses over the domain Ω×]0..T], the model misfit to observations is measured through the observation cost function:

Jobs=12YYO2
(6)#Jobs(ρ)=12H[M(ρ)]YO2

with O the observation error covariance matrix and the euclidian norm XO2=XTOX

Regularization term#

The regularization term is for example a Thikhonov regularization that only involves the control ρ and its background value ρ from which optimization is started.

Optimization#

The optimization problem minimizing the misfit J to observations writes as:

(7)#ρ^=argminρJ

This problem can be tackled with optimization algorithms adapted to high dimensional problems (L-BFGS-B [Zhu et al., 1994] or machine learning optimizers (e.g., Adam [Kingma and Ba, 2014])) that require the gradient ρJ of the cost function to the sought parameters ρ. The computation of the cost gradient ρJ relies on the composed adjoint model DρM that is derived by automatic differenciation of the forward model, using the Tapenade software [Hascoet and Pascual, 2013]. The optimization is started from a first guess ρ on the sought parameters ρ.

Note

Following this general definition of the inverse problem, multiple definitions of observation cost function, regularization as well as mappings included into the forward model are possible with smash and detailled after along with several optimization algorithms taylored adapted to solve the different parameter optimization problems.