Development of a water resources distribution and management tool (SPEHR); applied
Authors: X. Lemaitre. G.A. Corzo P. G.R. Santos G, H.A. Angarita.
Abstract
Water management is a complex problem that relates to human and physical variables that are very hard to predict, there is a cultural and social and human driven variables (Agricultural practices, Cities growth and others) that are hard to capture into the complex interactions of the water allocation and the hydrological system. This research presents a new novel yet simple system that integrates information from river basin social and hydrological variables in an online system for decision support. The concept developed so far represents the Magdalena river system in Colombia, and allows to simulate and to share ideas. A communication bar includes a score of goodness provided by decision makers such that a level of agreement between actors can be achieved. Aside of this, the system is made open source such that other river basin can be set up and even other hydrological modelling systems can be plugged in.
Keywords
Coello basin, Open Sources, Water Allocation models. Water Demands, WEAP.
Introduction
Water management is a complex problem that relates to human and physical variables that are very hard to predict. Models to represent this complex problem have been developed ranging from decision support systems [1] to discussion support systems like the models based on serious gaming [2]. However, there is still cultural and social and human driven variables (Agricultural practices, Cities growth and others) that are hard to capture into the complex interactions of the water allocation and the hydrological system. Current models like WEAP [3], allow for a great variety of interactions, however, spatio-temporal information from agricultural practices, an interactive space for online decision making and quasi(social) oriented solutions are not available.
In Colombia, IDEAM’s estimations of water demand from socio-economical activities is mainly represented by irrigation, domestic consumption, industrial, mining, livestock and services [4]. Also, Colombia have five macro-basins, one of them, is The Magdalena basin which gives water to the manly Engineering EPiC Series in Engineering rate of demands in the country with a water efficiency of twenty seven liters per second per square kilometer. This system has interventions from multiple agencies. In 2010, the National Government, through the Ministry of Environment, Housing and Territorial Development (in accordance with article 12 of Law 1444 of May 4, 2011) adopted the National Policy for the Comprehensive Management of Water Resources (PNGIRH), with the support of the Netherland’s embassy. In which the objectives, strategies, goals, indicators and strategic lines of action for the management of the water resource in the country were established over a 12-year timeline and six major components were drawn up: Supply, Demand, Quality of Irrigation, Institutional Strengthening and Governability[5]. Thus Colombia is in a process of change and awareness, in which, the new generations must actively participate and supporting this change. Currently in the world, learning techniques and intersectoral cooperation are being implemented, however, implementing this transition is challenging [3]. For the transition to be implemented, a social learning process needs to take place in which stakeholders became aware of the relationship between their own frames of reference and those of other [1].
This research presents a new novel yet simple system that integrates information from river basin social and hydrological variables in an online system for decision support. The concept developed so far represents the Coello basin system in Colombia, and allows to simulate and to share ideas. A communication bar includes a score of goodness provided by decision makers such that a level of agreement between actors can be achieved. Aside of this, the system is made open source such that other river basin can be set up and even other hydrological modelling systems can be plugged in. This paper also is a continuity of the investigation developed in WEAP for Coello Basin made before by Meiline Siahaan [6].
Methods
Python’s model description.
The developed approach for the water allocation in Coello watershed in Colombia were based on 6 stages: (1) the lecture of the hydrology modeling catchment data; (2) Simulation of the irrigation demand; (3) simulation of the population demand; (4) Calculation of water supply delivered inside the catchment; (5) simulation of water allocation inside the Coello basin in Colombia. The five stages of the modeling approach and their relationships are illustrated in Figure 1. The simulation for irrigation demand was simulated through the amount of irrigation area inside the crop yields in Coello, each area unit was assigned a rate of water use as is shown in the Equation1. Also, the simulation for population demand was simulate through a linear projection of the growth population for 10 years where for each habitant was assigned a rate use of water as is shown in Equation 2.
Figure 1 Stages of the developed modeling approach and their relationship in the water allocation in Coello watershed in Colombia
Water allocation and mass balance equations
To know the amount of water that the branch actually delivered, before calculate the mass balance equation on the branch, the program calculates the proportional effective supply demand per each demand site, this process will be described following:
Once the program has simulated the irrigation and population demand, and it also have the head flow per branch, the program evaluated if the amount of available water is enough to supply all the demands if not, the program calculates a proportional amount of water that will have the opportunity to delivered as follow:
WEAP model description
WEAP (Water evaluation and Planning System) is a software tool for a water allocation planning which is developed by Stockholm Environmental Institute (SEI) to aid the water planner by simulation different scenarios of water resources management. The scenarios are created by proposing different water demands: hydrology, infrastructure, supply and resource, or water quality. This tool implement the water balance in a basin [6]. WEAP creates a comprehensive and integrated picture of a municipal, industrial and agricultural water use and respective supply sources. The model is useful to systematically identify all users and supply sources by amount and locations. All surface water supplies, ground water supplies, and interbasin supplies transfers may be included in the model. Major reservoir as well as local supply reservoir are modeled. Reporting of water supply includes: total supply resources; river, groundwater and local supply sources; evaporation losses from reservoir, rivers and tributaries retour flow; and surface and groundwater interaction[1].
Comparing supply and demand. Comparisons are made at a site level such as water treatment or wastewater treatment plant, or at an aggregate level such as city or country. Forecast of future demands may be made in several ways as compared with estimates supplies under drought or other hydrological conditions [7]. Mass balance reporting: the model can display a mass balance of withdrawals and uses at any river/tributary node, demand site, wastewater treatment facility and supply source[1]. The figure 3 shows the Coello basin model in WEAP, it also shows the interaction between nodes, rivers and tributaries.
Figure 2 Coello basin model Source: [6]
Validation method
The validation process was done through a comparison of the water volumes available in a period of time simulated by each model (WEAP & Python). This comparison was verified through the method of the Root Mean Square Error (RMS). Therefore, for the validation of the simulation, it was selected a reference scenario, this reference scenario does not involve restrictions of reservoir storage or climate change, and it works with the actual data on the Coello basin.
In the second part of the simulation there was selected one more scenario, worked by Meiline on her WEAP model, which will be describe bellow:
• Current situation (Scenario Reference 1980-2013): this scenario uses the actual data.
• Impact of climate change and socio-economics changes (Scenario All change 1981- 2013): this scenario is adapted from current situation scenario with several changes as listed in the following:
1. Increasing of potential crop evapotranspiration
2. Increasing water demand
3. Changing head flow by using the most extreme projection
4. Increasing domestic and industrial factor as the changing projection of socio
economic.
The input data used for the python model was proportionated in CVS files, which contains the information below:
• A daily series of flow in cubic meters per second for the reference scenario.
• A daily series of flow in cubic meter per second for the All change scenario.
• A monthly series of irrigation demand in cubic meters per second for references and all change scenario.
Case of Study
Coello River is located in the southwest of Bogota City, flows through Tolima Department and ends in the Magdalena River. The Coello Basin has several sub basin, they are Rio Toche, Rio Coello, Rio Combeina, Rios Barmelon, Rio Cocora and Quebrada Gallego. Water in Coello basin is mainly used for industrial, agricultural, livestock and public water demand [8]. Ibague as the capital of Tolima Department relies its water demand for domestic and industrial on the Coello basin.
For development of the model were used two CVS files provided by the WEAP model of Meiline Siahaan. One of those CVS files had a daily time series of flow in cubic meter per second and the second one had a monthly requirement of water for irrigation, both file belongs to the Coello basin in Colombia and were obtained from a meteorological IDEAM’s stations.
Figure 3. Coello Basin Source: [6]
Results
The Python model was compared with the WEAP model in terms of water volume available below the final demand point. Similarly, supply and unmet demands for the all the branches were compared. As show in the figure 4, the behavior of the output hydrographs, over the Reference Scenario, is similar, both have the peak at the same time of the year with an error of 0.3% in the amount of available water in the year. Similarly, both hydrographs present a bimodal behavior through the year. At the same time, as you can see in the figure 5, over the All Change Scenario, the behavior of the hydrographs is similar too, but in this case, they present just one peak through the year with an error of 0.4% in the amount of water. Similarly, when the comparison within the river 2, one of the branches of Coello basin, was made it was obtained, for the Reference Scenario and All Change Scenario, an error of 0.17% and 2% for the supply delivered and unmet demand for domestics demands respectively and an error of 1% and 0.5% for the supply delivered and unmet demand for irrigation demands respectively.
Figure 4. Water available at the exit of Coello basin. Scenario Reference a) WEAP model b) Python model
Figure 5. Water available at the exit of Coello basin. Scenario All change a) WEAP model b) Python model
Conclusions
The model developed in python managed to recreate the results obtained with WEAP models in a satisfactory way since the errors calculated are below 2%. This means that the program could obtained similar water allocation process than WEAP simulating the same behavior of the water demand, supply delivered, unmet demand and above all the behavior of the hydrograms at the end of each branch and at the end of the chosen basin.
This model does not have the GIS functionality, therefore the user must enter the data manually and check the mass balance equations for each branch. After that the user will be able to check the same results that was show before.
Additionally, within the process of water distribution in WEAP, it was observed that once this program makes the mass balance, if at some point it fails to supply the demand completely, it assumes that the precipitation was taken in a timely manner. Therefore, this program has continued its development, improving the spatial distribution of rainfall through current rainwater runoff transformation models, and also improving functionalities such as the simulation of the irrigation demand and the GIS functionalities.
References
[1] W. Johnson, Q. Williams, and P. Kirshen, “WEAP: A Comprehensive and Integrated Model of Supply and Demand,” Georg. Water Resour. Conf., pp. 291–293, 1995.
[2] J. Craven, H. Angarita, G. A. Corzo Perez, and D. Vasquez, “Development and testing of a river basin management simulation game for integrated management of the Magdalena-Cauca river basin,” Environ. Model. Softw., vol. 90, no. January, pp. 78–88, 2017.
[3] C. Figueroa and M. Escobar, “Modelación hidrológica del recurso hídrico en la subcuenca del Río Aipe en Colombia ‘Ríos del páramo al valle, por urbes y campiñas,’” Sei,
no. Figura 1, pp. 1–4, 2015.
[4] IDEAM, “Estudio Nacional del Agua 2010,” Estud. Nac. del Agua 2010, p. 69, 2010.
[5] IDEAM, INVEMAR, SINCHI, IIAP, and IAvH, Informe del Estado del Medio Ambiente y de los Recursos Naturales 2015. Documento Síntesis. 2016.
[6] M. H. Siahaan, “Development of a Web-based Water Allocation Model: Case Study of Coello Basin, Colombia,” no. April, 2016.
[7] D. Yates, D. Purkey, J. Sieber, A. Huber-Lee, and H. Galbraith, “WEAP21—A Demand-, Priority-, and Preference-Driven Water Planning Model,” Water Int., vol. 30, no. 4, pp. 501–512, 2005.
[8] Ideam, “Zonificación y Codificación de Cuencas Hidrográficas,” p. 46, 2013.
Chaotic Statistical Downscaling (CSD):
Application and Comparison in the Bogotá River Basin
Authors: Freddy Duarte, Gerald Corzo, Germán Santos and Oscar Hernández.
Abstract
This study presents a new statistical downscaling method called Chaotic Statistical Downscaling (CSD). The method is based on three main steps: Phase space reconstruction for different time steps, identification of deterministic chaos and a general synchronization predictive model. The Bogotá river basin was used to test the methodology. Two sources of climatic information are downscaled: the first corresponds to 47 rainfall gauges stations (1970-2016, daily) and the second is derived from the information of the global climate model MPI-ESM-MR with a resolution of 1,875° x 1,875° daily resolution. These time series were used to reconstruct the phase space using the Method of Time-Delay. The Time-Delay method allows us to find the appropriate values of the time delay and the embedding dimension to capture the dynamics of the attractor. This information was used to calculate the exponents of Lyapunov, which shows the existence of deterministic chaos. Subsequently, a predictive model is created based on the general synchronization of two dynamical systems. Finally, the results obtained are compared with other statistical downscaling models for the Bogota River basin using different measures of error which show that the proposed method is able to reproduce reliable rainfall values (RMSE=73.37).
Introduction
Global models for the management and planning of water resources in different parts of the world are being developed and used to simulate climate future scenarios. Global Climate Models (GCM) of different organizations in the world provide meteorological information for multiple future scenarios at large scale. In this sense, new challenges have been presented to be able to use this information at Water management is a complex the basin level. GCM generate climate information of fundamental variables at a cell scale that may need to be transform to a smaller basin scale. Different techniques exist in this area, from physical models to stochastic models, as observed in (Corzo, Jonoski, Yimer, Xuan, & Solomatine, 2009), among the most used techniques are highlighted: ANN (Artificial Neural Network), SDSM (Statistical Downscaling Model), ADC (Advanced Delta Change Method) and WRF (The Weather Research and Forecast model).
However, considering that the climate systems and its associated dynamical processes are essentially non-linear, and possibly chaotic, the effectiveness of these techniques may be limited and the deterministic estimation of the precipitation obtained from GCMs for hydrological modeling is difficult, as described in (Sivakumar & Ronny, 2010). In view of this, it is necessary the creation of a downscaling model that considers explicitly the properties of the chaotical systems that have the potential to improve downscaling of rainfall. The analysis of deterministic chaos in time series has been developed in different study areas, being widely used in forecasts of time series and estimation of missing data as presented in (Gallego, 2010) and (Siek, 2011), also allowing the creation of predictive models, such as those presented in (Hernández, 2009). The application of the concepts of nonlinear dynamics and chaos theory in hydrological studies in recent decades has been considerable, especially in the study of the presence of deterministic chaos in precipitation time series. One of the pioneers was Hense, who in 1987 analyzed a series of 1008 monthly rainfall records in Nauru, observing deterministic chaos in a low dimension of the phase plane. Rodríguez-Iturbe (1989); Shafiri (1990); Tsonis (1993); Jayawardena and Lai (1994); Waelbroeck (1998); Berntsson (1994); Georkakos (1995); Sivakumar (2000) and Gaume (2002) have also reported the presence of deterministic chaos in precipitation time series.
Figure 1. Location of the Bogotá River Basin in the country (left) and in the department (right)
Figure 2. Cells of the Global Climate Model: MPI-ESM-MR, in the Bogotá river basin. Coordinate reference system: WGS-84
For the local data, 47 ground rainfall gauges stations selected from IDEAM (Institute of Hydrology, Meteorology and Environmental Studies, Colombia) were used for the period 1958-2016, as shown in the Figure 3, observing that there is an average of 12239 daily precipitation data per station, for use as predictors, with an average value of 2.76 mm, an average standard deviation of 6.69 mm and an average maximum value of 105.54 mm.
Figure 3. Ground rainfall gauge stations selected for the downscaling in the Bogotá River Basin
The Chaotic Statistical Downscaling model (CSD) evaluate the presence of deterministic chaos for different rainfall accumulation intervals for both of the time series (rainfall gauge stations and the global climate model). Then a chaotic predictive model is constructed with the results of the time delay, the embedding dimension and the Lyapunov exponents found for the "optimal" precipitation accumulation interval of each dynamic system. The predictive model is based on the synchronization between two dynamical systems, given by the parameter μ of the mutual false nearest neighbor’s method (MFNN).
According to (Sivakumar & Ronny, 2010): "The phase space is essentially a graph, whose coordinates represent the variables necessary to fully describe the state of the system at any time, the trajectory of the phase space diagram describes the evolution of the system for an initial state. The "region of attraction" of these trajectories in the phase space provides important qualitative information when determining the degree of complexity of the system ", as shown in Figure 4 where the phase space is observed in three dimensions for different accumulation intervals of precipitation. The limiting set that brings together asymptotic trajectories close to equilibrium is known as 'attractor'. The attractors of deterministic chaotic systems can exhibit an unusual type of self-similarity and present structures at all their scales and it is therefore necessary to find an appropriate dimension of the phase plane, such that the structure of the attractor remains invariant.
Figure 4. Comparison of the phase space of the precipitation time series for Cell 1 of the Bogotá river basin for rainfall accumulations of 1 day (left), 5 days (center) and 30 days (right).
For the evaluation of the presence of deterministic chaos in the dynamical systems Lyapunov exponents were used, for which it was necessary to reconstruct the phase space with the Method of Time-Delay, which finds the appropriate values of the time delay (τ) and embedding dimension (m) to capture the attractor dynamics. The autocorrelation function and the mutual information were used for the selection of the time delay, and the embedding dimension was selected using the correlation dimension method, the False Nearest Neighbors method (FNN) and Cao’s method, which were successfully used in (Gallego, 2010), (Hernández, 2009) and (Siek, 2011).
Within the different phenomena of dynamical non-linear systems, the synchronization of dynamical systems has had a great importance in the last decade, being used in a wide variety of practical and experimental applications, such as electrical and electronic circuits, lasers, telecommunications systems and chemical reactions. The synchronization of two or more dynamical systems is a fundamental phenomenon that is obtained when at least one of the systems changes its trajectory due to a coupling (unidirectional or bidirectional) with another system, allowing a coherent behavior in coupled systems. In chaotic systems, given the dependence of the initial conditions and their evolution in different attractors and dimensions, the synchronization process allows the systems to follow a common and defined trajectory.
The mutual false nearest neighbor’s method created by (Rulkov, Sushchik, Tsmiring, & Abarbanel, 1995), is a statistical technique based on the calculation of the parameter μ, which evaluates the local neighborhoods between two time series, so that μ takes values of the order of 1 if exist complete general synchronization, otherwise, μ has to be a number whose magnitude is comparable with the product of the attractor size divided by the product of the distance between the nearest neighbors in the time series x and y. The formula for the calculation of μ is given in Equation 1 and the relation between points of the attractors is shown in Figure 5,
Equation 1. Mutual false Nearest Neighbors
Where,
Xn and Yn , are points of the drive and response system at an instant n
XnNND and YnNNR are the closest neighbors of and in their respective systems.
XnNNR and YnNND are points of the opposite system at time n for ynnR and xnnD.
Figure 5. Relation between Xn ,Yn ,XnNND , YnNNR, XnNNR and YnNND for drive and response systems
Results and Discussion
With the purpose of obtain a wider range of analysis, the evaluation of the presence of deterministic chaos was performed in intervals of 1,3,5, 7, 10,15 and 30 days. Analyzing the results, it is observed that in the cells of the GCM and in the rainfall gauge stations time series, starting from a rainfall accumulation of 5 days the type of movement of the dynamic system is no longer random (noise) and becomes mainly deterministic chaos, ensuring the short-term predictability for climate projections. It was also observed that in both time series the type of movement of the dynamic system changes from deterministic chaotic to stable for a rainfall accumulation interval of 15 to 30 days, ensuring the existence of long-term predictability for monthly climatic projections.
The phase space reconstruction for local gauge stations and GCM cells shows that the embedding dimensions are 6 and 8, respectively. This indicates that the GCM is more complex than the dynamic system shown by the local gauge stations. The complexity is also reflected when calculating the average attractor size of 3.5 for the local stations and 4.86 for the GCM. These results are of great importance, since they allow to identify the degree of difficulty to synchronize the two chaotic systems.
The values obtained for the parameter μ of the mutual false nearest neighbor’s method are greater than the theoretical value of μ = 1 ideal for general synchronization. However, it is possible to observe that using the CSD model it was possible to decrease the difference with the ideal theoretical value by finding the minimum possible distances for - and - , which represents a slight increase in the synchronization of the two systems, as presented in Figure 6 for the station pr_21185040
Figure 6. Results of the parameter μ of mutual false nearest neighbor’s method for station pr_21185040
In the process of compare the downscaling model, a wide variety of statistical downscaling methods were used in the Bogotá river basin, for a total of 16 techniques, including: CSD, k-NN Bootstrapping, Delta methods, Analog methods, Quantile Mapping method, Weather Type methods and Generalized Linear Methods, most of the methods were include in the Meteolab toolbox (Cofiño, et al., 2013). These methods were evaluated under three different measures of error (Mean value difference, Max value difference and Root-Mean Square Error (RMSE)), in the Figure 7, the RMSE comparison is shown.
Finally, it was possible to compare the monthly multiannual precipitation, the annual precipitation, and the return period for a wide variety of downcaling techniques with the reference period, and use these results to estimate the future river flows using a hydrological model. In the Figure 8 is shown the spatial variability for the percentage difference between the historical reference and the projected time series using the CSD model.
Figure 7. Comparison of the RMSE error obtained for different downscaling techniques
Figure 8. Spatial variability of the differences in average precipitation between the historical reference records 1970-2000 and the series projection using the MPI-ESM-MR RCP8.5 realization for 2070-2100 using the Chaotic Statistical Downscaling Model
Conclusions
It was found the presence of deterministic chaos in all the local stations time series for 5-day accumulation intervals (46 stations) and 7 days (1 station) for the Bogotá river basin. Likewise, deterministic chaos was found in the cells of the GCM for accumulation intervals greater than 5 days. This result is very important since it shows that it is possible to formulate models to improve the transformation of precipitation data between different scales.
In the validation process of the CSD model, it was found that it shows an optimum performance in the simulation of the mean values of the time series with the RMSE. However, it is necessary to improve the modeling of extreme events (droughts and floods) with a more efficient calibration of the model parameters, that consider better the bias of the mean and extreme values.
In the constructing process of the predictive model, it was not found complete general synchronization between the two dynamic systems, despite having found a slight degree of synchrony between them. However, given the results obtained by the scale reduction technique, the degree of synchrony found is sufficient, and it is recommended to continue exploring to find a better synchronization between the dynamic systems that allow better results.
References
Angarita, H. (2014). Metodología para incluir variabilidad climática y escenarios de cambio climático en el modelo WEAP de la macro Cuenca del Rio Magdalena y resultados de las simulaciones. Bogotá.
Cofiño, A., Ancell, R., San-Martín, D., Herrera, S., Gutiérrez, J., & Manzanas, R. (2013). MeteoLab: An open-source Matlab toolbox for Meteorology & Climate. Cantabria .
Corzo, G., Jonoski, A., Yimer, G., Xuan, Y., & Solomatine, D. (2009). Downscaling Global Climate Models Using Modular Models and Fuzzy Committees. Gallego, J. (2010). Aplicación de la teoría de caos para el análisis y pronóstico de series de tiempo financieras en Colombia. Bogotá.
Hernández, O. (2009). Analysis and Optimization of Chaotic Models for Storm Surge Prediction. Delft. Rulkov, N., Sushchik, M., Tsmiring, L., & Abarbanel, H. (1995). Genralized Synchronization of chaos in directionally coupled chaotic systems. Physical Review E Volume 51 Number 2, 980-994.
Siek, M. (2011). Predicting Storm Surges: Chaos, Computational Intelligence, Data Assimilation, Ensembles. Delft. Sivakumar, B., & Ronny, B. (2010). Advances in Data-Based Approaches for Hydrologic Modeling and Forecasting. Davis, USA: World Scientific Publishing.
DEVELOPMENT OF A DISTRIBUTED HYDROLOGICAL MODEL FOR THE COELLO RIVER BASIN IN COLOMBIA
Authors: Nicolás A. López, Germán Santos And Gerald Corzo.
Abstract
The Coello river basin has been studied in the past with conceptual models that show the high importance of the spatial representation for water availability and contamination in the central region of Colombia. The state-of-the-art modelling software comprise a large gamma of open source and commercial software. However, still spatial distributed systems are limited in their application and their flexibility to represent regional interactions. For this purpose and to be able to analyze the Coello river basin, this work uses the concept of the distributed Tracer-Aided Catchment model (TACD). The concept of Tracer aid (TAC) uses a gridded system that comprises upper zones that can be homogenous and groundwater systems that have interactions. This was an important component due to the future research in the mining industry in the region. For now, the adaptation from an old format in a PCRASTER library has been updated to the latest Python. This updated version will be referred as TACD2. To evaluate the distributed hydrological modelling tool a model for the Coello was build. The model parameters were calibrated to minimize the Nash-Sutcliffe error between observed and estimated flow data. However, water balance and the RMSE were evaluated. The distributed model allowed us to identify the hydrological sectorization of processes. The basin shows that the high region represents almost half of the flow that arrives to the Payandé station. Moreover, topographic variability of the basin might increase its vulnerability to extreme events.
Keywords
Hydrology; hydrological model; TACD; Python.
Introduction
Hydrological models are generally useful to represent hydrological behavior of a watershed under average or extreme conditions. These results are usually analyzed through flow or levels at the discharge point. One common way to analyze hydrological process in a basin is through tracers. However, it is not common to find this in hydrological modelling systems. The Tracer-Aided Catchment model (TAC) was developed to consider hydrological processes on the basin as well as natural tracer information obtained at some hydrometeorological stations (Uhlenbrook 2000). TAC model is a conceptual lumped tank model that classifies subbasins into 6 different dominant hydrological processes and for each process, making it quite useful on the sectorization and on the possibilities of representing groundwater interaction. The modelling system originally was developed for the Brugga basin, located in the Dark Forest (Germany). For this implementation, it was included the capacity and permeability parameters for its calibration process.
The TAC model was later improved as a semi-distributed model using the GIS software PCRaster version 2.0, TACD (Roser 2001). From this update on, TACD model takes into consideration spatial variability of soil stratification, land use, geomorphologic and meteorological distributed information. Rainfall-runoff transformation is done according to some defined hydrological processes, as listed below (Menkveld 2003):
i. Flat hilltops: Areas with a tertiary weathering layer on hard rock. Water in this zone has long residence times due to low infiltration capacity.
ii. Periglacial drift layered cover: hydraulic permeability decreases with depth, so that it can be represented with a 2-tank schema with the upper tank for macropore flow and the lower tank for flow through a less permeable base layer. Percolation is small.
iii. Non-layered drift cover on the main layer: like the periglacial drift cover, but the lateral flow will fill the base layer first, and then water will be routed to the upper layer.
iv. Boulder field: coarse granular material and therefore hydraulic conductivity is very high. This can be modeled with one tank whose outflow constant is high.
v. Accumulation zone at foothills: areas near the stream where sediment layers alternate. Stratified sediments have good permeability, except those layers with where fine-grained materials allow a moderate permeability. Like the periglacial drift cover, but the base layer has a lower permeability
vi. Moraine area: areas with moderate hydraulic conductivity and slow runoff routing due to differences in composition. Modeled with a single tank
vii. Riparian area: area close to the stream where soil saturation is almost permanent during the whole year. Modeled with a single tank where maximum capacity is defined such that if exceeded, then water is directed directly into the stream. Evapotranspiration is assumed to be potential as it is saturated during the whole year
Figure 1. Conceptualization for TACD tank modeling.
Each cell in the model domain is classified in one of the previous categories, according to some criteria defined by the researcher. In the Brugga basin, criteria for classification is shown on Figure 1. As mentioned before, for each dominant process, capacity, conductivity and permeability parameters are calibrated to appropriately represent hydrological behavior of sub basins where water flow and water level is available for a convenient period of time.
Input of climatic data depends on the represented variable. While precipitation and temperature are spatial and temporally distributed data, land elevation and average time of solar radiation per day are spatially distributed data that remain constant through the model evaluation. Other data such as observed water flow and water level are scalar time series, that is, they are considered a boundary condition for a single cell on the watershed.
Study Area: Coello River Basin
Coello river basin is located at the center of the Tolima Department, in Colombia, with an extent of around 1800 km2, representing 7,8% of the total area of Tolima. Coello river starts at the Don Simón moor (3850 m.a.s.l.), flowing with an average slope of 3.2% through approx. 121 km from west to east until it reaches the left bank of the Magdalena River (300 m.a.s.l.).
Coello river and its tributaries play an important role in the regional development, because several nearby cities get water from it, such as Cajamarca, Ibagué, El Espinal and Flandes, among others. Its main tributary rivers are Toche, Bermellón, Cocora, Andes, Gallego and Combeima. Coello river has an estimated water supply for the region of 30,9 m3/s and an estimated water demand of 23,8 m3/s. Main water demand sources are drinking water supply, crop cultivation (coffee, grains, citric fruits, sugarcane, among others), animal husbandry (meat, eggs, eggs, etc), and mineral extraction.
Coello river basin has a mean annual precipitation of 1517,8 mm, on average being April the rainiest month (188,9 mm) and January the driest (66,7 mm). Mean temperature for the basin is 19.8°C, but varies from 28,6°C in the lowest points to 3°C at the moor. In modern times, flood events have been observed at the study region, such as those that occurred in Perales stream (1989), Anaime River (1993) and El Oso stream (1998), causing several material losses.
Figure 2. Coello basin location in Colombia, with main rivers and elevation model.
Methodology
Methodological Description
In order to use the TACD model to represent hydrological behavior of the Coello river basin, the model was updated to a recent version of its GIS software PCRaster version 4.1. Some capabilities of previous versions were no longer supported, and some new features were added. Therefore, a complete code rewriting in Python was necessary. In the current version of PCRaster, a Python framework interface is available for use.
Having in mind that rewriting the model scripts could lead to numerical differences between the two versions of the TACD model, validation of the new results with the original model results were necessary to ensure correctness of updated model logic, and for that reason, the Nash-Sutcliffe Efficiency was used between the original and the updated model in the Brugga basin.
Being sure that the updated model accurately calculates hydrological behavior for a basin, then the model and the input data for Coello river basin should be adapted to one another, such as maps and time series formats, folder organization, initial parameters, etc.
After setting up the Coello river basin model with TACD2, a calibration process followed in order to obtain a subset of model parameters that minimizes the error between observed and modeled runoff at the Payandé flow station. Results and comments on this process are described below.
TACD Model Main Features
As mentioned before, TACD is a distributed hydrological model that runs several routines for the hydrological processes in a watershed on a daily basis and outputs spatial as well as scalar information regarding soil moisture, snow content, stream flow and evapotranspiration, among others (Roser 2001). It is important to remember that as a distributed model, TACD calculates each of these routines averaging for each cell (pixel) in the study area by using several kinds of input such as spatio-temporal data (precipitation, temperature, vegetal coverage maps, Leaf-Area Index), spatial stationary data (channel width, digital elevation model, forest areas, water flow direction, manning’s n coefficient, streams, sun hours per day, urban areas) and scalar data (observed waterflow, tank parameters), as required in each routine:
• For the potential evapotranspiration, the Turc-Wendling approximation is used, which takes as input the daily solar radiation and the average daily temperature for each cell in the study area
• For the snow content balance, a 2-tank submodel was developed. This submodel separates the solid and the liquid part of the precipitation and calculates freezing and melting processes
• For the interception of precipitation (mostly due to plant canopies), a tank submodel is used. In this submodel, a fraction of the flux going towards the ground is intercepted in a small tank with limited capacity, so that if the water volume plus the current content of the tank exceed the tank capacity, then surplus will continue its way to the ground, as if not intercepted. Otherwise, evaporation processes occur on the tank and eventually might empty the tank. For this submodel, Leaf-Area Index (LAI) and Normalized Difference Vegetation Indices (COV) maps are required
• For the runoff in urban zones, the precipitation has a short concentration time compared to natural fields and therefore the model concentrates directly that precipitation into the most upstream part of the closest river where this urban subbasin drains to
• In saturated zones (e.g. lakes), infiltration rates are low compared to runoff and therefore, most precipitation actually flows directly with little retention effect. This is achieved with a two tank submodel whose upper tank capacity is low and whose lower tank still allows subsuperficial flow. In a similar way, the part of the precipitation that falls on the stream surface, is directly added to the waterflow in the stream
• For the soil infiltration, routines are very similar to the HBV model and represent infiltration as an exponential function of soil moisture and field capacity. In this routine, actual evapotranspiration is also calculated as a function of soil moisture, the field capacity and the potential evapotranspiration
• For the runoff routine, the tank parameters are used in order to calculate for each cell how the water flows through each of the tanks in each cell, according to the runoff-rainfall transformation defined for each cell
• For the channel flow routine, water routing is calculated along the river. This is done through the kinematic wave implementation in PCRaster Python Framework, taking as input the manning’s n coefficients, channels width and average length of channel per cell, as well as the spatial and temporal discretization dx and dt
TACD2: Model Update to Python
When writing the Python version of the model, some features of the original PCRaster version were adapted with built-in functions of the Python language. Besides, as the new framework is object-oriented, some major changes in model structure were made for readability and completeness.
A class called TACD2 is instatiated by setting up the path to the clonemap (study area) in .map format. After that, a DynamicFramework must be created for the TACD2 object with the initial and final timestep (with Python arguments firstTimeStep and lastTimeStep), and then the dynamic model is run by calling its run() procedure. All other procedures and attributes inherited from the PCRaster DynamicModel class are also available, such as setQuiet() for enabling or disabling output for debugging.
Figure 3. TACD2 script structure (object-oriented).
Model Validation
For testing goodness of fit from the updated model to the previous model, Nash-Sutcliffe model efficiency coefficient (NSE) was used as described in Nash et al. (1970). NSE, calculated as in Eq. [1], can take values from –∞ to +1, where an efficiency of 1 (NSE = +1) corresponds to a perfect match between observed and modeled data. Moreover, an efficiency of 0 corresponds to data where the modeled data is as precise as the mean of the observed series in forecasting the observed results.
Where:
𝑄𝑡𝑚: Modeled variable through time t (e.g. flow)
𝑄𝑡𝑜: Observed variable through time t (e.g. flow)
𝑇: Total time, i.e., number of timesteps
𝑄𝑜̅̅̅̅: Mean for observed variable
Model Setup for Coello River Basin
Taking into consideration difficulties by updating TACD2 model, a new folder structure for the model was defined, as shown in Figure 4. Once correctness of the TACD2 model was validated, spatial and temporal information for the Coello basin was to be converted to PCRaster required formats.
Other spatial information required for Coello basin was urban settlements location, soil classification, Leaf-Area Indices (LAI) and Normalized Difference Vegetation Indices (COV). Spatio-temporal data such as precipitation and average temperature was gathered from the European Center for Medium-Range Weather forecasts (ECMWF), ERA-Interim Database in NetCDF format. Time series such as water flow and water level in water level stations of study area were gathered from the Colombian Institute of Hydrology, Meteorology and Environmental Studies IDEAM, through its information service Surface Waters Observatory.
Figure 4. Folder structure of TACD2 model.
Parameter Calibration
Calibration scheme of TACD2 to Coello basin took into consideration water flow and water level at the Payandé Station. During calibration process of parameters, CPU time needed for this task was a bottleneck. A total of 67 parameters have to be adjusted. Most of these parameters are related to runoff generation, that is, tank model constants (Tank capacity, lateral flow between tanks, infiltration and percolation). As an example of the calculation complexity, the chosen time series take approximately 1 hour for each model run.
In order to optimize the calibration process, the standard python library scipy was used, more specifically the standard minimize procedure was applied to the Root-Mean Squared Error (RMSE) between modeled and observed waterflow at the Payandé Station.
Where:
𝑄𝑡𝑚: Modeled variable through time t (e.g. flow)
𝑄𝑡𝑜: Observed variable through time t (e.g. flow)
𝑇: Total time, i.e., number of timesteps
𝑄𝑜̅̅̅̅: Mean for observed variable
Results
Model Update and Validation
From the 74 data series obtained as a result of the model execution, 12 were not used in the original TACD model. From the remaining 62 data series, 9 had an exact match according to NSE goodness of fit, that it NSE=1. The fact that some bug corrections were made as the TACD code was rewritten in TACD2 is something to consider when comparing results of each model execution.
The data series whose comparison lead to the lowest values of NSE goodness of fit were Intercepted Precipitation (IntPrec), Intercepted Evapotranspiration (InterceptionET) and Cumulative Intercepted Evapotranspiration (EndInterceptET), with NSE values of 0.9789, 0.9798 and 0.9947, respectively, which validates correctness of the TACD2 logic. A sample of each of the mentioned result data series can be observed in Figures 5, 6 and 7.
Figure 5. Comparison of intercepted precipitation series.
Figure 6. Comparison of intercepted evapotranspiration series.
Figure 7. Comparison of cumulative intercepted evapotranspiration series.
Model Setup and Calibration
The first step for validating the calibration, the comparison of observed and modeled flow at the Payandé station is verified. For these series, the NSE coefficient is -0.57296 (RMSE=22.4937). However, Figure 8 shows that the mean trend of the modeled results corresponds to the mean trend of the observed data, but variability in both series is very different. A longer calibration process is encouraged in order to achieve a better fit.
Figure 8. Comparison between modeled and observed discharge at Payandé Station.
Figure 9 shows intercepted evapotranspiration (blue) as well as intercepted precipitation (orange). The intercepted precipitation is the sum of the precipitation that enters into the interception model tank and the interception surplus of the previous evaporation step. From the figure, it is observed that every drop of water that is intercepted evaporates and goes back to the atmosphere. Further calibration of the parameters involved in this process is necessary.
Figure 9. Intercepted evapotranspiration and intercepted precipitation.
Figure 10 shows water content in soil for the final timestep of the TACD2 model (left) and the soil classification for the basin (right). Problems due to spatial resolution of temperature can be seen in this figure, because the grid resolution for this map dataset was 11km x 11 km, compared to the 100mx100m resolution of the elevation model. Besides, a big contrast in the upper part of the basin can be observed. This water content difference is caused mainly by the temperature difference between adjacent cells. Hydrological classifications also play an important role in the model, as it can be observed that some zones become “highlighted” in the water content due to change in the soil classification.
Figure 11 displays final results for the modeled water flow at each station in the study area. Although only Payandé Station was used for calibration, it can be seen that most discharges from other subbasins have similar shapes at different scales with some small variability to each other as an expected result of local processes at each subbasin. From the current modeled flow at each station, it can be shown that the hydrological water supply potential could be estimated for each subbasin as follows: At the Payandé Station, 67.4 m3/s; at El Carmen, 42.8 m3/s; at Puente Carretera, 33.0 m3/s; at Yuldaima, 15,7 m3/s; at Bocatoma (intake), 0.7 m3/s; at Puente la Bolivar, 3.8 m3/s; at Puente Luisa, 2.1 m3/s.
Figure 10. Soil water content for last timestep (up) and hydrological classifications for Coello basin (down).
Figure 11. Modeled water flow at each station information Coello river basin.
Figure 12 shows water balance at the Payandé Station through time. One can see that there is an almost steady decrease in water balance as time advances, up to a water balance difference of -2121,38 mm at the end of the year. When comparing it to other stations, such as Puente La Bolívar or Puente Carretera, all of the water balances show the same decreasing trend, probably caused by the initial values of storage, which were calibrated for the starting part of the simulation runtime but may also be calibrated to ensure a zero sum in water balance.
Figure 12. Modeled water flow at each station information Coello river basin.
Conclusions
By validating the results of TACD2 with the former TACD model, the correctness in representing superficial and sub superficial flow processes observed and characterized previously on several basins is verified. With this result in mind, model adaption and calibration for the Coello river basin was done. Although time window for the calibration was short, the overall trend for the modeled discharge follows the observed data behavior, but with a very different variability. Results for the Payandé station shows that there is an almost constant constant decrease in the water balance in the basin.
Finally, through this work a novel distributed process-based hydrological model is made available for the scientific community. With TACD2 model, a different alternative for representing hydrological behavior of a watershed can be done for special situations where spatial and temporal variability issues might be present. TACD2 is released in its first version and although it is susceptible to improvement (e.g. User-friendly interface, GIS improvement and parallel computation capabilities), it offers a tunable yet robust possibility for hydrological distributed modeling.
References
Corporación Autónoma Regional Del Tolima. (2016). II Fase Diagnostico - Río Coello | CORTOLIMA. Retrieved on :22/12/2018 from Corporación Autónoma Regional del Tolima: https://www.cortolima.gov.co/contenido/ii-fase-diagnostico-r%C3%ADo-coello-0
Menkveld, A.J. (2003). Runoff generation in the riparian area of the Haldenbach micro-catchment (Germany), MSc Thesis. Institut für Hydrologie der Albert-Ludwigs-Universität, Freiburg i. Br., Germany.
Nash, J.E. and Sutcliffe, J.V. (1970). River flow forecasting through conceptual models part I — A discussion of principles. Journal of Hydrology. 10(3): 282–290. doi:10.1016/0022-1694(70)90255-6
Roser, S. (2001). Flächendetaillierte Weiterentwicklung des prozessorientierten Einzugsgebietsmodells TAC und Visualisierung der Modellergebnisse in einem dynamischen GIS), MSc Thesis. Institut für Hydrologie der Albert-Ludwigs-Universität, Freiburg i. Br., Germany.
Uhlenbrook, S. and Leibundgut, C. (2000). Natural tracers for Investigating residence times, runoff components and validation of a rainfall-runoff model. Proceedings of the TraM'2000 Conference, Liege, Belgium.