更全的杂志信息网

A regional suspended load yield estimation model for ungauged watersheds

更新时间:2016-07-05

1.Introduction

The prediction of suspended load(SL)is useful in the management of rivers(Sadeghi and Kheirfam,2015).Suspended load yield(SLY)in a watershed occurs through physical processes such as soil detachment,transportation,and deposition(Yang et al.,2017).The concentration of SL depends on climatic and hydrological conditions,topography,and land cover and management strategies(Melesse et al.,2011;Cho et al.,2016;Swarnkar et al.,2018).The destruction of the Hyrcanian forests in Iran and the SL flowing from watersheds into the Caspian Sea have caused a lot of problems.Therefore,measuring and/or estimating the SLY at the watershed outlets aids in management decisions(Kis‚i,2010).Despite the need to obtain accurate estimates of SLY at the watershed outlets,due to their large values,establishing SLY measurement stations in all Hyrcanian forest watersheds is not possible or economical.For this reason,it is necessary to estimate and predict SLY.Although the estimation of SLY is often conducted with rating curves(Tramblay et al.,2010),these models have a high estimation error for SLY regional analyses(Asselman,2000).Tramblay et al.(2008)indicated that the correlation between the annual maximum SLY and discharge was significant at only 92 stations out of 208 stations in North America.In this regard,Swarnkar et al.(2018)found that SLY varied with morphoclimatic regimes of watersheds,and Choubin et al.(2018)also reported the high variability of SLY induced by hydro-meteorological regimes for the watersheds of the north of Iran.Accordingly,in addition to discharge,various variables can be used to estimate the magnitude of extreme SLY,including topography,land cover,and climatic properties(Jarvie et al.,2002;Restrepo et al.,2006;Tramblay et al.,2008;Nadal-Romero et al.,2014;Swarnkar et al.,2018).It is necessary to investigate the efficiency of using climatic,hydrologic,and physical characteristics of watersheds for SLY modeling in ungauged watersheds with SLY measurement stations(Caratti et al.,2004).The extension of those models to watersheds lacking SLY measurement stations could be a convenient and practical approach(Tramblay et al.,2010).

Nevertheless,a large number of input variables is one of the most common limitations of this modeling process(Cruz-C´ardenas et al.,2016).Therefore,it is recommended to reduce the number of input variables through data reduction techniques(DRTs)(Lin and Wang,2006;Noori et al.,2011).Although various techniques are used to reduce the number of input variables,they have different mechanisms,processes,and capabilities.Therefore,it is necessary to select an appropriate DRT to obtain an optimum mode with consideration of the intended use case.For hydrological modeling,principal component analysis(PCA)(Ouarda et al.,2006;Zhang,2007),Gamma test(GT)(Corcoran et al.,2003;Moghaddamnia et al.,2009),and stepwise regression(SR)(Noori et al.,2010;Vanmaercke et al.,2011)are widely used.The above-mentioned DRTs have been used to predict discharge, flooding, evaporation, and water quality(Ramachandra Rao and Srinivas,2006;Robertson et al.,2006;Khan et al.,2007;Rashidi et al.,2016;Mohammadi et al.,2018;Sharifi Garmdareh et al.,2018),individually.Although these techniques have also been widely used to estimate river f l ows and/or SLY,there is not yet a reference technique for developing accurate models,though in one case,Kakaei-Lafdani et al.(2013)estimated the daily SLY using DRTs(GT and genetic algorithm)and predictor softwares(artificial neural network(ANN)and support vector machine).

It is obvious that the selected and introduced input data from various techniques may differ from one another,leading to different results of SLY estimation,and thus,leading to miss-managements of watersheds.Therefore,it is important for us to use the most appropriate and accurate technique(s)to select the most effective variables and then use them in SLY estimation models.In addition to using suitable DRTs,it is essential to apply accurate and user-friendly models to SLY estimation.To this end,many models have been used to estimate SLY.Of the most widely used techniques,the ANN and multiple linear regression(MLR)techniques are most often recommended due to their high accuracy and user-friendliness(Kis‚i,2008;Rajaee et al.,2011;Zare Abyaneh,2014;Nourani and Andalib,2015;Samantaray and Ghose,2018).However,for accurate SLYestimation,the ANN models are preferred to the MLR models(Yilmaz et al.,2018).In addition,Khan et al.(2016)successfully used the ANN to model hydrodynamic components of the Ramganga River Catchment of the Ganga Basin,India,an ungauged region with unreachable terrain.By and large,we need a comprehensive assessment to introduce accurate,applicable,and user-friendly model(s)for SLY estimation in ungauged watersheds.To this end,in this study we attempted to provide an appropriate regional model for SLY estimation in ungauged watersheds using input variable reduction techniques such as PCA,GT,and SR,as well as using ANN and MLR tools for modeling.

2.Materials and methods

2.1.Watershed characteristics and data collection

In this study,we selected the Hyrcanian forest watersheds,located south of the Caspian Sea,as the study area due to the importance of its regional environment.The watersheds that are located upstream of the Lahijan-Nour,Haraz-Neka,and Gorganroud rivers were selected for this research(Fig.1).The Lahijan-Nour and Haraz-Neka rivers are located at longitude 49°48E to 54°41E and latitude 35°36N to 37°19N,covering an area of approximately 28463 km2.The Gorganroud Watershed is located southeast of the Caspian Sea at longitude 54°02E to 56°16E and latitude 36°34N to 37°47N,covering an area of 13170 km2.The climate of this watershed is semiarid to semi-humid(Kheirfam and Vafakhah,2015).Additionally,the mean annual temperature of all the watersheds varies from 16.7°C to 19.1°C across the study area from the west to the east.The study area is also a part of the South Caspian Depression,which is bordered to the south by the Elburs mountainous system and to the north by the Caspian Sea.The Gilan,Mazandaran,and Golestan plains are located in the western,central,and eastern parts,respectively.This region represents the Elburs foredeep, filled with a thick sequence of Neogene-Quaternary sediments and separated from the Elburs fold system by a deep-seated regional fault(Svitoch et al.,2013).Other properties of the study area are presented in Table 1.

Our raw data were obtained from Kheirfam and Vafakhah(2015).Fourty-two watersheds were selected for the SLY modeling process using 14 climatic,hydrologic,and physiographic variables(Table 1).Furthermore,we used the SLY data obtained from the Iran Water Resources Management Company(IWRMC),which were calculated with the sediment-rating curve(SRC)function.

Fig.1.Overview of study region and location of selected watersheds.

Table 1 Definitions and characteristics of variables for study watersheds.

No. Variable Definition Unit Mean Minimum Maximum 1 A Area km2 3333.98 21.50 92238.00 2 Q2 Daily maximum discharge with two-year return period m3·s-1 33.46 1.20 177.79 3 Q p Peak discharge with two-year return period m3·s-1 46.28 262.71 1.62 4 P r Mean annual rainfall mm 636.99 446.36 1018.24 5 C c Compactness coefficient 33.16 5.23 95.25 6 T c Time of concentration h 363.69 29.70 1313.98 7 P Perimeter km 151.86 16.20 449.65 8 H w Weighted average height m 1445.93 666.90 2777.00 9 S w Watershed slope % 22.81 9.30 45.80 10 D d Drainage density km·km-2 0.28 0.10 1.13 11 S s Average stream slope % 6.21 0.10 16.97 12 S l Main stream length m 57586.9 5500.0 165000.0 13 L w Watershed length km 45.85 6.56 140.00 14 F f Form factor 1.49 0.22 2.82

2.2.Input variable reduction

(1)可以采用圆弧边分析法。所谓3点决定一个圆,1994年7月的325点、2005年6月的998点及2013年6月的1849点可作一圆弧边,大致上支撑了2018年10月的2449点。但如今又再度接近此圆弧线,恐怕跌破的机会大一些。

As mentioned above,in order to achieve the best SLY model,we used three DRTs to select the variables with the most significant influence on SLY in the Hyrcanian forest watersheds,including SR(Faraway,2002;Bethea,2018),PCA(Abrahams,1972;White,1975;Hess and Hess,2018),and GT(Koncar,1997;Adoalbj¨orn et al.,1997;Liu et al.,2003;Mohammadi et al.,2018).SR is a general statistical method and a combination of forward and backward methods that use the step-by-step algorithm to select the most effective variables based on the linear regression model.After adding all independent variables one by one,they are sorted according to their correlation coefficients(R)with the output variable.The variables with optimum R values are selected as the model input variables(Sharma and Yu,2015).PCA is widely used to identify the most effective variables from among the large number of variables.Due to this,the linear correlation between two or more variables can be established.Then the variables are summarized in a scatter plot,which is called the factor load.After this,the most effective variable influencing SLY is identified,and additional lines are drawn to maximize the remaining variability in a consecutive step-by-step extraction of factors.The variables with the highest factor loadings within each separate factor will probably share a common characteristic or combination of characteristics.The efficiency of the PCA method has been validated using Kaiser-Meyer-Olkin(KMO)and Bartlett tests(Shrestha and Kazama,2007;Peres and Fogliatto,2018).GT is another DRT that can be achieved through the use of the winGamma TM software.To identify the most effective variable(s),all the input variables are uploaded to the software first.Then,one of the variables is eliminated from the main set and the Gamma value of this run is recorded.In the second run,the eliminated variable in the first run is added to the main set,and the second variable will be eliminated.These runs are done for all the variables respectively and the Gamma values are recorded each time.Finally,the variables with the highest Gamma values,which are recorded through a procedure of elimination,are selected as input variables.Unlike SR and PCA,GT is a nonlinear technique(Moghaddamnia et al.,2009).More detailed methodological and functional descriptions of these techniques were expressed in our previous research(Kheirfam and Vafakhah,2015).

2.3.SLY modeling techniques

The ANN,designed by McCulloch and Pitts(1943),has been widely used as a forecasting tool in natural sciences.It might be one of the most successful technologies in the last two decades.The ANN has an input layer,a hidden layer,and an output layer.Each layer is made up of several nodes,and layers are interconnected by sets of correlation weights.In order to conduct the sediment prediction,the multi-layer perceptron(MLP)network has been recommended for different types of ANN(Kis‚i,2010;Heng and Suetsugi,2013).Also,in natural sciences modeling with the ANN,the feed forward(FF)and the back-propagation(BP)algorithms are common training process methods.However,it has been proven that the BP algorithms have the best performance in hydrological modeling(Maier and Dandy,1999;Nourani and Andalib,2015;Khan et al.,2016).Each node in an ANN layer receives and processes a weighted input from a preceding layer and transmits its output to nodes in the following layer through links(Vafakhah et al.,2014).The link equations and their description can be found in Khosrobeigi-Bozchaloei and Vafakhah(2015).We used four data sets,which were obtained from PCA,GT,and SR techniques,as well as raw data sets(all the 14 variables),to estimate SLY in the studied watersheds.Then,the SLY data were normalized to prevent attributes or features in greater numerical ranges from dominating those in smaller numerical ranges,as well as numerical difficulties during the calculation.In the modeling process,the data sets of SLY were scaled to a range between-1 and 1.Also,the Levenberg-Marquardt(LM)algorithm was applied to the training stage in the ANN model(Hagan and Menhaj,1994).Sigmoid tangent and linear activations were used for the hidden layer and output layer nodes,respectively(Vafakhah et al.,2014).The number of the hidden layer nodes of each model was determined after trying various network structures.The ANN training was stopped after 1000 iterations.

2.3.1.ANN

2.3.2.MLR

MLR,as a statistical method,is one of the suitable tools for modeling the linear relationship between a dependent and one or more independent variables of small sample size(Razi and Athappilly,2005).MLR attempts to model the relationship between two or more explanatory variables and a response variable by fitting a linear equation to the observed data(Bethea,2018).Each value of the independent variable(X)is associated with a value of the dependent variable(Y).As in the ANN modeling process,four data sets that were obtained from PCA,SR,and GT variable reduction results as well as the original data set were used as separate input data sets for the SLY modeling by MLR.

2.4.Model performance evaluation

In order to evaluate the performance of the regional SLY estimation models provided,80%of the watersheds were used randomly for the calibration(training)period and the remaining 20%of the watersheds were used for the validation(testing)period.Then,the models were evaluated using the relative error(RE),root mean square error(RMSE),determination of coefficient(R2),and bias(BIAS)as follows:

where O s i and M s i are the i th observed and estimated mean annual SLY,respectively;O m and M m are the average values of O s i and M s i,respectively;and N is the number of observed data.

第二,至40年代,英美陌生化文学特点初步形成。在这一阶段,英美作家受到一战的影响,战前的繁华变为废墟,面对无边的废墟,作家们再也无法将对美好生活的憧憬和向往寄托于田园生活,只能通过更为虚幻的语言形式来表现内心的空虚和迷乱。因此,这个阶段的陌生化文学作品重点描述人物的主观感受,而不是外部的客观现实。

3.Results and discussion

3.1.Most effective variables

In this study,the KMO value and the significance level of Bartlett's test for the PCA technique were 0.775 and 520.48,respectively.Therefore,PCA was considered a suitable technique for selecting the most effective variables for SLY modeling in the Hyrcanian forest watersheds according to Shrestha and Kazama(2007).Through the PCA process,we found that of the 14 variables the following five were the most effective,which explained 82.63%of variability:S l(45.74%of variability),Q2(12.02%of variability),H w(9.89%of variability),A(7.66%of variability),and P r(7.32%of variability).

Along with PCA,we determined the most important variables using the GT technique,a nonlinear method,according to the observed minimum RE results(Tsui et al.,2002).Noori et al.(2010),also using the GT technique,distinguished the influence values of variables with a one-to-one removing replacing strategy.The results of this process,in which all variables were repeated individually,indicated that Q p(with a Gamma value of G v=0.61746),F f(with G v=0.13562),S s(with G v=0.13546),L w(with G v=0.13328),and C c(with G v=0.13271)were the most effective variables in SLY modeling.

任务前:教师通过文本或影音资料引入主题,唤起学生针对这一主题的相关知识和经验。由于学生对于汽车行业的知识还不够了解,但这些知识对于任务实施是必须的,所以教师先向学生介绍一些必要和实用的汽车行业知识,比如汽车营销、汽车结构、汽车品牌、汽车保险、汽车维护的相关知识。

(2)Of all the variables,Q2 and Q p had the most significant effect on the SLY in the Hyrcanian forest watersheds.

Our results indicated that Q p and Q2 were the most effective watershed variables that had the most significant influence on the SLY in the Hyrcanian forest regions.The SLY was supplied by wash loads(with diameters of less than 63 μm,yielded from the watershed area)and bed material loads(with diameters of greater than 63 μm,yielded from the river bed and bank)(Sadeghi and Zakeri,2015).In the forest watersheds,the contribution of wash loads to SLY was negligible(Sadeghi and Kheirfam,2015;Kheirfam and Sadeghi,2017).Therefore,the major portion of SLY was supplied from bed material loads(Sadeghi and Zakeri,2015).Bed material loads mostly varied with the river flow variation(e.g.,Q p and Q2).In this regard,previous studies(Lobera et al.,2016;Rainato et al.,2017;Ebabu et al.,2018)indicated that the major volume of annual and daily suspended sediment transported by Q p and Q2,respectively,confirmed the decisive role of these variables.However,human intervention (eg., deforestation, land use change, overexploitation,and land degradation)has led to the decrease of the Hyrcanian watershed cover.During rain events,the wash load share in SLY increases(Sadeghi and Kheirfam,2015).In this study,P r was selected(by GT)as one of the important factors.Additionally,some physical variables of watersheds(H w,A,F f,S s,L w,and C c)were also distinguished through the PCA and GT techniques as effective variables influencing SLY.However,indirectly,these variables also influenced the river's hydrological behavior and the SLY(Bywater-Reyes et al.,2017).

3.2.Regional SLY estimation modeling

After selecting the most important variable using PCA,SR,and GT,the ANN and MRL were used to provide regional SLY models separately for each data set.Then,the statistical criteria were used to evaluate the performance of provided models and also to select and introduce the most appropriate regional SLY model.The structures of ANN and MLR models are given in Table 2,and the results of the evaluation performance by statistical criteria are provided in Tables 3 and 4.

In the meantime,for low values of SLY,the SR-ANN and PCA-ANN models had the best estimation of SLY in the calibration period,and for the original data set(ORG),the ANN model had the best SLY estimation during the validation period by 113,113,and 408 t/year,respectively(Tables 3 and 4).For high values for the calibration period,the MLR models underestimated the SLY.However,they had acceptable results.The analyses of the other statistical criteria indicated that the values of bias criteria for the ORG-MLR,SR-MLR,PCA-MLR,and GT-MLR models were 7.46,0.17,8.40,and 0.03 t/year,respectively,during the calibration period(Table 3).The variables selected by PCA(S l,Q2,H w,A,and P r)increased the bias of MLR results,which might be related to the P r effects on the SLY results.As described earlier,the SLY behavior became significantly more complicated with precipitation(or storm events). Therefore,it did not follow a systematic trend(Gao et al.,2013).This caused confusion in the PCA-MLR model.Regarding the SR-MLR model,selecting Q2 and Q p as the most effective variables led to low-bias estimation results due to the high dependence of SLY on the water flow,unlike the storm events.In addition to flow components(Q2 and Q p),using the physiographic parameters of watersheds(F f,S s,L w,and C c)as non-variable properties can also improve the ability of the models in SLY estimation(Chen,2018).We selected these parameters with the GT technique for GT-MLR modeling.

(1)Of the DRTs,GT had the best performance in selecting the most effective input variables for both the ANN and MLR models in the SLY estimation.

秦川所说的现实,是指戴菲儿身患绝症。他们的私人医生手捧诊断书,说,我很遗憾。戴菲儿冲他笑笑,说,谢谢您……留下来吃饭吧!她知道自己即将死去,却绝不是因为绝症。使用说明上写着充气娃娃永远不会身患绝症。使用说明上还写着总会有一个恰当的理由让她们顺利地死去,比如绝症,比如意外,比如自杀,等等。自杀的充气娃娃不多,但有。厂家在她们诞生以前就在她们的脑子里植入了自杀的理由、决心、方式和时间,她们要做的,只是等到那一天将自己顺利地结束。她们永远不会知道这其实属于“他杀”而非“自杀”——她们悲痛欲绝,却连悲痛欲绝都是虚假的;她们万念俱灰,却连万念俱灰都是虚假的。总之她们必须死去,因为她们不再有活着的价值。

建设生态农业,推广生态种植,建立有机食品基地。使用微生物技术和轮作制,即豆类、粮食、苜蓿、根茎植物不断轮种,以增加土地的氮肥和氯肥,待农作物收获之后,再把其根茎和麦秆捣碎,喷洒益生菌原液后埋入地下,使地表下形成一层肥沃的天然腐蚀质,同时又能促使有机物的转化,保持水土不致流失。通过“以菌治菌”、“以菌治虫”来减少病虫害。最终既提高农作物的产量与质量,同时田地园林免受污染,生态环境得到大大的改善。定期进行农产品基地的水质、环境、空气、土壤质量等的监测检测,推广使用有机肥料和微生物农药,清除和化解各种化学毒物,使土壤得到净化。有效控制农业面源污染。

混凝土配合比对外加剂相容性的影响较大,如砂率、石子的形状和级配、水灰比等参数,对混凝土的保水性、流动性、粘聚性、密实性等均有重要影响。另外,有些预拌混凝土厂家为了追求最大利益,用较少的胶凝材料配制较高强度等级的混凝土;用较小的水灰比配制较密实的混凝土,这些都易引起外加剂与水泥的不相容。

Table 2 Structures of ANN and equations of MLR regional SLY estimation models.

Note:1 means original data set as input for MLR,2 means SR output data set as input for MLR,3 means PCA output data set as input for MLR,4 means GToutput data set as input for MLR,5 means original data set as input for ANN,6 means SR output data set as input for ANN,7 means PCA output data set as input for ANN,8 means GT output data set as input for ANN,and S means SLY(t/year).

Model ANN structure or/and MLR equation ORG-MLR1 S=319.29Q2+80.84Q P-0.06P r+9.67T c+0.003A-57.22P+4.84H w-358.25S w+512.09D d+558.11S s-0.058S L-152.33L w+16189.86C c-13.784F f-20856.92 SR-MLR2 S=33.87Q2+201.009Q P-4858.32 PCA-MLR3 S=380S l-1.604Q2+8696.76H w+6149.12A-391.63P r-5701.52 GT-MLR4 S=243.11Q P-103.962S s-79.908L w+700.014C c-174.639 F f+1256.418 ORG-ANN5 14 input layer nodes,9 hidden layer nodes,and 1 output layer node SR-ANN6 2 input layer nodes,7 hidden layer nodes,and 1 output layer node PCA-ANN7 5 input layer nodes,8 hidden layer nodes,and 1 output layer node GT-ANN8 4 input layer nodes,12 hidden layer nodes,and 1 output layer node

Table 3 Summarized statistics of provided regional SLY estimation models in calibration period.

Model Statistics Minimum(t/year)Bias(t/year)Input data 112 89440 3900 14793 379 ORG-MLR -8391 60349 3893 12269 315 68.8 495.8 8148.0 7.46 ORG-ANN 169 89361 3926 14563 376 99.9 41.2 385.4 25.90 SR-MLR -4426 41926 10369 10176 98 33.4 1016.0 10586.0 0.17 SR-ANN 113 89259 3932 14756 375 99.9 44.4 379.2 31.60 PCA-MLR -11277 3892 53283 11447 294 60.0 849.0 9229.0 8.40 PCA-ANN 113 89119 3777 14754 390 99.9 30.0 448.0 123.00 GT-MLR -7556 53684 3900 11566 296 61.0 523.0 9093.0 0.03 GT-ANN 137 89424 3894 14788 379 99.9 26.0 323.0 6.00 Maximum(t/year)Average(t/year)Standard deviation(t/year)Coefficient of variation(%)R2(%) RE(%) RMSE(t/year)

Table 4 Summarized statistics of provided regional SLY estimation models in validation period.

Model Statistics Minimum(t/year)Bias(t/year)Input data 695 1334 1032 260 25 ORG-MLR -9481 24256 4955 12012 242 4.1 959.7 11603 3923.5 ORG-ANN 408 1406 998 346 34 50.1 46.2 514 33.3 SR-MLR -4573 6880 -217 4263 1956 3.2 393.9 4268 1250.0 SR-ANN 119 1091 598 323 53 58.1 44.7 474 433.6 PCA-MLR -10938 -389 10997 7524 1933 0.1 543.0 6930 1421.0 PCA-ANN 106 740 1314 517 69 42.0 44.0 468 291.0 GT-MLR -6159 7685 746 5039 675 3.4 476.0 4682 285.0 GT-ANN 124 1124 625 416 66 70.0 43.0 456 407.0 Maximum(t/year)Average(t/year)Standard deviation(t/year)Coefficient of variation(%)R2(%) RE(%) RMSE(t/year)

According to the evaluation of regional SLY estimation models shown in Tables 3 and 4,the performance of the ANN models was better than that of the MLR models in both the calibration and validation periods.However,compared to the MRL models,the superiority of the ANN models was proven when estimating daily SLY(Melesse et al.,2011;Pektas and Cigizoglu,2017).Of the ANN models,the GT-ANN model was the best for regional SLY estimation.The values of R2,RE,RMSE,and BIAS for the calibration and validation periods were 99.9%,26%,323 t/year,and 6 t/year;and 70%,43%,456 t/year,and 407 t/year,respectively.However,the PCAANN,SR-ANN,and ORG-ANN models also had acceptable estimation results.However,the large number of input data requirements was a major limitation for the ORG-ANN model.As various studies(Noori et al.,2011;Kakaei-Lafdani et al.,2013;Kheirfam and Vafakhah,2015;Mohammadi et al.,2018;Sharifi Garmdareh et al.,2018)have confirmed,the pre-processing of the most effective input variables improves the accuracy of black and/or gray box models,which is consistent with the present study.In contrast to PCA and SR,GT was a nonlinear valuator for rating the variables according to their effectiveness on the dependent variable(Chang and Heinemann,2018).Thus,it has a high capability of evaluating a high-volatility data set(e.g.,SLY).The fitting of the estimated and observed SLY values by all of the models is presented in Figs.2-5.According to these figures,the best fitting results were observed in the ANN models,particularly with the combination of the GT and PCA techniques.

Fig.2.Fitting of estimated and observed SLY values by ORG-MLR,ORG-ANN,SR-MLR,and SR-ANN models in calibration period.

Fig.3.Fitting of estimated and observed SLY values by ORG-MLR,ORG-ANN,SR-MLR,and SR-ANN models in validation period.

Fig.4.Fitting of estimated and observed SLY values by PCA-MLR,PCA-ANN,GT-MLR,and GT-ANN models in calibration period.

4.Conclusions

According to Table 3 and statistical analyses,the minimum,maximum,average,standard deviation,and coefficient of variation of input data in the calibration period were 112 t/year,89440 t/year,3900 t/year,and 14793 t/year,and 379%,respectively.These statistical criteria can provide useful information about the accuracy or errors of the provided models.In the calibration period,the MLR models could not estimate low values of SLY,as they underestimated the low values of observed SLY.In the validation period,we also observed the underestimated results from the MLR models for low values of SLY.In this regard,Kutner et al.(2004)reported that the MLR models describe a linear relationship between variables,and therefore,imbalance in the higher and lower values of the SLY data leads to overestimated and underestimated results from MLR models(McNeish and Stapleton,2016),similar to our data set.Meanwhile,in the calibration period for the ANN models,the estimated low values of SLY were very close to the observed values.However,the estimated low values of SLY in the validation period were acceptable.Unlike the storm events,the low values of SLY only depended on the low flow events(Sadeghi and Zakeri,2015;Sadeghi et al.,2018).Therefore,the ANN models easily simulated the low values of SLY that have a good fitting with water flow(Talebizadeh et al.,2010).In contrast to this,in the storm events,different nonlinear relationships governing the process of soil detachment and final SLY(Sadeghi and Zakeri,2015;Swarnkar et al.,2018)might lead to the inability of the ANN models to estimate high SLY values.The ability of the ANN models to precisely estimate low SLY values has been reported(Talebizadeh et al.,2010).

In addition to PCA and GT,the SR technique was also used as a linear method(Amendola et al.,2017)to select the important variables for modeling the SLY in the studied watersheds based on their R2 results.Accordingly,we used the stepwise method from regression models,in which the variables were input individually,and this operation continued until the error reached the desired significance level(Durrant,2001).The results showed that Q p and Q2 were distinguished as the most effective factors.

(3)Overall,the ANN models performed better than the MRL models in the SLY estimation.

(4)We found that the GT-ANN model had the best performance in estimating the regional SLY.Therefore,it can be considered an economic,appropriate,and reliable approach for estimating the SLY in ungauged watersheds.

悔/相道/之不察兮,延伫乎/吾将反。回/朕车/以复路兮,及/行迷/之未远。步/余马/于兰皋兮,驰/椒丘/且焉止息。进/不入/以离尤兮,退/将/复修/吾初服。制/芰荷/以为衣兮,集/芙蓉/以为裳。不吾/知其/亦已兮,苟/余情/其信芳。/高/余冠/之岌岌兮,长/余佩/之陆离。芳与泽/其杂糅兮,唯/昭质/其犹未亏。忽/反顾/以游目兮,将/往观/乎四荒。佩/缤纷/其繁饰兮,芳菲菲/其弥章。民生/各有/所乐兮,余/独/好修/以为常。虽/体解/吾犹未变兮,岂/余心/之可惩?”

Fig.5.Fitting of estimated and observed SLY values by PCA-MLR,PCA-ANN,GT-MLR,and GT-ANN models in validation period.

(5)The MLR models considered the deviation of the final estimated and observed results in order to achieve the best estimation for the overall data set(an average),which caused the poor estimation for the one-to-one data.In ANN,on the other hand,using the iteration method to train the model led to accurate estimation for both the one-to-one and the overall data sets.

(6)For future research,we recommend using a large number of data sets to develop reliable models.Using other data reduction techniques and modeling methods(e.g.,adaptive neuro-fuzzy inference,support vector machines(SVM),and neuro-fuzzy systems)could lead to a more complete awareness of the capabilities of the models.

Acknowledgements

1.4 疗效判定标准 采用美国足踝骨科协会(American orthopedic foot & ankle society,AOFAS)评分系统对术后足功能进行评估。AOFAS评分90~100分为优,75~89分为良,50~74分为可,50分以下为差。

We would like to thank Dr.Mehdi Vafakhah,Dr.Akbar Najafi,Eng.Alireza Motevalli,Eng.Saeid Janizadeh,and Eng.Saeid Khosrobeigi Bozchaloei for their valuable assistance during different stages of this study.

References

Abrahams,A.D.,1972.Factor analysis of drainage basin properties:Evidence for stream abstraction accompanying the degradation of relief.Water Resour.Res.8(3),624-633.https://doi.org/10.1029/WR008i003p00624.

Adoalbj¨orn,S.,Konˇcar,N.,Jones,A.J.,1997.A note on the Gamma test.Neural Comput. Appl. 5(3), 131-133. https://doi.org/10.1007/BF01413858.

Amendola,A.,Giordano,F.,Parrella,M.L.,Restaino,M.,2017.Variable selection in high-dimensional regression:A nonparametric procedure for business failure prediction.Appl.Stoch Model Bus.Ind.33(4),355-368.https://doi.org/10.1002/asmb.2240.

Asselman,N.E.M.,2000.Fitting and interpretation of sediment rating curves.J.Hydrol.234(3-4),228-248.https://doi.org/10.1016/S0022-1694(00)00253-5.

Bethea,R.M.,2018.Statistical Methods for Engineers and Scientists.Routledge,Abingdon-on-Thames.

Bywater-Reyes,S.,Segura,C.,Bladon,K.D.,2017.Geology and geomorphology control suspended sediment yield and modulate increases following timber harvest in temperate headwater streams.J.Hydrol.548,754-769.https://doi.org/10.1016/j.jhydrol.2017.03.048.

Caratti,J.F.,Nesser,J.A.,Lee Maynard,C.,2004.Watershed classification using canonical correspondence analysis and clustering techniques:A cautionary note1.J.Am.Water Resour.Assoc.40(5),1257-1268.https://doi.org/10.1111/j.1752-1688.2004.tb01584.x.

Chang,F.,Heinemann,P.H.,2018.Optimizing prediction of human assessments of dairy odors using input variable selection.Comput.Electron.Agric.150,402-410.https://doi.org/10.1016/j.compag.2018.05.017.

Chen,C.N.,2018.Application of physiographic soil erosion-deposition model in estimating sediment flushing efficiency of empty storage.J.Earth Syst.Sci.127(6),86.https://doi.org/10.1007/s12040-018-0989-1.

Cho,J.,Her,Y.,Bosch,D.,2016.Sensitivity of simulated conservation practice effectiveness to representation of field and in-stream processes in the Little River Watershed.Environ.Model.Assess.22(2),159-173.https://doi.org/10.1007/s10666-016-9530-6.

Choubin,B.,Darabi,H.,Rahmati,O.,Sajedi-Hosseini,F.,Kløve,B.,2018.River suspended sediment modelling using the CART model:A comparative study of machine learning techniques.Sci.Total Environ.615,272-281.https://doi.org/10.1016/j.scitotenv.2017.09.293.

Corcoran,J.J.,Wilson,I.D.,Ware,J.A.,2003.Predicting the geo-temporal variations of crime and disorder.Int.J.Forecast.19(4),623-634.https://doi.org/10.1016/S0169-2070(03)00095-5.

Cruz-C´ardenas,G.,Silva,J.T.,Ochoa-Estrada,S.,Estrada-Godoy,F.,Nava-Vel´azquez,J.,2016.Delineation of environmental units by multivariate techniques in the Duero river watershed,Michoac´an,Mexico.Environ.Model.Assess.22(3),257-266.https://doi.org/10.1007/s10666-016-9534-2.

Durrant,P.J.,2001.Wingamma:A Non-linear Data Analysis and Modeling Tool with Applications to Flood Prediction.Ph.D.Dissertation.Cardiff University,Wales.

Ebabu,K.,Tsunekawa,A.,Haregeweyn,N.,Adgo,E.,Meshesha,D.T.,Aklog,D.,Masunaga,T.,Tsubo,M.,Sultan,D.,Fenta,A.A.,et al.,2018.Analyzing the variability of sediment yield:A case study from paired watersheds in the Upper Blue Nile Basin,Ethiopia.Geomorphology 303,446-455.https://doi.org/10.1016/j.geomorph.2017.12.020.

Faraway,J.,2002.Practical Regression and ANOVA in R.University of Bath,Bath.

Gao,P.,Nearing,M.A.,Commons,M.,2013.Suspended sediment transport at the instantaneous and event time scales in semiarid watersheds of southeastern Arizona,USA.Water Resour.Res.49(10),6857-6870.https://doi.org/10.1002/wrcr.20549.

Hagan,M.T.,Menhaj,M.B.,1994.Training feedforward networks with the Marquardt algorithm.IEEE Trans.Neural Network.5(6),989-993.https://doi.org/10.1109/72.329697.

Heng,S.,Suetsugi,T.,2013.Using artificial neural network to estimate sediment load in ungauged catchments of the Tonle Sap river basin,Cambodia.J.Water Resour.Protect.5(2),111-123.https://doi.org/10.4236/jwarp.2013.52013.

Hess,A.S.,Hess,J.R.,2018.Principal component analysis.Transfusion 58(7),1580-1582.https://doi.org/10.1111/trf.14639.

Jarvie,H.,Oguchi,T.,Neal,C.,2002.Exploring the linkages between river water chemistry and watershed characteristics using GIS-based catchment and locality analyses.Reg.Environ.Change 3(1),36-50.https://doi.org/10.1007/s10113-001-0036-6.

Kakaei-Lafdani,E.,Moghaddamnia,A.,Ahmadi,A.,2013.Daily suspended sediment load prediction using artificial neural networks and support vector machines.J.Hydrol.478,50-62.https://doi.org/10.1016/j.jhydrol.2012.11.048.

Khan,J.A.,van Aelst,S.,Zamar,R.H.,2007.Building a robust linear model with forward selection and stepwise procedures.Comput.Stat.Data Anal.59,239-248.https://doi.org/10.1016/j.csda.2007.01.007.

Khan,M.Y.A.,Hasan,F.,Panwar,S.,Chakrapani,G.J.,2016.Neural network model for discharge and water-level prediction for Ramganga River catchment of Ganga Basin,India.Hydrol.Sci.J.61(11),2084-2095.https://doi.org/10.1080/02626667.2015.1083650.

Kheirfam,H.,Vafakhah,M.,2015.Assessment of some homogeneous methods for the regional analysis of suspended sediment yield in the South and Southeast of the Caspian Sea.J.Earth Syst.Sci.124(6),1247-1263.https://doi.org/10.1007/s12040-015-0604-7.

Kheirfam,H.,Sadeghi,S.H.R.,2017.Variability of bed load components in different hydrological conditions.J.Hydrol.:Reg.Stud.10,145-156.https://doi.org/10.1016/j.ejrh.2017.03.002.

Khosrobeigi-Bozchaloei,S.,Vafakhah,M.,2015.Regional analysis of flow duration curves using adaptive neuro-fuzzy inference system.J.Hydrol.Eng.20(12),06015008.https://doi.org/10.1061/(ASCE)HE.1943-5584.0001243.

Kis‚i,¨O.,2008.Constructing neural network sediment estimation models using a data-driven algorithm.Math.Comput.Model.79,94-103.https://doi.org/10.1016/j.matcom.2007.10.005.

Kis‚i,¨O.,2010.River suspended sediment concentration modeling using a neural differential evolution approach.J.Hydrol.389(1-2),227-235.https://doi.org/10.1016/j.jhydrol.2010.06.003.

Koncar,N.,1997.Optimisation Methodologies for Direct Inverse Neurocontrol.Ph.D.Dissertation.University of London,London.

Kutner,M.H.,Nachtsheim,C.,Neter,J.,2004.Applied Linear Regression Models.McGraw-Hill/Irwin,New York.

Lin,G.F.,Wang,C.M.,2006.Performing cluster analysis and discrimination analysis of hydrological factors in one step.Adv.Water Resour.29(11),1573-1585.https://doi.org/10.1016/j.advwatres.2005.11.008.

Liu,C.W.,Lin,K.H.,Kuo,Y.M.,2003.Application of factor analysis in the assessment of groundwater quality in a Blackfoot disease area in Taiwan.Sci.Total Environ.313(1-3),77-89.https://doi.org/10.1016/S0048-9697(02)00683-6.

Lobera,G.,Batalla,R.J.,Vericat,D.,L´opez-Taraz´on,J.A.,Tena,A.,2016.Sediment transport in two Mediterranean regulated rivers.Sci.Total Environ.540,101-113.https://doi.org/10.1016/j.scitotenv.2015.08.018.

Maier,H.R.,Dandy,G.C.,1999.Neural Networks for the prediction and forecasting of water resources variables:A review of modelling issues and applications.Environ.Model.Softw.15(1),101-124.https://doi.org/10.1016/S1364-8152(99)00007-9.

McCulloch,W.S.,Pitts,W.,1943.A logical calculus of the ideas immanent in nervous activity.Bull.Math.Biol.5(4),115-133.https://doi.org/10.1007/BF02478259.

McNeish,D.M.,Stapleton,L.M.,2016.The effect of small sample size on two-level model estimates:A review and illustration.Educ.Psychol.Rev.28(2),295-314.https://doi.org/10.1007/s10648-014-9287-x.

Melesse,A.M.,Ahmad,S.,McClain,M.E.,Wang,X.,Lim,Y.H.,2011.Suspended sediment load prediction of river systems:An artificial neural network approach.Agric.Water Manag.98(5),855-866.https://doi.org/10.1016/j.agwat.2010.12.012.

Moghaddamnia,A.,Ghafari Gousheh,M.,Piri,J.,Amin,S.,Han,D.,2009.Evaporation estimation using artificial neural networks and adaptive neurofuzzy inference system techniques.Adv.Water Resour.32(1),88-97.https://doi.org/10.1016/j.advwatres.2008.10.005.

Mohammadi,A.A.,Yousefi,M.,Soltani,J.,Ahangar,A.G.,Javan,S.,2018.Using the combined model of gamma test and neuro-fuzzy system for modeling and estimating lead bonds in reservoir sediments.Environ.Sci.Pollut.Control Ser.25(30),30315-30324.https://doi.org/10.1007/s11356-018-3026-7.

Nadal-Romero,E.,Martínez-Murillo,J.F.,Vanmaercke,M.,Poesen,J.,2014.Scale-dependency of sediment yield from badland areas in Mediterranean environments.Prog.Phys.Geogr.38(3),381-386.https://doi.org/10.1177/0309133311400330.

Noori,R.,Hoshyaripour,G.,Ashrafi,K.,Araabi,B.N.,2010.Uncertainty analysis of developed ANN and ANFIS models in prediction of carbon monoxide daily concentration.Atmos.Environ.44(4),476-482.https://doi.org/10.1016/j.atmosenv.2009.11.005.

Noori,R.,Karbassi,A.R.,Moghaddamnia,A.,Han,D.,Zokaei-Ashtiani,M.H.,Farokhnia,A.,Gousheh,M.G.,2011.Assessment of input variables determination on the SVM model performance using PCA,Gamma test,and forward selection techniques for monthly stream flow prediction.J.Hydrol.401(3),177-189.https://doi.org/10.1016/j.jhydrol.2011.02.021.

Nourani,V.,Andalib,G.H.,2015.Daily and monthly suspended sediment load predictions using wavelet based artificial intelligence approaches.J.Mt.Sci.12(1),85-100.https://doi.org/10.1007/s11629-014-3121-2.

Ouarda,T.B.M.J.,Cunderlik,J.M.,St-Hilaire,A.,Barbet,M.,Bruneau,P.,Bob´ee,B.,2006.Data-based comparison of seasonality-based regional f l ood frequency methods.J.Hydrol.330(1-2),329-339.https://doi.org/10.1016/j.jhydrol.2006.03.023.

Pektas,A.O.,Cigizoglu,H.K.,2017.Investigating the extrapolation performance of neural network models in suspended sediment data.Hydrol.Sci.J.62(10),1694-1703.https://doi.org/10.1080/02626667.2017.1349316.

Peres,F.A.P.,Fogliatto,F.S.,2018.Variable selection methods in multivariate statistical process control:A systematic literature review.Comput.Ind.Eng.115,603-619.https://doi.org/10.1016/j.cie.2017.12.006.

Rainato,R.,Mao,L.,García-Rama,A.,Picco,L.,Cesca,M.,Vianello,A.,Preciso,E.,Scussel,G.R.,Lenzi,M.A.,2017.Three decades of monitoring in the Rio Cordon instrumented basin:Sediment budget and temporal trend of sediment yield.Geomorphology 291,45-56.https://doi.org/10.1016/j.geomorph.2016.03.012.

Rajaee,T.,Nourani,V.,Zounemat-Kermani,M.,Kis‚i,¨O.,2011.River suspended sediment load prediction:Application of ANN and Wavelet conjunction model.J.Hydrol.Eng.16(8),613-627.https://doi.org/10.1061/(ASCE)HE.1943-5584.0000347.

Ramachandra Rao,A.,Srinivas,V.V.,2006.Regionalization of watersheds by hybrid-cluster analysis.J.Hydrol.318(1-4),37-56.https://doi.org/10.1016/j.jhydrol.2005.06.003.

Rashidi,S.,Vafakhah,M.,Lafdani,E.K.,Javadi,M.R.,2016.Evaluating the support vector machine for suspended sediment load forecasting based on gamma test.Arabian J.Geosci.9(11),583.https://doi.org/10.1007/s12517-016-2601-9.

Razi,M.A.,Athappilly,K.,2005.A comparative predictive analysis of neural networks(NNs),nonlinear regression and classification and regression tree(CART)models.Expert Syst.Appl.29,65-74.https://doi.org/10.1016/j.eswa.2005.01.006.

Restrepo,J.D.,Kjerfve,B.,Hermelin,M.,Restrepo,J.C.,2006.Factors controlling sediment yield in a major South American drainage basin:The Magdalena River,Colombia.J.Hydrol.316(1-4),213-232.https://doi.org/10.1016/j.jhydrol.2005.05.002.

Robertson,D.,Saad,D.,Heisey,D.,2006.A regional classification scheme for estimating reference water quality in streams using land-use-adjusted spatial regression-tree analysis.Environ.Manag.37(2),209-229.https://doi.org/10.1007/s00267-005-0022-8.

Sadeghi,S.H.R.,Kheirfam,H.,2015.Temporal variation of bed load to suspended load ratio in Kojour River,Iran.Clean.:Soil,Air,Water 43(10),1366-1374.https://doi.org/10.1007/s00267-005-0022-8.

Sadeghi,S.H.R.,Zakeri,M.A.,2015.Partitioning and analyzing temporal variability of wash and bed material loads in a forest watershed in Iran.J.Earth Syst.Sci.124(7),1503-1515.https://doi.org/10.1007/s12040-015-0614-5.

Sadeghi,S.H.R.,Gharemahmudli,S.,Kheirfam,H.,Khaledi Darvishan,A.,Kiani-Harchegani,M.,Saeidi,P.,Gholami,L.,Vafakhah,M.,2018.Effects of type,level and time of sand and gravel mining on particle size distributions of suspended sediment.Int.Soil Water Conserv.Res.6(2),184-193.https://doi.org/10.1016/j.iswcr.2018.01.005.

Samantaray,S.,Ghose,D.K.,2018.Evaluation of suspended sediment concentration using descent neural networks.Procedia Comput.Sci.132,1824-1831.https://doi.org/10.1016/j.procs.2018.05.138.

Sharifi Garmdareh,E.,Vafakhah,M.,Eslamian,S.S.,2018.Regional flood frequency analysis using support vector regression in arid and semi-arid regions of Iran.Hydrol.Sci.J.63(3),426-440.https://doi.org/10.1080/02626667.2018.1432056.

Sharma,M.J.,Yu,S.J.,2015.Stepwise regression data envelopment analysis for variable reduction.Appl.Math.Comput.253,126-134.https://doi.org/10.1016/j.amc.2014.12.050.

Shrestha,S.,Kazama,F.,2007.Assessment of surface water quality using multivariate statistical techniques:A case study of the Fuji river basin,Japan.Environ.Model.Softw.22(4),464-475.https://doi.org/10.1016/j.envsoft.2006.02.001.

Svitoch,A.A.,Badyukova,E.N.,Sheikhi,B.,Yanina,T.A.,2013.Geologicalgeomorphological structure and recent history of the Iranian coast of the Caspian Sea.Dokl.Earth Sci.451(2),843-848.https://doi.org/10.1134/S1028334X13080060.

Swarnkar,S.,Malini,A.,Tripathi,S.,Sinha,R.,2018.Assessment of uncertainties in soil erosion and sediment yield estimates at ungauged basins:An application to the Garra River basin,India.Hydrol.Earth Syst.Sci.22(4),2471-2485.https://doi.org/10.5194/hess-22-2471-2018.

Talebizadeh,M.,Morid,S.,Ayyoubzadeh,S.A.,Ghasemzadeh,M.,2010.Uncertainty analysis in sediment load modeling using ANN and SWAT model.Water Resour.Manag.24(9),1747-1761.https://doi.org/10.1007/s11269-009-9522-2.

Tramblay,Y.,St-Hilaire,A.,Ouarda,T.B.M.J.,2008.Frequency analysis of maximum annual suspended sediment concentrations in North America.Hydrol.Sci.J.53(1),236-252.https://doi.org/10.1623/hysj.53.1.236.

Tramblay,Y.,Ouarda,T.B.M.J.,St-Hilaire,A.,Poulin,J.,2010.Regional estimation of extreme suspended sediment concentrations using watershed characteristics.J.Hydrol.380(3-4),305-317.https://doi.org/10.1016/j.jhydrol.2009.11.006.

Tsui,A.P.M.,Jones,A.J.,Guedes de Oliveira,A.,2002.The construction of smooth models using irregular embeddings determined by a Gamma test analysis.Neural Comput.Appl.10(4),318.https://doi.org/10.1007/s005210200004.

Vafakhah,M.,Janizadeh,S.,Khosrobeigi Bozchaloei,S.,2014.Application of several data-driven techniques for rainfall-runoff modeling.Ecopersia 2(1),455-469.

Vanmaercke,M.,Poesen,J.,Verstraeten,G.,de Vente,J.,Ocakoglu,F.,2011.Sediment yield in Europe:Spatial patterns and scale dependency.Geomorphology 130(3-4),142-161.https://doi.org/10.1016/j.geomorph.2011.03.010.

White,E.L.,1975.Factor analysis of drainage basin properties:Classification of flood behavior in terms of basin geomorphology.J.Am.Water Works Assoc. 11(4), 676-687. https://doi.org/10.1111/j.1752-1688.1975.tb00722.x.

Yang,H.B.,Li,E.C.,Zhao,Y.,Liang,Q.H.,2017.Effect of water-sediment regulation and its impact on coastline and suspended sediment concentration in Yellow River Estuary.Water Sci.Eng.10(4),287-294.https://doi.org/10.1016/j.wse.2017.12.009.

Yilmaz,B.,Aras,E.,Nacar,S.,Kankal,M.,2018.Estimating suspended sediment load with multivariate adaptive regression spline,teachinglearning based optimization,and artificial bee colony models.Sci.Total Environ.639,826-840.https://doi.org/10.1016/j.scitotenv.2018.05.153.

Zare Abyaneh,H.,2014.Evaluation of multivariate linear regression and artificial neural networks in prediction of water quality parameters.J.Environ.Health Sci.Eng.12(1),1.https://doi.org/10.1186/2052-336X-12-40.

Zhang,Y.X.,2007.Artificial neural networks based on principal component analysis input selection for clinical pattern recognition analysis.Talanta 73(1),68-75.https://doi.org/10.1016/j.talanta.2007.02.030.

Hossein Kheirfam,Sahar Mokarram-Kashtiban
《Water Science and Engineering》2018年第4期文献

服务严谨可靠 7×14小时在线支持 支持宝特邀商家 不满意退款

本站非杂志社官网,上千家国家级期刊、省级期刊、北大核心、南大核心、专业的职称论文发表网站。
职称论文发表、杂志论文发表、期刊征稿、期刊投稿,论文发表指导正规机构。是您首选最可靠,最快速的期刊论文发表网站。
免责声明:本网站部分资源、信息来源于网络,完全免费共享,仅供学习和研究使用,版权和著作权归原作者所有
如有不愿意被转载的情况,请通知我们删除已转载的信息 粤ICP备2023046998号