基于k-最近邻筛选的BMA集合预报模型研究

刘开磊; 李致家; 姚成; 韩通; 钟栗; 孙如飞

文章摘要

刘开磊,李致家,姚成,韩通,钟栗,孙如飞.基于k-最近邻筛选的BMA集合预报模型研究[J].水利学报,2017,48(4):390-397,407

基于k-最近邻筛选的BMA集合预报模型研究

Study on the bayesian model averaging coupling with the k-nearest neighbor selection

投稿时间：2015-09-15

DOI：10.13243/j.cnki.slxb.20150978

中文关键词: 集合预报样本筛选 k-最近邻贝叶斯模型平均法高斯混合模型

英文关键词: ensemble forecast sample selection method k-nearest neighbor Bayesian Model Averaging Gaussian mixture model

基金项目:国家重点研发计划项目（2016YFC0400909）；国家自然科学基金项目（41130639，51179045，41101017，41201028）

作者	单位
刘开磊	淮河水利委员会水文局(信息中心), 安徽蚌埠 233000
李致家	河海大学水文水资源学院, 江苏南京 210098
姚成	河海大学水文水资源学院, 江苏南京 210098
韩通	河海大学水文水资源学院, 江苏南京 210098
钟栗	河海大学水文水资源学院, 江苏南京 210098
孙如飞	河海大学水文水资源学院, 江苏南京 210098

摘要点击次数: 6003

全文下载次数: 2815

中文摘要:

针对冗余训练样本会降低BMA参数求解效率与精度问题，本文提出在BMA运算之前采用k-最近邻（k-nearest neighbor）算法筛选有价值训练样本，并用于BMA参数求解的改进模型。模拟试验在淮河王家坝站进行，分别以k-最近邻筛选、不筛选两种方案为BMA提供训练样本，统计分析两种方案中王家坝站流量模拟结果，评价BMA改进法的性能。模拟结果显示，采用k-最近邻样本筛选方法后，BMA模型对洪水过程以及洪峰的预报精度提升明显；概率预报结果的离散程度降低的同时，可靠性程度获得提升。k-最近邻样本筛选方法的引入，能够有效去除BMA模型训练样本中的冗余数据，以少量的样本获得更可靠的模型参数，改善集合预报性能。

英文摘要:

The BMA (Bayesian model averaging) is a multi-model ensemble forecasting algorithm based on the Bayesian formula to estimate the posterior probability distribution of forecasting variables. The performance of BMA depends largely on the quality of its training datasets. However, there are a lot of redundant samples, which are inconsistent with the current flow state and affect the accuracy and the reliability of BMA forecasts. In this study, the k-nearest neighbor (KNN) method is applied to address the similarities between the historical samples and the most recent flood process to reduce the influence of redundant samples on the parameter estimation of BMA. Two cases of BMA, i.e. with the use of KNN sample selection (namely KBMA) and the original one, are investigated and compared at the Wangjiaba catchment located in the upper region of the Huai River basin. The ensemble means of these two cases were examined against the observations and the forecasts from their ensemble members to test the efficiency of their deterministic forecasts. Additionally, the probabilistic forecasts from these two cases were intercompared on the basis of two assessment criteria including Coverage Rate and Ranked Probability Score. The results indicate that the KBMA can produce improved deterministic and probabilistic forecasts as compared to the original BMA. By employing the KNN sample selection method, the KBMA is able to adjust its parameters according to the real time state of the flood processes and ensemble members,rather than adjusting them through the use of all samples. Our analysis demonstrates that the KNN sample selection method has the potential to substantially improve BMA ensemble forecasts.

查看全文查看/发表评论下载PDF阅读器

关闭