A robust missing value imputation method for noisy data

Bing Zhu, Changzheng He, Panos Liatsis

Research output: Contribution to journalArticlepeer-review

47 Scopus citations

Abstract

Missing data imputation is an important research topic in data mining. The impact of noise is seldom considered in previous works while real-world data often contain much noise. In this paper, we systematically investigate the impact of noise on imputation methods and propose a new imputation approach by introducing the mechanism of Group Method of Data Handling (GMDH) to deal with incomplete data with noise. The performance of four commonly used imputation methods is compared with ours, called RIBG (robust imputation based on GMDH), on nine benchmark datasets. The experimental result demonstrates that noise has a great impact on the effectiveness of imputation techniques and our method RIBG is more robust to noise than the other four imputation methods used as benchmark.

Original languageBritish English
Pages (from-to)61-74
Number of pages14
JournalApplied Intelligence
Volume36
Issue number1
DOIs
StatePublished - Jan 2012

Keywords

  • Group method of data handling (GMDH)
  • Missing data imputation
  • Noise

Fingerprint

Dive into the research topics of 'A robust missing value imputation method for noisy data'. Together they form a unique fingerprint.

Cite this