Missing Value Imputation in Time Series Using Top-k Case Matching

Kevin Wellenzohn; Hannes Mitterer; Johann Gamper; Michael Böhlen; Mourad Khayati

Back

Missing Value Imputation in Time Series Using Top-k Case Matching

Conference proceeding

Open access

Peer reviewed

Missing Value Imputation in Time Series Using Top-k Case Matching

Kevin Wellenzohn, Hannes Mitterer, Johann Gamper, Michael Böhlen and Mourad Khayati

Proceedings of the 26th GI-Workshop Grundlagen von Datenbanken, Bozen-Bolzano, Italy, October 21st to 24th, 2014, Vol.1313, pp.77-82

CEUR Workshop Proceedings, 1313

26th GI-Workshop Grundlagen von Datenbanken (GvDB 2014) (Bozen-Bolzano, 21/10/2014 - 24/10/2014)

2014

Handle:

https://hdl.handle.net/10863/3311

Abstract

Imputation of missing values

Threshold algorithm

Time series

In this paper, we present a simple yet effective algorithm, called the Top-k Case Matching algorithm, for the imputation of missing values in streams of time series data that are similar to each other. The key idea of the algorithm is to look for the k situations in the historical data that are most similar to the current situation and to derive the missing value from the measured values at these k time points. To efficiently identify the top-k most similar historical situations, we adopt Fagin’s Threshold Algorithm, yielding an algorithm with sub-linear runtime complexity with high probability, and linear complexity in the worst case (excluding the initial sorting of the data, which is done only once). We provide the results of a first experimental evaluation using real-world meteorological data. Our algorithm achieves a high accuracy and is more accurate and efficient than two more complex state of the art solutions.

Files and links (2)

pdf

Missingvalueimputationintimeseriesusingtop-kcasematching288.53 kBDownload View

Open Access

url

http://ceur-ws.org/Vol-1313/View

Details

Title: Missing Value Imputation in Time Series Using Top-k Case Matching
Creators: Kevin Wellenzohn
Hannes Mitterer
Johann Gamper
Michael Böhlen
Mourad Khayati
Publication Details: Proceedings of the 26th GI-Workshop Grundlagen von Datenbanken, Bozen-Bolzano, Italy, October 21st to 24th, 2014, Vol.1313, pp.77-82
Editor(s): Klan F, Specht G, Gamper J
ISSN: 1613-0073
EISSN: 1613-0073
Conference: 26th GI-Workshop Grundlagen von Datenbanken (GvDB 2014) (Bozen-Bolzano, 21/10/2014 - 24/10/2014)
Series / Volume: CEUR Workshop Proceedings
1313
Publisher: CEUR-WS
Format: Online
Number of pages: 6
Identifiers: (UNIBZ)18729108
991005772667801241
Scopus ID: 2-s2.0-84919968775
Academic Unit: Faculty of Computer Science
Language: English
Resource Type: Conference proceeding
Author Names String: Wellenzohn K, Mitterer H, Gamper J, Böhlen MH, Khayati M
Additional Description: srcEditors: Klan F, Specht G, Gamper J
Projected: 3982

Metrics

22 File views/ downloads

38 Record Views