MIDAS: Open-source framework for distributed online analysis of data streams

Andreas Henelius*, Jari Torniainen

*Corresponding author for this work

Research output: Contribution to journalArticleScientificpeer-review

43 Downloads (Pure)

Abstract

Data streams are pervasive but implementing online analysis of streaming data is often nontrivial as data streams can have different, domain-specific formats. Regardless of the stream, the analysis task is essentially the same: features are extracted from the stream, e.g., to employ machine learning and data mining methods. We present the Modular Integrated Distributed Analysis System (MIDAS) for constructing distributed online stream processing systems for heterogeneous data. The MIDAS framework makes it possible to process raw data streams, extract features, perform machine learning and make the results available through an HTTP API for easy integration with various applications. MIDAS is agnostic with regard to the type of data stream and is suitable for multiple domains.

Original languageEnglish
Pages (from-to)156-161
Number of pages6
JournalSoftwareX
Volume7
DOIs
Publication statusPublished - 1 Jan 2018
MoE publication typeA1 Journal article-refereed

Keywords

  • Data streams
  • Distributed systems
  • Machine learning
  • Online analysis

Fingerprint Dive into the research topics of 'MIDAS: Open-source framework for distributed online analysis of data streams'. Together they form a unique fingerprint.

Cite this