Evaluating New Approaches of Big Data Analytics Frameworks

Norman Spangenberg , Martin Roth , Bogdan Franczyk


The big data topic will be one of the leading growth markets in information technology in the next years. One problem in this area is the efficient computation of huge data volumes, especially for complex algorithms in data mining and machine learning tasks. This paper discuss new processing frameworks for big and smart data in distributed environments and presents a benchmark between two frameworks - Apache Flink and Apache Spark - based on a mixed workload with algorithms from different analytic areas with different real-world datasets
Author Norman Spangenberg
Norman Spangenberg,,
, Martin Roth
Martin Roth,,
, Bogdan Franczyk (MISaF / IBI / DISD) - [inna]
Bogdan Franczyk,,
- Department of Information Systems Design
- inna
Publication size in sheets0.5
Book Abramowicz Witold (eds.): Business Information Systems. 18th International Conference, BIS 2015, Poznań, Poland, June 24-26, 2015, Proceedings, Lecture Notes in Business Information Processing, vol. 208, 2015, Springer, ISBN 978-3-319-19026-6, [978-3-319-19027-3], 352 p., DOI:10.1007/978-3-319-19027-3
Keywords in EnglishApache Flink, Apache Spark, Big data processing frameworks, Big data analytics, MapReduce
Languageen angielski
Score (nominal)15
Score sourceconferenceIndex
Citation count*
Share Share

Get link to the record

* presented citation count is obtained through Internet information analysis and it is close to the number calculated by the Publish or Perish system.