Evaluating New Approaches of Big Data Analytics Frameworks
Norman Spangenberg , Martin Roth , Bogdan Franczyk
AbstractThe big data topic will be one of the leading growth markets in information technology in the next years. One problem in this area is the efficient computation of huge data volumes, especially for complex algorithms in data mining and machine learning tasks. This paper discuss new processing frameworks for big and smart data in distributed environments and presents a benchmark between two frameworks - Apache Flink and Apache Spark - based on a mixed workload with algorithms from different analytic areas with different real-world datasets
|Publication size in sheets||0.5|
|Book||Abramowicz Witold (eds.): Business Information Systems. 18th International Conference, BIS 2015, Poznań, Poland, June 24-26, 2015, Proceedings, Lecture Notes in Business Information Processing, vol. 208, 2015, Springer, ISBN 978-3-319-19026-6, [978-3-319-19027-3], 352 p., DOI:10.1007/978-3-319-19027-3|
|Keywords in English||Apache Flink, Apache Spark, Big data processing frameworks, Big data analytics, MapReduce|
* presented citation count is obtained through Internet information analysis and it is close to the number calculated by the Publish or Perish system.