TY - JOUR
T1 - Model-Based Big Data Analytics-as-a-Service
T2 - Take Big Data to the Next Level
AU - Ardagna, Claudio Agostino
AU - Bellandi, Valerio
AU - Bezzi, Michele
AU - Ceravolo, Paolo
AU - Damiani, Ernesto
AU - Hebert, Cedric
N1 - Funding Information:
This work was partly supported by the European Union’s Horizon 2020 research and innovation programme under the TOREADOR project, grant agreement No 688797, and by the programme “Piano sostegno alla ricerca 2015-17” funded by Università degli Studi di Milano. We would like to also thank AS De Oliveira for his help in the final preparation of the paper.
Funding Information:
This work was partly supported by the European Union's Horizon 2020 research and innovation programme under the TOREADOR project, grant agreement No 688797, and by the programme "Piano sostegno alla ricerca 2015-17" funded by Universita degli Studi di Milano. We would like to also thank AS De Oliveira for his help in the final preparation of the paper.
Publisher Copyright:
© 2008-2012 IEEE.
PY - 2021/3/1
Y1 - 2021/3/1
N2 - The Big Data revolution promises to build a data-driven ecosystem where better decisions are supported by enhanced analytics and data management. However, major hurdles still need to be overcome on the road that leads to commoditization and wide adoption of Big Data Analytics (BDA). Big Data complexity is the first factor hampering the full potential of BDA. The opacity and variety of Big Data technologies and computations, in fact, make BDA a failure prone and resource-intensive process, which requires a trial-and-error approach. This problem is even exacerbated by the fact that current solutions to Big Data application development take a bottom-up approach, where the last technology release drives application development. Selection of the best Big Data platform, as well as of the best pipeline to execute analytics, represents then a deal breaker. In this paper, we propose a return to roots by defining a Model-Driven Engineering (MDE) methodology that supports automation of BDA based on model specification. Our approach lets customers declare requirements to be achieved by an abstract Big Data platform and smart engines deploy the Big Data pipeline carrying out the analytics on a specific instance of such platform. Driven by customers' requirements, our methodology is based on an OWL-S ontology of Big Data services and on a compiler transforming OWL-S service compositions in workflows that can be directly executed on the selected platform. The proposal is experimentally evaluated in a real-world scenario focusing on the threat detection system of SAP.
AB - The Big Data revolution promises to build a data-driven ecosystem where better decisions are supported by enhanced analytics and data management. However, major hurdles still need to be overcome on the road that leads to commoditization and wide adoption of Big Data Analytics (BDA). Big Data complexity is the first factor hampering the full potential of BDA. The opacity and variety of Big Data technologies and computations, in fact, make BDA a failure prone and resource-intensive process, which requires a trial-and-error approach. This problem is even exacerbated by the fact that current solutions to Big Data application development take a bottom-up approach, where the last technology release drives application development. Selection of the best Big Data platform, as well as of the best pipeline to execute analytics, represents then a deal breaker. In this paper, we propose a return to roots by defining a Model-Driven Engineering (MDE) methodology that supports automation of BDA based on model specification. Our approach lets customers declare requirements to be achieved by an abstract Big Data platform and smart engines deploy the Big Data pipeline carrying out the analytics on a specific instance of such platform. Driven by customers' requirements, our methodology is based on an OWL-S ontology of Big Data services and on a compiler transforming OWL-S service compositions in workflows that can be directly executed on the selected platform. The proposal is experimentally evaluated in a real-world scenario focusing on the threat detection system of SAP.
KW - Big data
KW - model-driven architecture
KW - OWL-S
UR - http://www.scopus.com/inward/record.url?scp=85044063547&partnerID=8YFLogxK
U2 - 10.1109/TSC.2018.2816941
DO - 10.1109/TSC.2018.2816941
M3 - Article
AN - SCOPUS:85044063547
SN - 1939-1374
VL - 14
SP - 516
EP - 529
JO - IEEE Transactions on Services Computing
JF - IEEE Transactions on Services Computing
IS - 2
M1 - 8319508
ER -