TY - JOUR
T1 - The rise of "big data" on cloud computing
T2 - Review and open research issues
AU - Hashem, Ibrahim Abaker Targio
AU - Yaqoob, Ibrar
AU - Anuar, Nor Badrul
AU - Mokhtar, Salimah
AU - Gani, Abdullah
AU - Ullah Khan, Samee
N1 - Funding Information:
This paper is financially supported by the Malaysian Ministry of Education under the University of Malaya High Impact Research Grant UM.C/625/1/HIR/MoE/FCSIT/03 .
PY - 2015/1
Y1 - 2015/1
N2 - Cloud computing is a powerful technology to perform massive-scale and complex computing. It eliminates the need to maintain expensive computing hardware, dedicated space, and software. Massive growth in the scale of data or big data generated through cloud computing has been observed. Addressing big data is a challenging and time-demanding task that requires a large computational infrastructure to ensure successful data processing and analysis. The rise of big data in cloud computing is reviewed in this study. The definition, characteristics, and classification of big data along with some discussions on cloud computing are introduced. The relationship between big data and cloud computing, big data storage systems, and Hadoop technology are also discussed. Furthermore, research challenges are investigated, with focus on scalability, availability, data integrity, data transformation, data quality, data heterogeneity, privacy, legal and regulatory issues, and governance. Lastly, open research issues that require substantial research efforts are summarized.
AB - Cloud computing is a powerful technology to perform massive-scale and complex computing. It eliminates the need to maintain expensive computing hardware, dedicated space, and software. Massive growth in the scale of data or big data generated through cloud computing has been observed. Addressing big data is a challenging and time-demanding task that requires a large computational infrastructure to ensure successful data processing and analysis. The rise of big data in cloud computing is reviewed in this study. The definition, characteristics, and classification of big data along with some discussions on cloud computing are introduced. The relationship between big data and cloud computing, big data storage systems, and Hadoop technology are also discussed. Furthermore, research challenges are investigated, with focus on scalability, availability, data integrity, data transformation, data quality, data heterogeneity, privacy, legal and regulatory issues, and governance. Lastly, open research issues that require substantial research efforts are summarized.
KW - Big data
KW - Cloud computing
KW - Hadoop
UR - http://www.scopus.com/inward/record.url?scp=84907325157&partnerID=8YFLogxK
U2 - 10.1016/j.is.2014.07.006
DO - 10.1016/j.is.2014.07.006
M3 - Review article
AN - SCOPUS:84907325157
SN - 0306-4379
VL - 47
SP - 98
EP - 115
JO - Information Systems
JF - Information Systems
ER -