The rise of "big data" on cloud computing: Review and open research issues

Ibrahim Abaker Targio Hashem, Ibrar Yaqoob, Nor Badrul Anuar, Salimah Mokhtar, Abdullah Gani, Samee Ullah Khan

Research output: Contribution to journalReview articlepeer-review

2119 Scopus citations

Abstract

Cloud computing is a powerful technology to perform massive-scale and complex computing. It eliminates the need to maintain expensive computing hardware, dedicated space, and software. Massive growth in the scale of data or big data generated through cloud computing has been observed. Addressing big data is a challenging and time-demanding task that requires a large computational infrastructure to ensure successful data processing and analysis. The rise of big data in cloud computing is reviewed in this study. The definition, characteristics, and classification of big data along with some discussions on cloud computing are introduced. The relationship between big data and cloud computing, big data storage systems, and Hadoop technology are also discussed. Furthermore, research challenges are investigated, with focus on scalability, availability, data integrity, data transformation, data quality, data heterogeneity, privacy, legal and regulatory issues, and governance. Lastly, open research issues that require substantial research efforts are summarized.

Original languageBritish English
Pages (from-to)98-115
Number of pages18
JournalInformation Systems
Volume47
DOIs
StatePublished - Jan 2015

Keywords

  • Big data
  • Cloud computing
  • Hadoop

Fingerprint

Dive into the research topics of 'The rise of "big data" on cloud computing: Review and open research issues'. Together they form a unique fingerprint.

Cite this