Cloud Computing Technology Algorithms Capabilities in Managing and Processing Big Data in Business Organizations: MapReduce, Hadoop, Parallel Programming

Document Type : Int. Conf. on Communication Management and Information Technology- ICCMIT'20

Authors

1 PhD Candidate, Department of Management Information System, Cyprus International University, Cyprus/Nicosia.

2 Assistant Professor, Department of Management Information System, Cyprus International University, Cyprus/Nicosia.

Abstract

The objective of this study is to verify the importance of the capabilities of cloud computing services in managing and analyzing big data in business organizations because the rapid development in the use of information technology in general and network technology in particular, has led to the trend of many organizations to make their applications available for use via electronic platforms hosted by various Companies on their servers or so-called cloud computing that have become an excellent opportunity to provide services efficiently and at low cost, but managing big data presents a definite challenge in the cloud space beginning with the processes of extracting, processing data, storing data and analyze it. Through this study, we dealt with the concept of cloud computing and its capabilities in business organizations. We also interpreted the notion of big data and its distinct characteristics and sources. Finally, the relationship between cloud computing with big data was also explained (extraction, storage, analysis).

Keywords


Alvaro, P., Condie, T., Conway, N., Elmeleegy, K., Hellerstein, J. M., & Sears, R. (2010). Boom analytics: exploring data-centric, declarative programming for the cloud. In Proceedings of the 5th European conference on Computer systems (pp. 223–236).
Alyass, A., Turcotte, M., & Meyre, D. (2015). From big data analysis to personalized medicine for all: challenges and opportunities. BMC Medical Genomics, 8(1), 33.
Bagheri, H., & Shaltooki, A. A. (2015). Big Data: challenges, opportunities and Cloud based solutions. International Journal of Electrical and Computer Engineering, 5(2), 340.
Bell, M. W. (2008). Toward a definition of “virtual worlds.” Journal For Virtual Worlds Research, 1(1).
BOOTH, C. (2019a). Number of people using social media platforms. Retrieved from https://www.statista.com/topics/1164/social-networks/
BOOTH, C. (2019b). The most popular social media networks each year, gloriously animated. Retrieved from https://thenextweb.com/tech/2019/06/11/most-popular-social-media-networks-year-animated
Dasgupta, A. (2013). Big data: The future is in analytics. Geospatial World, 3(9), 28–36.
Dobre, C., & Xhafa, F. (2014). Parallel programming paradigms and frameworks in big data era. International Journal of Parallel Programming, 42(5), 710–738.
Erl, T., & Khattak, W. (n.d.). i Buhler, P.(2016). Big Data Fundamentals: Concepts, Drivers & Techniques. Prentice Hall.
Experiment, T. Dz. (2011). The DØ Experiment . Retrieved from https://www-d0.fnal.gov
Gewirtz, D. (2018). Volume, velocity, and variety: Understanding the three V’s of big data | ZDNet. Retrieved from https://www.zdnet.com/article/volume-velocity-and-variety-understanding-the-three-vs-of-big-data/
Gomes, P. (2016). Log analysis | Loggly. Retrieved from https://www.loggly.com/product/log-analysis/
Gu, R., Yang, X., Yan, J., Sun, Y., Wang, B., Yuan, C., & Huang, Y. (2014). SHadoop: Improving MapReduce performance by optimizing job execution mechanism in Hadoop clusters. Journal of Parallel and Distributed Computing, 74(3), 2166–2179.
Hadoopa, A. (2019). Apache Hadoop. Retrieved from http://hadoop.apache.org//
Hammer, C., Kostroch, M. D. C., & Quiros, M. G. (2017). Big Data: Potential, Challenges and Statistical Implications. International Monetary Fund.
IMMERMAN, G. (2017). What is big data velocity? Retrieved from https://www.machinemetrics.com/blog/what-is-big-data-velocity
Jain, A. (2016). The 5 V’s of big data - Watson Health Perspectives. Retrieved from https://www.ibm.com/blogs/watson-health/the-5-vs-of-big-data/
Ji, C., Li, Y., Qiu, W., Awada, U., & Li, K. (2012). Big data processing in cloud computing environments. Proceedings of the 2012 International Symposium on Pervasive Systems, Algorithms, and Networks, I-SPAN 2012, 17–23. https://doi.org/10.1109/I-SPAN.2012.9
Kalil, T. (2012). Big Data is a Big Deal | whitehouse.gov. Retrieved January 30, 2020, from https://obamawhitehouse.archives.gov/blog/2012/03/29/big-data-big-deal
Khan, S., Shakil, K. A., & Alam, M. (2018). Cloud-based big data analytics—a survey of current research and future directions. In Big Data Analytics (pp. 595–604). Springer.
Kobielus, J. G. (2012). The Forrester WaveTM: Enterprise Hadoop Solutions, Q1 2012. Forrester Research.
Manyika, J., Chui, M., Brown, B., Bughin, J., Dobbs, R., Roxburgh, C., & Byers, A. H. (2011). Big data: The next frontier for innovation, competition.
Mao, R., Xu, H., Wu, W., Li, J., Li, Y., & Lu, M. (2015). Overcoming the challenge of variety: big data abstraction, the next evolution of data management for AAL communication systems. IEEE Communications Magazine, 53(1), 42–47.
Marr, B. (2015). Big Data: Using SMART big data, analytics and metrics to make better decisions and improve performance. John Wiley & Sons.
Matthews, K. (2018). Here’s How Much Big Data Companies Make On The Internet - Big Data Showcase. Retrieved from https://bigdatashowcase.com/how-much-big-data-companies-make-on-internet/
Mayer-Schönberger, V., & Cukier, K. (2013). Big data: A revolution that will transform how we live, work, and think. Houghton Mifflin Harcourt.
Mell, P., & Grance, T. (2011). The NIST definition of cloud computing. Special Publication 800-145. Gaithersburg: National Institute of Standards and Technology.
Naimi, A. I., & Westreich, D. J. (2014). Big data: A revolution that will transform how we live, work, and think. Oxford University Press.
Noraziah, A., Fakherldin, M. A. I., Adam, K., & Majid, M. A. (2017). Big Data Processing in Cloud Computing Environments. Advanced Science Letters, 23(11), 11092–11095.
Pettey, C., & van der Meulen, R. (2012). Gartner’s 2012 Hype cycle for emerging technologies identifies" Tipping point" technologies that will unlock long-awaited technology scenarios. Hype Cycle Special Report. P1-4.
Pickell, D. (2018). Structured vs Unstructured Data – What’s the Difference? Retrieved from https://learn.g2.com/structured-vs-unstructured-data
Purcell, B. M. (2014). Big data using cloud computing. Journal of Technology Research, 5, 1.
Ramapriyan, H. K. (2015). The Role and Evolution of NASA’s Earth Science Data Systems.
Slagter, K., Hsu, C.-H., & Chung, Y.-C. (2015). An adaptive and memory efficient sampling mechanism for partitioning in MapReduce. International Journal of Parallel Programming, 43(3), 489–507.
Sykuta, M. E. (2016). Big data in agriculture: property rights, privacy and competition in ag data services. International Food and Agribusiness Management Review, 19(1030-2016–83141), 57–74.
Voruganti, S. (2014). Map Reduce a Programming Model for Cloud Computing Based On Hadoop Ecosystem. International Journal of Computer Science and Information Technologies, 5(3).
Yang, C., Huang, Q., Li, Z., Liu, K., & Hu, F. (2017). Big Data and cloud computing: innovation opportunities and challenges. International Journal of Digital Earth, 10(1), 13–53. https://doi.org/10.1080/17538947.2016.1239771
Yangjun, C. (2014). Semistructured-Data Model Sept. 2014Yangjun Chen ACS Semistructured-Data Model Semistructured data XML Document type definitions XML schema. - ppt download. Retrieved from https://slideplayer.com/slide/4950204/
Zhou, X., Lu, J., Li, C., & Du, X. (2012). Big data challenge in the management perspective. Communications of the CCF, 8, 16–20.
Zikopoulos, P., Eaton, C., & others. (2011). Understanding big data: Analytics for enterprise class hadoop and streaming data. McGraw-Hill Osborne Media.