Enhancing Big Data Value Using Knowledge Discovery Techniques

Full Text (PDF, 774KB), PP.1-12

Views: 0 Downloads: 0

Author(s)

Mai Abdrabo 1,* Mohammed Elmogy 2 Ghada Eltaweel 3 Sherif Barakat 4

1. Information Systems Department, Faculty of Computers and Information, Suez Canal University, Ismailia, 41611, Egypt

2. Information Technology Dept., Faculty of Computers and Information, Mansoura University, Mansoura, 35511, Egypt

3. Computer Science Department, Faculty of Computers and Information, Suez Canal University, Ismailia, 41611, Egypt

4. Information Systems Dept., Faculty of Computers and Information, Mansoura University, Mansoura, 35511, Egypt

* Corresponding author.

DOI: https://doi.org/10.5815/ijitcs.2016.08.01

Received: 20 Aug. 2015 / Revised: 11 Feb. 2016 / Accepted: 15 Apr. 2016 / Published: 8 Aug. 2016

Index Terms

Knowledge Discovery (KDD), Big Data, Hadoop, MapReduce, NoSQL

Abstract

The world has been drowned by floods of data due to technological development. Consequently, the Big Data term has gotten the expression to portray the gigantic sum. Different sorts of quick data are doubling every second. We have to profit from this enormous surge of data to convert it to knowledge. Knowledge Discovery (KDD) can enhance detecting the value of Big Data based on some techniques and technologies like Hadoop, MapReduce, and NoSQL. The use of Big Data value is critical in different fields.
This survey discusses the expansion of data that led the world to Big Data expression. Big Data has distinctive characteristics as volume, variety, velocity, value, veracity, variability, viscosity, virality, ambiguity, and complexity. We will describe the connection between Big Data and KDD techniques to reach data value. Big Data applications that are applied by big organizations will be discussed. Characteristics of big data will be introduced, which represent a significant challenge for Big Data management. Finally, some of the important future directions in Big Data field will be presented.

Cite This Paper

Mai Abdrabo, Mohammed Elmogy, Ghada Eltaweel, Sherif Barakat, "Enhancing Big Data Value Using Knowledge Discovery Techniques", International Journal of Information Technology and Computer Science(IJITCS), Vol.8, No.8, pp.1-12, 2016. DOI:10.5815/ijitcs.2016.08.01

Reference

[1]Gupta, Richa. "Journey From Data Mining To Web Mining To Big Data", International Journal of Computer Trends and Technology 10.1, 18-20. Web, (2014).

[2]SAGIROGLU, and SINANC, "Big Data A Review”, Collaboration Technologies And Systems (CTS), 2013 IEEE International Conference on San Diego, CA: IEEE, (2013).

[3]Shuliang, Gangyi, and Ming,"Big Spatial Data Mining", 2013 IEEE International Conference on Silicon Valley, CA: IEEE, (2013). 

[4]Holzinger, Dehmer, and Jurisica. "Knowledge Discovery and Interactive Data Mining in Bioinformatics, Future Challenges and Research Directions", BMC Bioinformatics, (Last accessed on 2016). 

[5]Katal, Wazid, and Goudar, "Big Data: Issues, Challenges, Tools and Good Practices, Contemporary Computing (IC3), Sixth 2013 IEEE International Conference on Noida: IEEE, (2013). 

[6]Bandyopadhyay, Sanghamitra, "Advanced Methods for Knowledge Discovery from Complex Data", New York, Springer, (2005).

[7]Krishnan, "Data Warehousing in the Age of Big Data". Print., ISBN 978-0-12-405891-0.

[8]Begoli, and Horey," Design Principles for Effective Knowledge Discovery from Big Data", Software Architecture (WICSA) and European Conference on Software Architecture (ECSA), Joint Working IEEE/IFIP Conference on Helsinki: IEEE, (2012).

[9]Data, Information, Knowledge & Wisdom", http://www.systems-thinking.org/dikw/dikw.htm,Systems-thinking.org,2005, (Last accessed on 2016). 

[10]https://datajobs.com/what-is-big-data, Frank, Lo. "What Is Big Data: The Complete Picture, Beyond The 4 V's." Datajobs.com", 2015. Web.(Last accessed on 2015).

[11]https://www.ida.gov.sg/~/media/Files/Infocomm%20Landscape/Technology/TechnologyRoad  map/BigData.pdf, (Last accessed on 2015).

[12]Lomotey, and Deters, "Towards Knowledge Discovery in Big Data", Service-Oriented System Engineering (SOSE), 8Th, 2014 IEEE International Symposium on Oxford: IEEE, (2014).

[13]https://datajobs.com/what-is-hadoop-and-nosql, Frank, Lo, "What Is Hadoop and NoSQL?", "Datajobs.com", 2015. Web (Last accessed on 2015).

[14]https://thinkbiganalytics.com/leading_big_data_technologies/nosql/, Thinkbiganalytics.com, "NoSQL | Think Big Analytics", 2014, Web. (Last accessed on 2015).

[15]https://thinkbiganalytics.com/leading_big_data_technologies/hadoop/, Thinkbiganalytics.com, "Hadoop Ecosystem: Think Big Analytics, 2014, (Last accessed on 2015).

[16]https://thinkbiganalytics.com/leading_big_data_technologies/machine-learning-in-hadoop-with-mahout/, Thinkbiganalytics.com, "Machine Learning in Hadoop: Think Big Analytics", 2014, (Last accessed on 2015).

[17]http://hpccsystems.com/Why-HPCC/How-it-works#ecl, Hpccsystems.com, "How It Works | HPCC Systems", 2015, (Last accessed on 2015). 

[18]Gosain, and Chugh. "New Design Principles For Effective Knowledge Discovery From Big Data", International Journal of Computer Applications 96.17 (2014).

[19]Bansal, Srividya, and Kagemann, "Integrating Big Data: A Semantic Extract-Transform-Load Framework", Computer 48.3 (2015).

[20]Assuncao, et al," “Distributed Stochastic Aware Random Forests - Efficient Data Mining For Big Data". Big Data Congress, 2013 IEEE International Congress on Santa Clara, CA: IEEE, (2013).

[21]http://statisticsviews.com/details/feature/4911381/statistical-truisms-in-the-age-of-big-data.html, Borne, Kirk, "Bias in Randomised Factorial Trials", (Last accessed on 2015).

[22]Prabha, Sujatha, “Reduction of big data sets using fuzzy clustering”, International Journal of Advanced Research in Computer Engineering & Technology, ( 2014).

[23]Fania, Miller, “Mining big data in the enterprise for better business intelligence”, Intel white paper, www.intel.com/it, (2014).

[24]Liao, Long, “MRPrePost-A parallel algorithm adapted for mining big data”, Electronics, Computer, and Applications, 2014 IEEE Workshop on  Ottawa, (2014).

[25]Chen, Mao, Y. Liu, “Big Data: A Survey,” Springer Science+Business Media New York, (2014).

[26]Aghdam, Kamalpour, Chen and Sim, "Identifying Places of Interest for Tourists using Knowledge Discovery Techniques", Industrial Automation, Information and Communications Technology (IAICT), 2014 International Conference on  Bali, (2014).

[27]Mayer, Cukier,"Big Data: A Revolution That Will Transform How We   Live, Work and Think", ISBN: 1848547927, UK, (2013).

[28]Chandarana, Vijayalakshmi, "Big Data Analytics Frameworks ", Circuits, Systems, Communication and Information Technology Applications (CSCITA), 2014 International Conference on  Mumbai, (2014).

[29]Azad, Jha," Data Mining in Intrusion Detection: A Comparative Study of Methods, Types and Data Sets", I.J. Information Technology and Computer Science, (2013).

[30]Yu, Jiang, Zhu, "RTIC: a big data system for massive traffic information mining ", Cloud Computing and Big Data (CloudCom-Asia), 2013 International Conference on  Fuzhou, (2013).

[31]http://futurememes.blogspot.com/2014/03/big-data- becomes-personal-knowledge.html, Blogga, _La. "Broader Perspective: Big Data Becomes Personal: Knowledge_into_Meaning", Futurememes.blogspot.com, (2014), (Last accessed on 2015).