Nosheen Nazir

Work place: National University of Computer and Emerging Sciences, Faisalabad, 38000, Pakistan

E-mail: f159020@nu.edu.pk

Website:

Research Interests: Data Mining, Data Compression, Data Structures and Algorithms

Biography

Nosheen Nazir was born in Chiniot, Punjab, Pakistan in 1992. She has completed her Graduation Degree in Information Technology from University of Sargodha, Punjab, Pakistan in 2015. She is currently a student of MS degree in Computer Science from the National University of Computer and Emerging Sciences (FAST-NU), Faisalabad, Pakistan.

In 2011, she received Prime Minister’s based scholarship at Punjab College Chiniot, Punjab, Pakistan. In 2013, she secured 3rd Position all over the district in her under-graduation program. In 2014, she was selected for Prime Minister’s Laptop Scheme for Youth of Pakistan and received a Laptop. In 2015 she completed her Graduation and enrolled in MS program in National University of Computer and Emerging Sciences (FAST-NU). In 2016, she passed a test from National Testing Service and stood 2nd in (Female) all over Chiniot, Punjab, Pakistan and she has been selected for Government Job in Education Department.

Her major areas of interest are Ecommerce, Marketing, Databases, Data Mining, and Data Warehousing.

Author Articles
Data Cleaning In Data Warehouse: A Survey of Data Pre-processing Techniques and Tools

By Anosh Fatima Nosheen Nazir Muhammad Gufran Khan

DOI: https://doi.org/10.5815/ijitcs.2017.03.06, Pub. Date: 8 Mar. 2017

A Data Warehouse is a computer system designed for storing and analyzing an organization's historical data from day-to-day operations in Online Transaction Processing System (OLTP). Usually, an organization summarizes and copies information from its operational systems to the data warehouse on a regular schedule and management performs complex queries and analysis on the information without slowing down the operational systems. Data need to be pre-processed to improve quality of data, before storing into data warehouse. This survey paper presents data cleaning problems and the approaches in use currently for pre-processing. To determine which technique of pre-processing is best in what scenario to improve the performance of Data Warehouse is main goal of this paper. Many techniques have been analyzed for data cleansing, using certain evaluation attributes and tested on different kind of data sets. Data quality tools such as YALE, ALTERYX, and WEKA have been used for conclusive results to ready the data in data warehouse and ensure that only cleaned data populates the warehouse, thus enhancing usability of the warehouse. Results of paper can be useful in many future activities like cleansing, standardizing, correction, matching and transformation. This research can help in data auditing and pattern detection in the data.

[...] Read more.
Other Articles