A Review of Contemporary Data Quality Issues in Data Warehouse ETL Environment
Rupali Gill1 and Jaiteg Singh2 1Assistant Professor, School of Computer Sciences, 2 Associate Professor, School of Computer Applications, Chitkara University, Punjab, India Email: rupali.gill@chitkara.edu.in Abstract: In today’s scenario, Extraction–transformation– loading (ETL) tools have become important pieces of software responsible for integrating heterogeneous information from several sources. The task of carrying out the ETL process is potentially a complex, hard and time consuming. Organisations now –a-days are concerned about vast qualities of data. The data quality is concerned with technical issues in data warehouse environment. Research in last few decades has laid more stress on data quality issues in a data warehouse ETL process. The data quality can be ensured cleaning the data prior to loading the data into a warehouse. Since the data is collected from various sources, it comes in various formats. The standardization of formats and cleaning such data becomes...