These models help to identify relationships between input columns and the predictable columns. Clustering algorithm is used to group sets of data with similar characteristics also called as clusters. Companies with a dedicated Data Warehousing team think way ahead of others in product development, marketing, pricing strategy, production time, historical analysis, and forecasting. Together these two processes—data warehousing and data mining techniques—work together to create a warehouse of data and extract valuable insight from it. Data Mining Introductory and advanced topics –MARGARET H DUNHAM, PEARSON EDUCATION; The Data Mining Techniques – ARUN K PUJARI, University Press. Data mining refers to extracting knowledge from large amounts of data. Knowledge discovery is an iterative sequence: Data cleaning – Remove inconsistent data. Any data which tends to be incomplete, noisy and uncertain can affect the result. Data Warehousing - Overview - The term Data Warehouse was first coined by Bill Inmon in 1990. A Data Warehouse works as a central repository where information arrives from one or more data sources. Data warehousing: - Extracting data from various sources, transforming it into required form and then loading it into the data warehouse. Data mining - Extracting useful information for large amounts of data, for the purpose of finding various methods for business intelligence. DW – Data Warehousing Fundamentals – PAULRAJ PONNAIAH WILEY STUDENT EDITION. Data mining is looking for patterns in the data that may lead to higher sales and profits. Data Warehousing in the Real World – SAM ANAHORY & DENNIS MURRAY. According to Inmon, a data warehouse is a subject oriented, integrated, time-variant, and non-volatile collection of data. Data warehousing is the electronic storage of a large amount of information by a business, in a manner that is secure, reliable, easy to retrieve, and easy to manage. Enterprise data is the lifeblood of a corporation, but it's useless if it's left to languish in data silos. Three main types of Data Warehouse. Data cleansing, metadata management, data distribution, storage management, recovery, and backup planning are processes conducted in a data warehouse while BI makes use of tools that focus on statistics, visualization, and data mining, including self service business intelligence. Data cleaning is the procedure of identifying and removing tricky or inaccurate data from a recordset, table or database. A decision tree is a tree in which every node is either a leaf node or a decision node. Data warehousing also related to data mining which means looking for meaningful data patterns in the huge data volumes and devise newer strategies for higher sales and profits. The data compiled in the data warehouse, which are collected as analytics, historical, or customer data are mined to detect meaningful patterns and extract inferences from them.