Google
 

Wednesday, December 19, 2007

Integration & transformation Prgorams


Functions of integration & transformation programs.


_ Integration & transformation programs perform functions such as…


_ Reformatting, recalculating or modifying key structures


_ Adding time elements


_ Identifying default values


_ Supplying logic to choose between multiple data sources


_ Summarizing, tallying & merging data from multiple sources


_ Archives


_ Contain old data – 2 yrs old


_ Achieves data – used at – forecasting & trend analysis


_ Metadata


_ Data about data


_ Data warehousing architecture


_ Card catalogue



Structure of Data Warehouse


Physical Data warehouse All data of data warehouse is stored


Logical Data Warehouse Contains information necessary to access the


data


Data Mart Subset of an enterprise


_ Obstacles to Successful data warehouse projects


_ Focus on the most important information in compani’s data


Warehouse


_ Evolution of Data mining


_ 3 technologies that are now sufficiently mature


_ Massive data collection


_ Powerful multiprocessor computers


_ Data mining algorithms


_ Four Revolutionary steps


1 Data Collection 1960


2 Data Access 1980


RDBMS – Relational databases management system


SQL – Structured Query language


ODBS


3 Data


Warehousing &


Decision


support


1960


Capable of answering business questions


Used – OLAP


- Multidimensional databases


- Data warehouses


4 Data mining Future


Uses advanced algorithms, multiprocessors, computers ,


massive databases


_ Advantages Data Mining : Tasks solved by data mining


Predicting Explicit modeling


Classification Clustering


Detection of relations Deviation Detection


_ Data mining can generate new business opportunities by providing


these capabilities


_ Automated prediction of trends & behavior


_ Automated discovery of previously unknown pattern_ Database can be larger in both depth & breadth


_ Technologies used in Data Mining


1 Neural networks Non linear predictive models


2 Role induction If / then rules


3 Evolutionary


programming


Youngest & evidently


Most praising branch of data mining


System automatically formulates hypotheses


4 Case based


reasoning [CBR]


Find closest part analogs of the present situation


Nearest neighbor system


5 Decision tree Represents set of decisions


Generate rules for the classification of a data set


6 Generic algoithms Use processes such as…


Generic combination - Mutation


Natural selection


7 Non linear


regression method


Based on searching for a depend any of the


targeted variable on other variable


No comments: