<?xml version='1.0' encoding='UTF-8'?><?xml-stylesheet href="http://www.blogger.com/styles/atom.css" type="text/css"?><feed xmlns='http://www.w3.org/2005/Atom' xmlns:openSearch='http://a9.com/-/spec/opensearchrss/1.0/' xmlns:georss='http://www.georss.org/georss' xmlns:gd='http://schemas.google.com/g/2005' xmlns:thr='http://purl.org/syndication/thread/1.0'><id>tag:blogger.com,1999:blog-1786747298065401647</id><updated>2012-02-16T05:18:59.509-08:00</updated><category term='Tips'/><title type='text'>Data Warehousing Tips</title><subtitle type='html'></subtitle><link rel='http://schemas.google.com/g/2005#feed' type='application/atom+xml' href='http://datawarehousingtips.blogspot.com/feeds/posts/default'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/1786747298065401647/posts/default?max-results=100'/><link rel='alternate' type='text/html' href='http://datawarehousingtips.blogspot.com/'/><link rel='hub' href='http://pubsubhubbub.appspot.com/'/><author><name>Thenmozhi</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><generator version='7.00' uri='http://www.blogger.com'>Blogger</generator><openSearch:totalResults>39</openSearch:totalResults><openSearch:startIndex>1</openSearch:startIndex><openSearch:itemsPerPage>100</openSearch:itemsPerPage><entry><id>tag:blogger.com,1999:blog-1786747298065401647.post-4032646128543003505</id><published>2011-09-29T07:59:00.001-07:00</published><updated>2011-09-29T08:00:43.698-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Tips'/><title type='text'>Interview Questions for Data Warehousing</title><content type='html'>What is Data warehousing?&lt;br /&gt;&lt;br /&gt;A data warehouse can be considered as a storage area where interest specific or relevant data is stored irrespective of the source. What actually is required to create a data warehouse can be considered as Data Warehousing. Data warehousing merges data from multiple sources into an easy and complete form.&lt;br /&gt;&lt;br /&gt;What are fact tables and dimension tables?&lt;br /&gt;&lt;br /&gt;As mentioned, data in a warehouse comes from the transactions. Fact table in a data warehouse consists of facts and/or measures. The nature of data in a fact table is usually numerical. On the other hand, dimension table in a data warehouse contains fields used to describe the data in fact tables. A dimension table can provide additional and descriptive information (dimension) of the field of a fact table.&lt;br /&gt;&lt;br /&gt;e.g. If I want to know the number of resources used for a task, my fact table will store the actual measure (of resources) while my Dimension table will store the task and resource details. Hence, the relation between a fact and dimension table is one to many.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;What is ETL process in data warehousing?&lt;br /&gt;&lt;br /&gt;ETL stands for Extraction, transformation and loading. That means extracting data from different sources such as flat files, databases or XML data, transforming this data depending on the application’s need and loads this data into data warehouse.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;Explain the difference between data mining and data warehousing?&lt;br /&gt;&lt;br /&gt;Data mining is a method for comparing large amounts of data for the purpose of finding patterns. Data mining is normally used for models and forecasting. Data mining is the process of correlations, patterns by shifting through large data repositories using pattern recognition techniques.&lt;br /&gt;&lt;br /&gt;Data warehousing is the central repository for the data of several business systems in an enterprise. Data from various resources extracted and organized in the data warehouse selectively for analysis and accessibility.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;What is an OLTP system and OLAP system?&lt;br /&gt;&lt;br /&gt;OLTP stands for OnLine Transaction Processing. Applications that supports and manges transactions which involve high volumes of data are supported by OLTP system. OLTP is based on client-server architecture and supports transactions across networks.&lt;br /&gt;&lt;br /&gt;OLAP stands for OnLine Analytical Processing. Business data analysis and complex calculations on low volumes of data are performed by OLAP. An insight of data coming from various resources can be gained by a user with the support of OLAP.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;What are cubes?&lt;br /&gt;&lt;br /&gt;Multi dimensional data is logically represented by Cubes in data warehousing. The dimension and the data are represented by the edge and the body of the cube respectively. OLAP environments view the data in the form of hierarchical cube. A cube typically includes the aggregations that are needed for business intelligence queries.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;What is snow flake scheme design in database?&lt;br /&gt;&lt;br /&gt;Snow flake schema is one of the designs that are present in database design. Snow flake schema serves the purpose of dimensional modeling in data warehousing. If the dimensional table is split into many tables, where the schema is inclined slightly towards normalization, then the snow flake design is utilized. It contains joins in depth. The reason is that, the tables split further.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;Explain the difference between star and snowflake schemas?&lt;br /&gt;&lt;br /&gt;Star schema: A highly de-normalized technique. A star schema has one fact table and is associated with numerous dimensions table and depicts a star.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;Snow flake schema: The normalized principles applied star schema is known as Snow flake schema. Every dimension table is associated with sub dimension table.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;Differences:&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;* A dimension table will not have parent table in star schema, whereas snow flake schemas have one or more parent tables.&lt;br /&gt;&lt;br /&gt;* The dimensional table itself consists of hierarchies of dimensions in star schema, where as hierarchies are split into different tables in snow flake schema. The drilling down data from top most hierarchies to the lowermost hierarchies can be done.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;What is the difference between view and materialized view?&lt;br /&gt;&lt;br /&gt;A view is created by combining data from different tables. Hence, a view does not have data of itself.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;On the other hand, Materialized view usually used in data warehousing has data. This data helps in decision making, performing calculations etc. The data stored by calculating it before hand using queries. When a view is created, the data is not stored in the database. The data is created when a query is fired on the view. Whereas, data of a materialized view is stored.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;What is junk dimension?&lt;br /&gt;&lt;br /&gt;A single dimension is formed by lumping a number of small dimensions. This dimension is called a junk dimension. Junk dimension has unrelated attributes. The process of grouping random flags and text attributes in dimension by transmitting them to a distinguished sub dimension is related to junk dimension.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;What is degenerate dimension table?&lt;br /&gt;&lt;br /&gt;A degenerate table does not have its own dimension table. It is derived from a fact table. The column (dimension) which is a part of fact table but does not map to any dimension.&lt;br /&gt;&lt;br /&gt;E.g. employee_id&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;What is conformed fact and conformed dimensions use for?&lt;br /&gt;&lt;br /&gt;Conformed fact in a warehouse allows itself to have same name in separate tables. They can be compared and combined mathematically. Conformed dimensions can be used across multiple data marts. These conformed dimensions have a static structure. Any dimension table that is used by multiple fact tables can be conformed dimensions.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;What is Virtual Data Warehousing?&lt;br /&gt;&lt;br /&gt;A virtual data warehouse provides a compact view of the data inventory. It contains Meta data. It uses middleware to build connections to different data sources. They can be fast as they allow users to filter the most important pieces of data from different legacy applications.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;What is active data warehousing?&lt;br /&gt;&lt;br /&gt;An Active data warehouse aims to capture data continuously and deliver real time data. They provide a single integrated view of a customer across multiple business lines. It is associated with Business Intelligence Systems&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;What is the difference between dependent and independent data warehouse?&lt;br /&gt;&lt;br /&gt;A dependent data warehouse stored the data in a central data warehouse. On the other hand independent data warehouse does not make use of a central data warehouse.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;Difference between data modeling and data mining?&lt;br /&gt;&lt;br /&gt;Data modeling aims to identify all entities that have data. It then defines a relationship between these entities. Data models can be conceptual, logical or Physical data models. Conceptual models are typically used to explore high level business concepts in case of stakeholders. Logical models are used to explore domain concepts. While Physical models are used to explore database design.&lt;br /&gt;&lt;br /&gt;Data mining is used to examine or explore the data using queries. These queries can be fired on the data warehouse. Data mining helps in reporting, planning strategies, finding meaningful patterns etc. it can be used to convert a large amount of data into a sensible form.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;What is the difference between ER Modeling and Dimensional Modeling?&lt;br /&gt;&lt;br /&gt;ER modeling, that models an ER diagram represents the entire businesses or applications processes. This diagram can be segregated into multiple Dimensional models. This is to say, an ER model will have both logical and physical model. The Dimensional model will only have physical model.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;What is Data Mart?&lt;br /&gt;&lt;br /&gt;Data mart stores particular data that is gathered from different sources. Particular data may belong to some specific community (group of people) or genre. Data marts can be used to focus on specific business needs.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;What are various methods of loading Dimension tables?&lt;br /&gt;&lt;br /&gt;Conventional load: Here the data is checked for any table constraints before loading.&lt;br /&gt;&lt;br /&gt;Direct or Faster load: The data is directly loaded without checking for any constraints.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;What is the difference between OLAP and data warehouse?&lt;br /&gt;&lt;br /&gt;A data warehouse serves as a repository to store historical data that can be used for analysis. OLAP is Online Analytical processing that can be used to analyze and evaluate data in a warehouse. The warehouse has data coming from varied sources. OLAP tool helps to organize data in the warehouse using multidimensional models.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;Describe the foreign key columns in fact table and dimension table?&lt;br /&gt;&lt;br /&gt;The primary keys of entity tables are the foreign keys of dimension tables.&lt;br /&gt;&lt;br /&gt;The Primary keys of fact dimensional table are the foreign keys of fact tables.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;Define the term slowly changing dimensions (SCD)?&lt;br /&gt;&lt;br /&gt;SCD are dimensions whose data changes very slowly. An example of this can be city of an employee. This dimension will change very slowly. The row of this data in the dimension can be either replaced completely without any track of old record OR a new row can be inserted, OR the change can be tracked&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;What is a Star Schema?&lt;br /&gt;&lt;br /&gt;A star schema comprises of fact and dimension tables. Fact table contains the fact or the actual data. Usually numerical data is stored with multiple columns and many rows. Dimension tables contain attributes or smaller granular data. The fact table in start schema will have foreign key references of dimension tables.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;What is the difference between star and snowflake schema?&lt;br /&gt;&lt;br /&gt;Star Schema: A de-normalized technique in which one fact table is associated with several dimension tables. It resembles a star.&lt;br /&gt;&lt;br /&gt;Snow Flake Schema: A star schema that is applied with normalized principles is known as Snow flake schema. Every dimension table is associated with sub dimension table.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;Explain the use lookup tables and Aggregate tables?&lt;br /&gt;&lt;br /&gt;An aggregate table contains summarized view of data. Lookup tables, using the primary key of the target, allow updating of records based on the lookup condition.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;What is real time data-warehousing?&lt;br /&gt;&lt;br /&gt;In real time data-warehousing, the warehouse is updated every time the system performs a transaction. It reflects the businesses real time information. This means that when the query is fired in the warehouse, the state of the business at that time will be returned.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;Define non-additive facts?&lt;br /&gt;&lt;br /&gt;The facts that can not be summed up for the dimensions present in the fact table are called non-additive facts. The facts can be useful if there are changes in dimensions. For example, profit margin is a non-additive fact for it has no meaning to add them up for the account level or the day level.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;Define BUS Schema?&lt;br /&gt;&lt;br /&gt;A BUS schema is to identify the common dimensions across business processes, like identifying conforming dimensions. BUS schema has conformed dimension and standardized definition of facts.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;What is data cleaning? How can we do that?&lt;br /&gt;&lt;br /&gt;Data cleaning is the process of identifying erroneous data. The data is checked for accuracy, consistency, typos etc.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;Data cleaning Methods:&lt;br /&gt;&lt;br /&gt;Parsing – Used to detect syntax errors.&lt;br /&gt;&lt;br /&gt;Data Transformation – Confirms that the input data matches in format with expected data.&lt;br /&gt;&lt;br /&gt;Duplicate elimination – This process gets rid of duplicate entries.&lt;br /&gt;&lt;br /&gt;Statistical Methods- values of mean, standard deviation, range, or clustering algorithms etc are used to find erroneous data.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;What is the purpose of Fact less Fact Table?&lt;br /&gt;&lt;br /&gt;Fact less tables are so called because they simply contain keys which refer to the dimension tables. Hence, they don’t really have facts or any information but are more commonly used for tracking some information of an event.&lt;br /&gt;&lt;br /&gt;Eg. To find the number of leaves taken by an employee in a month.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;What is a level of Granularity of a fact table?&lt;br /&gt;&lt;br /&gt;A fact table is usually designed at a low level of Granularity. This means that we need to find the lowest level of information that can store in a fact table.&lt;br /&gt;&lt;br /&gt;E.g. Employee performance is a very high level of granularity. Employee_performance_daily, employee_perfomance_weekly can be considered lower levels of granularity.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;What is Bit Mapped Index?&lt;br /&gt;&lt;br /&gt;Bitmap indexes make use of bit arrays (bitmaps) to answer queries by performing bitwise logical operations.&lt;br /&gt;&lt;br /&gt;They work well with data that has a lower cardinality which means the data that take fewer distinct values.&lt;br /&gt;&lt;br /&gt;Bitmap indexes are useful in the data warehousing applications.&lt;br /&gt;&lt;br /&gt;Bitmap indexes have a significant space and performance advantage over other structures for such data.&lt;br /&gt;&lt;br /&gt;Tables that have less number of insert or update operations can be good candidates.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;The advantages of Bitmap indexes are:&lt;br /&gt;&lt;br /&gt;They have a highly compressed structure, making them fast to read.&lt;br /&gt;&lt;br /&gt;Their structure makes it possible for the system to combine multiple indexes together so that they can access the underlying table faster.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;The Disadvantage of Bitmap indexes is:&lt;br /&gt;&lt;br /&gt;The overhead on maintaining them is enormous.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;What is Data Cardinality?&lt;br /&gt;&lt;br /&gt;Cardinality is the term used in database relations to denote the occurrences of data on either side of the relation.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;There are 3 basic types of cardinality:&lt;br /&gt;&lt;br /&gt;High data cardinality:Values of a data column are very uncommon.&lt;br /&gt;&lt;br /&gt;e.g.: email ids and the user names&lt;br /&gt;&lt;br /&gt;Normal data cardinality:Values of a data column are somewhat uncommon but never unique.&lt;br /&gt;&lt;br /&gt;e.g.: A data column containing LAST_NAME (there may be several entries of the same last name)&lt;br /&gt;&lt;br /&gt;Low data cardinality:Values of a data column are very usual.&lt;br /&gt;&lt;br /&gt;e.g.: flag statuses: 0/1&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;Determining data cardinality is a substantial aspect used in data modeling. This is used to determine the relationships&lt;br /&gt;&lt;br /&gt;Types of cardinalities:&lt;br /&gt;&lt;br /&gt;The Link Cardinality – 0:0 relationships&lt;br /&gt;&lt;br /&gt;The Sub-type Cardinality – 1:0 relationships&lt;br /&gt;&lt;br /&gt;The Physical Segment Cardinality – 1:1 relationship&lt;br /&gt;&lt;br /&gt;The Possession Cardinality – 0: M relation&lt;br /&gt;&lt;br /&gt;The Child Cardinality – 1: M mandatory relationship&lt;br /&gt;&lt;br /&gt;The Characteristic Cardinality – 0: M relationship&lt;br /&gt;&lt;br /&gt;The Paradox Cardinality – 1: M relationship.&lt;br /&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/1786747298065401647-4032646128543003505?l=datawarehousingtips.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://datawarehousingtips.blogspot.com/feeds/4032646128543003505/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=1786747298065401647&amp;postID=4032646128543003505' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/1786747298065401647/posts/default/4032646128543003505'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/1786747298065401647/posts/default/4032646128543003505'/><link rel='alternate' type='text/html' href='http://datawarehousingtips.blogspot.com/2011/09/interview-questions-for-data.html' title='Interview Questions for Data Warehousing'/><author><name>Thenmozhi</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-1786747298065401647.post-7347285218272441872</id><published>2009-01-22T07:16:00.001-08:00</published><updated>2009-01-22T07:16:47.170-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Tips'/><title type='text'>CHAID in Data Mining</title><content type='html'>Chi Square Automatic Interaction Detection. A decision tree technique used for classification of a dataset. &lt;br /&gt;&lt;br /&gt;Provides a set of rules that you can apply to a new (unclassified) dataset to predict which records will have a given outcome. &lt;br /&gt;&lt;br /&gt;Segments a dataset by using chi square tests to create multi-way splits. Preceded, and requires more data preparation than, CART.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/1786747298065401647-7347285218272441872?l=datawarehousingtips.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://datawarehousingtips.blogspot.com/feeds/7347285218272441872/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=1786747298065401647&amp;postID=7347285218272441872' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/1786747298065401647/posts/default/7347285218272441872'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/1786747298065401647/posts/default/7347285218272441872'/><link rel='alternate' type='text/html' href='http://datawarehousingtips.blogspot.com/2009/01/chaid-in-data-mining.html' title='CHAID in Data Mining'/><author><name>Thenmozhi</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-1786747298065401647.post-382048044885148343</id><published>2009-01-22T07:15:00.000-08:00</published><updated>2009-01-22T07:16:19.693-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Tips'/><title type='text'>CART in Data Warehousing</title><content type='html'>Classification and Regression Trees. A decision tree technique used for classification of a dataset. &lt;br /&gt;&lt;br /&gt;Provides a set of rules that you can apply to a new (unclassified) dataset to predict which records will have a given outcome. &lt;br /&gt;&lt;br /&gt;Segments a dataset by creating 2-way splits. Requires less data preparation than CHAID.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/1786747298065401647-382048044885148343?l=datawarehousingtips.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://datawarehousingtips.blogspot.com/feeds/382048044885148343/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=1786747298065401647&amp;postID=382048044885148343' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/1786747298065401647/posts/default/382048044885148343'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/1786747298065401647/posts/default/382048044885148343'/><link rel='alternate' type='text/html' href='http://datawarehousingtips.blogspot.com/2009/01/cart-in-data-warehousing.html' title='CART in Data Warehousing'/><author><name>Thenmozhi</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-1786747298065401647.post-6347563146316736434</id><published>2009-01-22T07:12:00.000-08:00</published><updated>2009-01-22T07:14:35.722-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Tips'/><title type='text'>Data Mining - Intro</title><content type='html'>Data mining, the extraction of hidden predictive information from large databases, is a powerful new technology with great potential to help companies focus on the most important information in their data warehouses. &lt;br /&gt;&lt;br /&gt;Data mining tools predict future trends and behaviors, allowing businesses to make proactive, knowledge-driven decisions. &lt;br /&gt;&lt;br /&gt;The automated, prospective analyses offered by data mining move beyond the analyses of past events provided by retrospective tools typical of decision support systems. &lt;br /&gt;&lt;br /&gt;Data mining tools can answer business questions that traditionally were too time consuming to resolve. &lt;br /&gt;&lt;br /&gt;They scour databases for hidden patterns, finding predictive information that experts may miss because it lies outside their expectations.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/1786747298065401647-6347563146316736434?l=datawarehousingtips.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://datawarehousingtips.blogspot.com/feeds/6347563146316736434/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=1786747298065401647&amp;postID=6347563146316736434' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/1786747298065401647/posts/default/6347563146316736434'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/1786747298065401647/posts/default/6347563146316736434'/><link rel='alternate' type='text/html' href='http://datawarehousingtips.blogspot.com/2009/01/data-mining-intro.html' title='Data Mining - Intro'/><author><name>Thenmozhi</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-1786747298065401647.post-5573972385059335832</id><published>2009-01-18T04:19:00.001-08:00</published><updated>2009-01-18T04:20:26.278-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Tips'/><title type='text'>Advantages and Disadvantages of ROLAP</title><content type='html'>&lt;span style="font-weight: bold;"&gt;Advantages:&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;-Can handle large amounts of data: The data size limitation of ROLAP technology is the limitation on data size of the underlying relational database. In other words, ROLAP itself places no limitation on data amount.&lt;br /&gt;&lt;br /&gt;-Can leverage functionalities inherent in the relational database: Often, relational database already comes with a host of functionalities. ROLAP technologies, since they sit on top of the relational database, can therefore leverage these functionalities.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Disadvantages:&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;-Performance can be slow: Because each ROLAP report is essentially a SQL query (or multiple SQL queries) in the relational database, the query time can be long if the underlying data size is large.&lt;br /&gt;&lt;br /&gt;-Limited by SQL functionalities: Because ROLAP technology mainly relies on generating SQL statements to query the relational database, and SQL statements do not fit all needs (for example, it is difficult to perform complex calculations using SQL), ROLAP technologies are therefore traditionally limited by what SQL can do. ROLAP vendors have mitigated this risk by building into the tool out-of-the-box complex functions as well as the ability to allow users to define their own functions.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/1786747298065401647-5573972385059335832?l=datawarehousingtips.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://datawarehousingtips.blogspot.com/feeds/5573972385059335832/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=1786747298065401647&amp;postID=5573972385059335832' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/1786747298065401647/posts/default/5573972385059335832'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/1786747298065401647/posts/default/5573972385059335832'/><link rel='alternate' type='text/html' href='http://datawarehousingtips.blogspot.com/2009/01/advantages-and-disadvantages-of-rolap.html' title='Advantages and Disadvantages of ROLAP'/><author><name>Thenmozhi</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-1786747298065401647.post-7516969628429345369</id><published>2009-01-18T04:17:00.000-08:00</published><updated>2009-01-18T04:18:57.364-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Tips'/><title type='text'>Advantages and Disadvantages of MOLAP</title><content type='html'>&lt;span style="font-weight: bold;"&gt;Advantages: &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;-Excellent performance: MOLAP cubes are built for fast data retrieval, and is optimal for slicing and dicing operations.&lt;br /&gt;&lt;br /&gt;-Can perform complex calculations: All calculations have been pre-generated when the cube is created. Hence, complex calculations are not only doable, but they return quickly.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Disadvantages:&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;-Limited in the amount of data it can handle: Because all calculations are performed when the cube is built, it is not possible to include a large amount of data in the cube itself. This is not to say that the data in the cube cannot be derived from a large amount of data. Indeed, this is possible. But in this case, only summary-level information will be included in the cube itself.&lt;br /&gt;&lt;br /&gt;-Requires additional investment: Cube technology are often proprietary and do not already exist in the organization. Therefore, to adopt MOLAP technology, chances are additional investments in human and capital resources are needed.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/1786747298065401647-7516969628429345369?l=datawarehousingtips.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://datawarehousingtips.blogspot.com/feeds/7516969628429345369/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=1786747298065401647&amp;postID=7516969628429345369' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/1786747298065401647/posts/default/7516969628429345369'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/1786747298065401647/posts/default/7516969628429345369'/><link rel='alternate' type='text/html' href='http://datawarehousingtips.blogspot.com/2009/01/advantages-and-disadvantages-of-molap.html' title='Advantages and Disadvantages of MOLAP'/><author><name>Thenmozhi</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-1786747298065401647.post-4896055943652087933</id><published>2009-01-18T04:16:00.000-08:00</published><updated>2009-01-18T04:17:13.542-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Tips'/><title type='text'>MOLAP Vs ROLAP Vs HOLAP</title><content type='html'>&lt;span style="font-weight: bold;"&gt;MOLAP&lt;/span&gt; : This is the more traditional way of OLAP analysis. In MOLAP, data is stored in a multidimensional cube. The storage is not in the relational database, but in proprietary formats.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;ROLAP : &lt;/span&gt;This methodology relies on manipulating the data stored in the relational database to give the appearance of traditional OLAP's slicing and dicing functionality. In essence, each action of slicing and dicing is equivalent to adding a "WHERE" clause in the SQL statement.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;HOLAP : &lt;/span&gt;HOLAP technologies attempt to combine the advantages of MOLAP and ROLAP. For summary-type information, HOLAP leverages cube technology for faster performance. When detail information is needed, HOLAP can "drill through" from the cube into the underlying relational data.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/1786747298065401647-4896055943652087933?l=datawarehousingtips.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://datawarehousingtips.blogspot.com/feeds/4896055943652087933/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=1786747298065401647&amp;postID=4896055943652087933' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/1786747298065401647/posts/default/4896055943652087933'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/1786747298065401647/posts/default/4896055943652087933'/><link rel='alternate' type='text/html' href='http://datawarehousingtips.blogspot.com/2009/01/molap-vs-rolap-vs-holap.html' title='MOLAP Vs ROLAP Vs HOLAP'/><author><name>Thenmozhi</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-1786747298065401647.post-475545554308224206</id><published>2009-01-18T04:13:00.000-08:00</published><updated>2009-01-18T04:15:18.634-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Tips'/><title type='text'>Data Warehousing Terms A to Z</title><content type='html'>&lt;span style="font-weight: bold;"&gt;Aggregation:&lt;/span&gt; One way of speeding up query performance. Facts are summed up for selected dimensions from the original fact table. The resulting aggregate table will have fewer rows, thus making queries that can use them go faster.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Attribute:&lt;/span&gt; Attributes represent a single type of information in a dimension. For example, year is an attribute in the Time dimension.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Conformed Dimension: &lt;/span&gt;A dimension that has exactly the same meaning and content when being referred from different fact tables.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Data Mart:&lt;/span&gt; Data marts have the same definition as the data warehouse (see below), but data marts have a more limited audience and/or data content.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Data Warehouse:&lt;/span&gt; A warehouse is a subject-oriented, integrated, time-variant and non-volatile collection of data in support of management's decision making process (as defined by Bill Inmon).&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Data Warehousing:&lt;/span&gt; The process of designing, building, and maintaining a data warehouse system.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Dimension:&lt;/span&gt; The same category of information. For example, year, month, day, and week are all part of the Time Dimension.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Dimensional Model:&lt;/span&gt; A type of data modeling suited for data warehousing. In a dimensional model, there are two types of tables: dimensional tables and fact tables. Dimensional table records information on each dimension, and fact table records all the "fact", or measures.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Dimensional Table:&lt;/span&gt; Dimension tables store records related to this particular dimension. No facts are stored in a dimensional table.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Drill Across:&lt;/span&gt; Data analysis across dimensions.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Drill Down:&lt;/span&gt; Data analysis to a child attribute.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Drill Through:&lt;/span&gt; Data analysis that goes from an OLAP cube into the relational database.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Drill Up:&lt;/span&gt; Data analysis to a parent attribute.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;ETL:&lt;/span&gt; Stands for Extraction, Transformation, and Loading. The movement of data from one area to another.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Fact Table:&lt;/span&gt; A type of table in the dimensional model. A fact table typically includes two types of columns: fact columns and foreign keys to the dimensions.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Hierarchy:&lt;/span&gt; A hierarchy defines the navigating path for drilling up and drilling down. All attributes in a hierarchy belong to the same dimension.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Metadata:&lt;/span&gt; Data about data. For example, the number of tables in the database is a type of metadata.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Metric:&lt;/span&gt; A measured value. For example, total sales is a metric.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;MOLAP:&lt;/span&gt; Multidimensional OLAP. MOLAP systems store data in the multidimensional cubes.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;OLAP:&lt;/span&gt; On-Line Analytical Processing. OLAP should be designed to provide end users a quick way of slicing and dicing the data.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;ROLAP:&lt;/span&gt; Relational OLAP. ROLAP systems store data in the relational database.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Snowflake Schema:&lt;/span&gt; A common form of dimensional model. In a snowflake schema, different hierarchies in a dimension can be extended into their own dimensional tables. Therefore, a dimension can have more than a single dimension table.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Star Schema:&lt;/span&gt; A common form of dimensional model. In a star schema, each dimension is represented by a single dimension table.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/1786747298065401647-475545554308224206?l=datawarehousingtips.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://datawarehousingtips.blogspot.com/feeds/475545554308224206/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=1786747298065401647&amp;postID=475545554308224206' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/1786747298065401647/posts/default/475545554308224206'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/1786747298065401647/posts/default/475545554308224206'/><link rel='alternate' type='text/html' href='http://datawarehousingtips.blogspot.com/2009/01/data-warehousing-terms-to-z.html' title='Data Warehousing Terms A to Z'/><author><name>Thenmozhi</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-1786747298065401647.post-2526161330215605327</id><published>2009-01-18T04:11:00.000-08:00</published><updated>2009-01-18T04:12:36.095-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Tips'/><title type='text'>Business Intelligence</title><content type='html'>Business intelligence is a term commonly associated with data warehousing. &lt;br /&gt;&lt;br /&gt;In fact, many of the tool vendors position their products as business intelligence software rather than data warehousing software. &lt;br /&gt;&lt;br /&gt;There are other occasions where the two terms are used interchangeably. &lt;br /&gt;&lt;br /&gt;Business intelligence usually refers to the information that is available for the enterprise to make decisions on. &lt;br /&gt;&lt;br /&gt;A data warehousing (or data mart) system is the backend, or the infrastructural, component for achieving business intelligence.&lt;br /&gt;&lt;br /&gt;Business intelligence also includes the insight gained from doing data mining analysis, as well as unstrctured data (thus the need fo content management systems).&lt;br /&gt;&lt;br /&gt;For our purposes here, we will discuss business intelligence in the context of using a data warehouse infrastructure.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/1786747298065401647-2526161330215605327?l=datawarehousingtips.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://datawarehousingtips.blogspot.com/feeds/2526161330215605327/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=1786747298065401647&amp;postID=2526161330215605327' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/1786747298065401647/posts/default/2526161330215605327'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/1786747298065401647/posts/default/2526161330215605327'/><link rel='alternate' type='text/html' href='http://datawarehousingtips.blogspot.com/2009/01/business-intelligence.html' title='Business Intelligence'/><author><name>Thenmozhi</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-1786747298065401647.post-3941976215824150148</id><published>2009-01-18T01:30:00.002-08:00</published><updated>2009-01-18T01:31:22.259-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Tips'/><title type='text'>Data Mart in Data Warehousing</title><content type='html'>A data mart (DM) is a specialized version of a data warehouse (DW). &lt;br /&gt;&lt;br /&gt;Like data warehouses, data marts contain a snapshot of operational data that helps business people to strategize based on analyses of past trends and experiences. &lt;br /&gt;&lt;br /&gt;The key difference is that the creation of a data mart is predicated on a specific, predefined need for a certain grouping and configuration of select data. &lt;br /&gt;&lt;br /&gt;A data mart configuration emphasizes easy access to relevant information (Reference : Wiki). &lt;br /&gt;&lt;br /&gt;Data Marts are designed to help manager make strategic decisions about their business.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/1786747298065401647-3941976215824150148?l=datawarehousingtips.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://datawarehousingtips.blogspot.com/feeds/3941976215824150148/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=1786747298065401647&amp;postID=3941976215824150148' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/1786747298065401647/posts/default/3941976215824150148'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/1786747298065401647/posts/default/3941976215824150148'/><link rel='alternate' type='text/html' href='http://datawarehousingtips.blogspot.com/2009/01/data-mart-in-data-warehousing.html' title='Data Mart in Data Warehousing'/><author><name>Thenmozhi</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-1786747298065401647.post-1222061955866069423</id><published>2009-01-18T01:30:00.001-08:00</published><updated>2009-01-18T01:48:56.006-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Tips'/><title type='text'>Define : Surrogate Key</title><content type='html'>Surrogate key is a substitution for the natural primary key in Data Warehousing. &lt;br /&gt;&lt;br /&gt;It is just a unique identifier or number for each row that can be used for the primary key to the table. &lt;br /&gt;&lt;br /&gt;The only requirement for a surrogate primary key is that it is unique for each row in the table. &lt;br /&gt;&lt;br /&gt;It is useful because the natural primary key can change and this makes updates more difficult.&lt;br /&gt;&lt;br /&gt;Surrogated keys are always integer or numeric.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/1786747298065401647-1222061955866069423?l=datawarehousingtips.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://datawarehousingtips.blogspot.com/feeds/1222061955866069423/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=1786747298065401647&amp;postID=1222061955866069423' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/1786747298065401647/posts/default/1222061955866069423'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/1786747298065401647/posts/default/1222061955866069423'/><link rel='alternate' type='text/html' href='http://datawarehousingtips.blogspot.com/2009/01/define-surrogate-key.html' title='Define : Surrogate Key'/><author><name>Thenmozhi</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-1786747298065401647.post-3161808273636379725</id><published>2009-01-18T01:29:00.001-08:00</published><updated>2009-01-18T01:49:34.760-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Tips'/><title type='text'>Bus Schema Vs Str Schema Vs Snow Flake Schema</title><content type='html'>&lt;span style="font-weight:bold;"&gt;Bus, Star, Snow flake Schemas in Data Warehousing&lt;span style="font-style:italic;"&gt;&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight:bold;"&gt;BUS Schema&lt;/span&gt; is composed of a master suite of confirmed dimension and standardized definition if facts. &lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight:bold;"&gt;Star schema&lt;/span&gt; is a type of organizing the tables such that we can retrieve the result from the database quickly in the warehouse environment. &lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight:bold;"&gt;Snowflake Schema,&lt;/span&gt; each dimension has a primary dimension table, to which one or more additional dimensions can join. The primary dimension table is the only table that can join to the fact table.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/1786747298065401647-3161808273636379725?l=datawarehousingtips.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://datawarehousingtips.blogspot.com/feeds/3161808273636379725/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=1786747298065401647&amp;postID=3161808273636379725' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/1786747298065401647/posts/default/3161808273636379725'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/1786747298065401647/posts/default/3161808273636379725'/><link rel='alternate' type='text/html' href='http://datawarehousingtips.blogspot.com/2009/01/bus-schema-vs-str-schema-vs-snow-flake.html' title='Bus Schema Vs Str Schema Vs Snow Flake Schema'/><author><name>Thenmozhi</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-1786747298065401647.post-24065384455339753</id><published>2009-01-18T01:28:00.001-08:00</published><updated>2009-01-18T01:50:14.524-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Tips'/><title type='text'>Define : Hybrid SCD</title><content type='html'>&lt;span style="font-weight:bold;"&gt;Hybrid Slowly Changing Dimension in Data Warehousing:&lt;span style="font-style:italic;"&gt;&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;Hybrid SCDs are combination of both SCD 1 and SCD 2. &lt;br /&gt;&lt;br /&gt;It may happen that in a table, some columns are important and we need to track changes for them i.e capture the historical data for them whereas in some columns even if the data changes, we don't care.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/1786747298065401647-24065384455339753?l=datawarehousingtips.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://datawarehousingtips.blogspot.com/feeds/24065384455339753/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=1786747298065401647&amp;postID=24065384455339753' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/1786747298065401647/posts/default/24065384455339753'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/1786747298065401647/posts/default/24065384455339753'/><link rel='alternate' type='text/html' href='http://datawarehousingtips.blogspot.com/2009/01/define-hybrid-scd.html' title='Define : Hybrid SCD'/><author><name>Thenmozhi</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-1786747298065401647.post-1918176637057137589</id><published>2009-01-18T01:27:00.001-08:00</published><updated>2009-01-18T01:50:40.800-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Tips'/><title type='text'>Define : SCD (Slowly Changing Dimensions)</title><content type='html'>&lt;span style="font-weight:bold;"&gt;SCD is abbreviation of Slowly changing dimensions in Data Warehousing. &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;SCD applies to cases where the attribute for a record varies over time. There are three different types of SCD.&lt;br /&gt;&lt;br /&gt;a) SCD1 : The new record replaces the original record. &lt;br /&gt;Only one record exist in database - current data. &lt;br /&gt;&lt;br /&gt;b) SCD2 : A new record is added into the customer dimension table. &lt;br /&gt;Two records exist in database - current data and previous history data.&lt;br /&gt;&lt;br /&gt;c) SCD3 : The original data is modified to include new data. &lt;br /&gt;One record exist in database - new information are attached with old information in same row.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/1786747298065401647-1918176637057137589?l=datawarehousingtips.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://datawarehousingtips.blogspot.com/feeds/1918176637057137589/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=1786747298065401647&amp;postID=1918176637057137589' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/1786747298065401647/posts/default/1918176637057137589'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/1786747298065401647/posts/default/1918176637057137589'/><link rel='alternate' type='text/html' href='http://datawarehousingtips.blogspot.com/2009/01/define-scd-slowly-changing-dimensions.html' title='Define : SCD (Slowly Changing Dimensions)'/><author><name>Thenmozhi</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-1786747298065401647.post-2130225263980700489</id><published>2009-01-18T01:26:00.001-08:00</published><updated>2009-01-18T01:26:37.569-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Tips'/><title type='text'>Non-Additive Facts in Data Warehousing</title><content type='html'>Non-additive facts are facts that cannot be summed up for any of the dimensions present in the fact table. &lt;br /&gt;&lt;br /&gt;However they are not considered as useless. If there is changes in dimensions the same facts can be useful.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/1786747298065401647-2130225263980700489?l=datawarehousingtips.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://datawarehousingtips.blogspot.com/feeds/2130225263980700489/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=1786747298065401647&amp;postID=2130225263980700489' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/1786747298065401647/posts/default/2130225263980700489'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/1786747298065401647/posts/default/2130225263980700489'/><link rel='alternate' type='text/html' href='http://datawarehousingtips.blogspot.com/2009/01/non-additive-facts-in-data-warehousing.html' title='Non-Additive Facts in Data Warehousing'/><author><name>Thenmozhi</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-1786747298065401647.post-6996617866448148641</id><published>2009-01-18T01:25:00.002-08:00</published><updated>2009-01-18T01:26:10.748-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Tips'/><title type='text'>Level of Granularity of a Fact Table</title><content type='html'>It means level of detail that you put into the fact table in a data warehouse. &lt;br /&gt;&lt;br /&gt;Level of granularity would mean what detail are you willing to put for each transactional fact.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/1786747298065401647-6996617866448148641?l=datawarehousingtips.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://datawarehousingtips.blogspot.com/feeds/6996617866448148641/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=1786747298065401647&amp;postID=6996617866448148641' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/1786747298065401647/posts/default/6996617866448148641'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/1786747298065401647/posts/default/6996617866448148641'/><link rel='alternate' type='text/html' href='http://datawarehousingtips.blogspot.com/2009/01/level-of-granularity-of-fact-table.html' title='Level of Granularity of a Fact Table'/><author><name>Thenmozhi</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-1786747298065401647.post-4519865540641984473</id><published>2009-01-18T01:25:00.001-08:00</published><updated>2009-01-18T01:25:33.590-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Tips'/><title type='text'>Define : Confirmed Fact</title><content type='html'>Conformed dimensions are the dimensions which can be used across multiple Data Marts in combination with multiple facts tables accordingly.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/1786747298065401647-4519865540641984473?l=datawarehousingtips.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://datawarehousingtips.blogspot.com/feeds/4519865540641984473/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=1786747298065401647&amp;postID=4519865540641984473' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/1786747298065401647/posts/default/4519865540641984473'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/1786747298065401647/posts/default/4519865540641984473'/><link rel='alternate' type='text/html' href='http://datawarehousingtips.blogspot.com/2009/01/define-confirmed-fact.html' title='Define : Confirmed Fact'/><author><name>Thenmozhi</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-1786747298065401647.post-648091023143705206</id><published>2009-01-18T01:24:00.000-08:00</published><updated>2009-01-18T01:25:04.141-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Tips'/><title type='text'>Conformed Dimensions in Data Warehousing</title><content type='html'>Conformed dimensions mean the exact same thing with every possible fact table to which they are joined. &lt;br /&gt;&lt;br /&gt;They are common to the cubes.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/1786747298065401647-648091023143705206?l=datawarehousingtips.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://datawarehousingtips.blogspot.com/feeds/648091023143705206/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=1786747298065401647&amp;postID=648091023143705206' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/1786747298065401647/posts/default/648091023143705206'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/1786747298065401647/posts/default/648091023143705206'/><link rel='alternate' type='text/html' href='http://datawarehousingtips.blogspot.com/2009/01/conformed-dimensions-in-data.html' title='Conformed Dimensions in Data Warehousing'/><author><name>Thenmozhi</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-1786747298065401647.post-8748316629179350821</id><published>2009-01-18T01:23:00.002-08:00</published><updated>2009-01-18T01:24:29.745-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Tips'/><title type='text'>Define : Real Time Data warehousing</title><content type='html'>Data warehousing captures business activity data. &lt;br /&gt;&lt;br /&gt;Real-time data warehousing captures business activity data as it occurs. &lt;br /&gt;&lt;br /&gt;As soon as the business activity is complete and there is data about it, the completed activity data flows into the data warehouse and becomes available instantly.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/1786747298065401647-8748316629179350821?l=datawarehousingtips.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://datawarehousingtips.blogspot.com/feeds/8748316629179350821/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=1786747298065401647&amp;postID=8748316629179350821' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/1786747298065401647/posts/default/8748316629179350821'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/1786747298065401647/posts/default/8748316629179350821'/><link rel='alternate' type='text/html' href='http://datawarehousingtips.blogspot.com/2009/01/define-real-time-data-warehousing.html' title='Define : Real Time Data warehousing'/><author><name>Thenmozhi</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-1786747298065401647.post-5523167198074366971</id><published>2009-01-18T01:23:00.001-08:00</published><updated>2009-01-18T01:23:38.481-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Tips'/><title type='text'>Full Load Vs Incremental (Refresh) Load in DW</title><content type='html'>Full Load: completely erasing the contents of one or more tables and reloading with fresh data.&lt;br /&gt;&lt;br /&gt;Incremental Load: applying ongoing changes to one or more tables based on a predefined schedule&lt;br /&gt;&lt;br /&gt;Refresh Load: the table will be truncated and data will be loaded again. Here we use to load static dimension table or type tables using this method.&lt;br /&gt;&lt;br /&gt;Incremental Load: It is a method to capture on the newly created or updated record. Based upon the falg or Date this load will be performed.&lt;br /&gt;&lt;br /&gt;Full Load: when we are loading the data for first time, either it may be a base load or history all the set of records will be loaded at a strech depends upon the volume.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/1786747298065401647-5523167198074366971?l=datawarehousingtips.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://datawarehousingtips.blogspot.com/feeds/5523167198074366971/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=1786747298065401647&amp;postID=5523167198074366971' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/1786747298065401647/posts/default/5523167198074366971'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/1786747298065401647/posts/default/5523167198074366971'/><link rel='alternate' type='text/html' href='http://datawarehousingtips.blogspot.com/2009/01/full-load-vs-incremental-refresh-load.html' title='Full Load Vs Incremental (Refresh) Load in DW'/><author><name>Thenmozhi</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-1786747298065401647.post-7561698249790746244</id><published>2009-01-18T01:22:00.001-08:00</published><updated>2009-01-18T01:22:50.228-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Tips'/><title type='text'>Define : ODS (Operation Data Source)</title><content type='html'>ODS - Operational Data Store.&lt;br /&gt;&lt;br /&gt;ODS Comes between staging area &amp; Data Warehouse. &lt;br /&gt;&lt;br /&gt;The data is ODS will be at the low level of granularity.&lt;br /&gt;&lt;br /&gt;Once data was poopulated in ODS aggregated data will be loaded into into EDW through ODS.&lt;br /&gt;&lt;br /&gt;It is also a simalar small DWH which will help analyst to analysis the bussiness.&lt;br /&gt;&lt;br /&gt;It will have data for less number of days. generally it will be around 30-45 days. &lt;br /&gt;&lt;br /&gt;Like DWH here also we will primary keys will be genrated, error and reject handling will be done.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/1786747298065401647-7561698249790746244?l=datawarehousingtips.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://datawarehousingtips.blogspot.com/feeds/7561698249790746244/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=1786747298065401647&amp;postID=7561698249790746244' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/1786747298065401647/posts/default/7561698249790746244'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/1786747298065401647/posts/default/7561698249790746244'/><link rel='alternate' type='text/html' href='http://datawarehousingtips.blogspot.com/2009/01/define-ods-operation-data-source.html' title='Define : ODS (Operation Data Source)'/><author><name>Thenmozhi</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-1786747298065401647.post-4058679964726765443</id><published>2009-01-18T01:21:00.001-08:00</published><updated>2009-01-18T01:21:35.891-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Tips'/><title type='text'>Define : Aggregate Tables in Data Warehousing</title><content type='html'>Aggregate table contains the summary of existing warehouse data which is grouped to certain levels of dimensions. &lt;br /&gt;&lt;br /&gt;It is always easy to retrieve data from aggregated tables than visiting original table which has million records. &lt;br /&gt;&lt;br /&gt;Aggregate tables reduces the load in the database server and increases the performance of the query and can retrieve the result quickly.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/1786747298065401647-4058679964726765443?l=datawarehousingtips.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://datawarehousingtips.blogspot.com/feeds/4058679964726765443/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=1786747298065401647&amp;postID=4058679964726765443' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/1786747298065401647/posts/default/4058679964726765443'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/1786747298065401647/posts/default/4058679964726765443'/><link rel='alternate' type='text/html' href='http://datawarehousingtips.blogspot.com/2009/01/define-aggregate-tables-in-data.html' title='Define : Aggregate Tables in Data Warehousing'/><author><name>Thenmozhi</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-1786747298065401647.post-3564605981561536961</id><published>2009-01-18T01:18:00.002-08:00</published><updated>2009-01-18T01:21:07.944-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Tips'/><title type='text'>Define : Lookup Tables</title><content type='html'>A lookup table is the table placed on the target table based upon the primary key of the target, it just updates the table by allowing only modified (new or updated) records based on thelookup condition&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/1786747298065401647-3564605981561536961?l=datawarehousingtips.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://datawarehousingtips.blogspot.com/feeds/3564605981561536961/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=1786747298065401647&amp;postID=3564605981561536961' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/1786747298065401647/posts/default/3564605981561536961'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/1786747298065401647/posts/default/3564605981561536961'/><link rel='alternate' type='text/html' href='http://datawarehousingtips.blogspot.com/2009/01/define-lookup-tables.html' title='Define : Lookup Tables'/><author><name>Thenmozhi</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-1786747298065401647.post-683304464421475916</id><published>2009-01-18T01:18:00.001-08:00</published><updated>2009-01-18T01:18:25.043-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Tips'/><title type='text'>Define : VLDB</title><content type='html'>VLDB is abbreviation of Very Large DataBase. &lt;br /&gt;&lt;br /&gt;A one terabyte database would normally be considered to be a VLDB. &lt;br /&gt;&lt;br /&gt;Typically, these are decision support systems or transaction processing applications serving large numbers of users.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/1786747298065401647-683304464421475916?l=datawarehousingtips.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://datawarehousingtips.blogspot.com/feeds/683304464421475916/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=1786747298065401647&amp;postID=683304464421475916' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/1786747298065401647/posts/default/683304464421475916'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/1786747298065401647/posts/default/683304464421475916'/><link rel='alternate' type='text/html' href='http://datawarehousingtips.blogspot.com/2009/01/define-vldb.html' title='Define : VLDB'/><author><name>Thenmozhi</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-1786747298065401647.post-7650814162350507093</id><published>2009-01-18T01:14:00.002-08:00</published><updated>2009-01-18T01:17:57.813-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Tips'/><title type='text'>Define : ETL</title><content type='html'>ETL is abbreviation of extract, transform, and load. &lt;br /&gt;&lt;br /&gt;ETL is software that enables businesses to consolidate their disparate data while moving it from place to place, and it doesn't really matter that that data is in different forms or formats. &lt;br /&gt;&lt;br /&gt;The data can come from any source.&lt;br /&gt;&lt;br /&gt;ETL is powerful enough to handle such data disparities. &lt;br /&gt;&lt;br /&gt;First, the extract function reads data from a specified source database and extracts a desired subset of data. &lt;br /&gt;&lt;br /&gt;Next, the transform function works with the acquired data - using rules orlookup tables, or creating combinations with other data - to convert it to the desired state.&lt;br /&gt;&lt;br /&gt;Finally, the load function is used to write the resulting data to a target database.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/1786747298065401647-7650814162350507093?l=datawarehousingtips.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://datawarehousingtips.blogspot.com/feeds/7650814162350507093/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=1786747298065401647&amp;postID=7650814162350507093' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/1786747298065401647/posts/default/7650814162350507093'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/1786747298065401647/posts/default/7650814162350507093'/><link rel='alternate' type='text/html' href='http://datawarehousingtips.blogspot.com/2009/01/define-etl.html' title='Define : ETL'/><author><name>Thenmozhi</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-1786747298065401647.post-4876077342849841805</id><published>2009-01-18T01:14:00.001-08:00</published><updated>2009-01-18T01:14:49.863-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Tips'/><title type='text'>Define : ODS</title><content type='html'>ODS is abbreviation of Operational Data Store. &lt;br /&gt;&lt;br /&gt;A database structure that is a repository for near real-time operational data rather than long term trend data. &lt;br /&gt;&lt;br /&gt;The ODS may further become the enterprise shared operational database, allowing operational systems that are being reengineered to use the ODS as there operation databases.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/1786747298065401647-4876077342849841805?l=datawarehousingtips.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://datawarehousingtips.blogspot.com/feeds/4876077342849841805/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=1786747298065401647&amp;postID=4876077342849841805' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/1786747298065401647/posts/default/4876077342849841805'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/1786747298065401647/posts/default/4876077342849841805'/><link rel='alternate' type='text/html' href='http://datawarehousingtips.blogspot.com/2009/01/define-ods.html' title='Define : ODS'/><author><name>Thenmozhi</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-1786747298065401647.post-6770781951423418735</id><published>2009-01-18T01:13:00.002-08:00</published><updated>2009-01-18T01:14:21.159-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Tips'/><title type='text'>Define : ER Diagram in Data Warehousing</title><content type='html'>Entity Relationship Diagrams are a major data modelling tool and will help organize the data in your project into entities and define the relationships between the entities. &lt;br /&gt;&lt;br /&gt;This process has proved to enable the analyst to produce a good database structure so that the data can be stored and retrieved in a most efficient manner. &lt;br /&gt;&lt;br /&gt;An entity-relationship (ER) diagram is a specialized graphic that illustrates the interrelationships between entities in a database. &lt;br /&gt;&lt;br /&gt;A type of diagram used in data modeling for relational data bases. These diagrams show the structure of each table and the links between tables.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/1786747298065401647-6770781951423418735?l=datawarehousingtips.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://datawarehousingtips.blogspot.com/feeds/6770781951423418735/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=1786747298065401647&amp;postID=6770781951423418735' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/1786747298065401647/posts/default/6770781951423418735'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/1786747298065401647/posts/default/6770781951423418735'/><link rel='alternate' type='text/html' href='http://datawarehousingtips.blogspot.com/2009/01/define-er-diagram-in-data-warehousing.html' title='Define : ER Diagram in Data Warehousing'/><author><name>Thenmozhi</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-1786747298065401647.post-7331470038689357050</id><published>2009-01-18T01:13:00.001-08:00</published><updated>2009-01-18T01:13:45.126-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Tips'/><title type='text'>view Vs Materialized View</title><content type='html'>A view takes the output of a query and makes it appear like a virtual table and it can be used in place of tables. &lt;br /&gt;&lt;br /&gt;A materialized view provides indirect access to table data by storing the results of a query in a separate schema object.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/1786747298065401647-7331470038689357050?l=datawarehousingtips.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://datawarehousingtips.blogspot.com/feeds/7331470038689357050/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=1786747298065401647&amp;postID=7331470038689357050' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/1786747298065401647/posts/default/7331470038689357050'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/1786747298065401647/posts/default/7331470038689357050'/><link rel='alternate' type='text/html' href='http://datawarehousingtips.blogspot.com/2009/01/view-vs-materialized-view.html' title='view Vs Materialized View'/><author><name>Thenmozhi</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-1786747298065401647.post-4430849923369383529</id><published>2009-01-18T01:12:00.000-08:00</published><updated>2009-01-18T01:13:18.453-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Tips'/><title type='text'>Define : Data Mining</title><content type='html'>Data Mining is the process of analyzing data from different perspectives and summarizing it into useful information.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/1786747298065401647-4430849923369383529?l=datawarehousingtips.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://datawarehousingtips.blogspot.com/feeds/4430849923369383529/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=1786747298065401647&amp;postID=4430849923369383529' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/1786747298065401647/posts/default/4430849923369383529'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/1786747298065401647/posts/default/4430849923369383529'/><link rel='alternate' type='text/html' href='http://datawarehousingtips.blogspot.com/2009/01/define-data-mining.html' title='Define : Data Mining'/><author><name>Thenmozhi</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-1786747298065401647.post-1922697258649506692</id><published>2009-01-18T01:11:00.000-08:00</published><updated>2009-01-18T01:12:35.561-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Tips'/><title type='text'>Foreign Key Columns in Fact and Dimension Table</title><content type='html'>Foreign keys of dimension tables are primary keys of entity tables. &lt;br /&gt;&lt;br /&gt;Foreign keys of facts tables are primary keys of Dimension tables.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/1786747298065401647-1922697258649506692?l=datawarehousingtips.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://datawarehousingtips.blogspot.com/feeds/1922697258649506692/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=1786747298065401647&amp;postID=1922697258649506692' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/1786747298065401647/posts/default/1922697258649506692'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/1786747298065401647/posts/default/1922697258649506692'/><link rel='alternate' type='text/html' href='http://datawarehousingtips.blogspot.com/2009/01/foreign-key-columns-in-fact-and.html' title='Foreign Key Columns in Fact and Dimension Table'/><author><name>Thenmozhi</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-1786747298065401647.post-8208729297375347227</id><published>2009-01-18T01:10:00.002-08:00</published><updated>2009-01-18T01:11:51.638-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Tips'/><title type='text'>OLTP vs OLAP : Difference</title><content type='html'>Data Source OLTP: Operational data is from original data source of the data &lt;br /&gt;&lt;br /&gt;OLAP: Consolidation data is from various source. &lt;br /&gt;&lt;br /&gt;Process Goal OLTP: Snapshot of business processes which does fundamental business tasks &lt;br /&gt;&lt;br /&gt;OLAP: Multi-dimensional views of business activities of planning and decision making Queries and Process Scripts OLTP: Simple quick running queries ran by users.&lt;br /&gt;&lt;br /&gt;OLAP: Complex long running queries by system to update the aggregated data. Database &lt;br /&gt;&lt;br /&gt;Design OLTP: Normalized small database. Speed will be not an issue due to smaller database and normalization will not degrade performance. This adopts entity relationship(ER) model and an application-oriented database design. &lt;br /&gt;&lt;br /&gt;OLAP: De-normalized large database. Speed is issue due to larger database and de-normalizing will improve performance as there will be lesser tables to scan while performing tasks. &lt;br /&gt;&lt;br /&gt;This adopts star, snowflake or fact constellation mode of subject-oriented database design. Back up and System Administration OLTP: Regular Database backup and system administration can do the job. &lt;br /&gt;&lt;br /&gt;OLAP: Reloading the OLTP data is good considered as good backup option.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/1786747298065401647-8208729297375347227?l=datawarehousingtips.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://datawarehousingtips.blogspot.com/feeds/8208729297375347227/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=1786747298065401647&amp;postID=8208729297375347227' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/1786747298065401647/posts/default/8208729297375347227'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/1786747298065401647/posts/default/8208729297375347227'/><link rel='alternate' type='text/html' href='http://datawarehousingtips.blogspot.com/2009/01/oltp-vs-olap-difference.html' title='OLTP vs OLAP : Difference'/><author><name>Thenmozhi</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-1786747298065401647.post-8508456908226678105</id><published>2009-01-18T01:10:00.001-08:00</published><updated>2009-01-18T01:10:36.471-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Tips'/><title type='text'>Define : OLAP</title><content type='html'>OLAP is abbreviation of Online Analytical Processing. &lt;br /&gt;&lt;br /&gt;This system is an application that collects, manages, processes and presents multidimensional data for analysis and management purposes.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/1786747298065401647-8508456908226678105?l=datawarehousingtips.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://datawarehousingtips.blogspot.com/feeds/8508456908226678105/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=1786747298065401647&amp;postID=8508456908226678105' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/1786747298065401647/posts/default/8508456908226678105'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/1786747298065401647/posts/default/8508456908226678105'/><link rel='alternate' type='text/html' href='http://datawarehousingtips.blogspot.com/2009/01/define-olap.html' title='Define : OLAP'/><author><name>Thenmozhi</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-1786747298065401647.post-4266470217065403859</id><published>2009-01-18T01:09:00.000-08:00</published><updated>2009-01-18T01:10:09.909-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Tips'/><title type='text'>Define : OLTP</title><content type='html'>OLTP is abbreviation of On-Line Transaction Processing. &lt;br /&gt;&lt;br /&gt;This system is an application that modifies data the instance it receives and has a large number of concurrent users.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/1786747298065401647-4266470217065403859?l=datawarehousingtips.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://datawarehousingtips.blogspot.com/feeds/4266470217065403859/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=1786747298065401647&amp;postID=4266470217065403859' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/1786747298065401647/posts/default/4266470217065403859'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/1786747298065401647/posts/default/4266470217065403859'/><link rel='alternate' type='text/html' href='http://datawarehousingtips.blogspot.com/2009/01/define-oltp.html' title='Define : OLTP'/><author><name>Thenmozhi</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-1786747298065401647.post-3686993173747753812</id><published>2009-01-18T01:08:00.002-08:00</published><updated>2009-01-18T01:09:35.882-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Tips'/><title type='text'>Loading Dimension Tables in Data warehousing</title><content type='html'>Conventional (Slow) : All the constraints and keys are validated against the data before, it is loaded, this way data integrity is maintained.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;Direct (Fast) : All the constraints and keys are disabled before the data is loaded. Once data is loaded, it is validated against all the constraints and keys. &lt;br /&gt;&lt;br /&gt;If data is found invalid or dirty it is not included in index and all future processes are skipped on this data.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/1786747298065401647-3686993173747753812?l=datawarehousingtips.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://datawarehousingtips.blogspot.com/feeds/3686993173747753812/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=1786747298065401647&amp;postID=3686993173747753812' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/1786747298065401647/posts/default/3686993173747753812'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/1786747298065401647/posts/default/3686993173747753812'/><link rel='alternate' type='text/html' href='http://datawarehousingtips.blogspot.com/2009/01/loading-dimension-tables-in-data.html' title='Loading Dimension Tables in Data warehousing'/><author><name>Thenmozhi</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-1786747298065401647.post-8901569825265281260</id><published>2009-01-18T01:08:00.001-08:00</published><updated>2009-01-18T01:08:24.247-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Tips'/><title type='text'>Define : Dimension Table</title><content type='html'>Dimensional table contains textual attributes of measurements stored in the facts tables. &lt;br /&gt;&lt;br /&gt;Dimensional table is a collection of hierarchies, categories and logic which can be used for user to traverse in hierarchy nodes.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/1786747298065401647-8901569825265281260?l=datawarehousingtips.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://datawarehousingtips.blogspot.com/feeds/8901569825265281260/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=1786747298065401647&amp;postID=8901569825265281260' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/1786747298065401647/posts/default/8901569825265281260'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/1786747298065401647/posts/default/8901569825265281260'/><link rel='alternate' type='text/html' href='http://datawarehousingtips.blogspot.com/2009/01/define-dimension-table.html' title='Define : Dimension Table'/><author><name>Thenmozhi</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-1786747298065401647.post-5733133815568946209</id><published>2009-01-18T01:07:00.001-08:00</published><updated>2009-01-18T01:07:54.109-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Tips'/><title type='text'>Fact Table in Data Warehousing</title><content type='html'>Fact table contains measurements of business process. &lt;br /&gt;&lt;br /&gt;Fact table contains the foreign keys for the dimension tables. Example, if you are business process is "paper production", "average production of paper by one machine" or "weekly production of paper" will be considered as measurement of business process.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/1786747298065401647-5733133815568946209?l=datawarehousingtips.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://datawarehousingtips.blogspot.com/feeds/5733133815568946209/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=1786747298065401647&amp;postID=5733133815568946209' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/1786747298065401647/posts/default/5733133815568946209'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/1786747298065401647/posts/default/5733133815568946209'/><link rel='alternate' type='text/html' href='http://datawarehousingtips.blogspot.com/2009/01/fact-table-in-data-warehousing.html' title='Fact Table in Data Warehousing'/><author><name>Thenmozhi</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-1786747298065401647.post-234603499818873876</id><published>2009-01-18T01:06:00.002-08:00</published><updated>2009-01-18T01:07:23.290-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Tips'/><title type='text'>Dimensional Modeling in Data Warehousing</title><content type='html'>Dimensional data model concept involves two types of tables and it is different from the 3rd normal form. &lt;br /&gt;&lt;br /&gt;This concepts uses Facts table which contains the measurements of the business and Dimension table which contains the context(dimension of calculation) of the measurements.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/1786747298065401647-234603499818873876?l=datawarehousingtips.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://datawarehousingtips.blogspot.com/feeds/234603499818873876/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=1786747298065401647&amp;postID=234603499818873876' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/1786747298065401647/posts/default/234603499818873876'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/1786747298065401647/posts/default/234603499818873876'/><link rel='alternate' type='text/html' href='http://datawarehousingtips.blogspot.com/2009/01/dimensional-modeling-in-data.html' title='Dimensional Modeling in Data Warehousing'/><author><name>Thenmozhi</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-1786747298065401647.post-9110288236009646712</id><published>2009-01-18T01:06:00.001-08:00</published><updated>2009-01-18T01:06:52.843-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Tips'/><title type='text'>Fundamental Stages of Data Wareshousing</title><content type='html'>Offline Operational Databases -Data warehouses in this initial stage are developed by simply copying the database of an operational system to an off-line server where the processing load of reporting does not impact on the operational system's performance. &lt;br /&gt;&lt;br /&gt;Offline Data Warehouse -Data warehouses in this stage of evolution are updated on a regular time cycle (usually daily, weekly or monthly) from the operational systems and the data is stored in an integrated reporting-oriented data structure Real Time Data Warehouse - Data warehouses at this stage are updated on a transaction or event basis, every time an operational system performs a transaction (e.g. an order or a delivery or a booking etc.) &lt;br /&gt;&lt;br /&gt;Integrated Data Warehouse - Data warehouses at this stage are used to generate activity or transactions that are passed back into the operational systems for use in the daily activity of the organization.;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/1786747298065401647-9110288236009646712?l=datawarehousingtips.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://datawarehousingtips.blogspot.com/feeds/9110288236009646712/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=1786747298065401647&amp;postID=9110288236009646712' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/1786747298065401647/posts/default/9110288236009646712'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/1786747298065401647/posts/default/9110288236009646712'/><link rel='alternate' type='text/html' href='http://datawarehousingtips.blogspot.com/2009/01/fundamental-stages-of-data-wareshousing.html' title='Fundamental Stages of Data Wareshousing'/><author><name>Thenmozhi</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-1786747298065401647.post-2569697388655929365</id><published>2009-01-18T01:04:00.000-08:00</published><updated>2009-01-18T01:06:01.722-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Tips'/><title type='text'>Datawarehousing : Intro</title><content type='html'>&lt;span style="font-weight:bold;"&gt;What is Data Warehousing?&lt;/span&gt; &lt;br /&gt;&lt;br /&gt;A data warehouse is the main repository of an organization's historical data, its corporate memory. &lt;br /&gt;&lt;br /&gt;It contains the raw material for management's decision support system. The critical factor leading to the use of a data warehouse is that a data analyst can perform complex queries and analysis, such as data mining, on the information without slowing down the operational systems (Ref:Wikipedia). &lt;br /&gt;&lt;br /&gt;Data warehousing collection of data designed to support management decision making.&lt;br /&gt;&lt;br /&gt;Data warehouses contain a wide variety of data that present a coherent picture of business conditions at a single point in time. &lt;br /&gt;&lt;br /&gt;It is a repository of integrated information, available for queries and analysis.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/1786747298065401647-2569697388655929365?l=datawarehousingtips.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://datawarehousingtips.blogspot.com/feeds/2569697388655929365/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=1786747298065401647&amp;postID=2569697388655929365' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/1786747298065401647/posts/default/2569697388655929365'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/1786747298065401647/posts/default/2569697388655929365'/><link rel='alternate' type='text/html' href='http://datawarehousingtips.blogspot.com/2009/01/datawarehousing-intro.html' title='Datawarehousing : Intro'/><author><name>Thenmozhi</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry></feed>
