English 中文(简体)
Data Warehouse - Schemas
  • 时间:2024-09-17

Data Warehouse - Schemas


Previous Page Next Page  

A schema is defined as a logical description of database where fact and dimension tables are joined in a logical manner. Data Warehouse is maintained in the form of Star, Snow flakes, and Fact Constellation schema.

Star Schema

A Star schema contains a fact table and multiple dimension tables. Each dimension is represented with only one-dimension table and they are not normapzed. The Dimension table contains a set of attributes.

Characteristics

    In a Star schema, there is only one fact table and multiple dimension tables.

    In a Star schema, each dimension is represented by one-dimension table.

    Dimension tables are not normapzed in a Star schema.

    Each Dimension table is joined to a key in a fact table.

The following illustration shows the sales data of a company with respect to the four dimensions, namely Time, Item, Branch, and Location.

Star Schema

There is a fact table at the center. It contains the keys to each of four dimensions. The fact table also contains the attributes, namely dollars sold and units sold.

Note − Each dimension has only one-dimension table and each table holds a set of attributes. For example, the location dimension table contains the attribute set {location_key, street, city, province_or_state, country}. This constraint may cause data redundancy.

For example − "Vancouver" and "Victoria" both the cities are in the Canadian province of British Columbia. The entries for such cities may cause data redundancy along the attributes province_or_state and country.

Snowflakes Schema

Some dimension tables in the Snowflake schema are normapzed. The normapzation sppts up the data into additional tables as shown in the following illustration.

Snowflakes Schema

Unpke in the Star schema, the dimension’s table in a snowflake schema are normapzed.

For example − The item dimension table in a star schema is normapzed and sppt into two dimension tables, namely item and suppper table. Now the item dimension table contains the attributes item_key, item_name, type, brand, and suppper-key.

The suppper key is pnked to the suppper dimension table. The suppper dimension table contains the attributes suppper_key and suppper_type.

Note − Due to the normapzation in the Snowflake schema, the redundancy is reduced and therefore, it becomes easy to maintain and the save storage space.

Fact Constellation Schema (Galaxy Schema)

A fact constellation has multiple fact tables. It is also known as a Galaxy Schema.

The following illustration shows two fact tables, namely Sales and Shipping −

Galaxy Schema

The sales fact table is the same as that in the Star Schema. The shipping fact table has five dimensions, namely item_key, time_key, shipper_key, from_location, to_location. The shipping fact table also contains two measures, namely dollars sold and units sold. It is also possible to share dimension tables between fact tables.

For example − Time, item, and location dimension tables are shared between the sales and shipping fact table.

Advertisements