Computer Science, asked by Mayb8488, 1 year ago

Explain the need of separate data staging area in data warehouse

Answers

Answered by bhasinanshp9dlfy
0

"We have a simple data warehouse that takes data from a few RDBMS source systems and load the data in dimension and fact tables of the warehouse. I wonder why we have a staging layer in between. Why can’t we process everything on the fly and push them in the data warehouse?"

Last night, I received this question from one of the members of DWBIConcepts community over email and thought of discussing the pros and cons of having a staging layer in this article.

Really staging area is not a necessity if we can handle it on the fly. But can we? Here are a few reasons why you can’t avoid a staging area:

Source systems are only available for extraction during a specific time slot which is generally lesser than your overall data loading time. It’s a good idea to extract and keep things at your end before you lose the connection to the source systems.

You want to extract data based on some conditions which require you to join two or more different systems together. E.g. you want to only extract those customers who also exist in some other system. You will not be able to perform a SQL query joining two tables from two physically different databases.

Answered by DudeItzKay
0

Answer:

Here's ur Answer Mate:-

The staging area is mainly used to quickly extract data from its data sources, minimizing the impact of the sources. After data has been loaded into the staging area, the staging area is used to combine data from multiple data sources, transformations, validations, data cleansing.

Hope it helps uh!

Similar questions