Recent Posts
-
Natural keys can be complex. It may involve multiple columns making its use prone to error. A column may be omitted in the join causing run-away queries. It is not efficient as it requires significantly more work to compare two strings over two binary integers. A surrogate is simple and easy to understand: The key…
-
The optimal data schema for parallelization is a Star Schema. Normalized data models are very poor for such systems because all tables are based on a unique primary key. Vendors that encouraged such modeling (Teradata) included an extensive array of bizarre indexing strategies to overcome the issue. So, the machine required a lot of handholding…
Welcome!
This site is focused on the use of Dimensional Modeling in creating a modern, flexible, and high performance Analytic Repository.
My blog are my own observations in the industry and impact of Cloud services on how you achieve cost efficiency and better performance.
I encourage you to join our Discussion Forum on all things Data Modeling. I encourage professionals involved in all aspects of data bases for analytic systems, its use, its challenges, and solutions.
Post Archive