Data is Dimensional

Dimensional Modelling for Advanced Data Analytics and Cloud Solutions

Category: Architecture

  • The optimal data schema for parallelization is a Star Schema. Normalized data models are very poor for such systems because all tables are based on a unique primary key.  Vendors that encouraged such modeling (Teradata) included an extensive array of bizarre indexing strategies to overcome the issue.  So, the machine required a lot of handholding…