Study smarter with Fiveable
Get study guides, practice questions, and cheatsheets for all your subjects. Join 500,000+ students with a 96% pass rate.
In Business Intelligence, your dashboards, reports, and predictive models are only as good as the data feeding them. Data quality dimensions give you a framework for evaluating whether your data is actually fit for purpose—and when exam questions ask you to diagnose why a BI initiative failed or recommend improvements, these dimensions are your diagnostic toolkit. You're being tested on your ability to identify which dimension is compromised in a given scenario and explain the downstream business impact.
Think of data quality dimensions as falling into three categories: intrinsic quality (is the data itself correct?), contextual quality (is it useful for this specific purpose?), and accessibility quality (can stakeholders actually use it?). Don't just memorize definitions—know how each dimension connects to data governance, ETL processes, and decision-making reliability. When you see a case study about conflicting reports or missed business opportunities, your first instinct should be to ask: which quality dimension broke down?
These dimensions evaluate whether data accurately represents reality, independent of how it's being used. Intrinsic quality failures corrupt everything downstream—no amount of sophisticated analytics can fix fundamentally flawed inputs.
Compare: Accuracy vs. Validity—both assess "correctness," but accuracy asks "is this the right value?" while validity asks "is this value properly formatted?" A birthdate of 02/30/1990 is invalid (impossible date), while 02/28/1991 for someone born 02/28/1990 is inaccurate (wrong year). FRQs love this distinction.
These dimensions ensure data maintains its quality across systems, time, and transformations. Structural failures often emerge during data integration when multiple sources collide.
Compare: Consistency vs. Integrity—consistency is about uniformity across systems (horizontal alignment), while integrity is about preservation over time (vertical alignment). A database could have perfect integrity but still be inconsistent with other systems using different naming conventions.
These dimensions evaluate whether data serves the specific business need at hand. Data that's perfect for one use case may be worthless for another.
Compare: Completeness vs. Relevance—these create tension in data strategy. Completeness pushes toward "capture everything," while relevance argues "capture only what matters." The best BI architectures balance both by defining clear data requirements before collection begins.
This dimension determines whether quality data actually reaches the people who need it. The best data in the world is worthless if decision-makers can't access it.
Compare: Accessibility vs. Timeliness—both affect whether data reaches users when needed, but timeliness is about data freshness while accessibility is about delivery mechanisms. Real-time data that's locked in a system only IT can query fails on accessibility, not timeliness.
| Concept | Best Examples |
|---|---|
| Intrinsic Quality | Accuracy, Validity, Precision |
| Structural Quality | Consistency, Integrity, Reliability |
| Contextual Quality | Completeness, Timeliness, Relevance |
| Accessibility Quality | Accessibility |
| ETL Validation Focus | Validity, Consistency, Completeness |
| Governance Priority | Integrity, Reliability, Accuracy |
| User-Facing Concerns | Timeliness, Accessibility, Relevance |
| Compliance-Related | Accuracy, Integrity, Relevance |
A company's CRM shows a customer's address as "123 Main St" while the billing system shows "123 Main Street, Suite 100." Which two dimensions are potentially compromised, and how would you distinguish between them?
Your sales dashboard shows last month's figures, but executives need to respond to a competitor's price change announced yesterday. Which dimension has failed, and what architectural change would address it?
Compare and contrast accuracy and precision using an example of customer age data. How might data be precise but inaccurate, or accurate but imprecise?
An FRQ describes a merger where two companies' product databases use different category taxonomies. Which dimensions are at risk, and what data management strategy would you recommend?
A data analyst complains that the information they need exists but is trapped in a legacy system requiring IT tickets to extract. Which dimension is failing, and how does this differ from a timeliness problem?