📊Probability and Statistics

Types of Data

Study smarter with Fiveable

Get study guides, practice questions, and cheatsheets for all your subjects. Join 500,000+ students with a 96% pass rate.

Get Started

Why This Matters

In Probability and Statistics, everything starts with understanding what kind of data you're working with. The type of data you have determines which statistical methods you can use, which graphs are appropriate, and which summary statistics make sense. You're being tested on your ability to classify data correctly and choose appropriate analyses—not just define terms. A common exam mistake is applying the wrong statistical test because a student misidentified their data type.

Think of data classification as a decision tree: first, is it qualitative or quantitative? If qualitative, is it nominal or ordinal? If quantitative, is it discrete or continuous, and does it have a true zero? These distinctions matter because they unlock or restrict what you can do mathematically. Don't just memorize definitions—know what operations each data type permits and why.

Qualitative Data: Describing Categories

Qualitative data describes attributes or characteristics rather than quantities. The key principle: you can count how many fall into each category, but you can't perform meaningful arithmetic on the categories themselves.

Nominal Data

No inherent order or ranking—categories are simply different from one another with no "greater than" or "less than" relationship
Only mode is meaningful as a measure of central tendency; mean and median cannot be calculated
Examples include blood type, eye color, and zip codes—even numeric codes like zip codes are nominal because arithmetic on them is meaningless

Ordinal Data

Categories have a meaningful order but the intervals between ranks are not necessarily equal
Median is appropriate but mean is problematic because you can't assume equal spacing between levels
Examples include survey responses (strongly disagree to strongly agree), letter grades, and socioeconomic status categories

Compare: Nominal vs. Ordinal—both are categorical, but ordinal has a logical sequence while nominal does not. If an FRQ asks which measure of center is appropriate, remember: nominal gets mode only, ordinal can use median.

Quantitative Data: Measuring Amounts

Quantitative data represents numerical values where arithmetic operations can be meaningful. The distinguishing feature: you can add, subtract, and calculate means with these values.

Discrete Data

Takes only specific, countable values—typically integers representing counts of items or occurrences
Cannot be subdivided into smaller meaningful units; you can't have 2.7 students
Examples include number of siblings, dice rolls, and defective items in a batch—anything you count rather than measure

Continuous Data

Can take any value within a range—theoretically infinite precision is possible
Measured rather than counted, limited only by the precision of your measuring instrument
Examples include height, time, and blood pressure—the value 5.2847 seconds is just as valid as 5 seconds

Compare: Discrete vs. Continuous—both are quantitative, but discrete data has gaps between possible values while continuous data can take any value in an interval. On exams, ask yourself: "Can this be 3.5?" If yes, it's continuous.

Measurement Scales: What Math Is Allowed?

The distinction between interval and ratio data determines which mathematical operations produce meaningful results. This matters because ratios and percentages only make sense when zero means "none."

Interval Data

Equal intervals between values but no true zero point—zero doesn't mean absence of the quantity
Ratios are not meaningful; $40°F$ is not "twice as hot" as $20°F$
Examples include temperature in Celsius/Fahrenheit, calendar years, and IQ scores—you can say how much more, but not how many times more

Ratio Data

True zero point exists, meaning zero represents complete absence of the quantity
All mathematical operations are valid—you can meaningfully say someone earning $\$80,000$ makes twice as much as someone earning $\$40,000$
Examples include weight, height, age, and income—most physical measurements fall here

Compare: Interval vs. Ratio—both allow addition and subtraction, but only ratio data supports meaningful multiplication and division. Classic exam question: "Is $80°F$ twice as warm as $40°F$ ?" No—temperature in Fahrenheit is interval, not ratio.

Data Collection Structure: When Was It Gathered?

How data is collected over time affects which analyses are appropriate. Time structure in your data determines whether you're comparing groups or tracking change.

Cross-Sectional Data

Snapshot at a single point in time across multiple subjects or groups
Useful for comparing different populations but cannot establish causation or trends
Examples include a survey of voter preferences before an election or comparing test scores across schools in one semester

Time Series Data

Collected repeatedly over time, often at regular intervals from the same source
Reveals trends, cycles, and seasonal patterns—essential for forecasting
Examples include daily stock prices, monthly unemployment rates, and annual GDP—any data tracked chronologically

Compare: Cross-sectional vs. Time Series—cross-sectional compares different subjects at one time, while time series tracks the same measure across time. FRQs often ask you to identify which type supports conclusions about trends (time series) versus group differences (cross-sectional).

Quick Reference Table

Concept	Best Examples
Nominal (categorical, no order)	Blood type, eye color, zip code
Ordinal (categorical, ordered)	Survey ratings, class rank, education level
Discrete (countable quantities)	Number of children, dice outcomes, defects
Continuous (measurable quantities)	Height, time, temperature
Interval (no true zero)	Celsius temperature, IQ, calendar year
Ratio (true zero exists)	Weight, income, distance, age
Cross-sectional (one time point)	Election polls, census snapshots
Time series (over time)	Stock prices, monthly sales, annual growth

Self-Check Questions

A researcher records the number of text messages each student sends per day. Is this discrete or continuous data? What if they recorded the time spent texting instead?
Which two data types both involve categories but differ in whether order matters? Give an example of each from a medical context.
Temperature measured in Kelvin has a true zero (absolute zero). Is Kelvin temperature interval or ratio data? How does this differ from Celsius?
A study compares income levels across five countries in 2024. Another study tracks one country's income from 2000–2024. Classify each data structure and explain what conclusions each can support.
FRQ-style: A survey asks respondents to rate their satisfaction as very dissatisfied, dissatisfied, neutral, satisfied, or very satisfied. What type of data is this? Can you calculate a meaningful average? Justify your answer.

📊Probability and Statistics

Types of Data

Why This Matters

Qualitative Data: Describing Categories

Nominal Data

Ordinal Data

Quantitative Data: Measuring Amounts

Discrete Data

Continuous Data

Measurement Scales: What Math Is Allowed?

Interval Data

Ratio Data

Data Collection Structure: When Was It Gathered?

Cross-Sectional Data

Time Series Data

Quick Reference Table

Self-Check Questions

history

social science

english & capstone

arts

science

math & computer science

world languages

high school exams

honors classes

college classes

hs classes