upgrade
upgrade

📊Probability and Statistics

Types of Data

Study smarter with Fiveable

Get study guides, practice questions, and cheatsheets for all your subjects. Join 500,000+ students with a 96% pass rate.

Get Started

Why This Matters

In Probability and Statistics, everything starts with understanding what kind of data you're working with. The type of data you have determines which statistical methods you can use, which graphs are appropriate, and which summary statistics make sense. You're being tested on your ability to classify data correctly and choose appropriate analyses—not just define terms. A common exam mistake is applying the wrong statistical test because a student misidentified their data type.

Think of data classification as a decision tree: first, is it qualitative or quantitative? If qualitative, is it nominal or ordinal? If quantitative, is it discrete or continuous, and does it have a true zero? These distinctions matter because they unlock or restrict what you can do mathematically. Don't just memorize definitions—know what operations each data type permits and why.


Qualitative Data: Describing Categories

Qualitative data describes attributes or characteristics rather than quantities. The key principle: you can count how many fall into each category, but you can't perform meaningful arithmetic on the categories themselves.

Nominal Data

  • No inherent order or ranking—categories are simply different from one another with no "greater than" or "less than" relationship
  • Only mode is meaningful as a measure of central tendency; mean and median cannot be calculated
  • Examples include blood type, eye color, and zip codes—even numeric codes like zip codes are nominal because arithmetic on them is meaningless

Ordinal Data

  • Categories have a meaningful order but the intervals between ranks are not necessarily equal
  • Median is appropriate but mean is problematic because you can't assume equal spacing between levels
  • Examples include survey responses (strongly disagree to strongly agree), letter grades, and socioeconomic status categories

Compare: Nominal vs. Ordinal—both are categorical, but ordinal has a logical sequence while nominal does not. If an FRQ asks which measure of center is appropriate, remember: nominal gets mode only, ordinal can use median.


Quantitative Data: Measuring Amounts

Quantitative data represents numerical values where arithmetic operations can be meaningful. The distinguishing feature: you can add, subtract, and calculate means with these values.

Discrete Data

  • Takes only specific, countable values—typically integers representing counts of items or occurrences
  • Cannot be subdivided into smaller meaningful units; you can't have 2.7 students
  • Examples include number of siblings, dice rolls, and defective items in a batch—anything you count rather than measure

Continuous Data

  • Can take any value within a range—theoretically infinite precision is possible
  • Measured rather than counted, limited only by the precision of your measuring instrument
  • Examples include height, time, and blood pressure—the value 5.2847 seconds is just as valid as 5 seconds

Compare: Discrete vs. Continuous—both are quantitative, but discrete data has gaps between possible values while continuous data can take any value in an interval. On exams, ask yourself: "Can this be 3.5?" If yes, it's continuous.


Measurement Scales: What Math Is Allowed?

The distinction between interval and ratio data determines which mathematical operations produce meaningful results. This matters because ratios and percentages only make sense when zero means "none."

Interval Data

  • Equal intervals between values but no true zero point—zero doesn't mean absence of the quantity
  • Ratios are not meaningful; 40°F40°F is not "twice as hot" as 20°F20°F
  • Examples include temperature in Celsius/Fahrenheit, calendar years, and IQ scores—you can say how much more, but not how many times more

Ratio Data

  • True zero point exists, meaning zero represents complete absence of the quantity
  • All mathematical operations are valid—you can meaningfully say someone earning $80,000\$80,000 makes twice as much as someone earning $40,000\$40,000
  • Examples include weight, height, age, and income—most physical measurements fall here

Compare: Interval vs. Ratio—both allow addition and subtraction, but only ratio data supports meaningful multiplication and division. Classic exam question: "Is 80°F80°F twice as warm as 40°F40°F?" No—temperature in Fahrenheit is interval, not ratio.


Data Collection Structure: When Was It Gathered?

How data is collected over time affects which analyses are appropriate. Time structure in your data determines whether you're comparing groups or tracking change.

Cross-Sectional Data

  • Snapshot at a single point in time across multiple subjects or groups
  • Useful for comparing different populations but cannot establish causation or trends
  • Examples include a survey of voter preferences before an election or comparing test scores across schools in one semester

Time Series Data

  • Collected repeatedly over time, often at regular intervals from the same source
  • Reveals trends, cycles, and seasonal patterns—essential for forecasting
  • Examples include daily stock prices, monthly unemployment rates, and annual GDP—any data tracked chronologically

Compare: Cross-sectional vs. Time Series—cross-sectional compares different subjects at one time, while time series tracks the same measure across time. FRQs often ask you to identify which type supports conclusions about trends (time series) versus group differences (cross-sectional).


Quick Reference Table

ConceptBest Examples
Nominal (categorical, no order)Blood type, eye color, zip code
Ordinal (categorical, ordered)Survey ratings, class rank, education level
Discrete (countable quantities)Number of children, dice outcomes, defects
Continuous (measurable quantities)Height, time, temperature
Interval (no true zero)Celsius temperature, IQ, calendar year
Ratio (true zero exists)Weight, income, distance, age
Cross-sectional (one time point)Election polls, census snapshots
Time series (over time)Stock prices, monthly sales, annual growth

Self-Check Questions

  1. A researcher records the number of text messages each student sends per day. Is this discrete or continuous data? What if they recorded the time spent texting instead?

  2. Which two data types both involve categories but differ in whether order matters? Give an example of each from a medical context.

  3. Temperature measured in Kelvin has a true zero (absolute zero). Is Kelvin temperature interval or ratio data? How does this differ from Celsius?

  4. A study compares income levels across five countries in 2024. Another study tracks one country's income from 2000–2024. Classify each data structure and explain what conclusions each can support.

  5. FRQ-style: A survey asks respondents to rate their satisfaction as very dissatisfied, dissatisfied, neutral, satisfied, or very satisfied. What type of data is this? Can you calculate a meaningful average? Justify your answer.