Study smarter with Fiveable
Get study guides, practice questions, and cheatsheets for all your subjects. Join 500,000+ students with a 96% pass rate.
Image enhancement sits at the foundation of nearly every computer vision pipeline you'll encounter. Before any algorithm can detect objects, recognize faces, or segment scenes, the input image often needs preprocessing to correct for poor lighting, sensor noise, or low contrast. You're being tested on your understanding of how these techniques manipulate pixel values and when to apply each oneโnot just what they're called.
The techniques in this guide demonstrate core principles: intensity transformations, spatial domain operations, frequency domain analysis, and adaptive processing. Exam questions will ask you to select the appropriate technique for a given scenario, explain the mathematical basis behind a method, or compare approaches for noise reduction versus edge preservation. Don't just memorize definitionsโknow what problem each technique solves and the tradeoffs involved.
These methods operate directly on pixel values using mathematical functions, transforming the input intensity to an output intensity without considering neighboring pixels. The key principle: a point operation applies the same transformation function to every pixel independently.
Compare: Histogram Equalization vs. Contrast Stretchingโboth improve contrast, but histogram equalization adapts to the image's actual distribution while contrast stretching applies a fixed linear mapping. If an exam question mentions "adaptive contrast improvement," histogram equalization is your answer.
Spatial filters modify pixel values based on the values of neighboring pixels within a defined kernel or window. The underlying mechanism: convolution of the image with a filter mask determines whether you smooth, sharpen, or detect features.
Compare: Median Filter vs. Gaussian Filterโboth reduce noise, but median filtering is nonlinear and excels at removing salt-and-pepper noise while preserving edges. Gaussian filtering is linear and better for Gaussian-distributed noise but blurs edges. FRQs often ask which filter to choose for a specific noise type.
Edge detection identifies locations where intensity changes rapidly, marking boundaries between regions. The mathematical basis: edges correspond to high values of the first derivative (gradient) or zero-crossings of the second derivative (Laplacian).
Compare: Global Thresholding vs. Adaptive Thresholdingโglobal works when lighting is uniform, but adaptive thresholding handles shadows and gradients by computing thresholds from local neighborhoods. If an exam scenario mentions "varying illumination," adaptive is the correct choice.
Frequency domain methods transform images using the Fourier Transform, allowing manipulation of specific frequency components. The core insight: low frequencies carry overall structure and smooth variations; high frequencies encode edges, noise, and fine details.
Compare: Spatial Domain vs. Frequency Domain Filteringโspatial filtering is intuitive and efficient for small kernels, while frequency domain filtering excels for large kernels and allows precise control over which frequencies to modify. Know that convolution in spatial domain equals multiplication in frequency domain.
| Concept | Best Examples |
|---|---|
| Point operations (intensity transforms) | Histogram Equalization, Contrast Stretching, Gamma Correction |
| Linear spatial filtering | Gaussian Smoothing, Mean Filter, Sharpening Kernels |
| Nonlinear spatial filtering | Median Filter, Adaptive Filtering |
| Edge detection | Sobel, Canny, Prewitt |
| Segmentation | Image Thresholding, Otsu's Method |
| Frequency domain | Fourier Transform, Low-pass/High-pass Filters |
| Detail enhancement | Unsharp Masking, High-pass Filtering |
| Noise-specific solutions | Median (salt-and-pepper), Gaussian filter (Gaussian noise) |
Which two techniques both improve contrast but differ in whether they adapt to the image's histogram distribution? Explain when you'd choose one over the other.
You're given an image corrupted by salt-and-pepper noise. Compare the effectiveness of mean filtering versus median filtering, and justify which you'd select.
Explain how unsharp masking achieves sharpening without directly computing derivatives. What parameter controls the strength of the effect?
An FRQ presents an image with uneven lighting across the frame. Compare global thresholding with adaptive thresholdingโwhich would you recommend and why?
Describe the relationship between spatial domain convolution and frequency domain multiplication. Why might you choose frequency domain filtering for a very large smoothing kernel?