Fiveable

👁️Computer Vision and Image Processing Unit 12 Review

QR code for Computer Vision and Image Processing practice questions

12.2 Medical imaging

👁️Computer Vision and Image Processing
Unit 12 Review

12.2 Medical imaging

Written by the Fiveable Content Team • Last updated September 2025
Written by the Fiveable Content Team • Last updated September 2025
👁️Computer Vision and Image Processing
Unit & Topic Study Guides

Medical imaging is a critical application of computer vision and image processing in healthcare. It combines physics, mathematics, and computer science to create visual representations of internal body structures, enabling non-invasive diagnosis, treatment planning, and monitoring of various medical conditions.

This field encompasses various imaging types, including X-ray, CT, MRI, ultrasound, and nuclear medicine. Each modality requires specialized acquisition techniques, digital representation methods, and processing algorithms to extract valuable diagnostic information and support clinical decision-making.

Fundamentals of medical imaging

  • Medical imaging serves as a crucial component in computer vision and image processing applications within healthcare
  • Integrates principles of physics, mathematics, and computer science to create visual representations of internal body structures
  • Enables non-invasive diagnosis, treatment planning, and monitoring of various medical conditions

Types of medical imaging

  • X-ray imaging uses high-energy electromagnetic radiation to produce 2D images of internal structures
  • Computed Tomography (CT) combines multiple X-ray images to create detailed cross-sectional views
  • Magnetic Resonance Imaging (MRI) utilizes strong magnetic fields and radio waves to generate high-resolution images of soft tissues
  • Ultrasound imaging employs high-frequency sound waves to visualize internal organs and blood flow
  • Nuclear medicine imaging (PET, SPECT) uses radioactive tracers to highlight specific physiological processes

Image acquisition techniques

  • Projection radiography captures 2D images by passing X-rays through the body onto a detector
  • Tomographic imaging reconstructs 3D images from multiple 2D projections taken at different angles
  • Emission tomography detects gamma rays emitted by radioactive tracers injected into the body
  • Diffusion-weighted imaging measures the movement of water molecules in tissues to assess cellular structure
  • Functional imaging techniques (fMRI, PET) visualize metabolic activity and brain function

Digital image representation

  • Pixels represent the smallest units of a digital image, storing intensity or color information
  • Voxels extend pixels to 3D space, representing volume elements in tomographic imaging
  • Bit depth determines the number of possible intensity values for each pixel (8-bit, 16-bit)
  • Image resolution defines the number of pixels per unit area, affecting image detail and file size
  • DICOM (Digital Imaging and Communications in Medicine) standardizes the format for storing and transmitting medical images

Image enhancement for diagnosis

  • Image enhancement techniques improve the visual quality and diagnostic value of medical images
  • Plays a crucial role in computer vision algorithms for automated analysis and detection
  • Facilitates more accurate interpretation of medical images by healthcare professionals

Contrast adjustment methods

  • Histogram equalization redistributes pixel intensities to enhance overall image contrast
  • Adaptive histogram equalization applies contrast enhancement locally to different regions of the image
  • Gamma correction adjusts the brightness and contrast of an image using a non-linear transformation
  • Window-level adjustment optimizes the display of specific intensity ranges in medical images
  • Unsharp masking enhances edge contrast by subtracting a blurred version of the image from the original

Noise reduction techniques

  • Gaussian filtering applies a weighted average to smooth out random noise in images
  • Median filtering replaces pixel values with the median of neighboring pixels, effective for salt-and-pepper noise
  • Anisotropic diffusion reduces noise while preserving important edges and structures
  • Wavelet denoising decomposes the image into multiple frequency bands and applies thresholding to remove noise
  • Non-local means filtering averages similar patches across the image to reduce noise while preserving details

Edge detection in medical images

  • Gradient-based methods (Sobel, Prewitt) compute intensity changes to identify edges
  • Laplacian of Gaussian (LoG) detects edges by finding zero-crossings in the second derivative of the image
  • Canny edge detection combines multiple steps to provide accurate and thin edges
  • Active contours (snakes) evolve a curve to fit object boundaries in medical images
  • Phase congruency edge detection utilizes phase information to detect edges invariant to image contrast

Segmentation in medical imaging

  • Segmentation divides medical images into distinct regions or structures of interest
  • Crucial for quantitative analysis, 3D visualization, and computer-aided diagnosis in medical imaging
  • Enables automated measurement of organ volumes, tumor sizes, and other anatomical features

Region-based segmentation

  • Region growing expands from seed points to segment connected areas with similar properties
  • Split-and-merge techniques recursively divide and combine regions based on homogeneity criteria
  • Watershed segmentation treats the image as a topographic surface and floods it to separate regions
  • Fuzzy c-means clustering groups pixels into segments based on fuzzy set theory
  • Graph-cut segmentation formulates image segmentation as an energy minimization problem

Threshold-based segmentation

  • Global thresholding separates foreground from background using a single intensity threshold
  • Otsu's method automatically determines the optimal threshold by maximizing inter-class variance
  • Adaptive thresholding applies different thresholds to various parts of the image based on local statistics
  • Multi-thresholding segments the image into multiple classes using multiple threshold values
  • Hysteresis thresholding uses two thresholds to reduce noise and improve edge connectivity

Atlas-based segmentation

  • Utilizes a pre-labeled atlas (template) to guide the segmentation of new images
  • Registration aligns the atlas with the target image to transfer labels
  • Multi-atlas segmentation combines information from multiple atlases to improve accuracy
  • Probabilistic atlas-based methods incorporate statistical information about anatomical variability
  • Patch-based segmentation uses local similarity between atlas and target image patches

3D reconstruction techniques

  • 3D reconstruction creates volumetric representations from 2D medical image slices
  • Essential for visualizing complex anatomical structures and planning surgical procedures
  • Integrates computer vision algorithms for accurate spatial representation of medical data

Volume rendering

  • Ray casting projects rays through the volume to create 2D projections of 3D data
  • Maximum Intensity Projection (MIP) displays the highest intensity voxels along each ray
  • Transfer functions map voxel intensities to colors and opacities for enhanced visualization
  • Shading techniques (Phong, Blinn-Phong) add depth and realism to volume-rendered images
  • GPU-accelerated volume rendering utilizes graphics hardware for real-time interactive visualization
Types of medical imaging, Frontiers | Hybrid Imaging: Instrumentation and Data Processing | Physics

Surface rendering

  • Marching cubes algorithm extracts isosurfaces from volumetric data
  • Mesh simplification reduces the complexity of 3D models while preserving important features
  • Texture mapping applies 2D images onto 3D surfaces to enhance visual realism
  • Smooth shading techniques (Gouraud, Phong) interpolate surface normals for improved appearance
  • Ambient occlusion simulates soft shadows to enhance depth perception in 3D renderings

Multiplanar reconstruction

  • Orthogonal plane reconstruction displays axial, sagittal, and coronal views of 3D data
  • Oblique plane reconstruction allows visualization of arbitrary slices through the volume
  • Curved planar reformation follows curved anatomical structures (blood vessels)
  • Maximum Intensity Projection (MIP) slab rendering combines multiple slices for enhanced visualization
  • Minimum Intensity Projection (MinIP) highlights low-density structures (lung airways)

Medical image registration

  • Image registration aligns multiple medical images to a common coordinate system
  • Crucial for comparing images from different modalities, time points, or patients
  • Enables fusion of complementary information from various imaging techniques

Rigid vs non-rigid registration

  • Rigid registration applies global transformations (translation, rotation) to align images
  • Affine registration extends rigid registration with scaling and shearing transformations
  • Non-rigid registration allows local deformations to account for tissue elasticity and anatomical variations
  • Deformable models (B-splines, thin-plate splines) represent complex non-rigid transformations
  • Diffeomorphic registration ensures smooth, invertible transformations between images

Feature-based registration

  • Landmark-based registration aligns images using corresponding points identified by experts
  • Scale-Invariant Feature Transform (SIFT) detects and matches distinctive image features
  • Speeded Up Robust Features (SURF) provides a faster alternative to SIFT for feature detection
  • Mutual Information (MI) measures the statistical dependency between image intensities
  • Normalized Cross-Correlation (NCC) quantifies the similarity between image patches

Intensity-based registration

  • Sum of Squared Differences (SSD) minimizes the intensity differences between aligned images
  • Correlation Coefficient (CC) measures the linear relationship between image intensities
  • Mutual Information (MI) maximizes the shared information between images
  • Normalized Mutual Information (NMI) provides robustness to changes in image overlap
  • Demons algorithm uses optical flow principles for non-rigid registration

Computer-aided diagnosis (CAD)

  • CAD systems assist radiologists in detecting and characterizing abnormalities in medical images
  • Integrates computer vision and machine learning techniques to improve diagnostic accuracy
  • Reduces the workload on radiologists and helps prioritize cases for review

Detection of abnormalities

  • Lung nodule detection in chest CT scans uses segmentation and shape analysis
  • Mammographic mass detection employs texture analysis and machine learning classifiers
  • Brain tumor detection in MRI utilizes multi-modal image analysis and deep learning
  • Bone fracture detection combines edge detection and pattern recognition techniques
  • Retinal abnormality detection analyzes fundus images using image processing and AI algorithms

Classification of lesions

  • Benign vs. malignant tumor classification uses texture features and machine learning
  • Alzheimer's disease classification analyzes brain MRI volumes and cortical thickness
  • Skin lesion classification employs dermoscopic image analysis and convolutional neural networks
  • Breast cancer subtype classification integrates genomic data with imaging features
  • Pulmonary embolism classification analyzes CT angiography images using deep learning

Quantitative analysis techniques

  • Tumor volume measurement uses segmentation and 3D reconstruction techniques
  • Bone density assessment analyzes X-ray attenuation in dual-energy X-ray absorptiometry (DXA)
  • Cardiac function analysis measures left ventricular volumes and ejection fraction in cardiac MRI
  • Brain atrophy quantification tracks changes in brain volume over time in neurodegenerative diseases
  • Perfusion analysis calculates blood flow parameters from dynamic contrast-enhanced imaging

Machine learning in medical imaging

  • Machine learning algorithms learn patterns from large datasets of medical images
  • Enables automated analysis, classification, and prediction in various medical imaging tasks
  • Continually improves as more data becomes available and algorithms are refined

Supervised vs unsupervised learning

  • Supervised learning trains models on labeled data to predict outcomes or classify new instances
  • Unsupervised learning discovers patterns and structures in unlabeled data
  • Semi-supervised learning combines labeled and unlabeled data to improve model performance
  • Reinforcement learning optimizes decision-making processes through trial and error
  • Active learning selectively queries experts to label the most informative samples

Convolutional neural networks

  • Convolutional layers extract hierarchical features from medical images
  • Pooling layers reduce spatial dimensions and provide translation invariance
  • Fully connected layers combine high-level features for classification or regression
  • Transfer learning adapts pre-trained networks to specific medical imaging tasks
  • Data augmentation techniques (rotation, scaling, flipping) increase training dataset diversity
Types of medical imaging, How machine learning will transform the way we look at medical images

Transfer learning for medical images

  • Fine-tuning adapts pre-trained networks to specific medical imaging tasks
  • Feature extraction uses pre-trained networks as fixed feature extractors
  • Domain adaptation addresses differences between source and target domains
  • Multi-task learning leverages shared representations across related medical imaging tasks
  • Few-shot learning enables learning from limited labeled medical image data

Modality-specific processing

  • Each imaging modality requires specialized processing techniques to extract relevant information
  • Integrates physics principles and image formation models specific to each modality
  • Enables optimal visualization and analysis of different anatomical structures and pathologies

X-ray image processing

  • Scatter correction removes the effects of scattered radiation to improve image contrast
  • Bone suppression enhances soft tissue visibility in chest radiographs
  • Dual-energy subtraction separates bone and soft tissue components
  • Image stitching combines multiple X-ray images to create full-body radiographs
  • Tomosynthesis reconstruction creates pseudo-3D images from limited-angle projections

CT image analysis

  • Hounsfield unit calibration ensures consistent intensity values across different CT scanners
  • Beam hardening correction reduces artifacts caused by polychromatic X-ray spectra
  • Iterative reconstruction algorithms improve image quality and reduce radiation dose
  • Dual-energy CT analysis differentiates materials based on their attenuation properties
  • Perfusion CT analysis quantifies blood flow parameters from dynamic contrast-enhanced scans

MRI data processing

  • B0 field inhomogeneity correction improves image uniformity
  • Motion correction reduces artifacts caused by patient movement during scanning
  • Diffusion tensor imaging (DTI) analyzes water diffusion to map white matter tracts
  • Functional MRI (fMRI) processing detects brain activation patterns
  • Quantitative susceptibility mapping (QSM) measures tissue magnetic susceptibility

Medical image compression

  • Compression reduces the storage and transmission requirements for large medical image datasets
  • Balances the trade-off between file size reduction and preservation of diagnostic information
  • Crucial for efficient storage, retrieval, and sharing of medical images in clinical workflows

Lossless vs lossy compression

  • Lossless compression preserves all original image information (ZIP, JPEG-LS)
  • Lossy compression achieves higher compression ratios at the cost of some information loss (JPEG, JPEG2000)
  • Near-lossless compression allows small, controlled deviations from the original image
  • Region of Interest (ROI) coding applies different compression levels to different image regions
  • Scalable compression enables progressive reconstruction of images at multiple quality levels

DICOM file format

  • Stores medical images along with associated metadata (patient information, acquisition parameters)
  • Supports various image types (CT, MRI, ultrasound) and modalities
  • Includes a header with metadata and a data element containing the pixel data
  • Allows for multi-frame storage of image sequences (cine loops, 3D volumes)
  • Supports both uncompressed and compressed (lossless and lossy) image storage

Compression standards for medical images

  • JPEG2000 provides superior compression performance and scalability for medical images
  • JPEG-LS offers efficient lossless and near-lossless compression for medical imaging
  • H.265/HEVC enables high-efficiency compression of medical video sequences
  • DICOM native formats include Run-Length Encoding (RLE) for lossless compression
  • DEFLATE algorithm (used in ZIP) provides lossless compression for DICOM files

Ethical considerations

  • Ethical considerations in medical imaging ensure patient safety, privacy, and fair treatment
  • Addresses the responsible development and deployment of AI in healthcare
  • Balances the benefits of advanced imaging technologies with potential risks and biases

Patient privacy and data security

  • De-identification removes personally identifiable information from medical images
  • Encryption protects patient data during storage and transmission
  • Access control mechanisms restrict image access to authorized personnel
  • Audit trails track all accesses and modifications to medical image data
  • Secure data sharing protocols enable collaborative research while protecting patient privacy

Bias in medical image analysis

  • Dataset bias can lead to poor performance on underrepresented patient populations
  • Algorithm bias may perpetuate or amplify existing healthcare disparities
  • Fairness metrics assess and mitigate biases in medical image analysis algorithms
  • Diverse and representative training data improves algorithm generalization
  • Explainable AI techniques provide transparency in medical image analysis decisions

Regulatory compliance in healthcare

  • HIPAA (Health Insurance Portability and Accountability Act) governs patient data privacy in the US
  • GDPR (General Data Protection Regulation) regulates data protection and privacy in the EU
  • FDA (Food and Drug Administration) oversees the approval of medical imaging devices and software
  • CE marking ensures compliance with EU health, safety, and environmental protection standards
  • ISO 13485 specifies quality management system requirements for medical devices