An algorithm is a precise, finite sequence of instructions designed to solve a specific problem or perform a task. Algorithm design sits at the heart of computational thinking because it forces you to move from vague problem descriptions to exact, repeatable procedures. This section covers what makes an algorithm work, how to evaluate one, and the major strategies you'll encounter.

Definition and Purpose

Think of an algorithm as a blueprint: it takes defined inputs, follows unambiguous steps, and produces a predictable output. Algorithms bridge the gap between abstract math and practical computation. You'll find them everywhere, from cryptography to data analysis to route planning.

A valid algorithm must have:

Defined inputs and outputs so you know exactly what goes in and what comes out
Unambiguous instructions where each step has one clear meaning
Finiteness, meaning it always terminates after a bounded number of steps
Correctness, producing the right output for every valid input

Key Components

Input specification defines the data or parameters the algorithm processes
Output description states the expected results
Control flow determines execution order: sequential (one step after another), conditional (if/else branching), or iterative (loops)
Efficiency measures performance in terms of time and space, which becomes critical as input sizes grow

The Algorithmic Thinking Process

Designing an algorithm isn't just writing steps. It follows a structured process:

Problem analysis: Break the complex issue into manageable pieces
Abstraction: Identify the essential features and strip away irrelevant details
Pattern recognition: Notice similarities to problems with known solutions
Decomposition: Divide the main problem into smaller, independently solvable subtasks
Design: Synthesize a step-by-step solution using appropriate techniques
Implementation: Translate the algorithm into pseudocode or a programming language
Testing and refinement: Verify correctness and improve efficiency

Algorithm Analysis

Algorithm analysis gives you a framework for comparing solutions before you commit to one. Without it, you're guessing which approach will scale. Quantitative analysis lets you predict how an algorithm will behave on large inputs.

Time Complexity

Time complexity measures how the running time of an algorithm grows as input size increases. It's expressed using asymptotic notation, which describes the growth rate rather than exact seconds.

Best-case represents the minimum time for the most favorable input
Average-case considers the expected time across typical inputs
Worst-case provides an upper bound, guaranteeing the algorithm never takes longer than this

Common growth rates, from fastest to slowest: $O(1)$ , $O(\log n)$ , $O(n)$ , $O(n \log n)$ , $O(n^2)$ , $O(2^n)$ .

For example, doubling the input size for an $O(n)$ algorithm roughly doubles the time. For an $O(n^2)$ algorithm, it quadruples the time. That difference matters enormously at scale.

Space Complexity

Space complexity quantifies how much memory an algorithm requires. It includes both the input space and any auxiliary space (temporary storage the algorithm creates).

Space complexity uses the same asymptotic notation as time complexity
In-place algorithms use only $O(1)$ extra space, modifying data within the existing structure
Recursive algorithms often have higher space complexity because each recursive call adds a frame to the call stack
A common design decision is the time-space trade-off: you can sometimes speed up an algorithm by using more memory, or reduce memory use at the cost of slower execution

Big O Notation

Big O describes the upper bound of an algorithm's growth rate as input size approaches infinity. It focuses on the dominant term and drops constant factors and lower-order terms.

For instance, if an algorithm takes $3n^2 + 5n + 100$ steps, its Big O is $O(n^2)$ because the $n^2$ term dominates for large $n$ .

Common complexities ranked from most to least efficient:

Big O	Name	Example
$O(1)$	Constant	Array index lookup
$O(\log n)$	Logarithmic	Binary search
$O(n)$	Linear	Linear search
$O(n \log n)$	Linearithmic	Merge sort
$O(n^2)$	Quadratic	Bubble sort
$O(2^n)$	Exponential	Brute-force subset enumeration

Common Algorithm Paradigms

These paradigms are general strategies for attacking classes of problems. Recognizing which paradigm fits your problem is often the hardest and most important step.

Divide and Conquer

This paradigm splits a problem into smaller subproblems, solves each one recursively, then combines the results.

The three phases:

Divide: Split the problem into smaller instances of the same problem
Conquer: Solve each subproblem recursively (or directly if small enough)
Combine: Merge the subproblem solutions into a solution for the original problem

Classic examples include merge sort (split array in half, sort each half, merge), quicksort (partition around a pivot, sort each partition), and binary search (eliminate half the search space each step).

Dynamic Programming

Dynamic programming (DP) solves problems where the same subproblems appear repeatedly. Instead of recomputing them, DP stores their solutions and reuses them.

Two requirements for DP to apply:

Overlapping subproblems: The same smaller problems come up again and again
Optimal substructure: An optimal solution to the whole problem can be built from optimal solutions to its subproblems

Two approaches exist:

Bottom-up (tabulation): Build solutions starting from the smallest subproblems, filling in a table
Top-down (memoization): Use recursion but cache results so each subproblem is solved only once

Applications include shortest path problems, sequence alignment, and the knapsack problem.

Note: Divide and conquer also breaks problems into subproblems, but DP is specifically for when those subproblems overlap. If subproblems are independent, divide and conquer is usually the better fit.

Greedy Algorithms

A greedy algorithm makes the locally optimal choice at each step, hoping this leads to a globally optimal solution. It never reconsiders past choices.

Greedy works when the problem has:

Greedy choice property: A locally optimal choice can lead to a globally optimal solution
Optimal substructure: An optimal solution contains optimal solutions to subproblems

Examples include Huffman coding (for data compression), Dijkstra's algorithm (shortest paths with non-negative weights), and job scheduling (minimizing total completion time).

Greedy algorithms are typically fast, but they don't always produce optimal solutions. You need to verify that the greedy choice property holds for your specific problem.

Data Structures in Algorithms

The data structure you choose directly affects your algorithm's performance. A poor choice can turn an efficient algorithm into a slow one.

Arrays and Linked Lists

Arrays store elements in contiguous memory, giving you $O(1)$ access to any element by index. This contiguous layout is also cache-friendly, which matters for real-world speed. Dynamic arrays (like Python lists or C++ vectors) resize automatically, with amortized $O(1)$ insertion at the end.

Linked lists store elements as nodes, where each node holds data and a pointer to the next node. Insertion and deletion at the head are $O(1)$ , but accessing the $k$ th element requires walking through $k$ nodes ( $O(n)$ ). Doubly linked lists add a pointer to the previous node, enabling bidirectional traversal.

Definition and purpose, Problem Solving Standard Sequence | AllAboutLean.com

Stacks and Queues

Stacks follow Last-In-First-Out (LIFO) order. Push adds an element to the top; pop removes from the top. Both operations run in $O(1)$ . Stacks are used in function call management, expression evaluation, and backtracking.

Queues follow First-In-First-Out (FIFO) order. Enqueue adds to the back; dequeue removes from the front. Both are $O(1)$ . Queues appear in breadth-first search, task scheduling, and buffer management.

Trees and Graphs

Trees are hierarchical structures where each node has zero or more children. Binary trees (at most two children per node) enable efficient searching and sorting. Balanced trees like AVL and Red-Black trees maintain $O(\log n)$ height, keeping operations fast even as data grows.

Graphs consist of vertices connected by edges and can represent complex relationships like social networks or road maps. Edges can be directed (one-way) or undirected (two-way). Graph algorithms solve problems like shortest paths, connectivity, and network flow.

Searching and Sorting Algorithms

Searching and sorting are among the most fundamental operations in computing. Nearly every application that handles data relies on them.

Linear vs. Binary Search

Linear search checks each element one by one. It works on unsorted data and runs in $O(n)$ time. Simple but slow for large datasets.

Binary search requires sorted data but runs in $O(\log n)$ . At each step, it compares the target to the middle element and eliminates half the remaining search space. For a sorted array of 1,000,000 elements, binary search needs at most about 20 comparisons, while linear search could need all 1,000,000.

Comparison-Based Sorting

These algorithms sort by comparing pairs of elements.

Simple sorts like bubble sort, insertion sort, and selection sort run in $O(n^2)$ and are mainly useful for small datasets or nearly sorted data
Efficient sorts like merge sort and quicksort achieve $O(n \log n)$ average-case complexity
Heapsort provides $O(n \log n)$ worst-case with in-place sorting
Timsort (used in Python and Java) adapts to partially sorted input for better real-world performance

There's a proven lower bound: no comparison-based sort can do better than $\Omega(n \log n)$ comparisons in the worst case.

Non-Comparison Sorting

These algorithms avoid element-to-element comparisons entirely, which lets them beat the $\Omega(n \log n)$ barrier under certain conditions.

Counting sort tallies the frequency of each value to determine positions. Runs in $O(n + k)$ where $k$ is the range of values
Radix sort processes elements digit by digit, suitable for fixed-length integers or strings
Bucket sort distributes elements into buckets, sorts each bucket, then concatenates

These can achieve $O(n)$ time but only work well when the input has a limited range or fixed-length keys.

Optimization Techniques

Optimization is about finding the best solution from a set of possibilities. Many real-world problems have so many possible solutions that checking all of them is impractical.

Heuristics

Heuristics are rule-of-thumb strategies that find good (but not necessarily optimal) solutions quickly. They're especially valuable for NP-hard problems where no known algorithm finds the exact answer in polynomial time.

Examples include:

Hill climbing: Repeatedly move to a better neighboring solution until no improvement is possible
Simulated annealing: Like hill climbing but occasionally accepts worse solutions to escape local optima, inspired by the cooling of metals
Genetic algorithms: Evolve a population of candidate solutions through selection, crossover, and mutation

Approximation Algorithms

Unlike heuristics, approximation algorithms come with provable guarantees. They produce solutions within a known factor of the optimal. For example, a 2-approximation algorithm for the traveling salesman problem guarantees its solution is at most twice the length of the optimal route.

These are used when exact solutions are too expensive to compute but you still need a performance guarantee.

Local vs. Global Optimization

Local optimization finds the best solution in the neighborhood of a current point. Methods like gradient descent and Newton's method are fast but can get stuck at local optima (a solution that's best nearby but not overall).

Global optimization searches the entire solution space. Methods like simulated annealing and genetic algorithms explore more broadly but take longer.

Hybrid approaches often work best: use global methods to find a promising region, then switch to local methods to refine the solution.

Algorithm Correctness

Writing an algorithm that seems to work isn't enough. You need to prove it produces the correct output for all valid inputs, not just the test cases you tried.

Proof Techniques

Mathematical induction proves correctness for all input sizes by establishing a base case and an inductive step
Loop invariants demonstrate that iterative algorithms maintain a true condition throughout execution
Proof by contradiction assumes the algorithm is incorrect and derives a logical impossibility
Testing complements formal proofs by checking specific inputs and edge cases, but testing alone can never prove correctness for all inputs

Loop Invariants

A loop invariant is a condition that holds true before the loop starts, remains true after each iteration, and still holds when the loop terminates. Proving a loop invariant has three parts:

Initialization: Show the invariant is true before the first iteration
Maintenance: Show that if the invariant is true before an iteration, it remains true after that iteration
Termination: Show the loop ends, and the invariant at termination implies the algorithm's correctness

Loop invariants are commonly used to prove the correctness of sorting and searching algorithms.

Definition and purpose, Algorithm - Wikipedia

Inductive Reasoning

Mathematical induction proves statements about all natural numbers (or all input sizes):

Base case: Prove the statement holds for the smallest value (often $n = 0$ or $n = 1$ )
Inductive step: Assume the statement holds for some value $k$ , then prove it holds for $k + 1$

Strong induction is a variant where you assume the statement holds for all values up to $k$ to prove it for $k + 1$ . This is particularly useful for recursive algorithms where a problem of size $k + 1$ might depend on subproblems of various sizes, not just size $k$ .

Advanced Algorithm Topics

These topics go beyond classical sequential algorithms and address the demands of modern computing environments.

Randomized Algorithms

Randomized algorithms incorporate random choices during execution to achieve better average-case performance or simpler implementations.

Las Vegas algorithms always produce the correct answer but have randomized running time (e.g., randomized quicksort)
Monte Carlo algorithms run in fixed time but have a small probability of producing an incorrect result (e.g., randomized primality testing)

Randomization is useful in cryptography, load balancing, and avoiding worst-case inputs that adversaries might exploit.

Parallel Algorithms

Parallel algorithms execute multiple instructions simultaneously, exploiting multi-core processors or distributed systems. They divide problems into independent subtasks that run concurrently.

Techniques include divide-and-conquer parallelism and data parallelism (applying the same operation to many data elements at once). Parallel merge sort and bitonic sort are classic examples.

Challenges include load balancing (keeping all processors busy), communication overhead (cost of sharing data between processors), and synchronization (ensuring correct ordering of dependent operations).

Distributed Algorithms

Distributed algorithms run across multiple networked computers, each with limited local information. They must handle communication delays, partial failures, and the absence of a global clock.

Key applications include consensus algorithms (getting multiple machines to agree on a value), distributed hash tables for peer-to-peer data storage, and blockchain protocols for decentralized consensus. These are essential for cloud computing, large-scale web services, and IoT systems.

Real-World Applications

Machine Learning Algorithms

Supervised learning (decision trees, support vector machines) learns from labeled data to make predictions
Unsupervised learning (k-means clustering, principal component analysis) discovers patterns in unlabeled data
Deep learning (convolutional neural networks, recurrent neural networks) handles complex tasks like image recognition and language processing
Reinforcement learning trains agents to make decisions by rewarding desired behaviors
Optimization algorithms like gradient descent and stochastic gradient descent are used to train all of these models

Cryptographic Algorithms

Symmetric encryption (AES) uses a single shared key for both encryption and decryption
Public-key cryptography (RSA, elliptic curve) uses a key pair for secure key exchange and digital signatures
Hash functions (SHA-256) produce fixed-size outputs from arbitrary inputs, used for data integrity verification and password storage
Key exchange protocols (Diffie-Hellman) let two parties establish a shared secret over an insecure channel
Post-quantum cryptography is an active area developing algorithms resistant to quantum computer attacks

Network Routing Algorithms

Dijkstra's algorithm finds shortest paths in graphs with non-negative edge weights
Bellman-Ford algorithm handles graphs that include negative edge weights
Distance vector routing has routers share routing tables with neighbors (used in RIP)
Link-state routing has each router build a complete map of the network topology (used in OSPF)
Border Gateway Protocol (BGP) manages routing between large autonomous systems across the internet

Ethical Considerations

As algorithms increasingly make decisions that affect people's lives, the ethical dimensions of algorithm design deserve serious attention.

Bias in Algorithm Design

Algorithmic bias can perpetuate or amplify existing societal prejudices. This often stems from training data bias, where historical data reflects past discrimination, and selection bias, where data collection doesn't represent all groups equally.

Mitigation strategies include data preprocessing to balance representation, fairness-aware learning algorithms, regular audits of algorithm outputs across demographic groups, and maintaining diverse development teams.

Privacy and Security Concerns

Algorithms processing personal data must comply with privacy regulations like GDPR and CCPA. Technical approaches to protecting privacy include:

Differential privacy: Adding calibrated noise to query results so individual records can't be identified
Secure multi-party computation: Multiple parties compute a function over their combined data without revealing their individual inputs
Homomorphic encryption: Performing computations on encrypted data without ever decrypting it
Anonymization and pseudonymization: Removing or replacing identifying information in datasets

Societal Impact of Algorithms

Algorithms now influence decisions in finance, healthcare, criminal justice, hiring, and many other domains. Key principles for responsible algorithm deployment include:

Transparency: People affected by algorithmic decisions should be able to understand how those decisions are made
Fairness: Algorithms should treat different demographic groups equitably
Accountability: There should be clear responsibility when algorithmic decisions cause harm
Human oversight: People should retain the ability to review and contest algorithmic decisions