Study smarter with Fiveable
Get study guides, practice questions, and cheatsheets for all your subjects. Join 500,000+ students with a 96% pass rate.
Cloud storage isn't just about "where your data lives"—it's about matching storage architecture to workload requirements. Exam questions will test whether you understand the tradeoffs between latency, durability, cost, and access patterns. You'll need to recognize when an application needs block storage's raw performance versus object storage's infinite scalability, or why a company might choose cold storage over hot storage for compliance data.
The key insight here is that storage types exist on spectrums: access frequency (hot vs. cold), data structure (structured vs. unstructured), and persistence (durable vs. ephemeral). Don't just memorize definitions—know what concept each storage type illustrates and when an architect would choose one over another.
How data is organized determines which storage type fits best. Unstructured data (images, logs, backups) needs flexible schemas, while structured data (transactions, user records) requires rigid organization for query efficiency.
Compare: Object Storage vs. File Storage—both store unstructured data, but object storage uses flat namespaces with API access while file storage maintains hierarchical paths with protocol-based access. If an exam question mentions "legacy application migration," file storage is usually the answer.
Access frequency is the primary cost driver in cloud storage. Architects must predict how often data will be read or written to select the appropriate storage tier.
Compare: Cold Storage vs. Archive Storage—both target infrequent access, but archive storage assumes even rarer retrieval (annually vs. monthly) with longer restoration times. FRQ tip: if the scenario mentions "regulatory compliance" or "7-year retention," archive storage is your answer.
Some workloads demand raw I/O performance that only certain storage architectures can deliver. Block storage operates at the lowest level, providing the speed databases and VMs require.
Compare: Block Storage vs. Ephemeral Storage—both offer high performance, but block storage persists independently of compute instances while ephemeral storage vanishes when the instance stops. Exam questions often test whether you'd use ephemeral storage for a database (you wouldn't—data would be lost).
Modern architectures often span multiple locations or combine cloud with on-premises infrastructure. These storage types address geographic distribution and hybrid requirements.
Compare: CDN vs. Hybrid Storage—CDN distributes copies of content for read performance, while hybrid storage manages primary data placement across environments. CDN is about latency reduction; hybrid storage is about infrastructure flexibility.
| Concept | Best Examples |
|---|---|
| Unstructured data at scale | Object Storage, File Storage |
| High-performance workloads | Block Storage, Hot Storage, Ephemeral Storage |
| Cost-optimized retention | Cold Storage, Archive Storage |
| Structured data management | Database Storage |
| Geographic distribution | CDN |
| Multi-environment flexibility | Hybrid Storage |
| Temporary/disposable data | Ephemeral Storage |
| Legacy application support | File Storage (NFS/SMB protocols) |
Which two storage types would you compare when deciding how to store millions of user-uploaded images that need web API access—and what's the key differentiator?
A financial services company must retain transaction records for 7 years but only accesses them during annual audits. Which storage type is most cost-effective, and why wouldn't hot storage be appropriate?
Compare and contrast block storage and object storage: what type of workload suits each, and how do their access methods differ?
An application uses temporary files for video transcoding that can be regenerated if lost. The files need maximum I/O speed. Which storage type fits, and what's the critical tradeoff the architect accepts?
A company wants to migrate to the cloud gradually while keeping latency-sensitive data on-premises. Which storage approach supports this, and how does it differ from using a CDN?