Study smarter with Fiveable
Get study guides, practice questions, and cheatsheets for all your subjects. Join 500,000+ students with a 96% pass rate.
Protein structure prediction sits at the heart of computational genomics because structure determines function. Whether you're analyzing disease-causing mutations, designing therapeutics, or understanding evolutionary relationships, you need to know how a protein folds and why that shape matters. These tools represent fundamentally different computational approaches—from sequence alignment to neural networks to physics-based simulations—and understanding when to use each method is critical for any genomics workflow.
You're being tested not just on what these tools do, but on the underlying principles that make them work: homology modeling, threading, ab initio prediction, and deep learning architectures. Don't just memorize tool names—know what type of prediction each tool performs, what input it requires, and when it's the right choice for your research question.
These tools leverage the principle that evolutionarily related proteins share similar structures. By identifying sequence similarity, we can infer structural and functional relationships without directly modeling 3D coordinates.
Compare: BLAST vs. HHpred—both identify homologous sequences, but HHpred uses profile-profile comparisons that detect more distant evolutionary relationships. When BLAST returns no significant hits, HHpred should be your next step.
Before modeling full 3D structures, predicting local structural elements (alpha-helices, beta-sheets, coils) provides crucial constraints. Neural networks trained on solved structures can predict these elements from sequence alone with ~80% accuracy.
When a template structure exists (typically >30% sequence identity), homology modeling produces the most reliable predictions. These methods align target sequences to templates and transfer structural coordinates.
Compare: MODELLER vs. SWISS-MODEL—both perform homology modeling, but MODELLER offers fine-grained control for expert users while SWISS-MODEL prioritizes automation. For quick exploratory modeling, use SWISS-MODEL; for publication-quality models requiring custom restraints, use MODELLER.
For proteins with no detectable homologs, threading (fold recognition) and ab initio methods attempt to predict structure from physical and statistical principles alone.
Compare: I-TASSER vs. RaptorX—both handle difficult targets, but I-TASSER relies more heavily on fragment assembly while RaptorX emphasizes contact prediction via deep learning. RaptorX often performs better on proteins with few homologs; I-TASSER excels when partial templates exist.
These approaches model protein folding using energy functions (Rosetta) or learned representations (AlphaFold) that capture the fundamental physics and patterns of protein architecture.
Compare: Rosetta vs. AlphaFold—Rosetta uses physics-based energy functions requiring significant computational resources, while AlphaFold uses deep learning for rapid, highly accurate predictions. For pure structure prediction, AlphaFold is now the default; for protein design and engineering tasks, Rosetta remains essential.
| Concept | Best Examples |
|---|---|
| Sequence similarity search | BLAST, HHpred |
| Secondary structure prediction | PSIPRED |
| Homology/comparative modeling | MODELLER, SWISS-MODEL, Phyre2 |
| Threading/fold recognition | I-TASSER, Phyre2, RaptorX |
| Ab initio prediction | I-TASSER, Rosetta |
| Deep learning approaches | AlphaFold, RaptorX |
| Protein design & engineering | Rosetta |
| Remote homology detection | HHpred, Phyre2 |
Which two tools would you use sequentially if BLAST returns no significant hits but you suspect your protein has a known fold? What makes them more sensitive than BLAST?
Compare and contrast homology modeling (SWISS-MODEL) with threading (I-TASSER)—when is each approach appropriate, and what determines which you should choose?
A colleague's AlphaFold model shows low pLDDT scores (<50) in a 40-residue region. What does this likely indicate about that region, and what tool might help characterize it further?
You need to predict how a point mutation affects protein stability and want to model alternative conformations. Which tool is best suited for this task, and why?
Rank these scenarios by expected model accuracy and explain your reasoning: (a) 60% sequence identity to a solved structure, (b) no detectable homologs but strong coevolutionary signal, (c) 25% identity to a distant homolog detected only by HHpred.