SARS-CoV-2 5’UTR RNA Folding
ISEF Category: Biochemistry
Ready to Turn This Idea Into a Real Project?
This guide was put together with the help of AI research tools to give you a solid starting point. But a competitive science fair project lives in the details: refining your research question, fine-tuning your variables, analyzing your data, and presenting your findings like a seasoned scientist.
For next steps tailored to your interests, skill level, and timeline, work one-on-one with a MehtA+ mentor. Learn more about MehtA+ Science & Engineering Research Mentorship →
Subcategory: Other · Difficulty: Advanced · Setup: Home Setup · Time: 1 to 2 Months
The Hook
A tiny RNA hairpin can act like a gate for the ribosome. In SARS-CoV-2, the 5'UTR helps control how fast viral RNA gets translated into protein. You can test whether more stable RNA folds match lower translation output using public data and ViennaRNA.
What Is It?
Think of RNA like a string that folds back on itself, more like a bent paper clip than a straight wire. Secondary structure is the pattern of stems and loops, and ΔG is the energy score that estimates how easy that fold is to form. More negative ΔG usually means the structure is more stable.
The 5'UTR sits at the front of an RNA message, where the ribosome starts translation. In SARS-CoV-2, changes in this region can open or block access to the start site, which can shift how much protein gets made. ViennaRNA predicts those folds from sequence, and public ribosome-profiling data can show how much translation happened in real cells. Your project asks whether the predicted structure and the observed output move together.
Why This Is a Good Topic
This is a strong science fair topic because you can do real biology with public data and a laptop. The question is testable, because you can compare predicted folding scores against measured translation values across variants. It also connects to viral gene regulation, which gives your project a clear real-world angle. You can learn sequence analysis, structure prediction, and statistics without needing a wet lab.
Research Questions
- How does predicted minimum free energy change across reported SARS-CoV-2 5'UTR variants?
- What is the effect of stem-disrupting mutations on local ΔG in the 5'UTR?
- Does predicted RNA folding stability correlate with translation efficiency from public ribosome-profiling datasets?
- To what extent do synonymous changes near the start codon alter predicted accessibility of the translation initiation region?
- Which ViennaRNA settings produce the strongest agreement with reported translation efficiency values?
- How does the relationship between ΔG and translation efficiency change when you compare different public datasets?
Basic Materials
- Laptop or desktop computer with internet access.
- ViennaRNA Package installed locally or accessed through its web tools.
- Python or R for cleaning sequence tables and plotting results.
- Spreadsheet software for tracking sequences, variants, and translation values.
- Public SARS-CoV-2 5'UTR sequence data from NCBI and translation tables from article supplements or GEO.
Advanced Materials
- Linux workstation or campus cluster for batch folding runs.
- ViennaRNA Package with access to ensemble and accessibility commands.
- Raw or processed ribosome-profiling datasets from GEO or SRA.
- Reference genome and annotation files for SARS-CoV-2 and related coronaviruses.
- R or Python environment with regression, resampling, and plotting libraries.
Software & Tools
- ViennaRNA Package: Predicts RNA secondary structure and ΔG values for each variant.
- Python: Cleans sequence tables, runs batch folding, and merges structure scores with translation data.
- R: Fits correlations and regression models and makes publication-style plots.
- Jupyter Notebook: Keeps code, notes, and plots together for repeatable analysis.
Experiment Steps
- Define the exact 5'UTR variant set you will compare, and decide whether you want natural variants, designed mutants, or both.
- Choose one translation readout from public ribosome-profiling studies, and build a clean table that maps each sequence to its reported value.
- Run ViennaRNA on every sequence with the same settings, then decide which structure score you will compare, such as minimum free energy, ensemble diversity, or local accessibility.
- Plan a normalization strategy that controls for sequence length, GC content, and study source before you compare variants.
- Pick a statistical test or model that asks whether structure scores predict translation efficiency better than chance, then decide how you will visualize the relationship.
Common Pitfalls
- Mixing sequence variants from different papers without checking that they refer to the same 5'UTR region, which makes your comparison uneven.
- Comparing raw ΔG values across folds with different sequence lengths, which can make a longer RNA look more stable for the wrong reason.
- Ignoring GC content, which can hide whether structure or base composition drives the translation trend.
- Using one public translation dataset without checking its assay conditions, which can turn a cell-type effect into a false structure effect.
- Reading correlation as causation, which can make a predictive pattern look like proof that folding alone controls translation.
What Makes This Competitive
A stronger version does more than plot ΔG against translation. It tests multiple folding metrics, compares them against a null model, and checks whether the pattern survives after controlling for GC content and sequence context. You can raise the level by comparing several public datasets, splitting the data into training and validation sets, and asking whether one region of the 5'UTR predicts translation better than the full sequence. That kind of careful analysis looks much more like real research than a simple correlation plot.
Project Variations
- Compare wild-type and variant 5'UTR sequences from several SARS-CoV-2 lineages to see whether the same structural motif predicts translation in each group.
- Swap minimum free energy for base-pair accessibility near the start codon and test which metric tracks translation more closely.
- Compare SARS-CoV-2 with another coronavirus 5'UTR to see whether the structure-translation link holds across related viruses.
Learn More
- ViennaRNA Package: Find the official user guide, command-line tools, and reference papers on the ViennaRNA website.
- NCBI GEO: Search public ribosome-profiling datasets and processed tables.
- NCBI SRA: Find raw sequencing reads linked from published studies.
- PubMed: Search review articles on SARS-CoV-2 5'UTR structure, ribosome profiling, and translation control.
- Nucleic Acids Research: Search the journal site for papers on RNA secondary structure, accessibility, and viral translation.
