Comparative Genomics of Longevity Gene Loss

Comparative Genomics of Longevity Gene Loss

ISEF Category: Computational Biology and Bioinformatics

Ready to Turn This Idea Into a Real Project?

This guide was put together with the help of AI research tools to give you a solid starting point. But a competitive science fair project lives in the details: refining your research question, fine-tuning your variables, analyzing your data, and presenting your findings like a seasoned scientist.

For next steps tailored to your interests, skill level, and timeline, work one-on-one with a MehtA+ mentor. Learn more about MehtA+ Science & Engineering Research Mentorship →

Subcategory: Computational Evolutionary Biology  ·  Difficulty: Advanced  ·  Setup: University Lab  ·  Time: Full Year

The Hook

Some animals live fast and die young, while others seem to ignore the clock. Whales, bats, and naked mole-rats all break the usual rules in different ways. You can ask whether their genomes show a shared pattern in DNA-repair genes. That kind of pattern hunt can turn a long shot into a sharp research question.

What Is It?

This project asks whether long-lived mammals show convergent losses, or repeated dropping of the same genes, in DNA-repair pathways. Think of DNA-repair genes like a maintenance crew for a building. If a crew member leaves, the building may run on a different schedule, and that change could affect aging, cancer risk, or stress resistance.

You would compare genomes from long-lived mammals, such as whales, bats, and naked mole-rats, with genomes from closer short-lived relatives. The goal is not to prove a gene causes longevity. The goal is to find candidate genes or pathways that stand out across species, then rank them with clear rules. That kind of analysis uses comparative genomics, which means comparing DNA across species to look for shared changes.

Why This Is a Good Topic

This makes a strong science fair topic because you can ask a real evolutionary question with public data and a clear analysis plan. The project connects to aging, cancer biology, and DNA maintenance, so the real-world stakes are easy to explain. You can learn how to compare sequences, build a phylogenetic context, and filter noisy hits into a shortlist of candidates. That gives you a real research workflow, not just a search-and-copy project.

Research Questions

  • How does lifespan group affect the number of apparent DNA-repair gene losses across mammal genomes?
  • What is the effect of using different short-lived reference species on the list of candidate longevity genes?
  • Does convergent gene loss appear more often in DNA-repair pathways than in other housekeeping pathways?
  • To what extent do whales, bats, and naked mole-rats share the same missing or disrupted DNA-repair genes?
  • Which DNA-repair genes stay intact in long-lived mammals but are lost in short-lived relatives?
  • How does stricter filtering of genome annotation quality change the ranking of candidate longevity targets?

Basic Materials

  • Laptop with reliable internet access.
  • Free storage space for genome files and notes.
  • Spreadsheet software such as Google Sheets or LibreOffice Calc.
  • Reference genome browser access through Ensembl or UCSC Genome Browser.
  • PubMed access for reading review articles on longevity and DNA repair.
  • A notebook or digital document for tracking species, gene names, and inclusion rules.

Advanced Materials

  • High-performance laptop or desktop with enough memory for large genome files.
  • Access to command-line bioinformatics tools on a local machine or server.
  • Reference genome assemblies for target and control mammals.
  • Gene annotation files in GFF or GTF format.
  • Protein and nucleotide sequence databases from NCBI and Ensembl.
  • Phylogenetic analysis software such as MEGA or IQ-TREE.
  • R or Python for filtering, visualization, and statistics.

Software & Tools

  • Ensembl Genome Browser: Finds orthologs, annotations, and comparative genomic data for many mammals.
  • UCSC Genome Browser: Lets you inspect gene neighborhoods and sequence context across species.
  • NCBI Gene: Helps you confirm gene names, function summaries, and linked records.
  • Python: Supports data cleaning, gene list comparisons, and plotting results.
  • R: Helps you run statistics, make comparison plots, and summarize pathway trends.

Experiment Steps

  1. Define a narrow gene set, such as DNA-repair and genome-maintenance genes, so your search stays focused.
  2. Choose long-lived mammals and short-lived relatives that give you fair comparisons across the tree of life.
  3. Set rules for what counts as a gene loss, such as missing annotation, frameshift, or premature stop codon.
  4. Build a comparison table that records each gene, each species, and each evidence source.
  5. Add phylogenetic context so you can tell shared evolutionary changes from random annotation gaps.
  6. Rank the strongest candidate genes with a clear scoring system that favors repeated patterns and good data quality.

Common Pitfalls

  • Treating a missing annotation as a true gene loss, which can happen when the genome assembly is incomplete.
  • Comparing distant species without matching them to close relatives, which makes lifespan effects hard to separate from ancestry.
  • Using gene names alone instead of confirming orthologs, which can mix up different genes with similar labels.
  • Ignoring assembly quality, which can make a bad genome look like it has many losses.
  • Calling a longevity gene too early, which overstates what comparative genomics can prove.

What Makes This Competitive

A strong project does more than list missing genes. It explains why a candidate loss looks real, repeated, and biologically meaningful. You can raise the quality fast by adding close relatives, strict evidence filters, and a ranking system that separates weak hits from strong ones. A better project also tests whether the same pattern holds across different parts of the DNA-repair network, not just one gene list.

Project Variations

  • Focus on telomere maintenance genes instead of the full DNA-repair set.
  • Compare gene losses in long-lived marine mammals against long-lived flying mammals to test whether ecology changes the pattern.
  • Rank candidate genes by pathway class, then ask whether base-excision repair, double-strand break repair, or mismatch repair shows the strongest signal.

Learn More

  • NCBI Gene: Search gene pages for function summaries, ortholog links, and supporting literature.
  • Ensembl Genome Browser: Compare mammal genomes and inspect gene annotations across species.
  • UCSC Genome Browser: Examine sequence neighborhoods and cross-species conservation tracks.
  • NIH PubMed: Search review articles on comparative genomics, longevity, and DNA repair.
  • NCBI Bookshelf: Read free textbook chapters on genomics, evolution, and molecular biology.
  • Molecular Biology of the Cell: Use a library or school copy for clear background on DNA repair and genome maintenance.

For next steps tailored to your interests, skill level, and timeline, work one-on-one with a MehtA+ mentor. Learn more about MehtA+ Science & Engineering Research Mentorship →

To discover more projects, visit the MehtA+ Science Fair Project Discovery Hub​ →

Shopping Cart