NSF requires disclosure of AI tool usage in proposal preparation. Ensure you disclose the use of FindGrants' AI drafting in your application.
NSF
Information about the evolutionary relationships between species and populations is critical for many biological analyses. For example, evolutionary trees and networks are commonly used to study the evolution and genetic basis of traits, with applications in fields ranging from medicine to agriculture and conservation. Despite significant progress over the past decade, many important parts of evolutionary history remain unresolved or are subject to debate. An opportunity to address these open questions has recently emerged. Technological advances have led to more complete and high quality genome assemblies than was previously possible, even resolving highly repetitive regions, like the 30-50% of vertebrate genomes made up of retrotransposons--DNA sequences that when active can copy-paste themselves to new locations. This development is significant because retrotransposons are major drivers of genome expansion and innovation--and their presence or absence at related positions across the genomes of different species are powerful markers of evolution. However, new computational methods are needed that leverage retrotransposon data for species tree and network reconstruction, an emerging field called retrophylogenomics. Only a few methods have been developed to date, but they are based on simple models that do not reflect retrotransposon dynamics. This project will address such challenges, building the computational infrastructure needed for fast and accurate evolutionary analyses of retrotransposons. This research will be integrated with an education plan to provide hands-on research and training opportunities in interdisciplinary computing and data science for undergraduate and graduate students. The software tools and curated data sets from this project will be incorporated into outreach for middle and high school students, connecting computing fundamentals with scientific discovery. The objective of this project is to address the core algorithmic and statistical barriers to retrophylogenomics, an emerging field concerned with leveraging retrotransposons to understand the evolution of species and populations. This goal will be pursued through the following activities. (1) Development of fast and user-friendly data processing workflows for calling retrotransposon variants from whole genome alignments, alleviating the need for manual curation. (2) Development of realistic models of retrotransposon evolution, integrating knowledge of retrotransposon dynamics with evolutionary models acting on multiple scales from the species/population-level to the molecular level. (3) Data simulations and model validation. (4) Design and analysis of methods for inferring species trees and networks from retrotransposons, blending statistical techniques with discrete algorithms. (5) Implementation of methods in open-source software. (6) Evaluation of method accuracy and scalability on simulated and real data. (7) Application of methods to vertebrate genomes to generate the largest catalog of retrotransposons variants, along with reconstructed species trees or networks. Graduate students will fully participate and be mentored in these activities. This project will create research opportunities for undergraduate students focused on data exploration and error detection via visualization and machine learning. This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.
Up to $348K
2030-04-30
Detailed requirements not yet analyzed
Have the NOFO? Paste it below for AI-powered requirement analysis.
One-time $749 fee · Includes AI drafting + templates + PDF export
Research Infrastructure: National Geophysical Facility (NGF): Advancing Earth Science Capabilities through Innovation - EAR Scope
NSF — up to $26.6M
AmLight: The Next Frontier Towards Discovery in the Americas and Africa
NSF — up to $9M
CREST Phase II Center for Complex Materials Design
NSF — up to $7.5M
EPSCoR CREST Phase I: Center for Energy Technologies
NSF — up to $7.5M
EPSCoR CREST Phase I: Center for Post-Transcriptional Regulation
NSF — up to $7.5M
EPSCoR CREST Phase I: Center for Semiconductors Research
NSF — up to $7.5M