-
Notifications
You must be signed in to change notification settings - Fork 0
Core Project Overview
Flu-GLUE is structured around a minimal 'core' project, which can be expanded with additional data and tools as needed. This layered project structure allows users to build on a robust foundation while adding specialized data and logic for specific research purposes. The core project includes:
- Influenza genome feature definitions that describe key regions in the genomes of all influenza species.
- Reference genome sequences for each species and subtype, ensuring comprehensive coverage.
- Genome feature annotations linked to the master reference sequences.
- A hierarchically arranged set of multiple sequence alignments (MSAs) representing sequence homology among reference sequences.
These core components support essential comparative genomics tasks such as feature coverage mapping, alignment generation, genotyping, phylogenetic reconstruction, mutation frequency calculations, and mutation profiling. This approach enables both local instance setup and pre-built project deployment, offering flexibility in how the system is used.
Flu-GLUE's layered project structure allows users to selectively import sequences based on specific research needs, preventing computational overload while maintaining analytical precision. This ensures that analyses remain feasible even with large datasets, by incorporating only the data relevant to the specific study at hand.
In addition to its flexible design, Flu-GLUE integrates diverse publicly available influenza genome data and offers a suite of powerful tools to support comprehensive genomic analysis:
- Genotyping: Identify and classify influenza virus strains with automated genotyping for influenza sequences.
- Mutation Profiling: Detect and analyze mutations in virus genomes, helping track viral evolution and adaptation.
- Phylogenetic Reconstruction: Build evolutionary trees to track the spread and genetic diversification of influenza viruses.
Flu-GLUE also offers validated and standardized data through its GenBank filtering tools, which process and clean influenza virus sequence data from GenBank. These tools help manage sequence-associated metadata, resolve redundancies, and ensure high-quality data is imported for further analysis.