Microbiome Analysis Pipeline

This repository contains scripts and workflows for conducting microbiome analyses using QIIME2 and R. The workflows are designed to handle metagenomic and taxonomic data, performing tasks such as sequence processing, differential abundance analysis, and visualization.

Overview

This repository supports two primary analysis pipelines:

QIIME2 Workflow: Focused on sequence processing, taxonomic classification, and phylogenetic diversity analysis.
R Workflow: Performs differential abundance analysis using ANCOMBC2 and creates detailed taxonomic visualizations.

These workflows are complementary and provide end-to-end support for microbiome data analysis.

Prerequisites

Software and Dependencies

QIIME2 (v2021.8 or later)
- Ensure all plugins like dada2, feature-classifier, phylogeny, and taxa are installed.
R (v4.0 or later)
- Required R packages: tidyverse, phyloseq, qiime2R, microbiome, ANCOMBC, microViz, ggplot2, lme4.

Input Files

QIIME2 Workflow:
- Raw paired-end sequences in CasavaOneEightSingleLanePerSampleDirFmt format.
- A pre-trained classifier for taxonomic classification (e.g., Silva database).
- Metadata file in .tsv format.
R Workflow:
- Exported .qza and .qzv files from QIIME2.
- Metadata file with relevant grouping columns.

Pipeline Details

QIIME2 Workflow

The QIIME2 pipeline includes the following steps:

Data Import: Converts raw paired-end sequence data into QIIME2's SampleData[PairedEndSequencesWithQuality] format.
Denoising with DADA2: Removes noise and generates a feature table and representative sequences.
Taxonomic Classification: Classifies sequences using a pre-trained classifier (e.g., Silva database).
Diversity Analysis: Computes alpha and beta diversity metrics based on phylogenetic trees.
Visualization: Produces bar plots, tables, and diversity summaries.

R Workflow

The R pipeline is designed for downstream statistical and visualization tasks:

Import Data: Converts exported QIIME2 .qza files into phyloseq objects.
Differential Abundance Analysis:
- Performs ANCOMBC2 at various taxonomic levels (e.g., Phylum, Genus).
- Identifies significant taxa with adjusted p-values and log fold changes.
Visualization:
- Generates bar plots with error bars for log fold changes.
- Outputs results as .tsv files and high-resolution images.

Usage

Running the QIIME2 Workflow

Update the QIIME2 paths in the Bash script (qiime2_workflow.sh).

Execute the script:

bash qiime2_workflow.sh

             _-=====-_
           /         \
 ___  ___ //   UNIPI  \\ ___  ___
|   ||   | ------------ |   ||   |
|___||___|     ___      |___||___|
   ( o o )    ( o )        ( o o )

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
ANCOMBC2_analysis		ANCOMBC2_analysis
QIIME2_toolkit		QIIME2_toolkit
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Microbiome Analysis Pipeline

Table of Contents

Overview

Prerequisites

Software and Dependencies

Input Files

Pipeline Details

QIIME2 Workflow

R Workflow

Usage

Running the QIIME2 Workflow

About

Releases

Packages

Languages

JackDettoLoco/PhD

Folders and files

Latest commit

History

Repository files navigation

Microbiome Analysis Pipeline

Table of Contents

Overview

Prerequisites

Software and Dependencies

Input Files

Pipeline Details

QIIME2 Workflow

R Workflow

Usage

Running the QIIME2 Workflow

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages