Any 16S Microbial Abundance Network Analysis Suggestions?

07 July 2018 3 491 Report

I am an undergraduate neuroscience and bioinformatics research assistant and my personal project has been to explore the gut microbiota of an EAE mouse model. There are three treatment groups: untreated control, Complete Freund's Adjuvant only, and MOG+CFA. Samples were taken from 6 control, 6 CFA, and 5 CFA+MOG mice over 5 time points (1 before and 4 after starting the EAE experiment).

I have since been focusing on analyzing a network created from the abundance data. Counts were normalized with DESeq2 and split into each treatment group. These count matrices were used to calculate a Spearman's rank correlation coefficient matrix for each group. This technique was applied to permutation testing with 100 randomized matrices generated. From the original matrix, all coefficients that fell below a 5% significance level in their corresponding distribution were set to 0 and correlations above were set to 1. In addition, any coefficients below 0.5 in magnitude were set to 0. This binary matrix was used to create unweighted network for each group.

I have since been focused on using over-representation testing on various features in the networks. Of key interest is how the networks are divided into communities/modules. I am using the fast greedy algorithm from igraph due to time/computing constraints but that could change based on suggestions.

Currently, I have been testing for whether a certain taxa-taxa interaction is over-represented in one module versus the rest of the network. I am using fisher's exact test where the 2x2 matrix can be split into within the network vs outside the network and the taxa-taxa interaction vs every other interaction. The counts correspond to each edge in the network that fulfills the criteria of each cell.

The data I get back is a matrix of a taxon-taxon interaction per each row and a module for each column. The values are p-values from the hypothesis testing. There are 3 such matrixes, one for each network/treatment group. I also have a matrix of the counts for each interaction/edge in each module.

My question is how can I better use the results of this data to derive biological insight? I have looked into dividing up bacteria into functional classes and potentially machine learning applications, but there are no standout programs that I know of that could readily take this data. The goals of this project are to better understand the structural changes in the gut microbiota during EAE and possibly to discover specific features like keystone taxa or co-occurrence groups that are gained or lost in the MOG+CFA group. Of particular interest are any OTUs related to the Lactobacillus/Bacilli lineage.

Ajit kumar Roy

I think u r proceeding in the right path.

Valerie Diane Valeriano

Hi David,

I also agree with the approach you are considering. Understanding the functional potential of the microbial groups is indeed important, complementary to understanding the abundance and structure for each treatment group. Given so, maybe a probable approach is to utilize PICRUSt (http://picrust.github.io/picrust/) to infer the metabolic capacity of your 16s libraries. This may help to understand why the keystone taxa are present or co-occurring (contributes to stability of the microbiota? etc.)

Steffen Kiel

Hi, in my experience, the Louvain approach performs slightly better than 'fast greedy' for identifying clusters in large networks. Re the taxon-taxon interactions you might check how the 'edge betweenness centrality' can help (also implemented in igraph).

Cheers, Steffen

Can I pool 16S data from all individuals at the pre-treatment time point?

How to learn more about SPSS and its Application?

Is there a problem with my RNA pellet?

Can I base on reverse DNA sequences to perform alignment, convert to amino acids and GenBank submission?

Baseline drift in HPLC? What causes this?

Text-Communication from the M1 Hand Area using BCI—and then there is Elon Musk?

Has anyone applied Python in the field of textile engineering for data analysis, automation, or smart textiles?

Strugglling with m6A dot blot any suugesstion ?

How can I use the cif data obtained from rietveld refinement extracted via gsas2, for microstructural analysis using ETEX software?

Request Python code?

GC-MS retention index prediticon?