Network construction and structure detection with metagenomic count data

Zhenqiu Liu, Shili Lin, Steven Piantadosi

Research output: Contribution to journalArticle

1 Citation (Scopus)

Abstract

Background: The human microbiome plays a critical role in human health. Massive amounts of metagenomic data have been generated with advances in next-generation sequencing technologies that characterize microbial communities via direct isolation and sequencing. How to extract, analyze, and transform these vast amounts of data into useful knowledge is a great challenge to bioinformaticians. Microbial biodiversity research has focused primarily on taxa composition and abundance and less on the co-occurrences among different taxa. However, taxa co-occurrences and their relationships to environmental and clinical conditions are important because network structure may help to understand how microbial taxa function together. Results: We propose a systematic robust approach for bacteria network construction and structure detection using metagenomic count data. Pairwise similarity/distance measures between taxa are proposed by adapting distance measures for samples in ecology. We also extend the sparse inverse covariance approach to a sparse inverse of a similarity matrix from count data for network construction. Our approach is efficient for large metagenomic count data with thousands of bacterial taxa. We evaluate our method with real and simulated data. Our method identifies true and biologically significant network structures efficiently. Conclusions: Network analysis is crucial for detecting subnetwork structures with metagenomic count data. We developed a software tool in MATLAB for network construction and biologically significant module detection. Software MetaNet can be downloaded from http://biostatistics.csmc.edu/MetaNet/.

Original languageEnglish (US)
Article number72
JournalBioData Mining
Volume8
Issue number1
DOIs
StatePublished - Dec 12 2015

Fingerprint

Metagenomics
Count Data
Distance Measure
Network Structure
Sequencing
Software
Biodiversity
Biostatistics
Ecology
Electric network analysis
MATLAB
Microbiota
Bacteria
Large Data
Network Analysis
Software Tools
Similarity Measure
Health
Isolation
Pairwise

All Science Journal Classification (ASJC) codes

  • Biochemistry
  • Molecular Biology
  • Genetics
  • Computer Science Applications
  • Computational Theory and Mathematics
  • Computational Mathematics

Cite this

Liu, Zhenqiu ; Lin, Shili ; Piantadosi, Steven. / Network construction and structure detection with metagenomic count data. In: BioData Mining. 2015 ; Vol. 8, No. 1.
@article{8e76129fe8614b2da6998ec0a91b53ad,
title = "Network construction and structure detection with metagenomic count data",
abstract = "Background: The human microbiome plays a critical role in human health. Massive amounts of metagenomic data have been generated with advances in next-generation sequencing technologies that characterize microbial communities via direct isolation and sequencing. How to extract, analyze, and transform these vast amounts of data into useful knowledge is a great challenge to bioinformaticians. Microbial biodiversity research has focused primarily on taxa composition and abundance and less on the co-occurrences among different taxa. However, taxa co-occurrences and their relationships to environmental and clinical conditions are important because network structure may help to understand how microbial taxa function together. Results: We propose a systematic robust approach for bacteria network construction and structure detection using metagenomic count data. Pairwise similarity/distance measures between taxa are proposed by adapting distance measures for samples in ecology. We also extend the sparse inverse covariance approach to a sparse inverse of a similarity matrix from count data for network construction. Our approach is efficient for large metagenomic count data with thousands of bacterial taxa. We evaluate our method with real and simulated data. Our method identifies true and biologically significant network structures efficiently. Conclusions: Network analysis is crucial for detecting subnetwork structures with metagenomic count data. We developed a software tool in MATLAB for network construction and biologically significant module detection. Software MetaNet can be downloaded from http://biostatistics.csmc.edu/MetaNet/.",
author = "Zhenqiu Liu and Shili Lin and Steven Piantadosi",
year = "2015",
month = "12",
day = "12",
doi = "10.1186/s13040-015-0072-2",
language = "English (US)",
volume = "8",
journal = "BioData Mining",
issn = "1756-0381",
publisher = "BioMed Central",
number = "1",

}

Network construction and structure detection with metagenomic count data. / Liu, Zhenqiu; Lin, Shili; Piantadosi, Steven.

In: BioData Mining, Vol. 8, No. 1, 72, 12.12.2015.

Research output: Contribution to journalArticle

TY - JOUR

T1 - Network construction and structure detection with metagenomic count data

AU - Liu, Zhenqiu

AU - Lin, Shili

AU - Piantadosi, Steven

PY - 2015/12/12

Y1 - 2015/12/12

N2 - Background: The human microbiome plays a critical role in human health. Massive amounts of metagenomic data have been generated with advances in next-generation sequencing technologies that characterize microbial communities via direct isolation and sequencing. How to extract, analyze, and transform these vast amounts of data into useful knowledge is a great challenge to bioinformaticians. Microbial biodiversity research has focused primarily on taxa composition and abundance and less on the co-occurrences among different taxa. However, taxa co-occurrences and their relationships to environmental and clinical conditions are important because network structure may help to understand how microbial taxa function together. Results: We propose a systematic robust approach for bacteria network construction and structure detection using metagenomic count data. Pairwise similarity/distance measures between taxa are proposed by adapting distance measures for samples in ecology. We also extend the sparse inverse covariance approach to a sparse inverse of a similarity matrix from count data for network construction. Our approach is efficient for large metagenomic count data with thousands of bacterial taxa. We evaluate our method with real and simulated data. Our method identifies true and biologically significant network structures efficiently. Conclusions: Network analysis is crucial for detecting subnetwork structures with metagenomic count data. We developed a software tool in MATLAB for network construction and biologically significant module detection. Software MetaNet can be downloaded from http://biostatistics.csmc.edu/MetaNet/.

AB - Background: The human microbiome plays a critical role in human health. Massive amounts of metagenomic data have been generated with advances in next-generation sequencing technologies that characterize microbial communities via direct isolation and sequencing. How to extract, analyze, and transform these vast amounts of data into useful knowledge is a great challenge to bioinformaticians. Microbial biodiversity research has focused primarily on taxa composition and abundance and less on the co-occurrences among different taxa. However, taxa co-occurrences and their relationships to environmental and clinical conditions are important because network structure may help to understand how microbial taxa function together. Results: We propose a systematic robust approach for bacteria network construction and structure detection using metagenomic count data. Pairwise similarity/distance measures between taxa are proposed by adapting distance measures for samples in ecology. We also extend the sparse inverse covariance approach to a sparse inverse of a similarity matrix from count data for network construction. Our approach is efficient for large metagenomic count data with thousands of bacterial taxa. We evaluate our method with real and simulated data. Our method identifies true and biologically significant network structures efficiently. Conclusions: Network analysis is crucial for detecting subnetwork structures with metagenomic count data. We developed a software tool in MATLAB for network construction and biologically significant module detection. Software MetaNet can be downloaded from http://biostatistics.csmc.edu/MetaNet/.

UR - http://www.scopus.com/inward/record.url?scp=84960491077&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84960491077&partnerID=8YFLogxK

U2 - 10.1186/s13040-015-0072-2

DO - 10.1186/s13040-015-0072-2

M3 - Article

VL - 8

JO - BioData Mining

JF - BioData Mining

SN - 1756-0381

IS - 1

M1 - 72

ER -