A computational framework to study microbial genes of unknown function

  • Microbes have an immense and varied functional potential that influences and is influenced by the surrounding environment. Microbial processes affect global biogeochemical cycles and numerous medical, biotechnological, and industrial activities. Over the centuries, the study of microbial systems has progressed through technological and methodological revolutions that greatly expanded our understanding of the microbial world. These discoveries provided insights into the role of microbial communities in the environment, and helped identify and develop beneficial industrial and biotechnological applications. However, the functional characterization of the microbial genetic repertoire has not kept pace with the constant growth of sequenced genomes and metagenomes. This discrepancy has opened a gap between the known and unknown coding sequence space. Several challenges hinder the bridging of this gap. Consequently, the unknown fraction is often excluded from functional microbiome analyses, resulting in a loss of valuable information and limiting our understanding of the functional roles of microbes. In the last decade, several methods have been proposed to address the challenge of uncharacterized genes. However, despite the advances brought by previous studies, an integrated and scalable solution that organizes unknown genes into biologically meaningful categories is still missing, as well as the development of a standard partitioning scale capable of unifying genomic and metagenomic data maximizing the information for the unknown fraction and facilitating its inclusion in the analyses of microbial systems. The work presented in this thesis addresses these challenges by developing the conceptual and computational basis to enable the study of the large pool of genes with unknown function and their inclusion in the analyses of microbial systems.

Download full text

Cite this publication

  • Export Bibtex
  • Export RIS

Citable URL (?):

Search for this publication

Search Google Scholar Search Catalog of German National Library Search OCLC WorldCat Search Bielefeld Academic Search Engine
Meta data
Publishing Institution:IRC-Library, Information Resource Center der Jacobs University Bremen
Granting Institution:Jacobs Univ.
Author:Chiara Vanni
Referee:Marc Thorsten Hütt, Johannes Söding, Antonio Fernandez-Guerra
Advisor:Frank Oliver Glöckner
Persistent Identifier (URN):urn:nbn:de:gbv:579-opus-1010001
Document Type:PhD Thesis
Language:English
Date of Successful Oral Defense:2021/07/21
Date of First Publication:2021/08/10
Academic Department:Life Sciences & Chemistry
PhD Degree:Bioinformatics
Focus Area:Health
Other Organisations Involved:Max Planck Institute for Marine Microbiology
Other Countries Involved:Denmark
Call No:2021/10

$Rev: 13581 $