Planned maintenance
A system upgrade is planned for 10/12-2024, at 12:00-13:00. During this time DiVA will be unavailable.
12345674 of 23
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Does genome size matter?: Comparative (meta)genomics to investigate the ecological meaning of genome size in aquatic prokaryotes
Stockholm University, Faculty of Science, Department of Ecology, Environment and Plant Sciences.ORCID iD: 0000-0002-1649-6894
2024 (English)Doctoral thesis, comprehensive summary (Other academic)
Abstract [en]

The Streamlining Theory explains that the success of certain prokaryotic species in marine environments might be linked to their highly reduced genomes and minimal nutritional requirements. Still, the ecological implications of genome size variability remain understudied. In this thesis, I provide novel insights to explore how we can use genome size information to understand different ecological patterns in prokaryotes.

The first part of my thesis explores patterns of genome size variability across ecosystems. In Chapter I we investigate prokaryotic genome size across 17,834 species-clusters (ANI > 95%) retrieved from three major biomes (host-associated, terrestrial and aquatic), and 8,267 species-clusters obtained from laboratory-grown isolates. We found that host-associated and aquatic prokaryotes (averaging 3.0 Mbp and 3.1 Mbp, respectively) hold reduced genome sizes when compared to those retrieved from soils (averaging 3.7 Mbp) and laboratory-grown isolates (averaging 4.3 Mbp). Moreover, only a minority of the species-clusters had been previously grown in laboratory cultures (~3.03%). In Chapter II we observed that differences in genome size also happen between and within aquatic environments: MAGs retrieved from the water column of brackish and marine environments (averaging 2.97 Mbp and 3.10 Mbp, respectively) have smaller genome sizes than those retrieved from pelagic freshwaters (averaging 3.48 Mbp). Differences in genome size are also observed between benthic and pelagic prokaryotes in the Baltic Sea (averaging 3.47 Mbp and 2.97 Mbp, respectively). Interestingly, we found that genome size in both brackish environments correlated negatively with the metabolic potential involved in numerous functions, with the only exception of genes involved in the nitrogen cycle.

In the second part of my thesis work, we focused on freshwater prokaryotes to explore genome size variability. In Chapter III we sampled and sequenced 17 new metagenomes collected from eight different freshwater lakes in the Stockholm region, with a particular focus on lake Mälaren. In total, this project compiles a total of 2,378 MAGs (completeness >30% and contamination <10%) grouped into 514 species-clusters from 19 different phyla. This data, together with other re-binned MAGs and publicly available genomes, were compiled to study the relation between genome size, prevalence and relative abundance in Chapter IV. In this project, we included 80,561 genomes of medium-to-high quality (>50% completeness and contamination <5%) retrieved from ~590 publicly available BioProjects and research articles, and were clustered into 24,050 species-clusters. After competitive mapping against a dataset of 636 globally-distributed freshwater metagenomes, we detected the presence of 9,028 species-clusters on at least one metagenomic sample. Our results show that the estimated genome size correlates negatively with the prevalence and the average relative abundance, reflecting that prokaryotes with reduced genomes are present in a larger number of metagenomes, and at a higher relative abundance. Furthermore, we found that species-clusters with reduced genomes have a higher tendency to co-occur with other prokaryotes, probably in relation to strong metabolic dependencies. Lastly, in Chapter V we selected the genus Rhodoferax (phylum Pseudomonadota) as a case study to investigate the ecological implications of intragenus genome size variability. After subsetting the results from the previous chapter, we compiled 345 high-quality genomes (>90% completeness and contamination <5%) classified as Rhodoferax. These genomes clustered into 96 species-clusters, from which 80 were detected on at least one freshwater metagenome. We found that intragenus genome size ranged from 2.41 Mbp to 6.92 Mbp, and its variability was linked to the number and length of genomic islands, the metabolic potential, and the depth stratification of lakes.

The projects presented combine newly generated metagenomic data with the re-use of public archived data to provide new insights of the ecological implications of prokaryotic genome size variability. We used comparative (meta)genomics to analyze and compare genomes from numerous species-clusters and from various environments, with a specific focus on aquatic environments.

Place, publisher, year, edition, pages
Stockholm: Department of Ecology, Environment and Plant Sciences, Stockholm University , 2024. , p. 108
Keywords [en]
genome size, metagenomics, bacteria, archaea, microbial genomics, microbiome, freshwaters, Baltic Sea, Rhodoferax
National Category
Microbiology Ecology Bioinformatics and Systems Biology Genetics
Research subject
Ecology and Evolution
Identifiers
URN: urn:nbn:se:su:diva-234929ISBN: 978-91-8014-995-2 (print)ISBN: 978-91-8014-996-9 (electronic)OAI: oai:DiVA.org:su-234929DiVA, id: diva2:1908567
Public defence
2024-12-13, Vivi Täckholmsalen (Q-salen), NPQ-huset, Svante Arrhenius väg 20 and online via Zoom, public link will be available a week before the event, Stockholm, 09:00 (English)
Opponent
Supervisors
Available from: 2024-11-20 Created: 2024-10-28 Last updated: 2024-11-11Bibliographically approved
List of papers
1. A Genomic Perspective Across Earth’s Microbiomes Reveals That Genome Size in Archaea and Bacteria Is Linked to Ecosystem Type and Trophic Strategy
Open this publication in new window or tab >>A Genomic Perspective Across Earth’s Microbiomes Reveals That Genome Size in Archaea and Bacteria Is Linked to Ecosystem Type and Trophic Strategy
Show others...
2022 (English)In: Frontiers in Microbiology, E-ISSN 1664-302X, Vol. 12, article id 761869Article, review/survey (Refereed) Published
Abstract [en]

Our view of genome size in Archaea and Bacteria has remained skewed as the data has been dominated by genomes of microorganisms that have been cultivated under laboratory settings. However, the continuous effort to catalog Earth’s microbiomes, specifically propelled by recent extensive work on uncultivated microorganisms, provides an opportunity to revise our perspective on genome size distribution. We present a meta-analysis that includes 26,101 representative genomes from 3 published genomic databases; metagenomic assembled genomes (MAGs) from GEMs and stratfreshDB, and isolates from GTDB. Aquatic and host-associated microbial genomes present on average the smallest estimated genome sizes (3.1 and 3.0 Mbp, respectively). These are followed by terrestrial microbial genomes (average 3.7 Mbp), and genomes from isolated microorganisms (average 4.3 Mbp). On the one hand, aquatic and host-associated ecosystems present smaller genomes sizes in genera of phyla with genome sizes above 3 Mbp. On the other hand, estimated genome size in phyla with genomes under 3 Mbp showed no difference between ecosystems. Moreover, we observed that when using 95% average nucleotide identity (ANI) as an estimator for genetic units, only 3% of MAGs cluster together with genomes from isolated microorganisms. Although there are potential methodological limitations when assembling and binning MAGs, we found that in genome clusters containing both environmental MAGs and isolate genomes, MAGs were estimated only an average 3.7% smaller than isolate genomes. Even when assembly and binning methods introduce biases, estimated genome size of MAGs and isolates are very similar. Finally, to better understand the ecological drivers of genome size, we discuss on the known and the overlooked factors that influence genome size in different ecosystems, phylogenetic groups, and trophic strategies. 

Keywords
microbial ecology, genome size, bacteria, archaea, genomics
National Category
Microbiology Evolutionary Biology Ecology
Research subject
Ecology and Evolution; Microbiology
Identifiers
urn:nbn:se:su:diva-201673 (URN)10.3389/fmicb.2021.761869 (DOI)000745167600001 ()
Funder
Science for Life Laboratory, SciLifeLabThe Royal Swedish Academy of SciencesScience for Life Laboratory, SciLifeLabThe Royal Swedish Academy of Sciences
Available from: 2022-02-01 Created: 2022-02-01 Last updated: 2024-10-28Bibliographically approved
2. Linking prokaryotic genome size variation to metabolic potential and environment 
Open this publication in new window or tab >>Linking prokaryotic genome size variation to metabolic potential and environment 
Show others...
2023 (English)In: ISME Communications, E-ISSN 2730-6151, Vol. 3, no 1, article id 25Article in journal (Refereed) Published
Abstract [en]

While theories and models have appeared to explain genome size as a result of evolutionary processes, little work has shown that genome sizes carry ecological signatures. Our work delves into the ecological implications of microbial genome size variation in benthic and pelagic habitats across environmental gradients of the brackish Baltic Sea. While depth is significantly associated with genome size in benthic and pelagic brackish metagenomes, salinity is only correlated to genome size in benthic metagenomes. Overall, we confirm that prokaryotic genome sizes in Baltic sediments (3.47 Mbp) are significantly bigger than in the water column (2.96 Mbp). While benthic genomes have a higher number of functions than pelagic genomes, the smallest genomes coded for a higher number of module steps per Mbp for most of the functions irrespective of their environment. Some examples of this functions are amino acid metabolism and central carbohydrate metabolism. However, we observed that nitrogen metabolism was almost absent in pelagic genomes and was mostly present in benthic genomes. Finally, we also show that Bacteria inhabiting Baltic sediments and water column not only differ in taxonomy, but also in their metabolic potential, such as the Wood-Ljungdahl pathway or the presence of different hydrogenases. Our work shows how microbial genome size is linked to abiotic factors in the environment, metabolic potential and taxonomic identity of Bacteria and Archaea within aquatic ecosystems. 

National Category
Microbiology Ecology
Research subject
Microbiology; Environmental Sciences
Identifiers
urn:nbn:se:su:diva-227655 (URN)10.1038/s43705-023-00231-x (DOI)001052229500001 ()
Funder
Science for Life Laboratory, SciLifeLab
Available from: 2024-03-25 Created: 2024-03-25 Last updated: 2024-10-28Bibliographically approved
3. Shotgun metagenomes from productive lakes in an urban region of Sweden
Open this publication in new window or tab >>Shotgun metagenomes from productive lakes in an urban region of Sweden
Show others...
2023 (English)In: Scientific Data, E-ISSN 2052-4463, Vol. 10, article id 810Article in journal (Refereed) Published
Abstract [en]

Urban lakes provide multiple benefits to society while influencing life quality. Moreover, lakes and their microbiomes are sentinels of anthropogenic impact and can be used for natural resource management and planning. Here, we release original metagenomic data from several well-characterized and anthropogenically impacted eutrophic lakes in the vicinity of Stockholm (Sweden). Our goal was to collect representative microbial community samples and use shotgun sequencing to provide a broad view on microbial diversity of productive urban lakes. Our dataset has an emphasis on Lake Mälaren as a major drinking water reservoir under anthropogenic impact. This dataset includes short-read sequence data and metagenome assemblies from each of 17 samples collected from eutrophic lakes near the greater Stockholm area. We used genome-resolved metagenomics and obtained 2378 metagenome assembled genomes that de-replicated into 514 species representative genomes. This dataset adds new datapoints to previously sequenced lakes and it includes the first sequenced set of metagenomes from Lake Mälaren. Our dataset serves as a baseline for future monitoring of drinking water reservoirs and urban lakes.

National Category
Oceanography, Hydrology and Water Resources
Identifiers
urn:nbn:se:su:diva-225401 (URN)10.1038/s41597-023-02722-x (DOI)001126323300001 ()37978200 (PubMedID)2-s2.0-85176963740 (Scopus ID)
Available from: 2024-01-17 Created: 2024-01-17 Last updated: 2024-10-28Bibliographically approved
4. The effect of genome size on ecological success is mediated by genomic features and biotic factors in aquatic prokaryotes
Open this publication in new window or tab >>The effect of genome size on ecological success is mediated by genomic features and biotic factors in aquatic prokaryotes
Show others...
(English)Manuscript (preprint) (Other academic)
Abstract [en]

While genome size is linked to abiotic factors in the environment, metabolic potential and taxonomic identity, the link between ubiquity, relative abundance and genome size in freshwater prokaryotes has not been investigated comprehensively beyond specific case studies. Here, we provide the first systematic evaluation at a global scale of the ecological implications of genome size variability in the abundance, prevalence, co-occurrence and biosynthesis of essential metabolites in freshwater Archaea and Bacteria. In this work, we leveraged 80,561 medium-to-high quality genomes (>50% completeness and <5% contamination) to investigate the ecological implications of genome size variability across freshwater prokaryotes. This dataset of genomes was de-replicated into 24,050 species-level clusters (ANI >95%), and mapped against a manually curated dataset 636 freshwater metagenomes distributed across the globe to estimate their relative abundance and prevalence. Our results show the presence of 9,028 species-clusters on at least one metagenomic sample. Their estimated genome size correlated negatively with their prevalence and relative abundance, indicating that reduced genomes are found in a larger number of freshwater metagenomes and at larger relative abundances than prokaryotic genomes of bigger size. By inferring a co-occurrence network, we also observed that prokaryotes with reduced genomes and high prevalence tend to co-occur in cohorts. Metabolic annotation shows that these co-occurrence cohorts might be supported by the exchange of essential metabolites (either passive or active), as observed the vitamin B12. Lastly, we hypothesize that prokaryotes with reduced genomes could follow two different strategies, either by increasing their coding density and compacting their genomes, or by losing a large number of the functions involved in catabolism and structure.

National Category
Bioinformatics and Systems Biology Ecology Microbiology
Research subject
Ecology and Evolution; Microbiology
Identifiers
urn:nbn:se:su:diva-234927 (URN)
Available from: 2024-10-28 Created: 2024-10-28 Last updated: 2024-10-28
5. The role of genomic islands, metabolic potential and environment in genome size intragenus variability: genus Rhodoferax as case study.
Open this publication in new window or tab >>The role of genomic islands, metabolic potential and environment in genome size intragenus variability: genus Rhodoferax as case study.
(English)Manuscript (preprint) (Other academic)
Abstract [en]

Genome size variability reflects ecological patterns in aquatic bacterial phyla, but the interplay at the genus level between the environment and different genomic features such as genomic islands and metabolic potential, is still to be explored. In this work, we selected 340 publicly available high-quality genomes (>90% completeness and <5% contamination, including MAGs and genomes from isolates) from genus Rhodoferax (family Burkholderaceae) to better understand the implications of intragenus genome size variability. To explore the importance of genomic islands, we complemented this dataset with high-quality genomes from other six bacterial genera present in freshwaters. We found that not only bacterial genera with larger average estimated genome sizes (such as Mycobacterium, Rhodoferax and Flavypsychrobacter) are more variable in genome size, but they also contain more genomic islands and they span for larger segments of the genome than genera with more reduced genomes. Moreover, we observed that genus Rhodoferax hosts 96 different species-level clusters (ANI >95% similarity) with average estimated genome sizes ranging from 2.41 Mbp to 6.92 Mbp, distributed mainly in freshwater environments, but also across terrestrial and host-associated. The metabolic potential and completeness of biosynthetic pathways for essential metabolites of these genomes are linked with their estimated genome sizes, indicating that Rhodoferax species-clusters with larger genomes contain less auxotrophies and more metabolic capabilities, such as for nitrogen cycling. Lastly, we observed that Rhodoferax species-clusters are present across global freshwater lakes, with the only exception of the African continent, and are abundant at different depths in freshwater lakes, reaching relative abundances of up to 20.4% in lake Lomtjärnan (Sweden). Overall, the genus Rhodoferax is a highly diverse and prevalent freshwater genus and offers valuable insights to understand the importance of genome size variability in freshwater Bacteria.

National Category
Microbiology Bioinformatics and Systems Biology Ecology
Research subject
Microbiology; Ecology and Evolution
Identifiers
urn:nbn:se:su:diva-234928 (URN)
Available from: 2024-10-28 Created: 2024-10-28 Last updated: 2024-10-28

Open Access in DiVA

Does genome size matter?: Comparative (meta)genomics to investigate the ecological meaning of genome size in aquatic prokaryotes(4502 kB)50 downloads
File information
File name FULLTEXT01.pdfFile size 4502 kBChecksum SHA-512
53d37441dff14cb70b24b346bf43f6d6993bf9f10f12dca3f7b2dc19c940bba96c474be40a4421e22193f1a094986b616a86f20bdfe8655138c96bbfa31fc31a
Type fulltextMimetype application/pdf

Authority records

Rodríguez Gijón, Alejandro

Search in DiVA

By author/editor
Rodríguez Gijón, Alejandro
By organisation
Department of Ecology, Environment and Plant Sciences
MicrobiologyEcologyBioinformatics and Systems BiologyGenetics

Search outside of DiVA

GoogleGoogle Scholar
Total: 50 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

isbn
urn-nbn

Altmetric score

isbn
urn-nbn
Total: 500 hits
12345674 of 23
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf