Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Novel candidate genes for environmental stresses response in Synechocystis sp. PCC 6803 revealed by machine learning algorithms
Stockholm University, Faculty of Science, Department of Molecular Biosciences, The Wenner-Gren Institute.ORCID iD: 0000-0001-7239-4457
Number of Authors: 42024 (English)In: Brazilian journal of microbiology (Impresso), ISSN 1517-8382, E-ISSN 1678-4405, Vol. 55, p. 1219-1229Article in journal (Refereed) Published
Abstract [en]

Cyanobacteria have developed acclimation strategies to adapt to harsh environments, making them a model organism. Understanding the molecular mechanisms of tolerance to abiotic stresses can help elucidate how cells change their gene expression patterns in response to stress. Recent advances in sequencing techniques and bioinformatics analysis methods have led to the discovery of many genes involved in stress response in organisms. The Synechocystis sp. PCC 6803 is a suitable microorganism for studying transcriptome response under environmental stress. Therefore, for the first time, we employed two effective feature selection techniques namely and support vector machine recursive feature elimination (SVM-RFE) and LASSO (Least Absolute Shrinkage Selector Operator) to pinpoint the crucial genes responsive to environmental stresses in Synechocystis sp. PCC 6803. We applied these algorithms of machine learning to analyze the transcriptomic data of Synechocystis sp. PCC 6803 under distinct conditions, encompassing light, salt and iron stress conditions. Seven candidate genes namely sll1862, slr0650, sll0760, slr0091, ssl3044, slr1285, and slr1687 were selected by both LASSO and SVM-RFE algorithms. RNA-seq analysis was performed to validate the efficiency of our feature selection approach in selecting the most important genes. The RNA-seq analysis revealed significantly high expression for five genes namely sll1862, slr1687, ssl3044, slr1285, and slr0650 under ion stress condition. Among these five genes, ssl3044 and slr0650 could be introduced as new potential candidate genes for further confirmatory genetic studies, to determine their roles in their response to abiotic stresses.

Place, publisher, year, edition, pages
2024. Vol. 55, p. 1219-1229
Keywords [en]
Cyanobacteria, LASSO, SVM- RFE, Gene expression, Environmental stresses, Machine learning
National Category
Genetics and Genomics
Identifiers
URN: urn:nbn:se:su:diva-229287DOI: 10.1007/s42770-024-01338-6ISI: 001214213700002PubMedID: 38705959Scopus ID: 2-s2.0-85192096584OAI: oai:DiVA.org:su-229287DiVA, id: diva2:1859805
Available from: 2024-05-22 Created: 2024-05-22 Last updated: 2025-02-07Bibliographically approved

Open Access in DiVA

No full text in DiVA

Other links

Publisher's full textPubMedScopus

Authority records

Emami, Seyedeh Noushin

Search in DiVA

By author/editor
Emami, Seyedeh Noushin
By organisation
Department of Molecular Biosciences, The Wenner-Gren Institute
In the same journal
Brazilian journal of microbiology (Impresso)
Genetics and Genomics

Search outside of DiVA

GoogleGoogle Scholar

doi
pubmed
urn-nbn

Altmetric score

doi
pubmed
urn-nbn
Total: 64 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf