Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
A New Census of Protein Tandem Repeats and Their Relationship with Intrinsic Disorder
Stockholm University, Faculty of Science, Department of Biochemistry and Biophysics. Stockholm University, Science for Life Laboratory (SciLifeLab).
Stockholm University, Faculty of Science, Department of Biochemistry and Biophysics. Stockholm University, Science for Life Laboratory (SciLifeLab).ORCID iD: 0000-0002-7115-9751
Show others and affiliations
Number of Authors: 52020 (English)In: Genes, E-ISSN 2073-4425, Vol. 11, no 4, article id 407Article in journal (Refereed) Published
Abstract [en]

Protein tandem repeats (TRs) are often associated with immunity-related functions and diseases. Since that last census of protein TRs in 1999, the number of curated proteins increased more than seven-fold and new TR prediction methods were published. TRs appear to be enriched with intrinsic disorder and vice versa. The significance and the biological reasons for this association are unknown. Here, we characterize protein TRs across all kingdoms of life and their overlap with intrinsic disorder in unprecedented detail. Using state-of-the-art prediction methods, we estimate that 50.9% of proteins contain at least one TR, often located at the sequence flanks. Positive linear correlation between the proportion of TRs and the protein length was observed universally, with Eukaryotes in general having more TRs, but when the difference in length is taken into account the difference is quite small. TRs were enriched with disorder-promoting amino acids and were inside intrinsically disordered regions. Many such TRs were homorepeats. Our results support that TRs mostly originate by duplication and are involved in essential functions such as transcription processes, structural organization, electron transport and iron-binding. In viruses, TRs are found in proteins essential for virulence.

Place, publisher, year, edition, pages
2020. Vol. 11, no 4, article id 407
Keywords [en]
tandem repeat, homorepeat, domain repeat, protein repeat, repeat prediction, intrinsic disorder, protein function, Swiss-Prot
National Category
Biological Sciences
Identifiers
URN: urn:nbn:se:su:diva-183161DOI: 10.3390/genes11040407ISI: 000537224600101PubMedID: 32283633OAI: oai:DiVA.org:su-183161DiVA, id: diva2:1450505
Available from: 2020-07-01 Created: 2020-07-01 Last updated: 2024-07-04Bibliographically approved
In thesis
1. Intrinsic disorder and tandem repeats - match made in evolution: Computational studies of molecular evolution
Open this publication in new window or tab >>Intrinsic disorder and tandem repeats - match made in evolution: Computational studies of molecular evolution
2023 (English)Doctoral thesis, comprehensive summary (Other academic)
Abstract [en]

Proteins are both the building blocks and workers of the cell, carrying out most of the important functions. For a long time, their structure has been regarded as the primary factor for their function, but intrinsically disordered proteins demonstrate an alternative to this paradigm. Disordered proteins can temporarily assume different forms based on their interactions with other molecules and play critical roles in several biological processes, including cell signaling and regulation of gene expression.

Tandem repeats are repeated patterns in genetic sequence. The role of tandem repeats in many protein structures is well documented today, but their role in disordered proteins is not entirely clear. This thesis aims to shed light on the mechanisms by which protein disorder and tandem repeats are linked.

Only 2.5% of residues in all known protein sequences are characterized by the overlap of tandem repeats and protein disorder as described in Paper III, but many of these proteins have crucial functions and are linked to human diseases. Short tandem repeats emerge in this study as most frequently occurring in disordered regions. Genetic variation in disordered proteins accounts for length differences in eukaryotic genes (Paper I) and many orphan, recently evolved proteins, are disordered due to high GC content (Paper II). 

A medical application of this research is illustrated in the thesis with examples of variations in short tandem repeats (STRs) and their role in human diseases. Paper IV presents a comprehensive resource of human STR variation and Paper V illustrates how it can be used to identify specific STRs of interest, such as in the case of colorectal cancer where variations in certain STRs lead to altered gene expression patterns in tumors.

Place, publisher, year, edition, pages
Stockholm: Department of Biochemistry and Biophysics, Stockholm University, 2023. p. 49
Keywords
Protein evolution, intrinsically disordered proteins (IDPs), tandem repeats, short tandem repeats (STRs), genetic variation, orphan proteins, GC content, human STR variation, colorectal cancer, gene expression
National Category
Bioinformatics (Computational Biology)
Research subject
Biochemistry towards Bioinformatics
Identifiers
urn:nbn:se:su:diva-223099 (URN)978-91-8014-559-6 (ISBN)978-91-8014-560-2 (ISBN)
Public defence
2023-12-11, hörsal 7, hus D, Universitetsvägen 10 D and online via Zoom, public link is available at the department website, Stockholm, 09:00 (English)
Opponent
Supervisors
Available from: 2023-11-16 Created: 2023-10-19 Last updated: 2023-12-06Bibliographically approved

Open Access in DiVA

No full text in DiVA

Other links

Publisher's full textPubMed

Authority records

Lundström, OxanaElofsson, Arne

Search in DiVA

By author/editor
Delucchi, MatteoSchaper, ElkeLundström, OxanaElofsson, ArneAnisimova, Maria
By organisation
Department of Biochemistry and BiophysicsScience for Life Laboratory (SciLifeLab)
In the same journal
Genes
Biological Sciences

Search outside of DiVA

GoogleGoogle Scholar

doi
pubmed
urn-nbn

Altmetric score

doi
pubmed
urn-nbn
Total: 231 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf