Change search
ReferencesLink to record
Permanent link

Direct link
Identifying and Quantifying Orphan Protein Sequences in Fungi
Stockholm University, Faculty of Science, Department of Biochemistry and Biophysics.
Stockholm University, Faculty of Science, Department of Biochemistry and Biophysics.ORCID iD: 0000-0002-7115-9751
2010 (English)In: Journal of Molecular Biology, ISSN 0022-2836, E-ISSN 1089-8638, Vol. 396, no 2, 396-405 p.Article in journal (Refereed) Published
Abstract [en]

For large regions of many proteins, and even entire proteins, no homology to known domains or proteins can be detected. These sequences are often referred to as orphans. Surprisingly, it has been reported that the large number of orphans is sustained in spite of a rapid increase of available genomic sequences. However, it is believed that de novo creation of coding sequences is rare in comparison to mechanisms such as domain shuffling and gene duplication; hence, most sequences should have homologs in other genomes. To investigate this, the sequences of 19 complete fungi genomes were compared. By using the phylogenetic relationship between these genomes, we could identify potentially de novo created orphans in Saccharomyces cerevisiae. We found that only a small fraction, <2%, of the S. cerevisiae proteome is orphan, which confirms that de novo creation of coding sequences is indeed rare. Furthermore, we found it necessary to compare the most closely related species to distinguish between de novo created sequences and rapidly evolving sequences where homologs are present but cannot be detected. Next, the orphan proteins (OPs) and orphan domains (ODs) were characterized. First, it was observed that both OPs and ODs are short. In addition, at least some of the OPs have been shown to be functional in experimental assays, showing that they are not pseudogenes. Furthermore, in contrast to what has been reported before and what is seen for older orphans, S. cerevisiae specific ODs and proteins are not more disordered than other proteins. This might indicate that many of the older, and earlier classified, orphans indeed are fast-evolving sequences. Finally, >90% of the detected ODs are located at the protein termini, which suggests that these orphans could have been created by mutations that have affected the start or stop codons.

Place, publisher, year, edition, pages
2010. Vol. 396, no 2, 396-405 p.
Keyword [en]
evolution, protein domain, orphan protein, fungi
National Category
Natural Sciences
URN: urn:nbn:se:su:diva-49277DOI: 10.1016/j.jmb.2009.11.053ISI: 000274980400013OAI: diva2:379359
Swedish Foundation for Strategic Research Swedish Research CouncilEU, FP7, Seventh Framework Programme, 503567; 512092

authorCount :2

Available from: 2010-12-17 Created: 2010-12-13 Last updated: 2014-11-10Bibliographically approved

Open Access in DiVA

Fulltext(1238 kB)143 downloads
File information
File name FULLTEXT02.pdfFile size 1238 kBChecksum SHA-512
Type fulltextMimetype application/pdf

Other links

Publisher's full text

Search in DiVA

By author/editor
Ekman, DianaElofsson, Arne
By organisation
Department of Biochemistry and Biophysics
In the same journal
Journal of Molecular Biology
Natural Sciences

Search outside of DiVA

GoogleGoogle Scholar
Total: 143 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

Altmetric score

Total: 31 hits
ReferencesLink to record
Permanent link

Direct link