.
The Open Protein Structure Annotation Network
PDB Keyword
.

DUFs

PFAM DUF families solved by PSI centers

 

The genome projects have unearthed an enormous diversity of novel genes of unknown function that require biological and biochemical characterization to assess their role in the organism(s) from which they were derived. These genes, like all others, can be grouped into families based on sequence similarity.

duf_sizes.PNG

The PFAM database 23.0 contains over 2200 such families, referred to as Domains of Unknown Function (DUF). In a coordinated effort, the four large-scale centers of the NIH Protein Structure Initiative have determined the first three‑dimensional structures for more than 250 of these DUF families. Analysis of the first 248, solved until October 2008, reveals that they significantly vary in size (with an average of  252 proteins) and in contributions from sequenced genomes and from metagenomic data (see the chart on the right). It also shows that about two thirds of the DUF families likely represent very divergent branches of already known and well-characterized families, which allows us to propose hypotheses about their biological function. The remainder can be formally categorized as new folds or topologies, although about one third of these show significant sub-structure similarity to previously characterized folds. The homology to functionally annotated protein families remains an important clue in proposing hypotheses about functions of DUF families but it is usually not sufficient for a very reliable functional annotation. The chart below shows overall percentages of DUF families with new folds, new folds partially similar to previously known folds, putative analogs, putative homologs and recognizable homologs. homology.PNGThe inset pie charts show the percentage of DUF families with proposed hypothesis about function in each of these six categories. From a more general perspective, our results infer that, despite the enormous increase in the number and the diversity of new genes being uncovered, the fold space of proteins encoded by those genes is gradually becoming saturated. These previously unexplored sectors of the protein universe are, therefore, primarily shaped by extreme diversification of known protein families, which enables organisms to evolve new functions and adapt to particular niches and habitats. Notwithstanding, these DUF families still constitute the richest source for discovery of the remaining protein folds and topologies.We recently published a paper on the structural analysis of DUF families solved by PSI centers, which was published in Plos Biology.

http://www.plosbiology.org/article/info%3Adoi%2F10.1371%2Fjournal.pbio.1000205  

A list of PFAM DUF families solved by PSI centers 

DUFs: 248

Displaying: 0 - 10

Next
Representative Structure
Annotation Solved by Fold Type

PF01519: Pfam family PF01519 {Protein of unknown function DUF16 } with 42 members in NR database and an additional 2 members in metagenomic datasets, is represented in Archaea and Bacteria. The first structur... BSGC Homolog

PF01796: Pfam family PF01796 {Domain of unknown function DUF35 } with 1010 members in NR database and an additional 561 members in metagenomic datasets, is represented in Archaea and Bacteria. The first struc... JCSG Homolog

PF01861: Pfam family PF01861 {Protein of unknown function DUF43} with 30 members in NR database. The first structural representative solved (PDB Id: 2qm3 ) was subjected to a FATCAT structural similarity sear... MCSG Homolog

PF01865: Pfam family PF01865 {Protein of unknown function DUF47 } with 595 members in NR database and an additional 352 members in metagenomic datasets, is represented in Archaea and Bacteria. The first struc... JCSG Homolog

PF01877: Pfam family PF01877 {Protein of unknown function DUF54 } with 175 members in NR database and an additional 89 members in metagenomic datasets, is represented in Archaea. The first structural represen... NYSGXRC Putative Analog

PF01883: Pfam family PF01883 {Domain of unknown function DUF59 } with 3219 members in NR database and an additional 2322 members in metagenomic datasets, is represented in Archaea, Bacteria, and Eukaryota. Th... JCSG Putative Homolog

PF01893: Pfam family PF01893 {Uncharacterized protein family UPF0058 } with 41 members in NR database and an additional 5 members in metagenomic datasets, is represented in Archaea. The first structural repre... NESG Putative Analog

PF01904: Pfam family PF01904 {Protein of unknown function DUF72 } with 478 members in NR database and an additional 168 members in metagenomic datasets, is represented in Archaea, Bacteria, and Eukaryota. The... JCSG Putative Homolog

PF01906: Pfam family PF01906 {Domain of unknown function DUF74 } with 756 members in NR database and an additional 468 members in metagenomic datasets, is represented in Archaea, Bacteria, and Eukaryota. The ... MCSG Putative Analog

PF01908: Pfam family PF01908 {Protein of unknown function DUF75} with 848 members in NR database and an additional 664 members in metagenomic datasets, is represented in Archaea, Bacteria, and Eukaryota. The ... MCSG Putative Analog
Next
 
 

 
 
 
 

Reviews

References

 

No references found.

Tag page
Related pages for 'DUFs': Groups/PF06684, Groups/PF05899, Groups/PF02130, Groups/PF01796, Groups/PF06821, Groups/PF01865, Groups/PF01934, Groups/PF05430, Groups/PF04525, Groups/PF04378, Groups/PF04303, Groups/PF03658, Groups/PF04229, Groups/PF06201, Groups/PF05618, Groups/PF07045, Groups/PF04445, Groups/PF04672, Groups/PF06865, Groups/PF04273, Groups/PF08754, Groups/PF07080, Groups/PF05891, Groups/PF09351, Groups/PF06824, Groups/PF06908, Groups/PF08786, Groups/PF07049, Groups/PF07350, Groups/PF04041, Groups/PF09209, Groups/PF05838, Groups/PF01989, Groups/PF08538, Groups/PF06764, Groups/PF07090, Groups/PF06228, Groups/PF07031, Groups/PF07313, Groups/PF01983, Groups/PF09391, Groups/PF04013, Groups/PF06520, Groups/PF06352, Groups/PF05082, Groups/PF07191, Groups/PF08982, Groups/PF04919, Groups/PF07315, Groups/PF06627, Groups/PF01994, Groups/PF01519, Groups/PF06283, Groups/PF07755, Groups/PF08950, Groups/PF01947, Groups/PF08921, Groups/PF08939, Groups/PF01861, Groups/PF08851, Groups/PF09123, Groups/PF09002, Groups/PF06948, Groups/PF09152, Groups/PF01883, Groups/PF04524, Groups/PF03479, Groups/PF03928, Groups/PF03713, Groups/PF01904, Groups/PF04379, Groups/PF01936, Groups/PF01987, Groups/PF09179, Groups/PF03886, Groups/PF04359, Groups/PF03884, Groups/PF03629, Groups/PF08681, Groups/PF05962, Groups/PF01931, Groups/PF08768, Groups/PF05913, Groups/PF01958, Groups/PF06108, Groups/PF04337, Groups/PF04255, Groups/PF09410, Groups/PF07997, Groups/PF09012, Groups/PF07237, Groups/PF03671, Groups/PF06267, Groups/PF06973, Groups/PF08924, Groups/PF08866, Groups/PF04016, Groups/PF06849, Groups/PF01982, Groups/PF04289, Groups/PF01949, Groups/PF09130, Groups/PF08940, Groups/PF08838, Groups/PF09413, Groups/PF09393, Groups/PF07955, Groups/PF08975, Groups/PF08981, Groups/PF08965, Groups/PF08966, Groups/PF08682, Groups/PF04126, Groups/PF06557, Groups/PF08985, Groups/PF09640, Groups/PF09224, Groups/PF09183, Groups/PF09151, Groups/PF09641, Groups/PF02410, Groups/PF04040, Groups/PF07338, Groups/PF01906, Groups/PF01910, Groups/PF01908, Groups/PF04237, Groups/PF06877, Groups/PF06855, Groups/PF06041, Groups/PF06004, Groups/PF05979, Groups/PF07369, Groups/PF04327, Groups/PF09148, Groups/PF04634, Groups/PF04287, Groups/PF05167, Groups/PF06014, Groups/PF08002, Groups/PF04222, Groups/PF01877, Groups/PF07063, Groups/PF07515, Groups/PF06526, Groups/PF06998, Groups/PF06475, Groups/PF03885, Groups/PF06619, Groups/PF07566, Groups/PF06335, Groups/PF08860, Groups/PF04010, Groups/PF06006, Groups/PF07408, Groups/PF01995, Groups/PF09400, Groups/PF08807, Groups/PF08930, Groups/PF09167, Groups/PF06572, Groups/PF07892, Groups/PF01954, Groups/PF01893, Groups/PF05303, Groups/PF03685, Groups/PF09211, Groups/PF04591, Groups/PF08854, Groups/PF04242, Groups/PF06304, Groups/PF08962, Groups/PF08848, Groups/PF08922, Groups/PF08680, Groups/PF09171, Groups/PF08973, Groups/PF08968, Groups/PF09450, Groups/PF09082, Groups/PF09155, Groups/PF05638, Groups/PF02457, Groups/PF03937, Groups/PF07005, Groups/PF09149, Groups/PF07072, Groups/PF06078, Groups/PF07050, Groups/PF05256, Groups/PF06938, Groups/PF08796, Groups/PF08830, Groups/PF08933, Groups/PF04416, Groups/PF08956, Groups/PF08958, Groups/PF09001, Groups/PF09633, Groups/PF09630, Groups/PF08989, Groups/PF01933, Groups/PF04398, Groups/PF01980, Groups/PF04296, Groups/PF04430, Groups/PF01937, Groups/PF04167, Groups/PF04751, Groups/PF06742, Groups/PF06863, Groups/PF04356, Groups/PF07286, Groups/PF08927, Groups/PF07336, Groups/PF01951, Groups/PF08719, Groups/PF06133, Groups/PF06794, Groups/PF08984, Groups/PF05907, Groups/PF06032, Groups/PF07208, Groups/PF09234, Groups/PF06844, Groups/PF06748, Groups/PF08837, Groups/PF07100, Groups/PF08827, Groups/PF08929, Groups/PF08974, Groups/PF06924, Groups/PF07166, Groups/PF08980, Groups/PF04038, Groups/PF08942, Groups/PF08986, Groups/PF04036, Groups/PF08963, Groups/PF07262, Groups/PF09634, Groups/PF09218, Groups/PF08987, Groups/PF09187, Groups/PF09407, Groups/PF09449, Groups/PF09185, Groups/PF09188

Files (2)

FileSizeDateAttached by 
 duf_sizes.PNG
No description
6.42 kB20:45, 1 Oct 2009lukaszActions
 homology.PNG
No description
12.02 kB21:54, 30 Sep 2009lukaszActions
You must login to post a comment.

ABOUT SSL CERTIFICATES
All content on this site is licensed under a Creative Commons Attribution 3.0 License