Структура и функционирование белков. Применение методов биоинформатики - Джон Ригден 2014
Распознавание фолда
Литература
Altschul SF, Madden TL, Schaffer AA, et al. (1997) Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 25:3389-3402
Bateman A and Finn RD (2007) SCOOP: a simple method for identification of novel protein super- family relationships. Bioinformatics 23:809-814
Bennett-Lovsey RM, Herbert AD, Sternberg MJ, et al. (2008) Exploring the extremes of sequence/structure space with ensemble fold recognition in the program Phyre. Proteins. 70:611-625
Berman HM, Westbrook J, Feng Z, et al. (2000) The protein data bank. Nucleic Acids Res 28:235- 242
Bowie JU, Lüthy R, Eisenberg D (1991) A method to identify protein sequences that fold into a known three-dimensional structure. Science 253:164-170
Bradford JR, Westhead DR (2005) Improved prediction of protein-protein binding sites using a support vector machines approach. Bioinformatics 21:1487-1494
Bryant SH (1996) Evaluation of threading specificity and accuracy. Proteins 26(2): 172-185
Busuttil S, Abela J, and Pace GJ (2004) Support vector machines with profile-based kernels for remote protein homology detection. Genome Inform Ser Workshop Genome Inform 15:191—200
Chivian D, Baker D (2006) Homology modeling using parametric alignment ensemble generation with consensus and energy-based model selection. Nucleic Acids Res 34:el 12
Copley RR, Bork P (2000) Homology among (beta/alpha)(8) barrels: implications for the evolution of metabolic pathways. J Mol Biol 303:627-641
Dodson G Wlodawer A (1998) Catalytic triads and their relatives. Trends Biochem Sci 23:347-352
Elofsson A (2002) A study on protein sequence alignment quality. Proteins 46:330-339 Fisher D (2003) 3D-SHOTGUN: a novel, cooperative, fold-recognition meta-predictor. Proteins 51:434—441
Garg A, Bhasin M, Raghava GP (2005) Support vector machine-based method for subcellular localization of human proteins using amino acid compositions, their order and similarity search. J Biol Chem 280:14427-14432
Ginalski К, Elofsson А, Fischer D, et al. (2003) 3D-Jury: a simple approach to improve protein structure predictions. Bioinformatics 19:1015-1018
Heger A, Mallick S, Wilton C, et al. (2008) The global trace graph, a novel paradigm for searching protein sequence databases. Bioinformatics 23:2361-2367
Hou Y, Hsu W, Lee ML, et al. (2003) Efficient remote homology detection using local structure. Bioinformatics 19:2294-2301.
Jaakkola T, Diekhans M, Haussier D (2000) A discriminative framework for detecting remote protein homologies. J Comput Biol 7:95-114
Jain AK, Duin RPW, Mao JC (2000) Statistical pattem recognition: A review. IEEE Trans Pattem Anal 22:4-37
Jaroszewski L, Li W, Godzik A (2002) In search for more accurate alignments in the twilight zone. Prot Sci 11:1702-1713
Jones DT (1999a) Protein secondary structure prediction based on position-specific scoring matrices. J Mol Biol 292:195-202.
Jones DT (1999b) GenTHREADER: an efficient and reliable protein fold recognition method for genomic sequences. J Mol Biol 287:797-815
Jones DT, Taylor WR, Thornton JM (1992) A new approach to protein fold recognition. Nature 358:86-89
Kabsch W, Sander C (1983) Dictionary of protein secondary structure: pattern recognition of hydrogen-bonded and geometrical features. Biopolymers 22:2577-2637
Kelley LA, MacCallum RM, Sternberg MJ (2000) Enhanced genome annotation using structural profiles in the program 3D-PSSM. J Mol Biol 299:499-520
Kim H, Park H (2003) Prediction of protein relative solvent accessibility with support vector machines and long-range interaction 3D local descriptor. Proteins 54:557-562
Kumar M, Bhasin M, Natt NK, et al. (2005) BhairPred: prediction of beta-hairpins in a protein from multiple alignment information using ANN and SVM techniques. Nucleic Acids Res 33 (Web Server issue): 154-159
Kuncheva LI, Whitaker CJ (2003) Measures of diversity in classifier ensembles and their relationship with the ensemble accuracy. Mach Learn 51:181—207
Lathrop RH (1999) An anytime local-to-global optimization algorithm for protein threading in theta (m2n2) space. J Comput Biol 6(3-4):405-418
Lathrop RH, Smith TF (1996) Global optimum protein threading with gapped alignment and empirical pair potentials. J Mol Biol 255:641-665
Leslie C, Eskin E, Noble WS (2002) The spectrum kernel: a string kernel for SVM protein classification. Рас Symp Biocomput 564-575
Leslie CS, Eskin E, Cohen A, et al. (2004) Mismatch string kernels for discriminative protein classification. Bioinformatics 20:467-476
Liao L, Noble WS (2003) Combining pairwise sequence similarity and support vector machines for detecting remote protein evolutionary and structural relationships. J Comput Biol 10:857-868 Madej T, Gilbrat J-F, Bryant SH (1995) Threading a database of protein cores. Proteins 23:356-369
Marsden RL, Lee D, Maibaum M, et al. (2006) Comprehensive genome analysis of 203 genomes provides structural genomics with new insights into protein family space. Nucleic Acids Res 34:1066-1080
McGuffin LJ (2008) The ModFOLD server for the quality assessment of protein structural models. Bioinformatics 24:586-587
Miyazawa S, Jemigan RL (1996) Residue-residue potentials with a favorable contact pair term and an unfavorable high packing density term, for simulation and threading. J Mol Biol 256(3):623-644
Moult J, Fidelis К, Kryshtafovych A, et al. (2007) Critical assessment of methods of protein structure prediction - Round VII. Proteins 69 S8:3-9
Murzin AG Brenner SE, Hubbard T, et al. (1995) SCOP: a structural classification of proteins database for the investigation of sequences and structures. J Mol Biol 247:536-540
Nguyen MN, Rajapakse JC (2003) Multi-class support vector machines for protein secondary structure prediction. Genome Inform Ser Workshop Genome Inform 14:218-227
Ohlson T, Wallner B, Elofsson A (2004) Profile-profile methods provide improved fold-recognition: a study of different profile-profile alignment methods. Proteins 57:188-197
Park J, Teichmann SA, Hubbard T, et al. (1997) Intermediate sequences increase the detection of homology between sequences. J Mol Biol 273:349-354
Pearson WR (1998) Empirical statistical estimates for sequence similarity searches. J Mol Biol 276:71-84
Ponting CP, Russell RB (2000) Identification of distant homologues of fibroblast growth factors suggests a common ancestor for all beta-trefoil proteins. J Mol Biol 302:1041-1047
Prasad JC, Vajda S, Camacho CJ (2004) Consensus alignment server for reliable comparative modeling with distant templates. Nucleic Acids Res 32:W50-W54
Richmond TJ (1984) Solvent accessible surface area and excluded volume in proteins. Analytical equations for overlapping spheres and implications for the hydrophobic effect. J Mol Biol 178:63-89
Rychlewski L, Jaroszewski L, Li W, Godzik A (2000) Comparison of sequence profiles. Strategies for structural predictions using sequence information. Protein Sci 9:232-241
Science Editorial (2005) So much more to know. Science 309:78-102
Seringhaus M, Gerstein M (2007) Chemistry Nobel rich in structure. Science 315:40-41
Sippl MJ (1990) Calculation of conformational ensembles from potentials of mean force. An approach to the knowledge-based prediction of local structures in globular proteins. J Mol Biol 213:859-883
Skolnick J, Kihara D (2000) Defrosting the frozen approximation: PROSPECTOR - a new approach to threading. Proteins 42:319-331
Soeding J (2005) Protein homology detection by HMM-HMM comparison. Bioinformatics 21:951— 960
Tanaka S, Scheraga HA (1976) Medium- and long-range interaction parameters between amino acids for predicting three-dimensional structures of proteins. Macromolecules 9:945-950 Tang CL, Xie L, Koh IY, et al. (2003) On the role of structural information in remote homology detection and sequence alignment: new methods using hybrid sequence profiles. J Mol Biol 334:1043-1062
Tress ML, Jones D, Valencia A (2003) Predicting reliable regions in protein alignments from sequence profiles. J Mol Biol 330:705-718
Venclovas C, Margelevicius M (2005) Comparative modeling in CASP6 using consensus approach to template selection, sequence-structure alignment, and structure assessment. Proteins(Suppl 7):99-105
Wallner B, Elofsson A (2005) Pcons5: combining consensus, structural evaluation and fold recognition scores. Bioinformatics 21:4248-4254
Wallner B, Elofsson A (2006) Dentification of correct regions in protein models using structural, alignment, and consensus information. Prot Sei 15:900-913
Westhead DR, Collura VP, Eldridge MD, et al. (1995) Protein fold recognition by threading: comparison of algorithms and analysis of results. Protein Eng 8:1197-1204
Weston J, Elisseeff A, Zhou D, et al. (2004) Protein ranking: from local to global structure in the protein similarity network. PNAS 101:6559-6563
Xia Y, Levitt M (2000) Extracting knowledge-based energy functions from protein structures by error rate minimization. Comparison of methods using lattice model. J Chem Phys 113:9318— 9330
Xu J, Li M, Kim D, et al. (2003) RAPTOR: optimal protein threading by linear programming. J Bioinform Comput Biol 1:95—117
Xu Y, Xu D, Uberbacher EC (1998) An efficient computational method for globally optimal threading. J Comput Biol 5:597-614
Zachariah MA, Crooks GE, Holbrook SR, Brenner SE (2005) A generalized affine gap model significantly improves protein sequence alignment accuracy. Proteins 58:329-338
Zhang Y (2007) Template-based modeling and free modeling by I-TASSER in CASP7. Proteins(Suppl 8): 108-117
Zhang Y, Skolnick J (2005) The protein structure prediction problem could be solved using the current PDB library. Proc Natl Acad Sci USA 102:1029-1034
Zhou H, Zhou Y (2005) Fold recognition by combining sequence profiles derived from evolution and from depth-dependent structural alignment of fragments. Proteins 58:321-328