 GE Hui-hua,HUANG Ke-jun,ZHANG Guang-ya.Amino Acid Signatures of Different Hypersanline Adaptation Proteomes and Their Classification[J].Journal of Huaqiao University(Natural Science),2014,35(2):169-174.[doi:10.11830/ISSN.1000-5013.2014.02.0169]





Amino Acid Signatures of Different Hypersanline Adaptation Proteomes and Their Classification
葛慧华 黄可君 张光亚
华侨大学 化工学院, 福建 厦门 361021
GE Hui-hua HUANG Ke-jun ZHANG Guang-ya
College of Chemical Engineering, Huaqiao University, Xiamen 361021, China
嗜盐微生物 非嗜盐微生物 蛋白质组 氨基酸 支持向量机 识别
halophile non-halophile proteome amino acid support vector machine discrimination
We selected two halophilic proteomes with different halophilic mechanism, and compared with a non-halophilic one. The results showed the difference between the halophilic(salt-in)and the non-halophilic proteome was obvious than that of halophilic(salt-out)and the non-halophilic proteome. In the halophilic(salt-out)proteome, the His and the small residues were significantly higher than those of non-halophilic proteome, while the Ala was significantly lower. However, both halophilic proteomes showed a large excess of acidic over basic amino acids. Based on these results, we introduced a novel Person Universal Kernel Function based support vector machine to classify the three kinds of proteins and the overall prediction accuracy could reach 84.1%. This method outperformed support vector machines based on other usually used kernels and other machine learning algorithms.


[1] EICHLER J.Biotechnological uses of archaeal extremozymes[J].Biotechnol Adv,2001,19(4):261-278.
[2] DELGADO-GARCÍA M,VALDIVIA-URDIALES B,AGUILAR-GONZÁLEZ C N,et al.Halophilic hydrolases as a new tool for the biotechnological industries[J].J Sci Food Agric,2012,92(13):2575-2580.
[3] ROBERTS M F.Organic compatible solutes of halotolerant and halophilic microorganisms[J].Saline Systems,2005,1:5.
[4] RHODES M E,FITZ-GIBBON S T,OREN A,et al.Amino acid signatures of salinity on an environmental scale with a focus on the Dead Sea[J].Environ Microbiol,2010,12(9):2613-2623.
[5] OREN A.Microbial life at high salt concentrations: Phylogenetic and metabolic diversity[J].Saline Systems,2008,4:2.
[6] COQUELLE N,TALON R,JUERS D H,et al.Gradual adaptive changes of a protein facing high salt concentrations[J].J Mol Biol,2010,404(3):493-505.
[7] SIGLIOCCOLO A,PAIARDINI A,PISCITELLI M,et al.Structural adaptation of extreme halophilic proteins through decrease of conserved hydrophobic contact surface[J].BMC Struct Biol,2011,11:50.
[8] STREET T O,BOLEN D W,ROSE G D.A molecular mechanism for osmolyte-induced protein stability[J]. Proc Natl Acad Sci USA,2006,103(38):13997-14002.
[9] EBRAHIMIE E,EBRAHIMI M,SARVESTANI N R,et al.Protein attributes contribute to halo-stability, bioinformatics approach[J].Saline Systems,2011,7(1):1.
[10] HAYES R J,BENTZIEN J,ARY M L,et al.Combining computational and experimental screening for rapid optimization of protein properties[J].Proc Natl Acad Sci USA,2002,99(25):15926-15931.
[11] COKER J A,DASSARMA P,KUMAR J,et al.Transcriptional profiling of the model Archaeon Halobacterium sp.NRC-1: Responses to changes in salinity and temperature[J].Saline Systems,2007,25(3):6.
[12] SCHWIBBERT K,MARIN-SANGUINO A,BAGYAN I,et al.A blueprint of ectoine metabolism from the genome of the industrial producer Halomonas elongata DSM 2581(T)[J].Environ Microbiol,2011,13(8):1973-1994.
[13] NIERMAN W C,FELDBLYUM T V,LAUB M T,et al.Complete genome sequence of Caulobacter crescentus[J]. Proc Natl Acad Sci USA,2001,98(7):4136-4141.
[14] ALTSCHUL S F,MADDEN T L,SCHAFFER A A,et al.Gapped BLAST and PSI-BLAST: A new generation of protein database search programs[J].Nucleic Acids Res,1997,25(17):3389-3402.
[15] DING Yan-rui,CAI Yu-jie,ZHANG Ge-xin,et al.The influence of dipeptide composition on protein thermostability[J].FEBS Lett,2004,569(1/2/3):284-288.
[16] CHOU Kuo-chen,SHEN Hong-bin.Cell-PLoc: A package of web-servers for predicting subcellular localization of proteins in various organisms[J].Nat Prot,2008,3(2):153-162.
[17] CHOU Kuo-chen,SHEN Hong-bin.Recent progresses in protein subcellular location prediction[J].Anal Biochem,2007,370(1):1-16.
[18] WANG Tong,YANG Jie,SHEN Hong-bin,et al.Predicting membrane protein types by the LLDA algorithm[J].Protein & Peptide Lett,2008,15(9):915-921.
[19] LI Feng-min,LI Qian-zhong Z.Predicting protein subcellular location using Chou’s pseudo amino acid composition and improved hybrid approach[J].Protein & Peptide Lett,2008,15(6):612-616.
[20] LIN Hao.The modified Mahalanobis discriminant for predicting outer membrane proteins by using Chou’s pseudo amino acid composition[J].J Theor Biol,2008,252(2):350-356.
[21] FRANK E,HALL M,TRIGG L,et al.Data mining in bioinformatics using Weka[J].Bioinformatics,2004,20(15):2479-2481.
[22] KASTRITIS P L,PAPANDREOU N C,HAMODRAKAS S J.Haloadaptation: Insights from comparative modeling studies of halophilic archaeal DHFRs[J].Int J Biol Macromol,2007,41(4):447-453.
[23] PAUL S,BAG S K,DAS S,et al.Molecular signature of hypersaline adaptation: Insights from genome and proteome composition of halophilic prokaryotes[J]. Genome Biol,2008,9:R70.
[24] WRIGHT D B,BANKS D D,LOHMAN J R,et al.The effect of salts on the activity and stability of Escherichia coli and Haloferax volcanii dihydrofolate reductases[J].J Mol Biol,2002,323(2):327-344.
[25] ARAKAWA T,TOKUNAGA M.Electrostatic and hydrophobic interactions play a major role in the stability and refolding of halophilic proteins[J].Protein Pept Lett,2004,11(2):125-132.
[26] COSTANTINI S,COLONNA G,FACCHIANO A M.Amino acid propensities for secondary structures are influenced by the protein structural class[J].Biochem Biophys Res Commun,2006,342(2):441-451.
[27] RADIVOJAC P,OBRADOVIC Z,SMITH D K,et al.Protein flexibility and intrinsic disorder[J].Protein Sci,2004,13(1):71-80.
[28] BETTS M J,RUSSELL R B.Amino acid properties and consequences of substitutions[M].Chichester: Bioinformatics for Geneticists Wiley,2003:289-316.
[29] BRITTON K L,BAKER P J, BORGES K M M, et al. Insights into thermal stability from a comparison of the glutamate dehydrogenases from Pyrococcus furiosus and Thermococcus litoralis[J].Eur J Biochem,1995,229(3):688-695.
[30] BARDAVID R E,OREN A.The amino acid composition of proteins from anaerobic halophilic bacteria of the order Halanaerobiales[J].Extremophiles,2012,16(3):567-572.
[31] WARD J J,MCGUFFIN L J,BUXTION B F,et al.Secondary structure prediction with support vector machines[J].Bioinformatics,2003,19(13):1650-1655.
[32] UESTUEN B,MELSSEN W J,BUYDENS L M C.Facilitating the application of support vector regression by using a universal Pearson Ⅶ function based kernel[J].Chemometrics and Intelligent Laboratory Systems,2006,81(1):29-40.
[33] 郑启富,陈德钊,刘化章.基于PersonⅦ核函数的支持向量机及其在化学模式分类中的应用[J].分析化学,2007,35(8):1142-1146.
[34] SABDERS W S,JOHNSTON C I,BRIDGES S M,et al.Prediction of cell penetrating peptides by support vector machines[J].PLOS Comput Biol,2011,7(7):e1002101.
[35] GROMIHA M M.Motifs in outer membrane protein sequences: Applications for discrimination[J].Biophy Chem,2005,117(1):65-71.
[36] ZHANG Guang-ya,FANG Bai-shan.LogitBoost classifier for discriminating thermophilic and mesophilic proteins[J].J Biotechnol,2007,127(3):417-424.


收稿日期: 2013-03-28
通信作者: 葛慧华(1979-),女,实验师,主要从事酶工程和分子动力学模拟的研究.E-mail:zhgyghh@hqu.edu.cn.
基金项目: 福建省高校新世纪优秀人才支持计划项目(07176C02)
更新日期/Last Update: 2014-03-20