参考文献/References:
[1] ARTILES J,GONZALO J,VERDEJO F.A testbed for people searching strategies in the WWW[C]//Proceedings of the 28th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval.Piscataway:ACM,2005:569-570.
[2] BAGGA A,BALDWIN B.Entity-based cross-document coreferencing using the vector space model[C]//Proceedings of the 17th International Conference on Computational Linguistics.Boston:Association for Computational Linguistics,1998:79-85.
[3] MANN G S,YAROWSKY D.Unsupervised personal name disambiguation[C]//Proceedings of the 7th Conference on Natural Language Learning at HLT-NAACL.Edmonton:Association for Computational Linguistics,2003:33-40.
[4] PEDERSEN T,PURANDARE A,KULKARNI A.Name discrimination by clustering similar contexts[C]//Computational Linguistics and Intelligent Text Processing.Berlin:Springer Berlin Heidelberg,2005:226-237.
[5] CHEN Y,MARTUB J.Towards robust unsupervised personal name disambiguation[C]//EMNLP-CoNLL.Washington D C:IEEE Press,2007:190-198.
[6] IKEDA M,ONO S,SATO I,et al.Person name disambiguation on the web by two-stage clustering[C]//2nd Web People Search Evaluation Workshop.New York:Association for Computing Machinery,2009:33-38.
[7] YANG Xia, JIN Peng, XIANG Wei.Exploring word similarity to improve Chinese personal name disambiguation[C]//Web Intelligence and Intelligent Agent Technology.Washington D C:IEEE Press,2011:197-200.
[8] SALTON G,WONG A,YANG C S.A vector space model for automatic indexing[J].Communications of the ACM,1975,18(11):613-620.
[9] 董振东,董强.知网简介[EB/OL][2014-03-16] .http://www.keenage.com.
[10] 刘群,李素建.基于《知网》的词汇语义相似度计算[J].中文计算语言学,2002,7(2):59-76.
[11] WAGNER R A,FISCHER M J.The string-to-string correction problem[J].Journal of the ACM(JACM),1974,21(1):168-173.
[12] HIRSCHBERG D S.A linear space algorithm for computing maximal common subsequences[J].Communications of the ACM,1975,18(6):341-343.
[13] 施聪莺,徐朝军,杨晓江.TFIDF 算法研究综述[J].计算机应用,2009,29(B6):167-170.
[14] HIRSCHDERG D S.Algorithms for the longest common subsequence problem[J].Journal of the ACMWeb Intelligence and Intelligent Agent Technology.Washington D C:IEEE Press,1977,24(4):664-675.
[15] 全方磊.数据特征提取在高铁车地传输中的应用研究[D].杭州:浙江大学,2013:39-40.
[16] 牛永洁,张成.多种字符串相似度算法的比较研究[J].计算机与数字工程,2012,40(3):14-17.
[17] 张鑫.人名消歧关键技术研究与实现[D].哈尔滨:哈尔滨工业大学,2012:32-33.