◎電機所專題英文演講(6月11日)-演講者:頂D廉教授(中央研究院資訊科學研究所)◎

  • 國立臺北大學 電機工程學系
  • 六月 9, 2009

主講人:頂D廉(中央研究院資訊科學研究所)
講題:Predicting the Positions of Proteins in the Cell through Document Classification Techniques.
時間:6月11日(星期四)下午13:30
地點:人文大樓11樓會議室
摘要:
Prediction of protein subcellular localization (PSL) is important for genome annotation, protein function prediction, and drug discovery. Many computational approaches for PSL prediction based on protein sequences have been proposed in recent years including expert system, k-nearest neighbors, artificial neural networks, support vector machines, and Bayesian networks. In this talk we shall describe PSLDoc, a method based on gapped-dipeptides and probabilistic latent semantic analysis (PLSA) to solve this problem. A protein is considered as a term string composed by gapped-dipeptides, which are defined as any two residues separated by one or more positions. The weighting scheme of gapped-dipeptides is calculated according to a position specific score matrix, which includes sequence evolutionary information. Then, PLSA is applied for feature reduction, and reduced vectors are input to five one-versus-rest support vector machine classifiers. Our approach compares favorably with all other approaches and demonstrates that the specific feature representation for proteins can be successfully applied to the prediction of protein subcellular localization and improves prediction accuracy.

歡迎有興趣的同學踴躍參加!

電機工程研究所敬邀