Similarity/dissimilarity analysis of protein sequences using the spatial median as a descriptor

HTML  Download Download as PDF (Size: 480KB)  PP. 142-148  
DOI: 10.4236/jbpc.2012.32016    4,630 Downloads   8,326 Views  Citations

ABSTRACT

A novel 3-D graphical representation of protein sequence has been introduced. A right cone of a unit base and unit height has been selected to represent protein sequences on its surface. The twenty amino acids have been represented by 20 circles and all protein's residues have been represented by n lines on the cone's surface. All the spots which represent the protein's residues have been shown in the cone's top view. The spatial median of all the spots is used as a new descriptor of any protein sequence. This approach was applied on two short segments of protein of yeast Saccharomyces cerevisiae. The examination of the similarities/dissimilarities for the eight ND5 proteins and the six β-globin proteins illustrate the utility of our approach. A linear correlation and significance analysis have been provided to compare our results and the percentage sequence alignment identity.

Share and Cite:

M. Abo-Elkhier, M. (2012) Similarity/dissimilarity analysis of protein sequences using the spatial median as a descriptor. Journal of Biophysical Chemistry, 3, 142-148. doi: 10.4236/jbpc.2012.32016.

Copyright © 2024 by authors and Scientific Research Publishing Inc.

Creative Commons License

This work and the related PDF file are licensed under a Creative Commons Attribution 4.0 International License.