2005 Technical Reports

Finding Molecular Complexes through Multiple Layer Clustering of Protein Interaction Networks

Bill Andreopoulos, Aijun An, Xiangji Huang and Xiaogang Wang

Technical Report CS-2005-13

York University

October 2005


Motivation: One of the purposes of studying and analyzing protein-protein interaction networks (PINs) is to identify new protein complexes that guide the workings of a cell. Clustering algorithms for PIN data presented in the literature often do not consider the layered structure of protein complexes, creating instead a flat clustering.

Results: We propose the MULIC clustering algorithm that produces layered clusters of PIN data. We applied MULIC clustering to five PINs, including three yeast PINs. MULIC clusters correlate with known protein complexes in the MIPS database. For example, a large cluster of 79 proteins significantly overlaps with a known complex of 88 proteins.

Conclusions: MULIC clustering can assist in predicting protein complexes. Given the layered structure of the MULIC clusters, the proteins in top layers tend to be more representative of protein complexes than proteins in bottom layers. Lab experiments on finding an unknown complex or determining the potential effects of a drug can initially be guided by proteins in top layers and later move to bottom layers of clusters.

Keywords: Clustering, multiple layer, protein interaction network, complex.

Notice:The work presented in the paper above is covered by pending patents and copyright. Publication of this paper does not grant rights to any intellectual property. All rights reserved.

Download paper in PDF format.

The documents distributed by this server have been provided by the contributing authors as a means to ensure timely dissemination of scholarly and technical work on a noncommercial basis. Copyright and all rights therein are maintained by the authors or by other copyright holders, notwithstanding that they have offered their works here electronically. It is understood that all persons copying this information will adhere to the terms and constraints invoked by each author's copyright. These works may not be reposted without the explicit permission of the copyright holder.