High-throughput inference of protein-protein interfaces from unassigned NMR data

Ramgopal R. Mettu, Ryan H. Lilien, Bruce Randall Donald

Research output: Contribution to a Journal (Peer & Non Peer)Articlepeer-review

1 Citation (Scopus)

Abstract

Summary: We cast the problem of identifying protein-protein interfaces, using only unassigned NMR spectra, into a geometric clustering problem. Identifying protein-protein interfaces is critical to understanding inter- and intra-cellular communication, and NMR allows the study of protein interaction in solution. However it is often the case that NMR studies of a protein complex are very time-consuming, mainly due to the bottleneck in assigning the chemical shifts, even if the apo structures of the constituent proteins are known. We study whether it is possible, in a high-throughput manner, to identify the interface region of a protein complex using only unassigned chemical shifts and residual dipolar coupling (RDC) data. We introduce a geometric optimization problem where we must cluster the cells in an arrangement on the boundary of a 3-manifold. The arrangement is induced by a spherical quadratic form, which in turn is parameterized by a SO(3)xR2. We show that this formalism derives directly from the physics of RDCs. We present an optimal algorithm for this problem that runs in O(n3log n time for an n-residue protein. We then use this clustering algorithm as a subroutine in a practical algorithm for identifying the interface region of a protein complex from unassigned NMR data. We present the results of our algorithm on NMR data for seven proteins from five protein complexes, and show that our approach is useful for high-throughput applications in which we seek to rapidly identify the interface region of a protein complex.

Original languageEnglish
Pages (from-to)i292-i301
JournalBioinformatics
Volume21
Issue numberSUPPL. 1
DOIs
Publication statusPublished - Jun 2005
Externally publishedYes

Fingerprint

Dive into the research topics of 'High-throughput inference of protein-protein interfaces from unassigned NMR data'. Together they form a unique fingerprint.

Cite this