-->

Protein Interaction Network

- May 17, 2017

File:The protein interaction network of Treponema pallidum.png ...
photo src: commons.wikimedia.org

Protein-protein interactions (PPIs) are the physical contacts of high specificity established between two or more protein molecules as a result of biochemical events steered by electrostatic forces including the hydrophobic effect. Many are physical contacts with molecular associations between chains that occur in a cell or in a living organism in a specific biomolecular context.

Proteins rarely act alone as their functions tend to be regulated. Many molecular processes within a cell are carried out by molecular machines that are built from a large number of protein components organized by their PPIs. These interactions make up the so-called interactomics of the organism, while aberrant PPIs are the basis of multiple aggregation-related diseases, such as Creutzfeldt-Jakob, Alzheimer's disease, and may lead to cancer.

PPIs have been studied from different perspectives: biochemistry, quantum chemistry, molecular dynamics, signal transduction, among others. All this information enables the creation of large protein interaction networks - similar to metabolic or genetic/epigenetic networks - that empower the current knowledge on biochemical cascades and molecular etiology of disease, as well as the discovery of putative protein targets of therapeutic interest.


File:Protein Interaction Network for TMEM8A.png - Wikimedia Commons
photo src: commons.wikimedia.org


Maps, Directions, and Place Reviews



Examples

Signal transduction

The activity of the cell is regulated by extracellular signals. Signals propagation to inside and/or along the interior of cells depends on PPIs between the various signaling molecules. The recruitment of signaling pathways through PPIs is called signal transduction and plays a fundamental role in many biological processes and in many diseases including Parkinson's disease and cancer.

Transport across membranes

A protein may be carrying another protein (for example, from cytoplasm to nucleus or vice versa in the case of the nuclear pore importins).

Cell metabolism

In many biosynthetic processes enzymes interact with each other to produce small compounds or other macromolecules.

Muscle contraction

Physiology of muscle contraction involves several interactions. Myosin filaments act as molecular motors and by binding to actin enables filament sliding. Furthermore, members of the skeletal muscle lipid droplet-associated proteins family associate with other proteins, as activator of adipose triglyceride lipase and its coactivator comparative gene identification-58, to regulate lipolysis in skeletal muscle.


Protein Interaction Network Video



Types

To describe the types of protein-protein interactions (PPIs) it is important to consider that proteins can interact in a "transient" way (to produce some specific effect in a short time) or to interact with other proteins in a "stable" way to build multiprotein complexes that are molecular machines within the living systems. A protein complex assembly can result in the formation of homo-oligomeric or hetero-oligomeric complexes. In addition to the conventional complexes, as enzyme-inhibitor and antibody-antigen, interactions can also be established between domain-domain and domain-peptide. Another important distinction to identify protein-protein interactions is the way they have been determined, since there are techniques that measure direct physical interactions between protein pairs, named "binary" methods, while there are other techniques that measure physical interactions among groups of proteins, without pairwise determination of protein partners, named "co-complex" methods.

Homo-oligomers vs. hetero-oligomers

Homo-oligomers are macromolecular complexes constituted by only one type of protein subunit. Protein subunits assembly is guided by the establishment of non-covalent interactions in the quaternary structure of the protein. Disruption of homo-oligomers in order to return to the initial individual monomers often requires denaturation of the complex. Several enzymes, carrier proteins, scaffolding proteins, and transcriptional regulatory factors carry out their functions as homo-oligomers. Distinct protein subunits interact in hetero-oligomers, which are essential to control several cellular functions. The importance of the communication between heterologous proteins is even more evident during cell signaling events and such interactions are only possible due to structural domains within the proteins (as described below).

Stable interactions vs. transient interactions

Stable interactions involve proteins that interact for a long time, taking part of permanent complexes as subunits, in order to carry out structural or functional roles. These are usually the case of homo-oligomers (e.g. cytochrome c), and some hetero-oligomeric proteins, as the subunits of ATPase. On the other hand, a protein may interact briefly and in a reversible manner with other proteins in only certain cellular contexts - cell type, cell cycle stage, external factors, presence of other binding proteins, etc. - as it happens with most of the proteins involved in biochemical cascades. These are called transient interactions. For example, some G protein-coupled receptors only transiently bind to Gi/o proteins when they are activated by extracellular ligands, while some Gq-coupled receptors, such as muscarinic receptor M3, pre-couple with Gq proteins prior to the receptor-ligand binding. Interactions between intrinsically disordered protein regions to globular protein domains (i.e. MoRFs) are transient interactions.

Covalent vs. non-covalent

Covalent interactions are those with the strongest association and are formed by disulphide bonds or electron sharing. Although being rare, these interactions are determinant in some posttranslational modifications, as ubiquitination and SUMOylation. Non-covalent bonds are usually established during transient interactions by the combination of weaker bonds, such as hydrogen bonds, ionic interactions, Van der Waals forces, or hydrophobic bonds.

Role of water

Water molecules play a significant role in the interactions between proteins. The crystal structures of complexes, obtained at high resolution from different but homologous proteins, have shown that some interface water molecules are conserved between homologous complexes. The majority of the interface water molecules make hydrogen bonds with both partners of each complex. Some interface amino acid residues or atomic groups of one protein partner engage in both direct and water mediated interactions with the other protein partner. Doubly indirect interactions, mediated by two water molecules, are more numerous in the homologous complexes of low affinity. Carefully conducted mutagenesis experiments, e.g. changing a tyrosine residue into a phenylalanine, have shown that water mediated interactions can contribute to the energy of interaction. Thus, water molecules may facilitate the interactions and cross-recognitions between proteins.


Functional and proteinâ€'protein interaction network analysis of ...
photo src: www.spandidos-publications.com


Structure

The molecular structures of many protein complexes have been unlocked by the technique of X-ray crystallography. The first structure to be solved by this method was that of sperm whale myoglobin by Sir John Cowdery Kendrew. In this technique the angles and intensities of a beam of X-rays diffracted by crystalline atoms are detected in a film, thus producing a three-dimensional picture of the density of electrons within the crystal.

Later, nuclear magnetic resonance also started to be applied with the aim of unravelling the molecular structure of protein complexes. One of the first examples was the structure of calmodulin-binding domains bound to calmodulin. This technique is based on the study of magnetic properties of atomic nuclei, thus determining physical and chemical properties of the correspondent atoms or the molecules. Nuclear magnetic resonance is advantageous for characterizing weak PPIs.

Domains

Proteins hold structural domains that allow their interaction with and bind to specific sequences on other proteins:

  • Src homology 2 (SH2) domain
  • Src homology 3 (SH3) domain
  • Phosphotyrosine-binding (PTB) domain
  • LIM domain
  • Sterile alpha motif (SAM) domain
  • PDZ domain
  • FERM domain
  • Calponin homology (CH) domain
  • Pleckstrin homology domain
  • WW domain
  • WSxWS motif

Apoptosis database
photo src: sbi.imim.es


Properties of the interface

The study of the molecular structure can give fine details about the interface that enables the interaction between proteins. When characterizing PPI interfaces it is important to take into account the type of complex.

Parameters evaluated include size (measured in absolute dimensions Å2 or in solvent-accessible surface area (SASA)), shape, complementarity between surfaces, residue interface propensities, hydrophobicity, segmentation and secondary structure, and conformational changes on complex formation.

The great majority of PPI interfaces reflects the composition of protein surfaces, rather than the protein cores, in spite of being frequently enriched in hydrophobic residues, particularly in aromatic residues. PPI interfaces are dynamic and frequently planar, although they can be globular and protruding as well. Based on three structures - insulin dimer, trypsin-pancreatic trypsin inhibitor complex, and oxyhaemoglobin - Cyrus Chothia and Joel Janin found that between 1,130 and 1,720 Å2 of surface area was removed from contact with water indicating that hydrophobicity is a major factor of stabilization of PPIs. Later studies refined the buried surface area of the majority of interactions to 1,600±350 Å2. However, much larger interaction interfaces were also observed and were associated with significant changes in conformation of one of the interaction partners. PPIs interfaces exhibit both shape and electrostatic complementarity.


Integrated analysis of differential gene expression profiles in ...
photo src: www.spandidos-publications.com


Regulation

  • Protein concentration, which in turn are affected by expression levels and degradation rates;
  • Protein affinity for proteins or other binding ligands;
  • Ligands concentrations (substrates, ions, etc.);
  • Presence of other proteins, nucleic acids, and ions;
  • Electric fields around proteins.
  • Occurrence of covalent modifications;

HOME
photo src: www.iscb.org


Measurement

There are a multitude of methods to detect them. Each of the approaches has its own strengths and weaknesses, especially with regard to the sensitivity and specificity of the method. The most conventional and widely used high-throughput methods are yeast two-hybrid screening and affinity purification coupled to mass spectrometry.

Yeast two-hybrid screening

This system was firstly described in 1989 by Fields and Song using Saccharomyces cerevisiae as biological model. Yeast two hybrid allows the identification of pairwise PPIs (binary method) in vivo, indicating non-specific tendencies towards sticky interactions.

Yeast cells are transfected with two plasmids: the bait (protein of interest fused with the DNA-binding domain of a yeast transcription factor, like Gal4), and the prey (a library of cDNA fragments linked to the activation domain of the transcription factor. Transcription of reporter genes does not occur unless bait and prey interact with each other and form a functional transcription factor. Thus, the interaction between proteins can be inferred by the presence of the products resultant of the reporter gene expression.

Despite its usefulness, the yeast two-hybrid system has limitations: specificity is relatively low; uses yeast as main host system, which can be a problem when studying other biological models; the number of PPIs identified is usually low because some transient PPIs are lost during purification steps; and, understates membrane proteins, for example. Limitations have been overcoming by the emergence of yeast two-hybrid variants, such as the membrane yeast two-hybrid (MYTH) and the split-ubiquitin system, which are not limited to interactions that occur in the nucleus; and, the bacterial two-hybrid system, performed in bacteria;

Affinity purification coupled to mass spectrometry

Affinity purification coupled to mass spectrometry mostly detects stable interactions and thus better indicates functional in vivo PPIs. This method starts by purification of the tagged protein, which is expressed in the cell usually at in vivo concentrations, and its interacting proteins (affinity purification). One of the most advantageous and widely used method to purify proteins with very low contaminating background is the tandem affinity purification, developed by Bertrand Seraphin and Mathias Mann and respective colleagues. PPIs can then be quantitatively and qualitatively analysed by mass spectrometry using different methods: chemical incorporation, biological or metabolic incorporation (SILAC), and label-free methods.

Other potential methods

Diverse techniques to identify PPIs have been emerging along with technology progression. These include co-immunoprecipitation, protein microarrays, analytical ultracentrifugation, light scattering, fluorescence spectroscopy, luminescence-based mammalian interactome mapping (LUMIER), resonance-energy transfer systems, mammalian protein-protein interaction trap, electro-switchable biosurfaces, protein-fragment complementation assay, as well as real-time label-free measurements by surface plasmon resonance, and calorimetry.

Text mining methods

Publicly available information from biomedical research is readily accessible through the internet and is becoming a powerful resource for predictive protein-protein interactions and protein docking. Text mining is much less time costly and consuming compared to other high-throughput techniques. Currently, these methods generally detect binary relations between interacting protein from individual sentences using machine learning and rule/pattern-based information extraction and machine learning approaches. A wide variety of text mining predicting PPIs applications are available for public use, as well as repositories which often stores manually validated and/or computationally predicted PPIs. The principal stages of text mining divides the implementation into two stages: information retrieval, where literature abstracts containing names of either or both proteins complexes are selected and information extraction, where detecting occurrences of residues are retrieved. The extraction is automated by searching for co-existing sentences, abstracts or paragraphs within textual context.

There are also studies using phylogenetic profiling, basing their functionalities on the theory that proteins involved in common pathways co-evolve in a correlated fashion across large number of species. More complex text mining methodologies use advanced dictionaries and generate networks by Natural Language Processing (NLP) of text, considering gene names as nodes and verbs as edges, other developments involve kernel methods to predict protein interactions.

Machine learning methods

These methods use machine learning to distinguish how interacting protein pairs differ from non-interacting protein pairs in terms of pairwise features such as cellular colocalization, gene co-expression, how closely located on a DNA are the genes that encode the two proteins, and so on. Random Forest has been found to be most-effective machine learning method for protein interaction prediction. Such methods have been applied for discovering protein interactions on human interactome, specifically the interactome of Membrane proteins and the interactome of Schizophrenia-associated proteins.


Identification of therapeutic targets for breast cancer using ...
photo src: www.spandidos-publications.com


Databases

Large scale identification of PPIs generated hundreds of thousands interactions, which were collected together in specialized biological databases that are continuously updated in order to provide complete interactomes. The first of these databases was the Database of Interacting Proteins (DIP). Since that time, the number of public databases has been increasing. Databases can be subdivided into primary databases, meta-databases, and prediction databases.

Primary databases collect information about published PPIs proven to exist via small-scale or large-scale experimental methods. Examples: DIP, Biomolecular Interaction Network Database (BIND), Biological General Repository for Interaction Datasets (BioGRID), Human Protein Reference Database (HPRD), IntAct Molecular Interaction Database, Molecular Interactions Database (MINT), MIPS Protein Interaction Resource on Yeast (MIPS-MPact), and MIPS Mammalian Protein-Protein Interaction Database (MIPS-MPPI).

Meta-databases normally result from the integration of primary databases information, but can also collect some original data. Examples: Agile Protein Interactomes Dataserver (APID), The Microbial Protein Interaction Database (MPIDB), and Protein Interaction Network Analysis (PINA) platform, (GPS-Prot).

Prediction databases include many PPIs that are predicted using several techniques (main article). Examples: Human Protein-Protein Interaction Prediction Database (PIPs), Interlogous Interaction Database (I2D), Known and Predicted Protein-Protein Interactions, and Unified Human Interactive (UniHI).


Functional and proteinâ€'protein interaction network analysis of ...
photo src: www.spandidos-publications.com


Interaction networks

Information found in PPIs databases supports the construction of interaction networks. Although the PPI network of a given query protein can be represented in textbooks, diagrams of whole cell PPIs are frankly complex and difficult to generate.

One example of a manually produced molecular interaction map is the Kurt Kohn's 1999 map of cell cycle control. Drawing on Kohn's map, Schwikowski et al. in 2000 published a paper on PPIs in yeast, linking 1,548 interacting proteins determined by two-hybrid screening. They used a layered graph drawing method to find an initial placement of the nodes and then improved the layout using a force-based algorithm.

Bioinformatic tools have been developed to simplify the difficult task of visualizing molecular interaction networks and complement them with other types of data. For instance, Cytoscape is an open-source software widely used and lots of plugins are currently available. Pajek software is advantageous for the visualization and analysis of very large networks.

Identification of functional modules in PPI networks is an important challenge in bioinformatics. Functional modules means a set of proteins that are highly connected to each other in PPI network. It is almost similar problem as community detection in social networks. There are some methods such as Jactive modules and MoBaS. Jactive modules integrate PPI network and gene expression data where as MoBaS integrate PPI network and Genome Wide association Studies.

The awareness of the major roles of PPIs in numerous physiological and pathological processes has been driving the challenge of unravel many interactomes. Examples of published interactomes are the thyroid specific DREAM interactome and the PP1? interactome in human brain.

Protein-protein relationships are often the result of multiple types of interactions or are deduced from different approaches, including co-localization, direct interaction, suppressive genetic interaction, additive genetic interaction, physical association, and other associations.

Signed interaction networks

Protein-protein interactions often result in one of the interacting proteins either being 'activated' or 'repressed'. Such effects can be indicated in a PPI network by "signs" (e.g. "activation" or "inhibition"). Although such attributes have been added to networks for a long time, Vinayagam et al. (2014) coined the term Signed network for them. Signed networks are often expressed by labeling the interaction as either positive or negative. A positive interaction is one where the interaction results in one of the proteins being activated. Conversely a negative interaction indicates that one of the proteins being inactivated.

Protein-protein interaction networks are often constructed as a result of lab experiments such as yeast two hybrid screens or 'affinity purification and subsequent mass spectrometry techniques. However these methods do not provide the layer of information needed in order to determine what type of interaction is present in order to be able to attribute signs to the network diagrams.

RNA interference screens

RNA interference (RNAi) screens (repression of individual proteins between transcription and translation) are one method that can be utilized in the process of providing signs to the protein-protein interactions. Individual proteins are repressed and the resulting phenotypes are analyzed. A correlating phenotypic relationship (i.e. where the inhibition of either of two proteins results in the same phenotype) indicates a positive, or activating relationship. Phenotypes that do not correlate (i.e. where the inhibition of either of two proteins results in two different phenotypes) indicate a negative or inactivating relationship. If protein A is dependent on protein B for activation then the inhibition of either protein A or B will result in a cell losing the service that is provided by protein A and the phenotypes will be the same for the inhibition of either A or B. If, however, protein A is inactivated by protein B then the phenotypes will differ depending on which protein is inhibited (inhibit protein B and it can no longer inactivate protein A leaving A active however inactivate A and there is nothing for B to activate since A is inactive and the phenotype changes). Multiple RNAi screens need to be performed in order to reliably appoint a sign to a given protein-protein interaction. Vinayagam et al. who devised this technique state that a minimum of nine RNAi screens are required with confidence increasing as one carries out more screens.


Cytoscape App Store - PINA4MS
photo src: apps.cytoscape.org


As therapeutic targets

Modulation of PPI is challenging and is receiving increasing attention by the scientific community. Several properties of PPI such as allosteric sites and hotspots, have been incorporated into drug-design strategies. The relevance of PPI as putative therapeutic targets for the development of new treatments is particularly evident in cancer, with several ongoing clinical trials within this area. The consensus among these promising targets is, nonetheless, denoted in the already available drugs on the market to treat a multitude of diseases. Examples are Titrobifan, inhibitor of the glycoprotein IIb/IIIa, used as a cardiovascular drug, and Maraviroc, inhibitor of the CCR5-gp120 interaction, used as anti-HIV drug. Recently, Amit Jaiswal and others were able to develop 30 peptides using protein-protein interaction studies to inhibit telomerase recruitment towards telomeres.

Source of the article : Wikipedia



EmoticonEmoticon

 

Start typing and press Enter to search