Pages

Thursday, November 6, 2014

A census of human RNA-binding proteins

Nature Reviews Genetics | Analysis

 Analyses of post-transcriptional gene regulation and the protein factors involved have been substantially driven forward by technological advances such as next-generation sequencing and modern protein mass spectrometry. This Analysis provides a census of 1,542 manually curated RNA-binding proteins, for which the authors have investigated interactions with different classes of RNA, evolutionary conservation, abundance and tissue-specific expression.
 

Abstract
 
Post-transcriptional gene regulation (PTGR) concerns processes involved in the maturation, transport, stability and translation of coding and non-coding RNAs. RNA-binding proteins (RBPs) and ribonucleoproteins coordinate RNA processing and PTGR. The introduction of large-scale quantitative methods, such as next-generation sequencing and modern protein mass spectrometry, has renewed interest in the investigation of PTGR and the protein factors involved at a systems-biology level. Here, we present a census of 1,542 manually curated RBPs that we have analysed for their interactions with different classes of RNA, their evolutionary conservation, their abundance and their tissue-specific expression. Our analysis is a critical step towards the comprehensive characterization of proteins involved in human RNA metabolism. 

Key points

  • Recent advances in next-generation sequencing methods and quantitative mass spectrometry have renewed the interest in RNA biology and the genome-wide investigation of post-transcriptional gene regulatory proteins. A global census that systematically lists the number of factors involved in post-transcriptional gene regulation (PTGR) is currently not available. Here, we provide an overall summary of the proteins involved in interactions with all classes of RNAs based on our current knowledge of PTGR; this will guide future systems-wide studies of PTGR.
  • RNA-binding proteins (RBPs) are evolutionarily deeply conserved, and their structural domains diversified early in evolution.
  • RBPs are among the most abundant proteins in the cell and are generally ubiquitously expressed, which mirrors their central and conserved role in gene regulation.
  • Only ~2% of RBPs are tissue-specific, and most of these are mRNA- and non-coding RNA-binding proteins.
  • Diseases involving RBPs show characteristic phenotypes depending on the type of RNA (for example, mRNA, ribosomal RNA and tRNA) predominantly bound by the RBPs.
  • Correlated expression of RBPs across developmental processes can identify factors in shared PTGR pathways.

Introduction

Post-transcriptional gene regulation (PTGR) is essential to sustain cellular metabolism, coordinating maturation, transport, stability and degradation of all classes of RNAs (Fig. 1). Mechanistically, each of these events is regulated by the formation of different ribonucleoprotein (RNP) complexes with RNA-binding proteins (RBPs) at their core. Initially, it was thought that RNA mainly served either as the template, in the form of mRNA, or as an adaptor or a structural component during protein synthesis, provided by tRNAs and ribosomal RNAs. With the discovery of catalytic RNAs and a multitude of non-coding RNA (ncRNA) species, it was recognized that RNA is a highly versatile molecule that carries out many regulatory functions in the cell, either by acting as a guide to recognize RNA sequence motifs or RNA recognition elements present in their target RNAs, or by functioning as a scaffold and assembly platform for recruiting proteins to act synergistically1. The characterization of the proteins transiently or stably interacting with RNAs is a prerequisite for the dissection of RNA regulatory processes.
Figure 1: Overview of the main post-transcriptional gene regulation pathways in eukaryotes.
An overview is given for the biogenesis, decay and function of the most abundant RNAs: tRNAs, ribosomal RNAs, small nuclear RNAs (snRNAs), small nucleolar RNAs (snoRNAs), mRNAs, microRNAs (miRNAs), PIWI-interacting RNAs (piRNAs) and long non-coding RNAs (lncRNAs). Processes are described from left to right. Referenced gene names and complexes in the figure are listed in Supplementary information S3 (table) and within the listed references. a | tRNAs are transcribed by RNA polymerase III (Pol III); the 5′ leader and 3′ trailer sequences are removed, introns are spliced, and the ends are joined. CCA nucleotides are added to 3′ ends, and nucleotide modifications — such as methylation (M), pseudouridylation (ψ) and deamination of adenosines to inosines (I) — are introduced before tRNA aminoacylation195. b | The 5S rRNA is transcribed by Pol III, whereas 28S, 18S and 5.8S rRNAs are transcribed as one transcript by Pol I. The precursor is processed by RNA exonucleases, endonucleases and the ribonucleoprotein (RNP) RNase MRP, guided by U3 small nucleolar RNP (snoRNP). Nucleotide modifications are introduced by snoRNPs. rRNAs are assembled together with ribosomal proteins into ribosomal precursor complexes in the nucleus and transported to the cytoplasm, where they mature to functional ribosomes92, 196, 197. c | Most snRNAs are transcribed by Pol II, capped and processed in the nucleus. When exported to the cytoplasm, they undergo methylation and assemble with LSM proteins into small nuclear ribonucleic particles (snRNPs) in a process aided by the survival motor neuron 1 (SMN1). These snRNPs are re-imported into the Cajal body (CB) within the nucleus, where they undergo final maturation and snRNP assembly81. U6 and U6atac snRNAs are transcribed by Pol III and are alternatively processed in the nucleus and the nucleolus198. Mature snRNPs form the core of the spliceosome. d | snoRNAs and small Cajal body-specific RNAs (scaRNAs) are processed from mRNA introns, capped and modified before they assemble into snoRNPs or scaRNPs in the CB. snoRNPs and scaRNPs carry out methylation and pseudouridylation of rRNAs, snoRNAs and snRNAs, or function in rRNA processing (for example, processing of U3 snoRNA)81. e | mRNAs are transcribed by Pol II, capped, spliced, edited and polyadenylated in the nucleus. Correctly matured mRNAs are exported into the cytoplasm. Regulatory RNA-binding proteins (RBPs) control correct translation, monitor stability, decay and localization, and shuttle mRNAs between actively translating ribosomes, stress granules and P bodies37, 141, 142, 199, 200, 201, 202. f | miRNAs are either transcribed from separate genes by Pol II as long primary miRNA (pri-miRNA) transcripts or expressed from mRNA introns (mirtrons) and processed into hairpin pre-miRNAs in the nucleus. After transport into the cytoplasm, they are processed into 21-nucleotide-long double-stranded RNAs. One strand is incorporated into Argonaute (AGO) proteins (forming miRNA-containing RNPs (miRNPs)) and guides them to partially complementary target mRNAs to recruit deadenylases and repress translation203. g | piRNAs are ~28-nucleotides-long, germline-specific small RNAs. Primary piRNAs are directly processed and assembled from long, Pol II-transcribed precursor transcripts, whereas secondary piRNAs are generated in the 'ping pong' cycle by the cleavage of complementary transcripts by PIWI proteins. Mature piRNAs are 2′-O-methylated and incorporated into PIWI proteins. The piRNA–PIWI complexes (piRNPs) silence transposable elements (TEs) either by endonucleolytic cleavage in the cytoplasm or through transcriptional silencing at their genomic loci in the nucleus107. h | Most lncRNAs are transcribed and processed in a similar way to mRNAs. Nuclear lncRNAs play an active part in gene regulation by directing proteins to specific gene loci, where they recruit chromatin modification complexes and induce transcriptional silencing or activation185. Other non-coding RNAs (for example, 7SK RNA) regulate transcription elongation rates204 or induce the formation of paraspeckles (PS)205. Cytoplasmic non-coding RNAs can modulate mRNA translation206. i | Incorrectly processed RNAs are recognized by several complexes in the nucleus and cytoplasm that initiate and execute their degradation207, 208. CPSF, cleavage and polyadenylation specificity factor; EJC, exon junction complex; hnRNP, heterogeneous nuclear RNP; NGD, no-go decay; NMD, nonsense-mediated RNA decay; NSD, non-stop decay; PABP, poly(A)-binding protein.
The recent development of large-scale quantitative methods, especially next-generation sequencing and modern protein mass spectrometry2, 3, 4, 5, 6, facilitates genome-wide identification of RBPs, their protein cofactors and their RNA targets. Deep-sequencing approaches using immunoprecipitation of RBPs, with or without in vivo RNA–protein crosslinking (crosslinking and immunoprecipitation followed by sequencing (CLIP–seq) and RNA immunoprecipitation and sequencing (RIP-seq), respectively)2, 3, as well as in vitro evolution methods7, 8, revealed the binding ranges of RBPs and showed that many RBPs bind to thousands of transcripts in cells at defined binding sites.
 
Despite the growing amount of data collected on RBPs, many questions remain to be answered. Researchers still have an incomplete understanding of how binding specificity is achieved and how the regulatory function of an individual RBP is influenced by synergy and competition with other RBPs. We argue that a balanced approach of detailed biochemical and functional studies paired with complex systems-biology methods will ultimately lead to an understanding of the principles underlying PTGR networks.
 
Although much of the published research centres on mRNA-binding proteins (mRBPs) and messenger RNPs, PTGR is not limited to mRNA maturation and regulation; it also includes processes acting on ncRNAs. In this respect, it may not be surprising that, among the ~150 RBPs listed in the Online Mendelian Inheritance in Man (OMIM) database as being linked to human diseases, only one-third are described as directly binding mRNAs; the others target diverse ncRNAs9.
 
Here, we present a census of 1,542 human RBPs that interact with all known classes of RNAs, detail their families and evolutionary conservation across species, and analyse their expression across tissues and their potential roles in developmental processes. This catalogue of RBPs will guide future analyses of RBPs and provide an overview of known RNA pathways and their protein components.
 
......................................................................................................................................................................
...................................................................................................................................................................... 

Conclusions

A census of human RBPs is essential for organizing our current molecular and genetic understanding of the role of RNA in general gene expression and PTGR. This catalogue provides researchers with a newly curated resource to guide their investigations of PTGR processes and to systematically study RBPs. An analogous catalogue that assesses the abundance of all expressed RNAs (that is, the RBP targets) and that classifies them across tissues and cell types is still missing. Such a catalogue would be a useful complementary document to this census.
 
Of the ~20,500 protein-coding genes in humans, we determined that 7.5% are directly involved in RNA metabolism by binding to and/or processing RNA, or by constituting essential components of RNPs. RBPs are structurally diverse and include many distinct classes of RBDs. Indeed, whereas the three most abundant DNA-binding domains account for 80% of all TFs58, the three most abundant RBDs accounted for only 20% of all RBPs in our census. Based on target-RNA categorization, we found that nearly 50% of RBPs acted in mRNA metabolic pathways and 11% constituted ribosomal proteins, while the rest were involved in the diverse number of ncRNA metabolic processes. The target-based categorization of RBPs can assist interpretation of disease phenotypes and mutations emerging from rapidly increasing patient genome sequencing, and may guide future functional studies. When considering abundances, we found that ribosomal proteins and mRBPs were the most abundant RBPs in the cell. Nevertheless, most RBPs were ubiquitously expressed at higher levels than the residual protein-coding transcriptome, and up to 20% of the total expressed protein-coding transcripts encoded RBPs. Therefore, not only is RNA metabolism one of the most conserved cellular processes, but it also has one of the highest protein copy number demands.
 
Many details of PTGR remain to be revealed, including the dissection of newly discovered RNA regulatory processes1, 184, 185. The investigation of PTGR networks is aided by the rapid development of next-generation sequencing-based methods, such as RIP- and CLIP-based methods2, 3, 14, ribosome profiling186, in vivo RNA secondary structure profiling187, 188, 189, small and long RNA-seq6, 190, 191, and 3′-end sequencing methods that profile alternative polyadenylation sites and poly(A) tail lengths158, 192, 193, 194. These studies reveal an unanticipated complexity in RBP binding and targeting, and highlight the need to experimentally dissect PTGR networks in various cellular systems.

Published online

http://www.nature.com/nrg/journal/vaop/ncurrent/full/nrg3813.html?WT.mc_id=FBK_NatureReviews

 



Macrophage subsets in atherosclerosis

Nature Reviews Cardiology

Atherosclerosis is characterized by increased accumulation of macrophages within the vessel wall. In response to stimuli such as modified lipids, cytokines, and senescent erythrocytes present in the atherosclerotic lesion, these macrophages can alter their functional phenotypes. Different macrophage subsets can influence the growth and composition of the atherosclerotic plaque in distinct ways. In this Review, Chinetti-Gbaguidi et al. highlight the diverse range of macrophage phenotypes present in atherosclerotic lesions, and their roles in both plaque progression and stability.
 

Abstract:

Macrophage accumulation within the vascular wall is a hallmark of atherosclerosis. In atherosclerotic lesions, macrophages respond to various environmental stimuli, such as modified lipids, cytokines, and senescent erythrocytes, which can modify their functional phenotypes. The results of studies on human atherosclerotic plaques demonstrate that the relative proportions of macrophage subsets within a plaque might be a better indicator of plaque phenotype and stability than the total number of macrophages. Understanding the function of specific macrophage subsets and their contribution to the composition and growth of atherosclerotic plaques would aid the identification of novel strategies to delay or halt the development of the disease and its associated pathophysiological consequences. However, most studies aimed at characterizing the phenotypes of human macrophages are performed in vitro and, therefore, their functional relevance to human pathology remains uncertain. In this Review, the diverse range of macrophage phenotypes in atherosclerotic lesions and their potential roles in both plaque progression and stability are discussed, with an emphasis on human pathology.

 Key points

  • Only M1 proinflammatory and M2 anti-inflammatory macrophages have been described in vitro—however, a wide spectrum of intermediate phenotypes has been identified in in vivo studies
  • Various stimuli (cytokines, lipids and their derivatives, senescent cells, iron) can influence macrophage phenotypes in atherosclerotic lesions
  • Macrophages with different functional phenotypes are likely to perform different roles in the development of atherosclerosis
  • M1 macrophages are associated with symptomatic and unstable plaques, whereas M2 macrophages are particularly abundant in stable zones of the plaque and asymptomatic lesions
  • Modulation of macrophage phenotypes might be a novel strategy for the pharmacological treatment of atherosclerosis

Introduction


The development of atherosclerosis involves activation of various cell types (including endothelial cells, smooth muscle cells, lymphocytes, monocytes, and macrophages) in the intima of the arteries, which results in a local inflammatory response.1 An increase in circulating LDL-cholesterol levels and the subsequent accumulation of oxidized LDL in the subendothelial space triggers the recruitment and retention of monocytes and lymphocytes in the arterial wall. In the intima, monocytes differentiate into macrophages, which scavenge lipoprotein particles, and eventually become foam cells.2 These macrophage-derived foam cells secrete inflammatory molecules and factors that further promote lipoprotein retention, degrade the extracellular matrix, and sustain inflammation.3, 4
 
Progression of atherosclerosis is characterized by apoptosis of these resident macrophages in the lipid core of the lesion. The clearance of apoptotic cells is mediated by phagocytes, mostly macrophages, which recognize and internalize dead cells in a process termed efferocytosis.5 In early lesions, phagocytes readily clear apoptotic cells, avoiding further progression of atherosclerosis. In chronic, advanced lesions, however, efferocytosis is no longer sufficient to engulf all dead cells, and the gradual accumulation of apoptotic debris results in formation of a necrotic core, which triggers further inflammation, necrosis, and thrombosis.5 Macrophages are crucial in the maintenance of efficient efferocytosis, and thereby contribute to both resolving inflammation and preventing the formation of a necrotic core within the plaque (Figure 1).6
 
Figure 1: Potential role of M2 macrophages in efferocytosis within atherosclerotic plaques.
M2 macrophages localized in areas of neovascularization or outside the lipid core can phagocytose apoptotic M1 macrophages, contributing to the resolution of inflammation. If efferocytosis is insufficient, dead M1 macrophages accumulate and undergo postapoptotic necrosis, leading to the formation of a necrotic core, which contributes to plaque instability and rupture.
Novel observations have challenged these previously well-established concepts. Whereas atherosclerosis was initially considered a type 1 T helper cell (TH1)-driven inflammatory process, the concept of heterogeneity of macrophages resident within lesions has gradually emerged over the past decade.7, 8 Firstly, several studies revealed that both monocytes and macrophages comprise heterogeneous cell populations that adapt their functional phenotype in response to specific microenvironmental signals and molecules.9, 10 These different monocyte and macrophage subtypes can be identified based on their differential expression of surface markers and chemokine receptors.9, 10 Secondly, the general dogma that tissue-resident macrophages are incapable of proliferating was challenged by investigators who observed macrophages in mouse lungs that proliferated independently of monocyte recruitment.11 Both resident and recruited macrophages can be induced to proliferate by IL-4.11 Moreover, in mice, macrophages in early atherosclerotic lesions are predominantly derived from recruited monocytes, whereas macrophage proliferation is a preponderant feature of advanced plaques and is influenced by microenvironment signals.12 Accordingly, differences in resident macrophage phenotypes can also influence their capacity to proliferate locally, which can alter the abundance of macrophages with a given phenotype.11, 13
 
In this Review, we describe and discuss the reported functional phenotypes of macrophages in atherosclerotic plaques, focusing mainly on human pathology. The correlative studies suggesting a link between such macrophage phenotypes and the structure or progression of atherosclerotic lesions will also be discussed.
Figure 2: Main macrophage subtypes found in atherosclerotic lesions.
Stimuli present in atherosclerotic lesions drive the differentiation of monocytes towards different macrophage phenotypes. a | M1 macrophages release proinflammatory cytokines. b | M(Hb), Mhem, and M2 macrophages are resistant to lipid accumulation, possess iron-handling capacities, and have anti-inflammatory effects. c | Mox macrophages display an antioxidant gene expression profile. d | M4 macrophages, like M1 macrophages, are proinflammatory, but lack the capacity for phagocytosis. Abbreviations: COX-2, cyclooxygenase; CXCL4, C-X-C motif chemokine 4; HMOX-1, haem oxygenase (decycling) 1; LDL, low-density lipoprotein; LXR, liver X receptor; MMP-7, matrix metalloproteinase-7; NFE2L2, nuclear factor (erythroid-derived 2)-like 2; NF-κB, nuclear factor kappa-light-chain-enhancer of activated B cells; TLR, toll-like receptor; TNF, tumour necrosis factor.
Figure 3: Localization of macrophage subsets in human atherosclerotic lesions.
M1 macrophages are predominantly found in the plaque shoulder and lipid core, whereas M2 macrophages are most abundant in the adventitia and areas of neovascularization, which also contain iron deposits. Fibrous caps contain similar amounts of both macrophage subsets.

Conclusions

 Macrophages are important in the development of atherosclerotic lesions because they participate in all stages of plaque formation and progression.77 During their lifetime, macrophages are exposed to a plethora of microenvironmental signals and stimuli (including cytokines modified lipids, senescent erythorocytes, and iron) that influence their transcriptional programme and functional phenotype. The intensity of these signals changes during plaque progression, and varies between plaque regions. Therefore, macrophages adapt their phenotype both over time and in response to their physical location, contributing to and modulating plaque progression and composition. The local accumulation of macrophage subpopulations that are highly competent in clearing apoptotic cells (such as M2 macrophages) can sustain efferocytosis, thereby contributing to the resolution of inflammation and prevention of necrotic core formation within plaques.
 
Whether the distinct macrophage phenotypes represent different stages of differentiation of a single population of phenotypically and functionally plastic resident macrophages, or are consequences of the recruitment and differentiation of specific monocyte subpopulations in the lesion, is still a matter of debate. However, despite rapid advances in this field, novel studies and research methods need to be developed to determine whether monocyte subpopulations give rise to specific macrophage phenotypes. Our current knowledge suggests that M2 macrophages, owing to their localization within human plaques and their intrinsic anti-inflammatory properties, are primarily associated with plaque stability. However, one of the most important challenges in this field will be to demonstrate a direct causative link between macrophage phenotype and the structure or progression of atherosclerotic lesions. Given that only correlation studies are feasible in humans, specific mouse models need to be developed.
 
Further identification of novel phenotypic and functional markers, and the use of novel large-scale expression profiling approaches and imaging technology, is expected to lead to an improved definition and understanding of the specific functions of the different macrophage subtypes identified within the plaque. The identification of biological stimuli that can modulate macrophage phenotypes could lead to the development of novel therapeutic approaches for the treatment of atherosclerosis.

Published online

http://www.nature.com/nrcardio/journal/vaop/ncurrent/full/nrcardio.2014.173.html?WT.mc_id=FBK_NatureReviews