- Open Access
Functions and properties of nuclear lncRNAs—from systematically mapping the interactomes of lncRNAs
Journal of Biomedical Science volume 27, Article number: 44 (2020)
Protein and DNA have been considered as the major components of chromatin. But beyond that, an increasing number of studies show that RNA occupies a large amount of chromatin and acts as a regulator of nuclear architecture. A significant fraction of long non-coding RNAs (lncRNAs) prefers to stay in the nucleus and cooperate with protein complexes to modulate epigenetic regulation, phase separation, compartment formation, and nuclear organization. An RNA strand also can invade into double-stranded DNA to form RNA:DNA hybrids (R-loops) in living cells, contributing to the regulation of gene expression and genomic instability. In this review, we discuss how nuclear lncRNAs orchestrate cellular processes through their interactions with proteins and DNA and summarize the recent genome-wide techniques to study the functions of lncRNAs by revealing their interactomes in vivo.
Over the past decade, intensive studies have demonstrated that long non-coding RNAs (lncRNAs) are key regulators in embryonic development , DNA damage responses , and human diseases such as neuronal disorders , heart dysfunction , and cancers [5, 6]. LncRNAs can act as effectors to direct biological processes. Unlike DNA, RNA is more mobile, flexible, able to self-fold into a distinct structure and interact with DNA or RNA by base pairing. Revealing the molecular mechanisms by which lncRNAs regulate those biological functions relies on the study of their interaction between DNA and proteins. Several methodologies have been developed to systematically identify RNA interacting chromatin, RNA interacting proteins and RNA-RNA interactions in vivo, thus uncovering the network of protein-RNA-DNA (Table 1).
Chromatin associated RNAs
The first study showed that RNA constitutes a significant fraction of chromatin date back over 50 years ago [41, 42]. Later studies provide evidence to support the idea that a nuclear matrix is made of insoluble proteins and RNA in an interphase nucleus . Treatment of RNase A leads to clumping of chromatin onto nuclear lamina and nucleolus [44, 45], indicating an indispensable role of RNA in the nuclear architecture. However, identifying a specific RNA contributes to a specific phenotype in the nucleus had facing technical challenges before the 1990s. Recently, the burst of studies utilized next-generation sequencing to characterize the interactions of lncRNA and chromatin. While using lncRNA ablation, accumulating evidence has suggested that RNA-protein complexes conduct various functions such as the formation of chromatin compartment, gene regulation, and inter- or intra-chromosomal interactions [44, 45] (Fig. 1).
It has been reported that the CCCTC-binding transcription factor (CTCF), which promotes chromatin loop formation, binds to thousands of RNAs . Depletion of steroid receptor RNA activator (SRA) noncoding RNA that is associated with CTCF, reduced CTCF-mediated insulator activity  at the IGF2/H19 imprinting control region and increased IGF2 expression. It was also reported that a lncRNA, Jpx, activates Xist RNA expression by evicting CTCF from binding to Xist promoter . These results imply that lncRNAs in the nucleus cooperate with RNA-binding proteins to regulate the gene expression, perhaps through modulating the chromatin conformation.
It was proposed that lncRNA transcription guides genome organization . For example, Oakes et al. demonstrated that inhibition of rRNA transcription leads to nucleolus disassembly . Induction of the transcription of rRNA genes that are inserted into new chromosomal locations can generate nucleolus-like structures . These results suggest that rRNA transcription could be responsible for the nucleolus organization. Lately, Nozawa et al. had suggested that chromatin-associated RNAs can form a chromatin mesh with scaffold attachment factor A (SAF-A), also known as heterogeneous ribonucleoprotein U (HNRNP-U) . They showed that SAF-A oligomerization, which is dependent on ATP and RNA, drives chromatin decompaction, whereas its monomerization compacts large-scale chromatin organization. The global change of large-scale chromatin by depletion of SAF-A did not alter dramatically gene expression , nevertheless, it resulted in excessive DNA damages and genomic instability .
Xist RNA regulates gene silencing and chromatin conformation
Xist RNA is one of the well-studied lncRNAs involved in epigenetic regulation and gene silencing. It is a 17-kb lncRNA that is expressed from the inactive X-chromosome (Xi) , coating the entire X chromosome to repress gene expression through its ability to recruit repressive complexes such as polycomb complexes PRC1 and PRC2 [54,55,56]. In addition, Xist plays an important role in three-dimensional (3D) conformation and maintains the heterochromatin structure in Xi (Fig. 2). When Xist was depleted, topologically associated domains (TADs) were restored in cis and the Xi (inactive X chromosome) became Xa (active X chromosome) like conformation . Lee et al. also showed that Xist repels positive chromatin factors such as BRG1 (also known as SMARCA4) and cohesin from the Xi [20, 57] to prevent the formation of TADs and chromatin superloops. In a higher order of chromatin structure, mammalian chromosomes are organized into alternating “A/B compartments” in 3D conformation. Spatial compartments usually possess chromatin of similar states, with A compartments being actively transcribed and gene-rich, and B compartments being transcriptionally inactive and gene-poor [58, 59]. Remarkably, ablating SMCHD1 (known as an architectural protein) displayed another layer of compartments--S1/S2  in Xi. Their results indicate that SMCDH1 binds to Xist and facilitates the folding of the S1/S2 compartments into a compartment-less (more compact) structure in Xi (Fig. 2). Altogether, robust evidence has shown that a lncRNA can regulate gene expression and chromatin 3D conformation through its abilities to recruit epigenetic factors, or to act as a decoy or a scaffold for protein complexes.
Enhancer RNAs promote chromatin looping
Enhancer RNAs were first discovered by two early studies through the genome-wide sequencing such as RNA-seq or ChIP-seq [61, 62]. They demonstrated that enhancers were occupied by RNA polymerase II (RNAP II) and transcribed into a class of ncRNAs termed enhancer RNAs (eRNAs). The epigenetic features of enhancers consist of histone 3 lysine 4 monomethylation (H3K4me1), histone 3 lysine 27 acetylation (H3K27ac), histone variants (H2AZ, H3.3), co-activators (mediator complex), and an open-chromatin architecture (DNase I hypersensitivity) . The expression of enhancer RNAs could be regulated by various stimuli such as estrogen (ER) or androgen (AR) [64,65,66]. It is generally believed that eRNAs are unstable, transcribed quickly after induction, and degraded rapidly . Several lines of evidence suggest that eRNAs can promote enhancer-promoter looping or facilitate RNA pol II loading, thus upregulating their target genes [65, 67]. For example, the gonadotropin hormone α-subunit gene is regulated by the eRNA in a cell-type-specific manner [68, 69]. The depletion of the eRNA results in a loss of interactions of the enhancers and promoters of the gonadotropin gene . These results further support that lncRNAs enable to direct chromatin looping.
lncRNAs mediate interchromosomal interactions
Homologous chromosome pairing rarely occurs in somatic cells under normal growth conditions with only a few exceptions. Transvection was first observed in 1908 in Drosophila, where homologous chromosomes were closely paired in somatic nuclei . It is an epigenetic phenomenon that can lead to either gene activation or repression. X chromosome pairing is one of the best-known examples of somatic homologous pairing in mammals . Tsix, a lncRNA, is highly expressed in undifferentiated ES cells, and antagonizes Xist action. During ES differentiation in female cells, transient homologous pairing occurs, and the Xist RNA expression increases to initiate chromosome-wide silencing. It was reported that Tsix RNA mediates the homologous pairing between two X chromosomes, thus breaking epigenetic symmetry between the two X chromosomes, as well as driving the random choices for the selection of inactive or active X chromosomes [72, 73]. Moreover, another lncRNA derived from the ends of sex chromosomes, dubbed PAR-TERRA (telomeric repeat-containing RNA), facilitates the pairing by clustering the ends of the sex chromosomes, and creates a hub to constrain the DNA loci in 3D space . Not limited to X chromosomes, several studies indicated that somatic allelic pairing also occurs in a number of autosomal loci, such as Oct4 and various cytokine genes [74,75,76,77]. Although how interchromosomal pairing impacts gene expression and epigenetic regulation remains elusive, it has been proposed that the alignment of the two homologous alleles could allow bi-allelically bound transcription factors to redistribute onto one allele to achieve their lowest free energy state (the aggregated state) [72, 78, 79], thus inducing the transition from biallelic to monoallelic expression.
Methods for RNA-chromatin interactions
To map the RNA-associated chromatin in vivo, Chu et al. and Simon et al. utilized biotinylated antisense DNA oligo probes to capture a specific RNA that is associated with chromatin [7, 8] (Table 1 and Fig. 3), named ChIRP-seq (Chromatin isolation by RNA purification) and CHART-seq (Capture Hybridization Analysis of RNA Targets) respectively. They first fixed cells with either glutaraldehyde (ChIRP) or formaldehyde (CHART) and sheared chromatin into small pieces by physical sonication. To minimize the noises caused by the non-specific interactions of DNA probes and chromatin DNA or proteins, CHART includes an RNase H step to elute the RNA-chromatin complexes. This is to ensure that only lncRNAs-complexes that are targeted by DNA probes will be eluted by RNase H, which specifically degrades RNA of RNA-DNA hybrids. Later, a hybrid method was developed, called CHIRT-seq, which combines glutaraldehyde fixation and RNase H elution  to identify genomic binding sites for TERRA RNA. Accumulating studies have used ChIRP-seq to determine the genomic bindings of many lncRNAs, including NEAT1, MALAT1, HOTAIR and MEG3, TERC, LTR ERV-9 [80, 81]. CHART-seq had also successfully identified Xist, NEAT1, and MALAT1 genomic binding sites [8, 9].
Recently, several methods were developed to reveal all interactions between RNA and DNA in the nucleus, including MARGI-seq (Mapping RNA-genome interactions), ChAR-seq (Chromatin-Associated RNA sequencing), and GRID-seq (Global RNA Interactions with DNA) [11,12,13]. The idea of these methods is to ligate chromatin-associated RNAs with their target genomic sequences by proximity ligation using a linker, thus forming RNA-DNA chimeric fragments. These techniques revealed hundreds of chromatin-associated RNAs including previously known lncRNAs and a large set of non-coding RNAs that are bound to active promoters, enhancers, and super-enhancers in a tissue-specific manner [11,12,13]. All-to-all mapping ideally can discover all interactions of RNA-chromatin, however, the coverage depends on the ligation efficacy, the distances between RNA and genomic DNA, as well as the abundance of RNAs.
As discussed previously, long non-coding RNAs may serve as scaffolds to bring two or several distant DNA loci into spatial proximity. To understand how a specific RNA interacts with chromatin in a 3D space, Mumbach et al. developed a method, named HiChIRP, which combines Hi-C and ChIRP . They incorporated azido-CTP into chromatin contacts, captured RNA-chromatin using biotinylated probes, conducted the copper-free dibenzocyclooctyne (DIBO) ‘CLICK’ chemistry to covalently conjugate a biotin for subsequent contacts, and sequenced the DNA contacts. They performed HiChIRP on 7SK small nuclear RNA (snRNA), telomerase RNA component (TERC) and lincRNA-EPS . They found that thousands of loops were enriched on 7SK HiChIRP, and some of them in promoters and active regulatory elements. They also showed that TERC is not only associated with loops that formed between telomeres but also with enhancer-promoter loops at some oncogenes, implying a role of TERC outside of telomeres. Therefore, their results provide insights into how lncRNAs mediate inter- or intra-chromatin looping.
Methods for RNA-protein interactions
RNA binding proteins (RBPs) are associated with a large number of human disorders, such as autoimmune and neurologic diseases [82,83,84]. Remarkable examples include FMRP in the Fragile-X mental retardation protein , the neuron-specific Nova and Hu proteins in the paraneoplastic neurologic degenerations  and the small nuclear ribonucleoproteins (snRNPs) in systemic lupus erythematosus . To identify RNAs that interact with these proteins in vivo, Ule et al. combined UV cross-linking with immunoprecipitation (CLIP) to pull down RNA-protein complexes  (Table 1 and Fig. 4), and the captured RNAs are sequenced. Because CLIP relies on reverse transcription to pass over residual amino acids that covalently attach to the RNA at the cross-link site, cDNAs tend to prematurely truncate immediately before the cross-link nucleotide . Later on, they resolved this problem by PCR amplification of truncated cDNAs via self-circularization of cDNAs, and developed individual-nucleotide resolution UV cross-linking and immunoprecipitation (iCLIP) to precisely map protein–RNA interactions . Due to the fact of low efficiency of RNA-protein crosslinking by UV 254 nm in CLIP, Tuchi et al. developed a method, named PAR-CLIP (photoactivatable ribonucleoside-enhanced crosslinking and immunoprecipitation) , to improve the crosslinking efficiency by incorporating 4-thiouridine (4SU) into nascent RNA transcripts in living cells. The cells were then irradiated by UV 365 nm to induce efficient crosslinking of photoreactive nucleoside (4SU)-labeled RNAs to interacting proteins (Table 1). PAR-CLIP has been applied to identify the transcriptome-wide binding sites of several RBPs and microRNA-containing ribonucleoprotein complexes.
Recently, researchers have elaborated on the methods for pulling down RNA-proteins complexes via antisense oligos that are complementary to targeted RNA sequences. Therefore, this type of capture is not limited by the requirement of antibodies against the proteins of interest for immunoprecipitation. In 2015, three groups established RNA-centric capture methods, including iDRiP-MS (identification of direct RNA interacting proteins), RAP-MS (RNA antisense purification coupled with mass spectrometry) and ChIRP-MS (comprehensive identification of RNA-binding proteins by mass spectrometry) to determine Xist RNA interacting proteins [18,19,20] (Table 1 and Fig. 4). In these three studies, cells were initially crosslinked to preserve RNA-protein interactions, and RNA-protein complexes were further purified by biotinylated antisense oligos along with highly denaturing purification conditions. ChIRP-MS utilizes formaldehyde to crosslink RNA-protein complexes, whereas RAP-MS and iDRiP-MS apply UV light for crosslinking (Table 1 and Fig. 4). Given that UV light is a short-range crosslinker, the RNA-protein interactions revealed by RAP-MS and iDRiP-MS tend to be direct. In contrast, the formaldehyde crosslinking fixes much larger macromolecular networks and generally leads to identify both direct and indirect factors.
To identify all RNA-binding proteins (RBPs), RBR-ID (proteomic identification of RNA-binding regions) introduces 4SU (4-thiouridine) into RNAs, crosslinks RNA-proteins by UV light, and compares the mass spectrometry of 4SU treated and non-treated samples . An RNA-crosslinked peptide has a different mass, so the intensity of the signal is generally lower in the crosslinked sample compared to the non-crosslinked sample (Table 1 and Fig. 4). Using RBR-ID, about 800 previously unknown and known RNA-binding proteins, such as chromatin factors (CTCF, ATRX, HDAC1, DNMT3, EZH2, TET1, TET2 and HP1), were identified as RBPs in mouse embryonic stem cells .
Methods for RNA-RNA interactions
Unlike DNA, RNA is a single strand of nucleic acids and is capable of folding into 3D structures that range from simple helical elements to complex tertiary structures and quaternary ribonucleoprotein assemblies . The changes in RNA structures directly affect their functions in response to cellular conditions. Recently, high-throughput techniques that combine nuclease digestion [22, 23, 86] or chemical probing [24,25,26, 28] with next-generation sequencing have revealed the single-stranded or double-stranded regions of RNA molecules. FragSeq (fragmentation sequencing)  utilizes nuclease P1, which specifically cleaves single-stranded nucleic acids (Table 1 and Fig. 5). PARS (parallel analysis of RNA structure)  maps RNA structure using RNase V1 or S1 nuclease to digest RNA to determine the double or single-stranded regions, respectively. In SHAPE-Seq (selective 2′-hydroxyl acylation analyzed by primer extension sequencing) [24,25,26], RNAs are treated with chemical probes (such as 1 M7) that covalently modify the RNA in loops and bulges, thus blocking reverse transcription at the modified sites. However, these methods may not represent the RNA structures in vivo due to applying in vitro transcription and in vitro folding of RNAs.
To profile genome-wide RNA structures in vivo, Ding et al. developed a method, called DMS-seq  (Table 1 and Fig. 5). In the DMS-seq, cells are treated with dimethyl sulfate (DMS) that methylates unprotected adenines (As) and cytosines (Cs) of RNA, resulting in the reverse transcriptase stalling at one nucleotide before DMS-modified As and Cs during cDNA synthesis. A year later, Chang’s group developed a method, termed icSHAPE  (in vivo click selective 2′-hydroxyl acylation and profiling experiment and NAI-N3) to probe RNA secondary structures in living cells for all four bases. They used a cell-permeable SHAPE reagent, NAI-N3, which adds an azide group to NAI (2-methylnicotinic acid imidazolide) to label flexible regions of RNA. After RNA isolation, a biotin moiety is selectively added to NAI-N3-modified RNA by copper-free click chemistry, allowing the biotin-streptavidin purification of modified RNA after RNA fragmentation. By comparison of in vivo and in vitro icSHAPE, they observed that all RNAs are less folded in vivo, suggesting that RNA structures largely depend on intracellular environments. Moreover, they found that regulatory RNAs, such as lncRNAs and primary microRNA (miRNA) precursors, preserve their structures better than mRNAs in vivo. To selectively enrich some specific RNA molecules, COMRADES (cross-linking of matched RNAs and deep sequencing) combines RNA capture and CLICK chemistry to probe RNA structures and RNA-RNA interactions . In the COMRADES method, cells are crosslinked by azide-modified psoralen, following RNA capture using biotinylated probes, and a copper-free click-chemistry reaction is carried out to link a biotin moiety to the cross-linked regions, allowing the second selection of the cross-linked regions for sequencing (Table 1 and Fig. 5).
In addition, an RNA molecule can base-pair with other RNA molecules to form RNA duplexes (such as miRNA and its target RNA) bound by RBPs. To identify miRNA targets, Tollervey et al. developed a method, named CLASH (crosslinking, ligation, and sequencing of hybrids) [30, 31] to identify human AGO1 interacting RNA duplexes (Table 1 and Fig. 5). In the CLASH method, RNA-protein complexes are UV-crosslinked and purified by antibodies against RNA binding proteins, following the ligation of RNA-RNA hybrids, and the chimeric RNAs of RNA-RNA duplexes are deep sequenced. Similarly, hiCLIP  also can map RNA-RNA duplexes bound by RBPs. In the hiCLIP method, a linker-adapter is introduced to the ligation step of RNA-RNA duplexes to improve the ligation efficiency (Table 1 and Fig. 5). Moreover, the ligation of two chimeric RNAs has been applied to other methods including MARIO (Mapping RNA interactome in vivo) , PARIS  and SPLASH , which map all RNA-RNA interactions (RNA structures and interactome) in living cells.
R-loops in gene regulation and genomic instability
R-loops are three-stranded nucleic acid structures in which a strand of RNA hybridizes with a strand of DNA, while the other strand of DNA loops out. The R-loop structure was first described in 1976 in the study  in which Thomas et al. showed that RNA could hybridize to double-stranded DNA in the presence of 70% formamide in vitro. A year later, Roberts and Sharp used R-loop hybridization technique to map an adenovirus 2 (Ad2) mRNA to its genome and found that some DNA sequences of the Ad2 coding gene were not hybridized with the matured RNA, suggesting that Ad2 genome consists of non-coding sequences, later known as introns [88, 89]. Recently, genome-wide studies have shown that R-loops are found in vivo, especially enriched in promoter regions . Ginno et al. demonstrated that R-loop formation is involved in gene regulation via its potential to protect DNA from methylation . Moreover, recent reports showed that antisense long noncoding TARID (TCF21 antisense RNA inducing promoter demethylation) forms an R-loop at the TCF21 promoter, thus facilitating GADD45A binding, local DNA demethylation and TCF21 expression [90, 91]. Another example is GATA3-AS1 lncRNA, which forms an R-loop structure with the central intron of GATA3-AS1 and tethers the MLL H3K4 methyltransferase to GATA3 gene locus, thereby regulating GATA3 expression . Furthermore, it has been proposed recently that R-loops act as intrinsic Pol II promoters to induce lncRNA transcription . The depletion of R-loops by overexpressing RNase H1 causes the selective reduction of antisense lncRNA transcription .
It is generally believed that R-loops form in cis during transcription when a nascent RNA hybridizes to the DNA template behind the moving RNA polymerase . However, research in yeast suggests that RNA:DNA hybrids can form in trans, which means that an RNA transcript at one locus hybridizes with homologous DNA at another locus , and the hybrids are likely involved in homologous recombination. Moreover, excellent studies have shown that genome instability arises from lesions generated from the formation of R-loops [95, 96]. Because transcription and replication share a common DNA template, when replication forks encounter transcription machinery, it results in transcription-replication collisions that lead to DNA damage. Therefore, the persistent RNA:DNA hybrids could be threats for genomic rearrangements .
Methods for mapping R-loops
To map R-loops in a genome-wide scale, Ginno et al. developed a method, named DRIP-seq (DNA-RNA immunoprecipitation coupled to high-throughput sequencing), which utilizes an antibody (S9.6)  to specifically purify RNA:DNA hybrids (Table 1 and Fig. 6), and the captured DNA fragments are further sequenced. Conventional DRIP-seq generally produces a robust signal but has a limited (approximately kilobase) resolution, a higher background, and a lack of strand specificity. Another method, named DRIPc-seq (DNA:RNA immunoprecipitation followed by cDNA conversion coupled to high-throughput sequencing), which builds on DRIP, except that a strand-specific RNA sequencing is performed to profile R-loops . DRIPc-seq increases the resolution of R-loop profiling and shows the strand-specificity. However, the sensitivity of DRIP for revealing authentic R-loops in vivo has been doubted due to the fact that the immunoprecipitation for R-loops is usually performed using isolated genomic DNA without any crosslinking. Thus R-loops could be disrupted or formed in vitro after cell lysis. Dumelie et al. developed a method, named bisDRIP-seq (bisulfite-based DRIP-seq) , which selectively labels the single-stranded DNA that loops out from an R-loop. They used bisulfite to convert cytosine residues into uracil residues within the genomic DNA regions that contain single-stranded DNA simultaneously when cells were lysed. Their results showed that bisDRIP-seq could map R-loops at near nucleotide resolution and detect the boundaries of R-loops. In contrast to DRIP, R-ChIP (RNase H chromatin immunoprecipitation) utilizes an RNase H, which specifically recognizes DNA:RNA hybrids in vivo to map R-loops . By expressing a catalytically dead enzyme, R-ChIP captures R-loops using a standard ChIP-seq protocol, which involves both fixation to stabilize R-loop/RNase H complex and sonication to increase the resolution (Table 1 and Fig. 6).
lncRNAs mediate phase separation of membrane-less organelles
Cellular organelles such as mitochondria and Golgi apparatuses composed of lipid bilayer membrane structures, which help the formation of compartments and separate biological processes within a cell. In contrast, membraneless structures are formed through a process known as liquid-liquid phase separation (LLPS) and are made of RNA-protein droplets . One of the most prominent membraneless structures in the nucleus is the nucleolus , which produces the ribosomal RNA and consists of a variety of proteins and RNA. Studies have shown that rRNA transcription is important for nucleolar assembly [50, 51, 99, 100]. Other membraneless structures, such as paraspeckles, Cajal bodies (CBs), histone locus bodies (HLBs) and promyelocytic leukemia (PML) bodies are also found in the nucleoplasm as phase separated-like droplets (Fig. 7), while stress bodies and process bodies are RNA granules in the cytoplasm. DNA is typically absent from the interior of these liquid-like droplets, whereas lncRNAs serve as scaffolds for their formation and maintenance [101,102,103].
How do RNAs promote phase separation synergistically with protein-protein interactions? The in vitro studies reported that RNA repeats, such as trinucleotide repeat and G-quadruplex, can undergo a phase transition to form either a condensed liquid or a gel-like state  through the multivalent base-pairing between RNA molecules. Droplet-like assemblies of RNA are associated with certain neurodegenerative diseases, including Huntington disease, muscular dystrophy, and amyotrophic lateral sclerosis . It has been proposed that such gel-like RNA foci may contribute to neuronal dysfunction in vivo. A recent study showed that RNA plays an important role in the phase behavior of prion-like RBPs, such as TDP43 and FUS , which are largely soluble in the nucleus but form solid pathological aggregates when mislocalized to the cytoplasm. The in vitro studies indicated that the ratio of RNA/protein is important for droplet formation and phase separation . Remarkably, reduction of nuclear RNAs or disruption of RNA binding leads to excessive phase separation and the formation of cytosolic assemblies in cells , suggesting that the higher RNA concentrations in the nucleus act as a buffer to prevent aggregation of RBPs in the cytoplasm. Then the question is how the RNA/protein ratio is regulated in such droplets. Hondele et al. recently reported that RNA flux into and out of phase-separated organelles is controlled by RNA-dependent DEAD-box ATPases (DDXs), which contain low-complexity domains (LCDs) that are crucial for the formation of multivalent meshworks within membraneless droplets . In addition, Ries et al. showed that methylation on mRNAs triggers phase separation of endogenous compartments, such as P-bodies, stress granules or neuronal RNA granules . These studies indicate that the abundance of nuclear RNAs and the modifications of RNA contribute to the dynamics of such membraneless organelles. However, there are intriguing questions that remain elusive. How do RNA structures impact on phase separation? How do RNA-mediated droplets involve in chromatin organization and gene regulation? What signals trigger the reorganization of such structures?
The Human Genome and ENCODE Projects have shown that above 90% of the genome is transcribed [1, 72] into non-coding RNAs. However, the functions of most non-coding RNAs remain largely unknown. In the last decade, emerging new techniques combined with high throughput sequencing have profiled the interactions between DNA, RNA, and proteins, thus facilitating the studies of lncRNAs functions in cells. LncRNAs can act as a guide, or decoy, or a scaffold for protein complexes to mediate the epigenetic regulation . Chromatin-associated lncRNAs involves nuclear architecture and chromatin conformation. Given that lncRNAs are long and mobile, they could serve as bridges to mediate chromatin looping and to drive the inter- or intra-chromosomal interactions [44, 45]. Moreover, RNA is able to hybridize with DNA to form R-loops, which have been found to contribute to gene regulation and genomic instability. RNA also mediates the liquid-liquid phase separation through its ability being as a multivalent scaffold for the binding of RBPs, thus regulating the sizes and the dynamics of membraneless organelles that carry biological processes. It is still a beginning to uncover the surface of the lncRNA world. There are still a lot we don’t know and plenty of work that needs to be done. Because RNA molecules consist of specific sequences, it is realistic and easy to design drug targets by blocking their actions using antisense oligos, RNA interferences, or aptamers. We hope that by understanding the mechanisms of lncRNAs action, RNA-centric therapies could be potential options to treat human diseases.
Availability of data and materials
Long non-coding RNAs
CCCTC-binding transcription factor
Topologically associated domains
RNA binding proteins
Insulin-like growth factor 2
Alpha Thalassemia/Mental Retardation Syndrome X-Linked
Histone Deacetylase 1
DNA methyltransferase 3
Enhancer Of Zeste Homolog 2
Ten-eleven translocation methylcytosine dioxygenase 1
Ten-eleven translocation methylcytosine dioxygenase 2
Heterochromatin Protein 1
Polycomb Repressive Complex 1
Polycomb Repressive Complex 2
- AGO1 :
Argonaute RISC Component 1
- 1 M7:
Growth arrest and DNA damage inducible alpha
GATA binding protein 3
Kung JT, Colognori D, Lee JT. Long noncoding RNAs: past, present, and future. Genetics. 2013;193(3):651–69.
Thapar R. Regulation of DNA double-Strand break repair by non-coding RNAs. Molecules. 2018;23(11).
Sparber P, et al. The role of long non-coding RNAs in the pathogenesis of hereditary diseases. BMC Med Genet. 2019;12(Suppl 2):42.
Turton N, et al. The functions of long non-coding RNA during embryonic cardiovascular development and its potential for diagnosis and treatment of congenital heart disease. J Cardiovasc Dev Dis. 2019;6(2).
Schmitt AM, Chang HY. Long noncoding RNAs: at the intersection of cancer and chromatin biology. Cold Spring Harb Perspect Med. 2017;7(7).
Schmitz SU, Grote P, Herrmann BG. Mechanisms of long noncoding RNA function in development and disease. Cell Mol Life Sci. 2016;73(13):2491–509.
Chu C, et al. Genomic maps of long noncoding RNA occupancy reveal principles of RNA-chromatin interactions. Mol Cell. 2011;44(4):667–78.
Simon MD, et al. The genomic binding sites of a noncoding RNA. Proc Natl Acad Sci U S A. 2011;108(51):20497–502.
Simon MD, et al. High-resolution Xist binding maps reveal two-step spreading during X-chromosome inactivation. Nature. 2013;504(7480):465–9.
Chu HP, et al. TERRA RNA antagonizes ATRX and protects telomeres. Cell. 2017;170(1):86–101 e16.
Sridhar B, et al. Systematic mapping of RNA-chromatin interactions in vivo. Curr Biol. 2017;27(4):602–9.
Bell JC, et al. Chromatin-associated RNA sequencing (ChAR-seq) maps genome-wide RNA-to-DNA contacts. Elife. 2018;7.
Li X, et al. GRID-seq reveals the global RNA-chromatin interactome. Nat Biotechnol. 2017;35(10):940–50.
Mumbach MR, et al. HiChIRP reveals RNA-associated chromosome conformation. Nat Methods. 2019;16(6):489–92.
Ule J, et al. CLIP identifies Nova-regulated RNA networks in the brain. Science. 2003;302(5648):1212–5.
Konig J, et al. iCLIP reveals the function of hnRNP particles in splicing at individual nucleotide resolution. Nat Struct Mol Biol. 2010;17(7):909–15.
Hafner M, et al. Transcriptome-wide identification of RNA-binding protein and microRNA target sites by PAR-CLIP. Cell. 2010;141(1):129–41.
McHugh CA, et al. The Xist lncRNA interacts directly with SHARP to silence transcription through HDAC3. Nature. 2015;521(7551):232–6.
Chu C, et al. Systematic discovery of Xist RNA binding proteins. Cell. 2015;161(2):404–16.
Minajigi A, et al. Chromosomes. A comprehensive Xist interactome reveals cohesin repulsion and an RNA-directed chromosome conformation. Science. 2015;349(6245).
He C, et al. High-resolution mapping of RNA-binding regions in the nuclear proteome of embryonic stem cells. Mol Cell. 2016;64(2):416–30.
Underwood JG, et al. FragSeq: transcriptome-wide RNA structure probing using high-throughput sequencing. Nat Methods. 2010;7(12):995–1001.
Kertesz M, et al. Genome-wide measurement of RNA secondary structure in yeast. Nature. 2010;467(7311):103–7.
Lucks JB, et al. Multiplexed RNA structure characterization with selective 2′-hydroxyl acylation analyzed by primer extension sequencing (SHAPE-Seq). Proc Natl Acad Sci U S A. 2011;108(27):11063–8.
Mortimer SA, et al. SHAPE-Seq: high-throughput RNA structure analysis. Curr Protoc Chem Biol. 2012;4(4):275–97.
Loughrey D, et al. SHAPE-Seq 2.0: systematic optimization and extension of high-throughput chemical probing of RNA secondary structure with next generation sequencing. Nucleic Acids Res. 2014;42(21).
Spitale RC, et al. Structural imprints in vivo decode RNA regulatory mechanisms. Nature. 2015;519(7544):486–90.
Ding Y, et al. In vivo genome-wide profiling of RNA secondary structure reveals novel regulatory features. Nature. 2014;505(7485):696–700.
Ziv O, et al. COMRADES determines in vivo RNA structures and interactions. Nat Methods. 2018;15(10):785–8.
Helwak A, et al. Mapping the human miRNA interactome by CLASH reveals frequent noncanonical binding. Cell. 2013;153(3):654–65.
Kudla G, et al. Cross-linking, ligation, and sequencing of hybrids reveals RNA-RNA interactions in yeast. Proc Natl Acad Sci U S A. 2011;108(24):10010–5.
Sugimoto Y, et al. hiCLIP reveals the in vivo atlas of mRNA secondary structures recognized by Staufen 1. Nature. 2015;519(7544):491–4.
Nguyen TC, et al. Mapping RNA-RNA interactome and RNA structure in vivo by MARIO. Nat Commun. 2016;7:12023.
Lu Z, et al. RNA duplex map in living cells reveals higher-order transcriptome structure. Cell. 2016;165(5):1267–79.
Aw JG, et al. In vivo mapping of eukaryotic RNA interactomes reveals principles of higher-order organization and regulation. Mol Cell. 2016;62(4):603–17.
Fox M, Roberts JJ. Drug resistance and DNA repair. Cancer Metastasis Rev. 1987;6(3):261–81.
Ginno PA, et al. R-loop formation is a distinctive characteristic of unmethylated human CpG island promoters. Mol Cell. 2012;45(6):814–25.
Dumelie JG, Jaffrey SR. Defining the location of promoter-associated R-loops at near-nucleotide resolution using bisDRIP-seq. Elife. 2017;6.
Chen L, et al. R-ChIP using inactive RNase H reveals dynamic coupling of R-loops with transcriptional pausing at gene promoters. Mol Cell. 2017;68(4):745–57 e5.
Sanz LA, et al. Prevalent, dynamic, and conserved R-loop structures associate with specific Epigenomic signatures in mammals. Mol Cell. 2016;63(1):167–78.
Huang RC, Bonner J. Histone-bound RNA, a component of native nucleohistone. Proc Natl Acad Sci U S A. 1965;54(3):960–7.
Holmes DS, et al. Chromosomal RNA: its properties. Science. 1972;177(4043):72–4.
Pederson T. Thinking about a nuclear matrix. J Mol Biol. 1998;277(2):147–59.
Nickerson JA, et al. Chromatin architecture and nuclear RNA. Proc Natl Acad Sci U S A. 1989;86(1):177–81.
Caudron-Herger M, Rippe K. Nuclear architecture by RNA. Curr Opin Genet Dev. 2012;22(2):179–87.
Kung JT, et al. Locus-specific targeting to the X chromosome revealed by the RNA interactome of CTCF. Mol Cell. 2015;57(2):361–75.
Yao H, et al. Mediation of CTCF transcriptional insulation by DEAD-box RNA-binding protein p68 and steroid receptor RNA activator SRA. Genes Dev. 2010;24(22):2543–55.
Sun S, et al. Jpx RNA activates Xist by evicting CTCF. Cell. 2013;153(7):1537–51.
Mele M, Rinn JL. “Cat’s cradling” the 3D genome by the act of LncRNA transcription. Mol Cell. 2016;62(5):657–64.
Oakes M, et al. Structural alterations of the nucleolus in mutants of Saccharomyces cerevisiae defective in RNA polymerase I. Mol Cell Biol. 1993;13(4):2441–55.
Karpen GH, Schaefer JE, Laird CD. A Drosophila rRNA gene located in euchromatin is active in transcription and nucleolus formation. Genes Dev. 1988;2(12B):1745–63.
Nozawa RS, et al. SAF-A regulates interphase chromosome structure through oligomerization with chromatin-associated RNAs. Cell. 2017;169(7):1214–1227 e18.
Brown CJ, et al. The human XIST gene: analysis of a 17 kb inactive X-specific RNA that contains conserved repeats and is highly localized within the nucleus. Cell. 1992;71(3):527–42.
Wang J, et al. Imprinted X inactivation maintained by a mouse Polycomb group gene. Nat Genet. 2001;28(4):371–5.
Plath K, et al. Developmentally regulated alterations in Polycomb repressive complex 1 proteins on the inactive X chromosome. J Cell Biol. 2004;167(6):1025–35.
Zhao J, et al. Polycomb proteins targeted by a short repeat RNA to the mouse X chromosome. Science. 2008;322(5902):750–6.
Jegu T, et al. Xist RNA antagonizes the SWI/SNF chromatin remodeler BRG1 on the inactive X chromosome. Nat Struct Mol Biol. 2019;26(2):96–109.
Bickmore WA, van Steensel B. Genome architecture: domain organization of interphase chromosomes. Cell. 2013;152(6):1270–84.
Lieberman-Aiden E, et al. Comprehensive mapping of long-range interactions reveals folding principles of the human genome. Science. 2009;326(5950):289–93.
Wang CY, et al. SMCHD1 merges chromosome compartments and assists formation of super-structures on the inactive X. Cell. 2018;174(2):406–21 e25.
Kim TK, et al. Widespread transcription at neuronal activity-regulated enhancers. Nature. 2010;465(7295):182–7.
De Santa F, et al. A large fraction of extragenic RNA pol II transcription sites overlap enhancers. PLoS Biol. 2010;8(5):e1000384.
Ding M, et al. Enhancer RNAs (eRNAs): new insights into gene transcription and disease treatment. J Cancer. 2018;9(13):2334–40.
Hah N, et al. Enhancer transcripts mark active estrogen receptor binding sites. Genome Res. 2013;23(8):1210–23.
Hsieh CL, et al. Enhancer RNAs participate in androgen receptor-driven looping that selectively enhances gene activation. Proc Natl Acad Sci U S A. 2014;111(20):7319–24.
Wang D, et al. Reprogramming transcription by distinct classes of enhancers functionally defined by eRNA. Nature. 2011;474(7351):390–4.
Yang Y, et al. Enhancer RNA-driven looping enhances the transcription of the long noncoding RNA DHRS4-AS1, a controller of the DHRS4 gene cluster. Sci Rep. 2016;6:20961.
Pnueli L, et al. RNA transcribed from a distal enhancer is required for activating the chromatin at the promoter of the gonadotropin alpha-subunit gene. Proc Natl Acad Sci U S A. 2015;112(14):4369–74.
Li W, et al. Functional roles of enhancer RNAs for oestrogen-dependent transcriptional activation. Nature. 2013;498(7455):516–20.
Stevens NM. A study of the germ cells of certain Diptera with reference to the heterochromosomes and the phenomena of synapsis. J Exp Zool. 1908;5(3):359–74.
Lee JT. Lessons from X-chromosome inactivation: long ncRNA as guides and tethers to the epigenome. Genes Dev. 2009;23(16):1831–42.
Consortium EP, et al. Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project. Nature. 2007;447(7146):799–816.
Xu N, Tsai CL, Lee JT. Transient homologous chromosome pairing marks the onset of X inactivation. Science. 2006;311(5764):1149–52.
Hogan MS, et al. Transient pairing of homologous Oct4 alleles accompanies the onset of embryonic stem cell differentiation. Cell Stem Cell. 2015;16(3):275–88.
Zorca CE, et al. Myosin VI regulates gene pairing and transcriptional pause release in T cells. Proc Natl Acad Sci U S A. 2015;112(13):E1587–93.
Brandt VL, Hewitt SL, Skok JA. It takes two: communication between homologous alleles preserves genomic stability during V(D)J recombination. Nucleus. 2010;1(1):23–9.
Spilianakis CG, et al. Interchromosomal associations between alternatively expressed loci. Nature. 2005;435(7042):637–45.
Donohoe ME, et al. The pluripotency factor Oct4 interacts with Ctcf and also controls X-chromosome pairing and counting. Nature. 2009;460(7251):128–32.
Scialdone A, Nicodemi M. Mechanics and dynamics of X-chromosome pairing at X inactivation. PLoS Comput Biol. 2008;4(12):e1000244.
Hu T, et al. Long non-coding RNAs transcribed by ERV-9 LTR retrotransposon act in cis to modulate long-range LTR enhancer function. Nucleic Acids Res. 2017;45(8):4479–92.
Wang F, et al. Deep learning identifies genome-wide DNA binding sites of long noncoding RNAs. RNA Biol. 2018;15(12):1468–76.
Lerner MR, Steitz JA. Antibodies to small nuclear RNAs complexed with proteins are produced by patients with systemic lupus erythematosus. Proc Natl Acad Sci U S A. 1979;76(11):5495–9.
Miyashiro KY, et al. RNA cargoes associating with FMRP reveal deficits in cellular functioning in Fmr1 null mice. Neuron. 2003;37(3):417–31.
Musunuru K, Darnell RB. Paraneoplastic neurologic disease antigens: RNA-binding proteins and signaling proteins in neuronal degeneration. Annu Rev Neurosci. 2001;24:239–62.
Ganser LR, et al. The roles of structural dynamics in the cellular functions of RNAs. Nat Rev Mol Cell Biol. 2019;20(8):474–89.
Wan Y, et al. Landscape and variation of RNA secondary structure across the human transcriptome. Nature. 2014;505(7485):706–9.
Thomas M, White RL, Davis RW. Hybridization of RNA to double-stranded DNA: formation of R-loops. Proc Natl Acad Sci U S A. 1976;73(7):2294–8.
Berget SM, Moore C, Sharp PA. Spliced segments at the 5′ terminus of adenovirus 2 late mRNA. Proc Natl Acad Sci U S A. 1977;74(8):3171–5.
Chow LT, et al. An amazing sequence arrangement at the 5′ ends of adenovirus 2 messenger RNA. Cell. 1977;12(1):1–8.
Arab K, et al. Long noncoding RNA TARID directs demethylation and activation of the tumor suppressor TCF21 via GADD45A. Mol Cell. 2014;55(4):604–14.
Arab K, et al. GADD45A binds R-loops and recruits TET1 to CpG island promoters. Nat Genet. 2019;51(2):217–23.
Gibbons HR, et al. Divergent lncRNA GATA3-AS1 regulates GATA3 transcription in T-helper 2 cells. Front Immunol. 2018;9:2512.
Tan-Wong SM, Dhir S, Proudfoot NJ. R-loops promote antisense transcription across the mammalian genome. Mol Cell. 2019;76(4):600–16 e6.
Groh M, Gromak N. Out of balance: R-loops in human disease. PLoS Genet. 2014;10(9):e1004630.
Wahba L, Gore SK, Koshland D. The homologous recombination machinery modulates the formation of RNA-DNA hybrids and associated chromosome instability. Elife. 2013;2:e00505.
Crossley MP, Bocek M, Cimprich KA. R-loops as cellular regulators and genomic threats. Mol Cell. 2019;73(3):398–411.
Palikyras S, Papantonis A. Modes of phase separation affecting chromatin regulation. Open Biol. 2019;9(10):190167.
Brangwynne CP, Mitchison TJ, Hyman AA. Active liquid-like behavior of nucleoli determines their size and shape in Xenopus laevis oocytes. Proc Natl Acad Sci U S A. 2011;108(11):4334–9.
Yao RW, et al. Nascent pre-rRNA sorting via phase separation drives the assembly of dense fibrillar components in the human nucleolus. Mol Cell. 2019.
Falahati H, et al. Nucleation by rRNA dictates the precision of nucleolus assembly. Curr Biol. 2016;26(3):277–85.
Bond CS, Fox AH. Paraspeckles: nuclear bodies built on long noncoding RNA. J Cell Biol. 2009;186(5):637–44.
Naganuma T, Hirose T. Paraspeckle formation during the biogenesis of long non-coding RNAs. RNA Biol. 2013;10(3):456–61.
Fox AH, et al. Paraspeckles: where long noncoding RNA meets phase separation. Trends Biochem Sci. 2018;43(2):124–35.
Jain A, Vale RD. RNA phase transitions in repeat expansion disorders. Nature. 2017;546(7657):243–7.
Alberti S, Dormann D. Liquid-liquid phase separation in disease. Annu Rev Genet. 2019.
Maharana S, et al. RNA buffers the phase separation behavior of prion-like RNA binding proteins. Science. 2018;360(6391):918–21.
Hondele M, et al. DEAD-box ATPases are global regulators of phase-separated organelles. Nature. 2019;573(7772):144–8.
Ries RJ, et al. M(6)a enhances the phase separation potential of mRNA. Nature. 2019;571(7765):424–8.
Rinn JL, Chang HY. Genome regulation by long noncoding RNAs. Annu Rev Biochem. 2012;81:145–66.
We would like to thank Tse-Chun Kuo and Hsing-Jien Kung for the invitation of this review article. We apologize to any of our colleagues whose work could not be cited here for space constraints.
This work was supported by Ministry of Science and Technology (MOST) grants 106–2311-B-002-039-MY2 (H-P.C.), 108–2628-B-002-004 (H-P. C.), and National Taiwan University grants NTU-CDP-108 L7732, NTU-AS-108 L104305, NTU-CC-107 L892003.
Ethics approval and consent to participate
Consent for publication
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
About this article
Cite this article
Guh, CY., Hsieh, YH. & Chu, HP. Functions and properties of nuclear lncRNAs—from systematically mapping the interactomes of lncRNAs. J Biomed Sci 27, 44 (2020). https://doi.org/10.1186/s12929-020-00640-3
- Long non-coding RNA
- Nuclear architecture
- Phase separation