Intermolecular base stacking mediates RNA-RNA interaction in a crystal structure of the RNA chaperone Hfq

Schulz, Eike C.; Seiler, Markus; Zuliani, Cecilia; Voigt, Franka; Rybin, Vladimir; Pogenberg, Vivian; Mücke, Norbert; Wilmanns, Matthias; Gibson, Toby J.; Barabas, Orsolya

doi:10.1038/s41598-017-10085-8

Download PDF

Article
Open access
Published: 29 August 2017

Intermolecular base stacking mediates RNA-RNA interaction in a crystal structure of the RNA chaperone Hfq

Eike C. Schulz^1,2^nAff5,
Markus Seiler¹^nAff6,
Cecilia Zuliani¹,
Franka Voigt¹^nAff7,
Vladimir Rybin³,
Vivian Pogenberg ORCID: orcid.org/0000-0002-1021-6804²,
Norbert Mücke⁴,
Matthias Wilmanns²,
Toby J. Gibson¹ &
…
Orsolya Barabas¹

Scientific Reports volume 7, Article number: 9903 (2017) Cite this article

4004 Accesses
11 Citations
Metrics details

Subjects

Abstract

The RNA-chaperone Hfq catalyses the annealing of bacterial small RNAs (sRNAs) with target mRNAs to regulate gene expression in response to environmental stimuli. Hfq acts on a diverse set of sRNA-mRNA pairs using a variety of different molecular mechanisms. Here, we present an unusual crystal structure showing two Hfq-RNA complexes interacting via their bound RNA molecules. The structure contains two Hfq₆:A₁₈ RNA assemblies positioned face-to-face, with the RNA molecules turned towards each other and connected via interdigitating base stacking interactions at the center. Biochemical data further confirm the observed interaction, and indicate that RNA-mediated contacts occur between Hfq-RNA complexes with various (ARN)_X motif containing RNA sequences in vitro, including the stress response regulator OxyS and its target, fhlA. A systematic computational survey also shows that phylogenetically conserved (ARN)_X motifs are present in a subset of sRNAs, some of which share similar modular architectures. We hypothesise that Hfq can co-opt RNA-RNA base stacking, an unanticipated structural trick, to promote the interaction of (ARN)_X motif containing sRNAs with target mRNAs on a “speed-dating” fashion, thereby supporting their regulatory function.

RNA compaction and iterative scanning for small RNA targets by the Hfq chaperone

Article Open access 07 March 2024

Diversity of bacterial small RNAs drives competitive strategies for a mutual chaperone

Article Open access 04 May 2022

RNA binding of Hfq monomers promotes RelA-mediated hexamerization in a limiting Hfq environment

Article Open access 21 April 2021

Introduction

Non-coding RNAs play key roles in regulating gene expression in all domains of life. In bacteria, sRNAs control almost every aspect of bacterial physiology including metabolism, quorum sensing, and virulence^1,2,3,4. During stress and environmental changes, sRNAs orchestrate a complex and dynamic response, allowing the bacteria to rapidly adapt to new conditions. Thus, they play critical roles in the lifestyle switching of bacteria that are able to inhabit variable environments, as well as during infection and disease^5,6,7,8,9,10.

Bacterial sRNAs are ~50–300 nucleotides long, and act by modulating the stability and translation of diverse mRNAs. They are expressed mostly independently from their mRNA targets and many of them can simultaneously act on several different mRNAs^11,12,13, regulating the translation of all of these with specificity and precision. Target recognition was shown to be generally initiated at short complementary ‘seed’ regions in the two RNAs^{14, 15} and for most sRNA-mRNA pairs it is critically dependent on the RNA-chaperoning protein Hfq^{7, 16,17,18,19}. Consequently, bacteria with mutations in the hfq gene show reduced virulence and reduced adaptation potential^{16, 18}.

The Hfq protein is a homo-hexameric ring-shaped RNA-binding protein of the Sm/LSm family that has several distinct ways of interacting with RNA^20,21,22. In sRNAs, Hfq was shown to preferentially bind 3′ to seed regions, whereas it interacts 5′ to sRNA-target regions in mRNAs²³. Furthermore, Rho-independent terminators display a universal recognition motif for Hfq²³. The Hfq hexamer (Hfq₆) has three distinct RNA binding sites referred to as ‘proximal’, ‘distal’ and ‘lateral’ (reviewed in refs 19, 22 and 24). In addition, its flexible C-terminal tail can contribute to binding and regulation of some RNAs^25,26,27,28. The ‘lateral’ binding site is located on the rim of the Hfq₆ ring and has accessory roles in RNA binding, with a preference for UA-rich sequences^{27, 29,30,31}. The ‘proximal’ site on one face of the ring preferentially binds to U-rich RNA sequences, such as the poly(U) tracts present at the 3′ termini of most sRNAs^{32, 33}. At these poly(U) tails, Hfq also directly interacts with the free 3′-OH group, which helps trigger a constricted RNA conformation required for efficient sRNA binding and recognition³². The ‘distal’ site is located on the opposite face of the ring and has high affinity to A-rich sequences, which are commonly found in the 5′ untranslated regions (UTR) of mRNAs^{19, 34}. Crystal structures revealed that the distal site of each Hfq subunit can accommodate a triplet of RNA nucleotides (ARN or AAN) with differing specificities: the A-site binds specifically adenines, the R-site can accommodate both adenine and guanine with preference for A, while the third base points away from Hfq towards the solvent and can be any nucleotide (N)^{26, 35}. Six such sites come together in the hexamer to form a circular binding site accommodating an 18nt long A-rich RNA segment³⁶. In agreement, genomic SELEX experiments revealed a specific enrichment of A-rich sequences among Hfq-bound RNAs and in vivo UV-crosslinking demonstrated that Hfq specifically binds to repeated ARN triplets (referred to as (ARN)_X motifs) in the 5′-UTR of mRNAs^{34, 37}.

However, Hfq-RNA binding is not restricted to a single binding site. Recent reports indicate that Hfq-RNA interactions can simultaneously involve multiple sites on the RNA and/or the protein^{21, 30, 38, 39}. A remarkable example is demonstrated in the crystal structure of the Hfq-RydC complex. Here, the 3′ U-rich tail of the sRNA binds to the proximal face of Hfq, while the 5′ end binds to the lateral surface. In addition, the external part of the rim as well as Hfq’s intrinsically disordered C-terminal tail are involved in contacts with RydC²⁷. Furthermore, whereas most sRNAs primarily bind to Hfq’s proximal site, some also contain (ARN)_X-like motifs, which contribute to their stability and can bind to the distal site of Hfq^{21, 40,41,42,43}. One prominent example is the oxidative stress response regulator, OxyS that is induced upon oxidative stress and acts on multiple mRNAs to fine-tune the expression of various stress response pathways^{5, 44, 45}. In addition to proximal site binding regions^{39, 43}, OxyS contains an extended (ARN)_X motif (positions 59–86) that is essential for its regulatory function, and biochemical studies and crystal structures have shown that it binds to Hfq’s distal site^{42, 46}.

To catalyse the annealing of diverse sets of RNA pairs, Hfq has been shown to employ a variety of mechanistic strategies (reviewed in refs 19, 24 and 47). For example, Hfq binding can reduce RNA motility and flexibility, which increases the chance for two RNA molecules to meet and the on-rate of their interaction. Hfq can also alter RNA secondary structure and thereby expose complementary regions in sRNAs and mRNAs, enabling their pairing or helping to form more stable s/mRNA pairs compared to the ones formed spontaneously. In addition, the distinct specificities of Hfq’s proximal and distal binding sites allow sRNAs and mRNAs to bind simultaneously to opposite faces of a single Hfq hexamer, which increases their local concentration and facilitates annealing. Arginine-rich patches along the rim of the protein are proposed to guide and catalyse base pairing between complementary strands^{29, 30}. Moreover, the repetitive binding surfaces of the Hfq hexamer can accommodate multiple RNA molecules on the same surface, which was proposed to enable cycling of different RNA substrates on the ring and facilitate RNA release and turnover⁴⁸. RNA turnover is further supported by Hfq’s C-terminal tail that helps displace RNA duplexes from the core binding sites⁴⁹. Finally, Hfq can also interact with various proteins involved in RNA metabolism and translation, which help to mediate its function^{50, 51}. It appears that Hfq uses different mechanisms to catalyse annealing depending on the exact sRNA-mRNA pair, and the variety of the documented, partially complementary, mechanistic pathways enables this global ribo-regulator to act on many sRNA substrates and mRNA targets rapidly and accurately in the crowded milieu of the cell^{41, 48, 50,51,52,53,54}. Nevertheless, the exact mechanisms of pairing remain incompletely understood for many sRNA-mRNA target pairs.

Here, we present a crystal structure of Escherichia coli Hfq in complex with A₁₈ RNA that shows an unanticipated quaternary architecture with two Hfq₆:A₁₈ assemblies interacting via their RNA molecules. Remarkably, the RNA molecules are held together by base stacking of every third base, the N bases of the (ARN)_X motif, that are flipped out by Hfq. Consistent with the structure, biochemical data with RNA probes that lack the base at the N-site and a systematic computational survey support the notion that base stacking of the N-site bases can help mediate RNA-RNA interaction between Hfq-bound (ARN)_X motif-containing RNA molecules. We hypothesize that Hfq co-opts the N-site bases to initiate low-affinity interactions between RNA substrates so as to facilitate their partner search, adding yet another tool to the toolbox of this versatile RNA chaperone.

Results

Crystal structure of an Hfq-A₁₈ RNA complex shows base stacking between two Hfq-bound RNA molecules

Several crystal structures of Hfq have been described previously alone or in complex with various RNA substrates^{22, 27, 32, 35, 46, 55, 56}. These data revealed how Hfq recognizes various RNA molecules and suggested mechanistic models for their annealing, however the structural basis of Hfq-mediated RNA-RNA interaction remains incompletely understood. Here, we present the crystal structure of an Hfq₆-A₁₈ RNA complex at 2.5 Å resolution (Fig. 1 and Table 1) that reveals an unanticipated quaternary structure. The crystals resulted from an experiment aimed at co-crystallizing Escherichia coli Hfq72 (containing amino acids 1–72) with A₃₀ RNA and poly(A)-polymerase 1, but they contain only Hfq72 and an 18 nucleotide long poly(A) RNA segment. Hfq72, that lacks most of the intrinsically disordered C-terminal tail^{25, 57}, was used to facilitate crystallization. Recent data showed that deletion of the highly variable C-terminus^{25, 58} has no effect on the affinity or annealing of A₁₈-containing RNAs and indicate that its main function is to promote RNA turnover⁴⁹. In the resulting crystal structure, the protein itself looks very similar to previously published structures^{35, 55} and only small changes can be observed (Figure S1a). Consistent with previous reports^{55, 59}, both the proximal and the distal sites of the Hfq72 hexamer are occupied with RNA (Figure S1b,c). The electron density at the proximal site is weak, probably indicating partial occupancy (Figure S1c). While this made the identification of the bases ambiguous, they were interpreted as uridines because they exhibit the shape of pyrimidine bases and Hfq is known to preferentially bind U-rich RNA at this site. Since no uridine containing RNA or nucleotides were added in the crystallization experiments, this density probably originated from the cellular lysate or from contamination in the synthetic RNA samples. At the distal face, we detect strong electron density for the A₁₈ RNA segment (Figure S1b), whereas the remaining 12 nts of the A₃₀ RNA substrate are not visible. The A₁₈ chain adopts a very similar binding geometry as previously reported, with most nucleotides in the C2′-endo configuration, the A- and R-site bases tightly bound to the surface of Hfq, and the N-site bases pointing away from the protein surface³⁵. However, unlike in previous structures, the N-site bases are not freely exposed to the solvent, instead they form interdigitating base stacking interactions (ring-to-ring distances ~3.8 Å) with a neighbouring Hfq72₆:A₁₈ complex resulting in a (Hfq72₆:A₁₈)₂ dimer (Fig. 1). In this sandwich-shaped supramolecular assembly, two A₁₈ RNA molecules are enclosed between two hexameric Hfq72 protein rings and the stacking of the N-site bases provides the glue to hold the assembly together. With respect to other Hfq-poly(A) structures, the N-site adenines are tilted only slightly - approximately by 15 degrees - towards the surface of Hfq (Figure S1a), and their stacking do not induce significant conformational changes. In addition to the base stacking, the N₁ atom of each N-site adenine makes an electrostatic interaction with a phosphate group (distance to O_2P 2.7 Å) in the sugar-phosphate backbone of the A₁₈ chain of the partner Hfq72₆:A₁₈ ring (Fig. 1b). Considering the low pH (4.2) of the crystallization solution, it is possible that the adenine base is protonated or tautomerized in the crystals, allowing a proper hydrogen bond to form between its N₁ and the phosphate oxygen of the partner RNA. These interactions stabilize the conformation of the stacked bases and the dimeric assembly. There are no direct protein-protein interactions between the Hfq72 hexamers and the dimer is held together solely by RNA-RNA stacking.

Table 1 X-ray Data Collection and Refinement Statistics.

Full size table

Base stacking between Hfq-RNA complexes in solution

To explore if the base stacking mediated dimerization observed in the crystal structure also occurs in solution, we analysed the oligomeric state of Hfq72-A₂₀ complexes by analytical ultracentrifugation (AUC). This revealed a shift in the sedimentation coefficient (s = 5.1S) compared to the RNA-free Hfq72 control (s = 3.1S), indicating that larger molecular assemblies have formed. Notably, the shift was significantly larger than expected for a simple monomeric Hfq72₆:A₂₀ complex (s = 3.9S, calculated based on our crystal structure) (Fig. 2a and Table S1) and its exact value was dependent on the concentration; it approached the values calculated for Hfq72₆:A₂₀ monomers at low complex concentrations, but increased gradually with increasing concentration (data not shown). These results indicate that Hfq72₆:A₂₀ complexes can form dimeric assemblies. The reduced s-value probably indicates a dynamic equilibrium with the monomeric species. Such dynamic oligomerization equilibrium would result in average sedimentation coefficients that are between the predicted s-values of monomers and dimers and increase with the abundance of the larger assemblies as concentration increases⁶⁰.

While AUC can provide information about the relative size of the Hfq-RNA complex, it does not reveal their intermolecular arrangement. Thus, to determine if the observed higher order complexes are arranged face to face, held together by the flipped-out N-site bases of the RNA molecules as in the crystal structure, we tested how the removal of these bases affects oligomerization in AUC. We used a synthetic A₂₀ RNA derivative, which contained an intact sugar-phosphate backbone, but every third nucleotide (the N-site equivalent) was substituted with an abasic nucleotide (‘AA0’). This ‘AA0’ RNA was able to bind to Hfq72 equally well as A₂₀ (Figures S2–3), consistent with previous observations that the N-site bases do not contribute to Hfq binding³⁵. On the other hand, removal of the N-site bases abolished formation of supramolecular assemblies in AUC experiments: the Hfq72-‘AA0’ complex sedimented as a single specie with a sedimentation coefficient consistent with a monomeric Hfq72₆:‘AA0’ assembly (measured s = 3.8 S to be compared with the expected s = 3.9 S), and no shift to larger assemblies could be observed (Fig. 2a and Table S1).

Next, to further confirm Hfq₆:RNA dimerization and its dependence on the N-site bases, we performed isothermal titration calorimetry (ITC, Figure S3) and fluorescence anisotropy (FA, Figure S4) experiments. These revealed a single binding event for the Hfq72-‘AA0’ RNA interaction, while showing two consecutive binding events with A₂₀ RNA. The binding affinities measured for the first event (K_D1-ITC = 1.3 nM and K_D1-FA = 0.4 nM) are consistent with previously reported Hfq-poly(A) binding constants, as well as with the single binding constant measured for the ‘AA0’ RNA (K_D-ITC = 40 nM)³⁵. In contrast, the second binding has lower affinity (K_D2-ITC = K_D2-FA = 2.2 µM) and is only observed with A₂₀ but not with ‘AA0’. This implies that the first high-affinity association event corresponds to primary Hfq-RNA binding, while the second A₂₀-specific moderate-affinity event may represent Hfq-A₂₀ dimerization. The moderate dimerization affinity observed with the Hfq72₆:A₂₀ complex is also consistent with our size-exclusion data where the micro-molar affinity dimers cannot be observed (Figure S2), and with our AUC data showing a sedimentation coefficient slightly smaller than expected for dimers (as described above)⁶⁰.

We confirmed these results using electrophoretic mobility shift assays (EMSA) with full length Hfq (Hfq102). To avoid protein aggregation in EMSA, we used the Hfq102^R16A,R17A mutant^{30, 61, 62}. These experiments revealed two shifted bands with the A₂₀ RNA, one likely corresponding to monomeric Hfq102^R16A,R17A ₆:A₂₀ complexes and the second to a slower migrating larger species. Consistent with the AUC, ITC and FA data, the slower migrating (‘super-shifted’) band was greatly reduced in the Hfq102^R16A,R17A-‘AA0’ complexes (Fig. 2b).

Finally, to further explore the impact of the N-site bases on Hfq:RNA oligomerization, we performed EMSA experiments with A₂₀ variants, where every N-site base was replaced with G, C or U (‘AAG’, ‘AAC’ and ‘AAU’ derivative). Since base stacking can occur with any base, we predicted that dimers can form with diverse RNA sequences, but their affinity might differ depending on the base stacking efficiencies of different bases⁶³. Consistently, we observed significant amount of supershift with ‘AAG’ that contains strongly stacking purine bases at the N-sites, but detected smaller amount of larger assemblies with pyrimidine bases as in ‘AAC’ and ‘AAU’ (Fig. 2b,c). Interestingly, the supershifted band was practically absent with C at the N-site, consistent with its lowest base stacking efficiency⁶³. The observed selectivity might also be supported by electrostatic or hydrogen bonding interaction between the base and the phosphate group of the partner RNA as seen in our crystal structure: guanine can naturally form a strong hydrogen bond at the N₁ position, whereas pyrimidines might not suitably reach the partner phosphate backbone.

Together, these data indicate that RNA-mediated Hfq-RNA dimers form in solution and their assembly requires the flipped-out N-site bases. While the biophysical data cannot directly reveal the exact architecture of the detected supramolecular assemblies, the results are in perfect agreement with our crystal structure. Especially, the peculiar dependence of the interaction on the N-site bases is uniquely explained by the structural data, whereas absence of these bases would not be expected to affect other Hfq-RNA assemblies.

Base stacking brings together (ARN)_X motifs from OxyS and fhlA

Our structural and biochemical data imply that Hfq can mediate RNA-RNA interactions via base stacking between A-rich RNA sequences. To test if this interaction can occur with physiological sRNAs and target mRNAs, we selected the prominent sRNA-mRNA pair, OxyS and fhlA. The fhlA mRNA encodes a transcriptional activator of formate metabolism⁶⁴ that is controlled by the central oxidative stress response regulator OxyS. Both OxyS and fhlA contain A-rich (ARN)_X motifs that are essential for Hfq-binding and RNA pairing in vivo ^{42, 65}. Curiously, OxyS and fhlA share little sequence complementarity; two short (7–9nt long) complementary seed regions can be found at the tips of stable stem-loop structures in both RNAs that were proposed to interact via a “kissing complex”⁶⁶, but the mechanism of OxyS-fhlA pairing remains incompletely understood.

We synthesized oligonucleotides containing the (ARN)_X motifs from OxyS (positions 57–86) and fhlA (the complementary seed regions were excluded to circumvent interaction by base pairing; see Methods for details). To test the importance of the flipped-out N-site bases, we also created an OxyS variant, Oxy0 where the predicted N-site nucleotides were replaced with abasic linkages (as for ‘AA0’ above). Since the sequence of the OxyS (ARN)_X motif is complex and its exact binding mode on Hfq is difficult to predict from the available crystal structures with short OxyS fragments⁴⁶, we manually inspected ARN triplets in the sequence to identify the N-site bases. We focused on the previously annotated ARN region⁴², searched for two purine bases followed by a variable nucleotide and removed the base at this putative N position. The oligonucleotides were differentially labelled with fluorescent probes (Cy5, Cy3), complexed with full length Hfq (Hfq102) alone or in combinations, and their oligomeric states were analysed by AUC (Fig. 3 and Table S1). As expected, Hfq102 alone sedimented as a single hexamer (s = 3.5S) and all RNA molecules revealed a monomeric state (s = 2.0–2.1S). When the (ARN)_X segments of OxyS and fhlA were mixed without Hfq, they also sedimented as separate monomeric species (s = 2.1S) and did not pair. Remarkably, the Hfq102-fhlA and Hfq102-OxyS complexes also revealed simple monomeric Hfq102₆:RNA complex species (s = 4.5S for both) and did not self-dimerize. This was surprising because Hfq72₆:A₂₀ complexes readily dimerized by themselves in our previous experiments. In contrast, an additional faster sedimenting peak appeared for the ternary Hfq102-fhlA-OxyS complex (s = 5.9S), indicating the formation of larger molecular assemblies. Importantly, the Hfq102-fhlA-Oxy0 complex did not dimerize (s = 4.4S), again highlighting the importance of the N-site bases.

These results indicate that the Hfq-mediated interactions between the N-site bases of (ARN)_X motifs seen in our crystal structure can also occur in OxyS and fhlA.

Conserved (ARN)_X motifs are present in a number of sRNAs

The observation of an unanticipated interaction between A-rich sequences in our crystal structure and in solution, prompted us to further explore (ARN)_X motifs in Hfq-regulated sRNAs and mRNAs. An increasing body of evidence already recognizes the importance of these motifs for riboregulation in bacteria. For example, it was demonstrated that Hfq binding is specifically enriched at (ARN)_X motifs in the 5′ UTR of mRNAs in vivo ^{34, 37} and these motifs are essential for their regulation^{42, 65, 67, 68}. Several sRNAs were also shown to bind Hfq at (ARN)_X motifs that contribute to their stability^{21, 40,41,42,43}. To explore (ARN)_X motifs in sRNAs more broadly, we screened 67 experimentally confirmed sRNAs from E. coli ⁶⁹. Since existing annotations of (ARN)_X containing regions were incomplete and the constraints defining an (ARN)_X motif were not clear, we constructed an iterative bioinformatics pipeline that consists of explorative pattern searches with various pattern definitions, secondary structure inspection, and conservation analysis (Figure S5). Based on this, our final pattern described the (ARN)_X motif as the concomitant presence of at least 4 ARN triplets within a sequence window of 20 nucleotides, also allowing maximally 2 non-adjacent non-functional triplets and separated single gaps. This pattern is consistent with previous findings that stable RNA binding at Hfq’s distal site involves at least four ARN triplets^{30, 35}. All 67 sRNAs were screened with this search pattern independently of whether they are known to interact with Hfq, and the results are summarized in Table 2. From the 67 sRNAs, we have identified matches to the (ARN)_X motifs in 25 instances.

Table 2 (ARN)_X motifs found in E. coli sRNAs.

Full size table

The matching sRNAs include many known Hfq interactors and several previously documented examples of (ARN)_X motif-containing sRNAs, such as OxyS⁴² and MicM (also known as ChiX)⁴⁰, as well as several additional instances, where the role of (ARN)_X motifs has not yet been implicated. In some sRNAs, we found multiple (two or four) non-adjacent (ARN)_X motifs. Interestingly, most identified (ARN)_X motifs contained at least 5 ARN triplets, even though our pattern searches required only 4 triplets. Analysing sequence conservation within the ARN triplets, also revealed a preference for A in the R position (Figs 4 and S6–S9, and data not shown), consistent with previous structural³⁵ and tryptophan fluorescence quenching data²⁶. Of note, we did not find (ARN)_X motifs in 42 out of the 67 sRNAs tested, which include several well-studied sRNAs (e.g. RybB, DsrA, and RydC)^{27, 30, 55} that were shown to bind to the proximal and rim sites of Hfq and anneal with mRNAs bound to the distal site of the same Hfq hexamer²¹. The presence of conserved (ARN)_X motifs in a distinct subset of sRNAs suggests that these sequence elements may have specific roles in the function of these sRNAs and would merit further investigation.

Several (ARN)_x motif-containing sRNAs share common structural features

To further explore the role of (ARN)_X motifs in the above identified set of (ARN)_X motif-containing sRNAs, we analysed their secondary structure and the arrangement of known functional modules (e.g. mRNA complementary seed regions) within their sequences. From the 25 sRNAs with predicted (ARN)_X motifs, we selected 17 that - for simplicity - contain one single (ARN)_X motif (Table 2). With these, we performed secondary structure predictions using three independent thermodynamic folding simulations and mapped the position of the (ARN)_X motif relative to secondary structure elements. This showed that the (ARN)_X motifs are often flanked by predicted secondary structure elements such as stem loops on one side or both.

Interestingly, we also found that four of the analysed sRNAs (MicM, MgrR, RyjA and OhsC) closely resemble OxyS in their overall structure (Fig. 4). They all feature two stem loops tightly embracing the (ARN)_X motif in a spatial arrangement that is so conserved that the different sRNA folds can be directly superimposed. To analyse these five examples (including OxyS) further, we prepared multiple sequence alignments for the sRNAs from related bacteria, which revealed high conservation of the (ARN)_X motifs, further supporting their functional importance (Figs 4 and S6–S9). In three out of the five sRNAs, the (ARN)_X motif also overlapped with experimentally determined Hfq binding sites (J. Vogel, personal communication)^{23, 42}. Next, we mapped functionally relevant sequence regions on these five selected sRNAs. We found that the (ARN)_X motif is positioned 20–40 nts away from the 3′ poly(U) tail in all cases (Table S2). This distance appears sufficient to reach between the distal and proximal faces of Hfq, likely allowing the (ARN)_X motif and the U-rich tail to bind simultaneously to Hfq. Using complementary search algorithms, we also identified the regions in the five selected sRNAs that are complementary to their well-known target mRNAs (Table S3). We searched with nine mRNAs: fhlA, rpoS, shoB, ybfM, dpiB, eptB, rsxE, tig, and nuoG and mapped the complementary regions onto the sRNA structure. For eight out of the nine sRNA-mRNA pairs (with the exception of the OxyS-rpoS pair), complementary regions localized to stem loops flanking the two sides of the (ARN)_X motif (Figs 4 and S6–S9).

Taken together, these analyses reveal that unrelated (ARN)_X motif-containing sRNAs share a common functional architecture, with a conserved localization of (ARN)_X motifs and seed regions within an overall similar structural arrangement.

mRNA targets of (ARN)_X motif-containing sRNAs display common architectural features

fhlA was previously shown to have a modular architecture, where several short seed regions flank a bipartite (ARN)_X motif involved in Hfq binding^{65, 66}. Based on our observation that several (ARN)_X motif-containing sRNAs share a common architecture, we wondered if the mRNA targets of these RNAs also share a similar architecture. To check this, we visually located (ARN)_X motif containing regions in the respective target mRNAs (Table S4) and mapped these against sRNA complementary regions (Table S3), the ribosome-binding site (RBS), the start codon, and secondary structure elements. The resulting general pattern appears to be more complicated than for (ARN)_X motif-containing sRNAs, but a common topology of functional elements can still be observed in eight out of the nine analysed mRNAs (again excluding rpoS). In contrast to the one complete (ARN)_X motif in (ARN)_X motif-containing sRNAs, generally two shorter (ARN)_X regions were found in the 5′ UTRs of the target mRNAs (Table S4 and Figure S10). As observed before, one (ARN)_X region was typically found close to the start codon and the RBS, while the other is located further upstream (−50 to −140)^{23, 37}. In several cases (fhlA, eptB, and ybfM), the predicted (ARN)_X regions also overlapped with experimentally identified Hfq-binding sites^{23, 65}. The spacing between the two (ARN)_X regions was ~60 nts in all cases and often contained stem loops or other folded elements, suggesting that these regions may constitute two parts of a bipartite (ARN)_X motif, which could come together in space upon folding of the mRNA (data not shown)⁶⁵. In addition, common features extended to sRNA complementary regions: multiple short seed regions were found in the proximity of (ARN)_X motifs, either upstream of the first (ARN)_X region, between the two (ARN)_X regions, or downstream of the second (ARN)_X region at the beginning of the coding sequence (Figure S10). In some cases, seed regions were found overlapping with (ARN)_X regions (also observed by Tree et al.³⁷). Of note, rpoS was a clear outlier in our analysis: it contains a long complementary region with OxyS, an (ARN)_X region far upstream in the 5′UTR, and a quite different secondary structure (data not shown). However, we observed marked structural similarities in the other mRNA targets of our selected (ARN)_X motif-containing sRNAs (Figure S10).

Discussion

An increasing body of evidence indicates the functional importance of (ARN)_X sequence motifs in Hfq-dependent riboregulation in bacteria in vivo. These motifs are widespread in Hfq-regulated RNAs in general; they are particularly abundant in the 5′UTRs of mRNAs and are also present in several sRNAs^{21, 34, 37, 40, 42}. Previous research has shown that (ARN)_X motifs provide essential Hfq binding sites and interact with Hfq’s distal site^{31, 42, 65, 67, 68, 70, 71}. Hfq binding involves up to six ARN triplets and occurs on a circular fashion³⁶, as seen in the crystal structures (Fig. 1 and Link et al.³⁵). Interestingly, every third base at the N-site is excluded from Hfq binding and points towards the solvent. In this study, we present a crystal structure of an E. coli Hfq-A₁₈ RNA complex, which reveals an additional structural feature of (ARN)_X motifs. It shows that, when bound to Hfq, these motifs can create base-stacking interactions between two RNA molecules (Fig. 1). Surprisingly, the observed interaction is mediated by the flipped-out N-site bases, proposing a functional role for these so far enigmatic residues and their unusual positioning on Hfq’s surface. Compared to previously reported Hfq-poly(A) RNA structures, the orientations of the N-site bases are practically unchanged, suggesting that stacking interactions can be formed without requiring any significant conformational changes after Hfq binding. Remarkably, rotation of the flipped-out base is restricted by the proximity of the protein surface to only a few tens of degrees, suggesting that Hfq actively prepares the observed RNA configuration.

Using abasic RNA probes that specifically lack the N-site bases, we provide several lines of biophysical evidence that support the occurrence of the structurally observed supramolecular interaction in solution and confirm its dependence on the N-site bases. Although in EMSA, ITC and FA experiments the exact composition of the higher order complexes could not be directly determined and the formation of e.g. 2:1 Hfq₆:RNA complexes that have been observed previously by others^{38, 43} could not be excluded, our AUC experiments strongly suggest a 2:2 complex. 2:1 Hfq₆:RNA complexes are also thought to have low abundance and little relevance at physiological Hfq-RNA ratios^{30, 39, 43, 72}. Furthermore, we show that the assemblies strongly depend on the presence of flipped-out N-site bases in the RNA and their stability scales with the base stacking affinity of these bases. This agrees well with the base-stacking mediated 2:2 assembly in our crystal structure, but is difficult to recapitulate with 2:1 Hfq₆:RNA complexes as absence of the N-site bases would not be expected to influence tandem binding of two Hfq hexamers on one RNA (binding affinity is not affected; Figures S3 and S4). Finally, our results with the Hfq-fhlA-OxyS complex can only be explained with a 2:2 Hfq₆:RNA assembly (i.e. 2 Hfq₆: 1 fhlA: 1 OxyS), as neither of the two RNAs formed higher order complexes when binding to Hfq individually.

Our bioinformatical analysis of a large set of E. coli sRNAs revealed that (ARN)_X motifs are present in many sRNAs, where they are highly conserved and in some cases co-occur with a specific arrangement of characteristic sequence and secondary structure elements. These observations indicate that (ARN)_X motifs can play a role not only in mRNAs, but also in some (ARN)_X motif-containing sRNAs. Based on our structural data, we hypothesise that (ARN)_X motif-containing sRNAs may bind to Hfq’s distal site and interact with mRNAs that are bound to a separate Hfq hexamer using interlocking base stacking of the flipped-out N-site bases as seen in our crystal structure (Figs 1 and 5). Such interaction between preformed Hfq-RNA complexes may enable association between diverse RNA molecules, allowing them to quickly probe their complementarity; and, in case of a positive match, trigger further annealing of upstream and downstream segments of the affected (ARN)_X motif-containing sRNA-mRNA (Fig. 5). If true, this mechanism can provide a platform for rapid partner search on a ‘speed-dating’ fashion.

Of note, the interaction observed in our crystal structure is well suited to initiate RNA-RNA interactions transiently as (i) it occurs between appropriately pre-organized protein-RNA assemblies, (ii) it has low sequence specificity and can bring together a variety of RNAs, (iii) it positions the two RNA molecules in antiparallel orientation, as required for proper pairing⁶², (iv) it has only micro-molar affinity enabling a rapid turnover⁷³, (v) it requires additional sequence-specific interactions to create a stable pair for a proper gene regulatory response. This putative mechanism may act in concert with other known annealing pathways, supporting or specifying the function of specific (ARN)_X motif containing sRNA. Due to its specific physicochemical properties, ARN base stacking can be particularly beneficial for sRNAs that act on multiple target mRNAs, with most of which they share only little sequence complementarity. Here, base stacking can enable interaction with many potential target RNAs and allow them to find even short complementary matches. In addition, the (ARN)_X interactions can also help increase the affinity of these multifaceted sRNAs towards one or another of their targets and thus contribute to their specificity. Consistent with this idea, we find several multi-target sRNA in our list of (ARN)_X motif-containing sRNAs (Table 2).

One example of a (ARN)_X motif-containing sRNA-mRNA pair is the central oxidative stress response regulator OxyS and its prominent target fhlA. Various studies on OxyS-fhlA suggested a so-called ‘kissing loop’ annealing model, where short seed regions in stem loops flanking the Hfq binding sites interact with complementary segments in the partner RNA^{65, 66}. Now, our structural and biochemical results indicate that Hfq-bound OxyS and fhlA can interact via their (ARN)_X motifs, perhaps initiating and/or facilitating the full RNA pairing. These results are consistent with previous studies showing that both OxyS and fhlA interact with the distal site of Hfq^{21, 42, 65} and can help elaborate their non-canonical mechanism of pairing.

Interestingly, our bioinformatics analysis also revealed that several other (ARN)_X-containing sRNA-mRNA pairs contain similar architectural features including stem loops and seed sequences flanking the (ARN)_X motifs. It will be interesting to test if these pairs can also associate via their (ARN)_X motifs and follow annealing mechanisms that are similar to the OxyS-fhlA pair.

Our results are consistent with the work of others showing that (ARN)_X motifs contribute to Hfq binding and riboregulation in several sRNAs (e.g. OxyS, MgrR, and MicM/ChiX)^{21, 42, 68, 74, 75}, whereas these motifs are absent and counterproductive to function if introduced in others (e.g. RyhB, DsrA, RydC, etc.)^{21, 27}. In fact, a recent study by Schu et al.²¹ showed that the stability and function of a number of sRNAs is compromised in Hfq distal face mutants and suggested that (ARN)_X motifs define the stability, target choice and functional role of a specific class of sRNAs (defined as Class II). In agreement with these studies, we find (ARN)_X motif matches in prominent Class II sRNAs (such as MicM and MgrR). Furthermore, Schu et al. also proposed that Class II sRNAs interact with mRNA targets bound to the rim of Hfq, in contrast to sRNAs without (ARN)_X motifs (Class I) that bind to the proximal and rim sites and anneal with mRNA targets bound to Hfq’s distal site. As the rim site of Hfq is smaller and weaker than the other binding sites and many Class II mRNA targets also have (ARN)_X motifs²¹, the ARN-ARN interactions observed here may help initiate or stabilize sRNA-mRNA contacts, thereby contributing to Class II sRNA function. In addition, several sRNAs, including OxyS, showed intermediate behaviours in the studies of Schu et al.²¹, suggesting that the mechanistic diversity of sRNA-mRNA pairing may be even greater. Our ARN pattern search protocol can help identify (ARN)_X motifs in bacterial RNAs more broadly, thus helping to classify sRNAs and derive testable hypotheses for their functional and mechanistic features. In accord, our ARN-containing RNA set also contains sRNAs that have not yet been implicated to interact with Hfq, and it will be interesting to test if these RNAs may rely on Hfq under specific cellular conditions.

We speculate that the proposed RNA interaction model may be relevant in different Gram - negative bacteria, as the binding mode of poly(A) RNA is shared and our bioinformatics analysis revealed conservation of (ARN)_X motifs in these species. Hfq proteins in Gram - positive bacteria bind RNA differently at their distal site, relying on a bipartite RNA-binding motif with no flipped-out bases; thus, our model is probably not applicable to these species. However, to investigate the exact impact of Hfq-mediated ARN base stacking in vivo and its species-specific features, further studies will be required.

One of our most surprising results is that Hfq can use base stacking to mediate RNA-RNA interactions. Base stacking is prominent in DNA, where it provides a major force stabilizing the structure of the double helix. In RNA, it was observed in structured tRNAs, rRNAs, ribozymes and in the ribosome^76,77,78 and has accessory roles in organizing the tertiary fold. Now, we show that base stacking is not a sole property of complex folded RNAs, but it can also occur between two separate single-stranded RNAs if supported by the RNA-chaperone protein Hfq.

The putative role of N-site base stacking in sRNA-mRNA interactions naturally raises the question if the identity of these bases matters for sRNA-mRNA pairing. In other words, are all bases and base combinations at the N-site able to interact equally well or does the identity of these bases convey a hidden code? If such a hidden code exists, this could contribute to specificity of sRNA-mRNA pairing and help ensure the selectivity of gene regulation. Our observations that poly(A) and ‘AAG’ containing RNAs interact more strongly than ‘AAC’ or ‘AAU’ sequences support this hidden code idea. Such preference for the pyrimidine bases can be explained by their advantageous base stacking and hydrogen bonding properties that help keep the RNA-mediated dimeric complex together. Consistently, we find that neither Hfq-OxyS nor Hfq-fhlA complexes can self-pair, only their ternary complex forms base stacked Hfq₆:RNA dimers. However, elucidating the exact role of base stacking in sRNA-mRNA pairing and the principles of their specificity will require substantial further analysis.

Materials and Methods

RNA oligonucleotides

All RNA oligonucleotides were synthesized by Integrated DNA Technologies (IDT; Leuven –Belgium). A₃₀ and A₂₀ contain 30 and 20 consecutive adenine nucleotides, respectively. The ‘AA0’ RNA had the following sequence: (AA0)₆AA, where 0 denotes an abasic nucleotide. Similarly, ‘AAC’, ‘AAU’ and ‘AAG’ sequences were (AAC)₆AA, (AAU)₆AA and (AAG)₆AA, respectively. The oligonucleotide representing the ARN motifs of OxyS was derived from Gottesman et al.⁶⁴ and comprises nucleotides 57 to 86 of full length OxyS, giving rise to the sequence: 5′-UCAACUCGAAUAACUAAAGCCAACGUGAAC-3′. In Oxy0, presumed N-site bases were exchanged for abasic nucleotides, denoted by 0: 5′-UCAA0UC0AA0AA0UAA0GCCAA0GU0AA-3′. The fhlA ARN segment oligonucleotide was constructed based on Salim et al.⁶⁵ and comprises the two (ARN)_X regions (nucleotides −78 to −65 and −14 to +5) directly fused to each other, giving rise to the sequence: 5′-CUAAUAAAAUUCUACCUAGAAGAACAAAAUGUC-3′. Residues −64 to −13 were replaced by a CC-dinucleotide, G at position −11 was replaced for A. For analytical ultracentrifugation, OxyS, Oxy0 and fhlA were synthesized with Cy5- and Cy3 fluorescence labels at their 3′-end, respectively. A modified A₂₀ RNA, with an ATTO488-dye at the 3′-end was used for fluorescence anisotropy measurements.

Protein production and crystallization

DNA encoding Escherichia coli Hfq72 (containing amino acids 1–72), Hfq102 (full length, aa 1–102), Hfq102^R16A,R17A (full length, R16A-R17A solubility mutant), and poly(A)-polymerase 1 (PAP-1; aa 19–478) were cloned into pETM28-SUMO vector and the 6xHis-SUMO-tagged proteins were expressed in E. coli BL21(DE3) cells in TB medium at 37 °C for 20 h (all three Hfq constructs) or 4 h (the PAP-1 construct). The cell lysate was applied to Ni-Sepharose column (His-Trap, GE Healthcare) in 0.1 M Hepes/NaOH pH 8.0, 0.5 M NaCl, 0.005 M TCEP. To remove nucleic acid contamination, the proteins were washed with 1 M LiCl on the column before eluting with imidazole. The eluate was then incubated with SenP2 protease (1:100) for 18 h at 4 °C and the cleaved SUMO-tag was removed via a second Ni purification. Proteins were further purified by size exclusion chromatography on a Superdex 200 column (0.05 M Hepes/NaOH pH 8.0, 0.5 M NaCl), concentrated to 10 mg/ml, and stored in 0.05 M Hepes/NaOH pH 8.0, 0.5 M NaCl at −80 °C until further use. For poly(A)-polymerase 1, a Heparin-Sepharose purification was included (using 0.05 M Hepes/NaOH pH 8.0, 0.5 M NaCl–2 M NaCl) after SenP2 cleavage to better remove the cleaved SUMO-tag and nucleic acid contaminations.

For crystallization, complexes were formed by mixing Hfq72, A₃₀ RNA and PAP-1 in a 1:1.2:1 molar ratio in HS-buffer (2 M NaCl, 0.02 M Hepes pH 8.0, 0.005 M MgCl₂, 5% Glycerol) and dialyzing the solution against CX-buffer (0.25 M NaCl, 0.02 M Hepes pH 8.0, 0.05 M MgCl₂, 10% Glycerol), and concentrated to 5 mg/ml. Crystals were grown at 20 °C in hanging drop vapor diffusion plates combining equal volumes of the complex solution with the well solution containing 0.1 M phosphate-citrate buffer pH 4.2, 27% PEG 1000, and 0.2 M LiSO₄.

Data collection and structure determination

Crystals were cryo-protected with 12% 2,3-butanediol in the well solution and flash frozen in liquid nitrogen. X-ray data collection was performed at 100 K; diffraction images were collected at BM30A (ESRF, Grenoble). Diffraction data was processed to 2.5 Å resolution with XDS⁷⁹. Even though the signal to noise ratio was still quite high at this resolution, the data was cut due to low completeness in the high resolution range (Table 1). The latter was probably caused by suboptimal placement of the X-ray detector during data collection, precluding collection of all diffraction data to the highest possible resolution. The structure was solved by molecular replacement in PHASER using the unliganded E. coli Hfq structure as a search model (PDB-ID:1HK9)^{80, 81}. The crystals belonged to space group R32 and the asymmetric unit contained two Hfq72 subunits bound to two adenine oligonucleotides, with crystallographic symmetry generating the biologically relevant homo-hexamers and the complete A₁₈ chain. Model building in COOT⁸² was alternated with refinement in PHENIX⁸³ until the R-values converged (Table 1). The structure was validated with MOLPROBITY⁸⁴. Structure factors and coordinates have been deposited with the Protein Data Bank under accession number 5NEW. Molecular images were generated in PyMOL⁸⁵.

Size exclusion chromatography (SEC)

Hfq72-RNA complexes were prepared at 10 µM concentration in AUC-buffer (0.25 M NaCl, 0.02 M Hepes/NaOH pH 8.0, 0.005 M MgCl₂, 2% Glycerol) and run at 0.05 ml/min on a Superdex S200 10/300 (GE Healthcare) pre-equilibrated in AUC buffer. UV-absorbance data were collected at 280 nm and 254 nm respectively.

Analytical ultracentrifugation (AUC)

Hfq72-RNA and Hfq102-RNA complexes were prepared in AUC-buffer (0.25 M NaCl, 0.02 M Hepes pH 8.0, 0.005 M MgCl₂, 2% Glycerol) at a concentration of 33 µM. Sedimentation velocity experiments were performed at 20 °C in a Beckman OptimaXL-A centrifuge fitted with a four-hole AN-60 rotor and double-sector Epon centerpieces at 45 000 rpm. To unambiguously assign the composition of the complexes, absorbance data were collected at 280 nm for RNA-free Hfq, at 254 nm for samples containing unlabelled RNA or at 548 nm or 650 nm for fluorescent oligos. Data were analysed by the c(s) method using the Sedfit software package⁸⁶. The observed s-values were compared with theoretical sedimentation coefficients calculated from our Hfq72-A₂₀ crystal structure using HYDROPRO 5a^{87, 88}. The viscosity (1.087 mPa·s) and the density (1.015 g/ml) of the AUC-buffer were calculated using the program SEDNTERP V1.09 (J.Philo, D. Hayes, T. Laue). The partial specific volumes were 0.530 ml/g for the RNA, 0.747 ml/g for the protein and 0.721 ml/g for the complex.

Electrophoretic mobility shift assays (EMSA)

Previous EMSA experiments indicated that full length Hfq102 migrates more effectively into native polyacrylamide gels than the truncated Hfq72 variant (see e.g. Updegrove et al.³⁹). It was also previously shown that mutation of two arginine residues, R16 and R17 to alanine in E. coli Hfq reduces non-specific protein aggregation. Therefore, to ensure that potential higher order assemblies do not result from protein aggregation in electrophoretic mobility shift assays, we used a Hfq102^R16A,R17A mutant^{30, 61, 62}. These mutations at the Hfq’s rim site do not affect binding of A-rich RNA²¹. Hfq102^R16A,R17A was expressed and purified as described above. ssRNA substrates were 5′-³²P-labelled using T4 Polynucleotide Kinase (NEB) and [γ-³²P]-ATP (Hartmann Analytic), and then purified on a Bio-Spin 6 column (Bio-Rad) following the manufacturers recommendations. Radiolabelled RNA was then incubated with varying amounts of Hfq102^R16A,R17A in EMSA-buffer (0.25 M NaCl, 0.05 M Tris pH 7.5, 10% (v/v) Glycerol) for 30 min at 25 °C. Each 10 µl reaction contained 1 µl of 200 nM labelled ssRNA and an increasing excess of Hfq102^R16A,R17A (1 μl of 20 nM–2000 nM). Complexes were separated via polyacrylamide gel electrophoresis using native 4–20% gradient gels in 1xTBE running buffer and results were imaged on a Typhoon FLA 9500 phosphoimager.

Isothermal titration calorimetry (ITC)

ITC experiments were carried out with an ITC₂₀₀ microcalorimeter (GE-Healthcare; Microcal) at 25 °C in AUC buffer (0.25 M NaCl, 0.02 M Hepes/NaOH pH 8.0, 0.005 M MgCl₂, 2% Glycerol) after intensive dialysis of both Hfq72 and RNA overnight. The RNA was loaded in the sample cell at a concentration of 10 µM and was titrated with 150 µM protein solution from the injection syringe. The heat of dilution was measured in control titrations with buffer and subtracted from the binding data. Data were analysed using the Origin 7.0 (Microcal) software. After testing several binding models, the Hfq72-‘AA0’ binding data was best fit by a ‘one-set-of-binding-sites’ model, while the Hfq72-A₂₀ ITC data corresponded best to a ‘two-set-of-binding-sites’ model.

Fluorescence anisotropy measurements

For fluorescence anisotropy, Hfq72 was dialyzed against AUC buffer. Starting from a concentration of 195 µM, Hfq72 was serially diluted by a factor of 0.66 in AUC buffer supplemented with BSA at a final concentration of 1 g/l. The resulting solutions were mixed with 3′-ATTO488-labelled A₂₀ (2 nM) in a final volume of 150 µl. Samples were prepared in triplicates in 96 - well plates. Anisotropy measurements were conducted in an Infinite M1000 plate reader (TECAN) at 25 °C. Excitation wavelength was 470 nm and the emitted light was recorded at 530 nm. Data were processed and fit according to a ‘two-set-of sites’-model in the GraphPad Prism software package.

Creation of permuted ARN pattern sets for sRNA sequence analysis

Since the rules defining an (ARN)_X motif were unknown, in order to survey the occurrence of (ARN)_X motifs in sRNAs comprehensively we designed and implemented a custom algorithm that created distinct sets of ARN patterns with several different pattern definitions.

Our pattern definitions varied primarily in their degrees of ambiguity. The starting point was a conservative ARN pattern definition containing six consecutive ARN triplets directly following each other. Then, several properties of the pattern were defined in a more permissive manner, in a way that was consistent with known examples of ARN motif sequences. First, one or two non-functional (non-ARN) triplets were allowed within the pattern. Second, one or two single nucleotide gaps were introduced next to ARN triplets. Third, single gaps were allowed anywhere in the pattern. Forth, combinations of the above-described different ambiguity properties were also allowed.

Using these pattern definitions, we then algorithmically created comprehensive pattern sets by permuting the combinations and positions of non-ARN elements (triplets or gaps) within the patterns. The resulting ensemble of pattern sets allowed us to cover the entire possible diversity that may occur in a potential (ARN)_X motif and was used for pattern matching in sRNA sequences to produce distinct sets of results for each pattern definition.

ARN pattern search and bioinformatic analysis

E. coli sRNA sequences were extracted from the Storz lab resource⁶⁹ and homologous sRNA sequences in other bacteria were identified using BLASTN in NCBI and KEGG^89,90,91. We did not attempt to sort the sRNA dataset into positive and negative interactors, as negatives under one condition may interact under different cell or experimental conditions, as was recently seen with McaS^{21, 92}. Sequences were aligned with Clustal W 2.0⁹³ and displayed in Jalview⁹⁴. E. coli mRNA sequence data was retrieved from Genolist⁹⁵ and analysed within a sequence window containing the 5′ UTR and the first 80 nts (+1–+80) of the coding sequence. In cases where 5′ UTR annotation was not available (e.g. for downstream genes in multigene operons), position −80 was used as a default starting point. These coordinate ranges were chosen to include all known functional regions of well-annotated mRNAs (such as fhlA) and exceed them by a safety margin.

Bioinformatic analysis was conducted in several steps using custom designed Perl algorithms and the overall pipeline is illustrated in Figure S5. For sRNAs, ARN pattern searches were performed in iterative fashion. To generate the initial patterns, we analysed experimentally validated (ARN)_X regions as well as the requirements of strong Hfq binding at this region, and algorithmically generated a number of ARN pattern definitions containing different numbers of ARN triplets in combination with non-ARN triplets and distinct single gaps. With these, we performed pattern matching in sRNAs to get an overview of the distribution of different kinds of ARN motifs in these sequences. This survey identified commonly observed patterns, which were then further refined iteratively by analysing similarities in the identified candidate (ARN)_X regions, conservation of their sequence, and their structural features. The resulting refined ARN pattern was then used to annotate (ARN)_X motifs in 67 sRNAs. Overlapping matches were merged into a single (ARN)_X region (Table 2).

Secondary structure predictions were performed using KineFold, Mfold, and RNAfold^96,97,98, where (ARN)_X motifs were protected from base-pairing. Plots were generated with Mfold and colour-coding was added manually. For identifying complementary regions between sRNAs and their target mRNAs, we defined the seed region in our search algorithm to minimally contain either seven consecutive base pairs or six consecutive base pairs flanked by a single gap followed by two base pairs.

Accession codes

Coordinates and structure factors have been deposited in the Protein Data Bank under accession code 5NEW.

References

Lenz, D. H. et al. The small RNA chaperone Hfq and multiple small RNAs control quorum sensing in Vibrio harveyi and Vibrio cholerae. Cell 118, 69–82 (2004).
Article CAS PubMed Google Scholar
Shakhnovich, E. A., Davis, B. M. & Waldor, M. K. Hfq negatively regulates type III secretion in EHEC and several other pathogens. Molecular microbiology 74, 347–363, doi:10.1111/j.1365-2958.2009.06856.x (2009).
Article CAS PubMed PubMed Central Google Scholar
Papenfort, K., Sun, Y., Miyakoshi, M., Vanderpool, C. K. & Vogel, J. Small RNA-mediated activation of sugar phosphatase mRNA regulates glucose homeostasis. Cell 153, 426–437, doi:10.1016/j.cell.2013.03.003 (2013).
Article CAS PubMed PubMed Central Google Scholar
Bobrovskyy, M. & Vanderpool, C. K. Regulation of bacterial metabolism by small RNAs using diverse mechanisms. Annu Rev Genet 47, 209–232, doi:10.1146/annurev-genet-111212-133445 (2013).
Article CAS PubMed Google Scholar
Altuvia, S., Weinstein-Fischer, D., Zhang, A., Postow, L. & Storz, G. A small, stable RNA induced by oxidative stress: role as a pleiotropic regulator and antimutator. Cell 90, 43–53 (1997).
Article CAS PubMed Google Scholar
Chao, Y. & Vogel, J. The role of Hfq in bacterial pathogens. Current opinion in microbiology 13, 24–33, doi:10.1016/j.mib.2010.01.001 (2010).
Article CAS PubMed Google Scholar
Storz, G., Vogel, J. & Wassarman, K. M. Regulation by small RNAs in bacteria: expanding frontiers. Molecular cell 43, 880–891, doi:10.1016/j.molcel.2011.08.022 (2011).
Article CAS PubMed PubMed Central Google Scholar
Wilson, J. W. et al. Space flight alters bacterial gene expression and virulence and reveals a role for global regulator Hfq. Proceedings of the National Academy of Sciences of the United States of America 104, 16299–16304, doi:10.1073/Pnas.0707155104 (2007).
Article ADS CAS PubMed PubMed Central Google Scholar
Chambers, J. R. & Sauer, K. Small RNAs and their role in biofilm formation. Trends Microbiol 21, 39–49, doi:10.1016/j.tim.2012.10.008 (2013).
Article CAS PubMed Google Scholar
Jorgensen, M. G. et al. Small regulatory RNAs control the multi-cellular adhesive lifestyle of Escherichia coli. Mol Microbiol 84, 36–50, doi:10.1111/j.1365-2958.2012.07976.x (2012).
Article PubMed Google Scholar
Sharma, C. M. & Vogel, J. Experimental approaches for the discovery and characterization of regulatory small RNA. Curr Opin Microbiol 12, 536–546, doi:10.1016/j.mib.2009.07.006 (2009).
Article CAS PubMed Google Scholar
Melamed, S. et al. Global Mapping of Small RNA-Target Interactions in Bacteria. Mol Cell 63, 884–897, doi:10.1016/j.molcel.2016.07.026 (2016).
Article CAS PubMed PubMed Central Google Scholar
Han, K., Tjaden, B. & Lory, S. GRIL-seq provides a method for identifying direct targets of bacterial small regulatory RNA by in vivo proximity ligation. Nat Microbiol 2, 16239, doi:10.1038/nmicrobiol.2016.239 (2016).
Article PubMed Google Scholar
Papenfort, K., Bouvier, M., Mika, F., Sharma, C. M. & Vogel, J. Evidence for an autonomous 5′ target recognition domain in an Hfq-associated small RNA. Proc Natl Acad Sci USA 107, 20435–20440, doi:10.1073/pnas.1009784107 (2010).
Article ADS CAS PubMed PubMed Central Google Scholar
Peterman, N., Lavi-Itzkovitz, A. & Levine, E. Large-scale mapping of sequence-function relations in small regulatory RNAs reveals plasticity and modularity. Nucleic Acids Res 42, 12177–12188, doi:10.1093/nar/gku863 (2014).
Article CAS PubMed PubMed Central Google Scholar
Ansong, C. et al. Global systems-level analysis of Hfq and SmpB deletion mutants in Salmonella: implications for virulence and global protein translation. PloS one 4, e4809, doi:10.1371/journal.pone.0004809 (2009).
Article ADS PubMed PubMed Central Google Scholar
Bilusic, I., Popitsch, N., Rescheneder, P., Schroeder, R. & Lybecker, M. Revisiting the coding potential of the E. coli genome through Hfq co-immunoprecipitation. RNA Biol 11, 641–654 (2014).
Article CAS PubMed PubMed Central Google Scholar
Chambers, J. R. & Bender, K. S. The RNA Chaperone Hfq Is Important for Growth and Stress Tolerance in Francisella novicida. PloS one 6, doi:10.1371/journal.pone.0019797 (2011).
Vogel, J. & Luisi, B. F. Hfq and its constellation of RNA. Nature reviews. Microbiology 9, 578–589, doi:10.1038/nrmicro2615 (2011).
Article CAS PubMed PubMed Central Google Scholar
Moller, T. et al. Hfq: a bacterial Sm-like protein that mediates RNA-RNA interaction. Molecular cell 9, 23–30 (2002).
Article ADS CAS PubMed Google Scholar
Schu, D. J., Zhang, A., Gottesman, S. & Storz, G. Alternative Hfq-sRNA interaction modes dictate alternative mRNA recognition. EMBO J 34, 2557–2573, doi:10.15252/embj.201591569 (2015).
Article CAS PubMed PubMed Central Google Scholar
Weichenrieder, O. RNA binding by Hfq and ring-forming (L)Sm proteins: a trade-off between optimal sequence readout and RNA backbone conformation. RNA Biol 11, 537–549, doi:10.4161/rna.29144 (2014).
Article CAS PubMed PubMed Central Google Scholar
Holmqvist, E. et al. Global RNA recognition patterns of post-transcriptional regulators Hfq and CsrA revealed by UV crosslinking in vivo. EMBO J 35, 991–1011, doi:10.15252/embj.201593360 (2016).
Article CAS PubMed PubMed Central Google Scholar
Updegrove, T. B., Zhang, A. & Storz, G. Hfq: the flexible RNA matchmaker. Curr Opin Microbiol 30, 133–138, doi:10.1016/j.mib.2016.02.003 (2016).
Article CAS PubMed PubMed Central Google Scholar
Beich-Frandsen, M. et al. Structural insights into the dynamics and function of the C-terminus of the E. coli RNA chaperone Hfq. Nucleic acids research 39, 4900–4915, doi:10.1093/nar/gkq1346 (2011).
Article CAS PubMed PubMed Central Google Scholar
Robinson, K. E., Orans, J., Kovach, A. R., Link, T. M. & Brennan, R. G. Mapping Hfq-RNA interaction surfaces using tryptophan fluorescence quenching. Nucleic acids research 42, 2736–2749, doi:10.1093/nar/gkt1171 (2014).
Article CAS PubMed Google Scholar
Dimastrogiovanni, D. et al. Recognition of the small regulatory RNA RydC by the bacterial Hfq protein. Elife 3, doi:10.7554/eLife.05375 (2014).
Vecerek, B., Rajkowitsch, L., Sonnleitner, E., Schroeder, R. & Blasi, U. The C-terminal domain of Escherichia coli Hfq is required for regulation. Nucleic Acids Res 36, 133–143, doi:10.1093/nar/gkm985 (2008).
Article CAS PubMed Google Scholar
Panja, S., Schu, D. J. & Woodson, S. A. Conserved arginines on the rim of Hfq catalyze base pair formation and exchange. Nucleic Acids Res 41, 7536–7546, doi:10.1093/nar/gkt521 (2013).
Article CAS PubMed PubMed Central Google Scholar
Sauer, E., Schmidt, S. & Weichenrieder, O. Small RNA binding to the lateral surface of Hfq hexamers and structural rearrangements upon mRNA target recognition. Proceedings of the National Academy of Sciences of the United States of America, doi:10.1073/pnas.1202521109 (2012).
Zhang, A., Schu, D. J., Tjaden, B. C., Storz, G. & Gottesman, S. Mutations in interaction surfaces differentially impact E. coli Hfq association with small RNAs and their mRNA targets. J Mol Biol 425, 3678–3697, doi:10.1016/j.jmb.2013.01.006 (2013).
Article CAS PubMed PubMed Central Google Scholar
Sauer, E. & Weichenrieder, O. Structural basis for RNA 3′-end recognition by Hfq. Proceedings of the National Academy of Sciences of the United States of America 108, 13065–13070, doi:10.1073/pnas.1103420108 (2011).
Article ADS CAS PubMed PubMed Central Google Scholar
Otaka, H., Ishikawa, H., Morita, T. & Aiba, H. PolyU tail of rho-independent terminator of bacterial small RNAs is essential for Hfq action. Proc Natl Acad Sci USA 108, 13059–13064, doi:10.1073/pnas.1107050108 (2011).
Article ADS CAS PubMed PubMed Central Google Scholar
Lorenz, C. et al. Genomic SELEX for Hfq-binding RNAs identifies genomic aptamers predominantly in antisense transcripts. Nucleic acids research 38, 3794–3808, doi:10.1093/nar/gkq032 (2010).
Article CAS PubMed PubMed Central Google Scholar
Link, T. M., Valentin-Hansen, P. & Brennan, R. G. Structure of Escherichia coli Hfq bound to polyriboadenylate RNA. Proceedings of the National Academy of Sciences of the United States of America 106, 19292–19297, doi:10.1073/pnas.0908744106 (2009).
Article ADS CAS PubMed PubMed Central Google Scholar
de Haseth, P. L. & Uhlenbeck, O. C. Interaction of Escherichia coli host factor protein with oligoriboadenylates. Biochemistry 19, 6138–6146 (1980).
Article PubMed Google Scholar
Tree, J. J., Granneman, S., McAteer, S. P., Tollervey, D. & Gally, D. L. Identification of Bacteriophage-Encoded Anti-sRNAs in Pathogenic Escherichia coli. Molecular cell. doi:10.1016/j.molcel.2014.05.006 (2014).
PubMed PubMed Central Google Scholar
Wang, W. et al. Cooperation of Escherichia coli Hfq hexamers in DsrA binding. Genes Dev 25, 2106–2117, doi:10.1101/gad.16746011 (2011).
Article CAS PubMed PubMed Central Google Scholar
Updegrove, T. B., Correia, J. J., Chen, Y., Terry, C. & Wartell, R. M. The stoichiometry of the Escherichia coli Hfq protein bound to RNA. RNA 17, 489–500, doi:10.1261/rna.2452111 (2011).
Article CAS PubMed PubMed Central Google Scholar
Figueroa-Bossi, N., Valentini, M., Malleret, L., Fiorini, F. & Bossi, L. Caught at its own game: regulatory small RNA inactivated by an inducible transcript mimicking its target. Genes & development 23, 2004–2015, doi:10.1101/gad.541609 (2009).
Article CAS Google Scholar
Moon, K. & Gottesman, S. Competition among Hfq-binding small RNAs in Escherichia coli. Molecular microbiology 82, 1545–1562, doi:10.1111/j.1365-2958.2011.07907.x (2011).
Article CAS PubMed Google Scholar
Zhang, A., Wassarman, K. M., Ortega, J., Steven, A. C. & Storz, G. The Sm-like Hfq protein increases OxyS RNA interaction with target mRNAs. Molecular cell 9, 11–22 (2002).
Article PubMed Google Scholar
Henderson, C. A. et al. Hfq binding changes the structure of Escherichia coli small noncoding RNAs OxyS and RprA, which are involved in the riboregulation of rpoS. RNA 19, 1089–1104, doi:10.1261/rna.034595.112 (2013).
Article CAS PubMed PubMed Central Google Scholar
Altuvia, S., Zhang, A., Argaman, L., Tiwari, A. & Storz, G. The Escherichia coli OxyS regulatory RNA represses fhlA translation by blocking ribosome binding. The EMBO journal 17, 6069–6075, doi:10.1093/emboj/17.20.6069 (1998).
Article CAS PubMed PubMed Central Google Scholar
Zhang, A. et al. The OxyS regulatory RNA represses rpoS translation and binds the Hfq (HF-I) protein. The EMBO journal 17, 6061–6068, doi:10.1093/emboj/17.20.6061 (1998).
Article CAS PubMed PubMed Central Google Scholar
Wang, L. et al. Structural insights into the recognition of the internal A-rich linker from OxyS sRNA by Escherichia coli Hfq. Nucleic Acids Res 43, 2400–2411, doi:10.1093/nar/gkv072 (2015).
Article CAS PubMed PubMed Central Google Scholar
Sobrero, P. & Valverde, C. The bacterial protein Hfq: much more than a mere RNA-binding factor. Crit Rev Microbiol 38, 276–299, doi:10.3109/1040841X.2012.664540 (2012).
Article CAS PubMed Google Scholar
Fender, A., Elf, J., Hampel, K., Zimmermann, B. & Wagner, E. G. RNAs actively cycle on the Sm-like protein Hfq. Genes & development 24, 2621–2626, doi:10.1101/gad.591310 (2010).
Article CAS Google Scholar
Santiago-Frangos, A., Kavita, K., Schu, D. J., Gottesman, S. & Woodson, S. A. C-terminal domain of the RNA chaperone Hfq drives sRNA competition and release of target RNA. Proc Natl Acad Sci USA 113, E6089–E6096, doi:10.1073/pnas.1613053113 (2016).
Article CAS PubMed PubMed Central Google Scholar
Masse, E., Escorcia, F. E. & Gottesman, S. Coupled degradation of a small regulatory RNA and its mRNA targets in Escherichia coli. Genes & development 17, 2374–2383, doi:10.1101/gad.1127103 (2003).
Article CAS Google Scholar
Bandyra, K. J. & Luisi, B. F. Licensing and due process in the turnover of bacterial RNA. RNA Biol 10, 627–635, doi:10.4161/rna.24393 (2013).
Article CAS PubMed PubMed Central Google Scholar
Papenfort, K. et al. SigmaE-dependent small RNAs of Salmonella respond to membrane stress by accelerating global omp mRNA decay. Mol Microbiol 62, 1674–1688 (2006).
Article CAS PubMed PubMed Central Google Scholar
Vanderpool, C. K. & Gottesman, S. Involvement of a novel transcriptional activator and small RNA in post-transcriptional regulation of the glucose phosphoenolpyruvate phosphotransferase system. Mol Microbiol 54, 1076–1089, doi:10.1111/j.1365-2958.2004.04348.x (2004).
Article CAS PubMed Google Scholar
Fei, J. et al. RNA biochemistry. Determination of in vivo target search kinetics of regulatory noncoding RNA. Science 347, 1371–1374, doi:10.1126/science.1258849 (2015).
Article ADS CAS PubMed PubMed Central Google Scholar
Wang, W., Wang, L., Wu, J., Gong, Q. & Shi, Y. Hfq-bridged ternary complex is important for translation activation of rpoS by DsrA. Nucleic acids research 41, 5938–5948, doi:10.1093/nar/gkt276 (2013).
Article CAS PubMed PubMed Central Google Scholar
Schumacher, M. A., Pearson, R. F., Moller, T., Valentin-Hansen, P. & Brennan, R. G. Structures of the pleiotropic translational regulator Hfq and an Hfq-RNA complex: a bacterial Sm-like protein. The EMBO journal 21, 3546–3556, doi:10.1093/emboj/cdf322 (2002).
Article CAS PubMed PubMed Central Google Scholar
Vincent, H. A. et al. Characterization of Vibrio cholerae Hfq Provides Novel Insights into the Role of the Hfq C-Terminal Region. Journal of molecular biology, doi:10.1016/j.jmb.2012.03.028 (2012).
Sun, X., Zhulin, I. & Wartell, R. M. Predicted structure and phyletic distribution of the RNA-binding protein Hfq. Nucleic acids research 30, 3662–3671 (2002).
Article CAS PubMed PubMed Central Google Scholar
Rajkowitsch, L. & Schroeder, R. Dissecting RNA chaperone activity. RNA 13, 2053–2060, doi:10.1261/rna.671807 (2007).
Article CAS PubMed PubMed Central Google Scholar
Howlett, G. J., Minton, A. P. & Rivas, G. Analytical ultracentrifugation for the study of protein association and assembly. Curr Opin Chem Biol 10, 430–436, doi:10.1016/j.cbpa.2006.08.017 (2006).
Article CAS PubMed Google Scholar
Panja, S. & Woodson, S. A. Hexamer to monomer equilibrium of E. coli Hfq in solution and its impact on RNA annealing. Journal of molecular biology 417, 406–412, doi:10.1016/j.jmb.2012.02.009 (2012).
Article CAS PubMed PubMed Central Google Scholar
Panja, S. & Woodson, S. A. Hfq proximity and orientation controls RNA annealing. Nucleic acids research, doi:10.1093/nar/gks618 (2012).
Friedman, R. A. & Honig, B. A free energy analysis of nucleic acid base stacking in aqueous solution. Biophys J 69, 1528–1535, doi:10.1016/S0006-3495(95)80023-8 (1995).
Article CAS PubMed PubMed Central Google Scholar
Gottesman, S. The small RNA regulators of Escherichia coli: roles and mechanisms. Annual review of microbiology 58, 303–328, doi:10.1146/annurev.micro.58.030603.123841 (2004).
Salim, N. N. & Feig, A. L. An upstream Hfq binding site in the fhlA mRNA leader region facilitates the OxyS-fhlA interaction. PloS one 5, doi:10.1371/journal.pone.0013028 (2010).
Argaman, L. & Altuvia, S. fhlA repression by OxyS RNA: kissing complex formation at two sites results in a stable antisense-target RNA complex. Journal of molecular biology 300, 1101–1112, doi:10.1006/jmbi.2000.3942 (2000).
Article CAS PubMed Google Scholar
Soper, T. J. & Woodson, S. A. The rpoS mRNA leader recruits Hfq to facilitate annealing with DsrA sRNA. RNA 14, 1907–1917, doi:10.1261/rna.1110608 (2008).
Article CAS PubMed PubMed Central Google Scholar
Updegrove, T., Wilf, N., Sun, X. & Wartell, R. M. Effect of Hfq on RprA-rpoS mRNA pairing: Hfq-RNA binding and the influence of the 5′ rpoS mRNA leader region. Biochemistry 47, 11184–11195, doi:10.1021/bi800479p (2008).
Article CAS PubMed Google Scholar
Storz, G. E. coli small RNAs http://cbmp.nichd.nih.gov/segr/ecoli_rnas.html (2010).
Salim, N. N., Faner, M. A., Philip, J. A. & Feig, A. L. Requirement of upstream Hfq-binding (ARN)x elements in glmS and the Hfq C-terminal region for GlmS upregulation by sRNAs GlmZ and GlmY{dagger}. Nucleic acids research, doi:10.1093/nar/gks392 (2012).
Beisel, C. L., Updegrove, T. B., Janson, B. J. & Storz, G. Multiple factors dictate target selection by Hfq-binding small RNAs. EMBO J 31, 1961–1974, doi:10.1038/emboj.2012.52 (2012).
Article CAS PubMed PubMed Central Google Scholar
Ribeiro Ede, A. Jr. et al. Structural flexibility of RNA as molecular basis for Hfq chaperone function. Nucleic acids research 40, 8072–8084, doi:10.1093/nar/gks510 (2012).
Article PubMed Google Scholar
Lim, W., Mayer, B. & Pawson, T. Cell Signaling: principles and mechanisms. (Garland Science, 2014).
Ellis, M. J., Trussler, R. S. & Haniford, D. B. Hfq binds directly to the ribosome-binding site of IS10 transposase mRNA to inhibit translation. Mol Microbiol 96, 633–650, doi:10.1111/mmi.12961 (2015).
Article CAS PubMed PubMed Central Google Scholar
Malecka, E. M., Strozecka, J., Sobanska, D. & Olejniczak, M. Structure of bacterial regulatory RNAs determines their performance in competition for the chaperone protein Hfq. Biochemistry 54, 1157–1170, doi:10.1021/bi500741d (2015).
Article CAS PubMed Google Scholar
Toor, N., Keating, K. S., Taylor, S. D. & Pyle, A. M. Crystal structure of a self-spliced group II intron. Science 320, 77–82, doi:10.1126/science.1153803 (2008).
Article ADS CAS PubMed PubMed Central Google Scholar
Quigley, G. J. & Rich, A. Structural domains of transfer RNA molecules. Science 194, 796–806 (1976).
Article ADS CAS PubMed Google Scholar
Noller, H. F. RNA structure: reading the ribosome. Science 309, 1508–1514, doi:10.1126/science.1111771 (2005).
Article ADS CAS PubMed Google Scholar
Kabsch, W. Xds. Acta Crystallographica Section D-Biological Crystallography 66, 125–132, doi:10.1107/S0907444909047337 (2010).
Article CAS PubMed Central Google Scholar
Sauter, C., Basquin, J. & Suck, D. Sm-like proteins in Eubacteria: the crystal structure of the Hfq protein from Escherichia coli. Nucleic acids research 31, 4091–4098 (2003).
Article CAS PubMed PubMed Central Google Scholar
Mccoy, A. J. et al. Phaser crystallographic software. Journal of Applied Crystallography 40, 658–674, doi:10.1107/S0021889807021206 (2007).
Article CAS PubMed PubMed Central Google Scholar
Emsley, P., Lohkamp, B., Scott, W. G. & Cowtan, K. Features and development of Coot. Acta Crystallographica Section D-Biological Crystallography 66, 486–501, doi:10.1107/S0907444910007493 (2010).
Article CAS PubMed Central Google Scholar
Adams, P. D. et al. PHENIX: a comprehensive Python-based system for macromolecular structure solution. Acta Crystallogr D Biol Crystallogr 66, 213–221, doi:10.1107/S0907444909052925 (2010).
Article CAS PubMed PubMed Central Google Scholar
Chen, V. B. et al. MolProbity: all-atom structure validation for macromolecular crystallography. Acta Crystallographica Section D-Biological Crystallography 66, 12–21, doi:10.1107/S0907444909042073 (2010).
Article CAS Google Scholar
DeLano, W. L. The PyMol Molecular Viewer. DeLano Scientific, San Carlos, California, USA www.pymol.org (2002).
Schuck, P. Size-distribution analysis of macromolecules by sedimentation velocity ultracentrifugation and lamm equation modeling. Biophysical journal 78, 1606–1619, doi:10.1016/S0006-3495(00)76713-0 (2000).
Article ADS CAS PubMed PubMed Central Google Scholar
Garcia de la Torre, J., Huertas, M. L. & Carrasco, B. Calculation of hydrodynamic properties of globular proteins from their atomic-level structure. Biophys J 78, 719–730, doi:10.1016/S0006-3495(00)76630-6 (2000).
Ortega, A., Amoros, D. & Garcia de la Torre, J. Prediction of hydrodynamic and other solution properties of rigid proteins from atomic- and residue-level models. Biophys J 101, 892–898, doi:10.1016/j.bpj.2011.06.046 (2011).
Article CAS PubMed PubMed Central Google Scholar
Kanehisa, M. et al. Data, information, knowledge and principle: back to metabolism in KEGG. Nucleic acids research 42, D199–205, doi:10.1093/nar/gkt1076 (2014).
Article CAS PubMed Google Scholar
Kanehisa, M. & Goto, S. KEGG: kyoto encyclopedia of genes and genomes. Nucleic acids research 28, 27–30 (2000).
Article CAS PubMed PubMed Central Google Scholar
Altschul, S. F., Gish, W., Miller, W., Myers, E. W. & Lipman, D. J. Basic local alignment search tool. Journal of molecular biology 215, 403–410, doi:10.1016/S0022-2836(05)80360-2 (1990).
Article CAS PubMed Google Scholar
Zhang, A. et al. Global analysis of small RNA and mRNA targets of Hfq. Mol Microbiol 50, 1111–1124 (2003).
Article CAS PubMed Google Scholar
Larkin, M. A. et al. Clustal W and Clustal X version 2.0. Bioinformatics 23, 2947–2948, doi:10.1093/bioinformatics/btm404 (2007).
Article CAS PubMed Google Scholar
Waterhouse, A. M., Procter, J. B., Martin, D. M., Clamp, M. & Barton, G. J. Jalview Version 2–a multiple sequence alignment editor and analysis workbench. Bioinformatics 25, 1189–1191, doi:10.1093/bioinformatics/btp033 (2009).
Article CAS PubMed PubMed Central Google Scholar
Lechat, P., Hummel, L., Rousseau, S. & Moszer, I. GenoList: an integrated environment for comparative analysis of microbial genomes. Nucleic acids research 36, D469–474, doi:10.1093/nar/gkm1042 (2008).
Article CAS PubMed Google Scholar
Xayaphoummine, A., Bucher, T. & Isambert, H. Kinefold web server for RNA/DNA folding path and structure prediction including pseudoknots and knots. Nucleic acids research 33, W605–610, doi:10.1093/nar/gki447 (2005).
Article CAS PubMed PubMed Central Google Scholar
Zuker, M. Mfold web server for nucleic acid folding and hybridization prediction. Nucleic acids research 31, 3406–3415 (2003).
Article CAS PubMed PubMed Central Google Scholar
Lorenz, R. et al. ViennaRNA Package 2.0. Algorithms Mol Biol 6, 26, doi:10.1186/1748-7188-6-26 (2011).
Article PubMed PubMed Central Google Scholar
Diederichs, K. & Karplus, P. A. Improved R-factors for diffraction data analysis in macromolecular crystallography. Nature structural biology 4, 269–275 (1997).
Article CAS PubMed Google Scholar

Download references

Acknowledgements

This work was supported by: the European Molecular Biology Laboratory; the EMBL International PhD Programme (fellowship to FV); and the European Commission under the Marie Curie Intra-European Fellowship Programme (to ECS, PIEF-GA-2011-301489). The authors thank members of the Barabas lab for discussions and technical advice on the biochemical experiments. We also thank Drs. Piotr Neumann, Fred Dyda, Kiran R. Patil, Teresa Carlomagno, Isambert Herve (Kinefold) for helpful discussions, the Protein Expression and Purification Core Facility and the Crystallization Facility at EMBL Heidelberg for materials and support, and Dr. Brigitte Altenberg for molecular visualization. X-ray diffraction data were collected at ESRF (beamline BM30A), we thank the staff of ESRF and EMBL-Grenoble for their assistance and support in using the beamline.

Author information

Eike C. Schulz
Present address: Max Planck Institute for the Structure and Dynamics of Matter, Luruper Chaussee 149, 22761, Hamburg, Germany
Markus Seiler
Present address: Buchmann Institute for Molecular Life Sciences, Max-von-Laue-Str. 15, 60438, Frankfurt a.M., Germany
Franka Voigt
Present address: Friedrich Miescher Institute for Biomedical Research, Maulbeerstrasse 66, 4058, Basel, Switzerland

Authors and Affiliations

Structural and Computational Biology Unit, European Molecular Biology Laboratory, 69117, Heidelberg, Germany
Eike C. Schulz, Markus Seiler, Cecilia Zuliani, Franka Voigt, Toby J. Gibson & Orsolya Barabas
Hamburg Outstation, European Molecular Biology Laboratory, Hamburg, 22603, Germany
Eike C. Schulz, Vivian Pogenberg & Matthias Wilmanns
Protein Expression and Purification Core Facility, European Molecular Biology Laboratory, 69117, Heidelberg, Germany
Vladimir Rybin
Division Biophysics of Macromolecules, German Cancer Research Center, Heidelberg, 69120, Germany
Norbert Mücke

Authors

Eike C. Schulz
View author publications
You can also search for this author in PubMed Google Scholar
Markus Seiler
View author publications
You can also search for this author in PubMed Google Scholar
Cecilia Zuliani
View author publications
You can also search for this author in PubMed Google Scholar
Franka Voigt
View author publications
You can also search for this author in PubMed Google Scholar
Vladimir Rybin
View author publications
You can also search for this author in PubMed Google Scholar
Vivian Pogenberg
View author publications
You can also search for this author in PubMed Google Scholar
Norbert Mücke
View author publications
You can also search for this author in PubMed Google Scholar
Matthias Wilmanns
View author publications
You can also search for this author in PubMed Google Scholar
Toby J. Gibson
View author publications
You can also search for this author in PubMed Google Scholar
Orsolya Barabas
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

E.C.S.: designed and conducted research including biochemical, biophysical and crystallography experiments, analysed and evaluated results, contributed to RNA sequence analysis, prepared tables, figures and manuscript; M.S.: designed algorithms for sequence analysis, constructed bioinformatic pipelines, performed computational analysis, structural predictions and data integration, evaluated results, prepared tables, figures and manuscript; C.Z.: purified protein, performed AUC and EMSA experiments, and contributed to writing the manuscript; F.V.: conducted EMSA and assisted with AUC; V.R.: conducted ITC experiments; V.P.: assisted with fluorescence anisotropy; N.M. and V.R.: conducted and supervised AUC experiments and analyzed data; M.W.: provided resources and advice; T.J.G.: supervised computational analysis, contributed to writing the manuscript; O.B.: lead the project, designed research, performed data analysis, evaluated results, and wrote the manuscript.

Corresponding author

Correspondence to Orsolya Barabas.

Ethics declarations

Competing Interests

The authors declare that they have no competing interests.

Additional information

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Supplementary Information

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Schulz, E.C., Seiler, M., Zuliani, C. et al. Intermolecular base stacking mediates RNA-RNA interaction in a crystal structure of the RNA chaperone Hfq. Sci Rep 7, 9903 (2017). https://doi.org/10.1038/s41598-017-10085-8

Download citation

Received: 12 April 2017
Accepted: 02 August 2017
Published: 29 August 2017
DOI: https://doi.org/10.1038/s41598-017-10085-8

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.