Rhamnosylation by EarP as a model system for studying molecular complications in horizontal gene transfer
In order to investigate if any interaction occurs between EF-P PTM systems and EF-Ps that are not their natural targets, we first had to pick one of the three known PTMs of EF-P to focus on ((R)-β-lysylation, 5-aminopentanolylation, or α-rhamnosylation). We selected α-rhamnosylation for three reasons. One, the rhamnosylation pathway is comparatively straightforward and particularly amenable to HGT, as it is carried out by a single enzyme, EarP. Additionally, the enzymes required to synthesize the substrate of EarP, dTDP-β-L-rhamnose, are well distributed throughout the bacterial tree of life [7] as rhamnose is a common component of bacterial cell walls [33]. Two, EarP appears to be relatively promiscuous with regard to its EF-P substrates; EF-P which co-occur with EarP exhibit a relatively high degree of heterogeneity in the sequence of the conserved loop [20,34]. EarP can tolerate several substitutions in this region and still successfully produce rhamnosylated EF-P [20]. Three, there are genomic clues that the transfer of EarP into a new species may lead to off-target effects. Specifically, the horizontal transfer of EarP into the phylum Thermotogota was associated with loss of well-conserved polyproline motifs, a potential sign of global EF-P dysfunction [29].
Once we decided to focus on EarP, we next made a phylogenetic tree of EF-P sequences across the bacterial tree of life in order to find EF-P that were not natively rhamnosylated yet potentially could be. These EF-Ps should encode an arginine at the tip of the conserved loop in domain I of EF-P but should not co-occur with EarP. These analyses are complicated by the fact that while unmodified EF-P have been verified experimentally [5,27,28], an unmodified EF-P and an EF-P modified by an unknown pathway are impossible to differentiate with purely genomic data. In our dataset of 3588 EF-P from across 30 phyla, we identified 741 EF-P which encoded an arginine at the conserved position (Figure 2). We eliminated 535 of these EF-P as potential targets as they either co-occurred with EarP (291 proteins) or were identified as the paralogous EF-P like protein EfpL, which we did not initially consider as a valid target (244 proteins) [5]. This left a pool of 206 arginine-type EF-Ps across 14 distinct phyla that had no known post-translational modification system. From this pool, we selected 8 EF-P for further study, prioritizing varied sequence composition in the loop region and phylogenetic diversity (Table 1, and annotated on the tree in Figure 2). Two of these EF-P, those from G. ferrihydriticus and N. communis, co-occurred with another EF-P while all others were the sole encoded EF-P of the genome.
Table 1) Table with further details on each species and the exact EF-P loop sequence of each EF-P. The first two entries are two examples of the known arginine EF-P variants for comparison. The conserved arginine residue that is modified by rhamnosylation in EarP type EF-Ps is in bold.
New arginine-type EF-P variants bypass the need for post-translational activation
To analyze our target EF-P, we first needed to ensure that they could be expressed and were functionally active without an accompanying modification system in our model organism E. coli. We confirmed successful expression through mass spectrometry (Figure 5 & Supplemental Figure 3) for 6 of the 8 EF-P. Notably, despite several tactics, we could not express the EF-P from the Verrucomicrobia A. muciniphila, nor the Firmicute C. inoculum. Next, we screened the remaining EF-P for activity in E. coli using two different methods.
First, we tested the ability of these EF-P to restore a normal growth phenotype to an E. coli mutant strain deficient in polyproline synthesis. Specifically, we used the E. coli double knockout strain DefpDuup, which is missing the primary EF-P of E. coli, as well as the backup polyproline synthesis system Uup [35–37]. This strain has a more pronounced phenotype compared to the E. coli single knockout strain Defp (doubling time of 48.9 minutes for DefpDuup compared to 27.3 minutes in Defp) [5]. We trans-complemented this strain with our six remaining EF-P and measured colony size of all combinations after 18 hours of growth at 37° C (Figure 3). Strikingly, the EF-P from Deinococcus radiodurans (EF-PDera) and Nitrosomonas communis (EF-PNico) replaced EF-PEco nearly to wildtype levels, restoring colony size on average to 80.1% and 71.4% of the EF-PEco trans-complementation, respectively.
Next, we screened our heterologously expressed EF-Ps specifically for activity in polyproline synthesis in a Δefp E. coli BW25113 background using our well described in vivo luminescent reporter (Figure 4) [5,38]. With this assay, we can test the “strength” of specific amino acid motifs by linking translational pausing to luminescence (further details in Methods). In practice, this means the stronger the detected luminescence, the stronger the translational pause caused by a particular combination of amino acids. We tested the ability of each EF‑P to rescue ribosomal pausing at a diverse set of motifs: a strong stalling motif featuring three consecutive prolines (LPPP), a weak stalling motif featuring two consecutive prolines (TPPH), and a control motif featuring a single proline which does not cause any stalling (RPDG) [5]. The results of these in vivo assays echoed our findings in the colony size complementation assay; three EF-P were hardly functional (those from H. aurantiacus, M. prima, and D. acetiphilus) and three could rescue polyproline synthesis in E. coli (those from D. radiodurans, N. communis, and G. ferrihydriticus). Interestingly, for the TPPH stalling motif, EF-PDera and EF-PNico performed significantly better than EF-PEco (P < 0.05, T test). However, these same EF-Ps performed poorly when tested against the LPPP motif (Figure 4). Variance in EF-P performance across different amino acid motifs has been observed in other EF-P subtypes [5,27]. For example, while the primary EF-P of E. coli rescues ribosomal stalling at most motifs, a subset of motifs are rescued more efficiently by the EF-P paralog, EfpL [5]. Similarly, unmodified lysine EF-Ps from the PGKGP subgroup can restore most, but not all polyproline synthesis when expressed heterologously in E. coli [27]. These observations suggest that differences in the loop region among EF-P subtypes leads to functional variability, particularly in their capacity to reposition specific tRNA combinations within the peptidyl transferase center.
Generally speaking, we found that three EF-P functioned well in E. coli (EF-PDera, EF-PNico, and EF-PGefe), and three struggled to function (those from H. aurantiacus, M. prima, and D. acetiphilus; EF-PHeau, EF-PMepr, and EF-PDeac). In some cases, these latter EF-Ps even had a detrimental impact on polyproline synthesis (see Figure 4, LPPP). These findings imply that either these EF-Ps 1) need an unknown modification to function properly, similar to the detrimental effect of un-rhamnosylated EF-P from P. putida and S. oneidensis on E. coli [20], or that 2) these EF-P are simply too distantly related to function effectively in E. coli. We found stronger support for the second possibility: the functional performance of our foreign EF-P in E. coli was generally inversely correlated with their phylogenetic distance from E. coli (Supplemental Figure 1). In other words, EF-P homologs from closely related species, particularly other Gammaproteobacteria such as S. oneidensis and N. communis, tended to function more effectively in E. coli than those from more distantly related organisms like M. prima and H. aurantiacus. This pattern mirrors findings from a similar study involving heterologous replacement of EF-Tu, where functionality in E. coli was typically limited to homologs from within the Gammaproteobacteria [39].
However, there were two clear exceptions to this general trend. In the first, we found that despite the substantial phylogenetic distance between D. radiodurans and E. coli (Supplemental Figure 1), EF-PDera consistently performed well in our assays. One possible explanation for these results is that EF-PDera may have been acquired via horizontal gene transfer from a lineage more closely related to E. coli, which could explain its higher level of function in our host background and its relatively high amino acid percent identity to EF-PEco (44.9%, Supplemental Figure 2). In the second, we found that the link between phylogenetic distance and EF-P function broke down in the specific context of our TPPH motif rescue experiments (Supplemental Figure 1). Here, we found no correlation between EF-P function and phylogenetic distance (P = 0.915, cor = -0.046, Pearson’s correlation). This divergence from the broader pattern was largely driven by the strong performance of EF-PDera and the relatively good performance of distantly related EF-P (EF-PHeau and EF-PMepr). These findings suggest that, although phylogenetic distance can be a useful predictor of heterologous EF-P functionality in E. coli, specific tRNA within the peptidyl transferase center can override these general trends and produce more nuanced, context-dependent outcomes.
Promiscuous EarP activity targets the non-cognate EF-P from M. prima
Next, we wanted to know whether our six target EF-P could be post-translationally modified by EarP. Here, we aimed to simulate complications resulting from the horizontal transfer of a PTM system into a new physiological background. Therefore, we purified each EF-P after co-expression with EarP (from Shewanella oneidensis, EarPShon) in a wildtype LMG194 E. coli background and measured the molecular weight of these EF-P with mass spectrometry. We excluded the EF-P from D. acetiphilus from these experiments, as it consistently failed to rescue polyproline synthesis in E. coli (Figure 4). As a control, we also included the EarP-type EF-P from Pseudomonas putida (EF-PPpu) to test the efficiency of cross-species post-translational modification with EarPShon.
We found that the molecular weight of four of our target EF-P matched the figure calculated from their sequence and did not vary upon co-expression with EarPShon (Supplemental Figure 3), indicating that no modification occurred. These EF-P included those from D. radiodurans, H. aurantiacus, N. communis, and G. ferrihydriticus. We also found no change in molecular weight upon co-expression of EarPShon for our negative control, EF-PEco. This EF-P encodes a lysine at the position of post-translational modification and therefore cannot be modified by EarP. These results verified that EF-PDera and EF-PNico function well in E. coli without the need for any post-translational modification and imply that these EF-P may also be unmodified in their native host species.
Interestingly, one of our heterologous EF-P showed an altered mass upon co-expression of EarPShon; that of M. prima (EF-PMepr, Figure 5). While the His6-tagged EF-PMepr has an expected monoisotopic mass of 21935.07 Da, upon co-expression of EarPShon a peak appeared at a mass of 22081.15 Da. This corresponds to an increase in mass of 146.08 Da, equivalent to the attachment of a rhamnose moiety (146.06 Da) [7]. We found a similar pattern in our positive control for rhamnosylation—EF-PPpu also showed an increase in mass of roughly 146 Da upon co-expression of EarPShon (Figure 5). However, while EF-PPpu was fully rhamnosylated—as shown by the disappearance of the molecular weight corresponding to its unmodified form (Figure 5)—EF-PMepr was partially rhamnosylated, with a relative abundance of 20.06% compared to that of the unmodified mass.
These results raise the question: why were so few EF-P variants successfully modified by EarPShon, despite prior evidence that EarP can tolerate variation in the EF-P loop region [20,26]? One hypothesis is that evolutionary optimization for rhamnosylation and for unmodified EF-P function impose conflicting selective pressures. Mutations that enhance the efficiency of rhamnosylation or improve the function of the modified EF-P may simultaneously impair the activity of its unmodified counterpart. This tradeoff is supported by the poor performance of unrhamnosylated EF-PShon and EF-PPpu in E. coli [7,20,26], suggesting that these EF-P variants have evolved a dependency on their modification for proper function. In the absence of rhamnosylation, their activity drops dramatically. Together, these findings suggest that EF-P subtypes become functionally locked into their respective post-translational modification (PTM) systems, and that switching between modification pathways or from a natively unmodified to a modified form may incur fitness costs.
Off-target rhamnosylation has diverging impacts on the function of unmodified arginine-type EF-Ps
To assess the impact of the non-native rhamnosylation of EF-PMepr in more detail, we repeated previous experiments with co-expression of EarPShon or an empty control plasmid for EF-PMepr, EF-PEco (negative control), EF-PShon and EF-PPpu (positive controls). For the complementation assay, co-expression of EarPShon and EF-PEco had no impact on colony size, as expected (Figure 6). On the other hand, co-expression of EarPShon greatly enhanced colony size in the EF-PShon and EF-PPpu controls (Figure 6). This is also to be expected, as these EF-P are natively rhamnosylated and can be detrimental to E. coli in their un-rhamnosylated forms [7,20]. Lastly, co-expression with EarPShon significantly affected the EF-PMepr trans-complementation but resulted in only a very slight increase in average colony size (Figure 6).
Next, we repeated the in vivo stalling‐rescue assay with our luminescent reporter. Co‐expression of EarPShon had no significant impact on EF-PEco (Figure 7), but dramatically improved EF-PPpu rescue: unmodified EF-PPpu had a detrimental impact on polyproline synthesis (LPPP = 0.60, TPPH = 0.77), whereas rhamnosylated EF-PPpu performed well (LPPP = 2.48, TPPH = 1.57), consistent with Lassak et al. (2015) [7]. EF-PMepr also gained a modest benefit from EarPShon in the assay with LPPP (from 0.38 → 1.18), but only to levels equivalent to those seen in the Δefp strain (Figure 7). In other words, rhamnosylation of EF-PMepr merely mitigated its deleterious effect and did not enhance activity. These data suggest that adding a non-native rhamnosyl modification disrupts the interaction between EF-PMepr and the E. coli ribosome, possibly due to incompatibility at the peptidyl-transferase center.
Lastly, we sought to explore our proposed explanation for how rhamnosylation affects EF-PMepr on the molecular level, specifically in its interaction with the tRNA at the P-site of the peptidyl transferase center. Due to the lack of experimental structures of EF-PMepr and in particular rhamnosylations, we used structural modeling to suggest and compare the positioning of β3Ωβ4 loop tip regions in the following three variants: unmodified EF-PMepr, rhamnosylated EF-PMepr, and the naturally rhamnosylated EF-P from Pseudomonas aeruginosa (EF-PPaer), in complex with tRNAPro (Figure 8). Our models suggest that while the unmodified EF-PMepr can form several favorable contacts with tRNAPro, these contacts are abolished upon the addition of the rhamnose modification, which introduces a steric clash (Figure 8). By contrast, based on the available high-resolution crystal structure [40] the β3Ωβ4 loop of EF-PPaer is oriented in such a way as to take full advantage of the addition of the rhamnose moiety, leading to additional favorable contacts (Figure 8). While a full proof for these models can only come from respective experimental structures, these observations suggest that rhamnosylation supports EF-P function when co-evolved with the protein structure, as is the case with EF-PPaer. In contrast, the rhamnosylation of EF-PMepr is predicted to disrupt the protein’s function despite a high overall structural conservation among EF-P across species. These results are in line with our experimental data: while EF-PMepr shows some limited activity in polyproline synthesis (Figure 7 TPPH) and provides minor trans-complementation benefits in E. coli (Figure 6), rhamnosylation did not improve its function, but merely reduced detrimental effects (Figure 7 LPPP, RPDG). Altogether, these results support our hypothesis that the beneficial application of rhamnosylation requires adaptation of the critical EF-P loop region, and that simply adding a non-native modification is unlikely to immediately enhance function.
It is only natural that EF-PMepr is adapted to support translation in M. prima, rather than in E. coli. Given the evolutionary distance between these species, it’s likely that the ribosome of M. prima differs structurally from that of E. coli. We have shown that upon interaction with EarPShon, a substantial fraction of EF-PMepr is rhamnosylated and that this modification has a significant impact on the function of EF-PMepr, inasmuch as it can be measured in E. coli. This set of experiments, measuring the impact of EarPShon on EF-PMepr, mimics historical horizontal gene transfer events that actually occurred in the phylum Thermotogota. Long ago, members of the genera Geotoga and Oceanotoga acquired an EarP and an EarP-type EF-P which has since replaced their ancestral form of EF-P [29]. The ancestral EF-P in these genera was likely similar to the EF-P in their sister family, the Kosmotogaceae, to which M. prima belongs. It is therefore reasonable to infer that the initial introduction of an “invading” EarP into the Geotoga and Oceanotoga caused off-target rhamnosylation of their native EF-P, likely with negative functional consequences. These molecular mismatches would help explain the genomic signatures which coincide with this horizontal transfer event [29].
A contrasting illustration of this “invading-gene” scenario comes from the EF-P paralog EfpL, which we initially excluded as a target as it is a “secondary EF-P”, and always paired with an EF-P with a wider motif spectrum [5]. Although it is not natively post-translationally modified in E. coli, we surprisingly found that exposure to heterologously expressed EarP also leads to partial rhamnosylation of EfpL (Supplemental Figure 4). Importantly, in our in silico model, this modification had no detectable effect on EfpL activity in E. coli, presumably as a result of its unique β3Ωβ4 architecture [5] (Supplemental Figure 5). On the one hand, this indicates that, unlike EF-PMepr, the rhamnosylation of EfpL does not compromise its function. On the other hand, EfpL does not gain a benefit from the rhamnose moiety, as it is predicted to be positioned too far from the tRNA for any meaningful interaction (Supplemental Figure 5). To test whether our model could be supported experimentally, we investigated the effect of rhamnosylation of EfpL in our trans-complementation experiment. Once again, we measured colony size of our ΔefpΔuup strain in the presence of earP alone, efpL alone or upon co-expression of both. Expression of efpL increased colony size significantly. In line with our in silico model, we also found that rhamnosylation did not affect EfpL’s complementation efficiency (Supplemental Figure 6). This observation also aligns with the phylogenetic placement of EfpL as a sister clade to the EarP type EF-P (Figure 2) and evolutionary observations inferred from former analyses [20], which suggest that an EfpL-like ancestor was initially tolerant to rhamnosylation. Only later, a shortening of the extended EfpL loop—presumably by a single proline—created new structural contacts and, likely, a new EF-P functionality. This evolutionary change may have enabled the subsequent fixation of EarP-dependent EF-Ps in some lineages. Thus, the fate of an “invading” PTM system can depend not only on potential incompatibility, but also on whether it creates an opportunity for novel function.
Taken together, our findings indicate that the horizontal transfer of PTM machinery can produce complex, host-dependent outcomes: initial interference with host proteins followed by successful integration after a period of adaption in some lineages, such as the Thermotogota, and evolutionary opportunity in others, such as EfpL. This duality adds a rich new dimension to our understanding of HGT and cellular adaptation.