Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Peak emission wavelength and fluorescence lifetime are coupled in far-red, GFP-like fluorescent proteins

  • Laura Canty,

    Roles Formal analysis, Investigation, Methodology, Visualization, Writing – original draft

    Affiliation Department of Biological Sciences, Sunnybrook Research Institute, Toronto, Ontario, Canada

  • Santosh Hariharan,

    Roles Formal analysis, Methodology, Software, Writing – review & editing

    Affiliations Department of Biological Sciences, Sunnybrook Research Institute, Toronto, Ontario, Canada, Department of Medical Biophysics, University of Toronto, Toronto, Ontario, Canada

  • Qian Liu,

    Roles Investigation, Methodology, Writing – review & editing

    Affiliation Department of Biological Sciences, Sunnybrook Research Institute, Toronto, Ontario, Canada

  • Steven A. Haney,

    Roles Conceptualization, Funding acquisition, Project administration, Writing – review & editing

    Affiliation Department of Oncology and Translational Research, Eli Lilly and Company, Indianapolis, Indiana, United States of America

  • David W. Andrews

    Roles Conceptualization, Funding acquisition, Methodology, Supervision, Visualization, Writing – review & editing

    david.andrews@sri.utoronto.ca

    Affiliations Department of Biological Sciences, Sunnybrook Research Institute, Toronto, Ontario, Canada, Department of Medical Biophysics, University of Toronto, Toronto, Ontario, Canada, Department of Biochemistry, University of Toronto, Toronto, Ontario, Canada

Abstract

The discovery and use of fluorescent proteins revolutionized cell biology by allowing the visualization of proteins in living cells. Advances in fluorescent proteins, primarily through genetic engineering, have enabled more advanced analyses, including Förster resonance energy transfer (FRET) and fluorescence lifetime imaging microscopy (FLIM) and the development of genetically encoded fluorescent biosensors. These fluorescence protein-based sensors are highly effective in cells grown in monolayer cultures. However, it is often desirable to use more complex models including tissue explants, organoids, xenografts, and whole animals. These types of samples have poor light penetration owing to high scattering and absorption of light by tissue. Far-red light with a wavelength between 650-900nm is less prone to scatter, and absorption by tissues and can thus penetrate more deeply. Unfortunately, there are few fluorescent proteins in this region of the spectrum, and they have sub-optimal fluorescent properties including low brightness and short fluorescence lifetimes. Understanding the relationships between the amino-acid sequences of far-red fluorescence proteins and their photophysical properties including peak emission wavelengths and fluorescence lifetimes would be useful in the design of new fluorescence proteins for this region of the spectrum. We used both site-directed mutagenesis and gene-shuffling between mScarlet and mCardinal fluorescence proteins to create new variants and assess their properties systematically. We discovered that for far-red, GFP-like proteins the emission maxima and fluorescence lifetime have a strong inverse correlation.

Introduction

Fluorescent proteins are powerful tools that enable the monitoring of gene expression, protein localization, and physiological processes in living cells [13]. Since the successful cloning of green fluorescent protein (GFP) [3], many efforts have been devoted to improving its fluorescent properties including increasing brightness and stability and expanding the color palette[414]. Fluorescent proteins with a red-shifted spectrum are of particular importance for two major reasons. First, they allow multiplexing with other fluorescent proteins in the blue, green, and orange region of the spectrum, enabling simultaneous monitoring of multiple proteins or processes. Second, far-red fluorescence proteins are advantageous for organoid, tissue, and animal imaging. Tissues absorb and scatter light in the visible region of the spectrum thus effectively limiting penetrance to a few microns. By comparison, light with wavelengths between 650–900 nm is better able to penetrate tissues because molecules such as water, oxygenated hemoglobin, and melanin absorb substantially less light in this region of the spectrum [15]. The discovery of other naturally occurring GFP-like proteins with red-shifted spectra, (e.g. red fluorescent protein from the coral Discosoma striada, dsRFP [16] and red fluorescent protein from the sea anemone Entacmaea quadrucolor, eqFP578 [13]) facilitated the expansion of fluorescent proteins into the red and far-red region of the spectrum. However, far-red fluorescent proteins tend to have a low quantum yield meaning they are dimmer, and they tend to have shorter fluorescence lifetimes than blue-shifted fluorescent proteins (Table 1), which reduces their efficacy for many applications including FRET-based assays.

thumbnail
Table 1. Properties of fluorescent proteins in the far-red region of the spectrum.

https://doi.org/10.1371/journal.pone.0208075.t001

There is another class of fluorescent proteins that have emission spectra that fall well into the optical window of tissues (between 650 and 900 nm). These fluorescent proteins are derived from bacterial phytochrome receptors [29] and have a completely different structure than GFP and GFP-like proteins. Members of this class of fluorescent protein have peak emission wavelengths of 670–720 nm [30]. Similar to the far-red fluorescent proteins, these fluorescent proteins have relatively low quantum yields and fluorescence lifetimes between 0.5 and 1 ns (Table 1; [28]). However, for use as acceptors the fluorescence lifetime is not critical and these near-infrared fluorescent proteins are of the appropriate wavelength for use as acceptor proteins for far-red fluorescence proteins in FRET-based biosensors [31,32].

One of the brightest GFP-type fluorescent proteins with a peak emission beyond 650 nm is mCardinal, with excitation and emission maxima at 604nm and 659nm respectively [24] and a brightness, calculated by taking the product of the molecular extinction coefficient and quantum yield, of 16.5. However, compared to mScarlet with an emission maximum of 594 nm, a brightness of 70 and a fluorescence lifetime of 3.9 ns, mCardinal is much dimmer and has a relatively short fluorescence lifetime of 1.3 ns [24]. To date, it is not clear to what extent these properties are linked and what limits the fluorescence lifetime of far-red fluorescence proteins.

Fluorescence lifetime is the average amount of time the donor remains in the excited state before releasing a photon and is characteristic of each fluorophore. Fluorescence lifetime is particularly useful for measuring FRET in live cells and can be used with genetically encoded fluorescent biosensors. FRET results in a decrease in fluorescence lifetime of the donor fluorophore and emission of a photon from the acceptor rather than the donor fluorophore [33]. Both lifetime and intensity measurements have been used to evaluate FRET.

Fluorescence lifetime is a powerful way to measure FRET because it is independent of excitation intensity and fluorophore concentration making quantitative measurements in live cells practical even when the two fluorophores are expressed as separate proteins rather than linked in a single protein [34]. Because the signals are noisy and there can be both bound and unbound proteins in a single pixel it is important for the donor to have as close to a monoexponential decay as possible. Furthermore, an extended fluorescence lifetime of the donor fluorophore results in a larger dynamic range assay [34,35]. For example, mClover, with a fluorescence lifetime of 3.2 ns has been found to be a superior donor for use in FLIM-based assays because it results in FLIM measurements with a better dynamic range relative enhanced GFP (EGFP), with a fluorescent lifetime of only 2.4 ns [35]. In practice a donor with a fluorescence lifetime greater than 3ns is required to distinguish random collisions between colocalized molecules and bona fide protein binding. Therefore, to have optimized FRET-based biosensors for use in FLIM-based assays, a donor with a fluorescence lifetime of at least 3.0 ns is highly desirable. While most fluorescent proteins have a fluorescence lifetime between 0.1 and 4.0 ns [36], the few fluorescent proteins in the far-red region of the spectrum have short fluorescence lifetimes (< 1.5 ns) making them poor candidates for donor fluorophores in FRET-based biosensors for applications using FLIM.

Despite numerous successful improvements in the photophysical properties of fluorescence proteins our understanding of how to systematically modify fluorescent proteins to optimize specific fluorescent properties [13] remains rudimentary. For example, it is known that quantum yield and fluorescence lifetime are highly correlated [36] and rigidifying the chromophore is thought to increases the quantum yield, as well as increasing the fluorescence lifetime [36]. A rigid chromophore tends to improve quantum yield, brightness, and fluorescence lifetime because it reduces non-radiative energy loss resulting from vibrations, rotations, and other conformational changes [36]. These non-radiative means of energy loss are a form of intramolecular quenching which results in decreased brightness and fluorescence lifetime [36]. Thus, preventing non-radiative energy loss results in brighter fluorescent proteins with higher quantum yields and extended fluorescence lifetimes.

Many factors can affect the brightness of a fluorescent protein when it is expressed in cells. These include how well the fluorophore is able to absorb a photon, (the extinction coefficient), the probability that the a photon will be released once the fluorophore is in the excited state (quantum yield), and concentration of the fluorophore, which can be affected by the folding efficiency and stability of the protein. Although increased quantum yield and brightness are highly desirable characteristics they are complicated and time-consuming measurement not compatible with screening large numbers of mutants. Furthermore, fluorescent proteins that appear to be brighter in cells do not necessarily have high quantum yields. One example of this is mScarlet and mScarlet-I. mScarlet does not appear as bright in cells relative to mScarlet-I (313 and 363% brighter than mCherry respectively [17]). However, mScarlet has a higher quantum yield compared to mScarlet-I (0.70 and 0.54, respectively). The apparent difference between relative brightness in cells and the quantum yield is due to the difference in accumulation of the fluorescent proteins resulting from differences in the maturation times. mScarlet-I has an apparent delay in maturation relative to mTurquoise of 0.6 hours and an accumulation of 129% in cells normalized to mCherry, while mScarlet has an apparent delay in maturation relative to mTurquoise of 2.9 hours and an accumulation of 89% in cells normalized to mCherry [17]. The Fluorescence lifetimes for these proteins correlate with the quantum yield as mScarlet has a longer fluorescence lifetime relative to mScarlet-I (3.9 ns and 3.1 ns, respectively [17]). Therefore, we measured fluorescence lifetime as the primary read-out from our assays.

To date, the brightest red fluorescent protein, with the most extended fluorescence lifetime (3.9 ns) is mScarlet [17]. While mScarlet performs well in terms of fluorescence lifetime, it is not ideal for applications in tissue or whole animal imaging because its excitation and emission peaks, at 569 nm and 594nm respectively [17], fall short of the optical window of tissues. However, because mScarlet is unique in its brightness and fluorescence lifetime among other fluorescent proteins in the red to far-red region of the spectrum, it may provide insight into the requirements necessary to improve the brightness in other far-red fluorescent proteins. Other fluorescent proteins in this region of the spectrum may be less informative in this context because they generally do not perform very well.

To better understand the relationship between emission maximum and lifetime of far-red fluorescence proteins we used site-directed mutagenesis of mCardinal and gene shuffling between mCardinal and mScarlet [37,38]. Our rationale for the gene shuffling was to mix two existing fluorescent proteins, one with a long lifetime but shorter emission maximum and the other with a short lifetime but longer emission maximum to find a fluorescent protein with both properties optimized. However, we discovered that there is a strong negative correlation between fluorescence lifetime and emission maximum suggesting there is a spectral limit to what can be achieved using GFP-like proteins.

Methods and materials

Tissue culture

HEK293T cells were used to test the point mutations because the ultimate goal was to use the optimized fluorescent protein in mammalian cells. HEK293T are good model cell line to apply for this purpose because they are particularly easy to grow and transfect. The HEK293T cells [39] were a gift from Frank Graham at McMaster University. Cells were grown in DMEM complete media. Cells were transiently transfected using calcium phosphate [40] for 16 hours. Cells were allowed to grow for another 24 hours after media change before imaging or harvesting. For spectral analysis experiments, cells were grown and transfected in 10 cm dishes. 24 hours after transfection cells were washed three times with PBS and once with hypotonic lysis buffer (1.5 mM MgCl2, 100nM HEPES, pH = 7.4, and protease inhibitors). Cells were incubated in 1 mL hypotonic lysis buffer on ice for 10 mins before homogenization using a Potter homogenizer. The suspension was centrifuged at 200 g for 10 mins at 4°C, and the supernatant was used for spectral analysis on a PTI QuantaMaster Fluorimeter from Horiba using FelixGX software (v. 4).

Mutants

All mCardinal and mKate2 mutants were synthesized by DNA2.0 (Now ATUM; Newark, California) in mammalian expression vectors under a CMV promoter. The mScarlet/mCardinal shuffle library was created by ATUM with products cloned into the pD144-SR expression vector with an IPTG inducible T5 promoter.

Fluorescence lifetime imaging microscopy (FLIM)

Cells were seeded in Cell Carrier 384-Ultra plates (6057308, Perkin Elmer) for imaging. Fluorescence lifetime images were taken as described [41]. Briefly, we used an ISS-Alba FLIM/FCS confocal microscope with a 60x water immersion objective (NA = 1.2) and a PDL 800-D pulsed laser (PicoQuant) at a repetition rate of 20MHz. We used a 445/505/588nm excitation filter and a 628/30 nm emission filter. VistaVision Software was used for acquisition and analysis of the FLIM data.

The gene–shuffle screen

The gene-shuffle library was screened in bacteria because it is faster and easier to screen a large number of colonies and recover the DNA of potential hits in the library in bacteria than it would be using a mammalian cell line. The mScarlet/mCardinal shuffle library was randomly generated as a pooled library [30] by ATUM (Newark, California) in an E. coli expression vector under an IPTG inducible T5 promoter. The library was transformed into T7 shuffle E. coli (C3029, NEB). All colonies arising from the transformation were selected and cultured overnight at 37°C in 96-well Megablocks (83.1972.002, Sarstedt) in the presences of ITPG (IPT002.50, Bioshop). 100 μL of bacterial suspension was transferred to non-binding, 96-well half area assay plates (3881, Corning) for emission scans. Fluorescent emission was scanned from 620–750 nm on a TECAN infinity M1000pro fluorescence plate reader with excitation at 600 nm. Plasmid DNA was recovered and sequenced for all bacteria expressing proteins with fluorescent intensity > 1000 A.U. between 620 and 750 nm with excitation at 600 nm (very close to the excitation maximum of mCardinal at 604 nm). These mutants were also analyzed for fluorescence lifetime, and full excitation and emission scans were recorded.

Excitation and emission scans

Excitation and emission scans were performed on PTI QuantaMaster Fluorimeter from Horiba using FelixGX software (v. 4). 300 μL of cell lysate or bacterial suspension were placed in special optical glass cuvettes with 4 mm path length (23–4.45-SOG-4, Starna Cell, Inc.). Measurements were taken at 25°C. Measurements were taken at 1 nm intervals with a 1 second integration time.

Lifetime measurement

Decay curves were obtained on a PTI QuantaMaster Fluorimeter from Horiba using FelixGX software (v. 4) and a whitelase SC450 pulsed laser using a repetition rate of 20MHz. 300 μL of cell lysate or bacterial suspension were placed in special optical glass cuvettes with 4mm path length (23–4.45-SOG-4, Starna Cell, Inc.). Measurements were taken at 25°C until peak photon count reached 10,000. Lifetimes were calculated from the decay cures using the data analysis package from FelixGX. All fluorescence lifetimes were calculated by fitting a mono-exponential decay. Selected decay curves are shown in (S3 Fig).

Random forest regression

We performed a Random Forest regression [42] to determine residues that are important for determining emission wavelength or fluorescence lifetime. The individual positions were used as features while the amino acid in each position was used as a categorical variable. Importance values were scaled as a proportion of the maximum value. For this analysis, we used only the protein sequences generated from the gene-shuffled library. We aligned the sequences of all the unique fluorescent proteins and used only the positions where the amino acids in mCardinal and mScarlet were different. Because this library was generated via gene shuffling, most loci had one of two amino acids (either the amino acid from mCardinal or the amino acid from mScarlet); however, there were a few loci that also included novel amino acids (i.e. not found in mCardinal or mScarlet). This appears to be a result of a mid-codon shuffling event resulting in a third amino acid (S1 Table).

Many of the residues that differ between mCardinal and mScarlet are in unstructured loop regions or on β-strands with their sidechains facing outwards and thus away from the chromophore making it difficult to predict how they might affect the fluorescent properties of the fluorescence protein (S4 Fig). However, it is possible that these residues are important for structural aspects of the protein, including overall rigidity that could potentially impact the fluorescent protein. Thus, we included all residues that differed between mCardinal and mScarlet in the analysis. For measuring the model performance, we used standard leave one out analyses where, in every run, one sequence was left out for validation while the rest of the sequences were used for generating the regression model.

Results and discussion

Site-directed mutagenesis of mCardinal

We made 14 single site-directed mutants of mCardinal aimed at extending the fluorescence lifetime by modifying residues that were close to the chromophore and likely to impact fluorescence lifetime (S1 Fig). We based these mutations on mutations that had been shown to increase fluorescence lifetime in other fluorescent proteins, by comparing the amino acid sequence between mCardinal and other fluorescent proteins with longer fluorescence lifetimes and selecting residues expected to stabilize the chromophore or facilitate hydrogen bonding with the chromophore. Of these mutants, nine (mCardinal-W144I, W144L, T147H, T150Y, C162T, C162N, A165R, R201T, and 201H) had no effect on fluorescence lifetime, two of the mutants, mCardinal-T147N and mCardinal-R201Y reduced the fluorescence lifetime to 0.9 ns and 0.95 ns respectively while one mutant, mCardinal-C162Y, was no longer fluorescent (Fig 1A). Two mutants, mCardinal-Y121N, and mCardinal-Y121Q increased the fluorescence lifetime to 2.7 ns and 2 ns, respectively (Fig 1A). However, these mutants also had a 64 nm blue-shift in their spectra (Fig 1C, S2 Fig).

thumbnail
Fig 1.

A Fluorescence lifetime measurements of mutants from site-directed mutagenesis of mCardinal measured in HEK393T cells by FLIM. B Fluorescence lifetime measurement of mutants from site-directed mutagenesis of mCardinal and mKate2 (in HEK293T cell lysates) and SL72 from the mScarlet/mCardinal shuffle library (in bacterial suspension) measured on the PTI fluorimeter. C Excitation and D Emission spectra of fluorescence proteins with the longest fluorescence lifetimes in HEK293T cell lysates, made by site-directed mutagenesis and SL72 from the mCardinal/mScarlet Shuffle library (measured in bacterial suspension).

https://doi.org/10.1371/journal.pone.0208075.g001

The above mutagenesis was based on insights from fluorescent proteins including mCerulean3 [4] and mTurquoise [43], which are substantially blue-shifted relative to mCardinal. In an attempt to minimize the blue-shift, we focused on fluorescent proteins with extended fluorescence lifetimes that that were closer in the spectrum to mCardinal.

mScarlet is a very bright, red fluorescence protein with peak excitation and emission wavelength at 569 and 594 nm and fluorescence lifetime of 3.9 ns [17]. Based on the residues hypothesized by Bindels et al., to be important for the extended lifetime in mScarlet, we made the mCardinal-PRYSS (T64P, K71R, F84Y, T147S, A221S) mutant. While this mutant did have an extended lifetime (2.7 ns; Fig 1B) relative to wildtype mCardinal, it had a 139 nm blue-shift in the emission spectrum (Fig 1C; S2 Fig).

Site-directed mutagenesis of mKate2

Kate2 has a peak excitation and emission wavelengths of 588 and 633 nm [44] and a fluorescence lifetime of 2.5 ns [17], as well as high sequence similarity to mCardinal [44] meaning that there are very few residues that could potentially account for the difference in fluorescence lifetime. Of the residues that differ, residue 45 (S1 Fig) stood out as being potentially important for at least one of these properties because it is close to the chromophore and there is a polar residue (glutamine) in mCardinal versus a hydrophobic residue (methionine) in mKate2. We hypothesized that a change in hydrophobicity close to the chromophore could be responsible for the differences in spectra and/or fluorescent lifetime found between mKate2 and mCardinal. Thus, we mutated the methionine at residue 45 in mKate2 to glutamine (mKate2-M45Q) as found in mCardinal at this position to see the effect this residue has on the emission maximum and/or fluorescent lifetime. Surprisingly, this mutation resulted in a 35 nm blue-shift relative to wildtype mKate2 (Fig 1C; S2 Fig) as well as an increase in fluorescence lifetime to 3.8 ns (Fig 1B).

Together with the data from the site-directed mutagenesis in mCardinal, our results suggest that we cannot easily apply the same mutations that resulted in an increase in fluorescence lifetime for other fluorescent proteins to fluorescent proteins in the far-red region on the spectrum. Even though the structures are very similar across the GFP-like proteins, the primary sequence can differ substantially, and specific mutations may not have the same outcome in a different environmental context created by a difference in primary sequence elsewhere in the protein.

One mechanism that could explain the blue-shift in the spectrum is that the mutations act to increase the energy gap between the ground state and the excited state of the chromophore. This would be likely in mutations that resulted in a moderate blue-shift in the spectrum. For example, the methionine at position 45 in mKate2 and the tyrosine at position 121 in mCardinal are both very close to the chromophore (S1 Fig) and could interact with it directly, thus mutating these residues to glutamine and asparagine, respectively, could alter the energy required reach the excited state and thus account for the 38 and 64 nm blue-shifts in the spectrum of mKate2-M45Q and mCardinal-Y121N mutants resulting from a single, non-conservative amino acid substitution.

The structure the chromophore can greatly impact the spectral properties of the fluorescent protein [45]. The mCardinal-PRYSS mutant is highly blue shifted but this is almost certainly due to a change in chromophore structure. That is, this variant forms a GFP-type chromophore rather than a DsRed-type chromophore resulting in the extreme blue-shift in the spectrum of 139 nm. Thus, another possible explanation for the moderate to large blue-shifts observed in the mutants is that some of the mutations we introduced affected the maturation of the chromophore resulting in GFP-like chromophore structures rather than the DsRed structure. Further, as described in Bindels et al., 2017, many residues in different regions of the protein were thought to be necessary for the extended fluorescence lifetime seen in mScarlet. The same could also be true for residues responsible for the spectral properties of other far-red fluorescent proteins. For these reasons, we moved away from a site-directed approach and towards a random approach.

Gene-shuffling between mCardinal and mScarlet

We screened a library of fluorescent proteins generated via gene-shuffling between mScarlet and mCardinal. We performed gene-shuffling because it has been proposed that this method covers a larger phenotypic distribution versus random mutagenesis of mCardinal or mScarlet alone [38]. We chose to shuffle mScarlet and mCardinal because mScarlet has the longest fluorescence lifetime of the red fluorescent proteins, and mCardinal has an emission maximum that falls within the optical window while still being relatively bright. Of approximately 10,000 proteins screened from the gene-shuffle library, many were not fluorescent when excited with light at 600 nm to select for far-red fluorescent proteins. We selected all the proteins that had a fluorescence intensity of at least 1000 A.U. at some wavelength between 620 and 700nm for further analysis. From this subset of proteins, we identified both mScarlet and mCardinal, along with 81 unique fluorescent proteins with peak emission wavelengths and fluorescent lifetimes that fell between mScarlet and mCardinal (S1 Table). The most red-shifted protein with fluorescence lifetime approaching 3.0 ns (SL72 highlighted in S1 Table) has a peak excitation and emission wavelengths of 578 and 621 nm (Fig 1C; S2 Fig) and a fluorescence lifetime of 2.8 ns (Fig 1B). However, this protein is not as red-shifted as mKate2 with excitation and emission maxima at 588 and 633 nm and has only a very slight increase in fluorescence lifetime from 2.5 ns for mKate2 to 2.8 ns for SL72.

Interestingly, when we plotted peak emission wavelength against fluorescence lifetime (Fig 2), we observed a strong and significant (p < 0.0001) negative correlation (R2 = 0.6998) between the peak emission wavelength of the fluorescent proteins and the fluorescent lifetime suggesting that peak emission wavelength and fluorescence lifetime cannot be independently manipulated. Another surprising finding was that we did not find any new fluorescent proteins with peak emission wavelength greater than 625 nm (pink circles in Fig 2; S1 Table). We had expected a continuous distribution of peak emission wavelength between mScarlet (594 nm) and mCardinal (659 nm). The fact that we did not observe any novel proteins with peak emission wavelengths beyond 625 nm suggests that we are approaching the spectral limit of this fluorophore. While it is possible that there were fluorescent proteins in the gene-shuffle library with peak emission wavelengths beyond 659 nm that did not meet our fluorescence intensity cut-off of 1000 A.U., this still represents a potential limit in the spectrum beyond which the fluorescence is not bright enough to be useful. Thus, a very specific set of properties at particular residues are likely required to have a detectable fluorescence emission maximum beyond 625 nm. We speculate that even minor changes to these residues could result in a blue-shift in the spectrum or complete loss of fluorescence. This hypothesis would also explain the unexpected results we obtained from the site-directed mutagenesis we performed.

thumbnail
Fig 2. Fluorescence lifetime plotted against peak emission wavelength for all unique fluorescent proteins generated either by site-directed mutagenesis or from the mScarlet/mCardinal shuffle library.

https://doi.org/10.1371/journal.pone.0208075.g002

Random forest regression

To address these hypotheses further, we used random forest regression [42] using the gene-shuffling data to test the null hypothesis and identify regions of the protein that might be important for wavelength or fluorescence lifetime separately. This analysis predicts regions of the protein that are important but has no directionality (Fig 3). Reassuringly, there were residues that were predicted by the model that were also independently identified by Bindels et al., 2017, and experimentally determined by us as being important for fluorescence lifetime. However, the overall conclusion from this analysis is that most regions predicted to be important for fluorescence lifetime were also predicted to be important for emission wavelength.

thumbnail
Fig 3. Residue importance for determining fluorescence lifetime (grey bars) or on peak emission wavelength (black bars) scaled as a proportion of the maximum importance for each property.

Positive numbers indicate that the residue is of relative importance and negative values mean the residue is not important. Numbers on the x-axis correspond to residue number in mCardinal. The secondary structure of the protein is illustrated along the bottom of the graph. Residues 58–79 make up the alpha helix and loop that hold the chromophore (Red pentagon) inside the β-barrel. Numbers highlighted in red indicated those residues on the β-sheet or loop regions that face inside the β-barrel and thus, could interact with the chromophore. Red arrows indicate residues that were independently selected for the site-directed mutagenesis in mCardinal or mKate2.

https://doi.org/10.1371/journal.pone.0208075.g003

Residues important for fluorescence lifetime.

Residues 157–184 were identified in our model as candidates for being important for only fluorescence lifetime. Interestingly, these residues make up two adjacent β-sheets. Unfortunately, only three residues in this region had been independently predicted to be important for fluorescence lifetime either by our group or Bindels et al. [17]. Residue 178 was identified by Bindels et al. [17] and we independently selected residues T150, and C162 for mutagenesis in mCardinal (Fig 3). Unfortunately, the point mutations, T150Y, C162N, and C162T in mCardinal did not result in a change in fluorescence lifetime, and the C162Y mutation in mCardinal resulted in a complete loss of fluorescence (Fig 1A). However, these results do not mean the model was wrong. Given that this model was based on the mScarlet/mCardinal shuffle data and these residues were mutated to amino acids that are not found in mScarlet or mCardinal at these positions it is possible that we did not select the correct amino acids to elicit a change in fluorescence lifetime. However, our results suggest that if further mutagenesis of this area were carried out we would likely only find amino acids with negative effects on fluorescent properties and so it was not pursued further.

Residues important for emission spectra.

There was only one region that was predicted as being important only for determining the emission maxima. That sequence included residues 22–32 close to the N-terminus of the protein with a relatively low probability of affecting emission wavelength. In our screen, we only found two novel fluorescent proteins had the amino acids from mCardinal in this region, and they both had emission maxima above 605 nm (S1 Table). However, there were many other fluorescent proteins with emission maxima 605 nm and greater, which suggested there may be other regions in the protein that are important for red-shifting the spectrum, including residues at positions 83 and 84, although, these residues are also predicted to be important for fluorescence lifetime. Thus, our data and model suggest that with the chromophore in these fluorescence proteins it may not be possible to have a far-red fluorescent protein with an extended fluorescence lifetime.

Notably, none of the novel fluorescent protein from the gene-shuffle library had an emission maximum that came close to mCardinal, and very few fluorescent proteins had residues from mCardinal in the N-terminal region particularly from residue 22–51, which includes the region identified by the model as being important for emission. To see if this entire region could account for the extra red-shift seen in mCardinal, we replaced the residues 22–51 in SL72 with the residues found in mCardinal and found that this mutation resulted in a complete loss of fluorescence (data not shown). This data supports the hypothesis that we are hitting the spectral limit of this fluorescent protein and it suggests that it is only possible to achieve the emission maximum seen in mCardinal under a specific context dependent on the amino acids present across multiple regions of the protein.

Residues important for both emission peak and fluorescence lifetime.

Some of the remaining residues were predicted to be important for both the emission spectra and for fluorescence lifetime. One region of note was from residue 71 to residue 77. These residues make the α-helix that holds the chromophore inside the β-barrel. Because of their proximity to the chromophore, it is not surprising that this region is important in determining the spectral and photophysical properties such as fluorescence lifetime.

Three of the residues in this region had been predicted by Bindels et al., as being important for the extended fluorescence lifetime of mScarlet [17]. As mentioned previously, we mutated all three of these amino acids in mCardinal along with two other residues that were identified as being important for the extended lifetime of mScarlet to make the mCardinal-PRYSS mutant (S1 Fig). This mutant had an increase in fluorescence lifetime to 2.7 ns but also had a 64 nm blue-shift in the spectrum–resulting in a protein even more blue-shifted than mScarlet (Fig 1B and 1C, S2 Fig). These results support the prediction by the model that this region is important for both fluorescence lifetime and spectral properties.

The reason far-red fluorescent proteins have short fluorescence lifetimes is most likely related to the relatively small energy gap between the excited and the ground state. As stated previously, mutations near the chromophore may result in an increase in the energy gap between the excited state and the ground state, resulting in the smaller blue-shifts in the spectrum. This mechanism could explain the observation that the mutations we tested that successfully increased the lifetime also resulted in a blue-shift in the spectrum.

Alternatively, it is also possible that the some of the spectral properties in fluorescent proteins resulting from the shuffle library resulted from a change in the structure of the chromophore from the DsRed structure to GFP-like structure. Consistent with this hypothesis there are eight amino acids (residues 17, 45, 64, 65, 71, 147, 162, and 174 mapped to mCardinal; S1 Table residues with an asterisk) that differ between mScarlet and mCardinal and are very close to the chromophore (S4 Fig). These residues, when mutated, are the most likely candidates to elicit spectral changes due to alteration of the chromophore structure. Furthermore, none of the novel fluorescent proteins in the shuffle library contain the residues from mCardinal at all eight of the positions that are near the chromophore and listed above. This could suggest the structure of the DsRed chromophore is highly dependent on the surrounding amino acids and changes to some of these key amino acids may result in GFP-like structure rather than the DsRed structure, which is important for the extreme red-shift seen in mCardinal. However, further studies are needed to fully test this hypothesis and investigate the extent to which these mutations are affecting the structure of the chromophore.

Conclusions

Our results suggest that we have identified the spectral limit of the chromophore in current far-red fluorescence proteins. As a result, amino acid sequences that result in peak emission wavelengths beyond 625 nm will be rare and will have short fluorescence lifetimes. We also hypothesize that very red-shifted fluorescent proteins, like mKate2 and mCardinal, are inherently more sensitive to spectral changes when mutations aimed at improving brightness or fluorescence lifetime are introduced even though similar mutations have been shown to have no impact on emission in more blue-shifted fluorescent proteins[4,17,44]. We, therefore, conclude that for FRET-based sensors mScarlet or mKate2 are the proteins of choice even though their emission maxima are not sufficiently red-shifted for use in animals. However, we predict that they will have reasonable properties for use in 3D tissue cultures. For imaging studies in which only localization is desired it will be important to counterbalance the further red emission offered by mCardinal against the much brighter mScarlet.

Supporting information

S1 Fig. Structural representation of mCardinal or mKate2 depicted using PyMOL [46] to visualize location of residues mutated in the site-directed mutagenesis (A) in the mCardianal-PRYSS mutant (B) shown in mCardinal (PDB ID: 4OQW) and M45 in mKate2 (C) shown on mKate (PDB ID: 4OQW).

Note that the residues that are on the inner α-Helix are not shown for A and C, and a couple of β-strands have been removed from B for better visualization of the highlighted residues.

https://doi.org/10.1371/journal.pone.0208075.s001

(TIF)

S2 Fig. Peak-normalized excitation and emission spectra of the mCardinal (A), mCardinal-Y121N(B), mCardinal-PRYSS(C), mKate2 (D), mKate2-M54Q (E), mScarlet (F), and SL72 from the mScarlet/mCardinal shuffle library(G).

https://doi.org/10.1371/journal.pone.0208075.s002

(TIF)

S3 Fig. Example decay curves of select fluorescent proteins, either expressed in cell lysates (mCardinal and mKate2 mutants) or E. coli (mScarlet and SL72), taken on the PTI.

https://doi.org/10.1371/journal.pone.0208075.s003

(TIF)

S4 Fig. Structural representation of mCardinal with amino acids that differ from mScarlet depicted using PyMOL [46].

All amino acids that are on the β-strands that facing outward away from the chromophore (A), inward toward the chromophore (B), or the nine closest inward facing amino acids (C). All residues are shown on mCardinal (PDB ID: 4OQW). Note that the residues that are on the inner α-Helix are not shown. D The two amino acids that are from mCardinal and close to the chromophore in SL72. These residues are shown on mScarlet (PDB ID: 5LK4) with the mCardinal residues highlighted because SL72 is more similar to mScarlet than mCardinal.

https://doi.org/10.1371/journal.pone.0208075.s004

(TIF)

S1 Table. Amino acid sequence alignment of unique sequences from the mScarlet/mCardinal shuffle library with their respective excitation/emission maxima and fluorescence lifetime.

Residue number is based on the mCardinal sequence and is indicated at the top of the chart with green numbers indicating residues that make up the inner α-helix, red numbers being residues in the β-barrel that point inward towards the chromophore and black numbers are residues in the β-barrel that point outward and away from the chromophore. Numbers with an asterisk are those residues whose sidechain is close to the chromophore and could potentially interact directly. Only residues that are different between mCardinal and mScarlet are shown. Those residues that are the same as mCardinal are highlighted in blue, and those residues that are in mScarlet are highlighted in Red. Residues that are not found in mCardinal or mScarlet are not highlighted (white background).

https://doi.org/10.1371/journal.pone.0208075.s005

(XLSX)

References

  1. 1. Newman RH, Fosbrink MD, Zhang J. Genetically encodable fluorescent biosensors for tracking signaling dynamics in living cells. Chem Rev. 2011;111: 3614–3666. pmid:21456512
  2. 2. Miyawaki A. Development of probes for cellular functions using fluorescent proteins and fluorescence resonance energy transfer. Annu Rev Biochem. 2011;80: 357–373. pmid:21529159
  3. 3. Chalfie M, Tu Y, Euskirchen G, Ward W, Prasher D. Green fluorescent protein as a marker for gene expression. Science (80-). 1994;263: 803–805.
  4. 4. Markwardt ML, Kremers GJ, Kraft CA, Ray K, Cranfill PJC, Wilson KA, et al. An improved cerulean fluorescent protein with enhanced brightness and reduced reversible photoswitching. PLoS One. 2011;6. pmid:21479270
  5. 5. Heim R, Tsien RY. Engineering green fluorescent protein for improved brightness, longer wavelengths and fluorescence resonance energy transfer. Curr Biol. 1996;6: 178–182. pmid:8673464
  6. 6. Bajar BT, Wang ES, Lam AJ, Kim BB, Jacobs CL, Howe ES, et al. Improving brightness and photostability of green and red fluorescent proteins for live cell imaging and FRET reporting. Sci Rep. Nature Publishing Group; 2016;6: 20889. pmid:26879144
  7. 7. Shaner NC, Lin MZ, McKeown MR, Steinbach PA, Hazelwood KL, Davidson MW, et al. Improving the photostability of bright monomeric orange and red fluorescent proteins. Nat Methods. 2008;5: 545–551. pmid:18454154
  8. 8. Kremers G-J, Goedhart J, van Munster EB, Gadella TWJ. Cyan and Yellow Super Fluorescent Proteins with Improved Brightness, Protein Folding, and FRET Förster Radius. Biochemistry. 2006;45: 6570–6580. pmid:16716067
  9. 9. Pandelieva AT, Baran MJ, Calderini GF, McCann JL, Tremblay V, Sarvan S, et al. Brighter Red Fluorescent Proteins by Rational Design of Triple-Decker Motif. ACS Chem Biol. 2016;11: 508–517. pmid:26697759
  10. 10. Zhao Y, Araki S, Wu J, Teramoto T, Chang Y-F, Nakano M, et al. An Expanded Palette of Genetically Encoded Ca2+ Indicators. Science (80-). 2011;333: 1888–1891. pmid:21903779
  11. 11. Day RN, Davidson MW. The fluorescent protein palette: tools for cellular imaging. Chem Soc Rev. 2009;38: 2887. pmid:19771335
  12. 12. Rodriguez EA, Campbell RE, Lin JY, Lin MZ, Miyawaki A, Palmer AE, et al. The Growing and Glowing Toolbox of Fluorescent and Photoactive Proteins. Trends in Biochemical Sciences. 2017. pmid:27814948
  13. 13. Hense A, Nienhaus K, Nienhaus GU. Exploring color tuning strategies in red fluorescent proteins. Photochem Photobiol Sci. Royal Society of Chemistry; 2015;14: 200–212. pmid:25597270
  14. 14. Eason MG, Damry AM, Chica RA. Structure-guided rational design of red fluorescent proteins: towards designer genetically-encoded fluorophores. Curr Opin Struct Biol. 2017;45: 91–99. pmid:28038355
  15. 15. Konig K. Multiphoton microscopy in life sciences. J Microsc. 2000;200: 83–104. pmid:11106949
  16. 16. Baird GS, Zacharias DA, Tsien RY. Biochemistry, mutagenesis, and oligomerization of DsRed, a red fluorescent protein from coral. Proc Natl Acad Sci. 2000;97: 11984–11989. pmid:11050229
  17. 17. Bindels DS, Haarbosch L, van Weeren L, Postma M, Wiese KE, Mastop M, et al. mScarlet: a bright monomeric red fluorescent protein for cellular imaging. Nature Methods. Nature Publishing Group; 2017; 1–12. pmid:27869816
  18. 18. Shcherbo D, Murphy CS, Ermakova G V., Solovieva EA, Chepurnykh T V., Shcheglov AS, et al. Far-red fluorescent tags for protein imaging in living tissues. Biochem J. 2009;418: 567–574. pmid:19143658
  19. 19. Wang L, Jackson WC, Steinbach PA, Tsien RY. Evolution of new nonantibody proteins via iterative somatic hypermutation. Proc Natl Acad Sci. 2004;101: 16745–16749. pmid:15556995
  20. 20. Shu X, Royant A, Lin MZ, Aguilera TA, Lev-Ram V, Steinbach PA, et al. Mammalian Expression of Infrared Fluorescent Proteins Engineered from a Bacterial Phytochrome. Science (80-). 2009;324: 804–807. Available: http://science.sciencemag.org/content/324/5928/804.abstract
  21. 21. Shcherbo D, Shemiakina II, Ryabova A V, Luker KE, Schmidt BT, Souslova EA, et al. Near-infrared fluorescent proteins. Nat Methods. 2010;7: 827–829. pmid:20818379
  22. 22. Morozova KS, Piatkevich KD, Gould TJ, Zhang J, Bewersdorf J, Verkhusha V V. Far-red fluorescent protein excitable with red lasers for flow cytometry and superresolution STED nanoscopy. Biophys J. Biophysical Society; 2010;99: L13–L15. pmid:20643047
  23. 23. Bajar BT, Lam AJ, Badiee RK, Oh YH, Chu J, Zhou XX, et al. Fluorescent indicators for simultaneous reporting of all four cell cycle phases. Nat Methods. 2016;13: 993–996. pmid:27798610
  24. 24. Chu J, Haynes RD, Corbel SY, Li P, González-González E, Burg JS, et al. Non-invasive intravital imaging of cellular differentiation with a bright red-excitable fluorescent protein. Nat Methods. 2014;11: 572–8. pmid:24633408
  25. 25. Matela G, Gao P, Guigas G, Eckert AF, Nienhaus K, Ulrich Nienhaus G. A far-red emitting fluorescent marker protein, mGarnet2, for microscopy and STED nanoscopy. Chem Commun. Royal Society of Chemistry; 2017;53: 979–982. pmid:28044150
  26. 26. Piatkevich KD, Subach F V, Verkhusha V V. Far-red light photoactivatable near-infrared fluorescent proteins engineered from a bacterial phytochrome. Nat Commun. Nature Publishing Group; 2013;4: 2153. pmid:23842578
  27. 27. Shcherbakova DM, Baloban M, Verkhusha V V. Near-infrared fluorescent proteins engineered from bacterial phytochromes. Current Opinion in Chemical Biology. 2015. pmid:26115447
  28. 28. Rice WL, Shcherbakova DM, Verkhusha V V., Kumar ATN. In vivo tomographic imaging of deep-seated cancer using fluorescence lifetime contrast. Cancer Res. 2015;75: 1236–1243. pmid:25670171
  29. 29. Filonov GS, Piatkevich KD, Ting LM, Zhang J, Kim K, Verkhusha V V. Bright and stable near-infrared fluorescent protein for in vivo imaging. Nat Biotechnol. Nature Publishing Group; 2011;29: 757–761. pmid:21765402
  30. 30. Shcherbakova DM, Verkhusha V V. Near-infrared fluorescent proteins for multicolor in vivo imaging. Nat Methods. 2013;10: 751–4. pmid:23770755
  31. 31. Lin L-T, Wang B-S, Chen J-C, Liu C-H, Chou C, Chiu S-J, et al. mPlum-IFP 1.4 fluorescent fusion protein may display Förster resonance energy transfer associated properties that can be used for near-infrared based reporter gene imaging. J Biomed Opt. 2013;18: 126013. pmid:24343444
  32. 32. Zlobovskaya OA, Sergeeva TF, Shirmanova M V., Dudenkova V V., Sharonov G V., Zagaynova E V., et al. Genetically encoded far-red fluorescent sensors for caspase-3 activity. Biotechniques. 2016;60. pmid:26842350
  33. 33. Becker W. Fluorescence lifetime imaging—techniques and applications. J Microsc. 2012;247: 119–136. pmid:22621335
  34. 34. Liu Q, Leber B, Andrews DW. family proteins measured in live MCF-7 cells using FLIM FRET. 2012;11: 3536–3542. pmid:22895112
  35. 35. Martin KJ, McGhee EJ, Schwarz JP, Drysdale M, Brachmann SM, Stucke V, et al. Accepting from the best donor; analysis of long-lifetime donor fluorescent protein pairings to optimise dynamic FLIM-based FRET experiments. PLoS One. 2018;13: 1–25. pmid:29293509
  36. 36. Berezin MY, Achilefu S. Fluorescence lifetime measurements and biological imaging. Chem Rev. 2010;110: 2641–2684. pmid:20356094
  37. 37. Sawano a, Miyawaki a. Directed evolution of green fluorescent protein by a new versatile PCR strategy for site-directed and semi-random mutagenesis. Nucleic Acids Res. 2000;28: E78. pmid:10931937
  38. 38. Meyer AJ, Ellefson JW, Ellington AD. Library Generation by Gene Shuffling. Current Protocols in Molecular Biology. Hoboken, NJ, USA: John Wiley & Sons, Inc.; 2014. p. 15.12.1–15.12.7. https://doi.org/10.1002/0471142727.mb1512s105 pmid:24510437
  39. 39. Graham F.L. S J. Characteristics of a Human Cell Line Transformed by D N A from Human Adenovirus Type 5. J.genVirol. 1977;36: 59–72.
  40. 40. Jordan M, Schallhorn A, Wurm FM. Transfecting mammalian cells: Optimization of critical parameters affecting calcium-phosphate precipitate formation. Nucleic Acids Res. 1996;24: 596–601. pmid:8604299
  41. 41. Osterlund EJ, Liu Q, Andrews DW. The Use of FLIM-FRET for the Detection of Mitochondria-Associated Protein Interactions. Methods in Molecular Biology. 2015. pp. 395–419. pmid:25631031
  42. 42. Breiman L. Random forests. Mach Learn. 2001;45: 5–32.
  43. 43. Goedhart J, von Stetten D, Noirclerc-Savoye M, Lelimousin M, Joosen L, Hink MA, et al. Structure-guided evolution of cyan fluorescent proteins towards a quantum yield of 93%. Nat Commun. 2012;3: 751. pmid:22434194
  44. 44. Shcherbo D, Merzlyak EM, Chepurnykh T V, Fradkov AF, Ermakova G V, Solovieva EA, et al. Bright far-red fluorescent protein for whole-body imaging. Nat Methods. 2007;4: 741–746. pmid:17721542
  45. 45. Miyawaki A, Shcherbakova DM, Verkhusha V V. Red fluorescent proteins: Chromophore formation and cellular applications. Curr Opin Struct Biol. Elsevier Ltd; 2012;22: 679–688. pmid:23000031
  46. 46. DeLano W. Pymol: An open-source molecular graphics tool. CCP4 Newsl Protein Crystallogr. 2002;700.