Adapter ligation is a critical first step in many microRNA analysis methods including microarray, qPCR, and sequencing. Previous studies have shown that ligation bias can have dramatic effects on both the fidelity of expression profiles and reproducibility across samples. We have developed a method for high efficiency and low bias microRNA capture by 3′ adapter ligation using T4 RNA ligase that does not require pooled adapters. Using a panel of 20 microRNA, we investigated the effects of ligase type, PEG concentration, ligase amount, adapter concentration, incubation time, incubation temperature, and adapter design on capture efficiency and bias. Of these factors, high PEG% was found to be critical in suppressing ligation bias. We obtained high average capture efficiency and low CV across the 20 microRNA panel, both in idealized buffer conditions (86%±10%) and total RNA spiking conditions (64%±17%). We demonstrate that this method is reliable across microRNA species that previous studies have had difficulty capturing and that our adapter design performs significantly better than the common adapter designs. Further, we demonstrate that the optimization methodology must be specifically designed for minimizing bias in order to obtain the ideal reaction parameters.
Citation: Song Y, Liu KJ, Wang T-H (2014) Elimination of Ligation Dependent Artifacts in T4 RNA Ligase to Achieve High Efficiency and Low Bias MicroRNA Capture. PLoS ONE 9(4): e94619. https://doi.org/10.1371/journal.pone.0094619
Editor: Baochuan Lin, Naval Research Laboratory, United States of America
Received: November 28, 2013; Accepted: March 18, 2014; Published: April 10, 2014
Copyright: © 2014 Song et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Funding: This work was supported by the National Institutes of Health [R21CA173390, R43GM103360, R01CA155305, U54CA151838]; and the Maryland Technology Development Corporation [2012- MTTCF-0291-00]. Funding for open access charge: National Institutes of Health. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: Kelvin Liu is currently a full-time employee of Circulomics Inc. A patent has been filed by Johns Hopkins University covering portions of this technology which may be incorporated into future Circulomics products. This does not alter the authors' adherence to PLOS ONE policies on sharing data and materials.
MicroRNA and other small RNA have added a new dimension to the connection between genotype and phenotype. These new mechanisms for gene expression regulation have led to a wealth of studies detailing the pervasive roles of microRNA in areas such as developmental biology , , , stem cell biology , , , cancer , , , , and plant genomics , , . MicroRNA are studied both to elucidate their roles in fundamental mechanistic pathways as well as to develop novel disease biomarkers ,  and therapeutics , . The majority of microRNA assay techniques been adapted from existing mRNA analysis methods. However, due to their short length, the first step of nearly all microRNA assays is to modify the microRNA through reverse-transcription , , , poly(A)-tailing ,  or ligation , . Among these methods, microRNA capture through adapter ligation is a pervasive first step in many PCR- , , , microarray- , , , bead- and sequencing- based assays , , .
Due to the rising popularity of 2nd generation sequencing for small RNA detection and discovery, a number of studies have sought to benchmark microRNA expression profiles across various detection platforms and systematically look for sources of bias , , , , . Sequencing based methods are enjoying rising popularity due to their ability to identify small RNA species de novo and due to their ability to distinguish closely related isoforms. Although these sequencing approaches typically involve many sequential enzymatic steps including reverse transcription, PCR amplification, ligation, and poly(A) extension, a number of recent studies have pinpointed adapter ligation as the main contributor to expression profile bias , , , .
Ligation bias is critical because it underlies such a large number of microRNA analysis methods. Ligation can introduce two distinct levels of bias to microRNA expression profiles. First, bias can be introduced across samples when different adapters are used on different individual samples. Alon et al showed that consistent differential expression profiles can be seen across samples when the same adapter sequence is used but that large variations are seen when different adapter sequences are used even within the same sample . This can be a significant problem when comparisons are made across assay platforms that use different adapter sequences or when adapters are used to barcode individual samples such as in multiplexed deep sequencing applications. Second, bias can be introduced within each sample across various microRNA species, distorting the resultant expression profiles. Hafner et al demonstrated that microRNA species can appear over or under-expressed by multiple orders of magnitude due to biases in ligation efficiency , . This is less of an issue in differential expression analysis but is a significant issue when comparisons are made across microRNA species to rank expression levels. Though the majority of recent studies have examined bias in the context of sequencing based methods, this ligation bias will have similar effects on other miRNA assays such as PCR and array based methods that incorporate 3′ ligation.
Recent studies have sought to identify the cause of ligation bias and remediate it , , , , , . All of these recent studies have focused on improving adapter design to reduce bias. Jayaprakash et al found that two terminal bases on the 3′ adapter can have dramatic effect on ligation efficiency . Zhuang et al and Hafner et al demonstrated that secondary structure interactions can contribute significantly to variations in ligation efficiency , . Remediation strategies have included optimization of ligase choice, optimizing secondary structure interactions, and incorporating adapter pools , , , , . To our knowledge, few studies other than Zhang et al have investigated the optimization of reaction conditions for bias suppression . Furthermore, nearly all have resorted to the use of randomized adapter pools , , , , . This may be due to the commonly held perception that T4 RNA ligase is inherently biased and difficult to use.
In this study, we have designed a microRNA capture method based on 3′ adapter ligation that achieves very high efficiency (86% AVG) and low bias (10% SD) across all microRNA species tested. High efficiency capture is demonstrated even with microRNA that previous studies have had difficulty capturing and even based on the standard 3′ modban adapter that previous studies have shown to exhibit high ligation bias. Using a panel of 20 microRNA, we studied key assay parameters such as PEG%, enzyme selection, adapter saturation, and design and show that they can be used to suppress bias and nearly eliminate ligation preference given suitable optimization methodology. We demonstrate that optimization must be done in the presence of total RNA using a microRNA panel to minimize global bias, as erroneous conditions can be found if optimization is done using only a single microRNA or in synthetic conditions.
Material and Methods
MicroRNA targets were synthesized by Integrated DNA Technologies (Coralville, IA). They consist of HPLC purified, DNA oligonucleotides with 5′-Ph and either 3′-ddC blocking group or 3′-Cy5 label. The adapters are based on modified 3′ modban adapters . The adapters were enzymatically pre-adenylated with T4 RNA ligase using a process similar to Thomson et al . Additional adapters were also synthesized for comparison purposes based on the SR1 and SR1-S sequences reported by .
MicroRNA were synthesized by Integrated DNA Technologies (Coralville, IA). The sequences were taken from miRBase (www.mirbase.org) and are listed in Table S1 in File S1. They targets consist of HPLC purified RNA oligonucleotides derivatized with 3′-OH and 5′-Cy3 end groups.
Unless otherwise indicated, ligation was performed by mixing 1.25 µL of 2 µM adenylated adapter, 1 µL of T4 RNA Ligase buffer (New England Biolabs, Ipswich, MA), 5 µL of 50% PEG8000, 1 µL of synthetic target, 0.5 µL of total RNA, 1 µL of T4 RNA Ligase 2 truncated K227Q (New England Biolabs, Ipswich, MA) and water into a 20 µL reaction volume. The reaction was then incubated at 25°C for 4 hours and heat denatured at 65°C for 20 minutes in a thermal cycler. In the experiments where different ligases were investigated, T4 RNA Ligase 2 truncated, T4 RNA Ligase 2 truncated R55K K227Q, and Thermostable 5′ App DNA/RNA Ligase were all obtained from New England Biolabs. In spiking experiments, 500 ng of human brain total RNA (Ambion, Austin, TX) was added to each sample.
The samples were analyzed on precast 15% TBE-urea polyacrylamide gels (Bio-Rad, Hercules, CA). 5 µL of sample was mixed with 5 µL of loading buffer and heated for 5 minutes at 95°C. The sample was then loaded into the gel and run for either 30 min or 50 min at 300V. The separated gels were scanned using a Typhoon 9410 variable mode imager (GE Healthcare, Piscataway, NJ). The gel images were analyzed using ImageQuant (GE Healthcare, Piscataway, NJ) to obtain lane profiles. These profiles were then curve-fit with Gaussian curves using Origin (OriginLab, Northampton, MA) to precisely determine band position and intensity.
Results and Discussion
Bias in Ligation Based microRNA Capture
MicroRNA consist of short RNA sequences that are typically 17–23 nt in length. Due to their short length, 5′ and/or 3′ adapter ligation is often used to label, capture, or lengthen the microRNA before downstream detection. A number of studies have suggested a pooled adapter approach to average out the intrinsic effects of ligation bias. However, by improving ligation reaction design and optimization methodology, we have developed a ligation based method that achieves high efficiency and low bias microRNA capture without the need for adapter pools.
As shown in Figure 1, an adapter oligonucleotide is ligated to the 3′-OH of each microRNA using T4 RNA ligase 2. To reduce side product formation, the adapter is first enzymatically pre-adenylated such that the ligation reaction can be performed in the absence of ATP. This prevents the microRNA in the sample from undergoing self-circularization, self-polymerization, and ligation to RNA species other than the adapter. Second, the 3′ end of the adapter is blocked with dideoxycytosine (ddC), a fluorophore, or other moiety to prevent self-circularization and adapter concatenation. Finally, a recombinant mutant ligase is used. These enzymes lack the domain necessary for ATP incorporation and contain point mutations that further suppress side product formation. Such an approach is commonly used in miRNA analysis ahead of reverse transcription , , , , . Using this general reaction design, we investigated the effects of specific reaction conditions in suppressing ligation bias.
The 19 nt, enzymatically pre-adenlyated adapter is ligated to the 3′ OH of microRNA using T4 RNA ligase 2. The reaction is run at 25°C for 4 hours in the absence of ATP. In order to characterize capture efficiency, the microRNA is end labeled with Cy3. The 3′ end of the adapter is blocked by –ddC, a fluorophore, or other moiety to prevent the formation of concatemers and circularized products.
In order to characterize overall ligation efficiency and ligation bias, we synthesized a panel of 20 representative microRNA. Ten of the microRNA were selected based on their reported roles as important cancer-related microRNA (let-7a, miR-16, miR-21, miR-26a, miR-29b, miR-34a, miR-15a, miR-17p, miR-92a, and miR-155) , , . Seven of the microRNA were chosen to enable comparison against recent publications (miR-31, miR-338, miR-567, miR-4803, miR-5183, miR-712, and miR-106b). For example, Zhuang et al reported difficulties capturing miR-4803, miR-5183, and miR-567 while Hafner et al reported low capture efficiencies for miR-31, miR-712, and miR-338 , . Jayaprakash et al reported that miR-106b could not be captured consistently under any of their experimental conditions. The final three microRNA were randomly selected (miR-25, miR-125b, miR-19b). Each target microRNA was labeled with a Cy3 dye at the 5′ end to enable quantification of ligation efficiency under spiking conditions in total RNA. The reaction products were analyzed using denaturing PAGE, and the gels were scanned using a multimode imager. Image analysis was then used to obtain band positions and DNA quantity. Using this PAGE analysis method, we obtain excellent quantification and reproducibility. Quantification is linear from <5 amols to >10 pmols with an experiment to experiment CV of 10% (Figure S1 in File S1).
The first parameter we investigated was ligase type, as the ligases themselves likely have different intrinsic bias. Figure 2 shows a denaturing PAGE analysis of adapter ligation on our 20 microRNA test panel using 4 different RNA ligases. For each ligase, we used the manufacturer recommended ligation conditions. As each microRNA is labeled with Cy3, the capture efficiency was easily quantified using image analysis to compare the band intensities between the free microRNA band at ∼20 nt and the ligated microRNA band at ∼40 nt. A quantitative analysis of ligation efficiency for each microRNA and each ligase is provided in Figure S2 in File S1.
The ligation products were analyzed by 15% denaturing urea-PAGE. Capture efficiency was determined by performing a Cy3 scan and comparing the intensities of the ∼40 nt captured microRNA band versus the ∼20 nt free microRNA band. T4 RNA Ligase 2 truncated (T4 Rnl2 T) had high average capture efficiency and low bias but many randomly sized background products. The point mutant enzymes T4 RNA Ligase 2 truncated K227Q (T4 Rnl2 TK) and T4 RNA Ligase 2 truncated KQ (T4 Rnl2 TKQ) had decreased side product formation but also lower average capture efficiency and higher bias. Thermostable 5′ App DNA/RNA Ligase (Mth Rnl), which was performed at 65°C instead of 25°C, had similar average capture efficiency and bias but with distinct ligation efficiency pattern.
T4 RNA ligase 2 truncated (T4 Rnl2 T) is a mutant enzyme that lacks the domain necessary for ATP incorporation, which should significantly reduce side product formation when used with pre-adenylated adapters in the absence of ATP . As evidenced by the bright uniform bands in Figure 2A, this enzyme gave high ligation efficiency (66% AVG) and low bias (11% SD) with every microRNA species being captured at >40%. However, a significant number of background products can also be seen on the gel. A number of unique and randomly sized bands, which run faster than both the captured microRNA and the free adapter, are visible in each lane. It is unclear what these products are as typically only circularized products can run faster than linear products of the same size, but the same 5′-Cy3 that enables visualization should also prevent the formation of such circularized products.
T4 RNA ligase 2 truncated K227Q (T4 Rnl 2 TK) contains a point mutation that is designed to further reduce side product formation , . This effect is clearly seen in Figure 2B, where side product formation is suppressed and only the desired products are visible. Smaller, randomly sized bands are no longer apparent. Yet, the overall ligation efficiency has decreased dramatically (20% AVG) and ligation bias across the microRNA panel is quite significant (25% SD). Six microRNA were captured at <2% efficiency.T4 RNA ligase 2 truncated KQ (T4 Rnl 2 TKQ) is a double-point mutant that is also designed to have low side product formation but with increased ligation activity that is restored to the levels of T4 Rnl2 T . In our experiments, little difference was seen between T4 Rnl 2 TK and T4 Rnl 2 TKQ, which had a capture efficiency of 17%±24%. T4 Rnl2 TKQ had 7 microRNA that were poorly captured at <2% efficiency.
Finally, we tried Thermostable 5′ App DNA/RNA Ligase (MthRnl) from New England Biolabs, which is a point mutant of RNA ligase isolated from Methanobacterium thermoautotrophicum. This thermostable ligase is unable to incorporate ATP and works optimally at 65°C. Ligation at an elevated incubation temperature could serve to reduce bias from secondary structure interactions. Though the same adapters and microRNAs were used, a different pattern of ligation bias was seen. This likely arose from ligation preferences intrinsic to the ligase itself. In addition, despite the higher reaction temperature, neither ligation efficiency (30% AVG) nor bias (28% SD) was significantly improved over the T4 Rnl2 variants.
Although T4 Rnl2 T had high ligation efficiency and low bias, the large amounts of background products complicate downstream processes and were deemed unacceptable. We chose to use T4 Rnl2 TK moving forward due to the belief that it would be easier to increase reaction efficiency and reduce bias than to suppress side product formation.
Additives such as polyethylene glycol (PEG) ,  and dimethyl sulfoxide (DMSO) , ,  are commonly added to ligation reactions to increase reaction efficiency. In our preliminary experiments, we saw minimal effect with DMSO addition (data not shown). However, we saw dramatic effects on ligation efficiency and bias due to PEG. PEG is thought to increase molecular crowding , , , and many studies as well as manufacturer protocols have recommended ∼15% PEG as an ideal concentration , . Ligation efficiency is said to plateau or even decrease at high PEG levels.
Based on the initial results of Figure 2B, we optimized the effects of PEG on a subset of our 20 microRNA panel. We tested three microRNAs that were initially poorly captured by T4 Rnl2 TK, miR-31 (2.5% capture efficiency), miR-155 (28% capture efficiency), and miR-4803 (3.5% capture efficiency). 10 nM of each microRNA was individually spiked into 500 ng of total RNA. This represents about ∼3–4 million microRNA copies per cell and is sufficient to approximate the aggregate expression of all microRNA within the cell. As the PEG concentration was varied from 0–35%, Figure 3 and Figure S3 in File S1 show that the capture efficiency generally increased as a function of PEG level and then decreased at high PEG levels. MiR-31 and miR-4803 behaved similarly, reaching the highest capture efficiency at 25% PEG, while miR-155 reached maximum efficiency at 15% PEG. The variation seen between these microRNA underscores the importance of optimizing reaction conditions across multiple microRNA species. The ideal conditions for a single microRNA do not necessarily extrapolate to other microRNA.
miR-155 and miR-4803 display similar behavior with respect to PEG percentage while miR-31 behaves distinctly. Different behavior is also seen between idealized buffer conditions and total RNA spiking conditions, illustrating the importance of optimization methodology in extrapolating assay performance. Optimizations performed using a single microRNA species in idealized buffer may not extrapolate to other microRNA under actual assay conditions.
When the microRNAs were spiked into idealized buffer conditions rather than total RNA, a different behavior was seen. Maximum capture efficiency was reached at lower PEG levels with miR-31 and miR-4803 behaving similarly again, reaching plateau at 20% PEG, and miR-155 reaching plateau at 15% PEG. The overall ligation efficiencies also increased significantly, particularly for miR-155. Total RNA likely contains inhibitors that prevent ligation from reaching completion such as: 1) RNA species that bind and sequester the microRNA, 2) RNA species that ligate competitively to the microRNA, 3) RNA species that ligate competitively to the adapter, 4) RNA species that ligate competitively to each other, and 5) inhibitors of the ligase. Though our and reaction design should minimize effects 2), 3), and 4), the large amount of background RNA can still occupy the ligase binding site even if the actual ligation cannot occur (i.e. ligase shaking hands but not making deals). In addition, the decrease in ligation efficiency seen at high PEG levels under spiking conditions is also absent or greatly reduced under idealized buffer conditions. It is unclear what the exact mechanism of PEG is and why different effects would be seen with and without total RNA spiking. Furthermore, the discrepancy in ligation behavior across the three microRNA also appears to decrease, with all three microRNA being well captured at 20% PEG.
This data illustrates that the optimal assay parameters determined using 1) a single microRNA vs. a panel and 2) in idealized buffer vs. spiking conditions are quite different, highlighting the critical importance of optimization methodology and design. Most previous publications, as well as manufacturer protocols, have recommended 12–15% PEG. At this PEG level, only miR-155 is optimally captured under spiking conditions. In spiking conditions, ligation at 20–25% PEG is optimal whereas ligation at the recommended 15% PEG leads to very low efficiency. Of all the parameters investigated, PEG had the most dramatic effect on ligation bias and efficiency.
Adapter Concentration and Ligase Amount
Next we tested the effects of adapter concentration and ligase amount in conjunction with one another. Adapter concentration must be in excess to the ligated species to drive ligation forward. In addition to microRNA, samples often contain other RNA species such as mRNA, rRNA, and siRNA that can also be ligated. Having too few adapters will limit the ligation efficiency and increase bias. Having too large an excess of adapters will promote side product formation. In addition, large excesses of free adapters may also complicate downstream assay processes. The ligase amount also needs to be sufficient to obtain a high ligation efficiency in a reasonable amount of time. However, high ligase amounts can promote side product formation as well. Practically speaking, ligase is the most expensive reaction component and should be minimized to reduce costs.
Adapter concentration and ligase amount were optimized by quantifying their effects on the capture of miR-31, both in presence and absence of total RNA. Figure 4 and Figure S4 in File S1 show the effects of concurrently varying adapter concentration from 50 nM to 400 nM and ligase amount from 100 units to 400 units. In the absence of total RNA, little effect was seen by increasing either adapter or enzyme levels. Even at 50 nM adapter and 100 units of enzyme, ligation efficiency was 88% due to their high excess. However, under the same conditions in the presence of total RNA, ligation efficiency drops to 34% due to the large amounts of other RNA species in solution that the adapters can be ligated to and that ligase can spend time shaking hands with. Under these realistic spiking conditions, much larger amounts of adapter and enzyme were needed; 200–300 nM of adapter and 200 units of enzyme were needed to reach saturation. Although the highest capture efficiency was obtained with high amounts of both adapter and enzyme, high levels of at least one component also gave relatively high efficiencies.
The experiment was performed under idealized buffer conditions and total RNA spiking conditions which lead to distinct conditions for optimum capture efficiency. Under spiking conditions, greater amounts of adapter and enzyme are necessary to obtain high capture efficiency.
For the previous reactions, a 4 hour incubation was performed. We investigated whether this time was sufficient and whether it could be reduced. Using miR-31 as a model and the optimized protocol developed thus far, we tested incubation times from 30 minutes to 18 hours both in the presence and absence of total RNA. Figure 5 and Figure S5 in File S1 illustrate the microRNA capture efficiency as a function of time. As expected, the ligation reaction proceeded much faster in the absence of total RNA, reaching >80% in only 30 minutes and reaching plateau in <2 hours. When spiked into total RNA, plateau was not reached until >8 hours. This was likely due to the greater number of RNA species within the sample that the adapters could be ligated to. By 4 hours, ligation reached 92% of the plateau value, striking a good compromise between incubation time and ligation efficiency.
Secondary structure has been proposed as a main contributor to ligation bias , . We investigated the effect of incubation temperature on ligation efficiency and ligation bias under the premise that elevated incubation temperatures could potentially reduce secondary structure interactions and alleviate bias. Initially, when we performed ligation experiments at 65°C using MthRnl, we did not see any significant difference when compared to ligation at 25°C using the T4 Rnl2 variants. However, in this case both the temperature and the ligase were changed. The intrinsic bias of the MthRnl could have swamped out any effects due to temperature.
We performed a second analysis using T4 Rnl2 TK while incubating at 4°C for 18 hours, 25°C for 4 hours, or 37°C for 4 hours. T4 Rnl2 TK is not thermostable and is denatured at 65°C so temperatures beyond 37°C were not tried. Incubation at 4°C allows for the reaction to proceed for an extended amount of time to compensate for decreased enzyme activity. Most enzymes recommend incubation at 25°C. We also performed ligation at 37°C to see if a modest increase in incubation temperature would have any effect on bias. Figure 6 and Figure S6 in File S1 show a graph and gel images, respectively, of the ligation efficiency for all 20 microRNA in our test panel using the 4°C, 25°C, and 37°C conditions when spiked into 500 ng of total RNA.
The ligation reaction was incubated for 18°C, 4 hours at 25°C or 4 hours at 37°C under total RNA spiking conditions.
The highest efficiency and lowest bias were seen at the 25°C condition where ligation efficiency across the 20 microRNA panel was 53.1% AVG ±13.7% SD. When the incubation temperature was decreased to 4°C, the ligation efficiency across the panel decreased to 23.6% AVG ±7.2% SD. Despite the increased ligation time (18 hours vs. 4 hours), the decreased enzyme activity at 4°C led to significantly reduced ligation efficiency. Interestingly, the capture efficiency CV for the 4°C condition (31% CV) was only slightly worse than at 25°C (26% CV), indicating that temperature does not have a large impact on bias across this range. The pattern in capture efficiency for each individual microRNA in the panel was fairly similar for the two conditions except that the capture efficiency at 25°C was 2× higher in most cases. When the incubation temperature was increased to 37°C, the ligation efficiency across the panel dropped to 34.7% AVG ±17.7% SD. The higher incubation temperature reduced the ligase activity but unexpectedly increased the bias over both the 4°C and 25°C conditions to nearly 51% CV. Incubation temperature likely results in a combination of effects on ligase activity, ligase degradation, and secondary structure formation that impact ligation efficiency in a complex manor. With a few exceptions, the pattern in ligation efficiency across the panel was generally similar to the 4°C and 25°C conditions.
Thus far, we've investigated the use of reaction conditions to suppress the intrinsic bias of T4 Rnl2 TK. Adapter design can also play a large part in ligation bias due to primary sequence and/or secondary structure effects. However, the differences in microRNA capture efficiency seen across published studies illustrate the unpredictable nature of these interactions , , , . For example, even when adapters were logically designed to either eliminate inhibitory secondary structures or promote favorable interactions, only modest improvements, if any, were seen . In Figure 7 and Figure S7 in File S1, we performed adapter ligation on the 20 microRNA panel in spiking conditions (500 ng total RNA) using four different adapter sequences. First, we synthesized two versions of our modified modban adapter  to test whether having RNA or DNA as the 5′ residue would affect ligation efficiency or bias. As the ligases used herein are RNA ligases used for single stranded blunt end ligations, it is possible that the ligases will have a preference for ligating RNA versus DNA. The rA version contains a ribo-A as the 5′ base while the dA version contains a dexoyribo-A as the 5′ base. As seen in Figure 7 left, both of our modified modban adapters achieved high ligation efficiencies with low bias. No significant difference was seen between the rA or dA versions of the adapter, indicating that T4 Rnl2 TK displays no real preference for RNA-RNA ligation or RNA-DNA ligation. The rA adapter had a capture efficiency of 68.5%±17.6% (AVG ± SD) across the 20 microRNA panel while the dA adapter had a capture efficiency of 71.6%±15.4% (AVG ± SD). Even under spiking conditions, the capture efficiencies surpass what previous publications were able to achieve using complex adapter pool strategies in idealized buffer conditions. Using the dA adapter, 19 microRNA were captured at >50% with the lowest still being captured at 31%. We attribute the low capture efficiency of miR-155 to degradation or synthesis problems. We re-synthesized miR-155 multiple times. Immediately after receiving the target, high ligation efficiencies were obtained which would slowly degrade over time. MiR-155 was the only target we saw this effect with.
The rA and dA adapters have identical sequences based on modified modban design except the 5′ base is either RNA or DNA as indicated. The SR1 and SR1-S adapters are taken from Zhuang et al (39). T4 Rnl2 TK shows no preference for DNA or RNA at the ligation site. However, overall capture efficiency and bias were significantly worse for the SR1 and SR1-S adapters.
Encouraged by the previous results, we synthesized the SR1 and SR1-S adapters used by Zhuang  and Hafner  to test whether our reaction would work equally well with other adapter designs. SR1 is a standard adapter commonly used in Illumina's sequencing products. SR1-S was designed by Zhuang to reduce ligation bias by eliminating inhibitory secondary structure interactions. As seen in Figure 7 right, the overall ligation efficiency and bias were much poorer with both of these adapter designs. The SR1 capture efficiency across the 20 microRNA panel was 49.4%±20.6% with only 7 microRNA captured at >50%. The SR1-S adapter fared even worse with only 1 microRNA being captured at >50%. SR1-S had a capture efficiency of 16.9%±15.2% (AVG ± SD) across the panel. This result parallels that reported by Zhuang where the SR1-S adapter unexpectedly performed much worse than the SR1 adapter and failed to improve ligation efficiencies despite eliminating secondary structure interactions. Given the current reaction conditions and microRNA test panel, our modified modban adapter appears to work significantly better than the SR1 adapter. It is possible that the ligation reaction conditions can be optimized specifically for the SR1 adapter but we did not attempt this.
High Efficiency and Low Bias microRNA Capture
Based on the optimized conditions determined in previous experiments, we performed ligation on the 20 microRNA panel using 300 units of T4 Rnl 2 TK, 25% PEG, 200 nM adapter incubated for 4 h at 25°C. Ligation was performed both in idealized buffer and in total RNA spiking conditions as shown in Figure 8. Each experiment was performed in triplicate. In the absence of total RNA, the capture efficiency across the 20 microRNA panel was 86% AVG ±10% SD. No microRNA was captured at less than 60% with 18 of 20 microRNA being captured at 70% or greater. This is a considerable improvement over the initial conditions of 20%±25% (AVG ± SD). Optimization of various parameters such as PEG, adapter, enzyme levels, and incubation time led to an increase in overall ligation efficiency and the concurrent benefit of effectively reducing ligation bias. Saturating levels of each parameter tend to push all of the competing reactions toward completion, regardless of moderate differences in thermodynamic equilibrium and kinetics. Thus, as the capture efficiency of all the microRNA increases and approaches 100%, bias is concurrently reduced since capture saturates at 100%.
The data is presented as representative gel images and as graphs based on image analysis of 3 sets of independent experiments. High capture efficiency and low bias are obtained across the panel both in idealized buffer conditions and total RNA spiking conditions.
When compared to previous studies performed in idealized buffer background, a significant improvement in average capture efficiency and bias was seen. For example Hafner et al reported that miR-16 (92%), miR-21(66%), miR-155 (90%), and miR-567 (73%) were well captured while miR-31 (48%) and miR-338 (3%) were less well captured . In our process, all of these microRNA are uniformly captured at 87%, 91%, 60%, 74%, 74%, and 93%, respectively. Similar results are seen when compared to Zhuang et al . They reported let-7a (100%), miR-31 (87%), and miR-567 (56%) to be well captured while miR-4803 (1%), miR-5183 (42%), and miR-712 (8%) were poorly captured. All of these microRNA species are well captured with our methods at 97%, 74%, 74%, 90%, 97% and 69%, respectively. In addition, miR-106, which could not be captured consistently by Jayaprakash et al under any conditions, is now captured at 90% .
These differences are further pronounced when comparing microRNA capture in realistic spiking conditions. The presence of total RNA likely reduces capture efficiency due to the aforementioned competitive reactions. Under realistic spiking conditions in presence of total RNA, our capture efficiency is slightly decreased and bias is slightly increased with a capture efficiency of 64% AVG ±17% SD across the panel. 16 of 20 microRNA have capture efficiencies greater than 50%. Only miR-155 had a low capture efficiency (27%). We experienced repeatability problems with miR-155, likely due to synthesis and degradation, as previously described. In comparison, Zhuang et el reported significantly reduced capture efficiencies under spiking conditions with efficiencies of 19%, 17%, 9%, 2% and 4% for let-7a, miR-31, miR-567, miR-4803, and miR-5183, respectively . To resolve this, they proposed a random adapter pool approach to average out the low capture efficiencies. However, the capture efficiencies for the pooled approach still remained relatively low at 83%, 27%, 8%, 9%, and 14%, respectively. A recent study by Zhang et al  has also used randomized adapter pools to reduce bias. Bias was reduced to1.8-fold from the expected frequency across a 29 microRNA panel. In comparison, the current method uses only a single adapter and achieves highly efficient and low bias capture levels of 74%, 63%, 61%, 73%, and 83% for let-7a, miR-31, miR-567, miR-4803, and miR-5183. In fact, our spiking capture efficiencies surpass what has been demonstrated by previous studies under idealized buffer conditions.
The experiments shown in Figure 8 were performed using a dA adapter labeled with Cy5. Additionally, we also performed this experiment in triplicate using an rA adapter labeled with Cy5 and an rA adapter blocked with ddC (data not shown). In each case, similar results were obtained, demonstrating that the adapter ligation process is highly repeatable and robust. This is further evidenced by the low average experiment-to-experiment SD for each microRNA seen in Figure 8, which was 3% in idealized buffer and 7% in total RNA spiking. miRNA capture efficiency is also consistent across a broad range of miRNA input levels as seen in Figure S8 in File S1.
Based on results across the 20 microRNA panel, we believe that ligation bias can be largely suppressed through suitable ligation optimization methodology, as opposed to solely focusing on adapter design. It is critical to design the reaction methodology to optimize for bias rather than just ligation efficiency. It is also critical to optimize using a panel of microRNA targets, rather than a single species, and to perform experiments both in idealized buffer and in realistic spiking conditions. Total RNA spiking has a large impact on the optimal reaction conditions, likely due to a number of competitive and inhibitive effects. Using this methodology, we were able to capture a panel of 20 microRNA at 86% AVG ±10% SD (12% CV) in buffer and 64%±16.5% SD (26% CV) in spiking conditions. This variability is below the typical variability in PCR amplification efficiency and means that expression profiles will likely vary very little due to this step. In contrast to previous methods that incorporated randomized adapter pools, the current method achieves low bias using only a single adapter sequence to capture all 20 microRNA. The elimination of adapter pools can greatly simplify downstream assay design. Key to this process was the use of very high PEG levels (25%) to suppress bias, particularly under total RNA spiking conditions. This PEG level is about double that which is typically recommended (10-15%). Molecular crowding by PEG has been demonstrated to affect both RNA enzyme activity  and RNA folding , . As discussed, previous studies have that indicated secondary structure interactions as a primary contributor to ligation bias. Thus, it is likely that the stabilization of RNA folding by high PEG can play a significant role in suppressing bias.
Through this process we have also discovered that variability in the adapter and microRNA synthesis can lead to significant artifacts. The oligonucleotides that we ordered were always HPLC purified and had specified yields of >80%, yet we saw large variations in capture efficiency across different batches of adapters and microRNA targets. 3 microRNA targets that initially appeared poorly captured became consistently well captured after the targets were re-synthesized. However, miR-155 and, to a lesser degree miR-712, behaved increasingly erratically as the targets aged and required multiple re-synthesis rounds to obtain optimal results. Interestingly, all the other microRNA targets remained very stable over time. It is possible that some of the variability and inconsistencies seen in previous papers could be also attributed to synthesis issues. While the 20 microRNA panel used herein is not exhaustive of all microRNA species, we feel it is a good representative snapshot of what can be expected.
Conceived and designed the experiments: YS KJL THW. Performed the experiments: YS. Analyzed the data: YS KJL. Wrote the paper: YS KJL THW. Provided scientific advisory: THW.
- 1. Kloosterman WP, Wienholds E, de Bruijn E, Kauppinen S, Plasterk RH (2006) In situ detection of miRNAs in animal embryos using LNA-modified oligonucleotide probes. Nat Methods 3: 27–29.
- 2. Mineno J, Okamoto S, Ando T, Sato M, Chono H, et al. (2006) The expression profile of microRNAs in mouse embryos. Nucleic Acids Res 34: 1765–1771.
- 3. Wienholds E, Kloosterman WP, Miska E, Alvarez-Saavedra E, Berezikov E, et al. (2005) MicroRNA expression in zebrafish embryonic development. Science 309: 310–311.
- 4. Tang F, Hajkova P, Barton SC, Lao K, Surani MA (2006) MicroRNA expression profiling of single whole embryonic stem cells. Nucleic Acids Res 34: e9.
- 5. Morin RD, O'Connor MD, Griffith M, Kuchenbauer F, Delaney A, et al. (2008) Application of massively parallel sequencing to microRNA profiling and discovery in human embryonic stem cells. Genome Res 18: 610–621.
- 6. Hatfield SD, Shcherbata HR, Fischer KA, Nakahara K, Carthew RW, et al. (2005) Stem cell division is regulated by the microRNA pathway. Nature 435: 974–978.
- 7. Lu J, Getz G, Miska EA, Alvarez-Saavedra E, Lamb J, et al. (2005) MicroRNA expression profiles classify human cancers. Nature 435: 834–838.
- 8. Volinia S, Calin GA, Liu CG, Ambs S, Cimmino A, et al. (2006) A microRNA expression signature of human solid tumors defines cancer gene targets. Proc Natl Acad Sci U S A 103: 2257–2261.
- 9. Garzon R, Marcucci G, Croce CM (2010) Targeting microRNAs in cancer: rationale, strategies and challenges. Nat Rev Drug Discov 9: 775–789.
- 10. Ventura A, Jacks T (2009) MicroRNAs and cancer: short RNAs go a long way. Cell 136: 586–591.
- 11. Meyers BC, Souret FF, Lu C, Green PJ (2006) Sweating the small stuff: microRNA discovery in plants. Curr Opin Biotechnol 17: 139–146.
- 12. Zhang B, Pan X, Cobb GP, Anderson TA (2006) Plant microRNA: a small regulatory molecule with big impact. Dev Biol 289: 3–16.
- 13. Voinnet O (2009) Origin, biogenesis, and activity of plant microRNAs. Cell 136: 669–687.
- 14. Keller A, Leidinger P, Bauer A, Elsharawy A, Haas J, et al. (2011) Toward the blood-borne miRNome of human diseases. Nat Methods 8: 841–843.
- 15. Mitchell PS, Parkin RK, Kroh EM, Fritz BR, Wyman SK, et al. (2008) Circulating microRNAs as stable blood-based markers for cancer detection. Proc Natl Acad Sci U S A 105: 10513–10518.
- 16. Kota J, Chivukula RR, O'Donnell KA, Wentzel EA, Montgomery CL, et al. (2009) Therapeutic microRNA delivery suppresses tumorigenesis in a murine liver cancer model. Cell 137: 1005–1017.
- 17. Valastyan S, Reinhardt F, Benaich N, Calogrias D, Szasz AM, et al. (2009) A pleiotropically acting microRNA, miR-31, inhibits breast cancer metastasis. Cell 137: 1032–1046.
- 18. Chen C, Ridzon DA, Broomer AJ, Zhou Z, Lee DH, et al. (2005) Real-time quantification of microRNAs by stem-loop RT-PCR. Nucleic Acids Res 33: e179.
- 19. Liu CG, Calin GA, Meloon B, Gamliel N, Sevignani C, et al. (2004) An oligonucleotide microchip for genome-wide microRNA profiling in human and mouse tissues. Proc Natl Acad Sci U S A 101: 9740–9744.
- 20. Liu CG, Calin GA, Volinia S, Croce CM (2008) MicroRNA expression profiling using microarrays. Nat Protoc 3: 563–578.
- 21. Shingara J, Keiger K, Shelton J, Laosinchai-Wolf W, Powers P, et al. (2005) An optimized isolation and labeling platform for accurate microRNA expression profiling. Rna 11: 1461–1470.
- 22. Chen J, Lozach J, Garcia EW, Barnes B, Luo S, et al. (2008) Highly sensitive and specific microRNA expression profiling using BeadArray technology. Nucleic Acids Res 36: e87.
- 23. Van Nieuwerburgh F, Soetaert S, Podshivalova K, Ay-Lin Wang E, Schaffer L, et al. (2011) Quantitative bias in Illumina TruSeq and a novel post amplification barcoding strategy for multiplexed DNA and small RNA deep sequencing. PLoS One 6: e26969.
- 24. Vigneault F, Sismour AM, Church GM (2008) Efficient microRNA capture and bar-coding via enzymatic oligonucleotide adenylation. Nat Methods 5: 777–779.
- 25. Lu DP, Read RL, Humphreys DT, Battah FM, Martin DI, et al. (2005) PCR-based expression analysis and identification of microRNAs. J RNAi Gene Silencing 1: 44–49.
- 26. Poy MN, Eliasson L, Krutzfeldt J, Kuwajima S, Ma X, et al. (2004) A pancreatic islet-specific microRNA regulates insulin secretion. Nature 432: 226–230.
- 27. Wang H, Ach RA, Curry B (2007) Direct and sensitive miRNA profiling from low-input total RNA. Rna 13: 151–159.
- 28. Thomson JM, Parker J, Perou CM, Hammond SM (2004) A custom microarray platform for analysis of microRNA gene expression. Nat Methods 1: 47–53.
- 29. Barad O, Meiri E, Avniel A, Aharonov R, Barzilai A, et al. (2004) MicroRNA expression detected by oligonucleotide microarrays: system establishment and expression profiling in human tissues. Genome Res 14: 2486–2494.
- 30. Hafner M, Landgraf P, Ludwig J, Rice A, Ojo T, et al. (2008) Identification of microRNAs and other small regulatory RNAs using cDNA library sequencing. Methods 44: 3–12.
- 31. Lu C, Meyers BC, Green PJ (2007) Construction of small RNA cDNA libraries for deep sequencing. Methods 43: 110–117.
- 32. Ach RA, Wang H, Curry B (2008) Measuring microRNAs: comparisons of microarray and quantitative PCR measurements, and of different total RNA prep methods. BMC Biotechnol 8: 69.
- 33. Chen Y, Gelfond JA, McManus LM, Shireman PK (2009) Reproducibility of quantitative RT-PCR array in miRNA expression profiling and comparison with microarray analysis. BMC Genomics 10: 407.
- 34. Git A, Dvinge H, Salmon-Divon M, Osborne M, Kutter C, et al. (2010) Systematic comparison of microarray profiling, real-time PCR, and next-generation sequencing technologies for measuring differential microRNA expression. Rna 16: 991–1006.
- 35. Willenbrock H, Salomon J, Sokilde R, Barken KB, Hansen TN, et al. (2009) Quantitative miRNA expression analysis: comparing microarrays with next-generation sequencing. Rna 15: 2028–2034.
- 36. Jayaprakash AD, Jabado O, Brown BD, Sachidanandam R (2011) Identification and remediation of biases in the activity of RNA ligases in small-RNA deep sequencing. Nucleic Acids Res 39: e141.
- 37. Alon S, Vigneault F, Eminaga S, Christodoulou DC, Seidman JG, et al. (2011) Barcoding bias in high-throughput multiplex sequencing of miRNA. Genome Res 21: 1506–1511.
- 38. Sorefan K, Pais H, Hall AE, Kozomara A, Griffiths-Jones S, et al. (2012) Reducing ligation bias of small RNAs in libraries for next generation sequencing. Silence 3: 4.
- 39. Linsen SE, de Wit E, Janssens G, Heater S, Chapman L, et al. (2009) Limitations and possibilities of small RNA digital gene expression profiling. Nat Methods 6: 474–476.
- 40. Hafner M, Renwick N, Brown M, Mihailovic A, Holoch D, et al. (2011) RNA-ligase-dependent biases in miRNA representation in deep-sequenced small RNA cDNA libraries. Rna 17: 1697–1712.
- 41. Zhuang F, Fuchs RT, Sun Z, Zheng Y, Robb GB (2012) Structural bias in T4 RNA ligase-mediated 3′-adapter ligation. Nucleic Acids Res 40: e54.
- 42. Sun G, Wu X, Wang J, Li H, Li X, et al. (2011) A bias-reducing strategy in profiling small RNAs using Solexa. RNA 17: 2256–2262.
- 43. Zhang Z, Lee JE, Riemondy K, Anderson EM, Yi R (2013) High-efficiency RNA cloning enables accurate quantification of miRNA expression by deep sequencing. Genome Biol 14: R109.
- 44. Lau NC, Lim LP, Weinstein EG, Bartel DP (2001) An abundant class of tiny RNAs with probable regulatory roles in Caenorhabditis elegans. Science 294: 858–862.
- 45. Pfeffer S, Lagos-Quintana M, Tuschl T (2005) Cloning of small RNA molecules. Curr Protoc Mol Biol Chapter 26: Unit 26 24.
- 46. Pak J, Fire A (2007) Distinct populations of primary and secondary effectors during RNAi in C. elegans. Science 315: 241–244.
- 47. Garzon R, Calin GA, Croce CM (2009) MicroRNAs in Cancer. Annu Rev Med 60: 167–179.
- 48. Viollet S, Fuchs RT, Munafo DB, Zhuang F, Robb GB (2011) T4 RNA ligase 2 truncated active site mutants: improved tools for RNA analysis. BMC Biotechnol 11: 72.
- 49. Yin S, Ho CK, Shuman S (2003) Structure-function analysis of T4 RNA ligase 2. J Biol Chem 278: 17601–17608.
- 50. Harrison B, Zimmerman SB (1984) Polymer-stimulated ligation: enhanced ligation of oligo- and polynucleotides by T4 RNA ligase in polymer solutions. Nucleic Acids Res 12: 8235–8251.
- 51. Tessier DC, Brousseau R, Vernet T (1986) Ligation of single-stranded oligodeoxyribonucleotides by T4 RNA ligase. Anal Biochem 158: 171–178.
- 52. Miyoshi D, Sugimoto N (2008) Molecular crowding effects on structure and stability of DNA. Biochimie 90: 1040–1051.
- 53. Nakano S, Karimata HT, Kitagawa Y, Sugimoto N (2009) Facilitation of RNA enzyme activity in the molecular crowding media of cosolutes. J Am Chem Soc 131: 16881–16888.
- 54. Kilburn D, Roh JH, Behrouzi R, Briber RM, Woodson SA (2013) Crowders perturb the entropy of RNA energy landscapes to favor folding. J Am Chem Soc 135: 10055–10063.
- 55. Kilburn D, Roh JH, Guo L, Briber RM, Woodson SA (2010) Molecular crowding stabilizes folded RNA structure by the excluded volume effect. J Am Chem Soc 132: 8690–8696.