Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR) and CRISPR associated (cas) genes conform the CRISPR-Cas systems of various bacteria and archaea and produce degradation of invading nucleic acids containing sequences (protospacers) that are complementary to repeat intervening spacers. It has been demonstrated that the base sequence identity of a protospacer with the cognate spacer and the presence of a protospacer adjacent motif (PAM) influence CRISPR-mediated interference efficiency. By using an original transformation assay with plasmids targeted by a resident spacer here we show that natural CRISPR-mediated immunity against invading DNA occurs in wild type Escherichia coli. Unexpectedly, the strongest activity is observed with protospacer adjoining nucleotides (interference motifs) that differ from the PAM both in sequence and location. Hence, our results document for the first time native CRISPR activity in E. coli and demonstrate that positions next to the PAM in invading DNA influence their recognition and degradation by these prokaryotic immune systems.
Citation: Almendros C, Guzmán NM, Díez-Villaseñor C, García-Martínez J, Mojica FJM (2012) Target Motifs Affecting Natural Immunity by a Constitutive CRISPR-Cas System in Escherichia coli. PLoS ONE 7(11): e50797. doi:10.1371/journal.pone.0050797
Editor: Igor Mokrousov, St. Petersburg Pasteur Institute, Russian Federation
Received: September 21, 2012; Accepted: October 25, 2012; Published: November 26, 2012
Copyright: © 2012 Almendros et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Funding: This study was supported by the Spanish Ministerio de Ciencia e Innovación (BIO2011-24417). The funder had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
Arrays of regularly spaced DNA repeats are present in 85% of sequenced archaea and about 50% of bacteria (CRISPRdb at http://crispr.u-psud.fr/crispr/, ). Even though the repeat sequence may vary to a great extent among arrays , the regular distance between repetitions grant their recognition as members of a family  at present known with the acronym CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats; ). At least some of the repeat intervening sequences (spacers) are acquired from identical DNA fragments (protospacers) in bacteriophages and plasmids –. Functionally related to CRISPR, and usually in close proximity to them, are the cas (CRISPR associated) genes , , , altogether conforming the CRISPR-Cas systems. Diverse systems, currently classified into three main types (I, II and III) each including several subtypes, are distinguished mainly based on the presence of particular signature cas genes . Increasing numbers of biochemical and genetic studies indicate that CRISPR-Cas provides adaptive immunity against molecules carrying protospacers. Indeed, specific Cas endonucleases cleave protospacers after base pairing complementary spacers carried in small CRISPR RNA (crRNA) molecules (for recent reviews on the CRISPR-Cas systems see , ). Additional sequence elements adjacent to repeat-spacer arrays (i.e. the leader) or to protospacers (i.e. the protospacer adjacent motif denoted PAM) participate in this activity. Notably, the leader contains promoters for the transcription of the adjacent CRISPR array – and is required for insertion of repeat-spacer units at the leader proximal edge of the repeat cassette . PAMs are short (2–5 nt) signatures located next to one end of the protospacers. The sequence and location of the PAM, relative to that of the corresponding spacer in the CRISPR array, is conserved for systems with similar repeats (belonging to the same CRISPR type according to Kunin et al. ) but both may vary among CRISPR types –. PAMs are required for efficient interference by at least some CRISPR-Cas systems , , – and their occurrence strongly suggests that they are recognized by the acquisition machinery during the selection of spacer precursors .
Two CRISPR-Cas systems, pertaining to subtypes I-E and I-F , also known as Ecoli and Ypest respectively , have been identified in E.coli strains . With two exceptions, represented by E. coli strain B7A and Shigella sp. D9 that contain both systems, either none or just one apparently functional system (with repeats and a complete set of associated cas genes) is retained. Subtype I-E is prevalent within the species and has been the subject of multiple studies (see recent publications , , , ). In contrast, subtype I-F is almost exclusively present (same two aforementioned exceptions) in a few members of the phylogenetic group B2 (4 out of 15 B2 strains of the ECOR collection; ) and functional studies on this system have only been performed in Pseudomonas aeruginosa –.
The observation that most E. coli isolates harbour either I-E or I-F suggests that they can replace one to each other . Indeed, they are strikingly similar structural and mechanistically. First, they pertain to the same main type (i.e. Type I), defined by characteristic processing and interference mechanistic details as well as the presence of cas3 gene, which is fused to cas2 in the case of I-F subtype . Their cas1 genes are more closely related to each other than to homologs in any other subtype , . Apart from Cas2, Cas3 and Cas1 occurring in the two E. coli systems, the remaining Cas proteins do not show evident sequence homology , yet they constitute interference complexes (named Cascade and Csy-complex for I-E and I-F systems respectively) similar at the structure and topology level , , , that appear to be functionally analogous . Furthermore, although I-E and I-F repeats pertain to distinct sequence types , both are partially palindromic  and have the shortest repeat periodicities (60 and 61 respectively) among known CRISPR , .
In this work we analyzed interference by the CRISPR-Cas I-F of E. coli LF82. This system is made of two arrays of CRISPR-4 repeats , accordingly referred to as CRISPR4.1 and CRISPR4.2 arrays respectively , separated by six cas genes (namely cas1, cas2/cas3 fusion, csy1, csy2, csy3 and cas6f). Putative leader sequences have been identified adjoining each array . Here we show that, in contrast to the subtype I-E, which is silent under normal laboratory growth –, , the Cas I-F genes are constitutively expressed and produce interference against target DNA in native conditions. The previously predicted PAM motif for CRISPR-4 repeats  is also observed in E. coli, but the most efficient interference motif (sequence in the PAM region causing interference) differs from that signature, showing a one-nucleotide displacement from the protospacer.
Identification of the PAM Associated with the CRISPR-Cas I-F System of E. coli
In a previous study , the alignment of regions containing protospacers associated with CRISPR-4 repeats from Pseudomonas aeruginosa, Yersinia pestis and Shewanella spp. revealed the conservation of the PAM signature GG adjacent to the end of the protospacers that becomes leader proximal in the corresponding CRISPR array. Now we performed a similar analysis of regions containing protospacers of 36 CRISPR-4 spacers from E. coli strains (Figure S1). Their alignment confirmed the previously observed PAM also located towards the leader side (Figure 1). As shown in Figure 1, positions are numbered starting from the PAM-proximal end of the protospacer and increasing towards the 3′ end (i.e. 5′ protospacer-G1G2 3′). In addition to the conservation of G1G2, exclusion of guanine at position 3 and thymine at position -1 was perceptible. However, the latter was dismissed as no nucleotide preference was evidenced by a further alignment of over 130 CRISPR-4 spacers for which similar sequences outside CRISPR loci were not found (Figure S2).
Protospacer and PAM positions are shaded in blue and red respectively. A crRNA molecule with an undefined spacer (N nucleotides) and the surrounding CRISPR sequences (5′ and 3′ handles) is drawn to illustrate the orientation of the PAM region with respect to the crRNA (aligned with adenine nucleotides at the 3′ end of the 5′ handle) when the spacer anneals to the cognate protospacer during target recognition.
The Cas I-F genes of E. coli are Expressed Under Normal Laboratory Growth Conditions
While an array with two repeats is the only reminiscence of the CRISPR-Cas I-E system present in other E. coli strains, LF82 carries a typical CRISPR-Cas subtype I-F system  and its CRISPR4.1 and CRISPR4.2 arrays contain 9 and 22 spacers respectively (Figure 2). We searched by BLASTN sequences in non-CRISPR loci with over 90% identity to these spacers and found two matches within plasmids of the species and one in an enterophage (Figure 2 and Figure S1), suggesting that the CRISPR-Cas of LF82 could act as an immune system. In order to determine whether this system might be active under normal laboratory growth conditions, reverse transcription PCR (RT-PCR) experiments were performed with RNA purified from LF82 cultures grown in LB medium at logarithmic and stationary growth phase. The distance between open reading frames (ORFs) of the cas genes and the fact that all they have the same direction of transcription (see Figure 2) strongly suggests that they are organized into two transcription units (cas2/cas3 and csy1 ORFs are separated by 330 bp), the first one including cas1 and cas2/cas3 (ORFs separated by 3 bp) and the second spanning from csy1 to cas6f (these ORFs are either overlapped or separated by less than 11 bp). Moreover, the proteins encoded by csy1, csy2, csy3 and cas6f form a functional Csy-complex , ,  and it has been reported for the analogous Cascade (Cas complex for antiviral defense) in the subtype I-E system that the corresponding genes form part of an operon –. Firstly, the possibility that the six cas genes were co-transcribed was dismissed as no product was obtained by total RNA reverse transcription followed by PCR amplification with primers targeting cas2/cas3 and csy1 (data not shown). Subsequently, the first gene of each transcription unit (cas1 and csy1) was used as target for primers in PCR reactions of cDNA samples extracted from logarithmic and stationary phase cultures, obtaining amplification in all cases (Figure 3 and data not shown), hence demonstrating their expression in the conditions assayed.
Spacers with and without identified protospacers as well as repeats are represented as gray, white and black rectangles respectively. Spacer #1 is labeled. Cas genes are shown as boxes pointing towards the direction of transcription.
Agarose gel electrophoresis of PCR products obtained using as template total DNA (lanes 1 to 3), cDNA (lanes 5 to 7) or RNA (lanes 8 to 10) of LF82 strain grown in LB medium at logarithmic phase (results of samples from stationary phase cultures were similar and are not shown). In addition to cas1 (lanes 1, 5 and 8) and csy1 (lanes 2, 6 and 9) the highly expressed tufB transcript (lanes 3, 7 and 10) was probed as a control of DNA contamination in RNA samples. A molecular weight marker is included (lane 4) for fragment size estimation.
The CRISPR-Cas I-F System of E. coli Produces Interference Against Target Plasmids
Expression of the cas genes strongly suggests that the CRISPR-Cas I-F of LF82 is active, prompting us to investigate interference by this system. First, we tested natural interference by the first spacer after the leader of the CRISPR4.1 array of LF82, hereinafter referred to as spacer #1 (see Figure 2). LF82 cells were subjected to transformation with mixtures composed of an equivalent concentration of two plasmids that differ in the presence or absence of protospacer #1, a sequence identical to spacer #1. The proportion of transformants carrying the targeted plasmid for each experiment is an indication of the interference activity driven by spacer #1: strongest activity implies a lower proportion of transformants with the target plasmid (see material and methods). Initial transformation experiments were performed with plasmid pCR2.1, which has no sequence matching LF82 spacers, and pCAR-GGC, the latter carrying the tri-nucleotide GGC at the 3′ end of the protospacer #1. Interference with this PAM was made evident as, on average, 22% of transformants (p<0.01) carried pCAR-GGC (see Figure 4). Hence, the CRISPR-Cas I-F system of E. coli LF82 is naturally active against target plasmids.
Data correspond to the proportion of transformants carrying a target plasmid (containing protospacer #1) to transformant colonies carrying pCR2.1 (see material and methods for details). Target plasmids differ in sequence at positions 1 to 3 of the PAM region (the three nucleotides are indicated under the bars in that order). The mean average from three independent competition experiments for each targeted plasmid is shown with its standard deviation. Values significantly below 1 imply CRISPR interference.
Identification of Nucleotides at the PAM Affecting Interference
Once CRISPR activity against protospacers with the predicted PAM was confirmed, the identification of nucleotides at the PAM region required for such interference, defining the interference motif, was addressed. With this aim, we performed competition assays with mixtures containing pCR2.1 vector and a derived plasmid (target plasmid) containing protospacer #1. Each transformation mixture differed in the sequence at positions 1 to 3 with respect to the protospacer in the target plasmid. Position 3 was included in the analysis as guanine was apparently excluded at this location and hence could form part of an extended PAM; i.e GGH (see Figure 1). Average data of three independent experiments for each plasmid pair are shown in Figure 4. Significance values corresponding to interference-deemed results were in all cases lower than 0.01. Strikingly, in addition to GGC and GGG, interference was observed with AGC, implying that G2 is enough for protospacer targeting. In contrast, the presence of guanine at position 1 had a limited effect on transformation balance (see GAC at Figure 4). Unexpectedly, G3 did also hold interference even in the absence of the canonical PAM G1G2 (see AAG and TTG) and moreover, the lower percentages of transformants carrying the target plasmids were invariably found when this third G was present (GGG, AAG, TTG). It is worth noting that the effect of G at positions 2 and 3 was cumulative (compare GGG with AGC and AAG). Taken together, these results show that at least one G either at position 2 or 3 is required for interference, the third G being the most effective. Strikingly, despite the high conservation of G1 in the CRISPR-4 PAM, this is neither sufficient nor necessary for such activity, a conclusion further supported by the equivalent transformation rates observed with GGC and AGC.
The potential influence of base pairing between the PAM region and the crRNA was also addressed. It has been demonstrated for the CRISPR-Cas type III system of Staphylococcus epidermidis that base pairing with the protospacer region beyond the spacer sequence of a crRNA prevents interference . As illustrated in Figure 1, PAM positions 1 to 3 will stand face to face to adenine nucleotides at the 5′ handle of crRNA molecules after spacer-protospacer hybridization during target recognition in the interference stage. Although no interference was observed when the tri-nucleotide TTT was located adjoining the protospacer, a result that is compatible with interference prevention by base pairing, it could also be explained by the absence of guanine residues at the PAM. In the same context, it might be possible that the interference observed when guanines are present at the PAM is exclusively due to the absence of base pairing (they face adenine residues). However, this possibility is ruled out as neither AAC nor CCC produces interference confirming that guanines are specifically required at the PAM region as predicted from the results discussed before. Yet, prevention of interference by base pairing may be possible as interference held by G2 is abolished when the surrounding nucleotides are complementary to the corresponding positions in the crRNA (compare TGT and TGC to AGC, GGC and GGG in Figure 4). In contrast, and concurring with the strong interference held by G3, base paring at positions 1 and 2 did not affect interference when this G is present (compare AAG with TTG). Further studies will be required to assess the implication of crRNA-PAM annealing on interference.
In this work we show for the first time the occurrence of CRISPR-Cas activity against foreign DNA in wild-type E. coli. Natural immunity by a resident spacer of the CRISPR-Cas I-F system of LF82 strain was evidenced using an original assay (the competition test) based on the transformation efficiency of a target plasmid compared to that of the non-targeted parental vector. Instead of the widely utilized method that relays on independent transformation experiments for each plasmid, in the competition test cells are subjected to transformation with a mixture of both plasmids, each with equivalent concentration, purity and topology. Electroporation variables can greatly differ among experiments and therefore the rate of transformation. The competition test circumvents the influence of electroporation variables across independent experiments as the number of transformants carrying the targeted plasmid is normalized respect to an internal control (the non-targeted vector) instead of with data from a separate assay. In addition, competition tests using the same internal control are comparable. This advantage is especially relevant when small differences between plasmids are expected as an elevated experimental error may conceal subtle variations. In this context, the plasmids used for the competition assay are high copy number (pUC origin) and as a consequence, the screening of transformant colonies will reveal interference only when it happens soon after the target plasmid gets into the cell (before it reaches a high copy number). Our assay provides statistically significant data at differences in interference between targeted plasmids as low as two-fold.
It is expected that the selection of spacer precursors by the recognition of an adjoining motif (i.e. the PAM) has a functional meaning , , ,  and nucleotides at the PAM have been shown to be important for interference , , . However, in contrast with the PAM, the interference motif of at least some systems admits certain flexibility as shown here for a CRISPR-Cas I-F as well as in previous studies with other subtypes , , , , , , . Although we have not tested the 64 possible tri-nucleotides at the PAM region, our results obtained with selected combinations clearly demonstrate that just one guanine residue, either at position 2 or 3, is sufficient to hold interference, but G at position 1 is not required, defining interference motifs that are shifted one position from the predicted PAM.
Previous works inferred PAM positions important for interference based on the selection of mutants that escape CRISPR interference. In this case, variations in the PAM that increase interference efficiency cannot be detected. In contrast, our study explores alternative nucleotides at PAM positions revealing both strong and weak interference motifs. Further, we show that positions outside the protospacer and PAM influence interference.
Two aspects related to the identity of the interference motifs versus PAM are intriguing: the dispensability of G1 and the implication of G3. Perhaps the role of G1 justifying its presence in the PAM is just to avoid base pairing with adenine in the crRNA, hence allowing interference by G2. The fact that interference is held by G3 in itself and, moreover, that the strongest interference is observed when this guanine is present is enigmatic because it is excluded of the PAM region of CRISPR-4 protospacers of E. coli (only 3 out of 36 protospacers carry it) and, notably, also of Shewanella spp. . This apparent paradox could be justified from a biological perspective: a strong interference might be less advantageous for the cell than a relaxed one that would provide the opportunity for harmless and eventually beneficial foreign DNA to be acquired and maintained. In this vein, we defined in a previous work  the PAMs of 6 CRISPR repeat types. CRISPR-4 was the only one associated to the PAM G1G2, but CRISPR-1 and CRISPR-7 repeats, which like CRISPR-4 were linked to Type I Cas genes conforming subtypes I-B and I-A CRISPR-Cas systems respectively, had N1G2G3. It is evocative that the most active interference motif of CRISPR-4 differs from its PAM and coincides with the one of closely related repeats , . This might reflect an evolutionary alternative that could be of particular benefit for the homogenous group (mostly within phylogenetic group B2) of E. coli strains containing CRISPR-Cas I-F systems. From a biochemical point of view, the fact that G3 is involved in interference may be just a consequence of the same Cas protein(s) being responsible for the detection of guanine in the PAM region during acquisition (G1G2) and interference (G2G3), both motifs being slightly displaced (one position) with respect to the protospacer, possibly as consequence of a slight displacement of the involved site of that protein when forming part of distinct nucleic acid-protein complexes.
The first report documenting interference activity by a CRISPR-Cas I-F system has been recently published for Pseudomonas aeruginosa (strain PA14) . The cas gene content and layout of CRISPR-Cas I-F loci in P. aeruginosa PA14 and E. coli LF82 (hereinafter referred to as PaeIF and EcoIF systems respectively) are alike , . Moreover, the amino acid identity percentage between Cas proteins of both systems (from about 40% to 65%) concurs with the phylogeny of the species . Yet, the consensus CRISPR sequences are very similar (26/28 nt identity) , . Such relatedness between both systems anticipates further mechanistic and functional analogy. Cady and collaborators  have now experimentally confirmed by spacer acquisition assays that sequences adjacent to the predicted PAM (GG) are selected as spacer precursors of this system and reported interference when this motif was present. Furthermore, in good agreement with our data, of three spacers conferring protection against phage infection which interference efficiency was estimated in P. aeruginosa, the target of the spacer showing the strongest interference adjoins G1G2G3. However, while our competition assays did not reveal a substantial effect of G to A substitutions at position 1, interference-evading phages were obtained during infection of P. aeruginosa with phages targeted by a PaeIF spacer where this nucleotide replacement was the only change observed at the protospacer region. This observation may suggest that, in contrast to EcoIF, G1 is essential for interference by the PaeIF system. Nevertheless, one evading phage lacking mutations in the target gene was also detected in this set of experiments, implying that a different cause is responsible for this resistance phenotype. That could also apply to the G1 to A1 change. A systematic analysis akin to the one we have employed in this work (i.e. independent of selection) would be required to confirm the requirement of G1 for interference by the PaeIF system.
In conclusion, the PAM of the naturally active I-F system of E. coli differs from the interference motifs. The presence of just one particular nucleotide in the PAM sustains immunity and adjacent positions affect its activity. These results could apply to other CRISPR-Cas systems, explaining why different targets with the same PAM show varied susceptibility.
Materials and Methods
E. coli Strains and Plasmids
E. coli strain LF82 used as a host for competition tests belongs to the phylogenetic group B2  and contains a complete CRISPR-Cas I-F system.
Plasmids used in this work are described in Table S1. pCR2.1 (Invitrogen) is a high-copy-number cloning vector that confers resistance to ampicillin and kanamycin. pCR2.1 derivative plasmids carrying protospacer #1 and diverse adjacent sequences were constructed by ligation of PCR fragments, obtained with partially complementary oligonucleotide pairs, to the 3′-T overhangs of the linearized vector as supplied by the manufacturer (see Tables S1 and S2). PCR reactions were performed using Taq DNA polymerase (Roche) in a Mastercycler Gradient thermal cycler (Eppendorf, Wesseling-Berzdorf, Germany). Ligation of DNA fragments was performed with T4 Ligase (Roche) following the recommendations of the manufacturer and restriction enzymes were purchased from Fermentas.
Plasmid DNA Purification and Quality Analysis
Plasmids were purified with the High Pure Plasmid Isolation Kit (Roche) following the manufacturer’s instructions. The DNA concentration and purity of samples was estimated with a Nanodrop ND-1000 (Nanodrop Technologies) and plasmid topology was analyzed by UV visualization of samples in EtBr stained agarose gels after electrophoresis in 1×TAE buffer.
Transformations were carried out by electroporation (2.45 KV, 25 µF, 200Ω) using an Electroporator 2510 (Eppendorf). Electrocompetent cells were prepared following the procedure described by Shi et al. . Transformant colonies of pCR2.1 and derived plasmids were selected on LB agar plates containing 100 µg/ml ampicillin.
Definition of the CRISPR-4 Protospacer Adjacent Motif (PAM) of E. coli
For the identification of the E. coli CRISPR-4 protospacer adjacent motif (PAM), regions of non-CRISPR loci containing sequences with over 90% identity to spacers of CRISPR-Cas I-F systems of E. coli strains were searched with the BLASTN program  run against the nr/nt database at the NCBI Website (http://blast.ncbi.nlm.nih.gov/Blast.cgi). When protospacers of different origin where found for a single spacer, the sequence with higher identity was selected for the analysis. Spacers were detected in E. coli sequences available through the coliBASE Website (http://www.xbase.ac.uk/colibase/) and in GenBank, using the CRISPR Finder application at http://crispr.u-psud.fr/. The DNA strands carrying the protospacer nucleotides complementary to the corresponding spacer sequence in the crRNA were aligned using the WebLogo application at http://weblogo.berkeley.edu/logo.cgi/to obtain sequence logos . The ends of the protospacers were used as reference for alignments and no gaps were introduced.
Competition Test Used for the Detection of Interference Activity
Interference by the native spacer #1 of E. coli strain LF82 was explored by competition tests. Briefly, in each experiment, LF82 cells were electroporated with plasmid mixtures (extracted from that strain) composed of the pCR2.1 vector and a derived construct carrying a sequence identical to spacer #1 (target plasmid). The DNA concentration, purity and proportion of the three topological forms (i.e. supercoiled, relaxed open circle and full-length linear) of each plasmid in each mixture was equivalent. Interference activity was estimated as the proportion of transformants carrying the target plasmid respect to those carrying pCR2.1 established by PCR screening. In the absence of interference, 50% of colonies harbouring either plasmid will be expected. But if spacer #1 produces interference with the protospacer carrier, a lesser percentage of cells transformed with this plasmid would be obtained, lower as interference activity increases. For each plasmid pair, three independent electroporation experiments, using different plasmid preparations and stocks of freshly prepared electrocompetent cells, were carried out. Twenty colonies were randomly selected from each experiment for PCR amplification. PCR reactions were performed under standard conditions using primers T7 (5′ GTAATACGACTCACTATAGGGC 3′) and M13 (5′ GGAAACAGCTATGACCATG 3′).
Reverse Transcription Polymerase Chain Reaction Analysis (RT-PCR)
Total RNA was isolated from E.coli LF82 cells using the Trizol reagent (Invitrogen) as indicated by the manufacturer. About 200 ng of total RNA were retrotranscribed with hexameric random primers and SuperScript III retrotranscriptase provided in the SuperScript III kit (Invitrogen), according to the supplied protocol. cDNA was PCR amplified using primers for cas1, csy1 and tufB genes (see Table S2). RT-PCR products were analyzed by UV visualization of EtBr stained agarose gels.
Plasmid constructions were verified by sequencing with the Big Dye Terminator Cycle Sequencing kit in an ABI PRISM 310 DNA Sequencer following the manufacturer’s instructions (Servicios Técnicos de Invetigación, Universidad de Alicante, Spain).
Statistical analyses (Anova, Kruskal-Wallis and U Mann-Whitney tests) were calculated using SPSS software version 17.0 (SPSS 111 Inc., Chicago, IL, USA). A p-value less than 0.05 was considered as significant.
Sequence of protospacer regions of E. coli CRISPR-4 spacers used to generate the WebLogo shown on Figure 1. The protospacer sequences are underlined and mismatches with respect to the corresponding spacer are labeled in red. Nucleotides matching the PAM are bolded. Protospacer regions of LF82 spacers are marked with an asterisk.
WebLogo generated by the alignment of 168 E. coli CRISPR-4 spacers.
Schematic representation of the strategy used for synthesizing artificial CRISPR-4 arrays carrying a spacer identical to a P1 sequence. The construction of the fragment carrying spacer P1.1 is shown to exemplify the general procedure. The leader of the CRISPR4.1 array, repeats and spacers are shown as green, black and blue boxes respectively. Relevant restriction sites as well as primers used for amplification of the leader-CRISPR region of the CRISPR4.1 of ED1a (C1.F and C2.R) and for synthesizing a fragment containing spacer P1.1 and a CRISPR unit (C3P1.F and C11.R) are indicated. Vertical lanes connecting the 3′ ends of primers C3P1.F and C11.R illustrate sequence complementarity at this region.
Plasmids constructed in this work carrying inserts with protospacer#1 and distinct PAM regions.
Primers used for PCR reactions.
We thank Arlette Darfeuille-Michaud (Clermont Université, Université d’Auvergne, France) for strain LF82.
Conceived and designed the experiments: FJMM CA CDV. Performed the experiments: CA NMG. Analyzed the data: CA CDV JGM FJMM. Wrote the paper: CA FJMM.
- 1. Grissa I, Vergnaud G, Pourcel C (2007) The CRISPRdb database and tools to display CRISPRs and to generate dictionaries of spacers and repeats. BMC Bioinformatics 8: 172. doi: 10.1186/1471-2105-8-172
- 2. Kunin V, Sorek R, Hugenholtz P (2007) Evolutionary conservation of sequence and secondary structures in CRISPR repeats. Genome Biology 8: R61. doi: 10.1186/gb-2007-8-4-r61
- 3. Mojica FJ, Díez-Villaseñor C, Soria E, Juez G (2000) Biological significance of a family of regularly spaced repeats in the genomes of Archaea, Bacteria and mitochondria. Mol Microbiol 36: 244–246. doi: 10.1046/j.1365-2958.2000.01838.x
- 4. Jansen R, Embden JD, Gaastra W, Schouls LM (2002) Identification of genes that are associated with DNA repeats in prokaryotes. Mol Microbiol 43: 1565–1575. doi: 10.1046/j.1365-2958.2002.02839.x
- 5. Barrangou R, Fremaux C, Deveau H, Richards M, Boyaval P, et al. (2007) CRISPR provides acquired resistance against viruses in prokaryotes. Science 315: 1709–1712. doi: 10.1126/science.1138140
- 6. Datsenko KA, Pougach K, Tikhonov A, Wanner BL, Severinov K, et al. (2012) Molecular memory of prior infections activates the CRISPR/Cas adaptive bacterial immunity system. Nat Commun 3: 945. doi: 10.1038/ncomms1937
- 7. Erdmann S, Garrett RA (2012) Selective and hyperactive uptake of foreign DNA by adaptive immune systems of an archaeon via two distinct mechanisms. Mol Microbiol 85: 1044–1056. doi: 10.1111/j.1365-2958.2012.08171.x
- 8. Swarts DC, Mosterd C, van Passel MW, Brouns SJ (2012) CRISPR interference directs strand specific spacer acquisition. PLoS One 7: e35888. doi: 10.1371/journal.pone.0035888
- 9. Yosef I, Goren MG, Qimron U (2012) Proteins and DNA elements essential for the CRISPR adaptation process in Escherichia coli. Nucleic Acids Res 40: 5569–5576. doi: 10.1093/nar/gks216
- 10. Haft DH, Selengut J, Mongodin EF, Nelson KE (2005) A guild of 45 CRISPR-associated (Cas) protein families and multiple CRISPR/Cas subtypes exist in prokaryotic genomes. Plos Comput Biol 1: 474–483. doi: 10.1371/journal.pcbi.0010060
- 11. Makarova KS, Grishin NV, Shabalina SA, Wolf YI, Koonin EV (2006) A putative RNA-interference-based immune system in prokaryotes: computational analysis of the predicted enzymatic machinery, functional analogies with eukaryotic RNAi, and hypothetical mechanisms of action. Biol Direct 1: 7.
- 12. Makarova KS, Haft DH, Barrangou R, Brouns SJJ, Charpentier E, et al. (2011) Evolution and classification of the CRISPR-Cas systems. Nat Rev Microbiol 9: 467–477. doi: 10.1038/nrmicro2577
- 13. Bikard D, Marraffini LA (2012) Innate and adaptive immunity in bacteria: mechanisms of programmed genetic variation to fight bacteriophages. Curr Opin Immunol 24: 15–20. doi: 10.1016/j.coi.2011.10.005
- 14. Wiedenheft B, Sternberg SH, Doudna JA (2012) RNA-guided genetic silencing systems in bacteria and archaea. Nature 482: 331–338. doi: 10.1038/nature10886
- 15. Lillestøl RK, Redder P, Garrett RA, Brügger K (2006) A putative viral defence mechanism in archaeal cells. Archaea 2: 59–72. doi: 10.1155/2006/542818
- 16. Brouns SJ, Jore MM, Lundgren M, Westra ER, Slijkhuis RJ, et al. (2008) Small CRISPR RNAs guide antiviral defense in prokaryotes. Science 321: 960–964. doi: 10.1126/science.1159689
- 17. Pougach K, Semenova E, Bogdanova E, Datsenko KA, Djordjevic M, et al. (2010) Transcription, processing and function of CRISPR cassettes in Escherichia coli. Mol Microbiol 77: 1367–1379. doi: 10.1111/j.1365-2958.2010.07265.x
- 18. Pul U, Wurm R, Arslan Z, Geissen R, Hofmann N, et al. (2010) Identification and characterization of E. coli CRISPR-cas promoters and their silencing by H-NS. Mol Microbiol 75: 1495–1512. doi: 10.1111/j.1365-2958.2010.07073.x
- 19. Westra ER, Pul U, Heidrich N, Jore MM, Lundgren M, et al. (2010) H-NS-mediated repression of CRISPR-based immunity in Escherichia coli K12 can be relieved by the transcription activator LeuO. Mol Microbiol 77: 1380–1393. doi: 10.1111/j.1365-2958.2010.07315.x
- 20. Wurtzel O, Sapra R, Chen F, Zhu Y, Simmons BA, et al. (2010) A single-base resolution map of an archaeal transcriptome. Genome Res 20: 133–141. doi: 10.1101/gr.100396.109
- 21. Bolotin A, Quinquis B, Sorokin A, Ehrlich SD (2005) Clustered regularly interspaced short palindrome repeats (CRISPRs) have spacers of extrachromosomal origin. Microbiology 151: 2551–2561. doi: 10.1099/mic.0.28048-0
- 22. Deveau H, Barrangou R, Garneau JE, Labonte J, Fremaux C, et al. (2008) Phage response to CRISPR-encoded resistance in Streptococcus thermophilus. J Bacteriol 190: 1390–1400. doi: 10.1128/jb.01412-07
- 23. Horvath P, Romero DA, Coute-Monvoisin AC, Richards M, Deveau H, et al. (2008) Diversity, activity, and evolution of CRISPR loci in Streptococcus thermophilus. J Bacteriol 190: 1401–1412. doi: 10.1128/jb.01415-07
- 24. Lillestøl RK, Shah SA, Brügger K, Redder P, Phan H, et al. (2009) CRISPR families of the crenarchaeal genus Sulfolobus: bidirectional transcription and dynamic properties. Mol Microbiol 72: 259–272. doi: 10.1111/j.1365-2958.2009.06641.x
- 25. Mojica FJM, Díez-Villaseñor C, García-Martínez J, Almendros C (2009) Short motif sequences determine the targets of the prokaryotic CRISPR defence system. Microbiology 155: 733–740. doi: 10.1099/mic.0.023960-0
- 26. Semenova E, Nagornykh M, Pyatnitskiy M, Artamonova II, Severinov K (2009) Analysis of CRISPR system function in plant pathogen Xanthomonas oryzae. FEMS Microbiol Lett 296: 110–116. doi: 10.1111/j.1574-6968.2009.01626.x
- 27. Gudbergsdottir S, Deng L, Chen Z, Jensen JVK, Jensen LR, et al. (2011) Dynamic properties of the Sulfolobus CRISPR/Cas and CRISPR/Cmr systems when challenged with vector-borne viral and plasmid genes and protospacers. Mol Microbiol 79: 35–49. doi: 10.1111/j.1365-2958.2010.07452.x
- 28. Garneau JE, Dupuis ME, Villion M, Romero DA, Barrangou R, et al. (2010) The CRISPR/cas bacterial immune system cleaves bacteriophage and plasmid DNA. Nature 468: 67–71. doi: 10.1038/nature09523
- 29. Semenova E, Jore MM, Datsenko KA, Semenova A, Westra ER, et al. (2011) Interference by clustered regularly interspaced short palindromic repeat (CRISPR) RNA is governed by a seed sequence. Proc Natl Acad Sci U S A 108: 10098–10103. doi: 10.1073/pnas.1104144108
- 30. Díez-Villaseñor C, Almendros C, García-Martínez J, Mojica FJM (2010) Diversity of CRISPR loci in Escherichia coli. Microbiology 153: 1351–1361. doi: 10.1099/mic.0.036046-0
- 31. Westra ER, van Erp PB, Künne T, Wong SP, Staals RH, et al. (2012) CRISPR immunity relies on the consecutive binding and degradation of negatively supercoiled invader DNA by Cascade and Cas3. Mol Cell 46: 595–605. doi: 10.1016/j.molcel.2012.03.018
- 32. Zegans ME, Wagner JC, Cady KC, Murphy DM, Hammond JH, et al. (2009) Interaction between bacteriophage DMS3 and host CRISPR region inhibits group behaviors of Pseudomonas aeruginosa. J Bacteriol 191: 210–219. doi: 10.1128/jb.00797-08
- 33. Haurwitz RE, Jinek M, Wiedenheft B, Zhou K, Doudna JA (2010) Sequence- and structure-specific RNA processing by a CRISPR endonuclease. Science 329: 1355–1358. doi: 10.1126/science.1192272
- 34. Cady KC, O’Toole GA (2011) Non-identity-mediated CRISPR-bacteriophage interaction mediated via the Csy and Cas3 proteins. J Bacteriol 193: 3433–3445. doi: 10.1128/jb.01411-10
- 35. Wiedenheft B, van Dujin E, Bultema JB, Waghmare SP, Zhou K, et al. (2011) RNA-guided complex from a bacterial immune system enhances target recognition through seed sequence interactions. Proc Natl Acad Sci U S A 108: 10092–10097. doi: 10.1073/pnas.1102716108
- 36. Cady KC, Bondy-Denomy J, Heussler GE, Davidson AR, O’Toole GA (2012) The CRISPR/Cas adaptive immune system of Pseudomonas aeruginosa mediates resistance to naturally occurring and engineered phages. J Bacteriol 194: 5728–5738. doi: 10.1128/jb.01184-12
- 37. Haurwitz RE, Sternberg SH, Doudna JA (2012) Csy4 relies on an unusual catalytic dyad to position and cleave CRISPR RNA. EMBO J 31: 2824–2832. doi: 10.1038/emboj.2012.107
- 38. Jore MM, Lundgren M, Van Duijn E, Bultema JB, Westra ER, et al. (2011) Structural basis for CRISPR RNA-guided DNA recognition by Cascade. Nat Struct Mol Biol 18: 529–536. doi: 10.1038/nsmb.2019
- 39. van Duijn E, Barbu IM, Barendregt A, Jore MM, Wiedenheft B, et al.. (2012) Native tandem and ion mobility mass spectrometry highlight structural and modular similarities in CRISPR-associated protein complexes from Escherichia coli and Pseudomonas aeruginosa. Mol Cell Proteomics. doi: 10.1074/mcp.M112.02026.
- 40. Mojica FJM, Díez-Villaseñor C (2010) The on-off switch of CRISPR immunity against phages in Escherichia coli. Mol Microbiol 77: 1341–1345. doi: 10.1111/j.1365-2958.2010.07326.x
- 41. Miquel S, Peyretaillade E, Claret L, de Vallée A, Dossat C, et al. (2010) Complete genome sequence of Crohn’s disease-associated adherent-invasive E. coli strain LF82. PLoS One 5: e12714. doi: 10.1371/journal.pone.0012714
- 42. Marraffini LA, Sontheimer EJ (2010) Self versus non-self discrimination during CRISPR RNA-directed immunity. Nature 463: 568–571. doi: 10.1038/nature08703
- 43. Fischer S, Maier LK, Stoll B, Brendel J, Fischer E, et al. (2012) An archaeal immune system can detect multiple protospacer adjacent motifs (PAMs) to target invader DNA. J Biol Chem 287: 33351–33363. doi: 10.1074/jbc.m112.377002
- 44. Broun JR, Volker C (2004) Phylogeny of gamma-proteobacteria: resolution of one branch of the universal tree? BioEssays 26: 463–468. doi: 10.1002/bies.20030
- 45. Shi X, Karkut T, Alting-Mees M, Chamankhah M, Hemmingsen SM, et al. (2003) Enhancing Escherichia coli electrotransformation competency by invoking physiological adaptations to stress and modifying membrane integrity. Anal Biochem 320: 152–155. doi: 10.1016/s0003-2697(03)00352-x
- 46. Altschul SF, Madden TL, Schaffer AA, Zhang J, Zhang Z, et al. (1997) Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res 25: 3389–3402. doi: 10.1093/nar/25.17.3389
- 47. Crooks GE, Hon G, Chandonia JM, Brenner SE (2004) WebLogo: a sequence logo generator. Genome Res 14: 1188–1190. doi: 10.1101/gr.849004