CRISPR/CAS-RELATED METHODS AND COMPOSITIONS FOR TREATING SICKLE CELL DISEASE
CRISPR/CAS-related compositions and methods for treatment of Sickle Cell Disease (SCD) are disclosed.
Latest EDITAS MEDICINE, INC. Patents:
- Genome editing systems comprising repair-modulating enzyme molecules and methods of their use
- COMPOSITIONS AND METHODS FOR TREATING CEP290-ASSOCIATED DISEASE
- Compositions and methods for treating CEP290 associated disease
- CRISPR/CAS-RELATED METHODS AND COMPOSITIONS FOR TREATING LEBER'S CONGENITAL AMAUROSIS 10 (LCA10)
- CRISPR-RELATED METHODS AND COMPOSITIONS WITH GOVERNING gRNAS
The present application is a continuation of U.S. application Ser. No. 15/129,367, filed Sep. 26, 2016, which is a national phase of International Application No. PCT/US2015/022856, filed Mar. 26, 2015, which claims the benefit of U.S. Provisional Application No. 61/970,588, filed Mar. 26, 2014, and U.S. Provisional Application No. 62/084,487, filed Nov. 25, 2014, the contents of each of which are hereby incorporated by reference in their entirety.
FIELD OF THE INVENTIONThe invention relates to CRISPR/CAS-related methods and components for editing of a target nucleic acid sequence, or modulating expression of a target nucleic acid sequence, and applications thereof in connection with Sickle Cell Disease (SCD).
SEQUENCE LISTINGThis application contains a Sequence Listing, which was submitted in ASCII format via EFS-Web, and is hereby incorporated by reference in its entirety. The ASCII copy, created on Jan. 19, 2022, is named SequenceListing.txt and is 3,905 KB in size.
BACKGROUNDSickle Cell Disease (SCD), also known as Sickle Cell Anemia (SCA), is a common inherited hematologic disease. It affects 80,000-90,000 people in the United States. It is common in people of African descent and in Hispanic-Americans with the prevalence of SCD being 1 in 500 and 1 in 1,000, respectively.
SCD is caused by a mutation in the beta-globin (HBB) gene. HBB is located on chromosome 11 within the HBB gene cluster, which includes genes encoding the delta globin chain, A gamma chain, G gamma chain. The alpha-globin gene is located on chromosome 16. A point mutation (e.g., GAG→GTG) results in the substitution of valine for glutamic acid at amino acid position 6 in exon 1 of the HBB gene. Beta hemoglobin chains with this mutation are expressed as HbS. The disease is inherited in an autosomal recessive manner, so that only patients with two HbS alleles have SCD. Subjects who have sickle cell trait (are heterozygous for HbS) only display a phenotype if they are severely dehydrated or oxygen deprived.
Normal adult hemoglobin (Hb) is composed of a tetramer made from two alpha-globin chains and two beta-globin chains. In SCD, the valine at position 6 of the beta-chain is hydrophobic and causes a change in conformation of the beta-globin protein when it is not bound to oxygen. HbS is more likely to polymerize and leads to the characteristic sickle shaped red blood cells (RBCs) found in SCD.
Sickle shape RBCs cause multiple manifestations of disease, which include, e.g., anemia, sickle cell crises, vaso-occlusive crises, aplastic crises and acute chest syndrome. The disease has varous manifestations, e.g., vaso-occlusive crisis, splenic sequestration crisis and anemia. Subjects may also suffer from acute chest crisis and infarcts of extremities, end organs and central nervous system. Treatment includes, e.g., hydration, transfusion and analgesics. Treatment of SCD also includes, e.g., the use of hydroxyurea, supplementation with folic acid, and penicillin prophylaxis during childhood. Bone marrow transplants have been demonstrated to cure SCD.
Thus, there remains a need for additional methods and compositions that can be used to treat SCD.
SUMMARY OF THE INVENTIONMethods and compositions discussed herein, provide for the treatment and prevention of Sickle Cell Disease (SCD), also known as Sickle Cell Anemia (SCA). SCD is an inherited hematologic disease.
In healthy individuals, two beta-globin molecules pair with two alpha-globin molecules to form normal hemoglobin (Hb). In SCD, mutations in the beta-globin (HBB) gene, e.g., a point mutation (GAG→GTG) that results in the substitution of valine for glutamic acid at amino acid position 6 of the beta-globin molecule, cause production of sickle hemoglobin (HbS). HbS is more likely to polymerize and leads to the characteristic sickle shaped red blood cells (RBCs). Sickle shaped RBCs give rise to multiple manifestations of disease, such as, anemia, sickle cell crises, vaso-occlusive crises, aplastic crises and acute chest syndrome. Alpha-globin can also pair with fetal hemoglobin (HbF), which significantly moderates the severe anemia and other symptoms of SCD. However, the expression of HbF is negatively regulated by the BCL11A gene product.
Methods and compositions disclosed herein provide a number of approaches for treating SCD. As is discussed in more detail below, methods described herein provide for treating SCD by correcting a target position in the HBB gene to provide corrected, or functional, e.g., wild type, beta-globin. Methods and compositions discussed herein can be used to treat or prevent SCD by altering the BCL11A gene (also known as B-cell CLL/lymphoma 11A, BCL11A-L, BCL11A-S, BCL11A-XL, CTIP1, HBFQTL5 and ZNF). BCL11A encodes a zinc-finger protein that is involved in the regulation of globin gene expression. By altering the BCL11A gene (e.g., one or both alleles of the BCL11A gene), the levels of gamma globin can be increased. Gamma globin can replace beta globin in the hemoglobin complex and effectively carry oxygen to tissues, thereby ameliorating SCD disease phenotypes.
In one aspect, methods and compositions discussed herein, provide for the correction of the underlying genetic cause of SCD, e.g., the correction of a mutation at a target position in the HBB gene, e.g., correction of a mutation at amino acid position 6, e.g., an E6V substitution in the HBB gene.
Mutations in the HBB gene (also known as beta-globin and CD113t-C) have been shown to cause SCD. Mutations leading to SCD can be described based on their target positions in the HBB gene. In an embodiment, the target position is E6, e.g., E6V, in the HBB gene.
“SCD target point position”, as used herein, refers to a target position in the HBB gene, typically a single nucleotide, which, if mutated, can result in a protein having a mutant amino acid and give rise to SCD. In an embodiment, the SCD target position is the target position at which a change can give rise to an E6 mutant protein, e.g., a protein having an E6V substitution.
While much of the disclosure herein is presented in the context of the mutation in the HBB gene that gives rise to an E6 mutant protein (e.g., E6V mutant protein), the methods and compositions herein are broadly applicable to any mutation, e.g., a point mutation or a deletion, in the HBB gene that gives rise to SCD.
While not wishing to be bound by theory, it is believed that, in an embodiment, a mutation at an SCD target point position in the HBB gene is corrected, e.g., by homology directed repair (HDR), as described herein.
In one aspect, methods and compositions discussed herein may be used to alter the BCL11A gene to treat or prevent SCD, by targeting the BCL11A gene, e.g., coding or non-coding regions of the BCL11A gene. Altering the BCL11A gene herein refers to reducing or eliminating (1) BCL11A gene expression, (2) BCL11A protein function, or (3) the level of BCL11A protein.
In an embodiment, the coding region (e.g., an early coding region) of the BCL11A gene is targeted for alteration. In an embodiment, a non-coding sequence (e.g., an enhancer region, a promoter region, an intron, 5′UTR, 3′UTR, or polyadenylation signal) is targeted for alteration.
In an embodiment, the method provides an alteration that comprises disrupting the BCL11A gene by the insertion or deletion of one or more nucleotides mediated by Cas9 (e.g., enzymatically active Cas9 (eaCas9), e.g., Cas9 nuclease or Cas9 nickase) as described below. This type of alteration is also referred to as “knocking out” the BCL11A gene.
In another embodiment, the method provides an alteration that does not comprise nucleotide insertion or deletion in the BCL11A gene and is mediated by enzymatically inactive Cas9 (eiCas9) or an eiCas9-fusion protein, as described below. This type of alteration is also referred to as “knocking down” the BCL11A gene.
In an embodiment, the methods and compositions discussed herein may be used to alter the BCL11A gene to treat or prevent SCD by knocking out one or both alleles of the BCL11A gene. In an embodiment, the coding region (e.g., an early coding region) of the BCL11A gene, is targeted to alter the gene. In an embodiment, a non-coding region of the BCL11A gene (e.g., an enhancer region, a promoter region, an intron, 5′ UTR, 3′UTR, polyadenylation signal) is targeted to alter the gene. In an embodiment, an enhancer (e.g., a tissue-specific enhancer, e.g., a myeloid enhancer, e.g., an erythroid enhancer) is targeted to alter the gene. BCL11A erythroid enhancer comprises an approximate 12.4 kb fragment of BCL11A intron2, located between approximate+52.0 to +64.4 kilobases (kb) from the Transcription Start Site (TSS+52 kb to TSS+64.4 kb, see
“SCD target knockout position”, as used herein, refers to a position in the BCL11A gene, which if altered, e.g., disrupted by insertion or deletion of one or more nucleotides, e.g., by NHEJ-mediated alteration, results in reduction or elimination of expression of functional BCL11A gene product. In an embodiment, the position is in the BCL11A coding region, e.g., an early coding region. In an embodiment, the position is in the BCL11A non-coding region, e.g., an enhancer region.
In an embodiment, methods and compositions discussed herein, provide for altering (e.g., knocking out) the BCL11A gene. In an embodiment, knocking out the BCL11A gene herein refers to (1) insertion or deletion (e.g., NHEJ-mediated insertion or deletion) of one or more nucleotides in close proximity to or within the early coding region of the BCL11A gene, or (2) deletion (e.g., NHEJ-mediated deletion) of a genomic sequence including the erythroid enhancer of the BCL11A gene,
In an embodiment, the SCD target knockout position is altered by genome editing using the CRISPR/Cas9 system. The SCD target knockout position may be targeted by cleaving with either a single nuclease or dual nickases, e.g., to induce insertion or deletion (e.g., NHEJ-mediated insertion or deletion) of one or more nucleotides in close proximity to or within the early coding region of the SCD target knockout position or to delete (e.g., mediated by NHEJ) a genomic sequence including the erythroid enhancer of the BCL11A gene.
In an embodiment, the methods and compositions described herein introduce one or more breaks in close proximity to or within the early coding region in at least one allele of the BCL11A gene. In an embodiment, a single strand break is introduced in close proximity to or within the early coding region in at least one allele of the BCL11A gene. In an embodiment, the single strand break will be accompanied by an additional single strand break, positioned by a second gRNA molecule.
In an embodiment, a double strand break is introduced in close proximity to or within the early coding region in at least one allele of the BCL11A gene. In an embodiment, a double strand break will be accompanied by an additional single strand break positioned by a second gRNA molecule. In an embodiment, a double strand break will be accompanied by two additional single strand breaks positioned by a second gRNA molecule and a third gRNA molecule.
In an embodiment, a pair of single strand breaks is introduced in close proximity to or within the early coding region in at least one allele of the BCL11A gene. In an embodiment, the pair of single strand breaks will be accompanied by an additional double strand break, positioned by a third gRNA molecule. In an embodiment, the pair of single strand breaks will be accompanied by an additional pair of single strand breaks positioned by a third gRNA molecule and a fourth gRNA molecule.
In an embodiment, two double strand breaks are introduced to flank the erythroid enhancer at the in the BCL11A gene (one 5′ and the other one 3′ to the erythroid enhancer) to remove (e.g., delete) the genomic sequence including the erythroid enhancer. It is contemplated herein that in an embodiment the deletion of the genomic sequence including the erythroid enhancer is mediated by NHEJ. In an embodiment, the breaks (i.e., the two double strand breaks) are positioned to avoid unwanted deletion of certain elements, such as endogenous splice sites. The breaks, i.e., two double strand breaks, can be positioned upstream and downstream of the erythroid enhancer, as discussed herein.
In an embodiment, two sets of breaks (e.g., one double strand break and a pair of single strand breaks) are introduced to flank the erythroid enhancer in the BCL11A gene (one set 5′ and the other set 3′ to the erythroid enhancer) to remove (e.g., delete) the genomic sequence including the erythroid enhancer. It is contemplated herein that in an embodiment the deletion of the genomic sequence including the erythroid enhancer is mediated by NHEJ. In an embodiment, the breaks (i.e., the double strand break and the pair of single strand breaks) are positioned to avoid unwanted deletion of certain chromosome elements, such as endogenous splice sites. The breaks, e.g., the double strand break and the pair of single strand breaks, can be positioned upstream and downstream of the erythroid enhancer, as discussed herein.
In an embodiment, two sets of breaks (e.g., two pairs of single strand breaks) are introduced to flank the erythroid enhancer at the SCD target position in the BCL11A gene (one set 5′ and the other set 3′ to the erythroid enhancer) to remove (e.g., delete) the genomic sequence including the erythroid enhancer. It is contemplated herein that in an embodiment the deletion of the genomic sequence including the erythroid enhancer is mediated by NHEJ. In an embodiment, the breaks (i.e., the two pairs of single strand breaks) are positioned to avoid unwanted deletion of certain chromosome elements, such as endogenous splice sites. The breaks, e.g., the two pairs of single strand breaks, can be positioned upstream and downstream of the erythroid enhancer, as discussed herein.
In an embodiment, the methods and compositions discussed herein may be used to alter the BCL11A gene to treat or prevent SCD by knocking down one or both alleles of the BCL11A gene. In one embodiment, the coding region of the BCL11A gene, is targeted to alter the gene. In another embodiment, a non-coding region (e.g., an enhancer region, a promoter region, an intron, 5′ UTR, 3′UTR, polyadenylation signal) of the BCL11A gene is targeted to alter the gene. In an embodiment, the promoter region of the BCL11A gene is targeted to knock down the expression of the BCL11A gene. A targeted knockdown approach alters, e.g., reduces or eliminates the expression of the BCL11A gene. As described herein, in an embodiment, a targeted knockdown is mediated by targeting an enzymatically inactive Cas9 (eiCas9) or an eiCas9 fused to a transcription repressor domain or chromatin modifying protein to alter transcription, e.g., to block, reduce, or decrease transcription, of the BCL11A gene.
“SCD target knockdown position”, as used herein, refers to a position, e.g., in the BCL11A gene, which if targeted by an eiCas9 or an eiCas9 fusion described herein, results in reduction or elimination of expression of functional BCL11A gene product. In an embodiment, transcription is reduced or eliminated. In an embodiment, the position is in the BCL11A promoter sequence. In an embodiment, a position in the promoter sequence of the BCL11A gene is targeted by an enzymatically inactive Cas9 (eiCas9) or an eiCas9-fusion protein, as described herein.
In an embodiment, one or more gRNA molecule comprising a targeting domain configured to target an enzymatically inactive Cas9 (eiCas9) or an eiCas9 fusion protein (e.g., an eiCas9 fused to a transcription repressor domain), sufficiently close to a SCD target knockdown position to reduce, decrease or repress expression of the BCL11A gene.
“SCD target position”, as used herein, refers to any of an SCD target point position, SCD target knockout position, or SCD target knockdown position, as described herein.
In one aspect, disclosed herein is a gRNA molecule, e.g., an isolated or non-naturally occurring gRNA molecule, comprising a targeting domain which is complementary with a target domain from the HBB gene or BCL11A gene.
When two or more gRNAs are used to position two or more cleavage events, e.g., double strand or single strand breaks, in a target nucleic acid, it is contemplated that the two or more cleavage events may be made by the same or different Cas9 proteins. For example, when two gRNAs are used to position two double strand breaks, a single Cas9 nuclease may be used to create both double strand breaks. When two or more gRNAs are used to position two or more single stranded breaks (single strand breaks), a single Cas9 nickase may be used to create the two or more single strand breaks. When two or more gRNAs are used to position at least one double strand break and at least one single strand break, two Cas9 proteins may be used, e.g., one Cas9 nuclease and one Cas9 nickase. It is contemplated that when two or more Cas9 proteins are used that the two or more Cas9 proteins may be delivered sequentially to control specificity of a double strand versus a single strand break at the desired position in the target nucleic acid.
In an embodiment, the targeting domain of the first gRNA molecule and the targeting domain of the second gRNA molecule hybridize to the target domain through complementary base pairing to opposite strands of the target nucleic acid molecule. In an embodiment, the gRNA molecule and the second gRNA molecule are configured such that the PAMs are oriented outward.
In an embodiment, the targeting domain of a gRNA molecule is configured to avoid unwanted target chromosome elements, such as repeat elements, e.g., an Alu repeat, or the endogenous splice sites, in the target domain. The gRNA molecule may be a first, second, third and/or fourth gRNA molecule.
In an embodiment, the targeting domain of a gRNA molecule is configured to position a cleavage event sufficiently far from a preselected nucleotide, e.g., the nucleotide of a coding region, such that the nucleotide is not altered. In an embodiment, the targeting domain of a gRNA molecule is configured to position an intronic cleavage event sufficiently far from an intron/exon border, or naturally occurring splice signal, to avoid alteration of the exonic sequence or unwanted splicing events. The gRNA molecule may be a first, second, third and/or fourth gRNA molecule, as described herein.
In an embodiment, a point mutation in the HBB gene, e.g., at E6, e.g., E6V, is targeted, e.g., for correction. In an embodiment, the targeting domain of a gRNA molecule comprises a sequence that is the same as, or differs by no more than 1, 2, 3, 4, or 5 nucleotides from, a targeting domain sequence from any one of Tables 1A-1D. In an embodiment, the targeting domain is selected from those in Tables 1A-1D. For example, in an embodiment, the targeting domain is:
In an embodiment, when the SCD target point position is E6, e.g., E6V, and two gRNAs are used to position two breaks, e.g., two single stranded breaks, in the target nucleic acid sequence, the targeting domain of each guide RNA is selected from one of Tables 1A-1D.
In an embodiment, a point mutation in the HBB gene, e.g., at E6, e.g., E6V, is targeted, e.g., for correction. In an embodiment, the targeting domain of a gRNA molecule comprises a sequence that is the same as, or differs by no more than 1, 2, 3, 4, or 5 nucleotides from, a targeting domain sequence from any one of Tables 13A-13D. In an embodiment, the targeting domain is selected from those in Tables 13A-13D. For example, in an embodiment, the targeting domain is:
In an embodiment, when the SCD target point position is E6, e.g., E6V, and two gRNAs are used to position two breaks, e.g., two single stranded breaks, in the target nucleic acid sequence, the targeting domain of each guide RNA is selected from one of Tables 13A-13D.
In an embodiment, a point mutation in the HBB gene, e.g., at E6, e.g., E6V, is targeted, e.g., for correction. In an embodiment, the targeting domain of a gRNA molecule comprises a sequence that is the same as, or differs by no more than 1, 2, 3, 4, or 5 nucleotides from, a targeting domain sequence from any one of Tables 14A-14C. In an embodiment, the targeting domain is selected from those in Tables 14A-14C.
In an embodiment, when the SCD target point position is E6, e.g., E6V, and two gRNAs are used to position two breaks, e.g., two single stranded breaks, in the target nucleic acid sequence, each guide RNA is selected from one of Tables 14A-14C.
In an embodiment, a point mutation in the HBB gene, e.g., at E6, e.g., E6V, is targeted, e.g., for correction. In an embodiment, the targeting domain of a gRNA molecule comprises a sequence that is the same as, or differs by no more than 1, 2, 3, 4, or 5 nucleotides from, a targeting domain sequence from any one of Tables 24A-24D. In an embodiment, the targeting domain is selected from those in Tables 24A-24D.
In an embodiment, when the SCD target point position is E6, e.g., E6V, and two gRNAs are used to position two breaks, e.g., two single stranded breaks, in the target nucleic acid sequence, the targeting domain of each guide RNA is selected from one of Tables 24A-24D.
In an embodiment, a point mutation in the HBB gene, e.g., at E6, e.g., E6V, is targeted, e.g., for correction. In an embodiment, the targeting domain of a gRNA molecule comprises a sequence that is the same as, or differs by no more than 1, 2, 3, 4, or 5 nucleotides from, a targeting domain sequence from any one of Tables 25A-25B. In an embodiment, the targeting domain is selected from those in Tables 25A-25B.
In an embodiment, when the SCD target point position is E6, e.g., E6V, and two gRNAs are used to position two breaks, e.g., two single stranded breaks, in the target nucleic acid sequence, the targeting domain of each guide RNA is selected from one of Tables 25A-25B.
In an embodiment, a point mutation in the HBB gene, e.g., at E6, e.g., E6V, is targeted, e.g., for correction. In an embodiment, the targeting domain of a gRNA molecule comprises a sequence that is the same as, or differs by no more than 1, 2, 3, 4, or 5 nucleotides from, a targeting domain sequence from Table 26. In an embodiment, the targeting domain is selected from those in Table 26.
In an embodiment, when the SCD target point position is E6, e.g., E6V, and two gRNAs are used to position two breaks, e.g., two single stranded breaks, in the target nucleic acid sequence, the targeting domain of each guide RNA is selected from Table 26. In another embodiment, a position in the coding region, e.g., the early coding region, of the BCL11A gene is targeted, e.g., for knockout. In an embodiment, the targeting domain comprises a sequence that is the same as, or differs by no more than 1, 2, 3, 4, or 5 nucleotides from, a targeting domain sequence from any one of Tables 2A-2F. In an embodiment, the targeting domain is selected from those in Tables 2A-2F. In another embodiment, the targeting domain is:
In an embodiment, when the SCD target knockout position is the BCL11A coding region, e.g., early coding region, and more than one gRNA is used to position breaks, e.g., two single stranded breaks or two double stranded breaks, or a combination of single strand and double strand breaks, e.g., to create one or more indels, in the target nucleic acid sequence, the targeting domain of each guide RNA is selected from one of Tables 2A-2F.
In another embodiment, a position in the coding region, e.g., the early coding region, of the BCL11A gene is targeted, e.g., for knockout. In an embodiment, the targeting domain comprises a sequence that is the same as, or differs by no more than 1, 2, 3, 4, or 5 nucleotides from, a targeting domain sequence from any one of Table 4A-4E. In an embodiment, the targeting domain is selected from those in Table 4A-4E. In another embodiment, the targeting domain is:
In an embodiment, when the SCD target knockout position is the BCL11A coding region, e.g., early coding region, and more than one gRNA is used to position breaks, e.g., two single strand breaks or two double strand breaks, or a combination of single strand and double strand breaks, e.g., to create one or more indels, in the target nucleic acid sequence, the targeting domain of each guide RNA is selected from one of Table 4A-4E.
In another embodiment, a position in the coding region, e.g., the early coding region, of the BCL11A gene is targeted, e.g., for knockout. In an embodiment, the targeting domain comprises a sequence that is the same as, or differs by no more than 1, 2, 3, 4, or 5 nucleotides from, a targeting domain sequence from any one of Table 5A-5E. In an embodiment, the targeting domain is selected from those in Table 5A-5E.
In an embodiment, when the SCD target knockout position is the BCL11A coding region, e.g., early coding region, and more than one gRNA is used to position breaks, e.g., two single strand breaks or two double strand breaks, or a combination of single strand and double strand breaks, e.g., to create one or more indels, in the target nucleic acid sequence, the targeting domain of each guide RNA is selected from one of Table 5A-5E.
In another embodiment, a position in the coding region, e.g., the early coding region, of the BCL11A gene is targeted, e.g., for knockout. In an embodiment, the targeting domain comprises a sequence that is the same as, or differs by no more than 1, 2, 3, 4, or 5 nucleotides from, a targeting domain sequence from any one of Table 6A-6B. In an embodiment, the targeting domain is selected from those in Table 6A-6B.
In an embodiment, when the SCD target knockout position is the BCL11A coding region, e.g., early coding region, and more than one gRNA is used to position breaks, e.g., two single strand breaks or two double strand breaks, or a combination of single strand and double strand breaks, e.g., to create one or more indels, in the target nucleic acid sequence, the targeting domain of each guide RNA is selected from one of Table 6A-6B.
In another embodiment, a position in the coding region, e.g., the early coding region, of the BCL11A gene is targeted, e.g., for knockout. In an embodiment, the targeting domain comprises a sequence that is the same as, or differs by no more than 1, 2, 3, 4, or 5 nucleotides from, a targeting domain sequence from any one of Table 15A-15D. In an embodiment, the targeting domain is selected from those in Table 15A-15D.
In an embodiment, when the SCD target knockout position is the BCL11A coding region, e.g., early coding region, and more than one gRNA is used to position breaks, e.g., two single strand breaks or two double strand breaks, or a combination of single strand and double strand breaks, e.g., to create one or more indels, in the target nucleic acid sequence, the targeting domain of each guide RNA is selected from one of Table 15A-15D.
In another embodiment, a position in the coding region, e.g., the early coding region, of the BCL11A gene is targeted, e.g., for knockout. In an embodiment, the targeting domain comprises a sequence that is the same as, or differs by no more than 1, 2, 3, 4, or 5 nucleotides from, a targeting domain sequence from any one of Table 16A-16E. In an embodiment, the targeting domain is selected from those in Table 16A-16E.
In an embodiment, when the SCD target knockout position is the BCL11A coding region, e.g., early coding region, and more than one gRNA is used to position breaks, e.g., two single strand breaks or two double strand breaks, or a combination of single strand and double strand breaks, e.g., to create one or more indels, in the target nucleic acid sequence, the targeting domain of each guide RNA is selected from one of Table 16A-16E.
In another embodiment, a position in the coding region, e.g., the early coding region, of the BCL11A gene is targeted, e.g., for knockout. In an embodiment, the targeting domain comprises a sequence that is the same as, or differs by no more than 1, 2, 3, 4, or 5 nucleotides from, a targeting domain sequence from any one of Table 17A-17B. In an embodiment, the targeting domain is selected from those in Table 17A-17B.
In an embodiment, when the SCD target knockout position is the BCL11A coding region, e.g., early coding region, and more than one gRNA is used to position breaks, e.g., two single strand breaks or two double strand breaks, or a combination of single strand and double strand breaks, e.g., to create one or more indels, in the target nucleic acid sequence, the targeting domain of each guide RNA is selected from one of Table 17A-17B.
In another embodiment, a position in the non-coding region, e.g., the enhancer region, of the BCL11A gene is targeted, e.g., for knockout. In an embodiment, the targeting domain comprises a sequence that is the same as, or differs by no more than 1, 2, 3, 4, or 5 nucleotides from, a targeting domain sequence from any one of Tables 7A-7D. In an embodiment, the targeting domain is selected from those in Tables 7A-7D. In another embodiment, the targeting domain is:
In an embodiment, when the SCD target knockout position is the non-coding region, e.g., the enhancer region, of the BCL11A gene, and more than one gRNA is used to position breaks, e.g., two single strand breaks or two double strand breaks, or a combination of single strand and double strand breaks, e.g., to create a deletion, in the target nucleic acid sequence, the targeting domain of each guide RNA is selected from one of Table 7A-7D.
In another embodiment, a position in the non-coding region, e.g., the enhancer region, of the BCL11A gene is targeted, e.g., for knockout. In an embodiment, the targeting domain comprises a sequence that is the same as, or differs by no more than 1, 2, 3, 4, or 5 nucleotides from, a targeting domain sequence from any one of Tables 8A-8D. In an embodiment, the targeting domain is selected from those in Tables 8A-8D.
In an embodiment, when the SCD target knockout position is the non-coding region, e.g., the enhancer region, of the BCL11A gene, and more than one gRNA is used to position breaks, e.g., two single strand breaks or two double strand breaks, or a combination of single strand and double strand breaks, e.g., to create a deletion, in the target nucleic acid sequence, the targeting domain of each guide RNA is selected from one of Table 8A-8D.
In another embodiment, a position in the non-coding region, e.g., the enhancer region, of the BCL11A gene is targeted, e.g., for knockout. In an embodiment, the targeting domain comprises a sequence that is the same as, or differs by no more than 1, 2, 3, 4, or 5 nucleotides from, a targeting domain sequence from Table 9. In an embodiment, the targeting domain is selected from those in Table 9.
In an embodiment, when the SCD target knockout position is the non-coding region, e.g., the enhancer region, of the BCL11A gene, and more than one gRNA is used to position breaks, e.g., two single strand breaks or two double strand breaks, or a combination of single strand and double strand breaks, e.g., to create a deletion, in the target nucleic acid sequence, the targeting domain of each guide RNA is selected from one of Table 9.
In another embodiment, a position in the non-coding region, e.g., the enhancer region, of the BCL11A gene is targeted, e.g., for knockout. In an embodiment, the targeting domain comprises a sequence that is the same as, or differs by no more than 1, 2, 3, 4, or 5 nucleotides from, a targeting domain sequence from any one of Tables 21A-21E. In an embodiment, the targeting domain is selected from those in Tables 21A-21E. In an embodiment, when the SCD target knockout position is the non-coding region, e.g., the enhancer region, of the BCL11A gene, and more than one gRNA is used to position breaks, e.g., two single strand breaks or two double strand breaks, or a combination of single strand and double strand breaks, e.g., to create a deletion, in the target nucleic acid sequence, the targeting domain of each guide RNA is selected from one of Table 21A-21E.
In another embodiment, a position in the non-coding region, e.g., the enhancer region, of the BCL11A gene is targeted, e.g., for knockout. In an embodiment, the targeting domain comprises a sequence that is the same as, or differs by no more than 1, 2, 3, 4, or 5 nucleotides from, a targeting domain sequence from any one of Tables 22A-22E. In an embodiment, the targeting domain is selected from those in Tables 22A-22E. In an embodiment, when the SCD target knockout position is the non-coding region, e.g., the enhancer region, of the BCL11A gene, and more than one gRNA is used to position breaks, e.g., two single strand breaks or two double strand breaks, or a combination of single strand and double strand breaks, e.g., to create a deletion, in the target nucleic acid sequence, the targeting domain of each guide RNA is selected from one of Table 22A-22E.
In another embodiment, a position in the non-coding region, e.g., the enhancer region, of the BCL11A gene is targeted, e.g., for knockout. In an embodiment, the targeting domain comprises a sequence that is the same as, or differs by no more than 1, 2, 3, 4, or 5 nucleotides from, a targeting domain sequence from any one of Tables 23A-23C. In an embodiment, the targeting domain is selected from those in Tables 23A-23C.
In an embodiment, when the SCD target knockout position is the non-coding region, e.g., the enhancer region, of the BCL11A gene, and more than one gRNA is used to position breaks, e.g., two single strand breaks or two double strand breaks, or a combination of single strand and double strand breaks, e.g., to create a deletion, in the target nucleic acid sequence, the targeting domain of each guide RNA is selected from one of Table 23A-23C.
In an embodiment, the targeting domain of the gRNA molecule is configured to target an enzymatically inactive Cas9 (eiCas9) or an eiCas9 fusion protein (e.g., an eiCas9 fused to a transcription repressor domain), sufficiently close to an SCD knockdown target position to reduce, decrease or repress expression of the BCL11A gene. In an embodiment, the targeting domain is configured to target the promoter region of the BCL11A gene to block transcription initiation, binding of one or more transcription enhancers or activators, and/or RNA polymerase. One or more gRNA may be used to target an eiCas9 to the promoter region of the BCL11A gene.
In an embodiment, when the BCL11A promoter region is targeted, e.g., for knockdown, the targeting domain can comprise a sequence that is the same as, or differs by no more than 1, 2, 3, 4, or 5 nucleotides from, a targeting domain sequence from any one of Tables 3A-3C. In an embodiment, the targeting domain is selected from those in Tables 3A-3C.
In an embodiment, when the SCD target knockdown position is the BCL11A promoter region and more than one gRNA is used to position an eiCas9 or an eiCas9-fusion protein (e.g., an eiCas9-transcription repressor domain fusion protein), in the target nucleic acid sequence, the targeting domain of each guide RNA is selected from one of Tables 3A-3C.
In an embodiment, when the BCL11A promoter region is targeted, e.g., for knockdown, the targeting domain can comprise a sequence that is the same as, or differs by no more than 1, 2, 3, 4, or 5 nucleotides from, a targeting domain sequence from any one of Tables 10A-10D. In an embodiment, the targeting domain is selected from those in Tables 10A-10D. In another embodiment, the targeting domain is:
In an embodiment, when the SCD target knockdown position is the BCL11A promoter region and more than one gRNA is used to position an eiCas9 or an eiCas9-fusion protein (e.g., an eiCas9-transcription repressor domain fusion protein), in the target nucleic acid sequence, each guide RNA is selected from one of Tables 10A-10D.
In an embodiment, when the BCL11A promoter region is targeted, e.g., for knockdown, the targeting domain can comprise a sequence that is the same as, or differs by no more than 1, 2, 3, 4, or 5 nucleotides from, a targeting domain sequence from any one of Tables 11A-11D. In an embodiment, the targeting domain is selected from those in Tables 11A-11D.
In an embodiment, when the SCD target knockdown position is the BCL11A promoter region and more than one gRNA is used to position an eiCas9 or an eiCas9-fusion protein (e.g., an eiCas9-transcription repressor domain fusion protein), in the target nucleic acid sequence, each guide RNA is selected from one of Tables 11A-11D.
In an embodiment, when the BCL11A promoter region is targeted, e.g., for knockdown, the targeting domain can comprise a sequence that is the same as, or differs by no more than 1, 2, 3, 4, or 5 nucleotides from, a targeting domain sequence from Table 12. In an embodiment, the targeting domain is selected from those in Table 12.
In an embodiment, when the SCD target knockdown position is the BCL11A promoter region and more than one gRNA is used to position an eiCas9 or an eiCas9-fusion protein (e.g., an eiCas9-transcription repressor domain fusion protein), in the target nucleic acid sequence, each guide RNA is selected from Table 12.
In an embodiment, when the BCL11A promoter region is targeted, e.g., for knockdown, the targeting domain can comprise a sequence that is the same as, or differs by no more than 1, 2, 3, 4, or 5 nucleotides from, a targeting domain sequence from any one of Tables 18A-18C. In an embodiment, the targeting domain is selected from those in Tables 18A-18C.
In an embodiment, when the SCD target knockdown position is the BCL11A promoter region and more than one gRNA is used to position an eiCas9 or an eiCas9-fusion protein (e.g., an eiCas9-transcription repressor domain fusion protein), in the target nucleic acid sequence, each guide RNA is selected from one of Tables 18A-18C.
In an embodiment, when the BCL11A promoter region is targeted, e.g., for knockdown, the targeting domain can comprise a sequence that is the same as, or differs by no more than 1, 2, 3, 4, or 5 nucleotides from, a targeting domain sequence from any one of Tables 19A-19E. In an embodiment, the targeting domain is selected from those in Tables 19A-19E.
In an embodiment, when the SCD target knockdown position is the BCL11A promoter region and more than one gRNA is used to position an eiCas9 or an eiCas9-fusion protein (e.g., an eiCas9-transcription repressor domain fusion protein), in the target nucleic acid sequence, each guide RNA is selected from one of Tables 19A-19E.
In an embodiment, when the BCL11A promoter region is targeted, e.g., for knockdown, the targeting domain can comprise a sequence that is the same as, or differs by no more than 1, 2, 3, 4, or 5 nucleotides from, a targeting domain sequence from any one of Tables 20A-20C. In an embodiment, the targeting domain is selected from those in Tables 20A-20C.
In an embodiment, when the SCD target knockdown position is the BCL11A promoter region and more than one gRNA is used to position an eiCas9 or an eiCas9-fusion protein (e.g., an eiCas9-transcription repressor domain fusion protein), in the target nucleic acid sequence, each guide RNA is selected from one of Tables 20A-20C.
In an embodiment, the targeting domain comprises a sequence that is the same as, or differs by no more than 1, 2, 3, 4, or 5 nucleotides from, a targeting domain sequence selected from any one of Tables 1A-1D, 2A-2F, 3A-3C, 4A-4E, 5A-5E, 6A-6B, 7A-7D, 8A-8D, 9, 10A-10D, 11A-11D, 12, 13A-13D, 14A-14C, 15A-15D, 16A-16E, 17A-17B, 18A-18C, 19A-19E, 20A-20C, 21A-21E, 22A-22E, 23A-23C, 24A-24D, 25A-25B, 26, or 31. In an embodiment, the targeting domain is selected from those in Tables 1A-1D, 2A-2F, 3A-3C, 4A-4E, 5A-5E, 6A-6B, 7A-7D, 8A-8D, 9, 10A-10D, 11A-11D, 12, 13A-13D, 14A-14C, 15A-15D, 16A-16E, 17A-17B, 18A-18C, 19A-19E, 20A-20C, 21A-21E, 22A-22E, 23A-23C, 24A-24D, 25A-25B, 26, or 31.
In an embodiment, the targeting domain which is complementary with the BCL11A gene is 16 nucleotides or more in length. In an embodiment, the targeting domain is 16 nucleotides in length. In an embodiment, the targeting domain is 17 nucleotides in length. In another embodiment, the targeting domain is 18 nucleotides in length. In still another embodiment, the targeting domain is 19 nucleotides in length. In still another embodiment, the targeting domain is 20 nucleotides in length. In still another embodiment, the targeting domain is 21 nucleotides in length. In still another embodiment, the targeting domain is 22 nucleotides in length. In still another embodiment, the targeting domain is 23 nucleotides in length. In still another embodiment, the targeting domain is 24 nucleotides in length. In still another embodiment, the targeting domain is 25 nucleotides in length. In still another embodiment, the targeting domain is 26 nucleotides in length.
In an embodiment, the targeting domain comprises 16 nucleotides.
In an embodiment, the targeting domain comprises 17 nucleotides.
In an embodiment, the targeting domain comprises 18 nucleotides.
In an embodiment, the targeting domain comprises 19 nucleotides.
In an embodiment, the targeting domain comprises 20 nucleotides.
In an embodiment, the targeting domain comprises 21 nucleotides.
In an embodiment, the targeting domain comprises 22 nucleotides.
In an embodiment, the targeting domain comprises 23 nucleotides.
In an embodiment, the targeting domain comprises 24 nucleotides.
In an embodiment, the targeting domain comprises 25 nucleotides.
In an embodiment, the targeting domain comprises 26 nucleotides.
In an embodiment, the gRNA, e.g., a gRNA comprising a targeting domain, which is complementary with the HBB gene or BCL11A gene, is a modular gRNA. In another embodiment, the gRNA is a unimolecular or chimeric gRNA.
HBB gRNA as described herein may comprise from 5′ to 3′: a targeting domain (comprising a “core domain”, and optionally a “secondary domain”); a first complementarity domain; a linking domain; a second complementarity domain; a proximal domain; and a tail domain. In an embodiment, the proximal domain and tail domain are taken together as a single domain.
In an embodiment, a gRNA comprises a linking domain of no more than 25 nucleotides in length; a proximal and tail domain, that taken together, are at least 20 nucleotides in length; and a targeting domain equal to or greater than 16, 17, 18, 19, 20, 21, 22, 23, 24, 25 or 26 nucleotides in length.
In another embodiment, a gRNA comprises a linking domain of no more than 25 nucleotides in length; a proximal and tail domain, that taken together, are at least 25 nucleotides in length; and a targeting domain equal to or greater than 16, 17, 18, 19, 20, 21, 22, 23, 24, 25 or 26 nucleotides in length.
In another embodiment, a gRNA comprises a linking domain of no more than 25 nucleotides in length; a proximal and tail domain, that taken together, are at least 30 nucleotides in length; and a targeting domain equal to or greater than 16, 17, 18, 19, 20, 21, 22, 23, 24, 25 or 26 nucleotides in length.
In another embodiment, a gRNA comprises a linking domain of no more than 25 nucleotides in length; a proximal and tail domain, that taken together, are at least 40 nucleotides in length; and a targeting domain equal to or greater than 16, 17, 18, 19, 20, 21, 22, 23, 24, 25 or 26 nucleotides in length.
A cleavage event, e.g., a double strand or single strand break, is generated by a Cas9 molecule. The Cas9 molecule may be an enzymatically active Cas9 (eaCas9) molecule, e.g., an eaCas9 molecule that forms a double strand break in a target nucleic acid or an eaCas9 molecule forms a single strand break in a target nucleic acid (e.g., a nickase molecule). Alternatively, in an embodiment, the Cas9 molecule may be an enzymatically inactive Cas9 (eiCas9) molecule or a modified eiCas9 molecule, e.g., the eiCas9 molecule is fused to Krüppel-associated box (KRAB) to generate an eiCas9-KRAB fusion protein molecule.
In an embodiment, the eaCas9 molecule catalyzes a double strand break.
In an embodiment, the eaCas9 molecule comprises HNH-like domain cleavage activity but has no, or no significant, N-terminal RuvC-like domain cleavage activity. In this case, the eaCas9 molecule is an HNH-like domain nickase, e.g., the eaCas9 molecule comprises a mutation at D10, e.g., D10A. In another embodiment, the eaCas9 molecule comprises N-terminal RuvC-like domain cleavage activity but has no, or no significant, HNH-like domain cleavage activity. In an embodiment, the eaCas9 molecule is an N-terminal RuvC-like domain nickase, e.g., the eaCas9 molecule comprises a mutation at H840, e.g., H840A. In an embodiment, the eaCas9 molecule is an N-terminal RuvC-like domain nickase, e.g., the eaCas9 molecule comprises a mutation at N863, e.g., N863A.
In an embodiment, a single strand break is formed in the strand of the target nucleic acid to which the targeting domain of said gRNA is complementary. In another embodiment, a single strand break is formed in the strand of the target nucleic acid other than the strand to which the targeting domain of said gRNA is complementary.
In another aspect, disclosed herein is a nucleic acid, e.g., an isolated or non-naturally occurring nucleic acid, e.g., DNA, that comprises (a) a sequence that encodes a gRNA molecule comprising a targeting domain that is complementary with a target domain, e.g., with an SCD target position, in the HBB gene or BCL11A gene as disclosed herein.
In an embodiment, the nucleic acid encodes a gRNA molecule, e.g., a first gRNA molecule, comprising a targeting domain configured to provide a cleavage event, e.g., a double strand break or a single strand break, sufficiently close to an SCD target position in the HBB gene or BCL11A gene to allow alteration, e.g., alteration associated with HDR or NHEJ, of the an SCD target position in the HBB gene or BCL11A gene.
In an embodiment, the nucleic acid encodes a gRNA molecule, e.g., a first gRNA molecule, comprising a targeting domain configured to target an enzymatically inactive Cas9 (eiCas9) or an eiCas9 fusion protein (e.g., an eiCas9 fused to a transcription repressor domain), sufficiently close to an SCD knockdown target position to reduce, decrease or repress expression of the BCL11A gene.
In an embodiment, the nucleic acid encodes a gRNA molecule, e.g., the first gRNA molecule, comprising a targeting domain comprising a sequence that is the same as, or differs by no more than 1, 2, 3, 4, or 5 nucleotides from, a targeting domain sequence from any one of Tables 1A-1D, 2A-2F, 3A-3C, 4A-4E, 5A-5E, 6A-6B, 7A-7D, 8A-8D, 9, 10A-10D, 11A-11D, 12, 13A-13D, 14A-14C, 15A-15D, 16A-16E, 17A-17B, 18A-18C, 19A-19E, 20A-20C, 21A-21E, 22A-22E, 23A-23C, 24A-24D, 25A-25B, 26, or 31. In an embodiment, the nucleic acid encodes a gRNA molecule comprising a targeting domain is selected from those in Tables 1A-1D, 2A-2F, 3A-3C, 4A-4E, 5A-5E, 6A-6B, 7A-7D, 8A-8D, 9, 10A-10D, 11A-11D, 12, 13A-13D, 14A-14C, 15A-15D, 16A-16E, 17A-17B, 18A-18C, 19A-19E, 20A-20C, 21A-21E, 22A-22E, 23A-23C, 24A-24D, 25A-25B, 26, or 31.
In an embodiment, the nucleic acid encodes a modular gRNA, e.g., one or more nucleic acids encode a modular gRNA. In another embodiment, the nucleic acid encodes a chimeric gRNA. The nucleic acid may encode a gRNA, e.g., the first gRNA molecule, comprising a targeting domain comprising 16 nucleotides or more in length. In one embodiment, the nucleic acid encodes a gRNA, e.g., the first gRNA molecule, comprising a targeting domain that is 16 nucleotides in length. In another embodiment, the nucleic acid encodes a gRNA, e.g., the first gRNA molecule, comprising a targeting domain that is 17 nucleotides in length. In still another embodiment, the nucleic acid encodes a gRNA, e.g., the first gRNA molecule, comprising a targeting domain that is 18 nucleotides in length. In still another embodiment, the nucleic acid encodes a gRNA, e.g., the first gRNA molecule, comprising a targeting domain that is 19 nucleotides in length. In still another embodiment, the nucleic acid encodes a gRNA, e.g., the first gRNA molecule, comprising a targeting domain that is 20 nucleotides in length. In still another embodiment, the nucleic acid encodes a gRNA, e.g., the first gRNA molecule, comprising a targeting domain that is 21 nucleotides in length. In still another embodiment, the nucleic acid encodes a gRNA, e.g., the first gRNA molecule, comprising a targeting domain that is 22 nucleotides in length. In still another embodiment, the nucleic acid encodes a gRNA, e.g., the first gRNA molecule, comprising a targeting domain that is 23 nucleotides in length. In still another embodiment, the nucleic acid encodes a gRNA, e.g., the first gRNA molecule, comprising a targeting domain that is 24 nucleotides in length. In still another embodiment, the nucleic acid encodes a gRNA, e.g., the first gRNA molecule, comprising a targeting domain that is 25 nucleotides in length. In still another embodiment, the nucleic acid encodes a gRNA, e.g., the first gRNA molecule, comprising a targeting domain that is 26 nucleotides in length.
In an embodiment, the targeting domain comprises 16 nucleotides.
In an embodiment, the targeting domain comprises 17 nucleotides.
In an embodiment, the targeting domain comprises 18 nucleotides.
In an embodiment, the targeting domain comprises 19 nucleotides.
In an embodiment, the targeting domain comprises 20 nucleotides.
In an embodiment, the targeting domain comprises 21 nucleotides.
In an embodiment, the targeting domain comprises 22 nucleotides.
In an embodiment, the targeting domain comprises 23 nucleotides.
In an embodiment, the targeting domain comprises 24 nucleotides.
In an embodiment, the targeting domain comprises 25 nucleotides.
In an embodiment, the targeting domain comprises 26 nucleotides.
In an embodiment, a nucleic acid encodes a gRNA comprising from 5′ to 3′: a targeting domain (comprising a “core domain”, and optionally a “secondary domain”); a first complementarity domain; a linking domain; a second complementarity domain; a proximal domain; and a tail domain. In an embodiment, the proximal domain and tail domain are taken together as a single domain.
In an embodiment, a nucleic acid encodes a gRNA e.g., the first gRNA molecule, comprising a linking domain of no more than 25 nucleotides in length; a proximal and tail domain, that taken together, are at least 20 nucleotides in length; and a targeting domain equal to or greater than 16, 17, 18, 19, 20, 21, 22, 23, 24, 25 or 26 nucleotides in length.
In an embodiment, a nucleic acid encodes a gRNA e.g., the first gRNA molecule, comprising a linking domain of no more than 25 nucleotides in length; a proximal and tail domain, that taken together, are at least 30 nucleotides in length; and a targeting domain equal to or greater than 16, 17, 18, 19, 20, 21, 22, 23, 24, 25 or 26 nucleotides in length.
In an embodiment, a nucleic acid encodes a gRNA e.g., the first gRNA molecule, comprising a linking domain of no more than 25 nucleotides in length; a proximal and tail domain, that taken together, are at least 30 nucleotides in length; and a targeting domain equal to or greater than 16, 17, 18, 19, 20, 21, 22, 23, 24, 25 or 26 nucleotides in length.
In an embodiment, a nucleic acid encodes a gRNA comprising e.g., the first gRNA molecule, a linking domain of no more than 25 nucleotides in length; a proximal and tail domain, that taken together, are at least 40 nucleotides in length; and a targeting domain equal to or greater than 16, 17, 18, 19, 20, 21, 22, 23, 24, 25 or 26 nucleotides in length.
In an embodiment, a nucleic acid comprises (a) a sequence that encodes a gRNA molecule e.g., the first gRNA molecule, comprising a targeting domain that is complementary with a target domain in the HBB gene or BCL11A gene as disclosed herein, and further comprising (b) a sequence that encodes a Cas9 molecule.
The Cas9 molecule may be an enzymatically active Cas9 (eaCas9) molecule, e.g., an eaCas9 molecule that forms a double strand break in a target nucleic acid or an eaCas9 molecule forms a single strand break in a target nucleic acid (e.g., a nickase molecule). Alternatively, in an embodiment, the Cas9 molecule may be an enzymatically inactive Cas9 (eiCas9) molecule or a modified eiCas9 molecule, e.g., the eiCas9 molecule is fused to Krüppel-associated box (KRAB) to generate an eiCas9-KRAB fusion protein molecule.
A nucleic acid disclosed herein may comprise (a) a sequence that encodes a gRNA molecule comprising a targeting domain that is complementary with a target domain in the HBB gene or BCL11A gene as disclosed herein; (b) a sequence that encodes a Cas9 molecule; and further comprises (c)(i) a sequence that encodes a second gRNA molecule described herein having a targeting domain that is complementary to a second target domain of the HBB gene or BCL11A gene, and optionally, (c)(ii) a sequence that encodes a third gRNA molecule described herein having a targeting domain that is complementary to a third target domain of the HBB gene or BCL11A gene; and optionally, (c)(iii) a sequence that encodes a fourth gRNA molecule described herein having a targeting domain that is complementary to a fourth target domain of the HBB gene or BCL11A gene.
In an embodiment, a nucleic acid encodes a second gRNA molecule comprising a targeting domain configured to provide a cleavage event, e.g., a double strand break or a single strand break, sufficiently close to an SCD target position in the HBB gene or BCL11A gene, to allow alteration, e.g., alteration associated with HDR or NHEJ, of an SCD target position in the HBB gene or BCL11A gene, either alone or in combination with the break positioned by said first gRNA molecule.
In an embodiment, the nucleic acid encodes a second gRNA molecule comprising a targeting domain configured to target an enzymatically inactive Cas9 (eiCas9) or an eiCas9 fusion protein (e.g., an eiCas9 fused to a transcription repressor domain), sufficiently close to an SCD knockdown target position to reduce, decrease or repress expression of the BCL11A gene.
In an embodiment, a nucleic acid encodes a third gRNA molecule comprising a targeting domain configured to provide a cleavage event, e.g., a double strand break or a single strand break, sufficiently close to an SCD target position in the HBB gene or BCL11A gene to allow alteration, e.g., alteration associated with HDR or NHEJ, of an SCD target position in the HBB gene or BCL11A gene, either alone or in combination with the break positioned by the first and/or second gRNA molecule.
In an embodiment, the nucleic acid encodes a third gRNA molecule comprising a targeting domain configured to target an enzymatically inactive Cas9 (eiCas9) or an eiCas9 fusion protein (e.g., an eiCas9 fused to a transcription repressor domain), sufficiently close to an SCD knockdown target position to reduce, decrease or repress expression of the BCL11A gene.
In an embodiment, a nucleic acid encodes a fourth gRNA molecule comprising a targeting domain configured to provide a cleavage event, e.g., a double strand break or a single strand break, sufficiently close to an SCD target position in the HBB gene or BCL11A gene to allow alteration, e.g., alteration associated with HDR or NHEJ, of an SCD target position in the HBB gene or BCL11A gene, either alone or in combination with the break positioned by the first gRNA molecule, the second gRNA molecule and/or the third gRNA molecule.
In an embodiment, the nucleic acid encodes a fourth gRNA molecule comprising a targeting domain configured to target an enzymatically inactive Cas9 (eiCas9) or an eiCas9 fusion protein (e.g., an eiCas9 fused to a transcription repressor domain), sufficiently close to an SCD knockdown target position to reduce, decrease or repress expression of the BCL11A gene.
In an embodiment, the nucleic acid encodes a second gRNA molecule. The second gRNA is selected to target the same SCD target position as the first gRNA molecule. Optionally, the nucleic acid may encode a third gRNA, and further optionally, the nucleic acid may encode a fourth gRNA molecule. The third gRNA molecule and the fourth gRNA molecule are selected to target the same SCD target position as the first and/or second gRNA molecules.
In an embodiment, the nucleic acid encodes a second gRNA molecule comprising a targeting domain comprising a sequence that is the same as, or differs by no more than 1, 2, 3, 4, or 5 nucleotides from, a targeting domain sequence from one of Tables 1A-1D, 2A-2F, 3A-3C, 4A-4E, 5A-5E, 6A-6B, 7A-7D, 8A-8D, 9, 10A-10D, 11A-11D, 12, 13A-13D, 14A-14C, 15A-15D, 16A-16E, 17A-17B, 18A-18C, 19A-19E, 20A-20C, 21A-21E, 22A-22E, 23A-23C, 24A-24D, 25A-25B, 26, or 31. In an embodiment, the nucleic acid encodes a second gRNA molecule comprising a targeting domain selected from those in Tables 1A-1D, 2A-2F, 3A-3C, 4A-4E, 5A-5E, 6A-6B, 7A-7D, 8A-8D, 9, 10A-10D, 11A-11D, 12, 13A-13D, 14A-14C, 15A-15D, 16A-16E, 17A-17B, 18A-18C, 19A-19E, 20A-20C, 21A-21E, 22A-22E, 23A-23C, 24A-24D, 25A-25B, 26, or 31. In an embodiment, when a third or fourth gRNA molecule are present, the third and fourth gRNA molecules may independently comprise a targeting domain comprising a sequence that is the same as, or differs by no more than 1, 2, 3, 4, or 5 nucleotides from, a targeting domain sequence from one of Tables 1A-1D, 2A-2F, 3A-3C, 4A-4E, 5A-5E, 6A-6B, 7A-7D, 8A-8D, 9, 10A-10D, 11A-11D, 12, 13A-13D, 14A-14C, 15A-15D, 16A-16E, 17A-17B, 18A-18C, 19A-19E, 20A-20C, 21A-21E, 22A-22E, 23A-23C, 24A-24D, 25A-25B, 26, or 31. In a further embodiment, when a third or fourth gRNA molecule are present, the third and fourth gRNA molecules may independently comprise a targeting domain selected from those in Tables 1A-1D, 2A-2F, 3A-3C, 4A-4E, 5A-5E, 6A-6B, 7A-7D, 8A-8D, 9, 10A-10D, 11A-11D, 12, 13A-13D, 14A-14C, 15A-15D, 16A-16E, 17A-17B, 18A-18C, 19A-19E, 20A-20C, 21A-21E, 22A-22E, 23A-23C, 24A-24D, 25A-25B, 26, or 31.
In an embodiment, the nucleic acid encodes a second gRNA which is a modular gRNA, e.g., wherein one or more nucleic acid molecules encode a modular gRNA. In another embodiment, the nucleic acid encoding a second gRNA is a chimeric gRNA. In another embodiment, when a nucleic acid encodes a third or fourth gRNA, the third and/or fourth gRNA may be a modular gRNA or a chimeric gRNA. When multiple gRNAs are used, any combination of modular or chimeric gRNAs may be used.
A nucleic acid may encode a second, a third, and/or a fourth gRNA comprising a targeting domain comprising 16 nucleotides or more in length. In an embodiment, the nucleic acid encodes a second gRNA comprising a targeting domain that is 16 nucleotides in length. In another embodiment, the nucleic acid encodes a second gRNA comprising a targeting domain that is 17 nucleotides in length. In still another embodiment, the nucleic acid encodes a second gRNA comprising a targeting domain that is 18 nucleotides in length. In still another embodiment, the nucleic acid encodes a second gRNA comprising a targeting domain that is 19 nucleotides in length. In still another embodiment, the nucleic acid encodes a second gRNA comprising a targeting domain that is 20 nucleotides in length. In still another embodiment, the nucleic acid encodes a second gRNA comprising a targeting domain that is 21 nucleotides in length. In still another embodiment, the nucleic acid encodes a second gRNA comprising a targeting domain that is 22 nucleotides in length. In still another embodiment, the nucleic acid encodes a second gRNA comprising a targeting domain that is 23 nucleotides in length. In still another embodiment, the nucleic acid encodes a second gRNA comprising a targeting domain that is 24 nucleotides in length. In still another embodiment, the nucleic acid encodes a second gRNA comprising a targeting domain that is 25 nucleotides in length. In still another embodiment, the nucleic acid encodes a second gRNA comprising a targeting domain that is 26 nucleotides in length.
In an embodiment, the targeting domain comprises 16 nucleotides.
In an embodiment, the targeting domain comprises 17 nucleotides.
In an embodiment, the targeting domain comprises 18 nucleotides.
In an embodiment, the targeting domain comprises 19 nucleotides.
In an embodiment, the targeting domain comprises 20 nucleotides.
In an embodiment, the targeting domain comprises 21 nucleotides.
In an embodiment, the targeting domain comprises 22 nucleotides.
In an embodiment, the targeting domain comprises 23 nucleotides.
In an embodiment, the targeting domain comprises 24 nucleotides.
In an embodiment, the targeting domain comprises 25 nucleotides.
In an embodiment, the targeting domain comprises 26 nucleotides.
In an embodiment, a nucleic acid encodes a second, a third, and/or a fourth gRNA comprising from 5′ to 3′: a targeting domain (comprising a “core domain”, and optionally a “secondary domain”); a first complementarity domain; a linking domain; a second complementarity domain; a proximal domain; and a tail domain. In an embodiment, the proximal domain and tail domain are taken together as a single domain.
In an embodiment, a nucleic acid encodes a second, a third, and/or a fourth gRNA comprising a linking domain of no more than 25 nucleotides in length; a proximal and tail domain, that taken together, are at least 20 nucleotides in length; and a targeting domain equal to or greater than 16, 17, 18, 19, 20, 21, 22, 23, 24, 25 or 26 nucleotides in length.
In an embodiment, a nucleic acid encodes a second, a third, and/or a fourth gRNA comprising a linking domain of no more than 25 nucleotides in length; a proximal and tail domain, that taken together, are at least 30 nucleotides in length; and a targeting domain equal to or greater than 16, 17, 18, 19, 20, 21, 22, 23, 24, 25 or 26 nucleotides in length.
In an embodiment, a nucleic acid encodes a second, a third, and/or a fourth gRNA comprising a linking domain of no more than 25 nucleotides in length; a proximal and tail domain, that taken together, are at least 35 nucleotides in length; and a targeting domain equal to or greater than 16, 17, 18, 19, 20, 21, 22, 23, 24, 25 or 26 nucleotides in length.
In an embodiment, a nucleic acid encodes a second, a third, and/or a fourth gRNA comprising a linking domain of no more than 25 nucleotides in length; a proximal and tail domain, that taken together, are at least 40 nucleotides in length; and a targeting domain equal to or greater than 16, 17, 18, 19, 20, 21, 22, 23, 24, 25 or 26 nucleotides in length.
In an embodiment, when the HBB gene is corrected, e.g., by HDR, the nucleic acid encodes (a) a sequence that encodes a gRNA molecule comprising a targeting domain that is complementary with a target domain in the HBB gene as disclosed herein; (b) a sequence that encodes a Cas9 molecule; optionally, (c)(i) a sequence that encodes a second gRNA molecule described herein having a targeting domain that is complementary to a second target domain of the HBB gene, and further optionally, (c)(ii) a sequence that encodes a third gRNA molecule described herein having a targeting domain that is complementary to a third target domain of the HBB gene; and still further optionally, (c)(iii) a sequence that encodes a fourth gRNA molecule described herein having a targeting domain that is complementary to a fourth target domain of the HBB gene; and further may comprise (d) a template nucleic acid (in an embodiment where an exogenous template is used).
In an embodiment, a mutation in the HBB gene is corrected, e.g., by HDR, using an exogenously provided template nucleic acid.
In an embodiment, the template nucleic acid is a single stranded nucleic acid. In another embodiment, the template nucleic acid is a double stranded nucleic acid. In an embodiment, the template nucleic acid comprises a nucleotide sequence, e.g., of one or more nucleotides, that will be added to or will template a change in the target nucleic acid. In another embodiment, the template nucleic acid comprises a nucleotide sequence that may be used to modify the target position. In another embodiment, the template nucleic acid comprises a nucleotide sequence, e.g., of one or more nucleotides, that corresponds to wild type sequence of the target nucleic acid, e.g., of the target position.
The template nucleic acid may comprise a replacement sequence, e.g., a replacement sequence from the Table 27. In an embodiment, the template nucleic acid comprises a 5′ homology arm, e.g., a 5′ homology arm from Table 27. In another embodiment, the template nucleic acid comprises a 3′ homology arm, e.g., a 3′ homology arm from Table 27.
In another embodiment, a mutation in the HBB gene is corrected, e.g., by HDR, without using an exogenously provided template nucleic acid. While not wishing to be bound by theory, it is believed that an endogenous region of homology can mediate HDR-based correction. In an embodiment, alteration of the target sequence occurs by HDR with an endogenous genomic donor sequence. In an embodiment, the endogenous genomic donor sequence is located on the same chromosome as the target sequence. In another embodiment, the endogenous genomic donor sequence is located on a different chromosome from the target sequence. In an embodiment, the endogenous genomic donor sequence comprises one or more nucleotides derived from the HBD gene. Mutations in the HBB gene that can be corrected (e.g., altered) by HDR with an endogenous genomic donor sequence include, e.g., a point mutation at E6, e.g., E6V.
As described above, a nucleic acid may comprise (a) a sequence encoding a gRNA molecule comprising a targeting domain that is complementary with a target domain in the HBB gene or BCL11A gene, and (b) a sequence encoding a Cas9 molecule.
In an embodiment, (a) and (b) are present on the same nucleic acid molecule, e.g., the same vector, e.g., the same viral vector, e.g., the same adeno-associated virus (AAV) vector. In an embodiment, the nucleic acid molecule is an AAV vector. Exemplary AAV vectors that may be used in any of the described compositions and methods include an AAV2 vector, a modified AAV2 vector, an AAV3 vector, a modified AAV3 vector, an AAV6 vector, a modified AAV6 vector, an AAV8 vector and an AAV9 vector.
In another embodiment, (a) is present on a first nucleic acid molecule, e.g. a first vector, e.g., a first viral vector, e.g., a first AAV vector; and (b) is present on a second nucleic acid molecule, e.g., a second vector, e.g., a second vector, e.g., a second AAV vector. The first and second nucleic acid molecules may be AAV vectors.
In another embodiment, the nucleic acid may further comprise (c) a sequence that encodes a second, third and/or fourth gRNA molecule as described herein. In an embodiment, the nucleic acid comprises (a), (b) and (c). Each of (a) and (c) may be present on the same nucleic acid molecule, e.g., the same vector, e.g., the same viral vector, e.g., the same adeno-associated virus (AAV) vector. In an embodiment, the nucleic acid molecule is an AAV vector.
In another embodiment, (a) and (c) are on different vectors. For example, (a) may be present on a first nucleic acid molecule, e.g. a first vector, e.g., a first viral vector, e.g., a first AAV vector; and (c) may be present on a second nucleic acid molecule, e.g., a second vector, e.g., a second vector, e.g., a second AAV vector. In an embodiment, the first and second nucleic acid molecules are AAV vectors.
In another embodiment, each of (a), (b), and (c) are present on the same nucleic acid molecule, e.g., the same vector, e.g., the same viral vector, e.g., an AAV vector. In an embodiment, the nucleic acid molecule is an AAV vector. In an alternate embodiment, one of (a), (b), and (c) is encoded on a first nucleic acid molecule, e.g., a first vector, e.g., a first viral vector, e.g., a first AAV vector; and a second and third of (a), (b), and (c) is encoded on a second nucleic acid molecule, e.g., a second vector, e.g., a second vector, e.g., a second AAV vector. The first and second nucleic acid molecule may be AAV vectors.
In an embodiment, (a) is present on a first nucleic acid molecule, e.g., a first vector, e.g., a first viral vector, a first AAV vector; and (b) and (c) are present on a second nucleic acid molecule, e.g., a second vector, e.g., a second vector, e.g., a second AAV vector. The first and second nucleic acid molecule may be AAV vectors.
In another embodiment, (b) is present on a first nucleic acid molecule, e.g., a first vector, e.g., a first viral vector, e.g., a first AAV vector; and (a) and (c) are present on a second nucleic acid molecule, e.g., a second vector, e.g., a second vector, e.g., a second AAV vector. The first and second nucleic acid molecule may be AAV vectors.
In another embodiment, (c) is present on a first nucleic acid molecule, e.g., a first vector, e.g., a first viral vector, e.g., a first AAV vector; and (b) and (a) are present on a second nucleic acid molecule, e.g., a second vector, e.g., a second vector, e.g., a second AAV vector. The first and second nucleic acid molecule may be AAV vectors.
In another embodiment, each of (a), (b) and (c) are present on different nucleic acid molecules, e.g., different vectors, e.g., different viral vectors, e.g., different AAV vector. For example, (a) may be on a first nucleic acid molecule, (b) on a second nucleic acid molecule, and (c) on a third nucleic acid molecule. The first, second and third nucleic acid molecule may be AAV vectors.
In another embodiment, when a third and/or fourth gRNA molecule are present, each of (a), (b), (c)(i), (c)(ii) and (c)(iii) may be present on the same nucleic acid molecule, e.g., the same vector, e.g., the same viral vector, e.g., an AAV vector. In an embodiment, the nucleic acid molecule is an AAV vector. In an alternate embodiment, each of (a), (b), (c)(i), (c)(ii) and (c)(iii) may be present on the different nucleic acid molecules, e.g., different vectors, e.g., the different viral vectors, e.g., different AAV vectors. In further embodiments, each of (a), (b), (c)(i), (c)(ii) and (c)(iii) may be present on more than one nucleic acid molecule, but fewer than five nucleic acid molecules, e.g., AAV vectors.
In another embodiment, when (d) a template nucleic acid is present, each of (a), (b), and (d) may be present on the same nucleic acid molecule, e.g., the same vector, e.g., the same viral vector, e.g., an AAV vector. In an embodiment, the nucleic acid molecule is an AAV vector. In an alternate embodiment, each of (a), (b), and (d) may be present on the different nucleic acid molecules, e.g., different vectors, e.g., the different viral vectors, e.g., different AAV vectors. In further embodiments, each of (a), (b), and (d) may be present on more than one nucleic acid molecule, but fewer than three nucleic acid molecules, e.g., AAV vectors.
In another embodiment, when (d) a template nucleic acid is present, each of (a), (b), (c)(i) and (d) may be present on the same nucleic acid molecule, e.g., the same vector, e.g., the same viral vector, e.g., an AAV vector. In an embodiment, the nucleic acid molecule is an AAV vector. In an alternate embodiment, each of (a), (b), (c)(i) and (d) may be present on the different nucleic acid molecules, e.g., different vectors, e.g., the different viral vectors, e.g., different AAV vectors. In further embodiments, each of (a), (b), (c)(i) and (d) may be present on more than one nucleic acid molecule, but fewer than four nucleic acid molecules, e.g., AAV vectors.
In another embodiment, when (d) a template nucleic acid is present, each of (a), (b), (c)(i), (c)(ii) and (d) may be present on the same nucleic acid molecule, e.g., the same vector, e.g., the same viral vector, e.g., an AAV vector. In an embodiment, the nucleic acid molecule is an AAV vector. In an alternate embodiment, each of (a), (b), (c)(i), (c)(ii) and (d) may be present on the different nucleic acid molecules, e.g., different vectors, e.g., the different viral vectors, e.g., different AAV vectors. In further embodiments, each of (a), (b), (c)(i), (c)(ii) and (d) may be present on more than one nucleic acid molecule, but fewer than five nucleic acid molecules, e.g., AAV vectors.
In another embodiment, when (d) a template nucleic acid is present, each of (a), (b), (c)(i), (c)(ii), (c)(iii) and (d) may be present on the same nucleic acid molecule, e.g., the same vector, e.g., the same viral vector, e.g., an AAV vector. In an embodiment, the nucleic acid molecule is an AAV vector. In an alternate embodiment, each of (a), (b), (c)(i), (c)(ii), (c)(iii) and (d) may be present on the different nucleic acid molecules, e.g., different vectors, e.g., the different viral vectors, e.g., different AAV vectors. In further embodiments, each of (a), (b), (c)(i), (c)(ii), (c)(iii) and (d) may be present on more than one nucleic acid molecule, but fewer than six nucleic acid molecules, e.g., AAV vectors.
The nucleic acids described herein may comprise a promoter operably linked to the sequence that encodes the gRNA molecule of (a), e.g., a promoter described herein. The nucleic acid may further comprise a second promoter operably linked to the sequence that encodes the second, third and/or fourth gRNA molecule of (c), e.g., a promoter described herein. The promoter and second promoter differ from one another. In an embodiment, the promoter and second promoter are the same.
The nucleic acids described herein may further comprise a promoter operably linked to the sequence that encodes the Cas9 molecule of (b), e.g., a promoter described herein.
In another aspect, disclosed herein is a composition comprising (a) a gRNA molecule comprising a targeting domain that is complementary with a target domain in the HBB gene or BCL11A gene, as described herein. The composition of (a) may further comprise (b) a Cas9 molecule, e.g., a Cas9 molecule as described herein. A composition of (a) and (b) may further comprise (c) a second, third and/or fourth gRNA molecule, e.g., a second, third and/or fourth gRNA molecule described herein. A composition of (a), (b) and (c) may further comprise (d) a template nucleic acid (in an embodiment where an exogenous template is used). In an embodiment, the composition is a pharmaceutical composition. The Compositions described herein, e.g., pharmaceutical compositions described herein, can be used in treating SCD in a subject, e.g., in accordance with a method disclosed herein.
In another aspect, disclosed herein is a method of altering a cell, e.g., altering the structure, e.g., altering the sequence, of a target nucleic acid of a cell, comprising contacting said cell with: (a) a gRNA that targets the HBB gene or BCL11A gene, e.g., a gRNA as described herein; (b) a Cas9 molecule, e.g., a Cas9 molecule as described herein; and optionally, (c) a second, third and/or fourth gRNA that targets HBB gene or BCL11A gene, e.g., a gRNA; and optionally, (d) a template nucleic acid, as described herein.
In an embodiment, the method comprises contacting said cell with (a) and (b).
In an embodiment, the method comprises contacting said cell with (a), (b), and (c).
In an embodiment, the method comprises contacting said cell with (a), (b), (c) and (d).
In an embodiment, the gRNA targets the HBB gene and no exogenous template nucleic acid is contacted with the cell.
The gRNA of (a) and optionally (c) may be selected from any of Tables 1A-1D, 2A-2F, 3A-3C, 4A-4E, 5A-5E, 6A-6B, 7A-7D, 8A-8D, 9, 10A-10D, 11A-11D, 12, 13A-13D, 14A-14C, 15A-15D, 16A-16E, 17A-17B, 18A-18C, 19A-19E, 20A-20C, 21A-21E, 22A-22E, 23A-23C, 24A-24D, 25A-25B, 26, or 31, or a gRNA that differs by no more than 1, 2, 3, 4, or 5 nucleotides from, a targeting domain sequence from any of Tables 1A-1D, 2A-2F, 3A-3C, 4A-4E, 5A-5E, 6A-6B, 7A-7D, 8A-8D, 9, 10A-10D, 11A-11D, 12, 13A-13D, 14A-14C, 15A-15D, 16A-16E, 17A-17B, 18A-18C, 19A-19E, 20A-20C, 21A-21E, 22A-22E, 23A-23C, 24A-24D, 25A-25B, 26, or 31.
In an embodiment, the method comprises contacting a cell from a subject suffering from or likely to develop SCD. The cell may be from a subject having a mutation at an SCD target position in the HBB gene or a subject which would benefit from having a mutation at an SCD target position in the BCL11A gene.
In an embodiment, the cell being contacted in the disclosed method is an erythroid cell. The contacting may be performed ex vivo and the contacted cell may be returned to the subject's body after the contacting step. In another embodiment, the contacting step may be performed in vivo.
In an embodiment, the method of altering a cell as described herein comprises acquiring knowledge of the sequence at an SCD target position in said cell, prior to the contacting step. Acquiring knowledge of the sequence at an SCD target position in the cell may be by sequencing the HBB gene or BCL11A gene, or a portion of the HBB gene or BCL11A gene.
In an embodiment, the contacting step of the method comprises contacting the cell with a nucleic acid, e.g., a vector, e.g., an AAV vector, that expresses at least one of (a), (b), and (c). In an embodiment, the contacting step of the method comprises contacting the cell with a nucleic acid, e.g., a vector, e.g., an AAV vector, that expresses each of (a), (b), and (c). In another embodiment, the contacting step of the method comprises delivering to the cell a Cas9 molecule of (b) and a nucleic acid which encodes a gRNA (a) and optionally, a second gRNA (c)(i) (and further optionally, a third gRNA (c)(iv) and/or fourth gRNA (c)(iii).
In an embodiment, the contacting step of the method comprises contacting the cell with a nucleic acid, e.g., a vector, e.g., an AAV vector, that expresses at least one of (a), (b), (c) and (d). In an embodiment, the contacting step of the method comprises contacting the cell with a nucleic acid, e.g., a vector, e.g., an AAV vector, that expresses each of (a), (b), and (c). In another embodiment, the contacting step of the method comprises delivering to the cell a Cas9 molecule of (b), a nucleic acid which encodes a gRNA of (a) and a template nucleic acid of (d), and optionally, a second gRNA (c)(i) (and further optionally, a third gRNA (c)(iv) and/or fourth gRNA (c)(iii).
In an embodiment, contacting comprises contacting the cell with a nucleic acid, e.g., a vector, e.g., an AAV vector, e.g., an AAV2 vector, a modified AAV2 vector, an AAV3 vector, a modified AAV3 vector, an AAV6 vector, a modified AAV6 vector, an AAV8 vector or an AAV9 vector.
In an embodiment, contacting comprises delivering to the cell a Cas9 molecule of (b), as a protein or an mRNA, and a nucleic acid which encodes (a) and optionally a second, third and/or fourth gRNA of (c).
In an embodiment, contacting comprises delivering to the cell a Cas9 molecule of (b), as a protein or an mRNA, said gRNA of (a), as an RNA, and optionally said second, third and/or fourth gRNA of (c), as an RNA.
In an embodiment, contacting comprises delivering to the cell a gRNA of (a) as an RNA, optionally said second, third and/or fourth gRNA of (c) as an RNA, and a nucleic acid that encodes the Cas9 molecule of (b).
In another aspect, disclosed herein is a method of treating or preventing a subject suffering from or likely to develop SCD, e.g., altering the structure, e.g., sequence, of a target nucleic acid of the subject, comprising contacting the subject (or a cell from the subject) with:
(a) a gRNA that targets the HBB gene or BCL11A gene, e.g., a gRNA disclosed herein;
(b) a Cas9 molecule, e.g., a Cas9 molecule disclosed herein; and
optionally, (c)(i) a second gRNA that targets the HBB gene or BCL11A gene, e.g., a second gRNA disclosed herein, and
further optionally, (c)(ii) a third gRNA, and still further optionally, (c)(iii) a fourth gRNA that target the HBB gene or BCL11A gene, e.g., a third and fourth gRNA disclosed herein.
The method of treating a subject may further comprise contacting the subject (or a cell from the subject) with (d) a template nucleic acid (in an embodiment where an exogenous template is used), e.g., a template nucleic acid disclosed herein.
In an embodiment, a template nucleic acid is used when the method of treating a subject uses HDR to alter the sequence of the target nucleic acid of the subject. In an embodiment, the gRNA targets the HBB gene and no exogenous template nucleic acid is contacted with the subject (or a cell from the subject).
In an embodiment, contacting comprises contacting with (a) and (b).
In an embodiment, contacting comprises contacting with (a), (b), and (c)(i).
In an embodiment, contacting comprises contacting with (a), (b), (c)(i) and (c)(ii).
In an embodiment, contacting comprises contacting with (a), (b), (c)(i), (c)(ii) and (c)(iii).
In an embodiment, contacting comprises contacting with (a), (b), (c)(i) and (d).
In an embodiment, contacting comprises contacting with (a), (b), (c)(i), (c)(ii) and (d).
In an embodiment, contacting comprises contacting with (a), (b), (c)(i), (c)(ii), (c)(iii) and (d).
The gRNA of (a) or (c) (e.g., (c)(i), (c)(ii), or (c)(iii) may be selected from any of Tables 1A-1D, 2A-2F, 3A-3C, 4A-4E, 5A-5E, 6A-6B, 7A-7D, 8A-8D, 9, 10A-10D, 11A-11D, 12, 13A-13D, 14A-14C, 15A-15D, 16A-16E, 17A-17B, 18A-18C, 19A-19E, 20A-20C, 21A-21E, 22A-22E, 23A-23C, 24A-24D, 25A-25B, 26, or 31, or a gRNA that differs by no more than 1, 2, 3, 4, or 5 nucleotides from, a targeting domain sequence from any of Tables 1A-1D, 2A-2F, 3A-3C, 4A-4E, 5A-5E, 6A-6B, 7A-7D, 8A-8D, 9, 10A-10D, 11A-11D, 12, 13A-13D, 14A-14C, 15A-15D, 16A-16E, 17A-17B, 18A-18C, 19A-19E, 20A-20C, 21A-21E, 22A-22E, 23A-23C, 24A-24D, 25A-25B, 26, or 31.
In an embodiment, the method comprises acquiring knowledge of the sequence (e.g., a mutation) of an SCD target position in said subject.
In an embodiment, the method comprises acquiring knowledge of the sequence (e.g., a mutation) of an SCD target position in said subject by sequencing the HBB gene or BCL11A gene or a portion of the HBB gene or BCL11A gene.
In an embodiment, the method comprises correcting a mutation at an SCD target position in the HBB gene.
In an embodiment, the method comprises correcting a mutation at an SCD target position in the HBB gene by HDR.
In an embodiment, the method comprises introducing a mutation at an SCD target position in the BCL11A gene.
In an embodiment, the method comprises introducing a mutation at an SCD target position in the BCL11A gene by NHEJ.
When the method comprises correcting the mutation at an SCD target position by HDR, a Cas9 of (b), at least one guide RNA, e.g., a guide RNA of (a) and a template nucleic acid of (d) are included in the contacting step.
In an embodiment, a cell of the subject is contacted ex vivo with (a), (b), (d) and optionally (c)(i), further optionally (c)(ii), and still further optionally (c)(iii). In an embodiment, said cell is returned to the subject's body.
In an embodiment, a cell of the subject is contacted is in vivo with (a), (b) (d) and optionally (c)(i), further optionally (c)(ii), and still further optionally (c)(iii).
In an embodiment, the cell of the subject is contacted in vivo by intravenous delivery of (a), (b), (d) and optionally (c)(i), further optionally (c)(ii), and still further optionally (c)(iii).
In an embodiment, the cell of the subject is contacted in vivo by intramuscular delivery of (a), (b), (d) and optionally (c)(i), further optionally (c)(ii), and still further optionally (c)(iii).
In an embodiment, the cell of the subject is contacted in vivo by subcutaneous delivery of (a), (b), (d) and optionally (c)(i), further optionally (c)(ii), and still further optionally (c)(iii).
In an embodiment, the cell of the subject is contacted in vivo by intra-bone marrow (IBM) delivery of (a), (b), (d) and optionally (c)(i), further optionally (c)(ii), and still further optionally (c)(iii).
In an embodiment, contacting comprises contacting the subject with a nucleic acid, e.g., a vector, e.g., an AAV vector, described herein, e.g., a nucleic acid that encodes at least one of (a), (b), (d) and optionally (c)(i), further optionally (c)(ii), and still further optionally (c)(iii).
In an embodiment, contacting comprises delivering to said subject said Cas9 molecule of (b), as a protein or mRNA, and a nucleic acid which encodes (a), a nucleic acid of (d) and optionally (c)(i), further optionally (c)(ii), and still further optionally (c)(iii).
In an embodiment, contacting comprises delivering to the subject the Cas9 molecule of (b), as a protein or mRNA, the gRNA of (a), as an RNA, a nucleic acid of (d) and optionally the second, third and/or fourth gRNA of (c), as an RNA.
In an embodiment, contacting comprises delivering to the subject the gRNA of (a), as an RNA, optionally said second, third and/or fourth gRNA of (c), as an RNA, a nucleic acid that encodes the Cas9 molecule of (b), and a nucleic acid of (d).
When the method comprises (1) introducing a mutation at an SCD target position by NHEJ or (2) knocking down expression of the BCL11A gene by targeting the promoter region, a Cas9 of (b) and at least one guide RNA, e.g., a guide RNA of (a) are included in the contacting step.
In an embodiment, a cell of the subject is contacted ex vivo with (a), (b) and optionally (c)(i), further optionally (c)(ii), and still further optionally (c)(iii). In an embodiment, said cell is returned to the subject's body.
In an embodiment, a populations of cells from a subject is contacted ex vivo with (a), (b) and optionally (c) to correct the E6V mutation in the HBB gene and a second population of cells from the subject is contacted ex vivo with (a), (b) and optionally (c) to introduce a mutation in the BCL11A gene to knockout the BCL11A gene. A mixture of the two cell populations may be returned to the subject's body to treat or prevent SCD.
In an embodiment, a cell of the subject is contacted is in vivo with (a), (b) and optionally (c)(i), further optionally (c)(ii), and still further optionally (c)(iii). In an embodiment, the cell of the subject is contacted in vivo by intravenous delivery of (a), (b) and optionally (c)(i), further optionally (c)(ii), and still further optionally (c)(iii). In an embodiment, the cell of the subject is contacted in vivo by intramuscular delivery of (a), (b) and optionally (c)(i), further optionally (c)(ii), and still further optionally (c)(iii). In an embodiment, the cell of the subject is contacted in vivo by subcutaneous delivery of (a), (b) and optionally (c)(i), further optionally (c)(ii), and still further optionally (c)(iii). In an embodiment, the cell of the subject is contacted in vivo by intra-bone marrow (IBM) delivery of (a), (b) and optionally (c)(i), further optionally (c)(ii), and still further optionally (c)(iii).
In an embodiment, contacting comprises contacting the subject with a nucleic acid, e.g., a vector, e.g., an AAV vector, described herein, e.g., a nucleic acid that encodes at least one of (a), (b), and optionally (c)(i), further optionally (c)(ii), and still further optionally (c)(iii).
In an embodiment, contacting comprises delivering to said subject said Cas9 molecule of (b), as a protein or mRNA, and a nucleic acid which encodes (a) and optionally (c)(i), further optionally (c)(ii), and still further optionally (c)(iii).
In an embodiment, contacting comprises delivering to the subject the Cas9 molecule of (b), as a protein or mRNA, the gRNA of (a), as an RNA, and optionally the second, third and/or fourth gRNA of (c), as an RNA.
In an embodiment, contacting comprises delivering to the subject the gRNA of (a), as an RNA, optionally said second, third and/or fourth gRNA of (c), as an RNA, and a nucleic acid that encodes the Cas9 molecule of (b).
In another aspect, disclosed herein is a reaction mixture comprising a gRNA, a nucleic acid, or a composition described herein, and a cell, e.g., a cell from a subject having, or likely to develop SCD, or a subject having a mutation at an SCD target position in the HBB gene, or a cell from a subject which would benefit from having a mutation at an SCD target position in the BCL11A gene.
In another aspect, disclosed herein is a kit comprising, (a) gRNA molecule described herein, or nucleic acid that encodes the gRNA, and one or more of the following:
(b) a Cas9 molecule, e.g., a Cas9 molecule described herein, or a nucleic acid or mRNA that encodes the Cas9;
(c)(i) a second gRNA molecule, e.g., a second gRNA molecule described herein or a nucleic acid that encodes (c)(i);
(c)(ii) a third gRNA molecule, e.g., a second gRNA molecule described herein or a nucleic acid that encodes (c)(ii);
(c)(iii) a fourth gRNA molecule, e.g., a second gRNA molecule described herein or a nucleic acid that encodes (c)(iii);
(d) a template nucleic acid (in an embodiment where an exogenous template is used), e.g., a template nucleic acid described herein.
In an embodiment, the kit comprises nucleic acid, e.g., an AAV vector, that encodes one or more of (a), (b), (c)(i), (c)(ii), (c)(iii) and (d).
In an aspect, the disclosure features a gRNA molecule, referred to herein as a governing gRNA molecule, comprising a targeting domain which is complementary to a target domain on a nucleic acid that encodes a component of the CRISPR/Cas system introduced into a cell or subject. In an embodiment, the governing gRNA molecule targets a nucleic acid that encodes a Cas9 molecule or a nucleic acid that encodes a target gene gRNA molecule. In an embodiment, the governing gRNA comprises a targeting domain that is complementary to a target domain in a sequence that encodes a Cas9 component, e.g., a Cas9 molecule or target gene gRNA molecule. In an embodiment, the target domain is designed with, or has, minimal homology to other nucleic acid sequences in the cell, e.g., to minimize off-target cleavage. For example, the targeting domain on the governing gRNA can be selected to reduce or minimize off-target effects. In an embodiment, a target domain for a governing gRNA can be disposed in the control or coding region of a Cas9 molecule or disposed between a control region and a transcribed region. In an embodiment, a target domain for a governing gRNA can be disposed in the control or coding region of a target gene gRNA molecule or disposed between a control region and a transcribed region for a target gene gRNA. While not wishing to be bound by theory, it is believed that altering, e.g., inactivating, a nucleic acid that encodes a Cas9 molecule or a nucleic acid that encodes a target gene gRNA molecule can be effected by cleavage of the targeted nucleic acid sequence or by binding of a Cas9 molecule/governing gRNA molecule complex to the targeted nucleic acid sequence.
The compositions, reaction mixtures and kits, as disclosed herein, can also include a governing gRNA molecule, e.g., a governing gRNA molecule disclosed herein.
Unless otherwise defined, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this invention belongs. Although methods and materials similar or equivalent to those described herein can be used in the practice or testing of the present invention, suitable methods and materials are described below. All publications, patent applications, patents, and other references mentioned herein are incorporated by reference in their entirety. In addition, the materials, methods, and examples are illustrative only and not intended to be limiting.
Headings, including numeric and alphabetical headings and subheadings, are for organization and presentation and are not intended to be limiting.
Other features and advantages of the invention will be apparent from the detailed description, drawings, and from the claims.
“Alt-HDR” or “alternative HDR”, or alternative homology-directed repair, as used herein, refers to the process of repairing DNA damage using a homologous nucleic acid (e.g., an endogenous homologous sequence, e.g., a sister chromatid, or an exogenous nucleic acid, e.g., a template nucleic acid). Alt-HDR is distinct from canonical HDR in that the process utilizes different pathways from canonical HDR, and can be inhibited by the canonical HDR mediators, RAD51 and BRCA2. Also, alt-HDR uses a single-stranded or nicked homologous nucleic acid for repair of the break.
“Canonical HDR”, or canonical homology-directed repair, as used herein, refers to the process of repairing DNA damage using a homologous nucleic acid (e.g., an endogenous homologous sequence, e.g., a sister chromatid, or an exogenous nucleic acid, e.g., a template nucleic acid). Canonical HDR typically acts when there has been significant resection at the double strand break, forming at least one single stranded portion of DNA. In a normal cell, HDR typically involves a series of steps such as recognition of the break, stabilization of the break, resection, stabilization of single stranded DNA, formation of a DNA crossover intermediate, resolution of the crossover intermediate, and ligation. The process requires RAD51 and BRCA2, and the homologous nucleic acid is typically double-stranded.
Unless indicated otherwise, the term “HDR” as used herein encompasses canonical HDR and alt-HDR.
“Domain”, as used herein, is used to describe segments of a protein or nucleic acid. Unless otherwise indicated, a domain is not required to have any specific functional property.
Calculations of homology or sequence identity between two sequences (the terms are used interchangeably herein) are performed as follows. The sequences are aligned for optimal comparison purposes (e.g., gaps can be introduced in one or both of a first and a second amino acid or nucleic acid sequence for optimal alignment and non-homologous sequences can be disregarded for comparison purposes). The optimal alignment is determined as the best score using the GAP program in the GCG software package with a Blossum 62 scoring matrix with a gap penalty of 12, a gap extend penalty of 4, and a frame shift gap penalty of 5. The amino acid residues or nucleotides at corresponding amino acid positions or nucleotide positions are then compared. When a position in the first sequence is occupied by the same amino acid residue or nucleotide as the corresponding position in the second sequence, then the molecules are identical at that position. The percent identity between the two sequences is a function of the number of identical positions shared by the sequences.
“Governing gRNA molecule”, as used herein, refers to a gRNA molecule that comprises a targeting domain that is complementary to a target domain on a nucleic acid that comprises a sequence that encodes a component of the CRISPR/Cas system that is introduced into a cell or subject. A governing gRNA does not target an endogenous cell or subject sequence. In an embodiment, a governing gRNA molecule comprises a targeting domain that is complementary with a target sequence on: (a) a nucleic acid that encodes a Cas9 molecule; (b) a nucleic acid that encodes a gRNA which comprises a targeting domain that targets the HBB or BCL11A gene (a target gene gRNA); or on more than one nucleic acid that encodes a CRISPR/Cas component, e.g., both (a) and (b). In an embodiment, a nucleic acid molecule that encodes a CRISPR/Cas component, e.g., that encodes a Cas9 molecule or a target gene gRNA, comprises more than one target domain that is complementary with a governing gRNA targeting domain. While not wishing to be bound by theory, it is believed that a governing gRNA molecule complexes with a Cas9 molecule and results in Cas9 mediated inactivation of the targeted nucleic acid, e.g., by cleavage or by binding to the nucleic acid, and results in cessation or reduction of the production of a CRISPR/Cas system component. In an embodiment, the Cas9 molecule forms two complexes: a complex comprising a Cas9 molecule with a target gene gRNA, which complex will alter the HBB or BCL11A gene; and a complex comprising a Cas9 molecule with a governing gRNA molecule, which complex will act to prevent further production of a CRISPR/Cas system component, e.g., a Cas9 molecule or a target gene gRNA molecule. In an embodiment, a governing gRNA molecule/Cas9 molecule complex binds to or promotes cleavage of a control region sequence, e.g., a promoter, operably linked to a sequence that encodes a Cas9 molecule, a sequence that encodes a transcribed region, an exon, or an intron, for the Cas9 molecule. In an embodiment, a governing gRNA molecule/Cas9 molecule complex binds to or promotes cleavage of a control region sequence, e.g., a promoter, operably linked to a gRNA molecule, or a sequence that encodes the gRNA molecule. In an embodiment, the governing gRNA, e.g., a Cas9-targeting governing gRNA molecule, or a target gene gRNA-targeting governing gRNA molecule, limits the effect of the Cas9 molecule/target gene gRNA molecule complex-mediated gene targeting. In an embodiment, a governing gRNA places temporal, level of expression, or other limits, on activity of the Cas9 molecule/target gene gRNA molecule complex. In an embodiment, a governing gRNA reduces off-target or other unwanted activity. In an embodiment, a governing gRNA molecule inhibits, e.g., entirely or substantially entirely inhibits, the production of a component of the Cas9 system and thereby limits, or governs, its activity.
“Modulator”, as used herein, refers to an entity, e.g., a drug, that can alter the activity (e.g., enzymatic activity, transcriptional activity, or translational activity), amount, distribution, or structure of a subject molecule or genetic sequence. In an embodiment, modulation comprises cleavage, e.g., breaking of a covalent or non-covalent bond, or the forming of a covalent or non-covalent bond, e.g., the attachment of a moiety, to the subject molecule. In an embodiment, a modulator alters the, three dimensional, secondary, tertiary, or quaternary structure, of a subject molecule. A modulator can increase, decrease, initiate, or eliminate a subject activity.
“Large molecule”, as used herein, refers to a molecule having a molecular weight of at least 2, 3, 5, 10, 20, 30, 40, 50, 60, 70, 80, 90, or 100 kD. Large molecules include proteins, polypeptides, nucleic acids, biologics, and carbohydrates.
A “polypeptide”, as used herein, refers to a polymer of amino acids having less than 100 amino acid residues. In an embodiment, it has less than 50, 20, or 10 amino acid residues.
“Non-homologous end joining” or “NHEJ”, as used herein, refers to ligation mediated repair and/or non-template mediated repair including canonical NHEJ (cNHEJ), alternative NHEJ (altNHEJ), microhomology-mediated end joining (MMEJ), single-strand annealing (SSA), and synthesis-dependent microhomology-mediated end joining (SD-MMEJ).
A “reference molecule”, e.g., a reference Cas9 molecule or reference gRNA, as used herein, refers to a molecule to which a subject molecule, e.g., a subject Cas9 molecule of subject gRNA molecule, e.g., a modified or candidate Cas9 molecule is compared. For example, a Cas9 molecule can be characterized as having no more than 10% of the nuclease activity of a reference Cas9 molecule. Examples of reference Cas9 molecules include naturally occurring unmodified Cas9 molecules, e.g., a naturally occurring Cas9 molecule such as a Cas9 molecule of S. pyogenes, S. aureus or S. thermophilus. In an embodiment, the reference Cas9 molecule is the naturally occurring Cas9 molecule having the closest sequence identity or homology with the Cas9 molecule to which it is being compared. In an embodiment, the reference Cas9 molecule is a sequence, e.g., a naturally occurring or known sequence, which is the parental form on which a change, e.g., a mutation has been made.
“Replacement”, or “replaced”, as used herein with reference to a modification of a molecule does not require a process limitation but merely indicates that the replacement entity is present.
“Small molecule”, as used herein, refers to a compound having a molecular weight less than about 2 kD, e.g., less than about 2 kD, less than about 1.5 kD, less than about 1 kD, or less than about 0.75 kD.
“Subject”, as used herein, may mean either a human or non-human animal. The term includes, but is not limited to, mammals (e.g., humans, other primates, pigs, rodents (e.g., mice and rats or hamsters), rabbits, guinea pigs, cows, horses, cats, dogs, sheep, and goats). In an embodiment, the subject is a human. In another embodiment, the subject is poultry.
“Treat”, “treating” and “treatment”, as used herein, mean the treatment of a disease in a mammal, e.g., in a human, including (a) inhibiting the disease, i.e., arresting or preventing its development; (b) relieving the disease, i.e., causing regression of the disease state; and (c) curing the disease.
“Prevent”, “preventing” and “prevention”, as used herein, means the prevention of a disease in a mammal, e.g., in a human, including (a) avoiding or precluding the disease; (2) affecting the predisposition toward the disease, e.g., preventing at least one symptom of the disease or to delay onset of at least one symptom of the disease.
“X” as used herein in the context of an amino acid sequence, refers to any amino acid (e.g., any of the twenty natural amino acids) unless otherwise specified.
Methods of Repairing Mutation(s) in the HBB Gene
One approach to treat or prevent SCD is to repair (i.e., correct) one or more mutations in the HBB gene, e.g., by HDR. In this approach, mutant HBB allele(s) are corrected and restored to wild type state. While not wishing to be bound by theory, it is believed that correction of the glutamic acid to valine substitution at amino acid 6 in the beta-globin gene restores wild type beta-globin production within erythroid cells. The method described herein can be performed in all cell types. Beta-globin is expressed in cells of erythroid cell lineage. In an embodiment, an erythroid cell is targeted.
In an embodiment, one HBB allele is repaired in the subject. In another embodiment, both HBB alleles are repaired in the subject. In either situation, the subject can be cured of disease. As the disease only displays a phenotype when both alleles are mutated, repair of a single allele is adequate for a cure.
In one aspect, methods and compositions discussed herein, provide for the correction of the underlying genetic cause of SCD, e.g., the correction of a mutation at a target position in the HBB gene, e.g., correction of a mutation at amino acid position 6, e.g., an E6V substitution in the HBB gene.
In an embodiment, the method provides for the correction of a mutation at a target position in the HBB gene, e.g., correction of a mutation at amino acid position 6, e.g., an E6V substitution in the HBB gene. As described herein, in one embodiment, the method comprises the introduction of one or more breaks (e.g., single strand breaks or double strand breaks) sufficiently close to (e.g., either 5′ or 3′ to) the target position in the HBB gene, e.g., E6V.
In an embodiment, the targeting domain of the gRNA molecule is configured to provide a cleavage event, e.g., a double strand break or a single strand break, sufficiently close to (e.g., either 5′ or 3′ to) the target position in the HBB gene, e.g., E6V to allow correction, e.g., an alteration in the HBB gene, e.g., an alternation associated with HDR. In an embodiment, the targeting domain is configured such that a cleavage event, e.g., a double strand or single strand break, is positioned within 1, 2, 3, 4, 5, 10, 15, 20, 25, 30, 35, 40, 45, 50, 60, 70, 80, 90, 100, 150, 200, 250, 300, 350, 400, 450 or 500 nucleotides of a the target position in the HBB gene, e.g., E6V. The break, e.g., a double strand or single strand break, can be positioned upstream or downstream of the target position in the HBB gene, e.g., E6V.
In an embodiment, a second, third and/or fourth gRNA molecule is configured to provide a cleavage event, e.g., a double strand break or a single strand break, sufficiently close to (e.g., either 5′ or 3′ to) the target position in the HBB gene, e.g., E6V to allow correction, e.g., an alteration associated with HDR in the HBB gene. In an embodiment, the targeting domain is configured such that a cleavage event, e.g., a double strand or single strand break, is positioned within 1, 2, 3, 4, 5, 10, 15, 20, 25, 30, 35, 40, 45, 50, 60, 70, 80, 90, 100, 150, 200, 250, 300, 350, 400, 450 or 500 nucleotides of a the target position in the HBB gene, e.g., E6V. The break, e.g., a double strand or single strand break, can be positioned upstream or downstream of the target position in the HBB gene, e.g., E6V.
In an embodiment, a single strand break is accompanied by an additional single strand break, positioned by a second, third and/or fourth gRNA molecule, as discussed below. For example, The targeting domains bind configured such that a cleavage event, e.g., the two single strand breaks, are positioned within 1, 2, 3, 4, 5, 10, 15, 20, 25, 30, 35, 40, 45, 50, 60, 70, 80, 90, 100, 150, 200, 250, 300, 350, 400, 450 or 500 nucleotides of the target position in the HBB gene, e.g., E6V. In an embodiment, the first and second gRNA molecules are configured such, that when guiding a Cas9 nickase, a single strand break will be accompanied by an additional single strand break, positioned by a second gRNA, sufficiently close to one another to result in an alteration of the target position in the HBB gene, e.g., E6V. In an embodiment, the first and second gRNA molecules are configured such that a single strand break positioned by said second gRNA is within 10, 20, 30, 40, or 50 nucleotides of the break positioned by said first gRNA molecule, e.g., when the Cas9 is a nickase. In an embodiment, the two gRNA molecules are configured to position cuts at the same position, or within a few nucleotides of one another, on different strands, e.g., essentially mimicking a double strand break.
In an embodiment, a double strand break can be accompanied by an additional double strand break, positioned by a second, third and/or fourth gRNA molecule, as is discussed below. For example, the targeting domain of a first gRNA molecule is configured such that a double strand break is positioned upstream of the target position in the HBB gene, e.g., E6V, e.g., within 1, 2, 3, 4, 5, 10, 15, 20, 25, 30, 35, 40, 45, 50, 60, 70, 80, 90, 100, 150, 200, 250, 300, 350, 400, 450 or 500 nucleotides of the target position; and the targeting domain of a second gRNA molecule is configured such that a double strand break is positioned downstream the target position in the HBB gene, e.g., E6V, e.g., within 1, 2, 3, 4, 5, 10, 15, 20, 25, 30, 35, 40, 45, 50, 60, 70, 80, 90, 100, 150, 200, 250, 300, 350, 400, 450 or 500 nucleotides of the target position.
In an embodiment, a double strand break can be accompanied by two additional single strand breaks, positioned by a second gRNA molecule and a third gRNA molecule. For example, the targeting domain of a first gRNA molecule is configured such that a double strand break is positioned upstream of the target position in the HBB gene, e.g., E6V, e.g., within 1, 2, 3, 4, 5, 10, 15, 20, 25, 30, 35, 40, 45, 50, 60, 70, 80, 90, 100, 150, 200, 250, 300, 350, 400, 450 or 500 nucleotides of the target position; and the targeting domains of a second and third gRNA molecule are configured such that two single strand breaks are positioned downstream of the target position in the HBB gene, e.g., E6V, e.g., within 1, 2, 3, 4, 5, 10, 15, 20, 25, 30, 35, 40, 45, 50, 60, 70, 80, 90, 100, 150, 200, 250, 300, 350, 400, 450 or 500 nucleotides of the target position. In an embodiment, the targeting domain of the first, second and third gRNA molecules are configured such that a cleavage event, e.g., a double strand or single strand break, is positioned, independently for each of the gRNA molecules.
In an embodiment, a first and second single strand breaks can be accompanied by two additional single strand breaks positioned by a third gRNA molecule and a fourth gRNA molecule. For example, the targeting domain of a first and second gRNA molecule are configured such that two single strand breaks are positioned upstream of the target position in the HBB gene, e.g., E6V, e.g., within 1, 2, 3, 4, 5, 10, 15, 20, 25, 30, 35, 40, 45, 50, 60, 70, 80, 90, 100, 150, 200, 250, 300, 350, 400, 450 or 500 nucleotides of the target position in the HBB gene, e.g., E6V; and the targeting domains of a third and fourth gRNA molecule are configured such that two single strand breaks are positioned downstream of the target position in the HBB gene, e.g., E6V, e.g., within 1, 2, 3, 4, 5, 10, 15, 20, 25, 30, 35, 40, 45, 50, 60, 70, 80, 90, 100, 150, 200, 250, 300, 350, 400, 450 or 500 nucleotides of the target position in the HBB gene, e.g., E6V.
In an embodiment, a mutation in the HBB gene, e.g., E6V is corrected using an exogenously provided template nucleic acid, e.g., by HDR. In another embodiment, a mutation in the HBB gene, e.g., E6V is corrected without using an exogenously provided template nucleic acid, e.g., by HDR. In an embodiment, alteration of the target sequence occurs with an endogenous genomic donor sequence, e.g., by HDR. In an embodiment, the endogenous genomic donor sequence comprises one or more nucleotides derived from the HBD gene. In an embodiment, a mutation in the HBB gene, e.g., E6V is corrected by an endogenous genomic donor sequence (e.g, an HBD gene). In an embodiment, an eaCas9 molecule, e.g., an eaCas9 molecule described herein, is used. In an embodiment, the eaCas9 molecule comprises HNH-like domain cleavage activity but has no, or no significant, N-terminal RuvC-like domain cleavage activity. In an embodiment, the eaCas9 molecule is an HNH-like domain nickase. In an embodiment, the eaCas9 molecule comprises a mutation at D10 (e.g., D10A). In an embodiment, the eaCas9 molecule comprises N-terminal RuvC-like domain cleavage activity but has no, or no significant, HNH-like domain cleavage activity. In an embodiment, the eaCas9 molecule is an N-terminal RuvC-like domain nickase. In an embodiment, the eaCas9 molecule comprises a mutation at H840 (e.g., H840A) or N863 (e.g., N863A).
Methods of Altering BCL11A
One approach to increase the expression of HbF involves identification of genes whose products play a role in the regulation of globin gene expression. One such gene is BCL11A. It plays a role in the regulation of γ globin expression. It was first identified because of its role in lymphocyte development. BCL11A encodes a zinc finger protein that is thought to be involved in the stage specific regulation of γ globin expression. The BCL11A gene product is expressed in adult erythroid precursor cells and down-regulation of its expression leads to an increase in 7 globin expression. In addition, it appears that the splicing of the BCL11A mRNA is developmentally regulated. In embryonic cells, it appears that the shorter BCL11A mRNA variants, known as BCL11A-S and BCL11A-XS are primary expressed, while in adult cells, the longer BCL11A-L and BCL11A-XL mRNA variants are predominantly expressed. See, Sankaran et al (2008) Science 322 p. 1839. The BCL11A protein appears to interact with the β globin locus to alter its conformation and thus its expression at different developmental stages. Thus, if BCL11A expression is altered e.g., disrupted (e.g., reduced or eliminated), it results in the elevation of γ globin and HbF production.
Disclosed herein are methods for altering the SCD target position in the BCL11A gene. Altering the SCD target position is achieved, e.g., by:
(1) knocking out the BCL11A gene:
-
- (a) insertion or deletion (e.g., NHEJ-mediated insertion or deletion) of one or more nucleotides in close proximity to or within the early coding region of the BCL11A gene, or
- (b) deletion (e.g., NHEJ-mediated deletion) of a genomic sequence including the erythroid enhancer of the BCL11A gene, or
(2) knocking down the BCL11A gene mediated by enzymatically inactive Cas9 (eiCas9) molecule or an eiCas9-fusion protein by targeting the promoter region of the gene.
All approaches give rise to alteration of the BCL11A gene.
In one embodiment, methods described herein introduce one or more breaks near the early coding region in at least one allele of the BCL11A gene. In another embodiment, methods described herein introduce two or more breaks to flank the erythroid enhancer of SCD target knockout position. The two or more breaks remove (e.g., delete) genomic sequence including the erythorid enhancer. In another embodiment, methods described herein comprises knocking down the BCL11A gene mediated by enzymatically inactive Cas9 (eiCas9) molecule or an eiCas9-fusion protein by targeting the promoter region of SCD target knockdown position. All methods described herein result in alteration of the BCL11A gene.
NHEJ-Mediated Introduction of an Indel in Close Proximity to or within the Early Coding Region of the SCD Knockout Position
In an embodiment, the method comprises introducing a NHEJ-mediated insertion or deletion of one more nucleotides in close proximity to the SCD target knockout position (e.g., the early coding region) of the BCL11A gene. As described herein, in one embodiment, the method comprises the introduction of one or more breaks (e.g., single strand breaks or double strand breaks) sufficiently close to (e.g., either 5′ or 3′ to) the early coding region of the SCD target knockout position, such that the break-induced indel could be reasonably expected to span the SCD target knockout position (e.g., the early coding region). While not wishing to be bound by theory, it is believed that NHEJ-mediated repair of the break(s) allows for the NHEJ-mediated introduction of an indel in close proximity to within the early coding region of the SCD target knockout position.
In an embodiment, the targeting domain of the gRNA molecule is configured to provide a cleavage event, e.g., a double strand break or a single strand break, sufficiently close to the early coding region in the BCL11A gene to allow alteration, e.g., alteration associated with NHEJ in the BCL11A gene. In an embodiment, the targeting domain is configured such that a cleavage event, e.g., a double strand or single strand break, is positioned within 1, 2, 3, 4, 5, 10, 15, 20, 25, 30, 35, 40, 45, 50, 60, 70, 80, 90, 100, 150, 200, 250, 300, 350, 400, 450 or 500 nucleotides of a SCD target knockout position. The break, e.g., a double strand or single strand break, can be positioned upstream or downstream of a SCD target knockout position in the BCL11A gene.
In an embodiment, a second gRNA molecule comprising a second targeting domain is configured to provide a cleavage event, e.g., a double strand break or a single strand break, sufficiently close to the early coding region in the BCL11A gene, to allow alteration, e.g., alteration associated with NHEJ in the BCL11A gene, either alone or in combination with the break positioned by said first gRNA molecule. In an embodiment, the targeting domains of the first and second gRNA molecules are configured such that a cleavage event, e.g., a double strand or single strand break, is positioned, independently for each of the gRNA molecules, within 1, 2, 3, 4, 5, 10, 15, 20, 25, 30, 35, 40, 45, 50, 60, 70, 80, 90, 100, 150, 200, 250, 300, 350, 400, 450 or 500 nucleotides of the target position. In an embodiment, the breaks, e.g., double strand or single strand breaks, are positioned on both sides of a nucleotide of a SCD target knockout position in the BCL11A gene. In an embodiment, the breaks, e.g., double strand or single strand breaks, are positioned on one side, e.g., upstream or downstream, of a nucleotide of a SCD target knockout position in the BCL11A gene.
In an embodiment, a single strand break is accompanied by an additional single strand break, positioned by a second gRNA molecule, as discussed below. For example, The targeting domains bind configured such that a cleavage event, e.g., the two single strand breaks, are positioned within 1, 2, 3, 4, 5, 10, 15, 20, 25, 30, 35, 40, 45, 50, 60, 70, 80, 90, 100, 150, 200, 250, 300, 350, 400, 450 or 500 nucleotides of the early coding region in the BCL11A gene. In an embodiment, the first and second gRNA molecules are configured such, that when guiding a Cas9 nickase, a single strand break will be accompanied by an additional single strand break, positioned by a second gRNA, sufficiently close to one another to result in alteration of the early coding region in the BCL11A gene. In an embodiment, the first and second gRNA molecules are configured such that a single strand break positioned by said second gRNA is within 10, 20, 30, 40, or 50 nucleotides of the break positioned by said first gRNA molecule, e.g., when the Cas9 is a nickase. In an embodiment, the two gRNA molecules are configured to position cuts at the same position, or within a few nucleotides of one another, on different strands, e.g., essentially mimicking a double strand break.
In an embodiment, a double strand break can be accompanied by an additional double strand break, positioned by a second gRNA molecule, as is discussed below. For example, the targeting domain of a first gRNA molecule is configured such that a double strand break is positioned upstream of the early coding region in the BCL11A gene, e.g., within 1, 2, 3, 4, 5, 10, 15, 20, 25, 30, 35, 40, 45, 50, 60, 70, 80, 90, 100, 150, 200, 250, 300, 350, 400, 450 or 500 nucleotides of the target position; and the targeting domain of a second gRNA molecule is configured such that a double strand break is positioned downstream of the early coding region in the BCL11A gene, e.g., within 1, 2, 3, 4, 5, 10, 15, 20, 25, 30, 35, 40, 45, 50, 60, 70, 80, 90, 100, 150, 200, 250, 300, 350, 400, 450 or 500 nucleotides of the target position.
In an embodiment, a double strand break can be accompanied by two additional single strand breaks, positioned by a second gRNA molecule and a third gRNA molecule. For example, the targeting domain of a first gRNA molecule is configured such that a double strand break is positioned upstream of the early coding region in the BCL11A gene, e.g., within 1, 2, 3, 4, 5, 10, 15, 20, 25, 30, 35, 40, 45, 50, 60, 70, 80, 90, 100, 150, 200, 250, 300, 350, 400, 450 or 500 nucleotides of the target position; and the targeting domains of a second and third gRNA molecule are configured such that two single strand breaks are positioned downstream of the early coding region in the BCL11A gene, e.g., within 1, 2, 3, 4, 5, 10, 15, 20, 25, 30, 35, 40, 45, 50, 60, 70, 80, 90, 100, 150, 200, 250, 300, 350, 400, 450 or 500 nucleotides of the target position. In an embodiment, the targeting domain of the first, second and third gRNA molecules are configured such that a cleavage event, e.g., a double strand or single strand break, is positioned, independently for each of the gRNA molecules.
In an embodiment, a first and second single strand breaks can be accompanied by two additional single strand breaks positioned by a third gRNA molecule and a fourth gRNA molecule. For example, the targeting domain of a first and second gRNA molecule are configured such that two single strand breaks are positioned upstream of the early coding region in the BCL11A gene, e.g., within 1, 2, 3, 4, 5, 10, 15, 20, 25, 30, 35, 40, 45, 50, 60, 70, 80, 90, 100, 150, 200, 250, 300, 350, 400, 450 or 500 nucleotides of the early coding region in the BCL11A gene; and the targeting domains of a third and fourth gRNA molecule are configured such that two single strand breaks are positioned downstream of a SCD target knockout position in the early coding region in the BCL11A gene, e.g., within 1, 2, 3, 4, 5, 10, 15, 20, 25, 30, 35, 40, 45, 50, 60, 70, 80, 90, 100, 150, 200, 250, 300, 350, 400, 450 or 500 nucleotides of the early coding region in the BCL11A gene.
NHEJ-Mediated Deletion of the Erythroid Enhancer at the SCD Target Position
In an embodiment, the method comprises introducing a NHEJ-mediated deletion of a genomic sequence including the erythroid enhancer. As described herein, in one embodiment, the method comprises the introduction of two double strand breaks-one 5′ and the other 3′ to (i.e., flanking) the SCD target position (e.g., the erythroid enhancer). Two gRNAs, e.g., unimolecular (or chimeric) or modular gRNA molecules, are configured to position the two double strand breaks on opposite sides of the SCD target knockdown position (e.g., the erythroid enhancer) in the BCL11A gene. In an embodiment, the first double strand break is positioned upstream of the erythroid enhancer within intron 2 (e.g., between TSS+0.75 kb to TSS+52.0 kb), and the second double strand break is positioned downstream of the erythroid enhancer within intron 2 (e.g., between TSS+64.4 kb to TSS+84.7 kb) (see
The first double strand break may be positioned as follows:
-
- (1) upstream of the 5′ end of the erythroid enhancer in intron 2 (e.g., between TSS+0.75 kb to TSS+52.0 kb), or
- (2) within the erythroid enhancer provided that a portion of the erythroid enhancer is removed resulting in disruption of one or more DHSs (e.g., between TSS+52.0 kb to TSS+64.4 kb), and the second double strand break to be paired with the first double strand break may be positioned as follows:
- (1) downstream the 3′ end of the erythroid enhancer in intron 2 (e.g., between TSS+64.4 kb to TSS+84.7 kb), or
- (2) within the erythroid enhancer provided that a portion of the erythroid enhancer is removed resulting in disruption of one or more DHSs (e.g., between TSS+52.0 kb to TSS+64.4 kb).
For example, the first double strand break may be positioned in the BCL11A gene:
(1) between TSS+0.75 kb to TSS+10 kb,
(2) between TSS+10 kb to TSS+20 kb,
(3) between TSS+20 kb to TSS+30 kb,
(4) between TSS+30 kb to TSS+40 kb,
(5) between TSS+40 kb to TSS+45 kb,
(6) between TSS+45 kb to TSS+47.5 kb,
(7) between TSS+47.5 kb to TSS+50 kb,
(8) between TSS+50 kb to TSS+51 kb,
(9) between TSS+51 kb to TSS+51.1 kb,
(10) between TSS+51.1 kb to TSS+51.2 kb,
(11) between TSS+51.2 kb to TSS+51.3 kb,
(12) between TSS+51.3 kb to TSS+51.4 kb,
(13) between TSS+51.4 kb to TSS+51.5 kb,
(14) between TSS+51.5 kb to TSS+51.6 kb,
(15) between TSS+51.6 kb to TSS+51.7 kb,
(16) between TSS+51.7 kb to TSS+51.8 kb,
(17) between TSS+51.8 kb to TSS+51.9 kb,
(18) between TSS+51.9 kb to TSS+52 kb,
(19) between TSS+52 kb to TSS+53 kb,
(20) between TSS+53 kb to TSS+54 kb,
(21) between TSS+54 kb to TSS+55 kb,
(22) between TSS+55 kb to TSS+56 kb,
(23) between TSS+56 kb to TSS+57 kb,
(24) between TSS+57 kb to TSS+58 kb,
(25) between TSS+58 kb to TSS+59 kb,
(26) between TSS+59 kb to TSS+60 kb,
(27) between TSS+60 kb to TSS+61 kb,
(28) between TSS+61 kb to TSS+62 kb,
(29) between TSS+62 kb to TSS+63 kb,
(30) between TSS+63 kb to TSS+64 kb, or
(31) between TSS+64 kb to TSS+64.4 kb,
and the second double strand break to be paired with the first double strand break may be positioned in the BCL11A gene:
(1) between TSS+52 kb to TSS+53 kb,
(2) between TSS+53 kb to TSS+54 kb,
(3) between TSS+54 kb to TSS+55 kb,
(4) between TSS+55 kb to TSS+56 kb,
(5) between TSS+56 kb to TSS+57 kb,
(6) between TSS+57 kb to TSS+58 kb,
(7) between TSS+58 kb to TSS+59 kb,
(8) between TSS+59 kb to TSS+60 kb,
(9) between TSS+60 kb to TSS+61 kb,
(10) between TSS+61 kb to TSS+62 kb,
(11) between TSS+62 kb to TSS+63 kb,
(12) between TSS+63 kb to TSS+64 kb,
(13) between TSS+64 kb to TSS+64.4 kb,
(14) between TSS+64.4 kb to TSS+65 kb,
(15) between TSS+65 kb to TSS+65.1 kb,
(16) between TSS+65.1 kb to TSS+65.2 kb,
(17) between TSS+65.2 kb to TSS+65.3 kb,
(18) between TSS+65.3 kb to TSS+65.4 kb,
(19) between TSS+65.4 kb to TSS+65.5 kb,
(20) between TSS+65.5 kb to TSS+65.7 kb,
(21) between TSS+65.7 kb to TSS+65.8 kb,
(22) between TSS+65.8 kb to TSS+65.9 kb,
(23) between TSS+65.9 kb to TSS+66 kb,
(24) between TSS+66 kb to TSS+67 kb,
(25) between TSS+67 kb to TSS+68 kb,
(26) between TSS+68 kb to TSS+69 kb,
(27) between TSS+69 kb to TSS+70 kb,
(28) between TSS+70 kb to TSS+75 kb,
(29) between TSS+75 kb to TSS+80 kb, or
(30) between TSS+80 kb to TSS+84.4 kb.
While not wishing to be bound by theory, it is believed that the two double strand breaks allow for NHEJ-mediated deletion of erythroid enhancer in the BCL11A gene.
In an embodiment, the method comprises introducing a NHEJ-mediated deletion of a genomic sequence including the erythroid enhancer. As described herein, in one embodiment, the method comprises the introduction of two sets of breaks (e.g., one double strand break and a pair of single strand breaks)—one 5′ and the other 3′ to (i.e., flanking) the SCD target position (e.g., the erythroid enhancer). Two gRNAs, e.g., unimolecular (or chimeric) or modular gRNA molecules, are configured to position the two sets of breaks (either the double strand break or the pair of single strand breaks) on opposite sides of the SCD target knockdown position (e.g., the erythroid enhancer) in the BCL11A gene. In an embodiment, the first set of breaks (either the double strand break or the pair of single strand breaks) is positioned upstream of the erythroid enhancer within intron 2 (e.g., between TSS+0.75 kb to TSS+52.0 kb), and the second set of breaks (either the double strand break or the pair of single strand breaks) is positioned downstream of the erythroid enhancer within intron 2 (e.g., between TSS+64.4 kb to TSS+84.7 kb) (see
The first set of breaks (either the double strand break or the pair of single strand breaks) may be positioned as follows:
-
- (1) upstream of the 5′ end of the erythroid enhancer in intron 2 (e.g., between TSS+0.75 kb to TSS+52.0 kb), or
- (2) within the erythroid enhancer provided that a portion of the erythroid enhancer is removed resulting in disruption of one or more DHSs (e.g., between TSS+52.0 kb to TSS+64.4 kb),
and the second set of breaks (either the double strand break or the pair of single strand breaks) to be paired with the first set of breaks (either the double strand break or the pair of single strand breaks) may be positioned as follows: - (1) downstream the 3′ end of the erythroid enhancer in intron 2 (e.g., between TSS+64.4 kb to TSS+84.7 kb), or
- (2) within the erythroid enhancer provided that a portion of the erythroid enhancer is removed resulting in disruption of one or more DHSs (e.g., between TSS+52.0 kb to TSS+64.4 kb).
For example, the first set of breaks (either the double strand break or the pair of single strand breaks) may be positioned in the BCL11A gene:
(1) between TSS+0.75 kb to TSS+10 kb,
(2) between TSS+10 kb to TSS+20 kb,
(3) between TSS+20 kb to TSS+30 kb,
(4) between TSS+30 kb to TSS+40 kb,
(5) between TSS+40 kb to TSS+45 kb,
(6) between TSS+45 kb to TSS+47.5 kb,
(7) between TSS+47.5 kb to TSS+50 kb,
(8) between TSS+50 kb to TSS+51 kb,
(9) between TSS+51 kb to TSS+51.1 kb,
(10) between TSS+51.1 kb to TSS+51.2 kb,
(11) between TSS+51.2 kb to TSS+51.3 kb,
(12) between TSS+51.3 kb to TSS+51.4 kb,
(13) between TSS+51.4 kb to TSS+51.5 kb,
(14) between TSS+51.5 kb to TSS+51.6 kb,
(15) between TSS+51.6 kb to TSS+51.7 kb,
(16) between TSS+51.7 kb to TSS+51.8 kb,
(17) between TSS+51.8 kb to TSS+51.9 kb,
(18) between TSS+51.9 kb to TSS+52 kb,
(19) between TSS+52 kb to TSS+53 kb,
(20) between TSS+53 kb to TSS+54 kb,
(21) between TSS+54 kb to TSS+55 kb,
(22) between TSS+55 kb to TSS+56 kb,
(23) between TSS+56 kb to TSS+57 kb,
(24) between TSS+57 kb to TSS+58 kb,
(25) between TSS+58 kb to TSS+59 kb,
(26) between TSS+59 kb to TSS+60 kb,
(27) between TSS+60 kb to TSS+61 kb,
(28) between TSS+61 kb to TSS+62 kb,
(29) between TSS+62 kb to TSS+63 kb,
(30) between TSS+63 kb to TSS+64 kb, or
(31) between TSS+64 kb to TSS+64.4 kb,
and the second set of breaks (either the double strand break or the pair of single strand breaks) to be paired with the first set of breaks (either the double strand break or the pair of single strand breaks) may be positioned in the BCL11A gene:
(1) between TSS+52 kb to TSS+53 kb,
(2) between TSS+53 kb to TSS+54 kb,
(3) between TSS+54 kb to TSS+55 kb,
(4) between TSS+55 kb to TSS+56 kb,
(5) between TSS+56 kb to TSS+57 kb,
(6) between TSS+57 kb to TSS+58 kb,
(7) between TSS+58 kb to TSS+59 kb,
(8) between TSS+59 kb to TSS+60 kb,
(9) between TSS+60 kb to TSS+61 kb,
(10) between TSS+61 kb to TSS+62 kb,
(11) between TSS+62 kb to TSS+63 kb,
(12) between TSS+63 kb to TSS+64 kb,
(13) between TSS+64 kb to TSS+64.4 kb,
(14) between TSS+64.4 kb to TSS+65 kb,
(15) between TSS+65 kb to TSS+65.1 kb,
(16) between TSS+65.1 kb to TSS+65.2 kb,
(17) between TSS+65.2 kb to TSS+65.3 kb,
(18) between TSS+65.3 kb to TSS+65.4 kb,
(19) between TSS+65.4 kb to TSS+65.5 kb,
(20) between TSS+65.5 kb to TSS+65.7 kb,
(21) between TSS+65.7 kb to TSS+65.8 kb,
(22) between TSS+65.8 kb to TSS+65.9 kb,
(23) between TSS+65.9 kb to TSS+66 kb,
(24) between TSS+66 kb to TSS+67 kb,
(25) between TSS+67 kb to TSS+68 kb,
(26) between TSS+68 kb to TSS+69 kb,
(27) between TSS+69 kb to TSS+70 kb,
(28) between TSS+70 kb to TSS+75 kb,
(29) between TSS+75 kb to TSS+80 kb, or
(30) between TSS+80 kb to TSS+84.4 kb.
While not wishing to be bound by theory, it is believed that the two sets of breaks (either the double strand break or the pair of single strand breaks) allow for NHEJ-mediated deletion of erythroid enhancer in the BCL11A gene.
In an embodiment, the method comprises introducing a NHEJ-mediated deletion of a genomic sequence including the erythroid enhancer. As described herein, in one embodiment, the method comprises the introduction of two sets of breaks (e.g., two pairs of single strand breaks)-one 5′ and the other 3′ to (i.e., flanking) the SCD target position (e.g., the erythroid enhancer). Two gRNAs, e.g., unimolecular (or chimeric) or modular gRNA molecules, are configured to position the two sets of breaks on opposite sides of the SCD target knockdown position (e.g., the erythroid enhancer) in the BCL11A gene. In an embodiment, the first set of breaks (i.e., the first pair of single strand breaks) is positioned upstream of the erythroid enhancer within intron 2 (e.g., between TSS+0.75 kb to TSS+52.0 kb), and the second set of breaks (i.e., the second pair of single strand breaks) is positioned downstream of the erythroid enhancer within intron 2 (e.g., between TSS+64.4 kb to TSS+84.7 kb) (see
The first pair of single strand breaks may be positioned as follows:
-
- (1) upstream of the 5′ end of the erythroid enhancer in intron 2 (e.g., between TSS+0.75 kb to TSS+52.0 kb), or
- (2) within the erythroid enhancer provided that a portion of the erythroid enhancer is removed resulting in disruption of one or more DHSs (e.g., between TSS+52.0 kb to TSS+64.4 kb),
and the second pair of single strand breaks to be paired with the first pair of single strand breaks may be positioned as follows: - (1) downstream the 3′ end of the erythroid enhancer in intron 2 (e.g., between TSS+64.4 kb to TSS+84.7 kb), or
- (2) within the erythroid enhancer provided that a portion of the erythroid enhancer is removed resulting in disruption of one or more DHSs (e.g., between TSS+52.0 kb to TSS+64.4 kb).
For example, the pair of single strand breaks may be positioned in the BCL11A gene:
(1) between TSS+0.75 kb to TSS+10 kb,
(2) between TSS+10 kb to TSS+20 kb,
(3) between TSS+20 kb to TSS+30 kb,
(4) between TSS+30 kb to TSS+40 kb,
(5) between TSS+40 kb to TSS+45 kb,
(6) between TSS+45 kb to TSS+47.5 kb,
(7) between TSS+47.5 kb to TSS+50 kb,
(8) between TSS+50 kb to TSS+51 kb,
(9) between TSS+51 kb to TSS+51.1 kb,
(10) between TSS+51.1 kb to TSS+51.2 kb,
(11) between TSS+51.2 kb to TSS+51.3 kb,
(12) between TSS+51.3 kb to TSS+51.4 kb,
(13) between TSS+51.4 kb to TSS+51.5 kb,
(14) between TSS+51.5 kb to TSS+51.6 kb,
(15) between TSS+51.6 kb to TSS+51.7 kb,
(16) between TSS+51.7 kb to TSS+51.8 kb,
(17) between TSS+51.8 kb to TSS+51.9 kb,
(18) between TSS+51.9 kb to TSS+52 kb,
(19) between TSS+52 kb to TSS+53 kb,
(20) between TSS+53 kb to TSS+54 kb,
(21) between TSS+54 kb to TSS+55 kb,
(22) between TSS+55 kb to TSS+56 kb,
(23) between TSS+56 kb to TSS+57 kb,
(24) between TSS+57 kb to TSS+58 kb,
(25) between TSS+58 kb to TSS+59 kb,
(26) between TSS+59 kb to TSS+60 kb,
(27) between TSS+60 kb to TSS+61 kb,
(28) between TSS+61 kb to TSS+62 kb,
(29) between TSS+62 kb to TSS+63 kb,
(30) between TSS+63 kb to TSS+64 kb, or
(31) between TSS+64 kb to TSS+64.4 kb,
and the second pair of single strand breaks to be paired with the first pair of single strand breaks may be positioned in the BCL11A gene:
(1) between TSS+52 kb to TSS+53 kb,
(2) between TSS+53 kb to TSS+54 kb,
(3) between TSS+54 kb to TSS+55 kb,
(4) between TSS+55 kb to TSS+56 kb,
(5) between TSS+56 kb to TSS+57 kb,
(6) between TSS+57 kb to TSS+58 kb,
(7) between TSS+58 kb to TSS+59 kb,
(8) between TSS+59 kb to TSS+60 kb,
(9) between TSS+60 kb to TSS+61 kb,
(10) between TSS+61 kb to TSS+62 kb,
(11) between TSS+62 kb to TSS+63 kb,
(12) between TSS+63 kb to TSS+64 kb,
(13) between TSS+64 kb to TSS+64.4 kb,
(14) between TSS+64.4 kb to TSS+65 kb,
(15) between TSS+65 kb to TSS+65.1 kb,
(16) between TSS+65.1 kb to TSS+65.2 kb,
(17) between TSS+65.2 kb to TSS+65.3 kb,
(18) between TSS+65.3 kb to TSS+65.4 kb,
(19) between TSS+65.4 kb to TSS+65.5 kb,
(20) between TSS+65.5 kb to TSS+65.7 kb,
(21) between TSS+65.7 kb to TSS+65.8 kb,
(22) between TSS+65.8 kb to TSS+65.9 kb,
(23) between TSS+65.9 kb to TSS+66 kb,
(24) between TSS+66 kb to TSS+67 kb,
(25) between TSS+67 kb to TSS+68 kb,
(26) between TSS+68 kb to TSS+69 kb,
(27) between TSS+69 kb to TSS+70 kb,
(28) between TSS+70 kb to TSS+75 kb,
(29) between TSS+75 kb to TSS+80 kb, or
(30) between TSS+80 kb to TSS+84.4 kb.
While not wishing to be bound by theory, it is believed that the two sets of breaks (e.g., the two pair of single strand breaks) allow for NHEJ-mediated deletion of erythroid enhancer in the BCL11A gene.
Knocking Down the BCL11A Gene Mediated by an Enzymatically Inactive Cas9 (eiCas9) Molecule or an eiCas9-Fusion Protein by Targeting the Promoter Region of the Gene.
A targeted knockdown approach reduces or eliminates expression of functional BCL11A gene product. As described herein, a targeted knockdown is mediated by targeting an enzymatically inactive Cas9 (eiCas9) molecule or an eiCas9 fused to a transcription repressor domain or chromatin modifying protein to alter transcription, e.g., to block, reduce, or decrease transcription, of the BCL11A gene. In an embodiment, one or more eiCas9s may be used to block binding of one or more endogenous transcription factors. In another embodiment, an eiCas9 can be fused to a chromatin modifying protein. Altering chromatin status can result in decreased expression of the target gene. One or more eiCas9s fused to one or more chromatin modifying proteins may be used to alter chromatin status.
Methods and compositions discussed herein may be used to alter the expression of the BCL11A gene to treat or prevent SCD by targeting a promoter region of the BCL11A gene. In an embodiment, the promoter region, e.g., at least 2 kb, at least 1.5 kb, at least 1.0 kb, or at least 0.5 kb upstream or downstream of the TSS is targeted to knockdown expression of the BCL11A gene. In an embodiment, the methods and compositions discussed herein may be used to knock down the BCL11A gene to treat or prevent SCD by targeting 0.5 kb upstream or downstream of the TSS. A targeted knockdown approach reduces or eliminates expression of functional BCL11A gene product. As described herein, a targeted knockdown is mediated by targeting an enzymatically inactive Cas9 (eiCas9) molecule or an eiCas9 fused to a transcription repressor domain or chromatin modifying protein to alter transcription, e.g., to block, reduce, or decrease transcription, of the BCL11A gene.
Methods to Treat or Prevent Sickle Cell Disease (SCD)
Disclosed herein are the approaches to treat or prevent SCD, using the compositions and methods described herein.
One approach to treat or prevent SCD is to repair (i.e., correct) one or more mutations in the HBB gene, e.g., by HDR. In this approach, mutant HBB allele(s) are corrected and restored to wild type state. While not wishing to be bound by theory, it is believed that correction of the glutamic acid to valine substitution at amino acid 6 in the beta-globin gene restores wild type beta-globin production within erythroid cells. The method described herein can be performed in all cell types. Beta-globin is expressed in cells of erythroid cell lineage. In an embodiment, an erythroid cell is targeted.
In an embodiment, one HBB allele is repaired in the subject. In another embodiment, both HBB alleles are repaired in the subject. In either situation, the subjects can be cured of disease. As the disease only displays a phenotype when both alleles are mutated, repair of a single allele is adequate for a cure.
In one approach, the BCL11A gene is targeted as a targeted knockout or knockdown, e.g., to increase expression of fetal hemoglobin.
While not wishing to be bound by theory, it is considered that increasing levels of fetal hemoglobin (HbF) in subjects with SCD may ameliorate disease. Fetal hemoglobin can replace beta hemoglobin in the hemoglobin complex, form adequate tetramers with alpha hemoglobin, and effectively carry oxygen to tissues. Subjects with beta-thalassemia who express higher levels of fetal hemoglobin have been found to have a less severe phenotype. Hydroxyurea, often used in the treatment of beta-thalassemia, may exert its mechanism of action via increasing levels of HbF production.
In an embodiment, knockout or knockdown of the BCL11A gene increases fetal hemoglobin levels in beta-thalassemia subjects and improves phenotype and/or reduces or prevents disease progression. BCL11A is a zinc-finger repressor that is involved in the regulation of fetal hemoglobin and acts to repress the synthesis of fetal hemoglobin. Knockout of the BCL11A gene in erythroid cells induces increased fetal hemoglobin (HbF) synthesis and increased HbF can result in more effective oxygen carrying capacity in subjects with beta-thalassemia (HbF will form tetramers with hemoglobin alpha).
In an embodiment, the BCL11A knockout or knockdown is targeted specifically to cells of the erythroid lineage. BCL11A knockout in erythroid cells has been found in in vitro studies to have no effect on erythroid growth, maturation and function. In an embodiment, erythroid cells are preferentially targeted, e.g., at least 90%, 95%, 96%, 97%, 98%, 99%, or 100% of the targeted cells are erythroid cells. For example, in the case of in vivo delivery, erythroid cells are preferentially targeted, and if cells are treated ex vivo and returned to the subject, erythroid cells are preferentially modified.
In an embodiment, the methods described herein result in increased fetal hemoglobin synthesis in beta thalassemia subjects, thereby improving disease phenotype in subjects with SCD. For example, subjects with beta thalassemia major will suffer from less severe anemia and will need fewer blood transfusions. They will therefore have fewer complications arising from transfusions and chelation therapy. In an embodiment, the method described herein increases fetal hemoglobin synthesis and improves the oxygen carrying capacity of erythroid cells. For example, subjects are expected to demonstrate decreased rates of extramedullary erythropoiesis and decreased erythroid hypertrophy within the bone marrow compared to a subject who has not received the therapy. In an embodiment, the method described herein results in reduction of bone fractures, bone abnormalities, splenomegaly, and thrombosis compared to a subject who has not received the therapy.
Knockdown or knockout of one or both BCL11A alleles may be performed prior to disease onset or after disease onset, but preferably early in the disease course.
In an embodiment, the method comprises initiating treatment of a subject prior to disease onset.
In an embodiment, the method comprises initiating treatment of a subject after disease onset.
In an embodiment, the method comprises initiating treatment of a subject well after disease onset, e.g., 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 12, 16, 24, 36, 48 or more months after onset of SCD. While not wishing to be bound by theory it is believed that this treatment may be effective if subjects present well into the course of illness.
In an embodiment, the method comprises initiating treatment of a subject in an advanced stage of disease.
Overall, initiation of treatment for subjects at all stages of disease is expected to prevent negative consequences of disease and be of benefit to subjects.
In an embodiment, the method comprises initiating treatment of a subject prior to disease expression. In an embodiment, the method comprises initiating treatment of a subject in an early stage of disease, e.g., when a subject has tested positive for beta-thalassemia mutations but has no signs or symptoms associated with beta-thalassemia major, minor or intermedia.
In an embodiment, the method comprises initiating treatment of a subject at the appearance of microcytic anemia, e.g., in an infant, child, adult or young adult.
In an embodiment, the method comprises initiating treatment of a subject who is transfusion-dependent.
In an embodiment, the method comprises initiating treatment of a subject who has tested positive for a mutation in a beta globin gene.
In an embodiment, the method comprises initiating treatment at the appearance of any one or more of the following findings associated or consistent with beta-thalassemia major or beta-thalassemia minor: anemia, diarrhea, fever, failure to thrive, frontal bossing, broken long bones, hepatomegaly, splenomegaly, thrombosis, pulmonary embolus, stroke, leg ulcer, cardiomyopathy, cardiac arrhythmia, and evidence of extramedullary erythropoiesis.
In an embodiment, a cell is treated, e.g., ex vivo. In an embodiment, an ex vivo treated cell is returned to a subject.
In an embodiment, allogenic or autologous bone marrow or erythroid cells are treated ex vivo. In an embodiment, an ex vivo treated allogenic or autologous bone marrow or erythroid cells are administered to the subject. In an embodiment, an erythroid cell, e.g., an autologous erythroid cell, is treated ex vivo and returned to the subject. In an embodiment, an autologous stem cell, is treated ex vivo and returned to the subject. In an embodiment, the modified HSCs are administered to the patient following no myeloablative pre-conditioning. In an embodiment, the modified HSCs are administered to the patient following mild myeloablative pre-conditioning such that following engraftment, some of the hematopoietic cells are devied from the modified HSCs. In other aspects, the HSCs are administered after full myeloablation such that following engraftment, 100% of the hematopoietic cells are derived from the modified HSCs.
In an embodiment, the method comprises delivery of a gRNA molecule and Cas9 molecule by intravenous injection, intramuscular injection, subcutaneous injection, or intra-bone marrow (IBM) injection.
In an embodiment, the method comprises delivery of a gRNA molecule and/or a Cas9 molecule by an AAV. In an embodiment, the method comprises delivery of a gRNA molecule and/or a Cas9 molecule by a lentivirus. In an embodiment, the method comprises delivery of a gRNA molecule and/or a Cas9 molecule by a nanoparticle. In an embodiment, the method comprises delivery of a gRNA molecule by a parvovirus, e.g., a modified parvovirus specifically designed to target bone marrow cells and/or CD4 cells. In an embodiment, two or more gRNA molecules (e.g., a second, third or fourth gRNA molecules) are delivered.
I. gRNA Molecules
A gRNA molecule, as that term is used herein, refers to a nucleic acid that promotes the specific targeting or homing of a gRNA molecule/Cas9 molecule complex to a target nucleic acid. gRNA molecules can be unimolecular (having a single RNA molecule), sometimes referred to herein as “chimeric” gRNAs, or modular (comprising more than one, and typically two, separate RNA molecules). A gRNA molecule comprises a number of domains. The gRNA molecule domains are described in more detail below.
Several exemplary gRNA structures, with domains indicated thereon, are provided in
In an embodiment, a unimolecular, or chimeric, gRNA comprises, preferably from 5′ to 3′:
-
- a targeting domain (which is complementary to a target nucleic acid in the HBB gene or BCL11A gene, e.g., a targeting domain from any of Tables 1A-1D, 2A-2F, 3A-3C, 4A-4E, 5A-5E, 6A-6B, 7A-7D, 8A-8D, 9, 10A-10D, 11A-11D, 12, 13A-13D, 14A-14C, 15A-15D, 16A-16E, 17A-17B, 18A-18C, 19A-19E, 20A-20C, 21A-21E, 22A-22E, 23A-23C, 24A-24D, 25A-25B, 26, or 31;
- a first complementarity domain;
- a linking domain;
- a second complementarity domain (which is complementary to the first complementarity domain);
- a proximal domain; and
- optionally, a tail domain.
In an embodiment, a modular gRNA comprises:
-
- a first strand comprising, preferably from 5′ to 3′;
- a targeting domain (which is complementary to a target nucleic acid in the HBB gene or BCL11A gene, e.g., a targeting domain from Tables 1A-1D, 2A-2F, 3A-3C, 4A-4E, 5A-5E, 6A-6B, 7A-7D, 8A-8D, 9, 10A-10D, 11A-11D, 12, 13A-13D, 14A-14C, 15A-15D, 16A-16E, 17A-17B, 18A-18C, 19A-19E, 20A-20C, 21A-21E, 22A-22E, 23A-23C, 24A-24D, 25A-25B, 26, or 31; and
- a first complementarity domain; and
- a second strand, comprising, preferably from 5′ to 3′:
- optionally, a 5′ extension domain;
- a second complementarity domain;
- a proximal domain; and
- optionally, a tail domain.
- a first strand comprising, preferably from 5′ to 3′;
The domains are discussed briefly below.
The Targeting Domain
The targeting domain comprises a nucleotide sequence that is complementary, e.g., at least 80, 85, 90, or 95% complementary, e.g., fully complementary, to the target sequence on the target nucleic acid. The targeting domain is part of an RNA molecule and will therefore comprise the base uracil (U), while any DNA encoding the gRNA molecule will comprise the base thymine (T). While not wishing to be bound by theory, in an embodiment, it is believed that the complementarity of the targeting domain with the target sequence contributes to specificity of the interaction of the gRNA molecule/Cas9 molecule complex with a target nucleic acid. It is understood that in a targeting domain and target sequence pair, the uracil bases in the targeting domain will pair with the adenine bases in the target sequence. In an embodiment, the target domain itself comprises in the 5′ to 3′ direction, an optional secondary domain, and a core domain. In an embodiment, the core domain is fully complementary with the target sequence. In an embodiment, the targeting domain is 5 to 50 nucleotides in length. The strand of the target nucleic acid with which the targeting domain is complementary is referred to herein as the complementary strand. Some or all of the nucleotides of the domain can have a modification, e.g., a modification found in Section VIII herein.
In an embodiment, the targeting domain is 16 nucleotides in length.
In an embodiment, the targeting domain is 17 nucleotides in length.
In an embodiment, the targeting domain is 18 nucleotides in length.
In an embodiment, the targeting domain is 19 nucleotides in length.
In an embodiment, the targeting domain is 20 nucleotides in length.
In an embodiment, the targeting domain is 21 nucleotides in length.
In an embodiment, the targeting domain is 22 nucleotides in length.
In an embodiment, the targeting domain is 23 nucleotides in length.
In an embodiment, the targeting domain is 24 nucleotides in length.
In an embodiment, the targeting domain is 25 nucleotides in length.
In an embodiment, the targeting domain is 26 nucleotides in length.
In an embodiment, the targeting domain comprises 16 nucleotides.
In an embodiment, the targeting domain comprises 17 nucleotides.
In an embodiment, the targeting domain comprises 18 nucleotides.
In an embodiment, the targeting domain comprises 19 nucleotides.
In an embodiment, the targeting domain comprises 20 nucleotides.
In an embodiment, the targeting domain comprises 21 nucleotides.
In an embodiment, the targeting domain comprises 22 nucleotides.
In an embodiment, the targeting domain comprises 23 nucleotides.
In an embodiment, the targeting domain comprises 24 nucleotides.
In an embodiment, the targeting domain comprises 25 nucleotides.
In an embodiment, the targeting domain comprises 26 nucleotides.
Targeting domains are discussed in more detail below.
The First Complementarity Domain
The first complementarity domain is complementary with the second complementarity domain, and in an embodiment, has sufficient complementarity to the second complementarity domain to form a duplexed region under at least some physiological conditions. In an embodiment, the first complementarity domain is 5 to 30 nucleotides in length. In an embodiment, the first complementarity domain is 5 to 25 nucleotides in length. In an embodiment, the first complementary domain is 7 to 25 nucleotides in length. In an embodiment, the first complementary domain is 7 to 22 nucleotides in length. In an embodiment, the first complementary domain is 7 to 18 nucleotides in length. In an embodiment, the first complementary domain is 7 to 15 nucleotides in length. In an embodiment, the first complementary domain is 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, or 25 nucleotides in length.
In an embodiment, the first complementarity domain comprises 3 subdomains, which, in the 5′ to 3′ direction are: a 5′ subdomain, a central subdomain, and a 3′ subdomain. In an embodiment, the 5′ subdomain is 4 to 9, e.g., 4, 5, 6, 7, 8 or 9 nucleotides in length. In an embodiment, the central subdomain is 1, 2, or 3, e.g., 1, nucleotide in length. In an embodiment, the 3′ subdomain is 3 to 25, e.g., 4 to 22, 4 to 18, or 4 to 10, or 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, or 25 nucleotides in length.
The first complementarity domain can share homology with, or be derived from, a naturally occurring first complementarity domain. In an embodiment, it has at least 50% homology with a first complementarity domain disclosed herein, e.g., an S. pyogenes, S. aureus or S. thermophilus, first complementarity domain.
Some or all of the nucleotides of the domain can have a modification, e.g., a modification found in Section VIII herein.
First complementarity domains are discussed in more detail below.
The Linking Domain
A linking domain serves to link the first complementarity domain with the second complementarity domain of a unimolecular gRNA. The linking domain can link the first and second complementarity domains covalently or non-covalently. In an embodiment, the linkage is covalent. In an embodiment, the linking domain covalently couples the first and second complementarity domains, see, e.g.,
In modular gRNA molecules the two molecules are associated by virtue of the hybridization of the complementarity domains see e.g.,
A wide variety of linking domains are suitable for use in unimolecular gRNA molecules. Linking domains can consist of a covalent bond, or be as short as one or a few nucleotides, e.g., 1, 2, 3, 4, or 5 nucleotides in length. In an embodiment, a linking domain is 2, 3, 4, 5, 6, 7, 8, 9, 10, 15, 20, or 25 or more nucleotides in length. In an embodiment, a linking domain is 2 to 50, 2 to 40, 2 to 30, 2 to 20, 2 to 10, or 2 to 5 nucleotides in length. In an embodiment, a linking domain shares homology with, or is derived from, a naturally occurring sequence, e.g., the sequence of a tracrRNA that is 5′ to the second complementarity domain. In an embodiment, the linking domain has at least 50% homology with a linking domain disclosed herein.
Some or all of the nucleotides of the domain can have a modification, e.g., a modification found in Section VIII herein.
Linking domains are discussed in more detail below.
The 5′ Extension Domain
In an embodiment, a modular gRNA can comprise additional sequence, 5′ to the second complementarity domain, referred to herein as the 5′ extension domain, see, e.g.,
The Second Complementarity Domain
The second complementarity domain is complementary with the first complementarity domain, and in an embodiment, has sufficient complementarity to the second complementarity domain to form a duplexed region under at least some physiological conditions. In an embodiment, e.g., as shown in
In an embodiment, the second complementarity domain is 5 to 27 nucleotides in length. In an embodiment, it is longer than the first complementarity region. In an embodiment the second complementary domain is 7 to 27 nucleotides in length. In an embodiment, the second complementary domain is 7 to 25 nucleotides in length. In an embodiment, the second complementary domain is 7 to 20 nucleotides in length. In an embodiment, the second complementary domain is 7 to 17 nucleotides in length. In an embodiment, the complementary domain is 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25 or 26 nucleotides in length.
In an embodiment, the second complementarity domain comprises 3 subdomains, which, in the 5′ to 3′ direction are: a 5′ subdomain, a central subdomain, and a 3′ subdomain. In an embodiment, the 5′ subdomain is 3 to 25, e.g., 4 to 22, 4 to 18, or 4 to 10, or 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, or 25 nucleotides in length. In an embodiment, the central subdomain is 1, 2, 3, 4 or 5, e.g., 3, nucleotides in length. In an embodiment, the 3′ subdomain is 4 to 9, e.g., 4, 5, 6, 7, 8 or 9 nucleotides in length.
In an embodiment, the 5′ subdomain and the 3′ subdomain of the first complementarity domain, are respectively, complementary, e.g., fully complementary, with the 3′ subdomain and the 5′ subdomain of the second complementarity domain.
The second complementarity domain can share homology with or be derived from a naturally occurring second complementarity domain. In an embodiment, it has at least 50% homology with a second complementarity domain disclosed herein, e.g., an S. pyogenes, S. aureus or S. thermophilus, first complementarity domain.
Some or all of the nucleotides of the domain can have a modification, e.g., a modification found in Section VIII herein.
A Proximal domain
In an embodiment, the proximal domain is 5 to 20 nucleotides in length. In an embodiment, the proximal domain can share homology with or be derived from a naturally occurring proximal domain. In an embodiment, it has at least 50% homology with a proximal domain disclosed herein, e.g., an S. pyogenes, S. aureus or S. thermophilus, proximal domain.
Some or all of the nucleotides of the domain can have a modification, e.g., modification found in Section VIII herein.
A Tail Domain
As can be seen by inspection of the tail domains in
In an embodiment, the tail domain is absent or is 1 to 50 nucleotides in length. In an embodiment, the tail domain can share homology with or be derived from a naturally occurring proximal tail domain. In an embodiment, it has at least 50% homology with a tail domain disclosed herein, e.g., an S. pyogenes, S. aureus or S. thermophilus, tail domain.
In an embodiment, the tail domain includes nucleotides at the 3′ end that are related to the method of in vitro or in vivo transcription. When a T7 promoter is used for in vitro transcription of the gRNA, these nucleotides may be any nucleotides present before the 3′ end of the DNA template. When a U6 promoter is used for in vivo transcription, these nucleotides may be the sequence UUUUUU. When alternate pol-III promoters are used, these nucleotides may be various numbers or uracil bases or may include alternate bases.
The domains of gRNA molecules are described in more detail below.
The Targeting Domain
The “targeting domain” of the gRNA is complementary to the “target domain” on the target nucleic acid. The strand of the target nucleic acid comprising the nucleotide sequence complementary to the core domain of the gRNA is referred to herein as the “complementary strand” of the target nucleic acid. Guidance on the selection of targeting domains can be found, e.g., in Fu Y et al., Nat Biotechnol 2014 (doi: 10.1038/nbt.2808) and Sternberg S H et al., Nature 2014 (doi: 10.1038/nature13011).
In an embodiment, the targeting domain is 16, 17, 18, 19, 20, 21, 22, 23, 24, 25 or 26 nucleotides in length.
In an embodiment, the targeting domain is 16 nucleotides in length.
In an embodiment, the targeting domain is 17 nucleotides in length.
In an embodiment, the targeting domain is 18 nucleotides in length.
In an embodiment, the targeting domain is 19 nucleotides in length.
In an embodiment, the targeting domain is 20 nucleotides in length.
In an embodiment, the targeting domain is 21 nucleotides in length.
In an embodiment, the targeting domain is 22 nucleotides in length.
In an embodiment, the targeting domain is 23 nucleotides in length.
In an embodiment, the targeting domain is 24 nucleotides in length.
In an embodiment, the targeting domain is 25 nucleotides in length.
In an embodiment, the targeting domain is 26 nucleotides in length.
In an embodiment, the targeting domain comprises 16 nucleotides.
In an embodiment, the targeting domain comprises 17 nucleotides.
In an embodiment, the targeting domain comprises 18 nucleotides.
In an embodiment, the targeting domain comprises 19 nucleotides.
In an embodiment, the targeting domain comprises 20 nucleotides.
In an embodiment, the targeting domain comprises 21 nucleotides.
In an embodiment, the targeting domain comprises 22 nucleotides.
In an embodiment, the targeting domain comprises 23 nucleotides.
In an embodiment, the targeting domain comprises 24 nucleotides.
In an embodiment, the targeting domain comprises 25 nucleotides.
In an embodiment, the targeting domain comprises 26 nucleotides.
In an embodiment, the targeting domain is 10+/−5, 20+/−5, 30+/−5, 40+/−5, 50+/−5, 60+/−5, 70+/−5, 80+/−5, 90+/−5, or 100+/−5 nucleotides, in length.
In an embodiment, the targeting domain is 20+/−5 nucleotides in length.
In an embodiment, the targeting domain is 20+/−10, 30+/−10, 40+/−10, 50+/−10, 60+/−10, 70+/−10, 80+/−10, 90+/−10, or 100+/−10 nucleotides, in length.
In an embodiment, the targeting domain is 30+/−10 nucleotides in length.
In an embodiment, the targeting domain is 10 to 100, 10 to 90, 10 to 80, 10 to 70, 10 to 60, 10 to 50, 10 to 40, 10 to 30, 10 to 20 or 10 to 15 nucleotides in length.
In another embodiment, the targeting domain is 20 to 100, 20 to 90, 20 to 80, 20 to 70, 20 to 60, 20 to 50, 20 to 40, 20 to 30, or 20 to 25 nucleotides in length.
Typically the targeting domain has full complementarity with the target sequence. In an embodiment the targeting domain has or includes 1, 2, 3, 4, 5, 6, 7 or 8 nucleotides that are not complementary with the corresponding nucleotide of the targeting domain.
In an embodiment, the target domain includes 1, 2, 3, 4 or 5 nucleotides that are complementary with the corresponding nucleotide of the targeting domain within 5 nucleotides of its 5′ end. In an embodiment, the target domain includes 1, 2, 3, 4 or 5 nucleotides that are complementary with the corresponding nucleotide of the targeting domain within 5 nucleotides of its 3′ end.
In an embodiment, the target domain includes 1, 2, 3, or 4 nucleotides that are not complementary with the corresponding nucleotide of the targeting domain within 5 nucleotides of its 5′ end. In an embodiment, the target domain includes 1, 2, 3, or 4 nucleotides that are not complementary with the corresponding nucleotide of the targeting domain within 5 nucleotides of its 3′ end.
In an embodiment, the degree of complementarity, together with other properties of the gRNA, is sufficient to allow targeting of a Cas9 molecule to the target nucleic acid.
In an embodiment, the targeting domain comprises two consecutive nucleotides that are not complementary to the target domain (“non-complementary nucleotides”), e.g., two consecutive noncomplementary nucleotides that are within 5 nucleotides of the 5′ end of the targeting domain, within 5 nucleotides of the 3′ end of the targeting domain, or more than 5 nucleotides away from one or both ends of the targeting domain.
In an embodiment, no two consecutive nucleotides within 5 nucleotides of the 5′ end of the targeting domain, within 5 nucleotides of the 3′ end of the targeting domain, or within a region that is more than 5 nucleotides away from one or both ends of the targeting domain, are not complementary to the targeting domain.
In an embodiment, there are no noncomplementary nucleotides within 5 nucleotides of the 5′ end of the targeting domain, within 5 nucleotides of the 3′ end of the targeting domain, or within a region that is more than 5 nucleotides away from one or both ends of the targeting domain.
In an embodiment, the targeting domain nucleotides do not comprise modifications, e.g., modifications of the type provided in Section VIII. However, in an embodiment, the targeting domain comprises one or more modifications, e.g., modifications that it render it less susceptible to degradation or more bio-compatible, e.g., less immunogenic. By way of example, the backbone of the targeting domain can be modified with a phosphorothioate, or other modification from Section VIII. In an embodiment, a nucleotide of the targeting domain can comprise a 2′ modification (e.g., a modification at the 2′ position on ribose), e.g., a 2-acetylation, e.g., a 2′ methylation, or other modification(s) from Section VIII.
In an embodiment, the targeting domain includes 1, 2, 3, 4, 5, 6, 7 or 8 or more modifications. In an embodiment, the targeting domain includes 1, 2, 3, or 4 modifications within 5 nucleotides of its 5′ end. In an embodiment, the targeting domain comprises as many as 1, 2, 3, or 4 modifications within 5 nucleotides of its 3′ end.
In an embodiment, the targeting domain comprises modifications at two consecutive nucleotides, e.g., two consecutive nucleotides that are within 5 nucleotides of the 5′ end of the targeting domain, within 5 nucleotides of the 3′ end of the targeting domain, or more than 5 nucleotides away from one or both ends of the targeting domain.
In an embodiment, no two consecutive nucleotides are modified within 5 nucleotides of the 5′ end of the targeting domain, within 5 nucleotides of the 3′ end of the targeting domain, or within a region that is more than 5 nucleotides away from one or both ends of the targeting domain. In an embodiment, no nucleotide is modified within 5 nucleotides of the 5′ end of the targeting domain, within 5 nucleotides of the 3′ end of the targeting domain, or within a region that is more than 5 nucleotides away from one or both ends of the targeting domain.
Modifications in the targeting domain can be selected to not interfere with targeting efficacy, which can be evaluated by testing a candidate modification in the system described in Section IV. gRNAs having a candidate targeting domain having a selected length, sequence, degree of complementarity, or degree of modification, can be evaluated in a system in Section IV. The candidate targeting domain can be placed, either alone, or with one or more other candidate changes in a gRNA molecule/Cas9 molecule system known to be functional with a selected target and evaluated.
In an embodiment, all of the modified nucleotides are complementary to and capable of hybridizing to corresponding nucleotides present in the target domain. In another embodiment, 1, 2, 3, 4, 5, 6, 7 or 8 or more modified nucleotides are not complementary to or capable of hybridizing to corresponding nucleotides present in the target domain.
In an embodiment, the targeting domain comprises, preferably in the 5′→3′ direction: a secondary domain and a core domain. These domains are discussed in more detail below.
The Core Domain and Secondary Domain of the Targeting Domain
The “core domain” of the targeting domain is complementary to the “core domain target” on the target nucleic acid. In an embodiment, the core domain comprises about 8 to about 13 nucleotides from the 3′ end of the targeting domain (e.g., the most 3′ 8 to 13 nucleotides of the targeting domain).
In an embodiment, the core domain and targeting domain, are independently, 6+/−2, 7+/−2, 8+/−2, 9+/−2, 10+/−2, 11+/−2, 12+/−2, 13+/−2, 14+/−2, 15+/−2, or 16+−2, nucleotides in length.
In an embodiment, the core domain and targeting domain, are independently, 10+/−2 nucleotides in length.
In an embodiment, the core domain and targeting domain, are independently, 10+/−4 nucleotides in length.
In an embodiment, the core domain and targeting domain are independently 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, or 18 nucleotides in length.
In an embodiment, the core domain and targeting domain are independently 3 to 20, 4 to 20, 5 to 20, 6 to 20, 7 to 20, 8 to 20, 9 to 20 10 to 20 or 15 to 20 nucleotides in length.
In an embodiment, the core domain and targeting domain are independently 3 to 15, e.g., 6 to 15, 7 to 14, 7 to 13, 6 to 12, 7 to 12, 7 to 11, 7 to 10, 8 to 14, 8 to 13, 8 to 12, 8 to 11, 8 to 10 or 8 to 9 nucleotides in length.
The core domain is complementary with the core domain target. Typically the core domain has exact complementarity with the core domain target. In an embodiment, the core domain can have 1, 2, 3, 4 or 5 nucleotides that are not complementary with the corresponding nucleotide of the core domain. In an embodiment, the degree of complementarity, together with other properties of the gRNA, is sufficient to allow targeting of a Cas9 molecule to the target nucleic acid.
The “secondary domain” of the targeting domain of the gRNA is complementary to the “secondary domain target” of the target nucleic acid.
In an embodiment, the secondary domain is positioned 5′ to the core domain.
In an embodiment, the secondary domain is absent or optional.
In an embodiment, if the targeting domain is 26 nucleotides in length and the core domain (counted from the 3′ end of the targeting domain) is 8 to 13 nucleotides in length, the secondary domain is 12 to 17 nucleotides in length.
In an embodiment, if the targeting domain is 25 nucleotides in length and the core domain (counted from the 3′ end of the targeting domain) is 8 to 13 nucleotides in length, the secondary domain is 12 to 17 nucleotides in length.
In an embodiment, if the targeting domain is 24 nucleotides in length and the core domain (counted from the 3′ end of the targeting domain) is 8 to 13 nucleotides in length, the secondary domain is 11 to 16 nucleotides in length.
In an embodiment, if the targeting domain is 23 nucleotides in length and the core domain (counted from the 3′ end of the targeting domain) is 8 to 13 nucleotides in length, the secondary domain is 10 to 15 nucleotides in length.
In an embodiment, if the targeting domain is 22 nucleotides in length and the core domain (counted from the 3′ end of the targeting domain) is 8 to 13 nucleotides in length, the secondary domain is 9 to 14 nucleotides in length.
In an embodiment, if the targeting domain is 21 nucleotides in length and the core domain (counted from the 3′ end of the targeting domain) is 8 to 13 nucleotides in length, the secondary domain is 8 to 13 nucleotides in length.
In an embodiment, if the targeting domain is 20 nucleotides in length and the core domain (counted from the 3′ end of the targeting domain) is 8 to 13 nucleotides in length, the secondary domain is 7 to 12 nucleotides in length.
In an embodiment, if the targeting domain is 19 nucleotides in length and the core domain (counted from the 3′ end of the targeting domain) is 8 to 13 nucleotides in length, the secondary domain is 6 to 11 nucleotides in length.
In an embodiment, if the targeting domain is 18 nucleotides in length and the core domain (counted from the 3′ end of the targeting domain) is 8 to 13 nucleotides in length, the secondary domain is 5 to 10 nucleotides in length.
In an embodiment, if the targeting domain is 17 nucleotides in length and the core domain (counted from the 3′ end of the targeting domain) is 8 to 13 nucleotides in length, the secondary domain is 4 to 9 nucleotides in length.
In an embodiment, if the targeting domain is 16 nucleotides in length and the core domain (counted from the 3′ end of the targeting domain) is 8 to 13 nucleotides in length, the secondary domain is 3 to 8 nucleotides in length.
In an embodiment, the secondary domain is 0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14 or 15 nucleotides in length.
The secondary domain is complementary with the secondary domain target. Typically the secondary domain has exact complementarity with the secondary domain target. In an embodiment the secondary domain can have 1, 2, 3, 4 or 5 nucleotides that are not complementary with the corresponding nucleotide of the secondary domain. In an embodiment, the degree of complementarity, together with other properties of the gRNA, is sufficient to allow targeting of a Cas9 molecule to the target nucleic acid.
In an embodiment, the core domain nucleotides do not comprise modifications, e.g., modifications of the type provided in Section VIII. However, in an embodiment, the core domain comprises one or more modifications, e.g., modifications that it render it less susceptible to degradation or more bio-compatible, e.g., less immunogenic. By way of example, the backbone of the core domain can be modified with a phosphorothioate, or other modification(s) from Section VIII. In an embodiment a nucleotide of the core domain can comprise a 2′ modification (e.g., a modification at the 2′ position on ribose), e.g., a 2-acetylation, e.g., a 2′ methylation, or other modification(s) from Section VIII. Typically, a core domain will contain no more than 1, 2, or 3 modifications.
Modifications in the core domain can be selected to not interfere with targeting efficacy, which can be evaluated by testing a candidate modification in the system described in Section IV. gRNAs having a candidate core domain having a selected length, sequence, degree of complementarity, or degree of modification, can be evaluated in the system described at Section IV. The candidate core domain can be placed, either alone, or with one or more other candidate changes in a gRNA molecule/Cas9 molecule system known to be functional with a selected target and evaluated.
In an embodiment, the secondary domain nucleotides do not comprise modifications, e.g., modifications of the type provided in Section VIII. However, in an embodiment, the secondary domain comprises one or more modifications, e.g., modifications that render it less susceptible to degradation or more bio-compatible, e.g., less immunogenic. By way of example, the backbone of the secondary domain can be modified with a phosphorothioate, or other modification(s) from Section VIII. In an embodiment a nucleotide of the secondary domain can comprise a 2′ modification (e.g., a modification at the 2′ position on ribose), e.g., a 2-acetylation, e.g., a 2′ methylation, or other modification from Section VIII. Typically, a secondary domain will contain no more than 1, 2, or 3 modifications.
Modifications in the secondary domain can be selected to not interfere with targeting efficacy, which can be evaluated by testing a candidate modification in the system described in Section IV. gRNAs having a candidate secondary domain having a selected length, sequence, degree of complementarity, or degree of modification, can be evaluated in the system described at Section IV. The candidate secondary domain can be placed, either alone, or with one or more other candidate changes in a gRNA molecule/Cas9 molecule system known to be functional with a selected target and evaluated.
In an embodiment, (1) the degree of complementarity between the core domain and its target, and (2) the degree of complementarity between the secondary domain and its target, may differ. In an embodiment, (1) may be greater than (2). In an embodiment, (1) may be less than (2). In an embodiment, (1) and (2) are the same, e.g., each may be completely complementary with its target.
In an embodiment, (1) the number of modifications (e.g., modifications from Section VIII) of the nucleotides of the core domain and (2) the number of modification (e.g., modifications from Section VIII) of the nucleotides of the secondary domain, may differ. In an embodiment, (1) may be less than (2). In an embodiment, (1) may be greater than (2). In an embodiment, (1) and (2) may be the same, e.g., each may be free of modifications.
The First and Second Complementarity Domains
The first complementarity domain is complementary with the second complementarity domain.
Typically the first domain does not have exact complementarity with the second complementarity domain target. In an embodiment, the first complementarity domain can have 1, 2, 3, 4 or 5 nucleotides that are not complementary with the corresponding nucleotide of the second complementarity domain. In an embodiment, 1, 2, 3, 4, 5 or 6, e.g., 3 nucleotides, will not pair in the duplex, and, e.g., form a non-duplexed or looped-out region. In an embodiment, an unpaired, or loop-out, region, e.g., a loop-out of 3 nucleotides, is present on the second complementarity domain. In an embodiment, the unpaired region begins 1, 2, 3, 4, 5, or 6, e.g., 4, nucleotides from the 5′ end of the second complementarity domain.
In an embodiment, the degree of complementarity, together with other properties of the gRNA, is sufficient to allow targeting of a Cas9 molecule to the target nucleic acid.
In an embodiment, the first and second complementarity domains are:
independently, 6+/−2, 7+/−2, 8+/−2, 9+/−2, 10+/−2, 11+/−2, 12+/−2, 13+/−2, 14+/−2, 15+/−2, 16+/−2, 17+/−2, 18+/−2, 19+/−2, or 20+/−2, 21+/−2, 22+/−2, 23+/−2, or 24+/−2 nucleotides in length;
independently, 6, 7, 8, 9, 10, 11, 12, 13, 14, 14, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, or 26, nucleotides in length; or
independently, 5 to 24, 5 to 23, 5 to 22, 5 to 21, 5 to 20, 7 to 18, 9 to 16, or 10 to 14 nucleotides in length.
In an embodiment, the second complementarity domain is longer than the first complementarity domain, e.g., 2, 3, 4, 5, or 6, e.g., 6, nucleotides longer.
In an embodiment, the first and second complementary domains, independently, do not comprise modifications, e.g., modifications of the type provided in Section VIII.
In an embodiment, the first and second complementary domains, independently, comprise one or more modifications, e.g., modifications that the render the domain less susceptible to degradation or more bio-compatible, e.g., less immunogenic. By way of example, the backbone of the domain can be modified with a phosphorothioate, or other modification(s) from Section VIII. In an embodiment a nucleotide of the domain can comprise a 2′ modification (e.g., a modification at the 2′ position on ribose), e.g., a 2-acetylation, e.g., a 2′ methylation, or other modification(s) from Section VIII.
In an embodiment, the first and second complementary domains, independently, include 1, 2, 3, 4, 5, 6, 7 or 8 or more modifications. In an embodiment, the first and second complementary domains, independently, include 1, 2, 3, or 4 modifications within 5 nucleotides of its 5′ end. In an embodiment, the first and second complementary domains, independently, include as many as 1, 2, 3, or 4 modifications within 5 nucleotides of its 3′ end.
In an embodiment, the first and second complementary domains, independently, include modifications at two consecutive nucleotides, e.g., two consecutive nucleotides that are within 5 nucleotides of the 5′ end of the domain, within 5 nucleotides of the 3′ end of the domain, or more than 5 nucleotides away from one or both ends of the domain. In an embodiment, the first and second complementary domains, independently, include no two consecutive nucleotides that are modified, within 5 nucleotides of the 5′ end of the domain, within 5 nucleotides of the 3′ end of the domain, or within a region that is more than 5 nucleotides away from one or both ends of the domain. In an embodiment, the first and second complementary domains, independently, include no nucleotide that is modified within 5 nucleotides of the 5′ end of the domain, within 5 nucleotides of the 3′ end of the domain, or within a region that is more than 5 nucleotides away from one or both ends of the domain.
Modifications in a complementarity domain can be selected to not interfere with targeting efficacy, which can be evaluated by testing a candidate modification in the system described in Section IV. gRNAs having a candidate complementarity domain having a selected length, sequence, degree of complementarity, or degree of modification, can be evaluated in the system described in Section IV. The candidate complementarity domain can be placed, either alone, or with one or more other candidate changes in a gRNA molecule/Cas9 molecule system known to be functional with a selected target and evaluated.
In an embodiment, the first complementarity domain has at least 60, 70, 80, 85%, 90% or 95% homology with, or differs by no more than 1, 2, 3, 4, 5, or 6 nucleotides from, a reference first complementarity domain, e.g., a naturally occurring, e.g., an S. pyogenes, S. aureus or S. thermophilus, first complementarity domain, or a first complementarity domain described herein, e.g., from
In an embodiment, the second complementarity domain has at least 60, 70, 80, 85%, 90%, or 95% homology with, or differs by no more than 1, 2, 3, 4, 5, or 6 nucleotides from, a reference second complementarity domain, e.g., a naturally occurring, e.g., an S. pyogenes, S. aureus or S. thermophilus, second complementarity domain, or a second complementarity domain described herein, e.g., from
The duplexed region formed by first and second complementarity domains is typically 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21 or 22 base pairs in length (excluding any looped out or unpaired nucleotides).
In an embodiment, the first and second complementarity domains, when duplexed, comprise 11 paired nucleotides, for example, in the gRNA sequence (one paired strand underlined, one bolded):
In an embodiment, the first and second complementarity domains, when duplexed, comprise 15 paired nucleotides, for example in the gRNA sequence (one paired strand underlined, one bolded):
In an embodiment the first and second complementarity domains, when duplexed, comprise 16 paired nucleotides, for example in the gRNA sequence (one paired strand underlined, one bolded):
In an embodiment the first and second complementarity domains, when duplexed, comprise 21 paired nucleotides, for example in the gRNA sequence (one paired strand underlined, one bolded):
In an embodiment, nucleotides are exchanged to remove poly-U tracts, for example in the gRNA sequences (exchanged nucleotides underlined):
The 5′ Extension Domain
In an embodiment, a modular gRNA can comprise additional sequence, 5′ to the second complementarity domain. In an embodiment, the 5′ extension domain is 2 to 10, 2 to 9, 2 to 8, 2 to 7, 2 to 6, 2 to 5, or 2 to 4 nucleotides in length. In an embodiment, the 5′ extension domain is 2, 3, 4, 5, 6, 7, 8, 9, or 10 or more nucleotides in length.
In an embodiment, the 5′ extension domain nucleotides do not comprise modifications, e.g., modifications of the type provided in Section VIII. However, in an embodiment, the 5′ extension domain comprises one or more modifications, e.g., modifications that it render it less susceptible to degradation or more bio-compatible, e.g., less immunogenic. By way of example, the backbone of the 5′ extension domain can be modified with a phosphorothioate, or other modification(s) from Section VIII. In an embodiment, a nucleotide of the 5′ extension domain can comprise a 2′ modification (e.g., a modification at the 2′ position on ribose), e.g., a 2-acetylation, e.g., a 2′ methylation, or other modification(s) from Section VIII.
In an embodiment, the 5′ extension domain can comprise as many as 1, 2, 3, 4, 5, 6, 7 or 8 modifications. In an embodiment, the 5′ extension domain comprises as many as 1, 2, 3, or 4 modifications within 5 nucleotides of its 5′ end, e.g., in a modular gRNA molecule. In an embodiment, the 5′ extension domain comprises as many as 1, 2, 3, or 4 modifications within 5 nucleotides of its 3′ end, e.g., in a modular gRNA molecule.
In an embodiment, the 5′ extension domain comprises modifications at two consecutive nucleotides, e.g., two consecutive nucleotides that are within 5 nucleotides of the 5′ end of the 5′ extension domain, within 5 nucleotides of the 3′ end of the 5′ extension domain, or more than 5 nucleotides away from one or both ends of the 5′ extension domain. In an embodiment, no two consecutive nucleotides are modified within 5 nucleotides of the 5′ end of the 5′ extension domain, within 5 nucleotides of the 3′ end of the 5′ extension domain, or within a region that is more than 5 nucleotides away from one or both ends of the 5′ extension domain. In an embodiment, no nucleotide is modified within 5 nucleotides of the 5′ end of the 5′ extension domain, within 5 nucleotides of the 3′ end of the 5′ extension domain, or within a region that is more than 5 nucleotides away from one or both ends of the 5′ extension domain.
Modifications in the 5′ extension domain can be selected so as to not interfere with gRNA molecule efficacy, which can be evaluated by testing a candidate modification in the system described in Section IV. gRNAs having a candidate 5′ extension domain having a selected length, sequence, degree of complementarity, or degree of modification, can be evaluated in the system described at Section IV. The candidate 5′ extension domain can be placed, either alone, or with one or more other candidate changes in a gRNA molecule/Cas9 molecule system known to be functional with a selected target and evaluated.
In an embodiment, the 5′ extension domain has at least 60, 70, 80, 85, 90 or 95% homology with, or differs by no more than 1, 2, 3, 4, 5, or 6 nucleotides from, a reference 5′ extension domain, e.g., a naturally occurring, e.g., an S. pyogenes, S. aureus or S. thermophilus, 5′ extension domain, or a 5′ extension domain described herein, e.g., from
The Linking Domain
In a unimolecular gRNA molecule the linking domain is disposed between the first and second complementarity domains. In a modular gRNA molecule, the two molecules are associated with one another by the complementarity domains.
In an embodiment, the linking domain is 10+/−5, 20+/−5, 30+/−5, 40+/−5, 50+/−5, 60+/−5, 70+/−5, 80+/−5, 90+/−5, or 100+/−5 nucleotides, in length.
In an embodiment, the linking domain is 20+/−10, 30+/−10, 40+/−10, 50+/−10, 60+/−10, 70+/−10, 80+/−10, 90+/−10, or 100+/−10 nucleotides, in length.
In an embodiment, the linking domain is 10 to 100, 10 to 90, 10 to 80, 10 to 70, 10 to 60, 10 to 50, 10 to 40, 10 to 30, 10 to 20 or 10 to 15 nucleotides in length.
In another embodiment, the linking domain is 20 to 100, 20 to 90, 20 to 80, 20 to 70, 20 to 60, 20 to 50, 20 to 40, 20 to 30, or 20 to 25 nucleotides in length.
In an embodiment, the linking domain is 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16 17, 18, 19, or 20 nucleotides in length.
In and embodiment, the linking domain is a covalent bond.
In an embodiment, the linking domain comprises a duplexed region, typically adjacent to or within 1, 2, or 3 nucleotides of the 3′ end of the first complementarity domain and/or the 5-end of the second complementarity domain. In an embodiment, the duplexed region can be 20+/−10 base pairs in length. In an embodiment, the duplexed region can be 10+/−5, 15+/−5, 20+/−5, or 30+/−5 base pairs in length. In an embodiment, the duplexed region can be 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, or 15 base pairs in length.
Typically the sequences forming the duplexed region have exact complementarity with one another, though in an embodiment as many as 1, 2, 3, 4, 5, 6, 7 or 8 nucleotides are not complementary with the corresponding nucleotides.
In an embodiment, the linking domain nucleotides do not comprise modifications, e.g., modifications of the type provided in Section VIII. However, in an embodiment, the linking domain comprises one or more modifications, e.g., modifications that it render it less susceptible to degradation or more bio-compatible, e.g., less immunogenic. By way of example, the backbone of the linking domain can be modified with a phosphorothioate, or other modification(s) from Section VIII. In an embodiment a nucleotide of the linking domain can comprise a 2′ modification (e.g., a modification at the 2′ position on ribose), e.g., a 2-acetylation, e.g., a 2′ methylation, or other modification(s) from Section VIII. In an embodiment, the linking domain can comprise as many as 1, 2, 3, 4, 5, 6, 7 or 8 modifications.
Modifications in a linking domain can be selected so as to not interfere with targeting efficacy, which can be evaluated by testing a candidate modification in the system described in Section IV. gRNAs having a candidate linking domain having a selected length, sequence, degree of complementarity, or degree of modification, can be evaluated a system described in Section IV. A candidate linking domain can be placed, either alone, or with one or more other candidate changes in a gRNA molecule/Cas9 molecule system known to be functional with a selected target and evaluated.
In an embodiment, the linking domain has at least 60, 70, 80, 85, 90 or 95% homology 30 with, or differs by no more than 1, 2, 3, 4, 5, or 6 nucleotides from, a reference linking domain, e.g., a linking domain described herein, e.g., from
The Proximal Domain
In an embodiment, the proximal domain is 6+/−2, 7+/−2, 8+/−2, 9+/−2, 10+/−2, 11+/−2, 12+/−2, 13+/−2, 14+/−2, 14+/−2, 16+/−2, 17+/−2, 18+/−2, 19+/−2, or 20+/−2 nucleotides in length.
In an embodiment, the proximal domain is 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25 or 26 nucleotides in length.
In an embodiment, the proximal domain is 5 to 20, 7, to 18, 9 to 16, or 10 to 14 nucleotides in length.
In an embodiment, the proximal domain nucleotides do not comprise modifications, e.g., modifications of the type provided in Section VIII. However, in an embodiment, the proximal domain comprises one or more modifications, e.g., modifications that it render it less susceptible to degradation or more bio-compatible, e.g., less immunogenic. By way of example, the backbone of the proximal domain can be modified with a phosphorothioate, or other modification(s) from Section VIII. In an embodiment a nucleotide of the proximal domain can comprise a 2′ modification (e.g., a modification at the 2′ position on ribose), e.g., a 2-acetylation, e.g., a 2′ methylation, or other modification(s) from Section VIII.
In an embodiment, the proximal domain can comprise as many as 1, 2, 3, 4, 5, 6, 7 or 8 modifications. In an embodiment, the proximal domain comprises as many as 1, 2, 3, or 4 modifications within 5 nucleotides of its 5′ end, e.g., in a modular gRNA molecule. In an embodiment, the target domain comprises as many as 1, 2, 3, or 4 modifications within 5 nucleotides of its 3′ end, e.g., in a modular gRNA molecule.
In an embodiment, the proximal domain comprises modifications at two consecutive nucleotides, e.g., two consecutive nucleotides that are within 5 nucleotides of the 5′ end of the proximal domain, within 5 nucleotides of the 3′ end of the proximal domain, or more than 5 nucleotides away from one or both ends of the proximal domain. In an embodiment, no two consecutive nucleotides are modified within 5 nucleotides of the 5′ end of the proximal domain, within 5 nucleotides of the 3′ end of the proximal domain, or within a region that is more than 5 nucleotides away from one or both ends of the proximal domain. In an embodiment, no nucleotide is modified within 5 nucleotides of the 5′ end of the proximal domain, within 5 nucleotides of the 3′ end of the proximal domain, or within a region that is more than 5 nucleotides away from one or both ends of the proximal domain.
Modifications in the proximal domain can be selected so as to not interfere with gRNA molecule efficacy, which can be evaluated by testing a candidate modification in the system described in Section IV. gRNAs having a candidate proximal domain having a selected length, sequence, degree of complementarity, or degree of modification, can be evaluated in the system described at Section IV. The candidate proximal domain can be placed, either alone, or with one or more other candidate changes in a gRNA molecule/Cas9 molecule system known to be functional with a selected target and evaluated.
In an embodiment, the proximal domain has at least 60, 70, 80, 85 90 or 95% homology with, or differs by no more than 1, 2, 3, 4, 5, or 6 nucleotides from, a reference proximal domain, e.g., a naturally occurring, e.g., an S. pyogenes, S. aureus or S. thermophilus, proximal domain, or a proximal domain described herein, e.g., from
In an embodiment, the tail domain is 10+/−5, 20+/−5, 30+/−5, 40+/−5, 50+/−5, 60+/−5, 70+/−5, 80+/−5, 90+/−5, or 100+/−5 nucleotides, in length.
In an embodiment, the tail domain is 20+/−5 nucleotides in length.
In an embodiment, the tail domain is 20+/−10, 30+/−10, 40+/−10, 50+/−10, 60+/−10, 70+/−10, 80+/−10, 90+/−10, or 100+/−10 nucleotides, in length.
In an embodiment, the tail domain is 25+/−10 nucleotides in length.
In an embodiment, the tail domain is 10 to 100, 10 to 90, 10 to 80, 10 to 70, 10 to 60, 10 to 50, 10 to 40, 10 to 30, 10 to 20 or 10 to 15 nucleotides in length.
In another embodiment, the tail domain is 20 to 100, 20 to 90, 20 to 80, 20 to 70, 20 to 60, 20 to 50, 20 to 40, 20 to 30, or 20 to 25 nucleotides in length.
In an embodiment, the tail domain is 1 to 20, 1 to 15, 1 to 10, or 1 to 5 nucleotides in length.
In an embodiment, the tail domain nucleotides do not comprise modifications, e.g., modifications of the type provided in Section VIII. However, in an embodiment, the tail domain comprises one or more modifications, e.g., modifications that it render it less susceptible to degradation or more bio-compatible, e.g., less immunogenic. By way of example, the backbone of the tail domain can be modified with a phosphorothioate, or other modification(s) from Section VIII. In an embodiment a nucleotide of the tail domain can comprise a 2′ modification (e.g., a modification at the 2′ position on ribose), e.g., a 2-acetylation, e.g., a 2′ methylation, or other modification(s) from Section VIII.
In an embodiment, the tail domain can have as many as 1, 2, 3, 4, 5, 6, 7 or 8 modifications. In an embodiment, the target domain comprises as many as 1, 2, 3, or 4 modifications within 5 nucleotides of its 5′ end. In an embodiment, the target domain comprises as many as 1, 2, 3, or 4 modifications within 5 nucleotides of its 3′ end.
In an embodiment, the tail domain comprises a tail duplex domain, which can form a tail duplexed region. In an embodiment, the tail duplexed region can be 3, 4, 5, 6, 7, 8, 9, 10, 11, or 12 base pairs in length. In an embodiment, a further single stranded domain, exists 3′ to the tail duplexed domain. In an embodiment, this domain is 3, 4, 5, 6, 7, 8, 9, or 10 nucleotides in length. In an embodiment it is 4 to 6 nucleotides in length.
In an embodiment, the tail domain has at least 60, 70, 80, or 90% homology with, or differs by no more than 1, 2, 3, 4, 5, or 6 nucleotides from, a reference tail domain, e.g., a naturally occurring, e.g., an S. pyogenes, S. aureus or S. thermophilus, tail domain, or a tail domain described herein, e.g., from
In an embodiment, the proximal and tail domain, taken together comprise the following sequences:
In an embodiment, the tail domain comprises the 3′ sequence UUUUUU, e.g., if a U6 promoter is used for transcription.
In an embodiment, the tail domain comprises the 3′ sequence UUUU, e.g., if an H1 promoter is used for transcription.
In an embodiment, tail domain comprises variable numbers of 3′ Us depending, e.g., on the termination signal of the pol-III promoter used.
In an embodiment, the tail domain comprises variable 3′ sequence derived from the DNA template if a T7 promoter is used.
In an embodiment, the tail domain comprises variable 3′ sequence derived from the DNA template, e.g., if in vitro transcription is used to generate the RNA molecule.
In an embodiment, the tail domain comprises variable 3′ sequence derived from the DNA template, e., if a pol-II promoter is used to drive transcription.
Modifications in the tail domain can be selected to not interfere with targeting efficacy, which can be evaluated by testing a candidate modification in the system described in Section IV. gRNAs having a candidate tail domain having a selected length, sequence, degree of complementarity, or degree of modification, can be evaluated in the system described in Section IV. The candidate tail domain can be placed, either alone, or with one or more other candidate changes in a gRNA molecule/Cas9 molecule system known to be functional with a selected target and evaluated.
In an embodiment, the tail domain comprises modifications at two consecutive nucleotides, e.g., two consecutive nucleotides that are within 5 nucleotides of the 5′ end of the tail domain, within 5 nucleotides of the 3′ end of the tail domain, or more than 5 nucleotides away from one or both ends of the tail domain. In an embodiment, no two consecutive nucleotides are modified within 5 nucleotides of the 5′ end of the tail domain, within 5 nucleotides of the 3′ end of the tail domain, or within a region that is more than 5 nucleotides away from one or both ends of the tail domain. In an embodiment, no nucleotide is modified within 5 nucleotides of the 5′ end of the tail domain, within 5 nucleotides of the 3′ end of the tail domain, or within a region that is more than 5 nucleotides away from one or both ends of the tail domain.
In an embodiment a gRNA has the following structure:
5′ [targeting domain]-[first complementarity domain]-[linking domain]-[second complementarity domain]-[proximal domain]-[tail domain]-3′
wherein, the targeting domain comprises a core domain and optionally a secondary domain, and is 10 to 50 nucleotides in length;
the first complementarity domain is 5 to 25 nucleotides in length and, In an embodiment has at least 50, 60, 70, 80, 85, 90 or 95% homology with a reference first complementarity domain disclosed herein;
the linking domain is 1 to 5 nucleotides in length;
the second complementarity domain is 5 to 27 nucleotides in length and, in an embodiment has at least 50, 60, 70, 80, 85, 90 or 95% homology with a reference second complementarity domain disclosed herein;
the proximal domain is 5 to 20 nucleotides in length and, in an embodiment has at least 50, 60, 70, 80, 85, 90 or 95% homology with a reference proximal domain disclosed herein; and
the tail domain is absent or a nucleotide sequence is 1 to 50 nucleotides in length and, in an embodiment has at least 50, 60, 70, 80, 85, 90 or 95% homology with a reference tail domain disclosed herein.
Exemplary Chimeric gRNAs
In an embodiment, a unimolecular, or chimeric, gRNA comprises, preferably from 5′ to 3′:
-
- a targeting domain (which is complementary to a target nucleic acid);
- a first complementarity domain, e.g., comprising 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, or 26 nucleotides;
- a linking domain;
- a second complementarity domain (which is complementary to the first complementarity domain);
- a proximal domain; and a tail domain,
- wherein,
- (a) the proximal and tail domain, when taken together, comprise at least 15, 18, 20, 25, 30, 31, 35, 40, 45, 49, 50, or 53 nucleotides;
- (b) there are at least 15, 18, 20, 25, 30, 31, 35, 40, 45, 49, 50, or 53 nucleotides 3′ to the last nucleotide of the second complementarity domain; or
- (c) there are at least 16, 19, 21, 26, 31, 32, 36, 41, 46, 50, 51, or 54 nucleotides 3′ to the last nucleotide of the second complementarity domain that is complementary to its corresponding nucleotide of the first complementarity domain.
In an embodiment, the sequence from (a), (b), or (c), has at least 60, 75, 80, 85, 90, 95, or 99% homology with the corresponding sequence of a naturally occurring gRNA, or with a gRNA described herein.
In an embodiment, the proximal and tail domain, when taken together, comprise at least 15, 18, 20, 25, 30, 31, 35, 40, 45, 49, 50, or 53 nucleotides.
In an embodiment, there are at least 15, 18, 20, 25, 30, 31, 35, 40, 45, 49, 50, or 53 nucleotides 3′ to the last nucleotide of the second complementarity domain.
In an embodiment, there are at least 16, 19, 21, 26, 31, 32, 36, 41, 46, 50, 51, or 54 nucleotides 3′ to the last nucleotide of the second complementarity domain that is complementary to its corresponding nucleotide of the first complementarity domain.
In an embodiment, the targeting domain comprises, has, or consists of, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25 or 26 nucleotides (e.g., 16, 17, 18, 19, 20, 21, 22, 23, 24, 25 or 26 consecutive nucleotides) having complementarity with the target domain, e.g., the targeting domain is 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, or 26 nucleotides in length.
In an embodiment, the targeting domain comprises, has, or consists of, 16 nucleotides (e.g., 16 consecutive nucleotides) having complementarity with the target domain, e.g., the targeting domain is 16 nucleotides in length.
In an embodiment, the targeting domain comprises, has, or consists of, 17 nucleotides (e.g., 17 consecutive nucleotides) having complementarity with the target domain, e.g., the targeting domain is 17 nucleotides in length.
In an embodiment, the targeting domain comprises, has, or consists of, 18 nucleotides (e.g., 18 consecutive nucleotides) having complementarity with the target domain, e.g., the targeting domain is 18 nucleotides in length.
In an embodiment, the targeting domain comprises, has, or consists of, 19 nucleotides (e.g., 19 consecutive nucleotides) having complementarity with the target domain, e.g., the targeting domain is 19 nucleotides in length.
In an embodiment, the targeting domain comprises, has, or consists of, 20 nucleotides (e.g., 20 consecutive nucleotides) having complementarity with the target domain, e.g., the targeting domain is 20 nucleotides in length.
In an embodiment, the targeting domain comprises, has, or consists of, 21 nucleotides (e.g., 21 consecutive nucleotides) having complementarity with the target domain, e.g., the targeting domain is 21 nucleotides in length.
In an embodiment, the targeting domain comprises, has, or consists of, 22 nucleotides (e.g., 22 consecutive nucleotides) having complementarity with the target domain, e.g., the targeting domain is 22 nucleotides in length.
In an embodiment, the targeting domain comprises, has, or consists of, 23 nucleotides (e.g., 23 consecutive nucleotides) having complementarity with the target domain, e.g., the targeting domain is 23 nucleotides in length.
In an embodiment, the targeting domain comprises, has, or consists of, 24 nucleotides (e.g., 24 consecutive nucleotides) having complementarity with the target domain, e.g., the targeting domain is 24 nucleotides in length.
In an embodiment, the targeting domain comprises, has, or consists of, 25 nucleotides (e.g., 25 consecutive nucleotides) having complementarity with the target domain, e.g., the targeting domain is 25 nucleotides in length.
In an embodiment, the targeting domain comprises, has, or consists of, 26 nucleotides (e.g., 26 consecutive nucleotides) having complementarity with the target domain, e.g., the targeting domain is 26 nucleotides in length.
In an embodiment, the targeting domain comprises, has, or consists of, 16 nucleotides (e.g., 16 consecutive nucleotides) having complementarity with the target domain, e.g., the targeting domain is 16 nucleotides in length; and the proximal and tail domain, when taken together, comprise at least 15, 18, 20, 25, 30, 31, 35, 40, 45, 49, 50, or 53 nucleotides.
In an embodiment, the targeting domain comprises, has, or consists of, 16 nucleotides (e.g., 16 consecutive nucleotides) having complementarity with the target domain, e.g., the targeting domain is 16 nucleotides in length; and there are at least 15, 18, 20, 25, 30, 31, 35, 40, 45, 49, 50, or 53 nucleotides 3′ to the last nucleotide of the second complementarity domain.
In an embodiment, the targeting domain comprises, has, or consists of, 16 nucleotides (e.g., 16 consecutive nucleotides) having complementarity with the target domain, e.g., the targeting domain is 16 nucleotides in length; and there are at least 16, 19, 21, 26, 31, 32, 36, 41, 46, 50, 51, or 54 nucleotides 3′ to the last nucleotide of the second complementarity domain that is complementary to its corresponding nucleotide of the first complementarity domain.
In an embodiment, the targeting domain comprises, has, or consists of, 17 nucleotides (e.g., 17 consecutive nucleotides) having complementarity with the target domain, e.g., the targeting domain is 17 nucleotides in length; and the proximal and tail domain, when taken together, comprise at least 15, 18, 20, 25, 30, 31, 35, 40, 45, 49, 50, or 53 nucleotides.
In an embodiment, the targeting domain comprises, has, or consists of, 17 nucleotides (e.g., 17 consecutive nucleotides) having complementarity with the target domain, e.g., the targeting domain is 17 nucleotides in length; and there are at least 15, 18, 20, 25, 30, 31, 35, 40, 45, 49, 50, or 53 nucleotides 3′ to the last nucleotide of the second complementarity domain.
In an embodiment, the targeting domain comprises, has, or consists of, 17 nucleotides (e.g., 17 consecutive nucleotides) having complementarity with the target domain, e.g., the targeting domain is 17 nucleotides in length; and there are at least 16, 19, 21, 26, 31, 32, 36, 41, 46, 50, 51, or 54 nucleotides 3′ to the last nucleotide of the second complementarity domain that is complementary to its corresponding nucleotide of the first complementarity domain.
In an embodiment, the targeting domain comprises, has, or consists of, 18 nucleotides (e.g., 18 consecutive nucleotides) having complementarity with the target domain, e.g., the targeting domain is 18 nucleotides in length; and the proximal and tail domain, when taken together, comprise at least 15, 18, 20, 25, 30, 31, 35, 40, 45, 49, 50, or 53 nucleotides.
In an embodiment, the targeting domain comprises, has, or consists of, 18 nucleotides (e.g., 18 consecutive nucleotides) having complementarity with the target domain, e.g., the targeting domain is 18 nucleotides in length; and there are at least 15, 18, 20, 25, 30, 31, 35, 40, 45, 49, 50, or 53 nucleotides 3′ to the last nucleotide of the second complementarity domain.
In an embodiment, the targeting domain comprises, has, or consists of, 18 nucleotides (e.g., 18 consecutive nucleotides) having complementarity with the target domain, e.g., the targeting domain is 18 nucleotides in length; and there are at least 16, 19, 21, 26, 31, 32, 36, 41, 46, 50, 51, or 54 nucleotides 3′ to the last nucleotide of the second complementarity domain that is complementary to its corresponding nucleotide of the first complementarity domain.
In an embodiment, the targeting domain comprises, has, or consists of, 19 nucleotides (e.g., 19 consecutive nucleotides) having complementarity with the target domain, e.g., the targeting domain is 19 nucleotides in length; and the proximal and tail domain, when taken together, comprise at least 15, 18, 20, 25, 30, 31, 35, 40, 45, 49, 50, or 53 nucleotides.
In an embodiment, the targeting domain comprises, has, or consists of, 19 nucleotides (e.g., 19 consecutive nucleotides) having complementarity with the target domain, e.g., the targeting domain is 19 nucleotides in length; and there are at least 15, 18, 20, 25, 30, 31, 35, 40, 45, 49, 50, or 53 nucleotides 3′ to the last nucleotide of the second complementarity domain.
In an embodiment, the targeting domain comprises, has, or consists of, 19 nucleotides (e.g., 19 consecutive nucleotides) having complementarity with the target domain, e.g., the targeting domain is 19 nucleotides in length; and there are at least 16, 19, 21, 26, 31, 32, 36, 41, 46, 50, 51, or 54 nucleotides 3′ to the last nucleotide of the second complementarity domain that is complementary to its corresponding nucleotide of the first complementarity domain.
In an embodiment, the targeting domain comprises, has, or consists of, 20 nucleotides (e.g., 20 consecutive nucleotides) having complementarity with the target domain, e.g., the targeting domain is 20 nucleotides in length; and the proximal and tail domain, when taken together, comprise at least 15, 18, 20, 25, 30, 31, 35, 40, 45, 49, 50, or 53 nucleotides.
In an embodiment, the targeting domain comprises, has, or consists of, 20 nucleotides (e.g., 20 consecutive nucleotides) having complementarity with the target domain, e.g., the targeting domain is 20 nucleotides in length; and there are at least 15, 18, 20, 25, 30, 31, 35, 40, 45, 49, 50, or 53 nucleotides 3′ to the last nucleotide of the second complementarity domain.
In an embodiment, the targeting domain comprises, has, or consists of, 20 nucleotides (e.g., 20 consecutive nucleotides) having complementarity with the target domain, e.g., the targeting domain is 20 nucleotides in length; and there are at least 16, 19, 21, 26, 31, 32, 36, 41, 46, 50, 51, or 54 nucleotides 3′ to the last nucleotide of the second complementarity domain that is complementary to its corresponding nucleotide of the first complementarity domain.
In an embodiment, the targeting domain comprises, has, or consists of, 21 nucleotides (e.g., 21 consecutive nucleotides) having complementarity with the target domain, e.g., the targeting domain is 21 nucleotides in length; and the proximal and tail domain, when taken together, comprise at least 15, 18, 20, 25, 30, 31, 35, 40, 45, 49, 50, or 53 nucleotides.
In an embodiment, the targeting domain comprises, has, or consists of, 21 nucleotides (e.g., 21 consecutive nucleotides) having complementarity with the target domain, e.g., the targeting domain is 21 nucleotides in length; and there are at least 15, 18, 20, 25, 30, 31, 35, 40, 45, 49, 50, or 53 nucleotides 3′ to the last nucleotide of the second complementarity domain.
In an embodiment, the targeting domain comprises, has, or consists of, 21 nucleotides (e.g., 21 consecutive nucleotides) having complementarity with the target domain, e.g., the targeting domain is 21 nucleotides in length; and there are at least 16, 19, 21, 26, 31, 32, 36, 41, 46, 50, 51, or 54 nucleotides 3′ to the last nucleotide of the second complementarity domain that is complementary to its corresponding nucleotide of the first complementarity domain.
In an embodiment, the targeting domain comprises, has, or consists of, 22 nucleotides (e.g., 22 consecutive nucleotides) having complementarity with the target domain, e.g., the targeting domain is 22 nucleotides in length; and the proximal and tail domain, when taken together, comprise at least 15, 18, 20, 25, 30, 31, 35, 40, 45, 49, 50, or 53 nucleotides.
In an embodiment, the targeting domain comprises, has, or consists of, 22 nucleotides (e.g., 22 consecutive nucleotides) having complementarity with the target domain, e.g., the targeting domain is 22 nucleotides in length; and there are at least 15, 18, 20, 25, 30, 31, 35, 40, 45, 49, 50, or 53 nucleotides 3′ to the last nucleotide of the second complementarity domain.
In an embodiment, the targeting domain comprises, has, or consists of, 22 nucleotides (e.g., 22 consecutive nucleotides) having complementarity with the target domain, e.g., the targeting domain is 22 nucleotides in length; and there are at least 16, 19, 21, 26, 31, 32, 36, 41, 46, 50, 51, or 54 nucleotides 3′ to the last nucleotide of the second complementarity domain that is complementary to its corresponding nucleotide of the first complementarity domain.
In an embodiment, the targeting domain comprises, has, or consists of, 23 nucleotides (e.g., 23 consecutive nucleotides) having complementarity with the target domain, e.g., the targeting domain is 23 nucleotides in length; and the proximal and tail domain, when taken together, comprise at least 15, 18, 20, 25, 30, 31, 35, 40, 45, 49, 50, or 53 nucleotides.
In an embodiment, the targeting domain comprises, has, or consists of, 23 nucleotides (e.g., 23 consecutive nucleotides) having complementarity with the target domain, e.g., the targeting domain is 23 nucleotides in length; and there are at least 15, 18, 20, 25, 30, 31, 35, 40, 45, 49, 50, or 53 nucleotides 3′ to the last nucleotide of the second complementarity domain.
In an embodiment, the targeting domain comprises, has, or consists of, 23 nucleotides (e.g., 23 consecutive nucleotides) having complementarity with the target domain, e.g., the targeting domain is 23 nucleotides in length; and there are at least 16, 19, 21, 26, 31, 32, 36, 41, 46, 50, 51, or 54 nucleotides 3′ to the last nucleotide of the second complementarity domain that is complementary to its corresponding nucleotide of the first complementarity domain.
In an embodiment, the targeting domain comprises, has, or consists of, 24 nucleotides (e.g., 24 consecutive nucleotides) having complementarity with the target domain, e.g., the targeting domain is 24 nucleotides in length; and the proximal and tail domain, when taken together, comprise at least 15, 18, 20, 25, 30, 31, 35, 40, 45, 49, 50, or 53 nucleotides.
In an embodiment, the targeting domain comprises, has, or consists of, 24 nucleotides (e.g., 24 consecutive nucleotides) having complementarity with the target domain, e.g., the targeting domain is 24 nucleotides in length; and there are at least 15, 18, 20, 25, 30, 31, 35, 40, 45, 49, 50, or 53 nucleotides 3′ to the last nucleotide of the second complementarity domain.
In an embodiment, the targeting domain comprises, has, or consists of, 24 nucleotides (e.g., 24 consecutive nucleotides) having complementarity with the target domain, e.g., the targeting domain is 24 nucleotides in length; and there are at least 16, 19, 21, 26, 31, 32, 36, 41, 46, 50, 51, or 54 nucleotides 3′ to the last nucleotide of the second complementarity domain that is complementary to its corresponding nucleotide of the first complementarity domain.
In an embodiment, the targeting domain comprises, has, or consists of, 25 nucleotides (e.g., 25 consecutive nucleotides) having complementarity with the target domain, e.g., the targeting domain is 25 nucleotides in length; and the proximal and tail domain, when taken together, comprise at least 15, 18, 20, 25, 30, 31, 35, 40, 45, 49, 50, or 53 nucleotides.
In an embodiment, the targeting domain comprises, has, or consists of, 25 nucleotides (e.g., 25 consecutive nucleotides) having complementarity with the target domain, e.g., the targeting domain is 25 nucleotides in length; and there are at least 15, 18, 20, 25, 30, 31, 35, 40, 45, 49, 50, or 53 nucleotides 3′ to the last nucleotide of the second complementarity domain.
In an embodiment, the targeting domain comprises, has, or consists of, 25 nucleotides (e.g., 25 consecutive nucleotides) having complementarity with the target domain, e.g., the targeting domain is 25 nucleotides in length; and there are at least 16, 19, 21, 26, 31, 32, 36, 41, 46, 50, 51, or 54 nucleotides 3′ to the last nucleotide of the second complementarity domain that is complementary to its corresponding nucleotide of the first complementarity domain.
In an embodiment, the targeting domain comprises, has, or consists of, 26 nucleotides (e.g., 26 consecutive nucleotides) having complementarity with the target domain, e.g., the targeting domain is 26 nucleotides in length; and the proximal and tail domain, when taken together, comprise at least 15, 18, 20, 25, 30, 31, 35, 40, 45, 49, 50, or 53 nucleotides.
In an embodiment, the targeting domain comprises, has, or consists of, 26 nucleotides (e.g., 26 consecutive nucleotides) having complementarity with the target domain, e.g., the targeting domain is 26 nucleotides in length; and there are at least 15, 18, 20, 25, 30, 31, 35, 40, 45, 49, 50, or 53 nucleotides 3′ to the last nucleotide of the second complementarity domain.
In an embodiment, the targeting domain comprises, has, or consists of, 26 nucleotides (e.g., 26 consecutive nucleotides) having complementarity with the target domain, e.g., the targeting domain is 26 nucleotides in length; and there are at least 16, 19, 21, 26, 31, 32, 36, 41, 46, 50, 51, or 54 nucleotides 3′ to the last nucleotide of the second complementarity domain that is complementary to its corresponding nucleotide of the first complementarity domain.
In an embodiment, the unimolecular, or chimeric, gRNA molecule (comprising a targeting domain, a first complementary domain, a linking domain, a second complementary domain, a proximal domain and, optionally, a tail domain) comprises the following sequence in which the targeting domain is depicted as 20 Ns but could be any sequence and range in length from 16 to 26 nucleotides and in which the gRNA sequence is followed by 6 Us, which serve as a termination signal for the U6 promoter, but which could be either absent or fewer in number: NNNNNNNNNNNNNNNNNNNNGUUUUAGAGCUAGAAAUAGCAAGUUAAAAUAAGG CUAGUCCGUUAUCAACUUGAAAAAGUGGCACCGAGUCGGUGCUUUUUU (SEQ ID NO: 45). In an embodiment, the unimolecular, or chimeric, gRNA molecule is a S. pyogenes gRNA molecule.
In some embodiments, the unimolecular, or chimeric, gRNA molecule (comprising a targeting domain, a first complementary domain, a linking domain, a second complementary domain, a proximal domain and, optionally, a tail domain) comprises the following sequence in which the targeting domain is depicted as 20 Ns but could be any sequence and range in length from 16 to 26 nucleotides and in which the gRNA sequence is followed by 6 Us, which serve as a termination signal for the U6 promoter, but which could be either absent or fewer in number: NNNNNNNNNNNNNNNNNNNNGUUUUAGUACUCUGGAAACAGAAUCUACUAAAAC AAGGCAAAAUGCCGUGUUUAUCUCGUCAACUUGUUGGCGAGAUUUUUU (SEQ ID NO: 40). In an embodiment, the unimolecular, or chimeric, gRNA molecule is a S. aureus gRNA molecule.
The sequences and structures of exemplary chimeric gRNAs are also shown in
Exemplary Modular gRNAs
In an embodiment, a modular gRNA comprises:
-
- a first strand comprising, preferably from 5′ to 3′;
- a targeting domain, e.g., comprising 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, or 26 nucleotides;
- a first complementarity domain; and
- a second strand, comprising, preferably from 5′ to 3′:
- optionally a 5′ extension domain;
- a second complementarity domain;
- a proximal domain; and
- a tail domain,
- wherein:
- (a) the proximal and tail domain, when taken together, comprise at least 15, 18, 20, 25, 30, 31, 35, 40, 45, 49, 50, or 53 nucleotides;
- (b) there are at least 15, 18, 20, 25, 30, 31, 35, 40, 45, 49, 50, or 53 nucleotides 3′ to the last nucleotide of the second complementarity domain; or
- (c) there are at least 16, 19, 21, 26, 31, 32, 36, 41, 46, 50, 51, or 54 nucleotides 3′ to the last nucleotide of the second complementarity domain that is complementary to its corresponding nucleotide of the first complementarity domain.
- a first strand comprising, preferably from 5′ to 3′;
In an embodiment, the sequence from (a), (b), or (c), has at least 60, 75, 80, 85, 90, 95, or 99% homology with the corresponding sequence of a naturally occurring gRNA, or with a gRNA described herein.
In an embodiment, the proximal and tail domain, when taken together, comprise at least 15, 18, 20, 25, 30, 31, 35, 40, 45, 49, 50, or 53 nucleotides.
In an embodiment, there are at least 15, 18, 20, 25, 30, 31, 35, 40, 45, 49, 50, or 53 nucleotides 3′ to the last nucleotide of the second complementarity domain.
In an embodiment, there are at least 16, 19, 21, 26, 31, 32, 36, 41, 46, 50, 51, or 54 nucleotides 3′ to the last nucleotide of the second complementarity domain that is complementary to its corresponding nucleotide of the first complementarity domain.
In an embodiment, the targeting domain comprises, has, or consists of, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25 or 26 nucleotides (e.g., 16, 17, 18, 19, 20, 21, 22, 23, 24, 25 or 26 consecutive nucleotides) having complementarity with the target domain, e.g., the targeting domain is 16, 17, 18, 19, 20, 21, 22, 23, 24, 25 or 26 nucleotides in length. In an embodiment, the targeting domain comprises, has, or consists of, 16 nucleotides (e.g., 16 consecutive nucleotides) having complementarity with the target domain, e.g., the targeting domain is 16 nucleotides in length.
In an embodiment, the targeting domain has, or consists of, 17 nucleotides (e.g., 17 consecutive nucleotides) having complementarity with the target domain, e.g., the targeting domain is 17 nucleotides in length.
In an embodiment, the targeting domain has, or consists of, 18 nucleotides (e.g., 18 consecutive nucleotides) having complementarity with the target domain, e.g., the targeting domain is 18 nucleotides in length.
In an embodiment, the targeting domain comprises, has, or consists of, 19 nucleotides (e.g., 19 consecutive nucleotides) having complementarity with the target domain, e.g., the targeting domain is 19 nucleotides in length.
In an embodiment, the targeting domain comprises, has, or consists of, 20 nucleotides (e.g., 20 consecutive nucleotides) having complementarity with the target domain, e.g., the targeting domain is 20 nucleotides in length.
In an embodiment, the targeting domain comprises, has, or consists of, 21 nucleotides (e.g., 21 consecutive nucleotides) having complementarity with the target domain, e.g., the targeting domain is 21 nucleotides in length.
In an embodiment, the targeting domain comprises, has, or consists of, 22 nucleotides (e.g., 22 consecutive nucleotides) having complementarity with the target domain, e.g., the targeting domain is 22 nucleotides in length.
In an embodiment, the targeting domain comprises, has, or consists of, 23 nucleotides (e.g., 23 consecutive nucleotides) having complementarity with the target domain, e.g., the targeting domain is 23 nucleotides in length.
In an embodiment, the targeting domain comprises, has, or consists of, 24 nucleotides (e.g., 24 consecutive nucleotides) having complementarity with the target domain, e.g., the targeting domain is 24 nucleotides in length.
In an embodiment, the targeting domain comprises, has, or consists of, 25 nucleotides (e.g., 25 consecutive nucleotides) having complementarity with the target domain, e.g., the targeting domain is 5 nucleotides in length.
In an embodiment, the targeting domain comprises, has, or consists of, 26 nucleotides (e.g., 26 consecutive nucleotides) having complementarity with the target domain, e.g., the targeting domain is 26 nucleotides in length.
In an embodiment, the targeting domain comprises, has, or consists of, 16 nucleotides (e.g., 16 consecutive nucleotides) having complementarity with the target domain, e.g., the targeting domain is 16 nucleotides in length; and the proximal and tail domain, when taken together, comprise at least 15, 18, 20, 25, 30, 31, 35, 40, 45, 49, 50, or 53 nucleotides.
In an embodiment, the targeting domain comprises, has, or consists of, 16 nucleotides (e.g., 16 consecutive nucleotides) having complementarity with the target domain, e.g., the targeting domain is 16 nucleotides in length; and there are at least 15, 18, 20, 25, 30, 31, 35, 40, 45, 49, 50, or 53 nucleotides 3′ to the last nucleotide of the second complementarity domain.
In an embodiment, the targeting domain comprises, has, or consists of, 16 nucleotides (e.g., 16 consecutive nucleotides) having complementarity with the target domain, e.g., the targeting domain is 16 nucleotides in length; and there are at least 16, 19, 21, 26, 31, 32, 36, 41, 46, 50, 51, or 54 nucleotides 3′ to the last nucleotide of the second complementarity domain that is complementary to its corresponding nucleotide of the first complementarity domain.
In an embodiment, the targeting domain has, or consists of, 17 nucleotides (e.g., 17 consecutive nucleotides) having complementarity with the target domain, e.g., the targeting domain is 17 nucleotides in length; and the proximal and tail domain, when taken together, comprise at least 15, 18, 20, 25, 30, 31, 35, 40, 45, 49, 50, or 53 nucleotides.
In an embodiment, the targeting domain has, or consists of, 17 nucleotides (e.g., 17 consecutive nucleotides) having complementarity with the target domain, e.g., the targeting domain is 17 nucleotides in length; and there are at least 15, 18, 20, 25, 30, 31, 35, 40, 45, 49, 50, or 53 nucleotides 3′ to the last nucleotide of the second complementarity domain.
In an embodiment, the targeting domain has, or consists of, 17 nucleotides (e.g., 17 consecutive nucleotides) having complementarity with the target domain, e.g., the targeting domain is 17 nucleotides in length; and there are at least 16, 19, 21, 26, 31, 32, 36, 41, 46, 50, 51, or 54 nucleotides 3′ to the last nucleotide of the second complementarity domain that is complementary to its corresponding nucleotide of the first complementarity domain.
In an embodiment, the targeting domain has, or consists of, 18 nucleotides (e.g., 18 consecutive nucleotides) having complementarity with the target domain, e.g., the targeting domain is 18 nucleotides in length; and the proximal and tail domain, when taken together, comprise at least 15, 18, 20, 25, 30, 31, 35, 40, 45, 49, 50, or 53 nucleotides.
In an embodiment, the targeting domain has, or consists of, 18 nucleotides (e.g., 18 consecutive nucleotides) having complementarity with the target domain, e.g., the targeting domain is 18 nucleotides in length; and there are at least 15, 18, 20, 25, 30, 31, 35, 40, 45, 49, 50, or 53 nucleotides 3′ to the last nucleotide of the second complementarity domain.
In an embodiment, the targeting domain has, or consists of, 18 nucleotides (e.g., 18 consecutive nucleotides) having complementarity with the target domain, e.g., the targeting domain is 18 nucleotides in length; and there are at least 16, 19, 21, 26, 31, 32, 36, 41, 46, 50, 51, or 54 nucleotides 3′ to the last nucleotide of the second complementarity domain that is complementary to its corresponding nucleotide of the first complementarity domain.
In an embodiment, the targeting domain comprises, has, or consists of, 19 nucleotides (e.g., 19 consecutive nucleotides) having complementarity with the target domain, e.g., the targeting domain is 19 nucleotides in length; and the proximal and tail domain, when taken together, comprise at least 15, 18, 20, 25, 30, 31, 35, 40, 45, 49, 50, or 53 nucleotides.
In an embodiment, the targeting domain comprises, has, or consists of, 19 nucleotides (e.g., 19 consecutive nucleotides) having complementarity with the target domain, e.g., the targeting domain is 19 nucleotides in length; and there are at least 15, 18, 20, 25, 30, 31, 35, 40, 45, 49, 50, or 53 nucleotides 3′ to the last nucleotide of the second complementarity domain.
In an embodiment, the targeting domain comprises, has, or consists of, 19 nucleotides (e.g., 19 consecutive nucleotides) having complementarity with the target domain, e.g., the targeting domain is 19 nucleotides in length; and there are at least 16, 19, 21, 26, 31, 32, 36, 41, 46, 50, 51, or 54 nucleotides 3′ to the last nucleotide of the second complementarity domain that is complementary to its corresponding nucleotide of the first complementarity domain.
In an embodiment, the targeting domain comprises, has, or consists of, 20 nucleotides (e.g., 20 consecutive nucleotides) having complementarity with the target domain, e.g., the targeting domain is 20 nucleotides in length; and the proximal and tail domain, when taken together, comprise at least 15, 18, 20, 25, 30, 31, 35, 40, 45, 49, 50, or 53 nucleotides.
In an embodiment, the targeting domain comprises, has, or consists of, 20 nucleotides (e.g., 20 consecutive nucleotides) having complementarity with the target domain, e.g., the targeting domain is 20 nucleotides in length; and there are at least 15, 18, 20, 25, 30, 31, 35, 40, 45, 49, 50, or 53 nucleotides 3′ to the last nucleotide of the second complementarity domain.
In an embodiment, the targeting domain comprises, has, or consists of, 20 nucleotides (e.g., 20 consecutive nucleotides) having complementarity with the target domain, e.g., the targeting domain is 20 nucleotides in length; and there are at least 16, 19, 21, 26, 31, 32, 36, 41, 46, 50, 51, or 54 nucleotides 3′ to the last nucleotide of the second complementarity domain that is complementary to its corresponding nucleotide of the first complementarity domain.
In an embodiment, the targeting domain comprises, has, or consists of, 21 nucleotides (e.g., 21 consecutive nucleotides) having complementarity with the target domain, e.g., the targeting domain is 21 nucleotides in length; and the proximal and tail domain, when taken together, comprise at least 15, 18, 20, 25, 30, 31, 35, 40, 45, 49, 50, or 53 nucleotides.
In an embodiment, the targeting domain comprises, has, or consists of, 21 nucleotides (e.g., 21 consecutive nucleotides) having complementarity with the target domain, e.g., the targeting domain is 21 nucleotides in length; and there are at least 15, 18, 20, 25, 30, 31, 35, 40, 45, 49, 50, or 53 nucleotides 3′ to the last nucleotide of the second complementarity domain.
In an embodiment, the targeting domain comprises, has, or consists of, 21 nucleotides (e.g., 21 consecutive nucleotides) having complementarity with the target domain, e.g., the targeting domain is 21 nucleotides in length; and there are at least 16, 19, 21, 26, 31, 32, 36, 41, 46, 50, 51, or 54 nucleotides 3′ to the last nucleotide of the second complementarity domain that is complementary to its corresponding nucleotide of the first complementarity domain.
In an embodiment, the targeting domain comprises, has, or consists of, 22 nucleotides (e.g., 22 consecutive nucleotides) having complementarity with the target domain, e.g., the targeting domain is 22 nucleotides in length; and the proximal and tail domain, when taken together, comprise at least 15, 18, 20, 25, 30, 31, 35, 40, 45, 49, 50, or 53 nucleotides.
In an embodiment, the targeting domain comprises, has, or consists of, 22 nucleotides (e.g., 22 consecutive nucleotides) having complementarity with the target domain, e.g., the targeting domain is 22 nucleotides in length; and there are at least 15, 18, 20, 25, 30, 31, 35, 40, 45, 49, 50, or 53 nucleotides 3′ to the last nucleotide of the second complementarity domain.
In an embodiment, the targeting domain comprises, has, or consists of, 22 nucleotides (e.g., 22 consecutive nucleotides) having complementarity with the target domain, e.g., the targeting domain is 22 nucleotides in length; and there are at least 16, 19, 21, 26, 31, 32, 36, 41, 46, 50, 51, or 54 nucleotides 3′ to the last nucleotide of the second complementarity domain that is complementary to its corresponding nucleotide of the first complementarity domain.
In an embodiment, the targeting domain comprises, has, or consists of, 23 nucleotides (e.g., 23 consecutive nucleotides) having complementarity with the target domain, e.g., the targeting domain is 23 nucleotides in length; and the proximal and tail domain, when taken together, comprise at least 15, 18, 20, 25, 30, 31, 35, 40, 45, 49, 50, or 53 nucleotides.
In an embodiment, the targeting domain comprises, has, or consists of, 23 nucleotides (e.g., 23 consecutive nucleotides) having complementarity with the target domain, e.g., the targeting domain is 23 nucleotides in length; and there are at least 15, 18, 20, 25, 30, 31, 35, 40, 45, 49, 50, or 53 nucleotides 3′ to the last nucleotide of the second complementarity domain.
In an embodiment, the targeting domain comprises, has, or consists of, 23 nucleotides (e.g., 23 consecutive nucleotides) having complementarity with the target domain, e.g., the targeting domain is 23 nucleotides in length; and there are at least 16, 19, 21, 26, 31, 32, 36, 41, 46, 50, 51, or 54 nucleotides 3′ to the last nucleotide of the second complementarity domain that is complementary to its corresponding nucleotide of the first complementarity domain.
In an embodiment, the targeting domain comprises, has, or consists of, 24 nucleotides (e.g., 24 consecutive nucleotides) having complementarity with the target domain, e.g., the targeting domain is 24 nucleotides in length; and the proximal and tail domain, when taken together, comprise at least 15, 18, 20, 25, 30, 31, 35, 40, 45, 49, 50, or 53 nucleotides.
In an embodiment, the targeting domain comprises, has, or consists of, 24 nucleotides (e.g., 24 consecutive nucleotides) having complementarity with the target domain, e.g., the targeting domain is 24 nucleotides in length; and there are at least 15, 18, 20, 25, 30, 31, 35, 40, 45, 49, 50, or 53 nucleotides 3′ to the last nucleotide of the second complementarity domain.
In an embodiment, the targeting domain comprises, has, or consists of, 24 nucleotides (e.g., 24 consecutive nucleotides) having complementarity with the target domain, e.g., the targeting domain is 24 nucleotides in length; and there are at least 16, 19, 21, 26, 31, 32, 36, 41, 46, 50, 51, or 54 nucleotides 3′ to the last nucleotide of the second complementarity domain that is complementary to its corresponding nucleotide of the first complementarity domain.
In an embodiment, the targeting domain comprises, has, or consists of, 25 nucleotides (e.g., 25 consecutive nucleotides) having complementarity with the target domain, e.g., the targeting domain is 25 nucleotides in length; and the proximal and tail domain, when taken together, comprise at least 15, 18, 20, 25, 30, 31, 35, 40, 45, 49, 50, or 53 nucleotides.
In an embodiment, the targeting domain comprises, has, or consists of, 25 nucleotides (e.g., 25 consecutive nucleotides) having complementarity with the target domain, e.g., the targeting domain is 25 nucleotides in length; and there are at least 15, 18, 20, 25, 30, 31, 35, 40, 45, 49, 50, or 53 nucleotides 3′ to the last nucleotide of the second complementarity domain.
In an embodiment, the targeting domain comprises, has, or consists of, 25 nucleotides (e.g., 25 consecutive nucleotides) having complementarity with the target domain, e.g., the targeting domain is 25 nucleotides in length; and there are at least 16, 19, 21, 26, 31, 32, 36, 41, 46, 50, 51, or 54 nucleotides 3′ to the last nucleotide of the second complementarity domain that is complementary to its corresponding nucleotide of the first complementarity domain.
In an embodiment, the targeting domain comprises, has, or consists of, 26 nucleotides (e.g., 26 consecutive nucleotides) having complementarity with the target domain, e.g., the targeting domain is 26 nucleotides in length; and the proximal and tail domain, when taken together, comprise at least 15, 18, 20, 25, 30, 31, 35, 40, 45, 49, 50, or 53 nucleotides.
In an embodiment, the targeting domain comprises, has, or consists of, 26 nucleotides (e.g., 26 consecutive nucleotides) having complementarity with the target domain, e.g., the targeting domain is 26 nucleotides in length; and there are at least 15, 18, 20, 25, 30, 31, 35, 40, 45, 49, 50, or 53 nucleotides 3′ to the last nucleotide of the second complementarity domain.
In an embodiment, the targeting domain comprises, has, or consists of, 26 nucleotides (e.g., 26 consecutive nucleotides) having complementarity with the target domain, e.g., the targeting domain is 26 nucleotides in length; and there are at least 16, 19, 21, 26, 31, 32, 36, 41, 46, 50, 51, or 54 nucleotides 3′ to the last nucleotide of the second complementarity domain that is complementary to its corresponding nucleotide of the first complementarity domain.
II. Methods for Designing gRNAs
Methods for designing gRNAs are described herein, including methods for selecting, designing and validating target domains. Exemplary targeting domains are also provided herein. Targeting Domains discussed herein can be incorporated into the gRNAs described herein.
Methods for selection and validation of target sequences as well as off-target analyses are described, e.g., in Mali et al., 2013 Science 339(6121): 823-826; Hsu et al. Nat Biotechnol, 31(9): 827-32; Fu et al., 2014 Nat Biotechnol, doi: 10.1038/nbt.2808. PubMed PMID: 24463574; Heigwer et al., 2014 Nat Methods 11(2):122-3. doi: 10.1038/nmeth.2812. PubMed PMID: 24481216; Bae et al., 2014 Bioinformatics PubMed PMID: 24463181; Xiao A et al., 2014 Bioinformatics PubMed PMID: 24389662.
For example, a software tool can be used to optimize the choice of gRNA within a user's target sequence, e.g., to minimize total off-target activity across the genome. Off target activity may be other than cleavage. For each possible gRNA choice using S. pyogenes Cas9, software tools can identify all potential off-target sequences (preceding either NAG or NGG PAMs) across the genome that contain up to a certain number (e.g., 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10) of mismatched base-pairs. The cleavage efficiency at each off-target sequence can be predicted, e.g., using an experimentally-derived weighting scheme. Each possible gRNA can then ranked according to its total predicted off-target cleavage; the top-ranked gRNAs represent those that are likely to have the greatest on-target and the least off-target cleavage. Other functions, e.g., automated reagent design for gRNA vector construction, primer design for the on-target Surveyor assay, and primer design for high-throughput detection and quantification of off-target cleavage via next-generation sequencing, can also be included in the tool. Candidate gRNA molecules can be evaluated by art-known methods or as described in Section IV herein.
Guide RNAs (gRNAs) for use with S. pyogenes, S. aureus and N. meningitidis Cas9s were identified using a DNA sequence searching algorithm. Guide RNA design was carried out using a custom guide RNA design software based on the public tool cas-offinder (reference:Cas-OFFinder: a fast and versatile algorithm that searches for potential off-target sites of Cas9 RNA-guided endonucleases., Bioinformatics. 2014 Feb. 17. Bae S, Park J, Kim J S. PMID:24463181). Said custom guide RNA design software scores guides after calculating their genomewide off-target propensity. Typically matches ranging from perfect matches to 7 mismatches are considered for guides ranging in length from 17 to 24. Once the off-target sites are computationally determined, an aggregate score is calculated for each guide and summarized in a tabular output using a web-interface. In addition to identifying potential gRNA sites adjacent to PAM sequences, the software also identifies all PAM adjacent sequences that differ by 1, 2, 3 or more nucleotides from the selected gRNA sites. Genomic DNA sequence for each gene was obtained from the UCSC Genome browser and sequences were screened for repeat elements using the publically available RepeatMasker program. RepeatMasker searches input DNA sequences for repeated elements and regions of low complexity. The output is a detailed annotation of the repeats present in a given query sequence.
Following identification, gRNAs were ranked into tiers based on their distance to the target site, their orthogonality or presence of a 5′ G (based on identification of close matches in the human genome containing a relavant PAM (e.g., in the case of S. pyogenes, a NGG PAM, in the case of S. aureus, a NNGRRT or NNGRRV PAM, and in the case of N. meningitidis, a NNNNGATT or NNNNGCTT PAM). Orthogonality refers to the number of sequences in the human genome that contain a minimum number of mismatches to the target sequence. A “high level of orthogonality” or “good orthogonality” may, for example, refer to 20-mer gRNAs that have no identical sequences in the human genome besides the intended target, nor any sequences that contain one or two mismatches in the target sequence. Targeting domains with good orthogonality are selected to minimize off-target DNA cleavage.
As an example, for S. pyogenes and N. meningitidis targets, 17-mer, or 20-mer gRNAs were designed. As another example, for S. aureus targets, 18-mer, 19-mer, 20-mer, 21-mer, 22-mer, 23-mer and 24-mer gRNAs were designed. Targeting domains, disclosed herein, may comprise the 17-mer described in Tables 1A-1D, 2A-2F, 3A-3C, 4A-4E, 5A-5E, 6A-6B, 7A-7D, 8A-8D, 9, 10A-10D, 11A-11D, 12, 13A-13D, 14A-14C, 15A-15D, 16A-16E, 17A-17B, 18A-18C, 19A-19E, 20A-20C, 21A-21E, 22A-22E, 23A-23C, 24A-24D, 25A-25B, 26, or 31, e.g., the targeting domains of 18 or more nucleotides may comprise the 17-mer gRNAs described in Tables 1A-1D, 2A-2F, 3A-3C, 4A-4E, 5A-5E, 6A-6B, 7A-7D, 8A-8D, 9, 10A-10D, 11A-11D, 12, 13A-13D, 14A-14C, 15A-15D, 16A-16E, 17A-17B, 18A-18C, 19A-19E, 20A-20C, 21A-21E, 22A-22E, 23A-23C, 24A-24D, 25A-25B, 26, or 31. Targeting domains, disclosed herein, may comprises the 18-mer described in Tables 1A-1D, 2A-2F, 3A-3C, 4A-4E, 5A-5E, 6A-6B, 7A-7D, 8A-8D, 9, 10A-10D, 11A-11D, 12, 13A-13D, 14A-14C, 15A-15D, 16A-16E, 17A-17B, 18A-18C, 19A-19E, 20A-20C, 21A-21E, 22A-22E, 23A-23C, 24A-24D, 25A-25B, 26, or 31, e.g., the targeting domains of 19 or more nucleotides may comprise the 18-mer gRNAs described in Tables 1A-1D, 2A-2F, 3A-3C, 4A-4E, 5A-5E, 6A-6B, 7A-7D, 8A-8D, 9, 10A-10D, 11A-11D, 12, 13A-13D, 14A-14C, 15A-15D, 16A-16E, 17A-17B, 18A-18C, 19A-19E, 20A-20C, 21A-21E, 22A-22E, 23A-23C, 24A-24D, 25A-25B, 26, or 31. Targeting domains, disclosed herein, may comprises the 19-mer described in Tables 1A-1D, 2A-2F, 3A-3C, 4A-4E, 5A-5E, 6A-6B, 7A-7D, 8A-8D, 9, 10A-10D, 11A-11D, 12, 13A-13D, 14A-14C, 15A-15D, 16A-16E, 17A-17B, 18A-18C, 19A-19E, 20A-20C, 21A-21E, 22A-22E, 23A-23C, 24A-24D, 25A-25B, 26, or 31, e.g., the targeting domains of 20 or more nucleotides may comprise the 19-mer gRNAs described in Tables 1A-1D, 2A-2F, 3A-3C, 4A-4E, 5A-5E, 6A-6B, 7A-7D, 8A-8D, 9, 10A-10D, 11A-11D, 12, 13A-13D, 14A-14C, 15A-15D, 16A-16E, 17A-17B, 18A-18C, 19A-19E, 20A-20C, 21A-21E, 22A-22E, 23A-23C, 24A-24D, 25A-25B, 26, or 31. Targeting domains, disclosed herein, may comprises the 20-mer gRNAs described in Tables 1A-1D, 2A-2F, 3A-3C, 4A-4E, 5A-5E, 6A-6B, 7A-7D, 8A-8D, 9, 10A-10D, 11A-11D, 12, 13A-13D, 14A-14C, 15A-15D, 16A-16E, 17A-17B, 18A-18C, 19A-19E, 20A-20C, 21A-21E, 22A-22E, 23A-23C, 24A-24D, 25A-25B, 26, or 31, e.g., the targeting domains of 21 or more nucleotides may comprise the 20-mer gRNAs described in Tables 1A-1D, 2A-2F, 3A-3C, 4A-4E, 5A-5E, 6A-6B, 7A-7D, 8A-8D, 9, 10A-10D, 11A-11D, 12, 13A-13D, 14A-14C, 15A-15D, 16A-16E, 17A-17B, 18A-18C, 19A-19E, 20A-20C, 21A-21E, 22A-22E, 23A-23C, 24A-24D, 25A-25B, 26, or 31. Targeting domains, disclosed herein, may comprises the 21-mer described in Tables 1A-1D, 2A-2F, 3A-3C, 4A-4E, 5A-5E, 6A-6B, 7A-7D, 8A-8D, 9, 10A-10D, 11A-11D, 12, 13A-13D, 14A-14C, 15A-15D, 16A-16E, 17A-17B, 18A-18C, 19A-19E, 20A-20C, 21A-21E, 22A-22E, 23A-23C, 24A-24D, 25A-25B, 26, or 31, e.g., the targeting domains of 22 or more nucleotides may comprise the 21-mer gRNAs described in Tables 1A-1D, 2A-2F, 3A-3C, 4A-4E, 5A-5E, 6A-6B, 7A-7D, 8A-8D, 9, 10A-10D, 11A-11D, 12, 13A-13D, 14A-14C, 15A-15D, 16A-16E, 17A-17B, 18A-18C, 19A-19E, 20A-20C, 21A-21E, 22A-22E, 23A-23C, 24A-24D, 25A-25B, 26, or 31. Targeting domains, disclosed herein, may comprises the 22-mer described in Tables 1A-1D, 2A-2F, 3A-3C, 4A-4E, 5A-5E, 6A-6B, 7A-7D, 8A-8D, 9, 10A-10D, 11A-11D, 12, 13A-13D, 14A-14C, 15A-15D, 16A-16E, 17A-17B, 18A-18C, 19A-19E, 20A-20C, 21A-21E, 22A-22E, 23A-23C, 24A-24D, 25A-25B, 26, or 31, e.g., the targeting domains of 23 or more nucleotides may comprise the 22-mer gRNAs described in Tables 1A-1D, 2A-2F, 3A-3C, 4A-4E, 5A-5E, 6A-6B, 7A-7D, 8A-8D, 9, 10A-10D, 11A-11D, 12, 13A-13D, 14A-14C, 15A-15D, 16A-16E, 17A-17B, 18A-18C, 19A-19E, 20A-20C, 21A-21E, 22A-22E, 23A-23C, 24A-24D, 25A-25B, 26, or 31. Targeting domains, disclosed herein, may comprises the 23-mer described in Tables 1A-1D, 2A-2F, 3A-3C, 4A-4E, 5A-5E, 6A-6B, 7A-7D, 8A-8D, 9, 10A-10D, 11A-11D, 12, 13A-13D, 14A-14C, 15A-15D, 16A-16E, 17A-17B, 18A-18C, 19A-19E, 20A-20C, 21A-21E, 22A-22E, 23A-23C, 24A-24D, 25A-25B, 26, or 31, e.g., the targeting domains of 24 or more nucleotides may comprise the 23-mer gRNAs described in Tables 1A-1D, 2A-2F, 3A-3C, 4A-4E, 5A-5E, 6A-6B, 7A-7D, 8A-8D, 9, 10A-10D, 11A-11D, 12, 13A-13D, 14A-14C, 15A-15D, 16A-16E, 17A-17B, 18A-18C, 19A-19E, 20A-20C, 21A-21E, 22A-22E, 23A-23C, 24A-24D, 25A-25B, 26, or 31. Targeting domains, disclosed herein, may comprises the 24-mer described in Tables 1A-1D, 2A-2F, 3A-3C, 4A-4E, 5A-5E, 6A-6B, 7A-7D, 8A-8D, 9, 10A-10D, 11A-11D, 12, 13A-13D, 14A-14C, 15A-15D, 16A-16E, 17A-17B, 18A-18C, 19A-19E, 20A-20C, 21A-21E, 22A-22E, 23A-23C, 24A-24D, 25A-25B, 26, or 31, e.g., the targeting domains of 25 or more nucleotides may comprise the 24-mer gRNAs described in Tables 1A-1D, 2A-2F, 3A-3C, 4A-4E, 5A-5E, 6A-6B, 7A-7D, 8A-8D, 9, 15 10A-10D, 11A-11D, 12, 13A-13D, 14A-14C, 15A-15D, 16A-16E, 17A-17B, 18A-18C, 19A-19E, 20A-20C, 21A-21E, 22A-22E, 23A-23C, 24A-24D, 25A-25B, 26, or 31.
gRNAs were identified for both single-gRNA nuclease cleavage and for a dual-gRNA paired “nickase” strategy. Criteria for selecting gRNAs and the determination for which gRNAs can be used for the dual-gRNA paired “nickase” strategy is based on two considerations:
-
- 1. gRNA pairs should be oriented on the DNA such that PAMs are facing out and cutting with the D10A Cas9 nickase will result in 5′ overhangs.
- 2. An assumption that cleaving with dual nickase pairs will result in deletion of the entire intervening sequence at a reasonable frequency. However, cleaving with dual nickase pairs can also result in indel mutations at the site of only one of the gRNAs. Candidate pair members can be tested for how efficiently they remove the entire sequence versus causing indel mutations at the site of one gRNA.
The targeting domains discussed herein can be incorporated into the gRNAs described herein.
Strategies to Identify gRNAs for S. pyogenes, S. aureus, and N. meningitidis to Correct a Mutation in the HBB Gene
gRNAs were designed for use with S. pyogenes, and S. aureus Cas9 enzymes to target the E6V mutation in the HBB gene. As an example, three strategies were utilized to identify gRNAs for use with S. pyogenes, S. aureus and N. meningitidis Cas9 enzymes.
In one strategy, the gRNAs were identified and ranked into 3 tiers for S. pyogenes (Tables 1A-1C). The targeting domains for tier 1 gRNA molecules for use with the S. pyogenes Cas9 to target the E6V mutation in the HBB gene were selected based on (1) a reasonable distance to the target position, and (2) a high level of orthogonality. Tier 2 gRNAs were selected based on (1), a reasonable distance to the target position, and (2) presence of a 5′G. Tier 3 used the same distance restriction, but removed the requirement of good orthogonality and the 5′G. Note that tiers are non-inclusive (each gRNA is listed only once). gRNAs for use with the S. aureus (Table 1D), Cas9s were identified manually by scanning genomic DNA sequence for the presence of PAM sequences. These gRNAs were not separated into tiers, but were listed in a single list.
In a second strategy, the gRNAs were identified and ranked into 4 tiers for S. pyogenes (Tables 13A-13D) and 5 tiers for S. aureus (Tables 14A-14C). The targeting domain for tier 1 gRNA molecules to use with S. pyogenes Cas9 were selected based on (1) a short distance to the target position, e.g., within 100 bp upstream and 100 bp downstream of the mutation, (2) a high level of orthogonality, and (3) the presence of a 5′ G. For selection of tier 2 gRNAs, a short distance and high orthogonality were required but the presence of a 5′G was not required. Tier 3 uses the same distance restriction and the requirement for a 5′G, but removes the requirement of good orthogonality. Tier 4 uses the same distance restriction but removes the requirement of good orthogonality and the 5′G. The targeting domain for tier 1 gRNA molecules to use with S. aureus Cas9 were selected based on (1) a short distance to the target position, e.g., within 100 bp upstream and 100 bp downstream of the mutation, (2) a high level of orthogonality, and (3) the presence of a 5′ G. For selection of tier 2 gRNAs, a short distance and high orthogonality were required but the presence of a 5′G was not required. Tier 3 uses the same distance restriction and the requirement for a 5′G, but removes the requirement of good orthogonality. Tier 4 uses the same distance restriction but removes the requirement of good orthogonality and the 5′G. Tier 5 is selected based on (1) a short distance to the target position, e.g., within 100 bp upstream and 100 bp downstream of the mutation and (2) PAM is NNGRRV. Note that tiers are non-inclusive (each gRNA is listed only once for the strategy). In certain instances, no gRNA was identified based on the criteria of the particular tier. In some instances, there are no corresponding exemplary gRNAs in certain tiers.
In a third strategy, the gRNAs were identified and ranked into 3 tiers for S. pyogenes (Tables 24A-24D), 4 tiers for S. aureus (Tables 25A-25B) and 3 tiers for N. meningitidis (Tables 26). The targeting domain for tier 1 gRNA molecules to use with S. pyogenes Cas9 were selected based on (1) distance to the target position, e.g., within 200 bp upstream and 200 bp downstream of the mutation and (2) a high level of orthogonality. The targeting domain for tier 2 gRNA molecules to use with S. pyogenes Cas9 were selected based on (1) distance to the target position, e.g., within 200 bp upstream and 200 bp downstream of the mutation and (2) the presence of a 5′G. The targeting domain for tier 3 gRNA molecules to use with S. pyogenes Cas9 were selected based on distance to the target position, e.g., within 200 bp upstream and 200 bp downstream of the mutation. The targeting domain for tier 1 gRNA molecules to use with S. aureus Cas9 were selected based on (1) distance to the target position, e.g., within 200 bp upstream and 200 bp downstream of the mutation, (2) a high level of orthogonality and (3) PAM is NNGRRT. The targeting domain for tier 2 gRNA molecules to use with S. aureus Cas9 were selected based on (1) distance to the target position, e.g., within 200 bp upstream and 200 bp downstream of the mutation, (2) the presence of a 5′G, and (3) PAM is NNGRRT. The targeting domain for tier 3 gRNA molecules to use with S. aureus Cas9 were selected based on (1) distance to the target position, e.g., within 200 bp upstream and 200 bp downstream of the mutation and (2) PAM is NNGRRT. The targeting domain for tier 4 gRNA molecules to use with S. aureus Cas9 were selected based on (1) distance to the target position, e.g., within 200 bp upstream and 200 bp downstream of the mutation and (2) PAM is NNGRRV. The targeting domain for tier 1 gRNA molecules to use with N. meningitidis Cas9 were selected based on (1) distance to the target position, e.g., within 200 bp upstream and 200 bp downstream of the mutation and (2) a high level of orthogonality. The targeting domain for tier 2 gRNA molecules to use with N. meningitidis Cas9 were selected based on (1) distance to the target position, e.g., within 200 bp upstream and 200 bp downstream of the mutation and (2) the presence of a 5′G. The targeting domain for tier 3 gRNA molecules to use with N. meningitidis Cas9 were selected based on distance to the target position, e.g., within 200 bp upstream and 200 bp downstream of the mutation.
In an embodiment, dual targeting (e.g., dual nicking) is used to create two nicks on opposite DNA strands by using S. pyogenes, S. aureus and N. meningitidis Cas9 nickases with two targeting domains that are complementary to opposite DNA strands, e.g., a gRNA comprising any minus strand targeting domain may be paired any gRNA comprising a plus strand targeting domain provided that the two gRNAs are oriented on the DNA such that PAMs face outward and the distance between the 5′ ends of the gRNAs is 0-50 bp. Exemplary nickase pairs including selecting a targeting domain from Group A and a second targeting domain from Group B in Table 24D (for S. pyogenes). It is contemplated herein that in an embodiment a targeting domain of Group A can be combined with any of the targeting domains of Group B in Table 24D (for S. pyogenes). For example, HBB-9, HBB-20can be combined with HBB-11, HBB-39.
Strategies to Identify gRNAs for S. pyogenes, S. aureus, and N. meningitidis to Knock Out the BCL11A Gene
gRNAs were designed for use with S. pyogenes, S. aureus and N. meningitidis Cas9 enzymes to induce an insertion or deletion of one or more nucleotides mediated by NHEJ in close proximity to or within the early coding region. As an example, three strategies were utilized to identify gRNAs for use with S. pyogenes, S. aureus and N. meningitidis Cas9 enzymes.
In one strategy, the gRNAs were identified and ranked into 4 tires for S. pyogenes (Tables 2A-2D). The targeting domains for tier 1 gRNA molecules for use with the S. pyogenes Cas9 to knockout the BCL11A gene were selected based on (1) a reasonable distance to the target position, and (2) a high level of orthogonality. Tier 2 gRNAs were selected based on (1), a reasonable distance to the target position, and (2) presence of a 5′G. Tier 3 used the same distance restriction, but removed the requirement of good orthogonality and the 5′G. Tier 4 only required the presence in the coding sequence. Note that tiers are non-inclusive (each gRNA is listed only once). gRNAs for use with the S. aureus (Table 2E), and N. meningitidis (Table 2F) Cas9s were identified manually by scanning genomic DNA sequence for the presence of PAM sequences. These gRNAs were not separated into tiers, but were listed in a single list. Note that tiers are non-inclusive (each gRNA is listed only once for the strategy). In certain instances, no gRNA was identified based on the criteria of the particular tier.
In a second strategy, the gRNAs were identified and ranked into 5 tiers for S. pyogenes (Tables 4A-4E), and S. aureus (Tables 5A-5E); and 2 tiers for N. meningitidis (Tables 6A-6B). For S. pyogenes, and S. aureus, the targeting domain for tier 1 gRNA molecules were selected based on (1) distance to a target site (e.g., start codon), e.g., within 500 bp (e.g., downstream) of the target site (e.g., start codon), (2) a high level of orthogonality and (3) the presence of 5′G. The targeting domain for tier 2 gRNA molecules were selected based on (1) distance to a target site (e.g., start codon), e.g., within 500 bp (e.g., downstream) of the target site (e.g., start codon) and (2) a high level of orthogonality. The targeting domain for tier 3 gRNA molecules were selected based on (1) distance to a target site (e.g., start codon), e.g., within 500 bp (e.g., downstream) of the target site (e.g., start codon) and (2) the presence of 5′G. The targeting domain for tier 4 gRNA molecules were selected based on distance to a target site (e.g., start codon), e.g., within 500 bp (e.g., downstream) of the target site (e.g., start codon). The targeting domain for tier 5 gRNA molecules were selected based on distance to the target site (e.g., start codon), e.g., within reminder of the coding sequence, e.g., downstream of the first 500 bp of coding sequence (e.g., anywhere from +500 (relative to the start codon) to the stop codon). For N. meningitidis, the targeting domain for tier 1 gRNA molecules were selected based on (1) distance to a target site (e.g., start codon), e.g., within 500 bp (e.g., downstream) of the target site (e.g., start codon). The targeting domain for tier 2 gRNA molecules were selected based on distance to the target site (e.g., start codon), e.g., within reminder of the coding sequence, e.g., downstream of the first 500 bp of coding sequence (e.g., anywhere from +500 (relative to the start codon) to the stop codon). Note that tiers are non-inclusive (each gRNA is listed only once for the strategy). In certain instances, no gRNA was identified based on the criteria of the particular tier.
In a third strategy, the gRNAs were identified and ranked into 3 tiers for S. pyogenes (Tables 15A-15D), and N. meningitidis (Tables 17A-17B); and 5 tiers for S. aureus (Tables 16A-16D). The targeting domain to be used with S. pyogenes Cas9 enzymes for tier 1 gRNA molecules were selected based on (1) distance to a target site (e.g., start codon), e.g., within 500 bp (e.g., downstream) of the target site (e.g., start codon) and (2) a high level of orthogonality. The targeting domain to be used with S. pyogenes Cas9 enzymes for tier 2 gRNA molecules were selected based on (1) distance to the target site (e.g., start codon), e.g., within 500 bp (e.g., downstream) of the target site (e.g., start codon). The targeting domain to be used with S. pyogenes Cas9 enzymes for tier 3 gRNA molecules were selected based on distance to the target site (e.g., start codon), e.g., within reminder of the coding sequence, e.g., downstream of the first 500 bp of coding sequence (e.g., anywhere from +500 (relative to the start codon) to the stop codon). The gRNAs were identified and ranked into 5 tiers for S. aureus, when the relevant PAM was NNGRRT or NNGRRV. The targeting domain to be used with S. aureus Cas9 enzymes for tier 1 gRNA molecules were selected based on (1) distance to the target site (e.g., start codon), e.g., within 500 bp (e.g., downstream) of the target site (e.g., start codon), (2) a high level of orthogonality, and (3) PAM is NNGRRT. The targeting domain to be used with S. aureus Cas9 enzymes for tier 2 gRNA molecules were selected based on (1) distance to the target site (e.g., start codon), e.g., within 500 bp (e.g., downstream) of the target site (e.g., start codon), and (2) PAM is NNGRRT. The targeting domain to be used with S. aureus Cas9 enzymes for tier 3 gRNA molecules were selected based on (1) distance to a the target site (e.g., start codon) mutation, e.g., within 500 bp (e.g., downstream) of the target site (e.g., start codon), and (2) PAM is NNGRRV. The targeting domain to be used with S. aureus Cas9 enzymes for tier 4 gRNA molecules were selected based on (1) distance to the target site (e.g., start codon), e.g., within reminder of the coding sequence, e.g., downstream of the first 500 bp of coding sequence (e.g., anywhere from +500 (relative to the start codon) to the stop codon), and (2) PAM is NNGRRT. The targeting domain to be used with S. aureus Cas9 enzymes for tier 5 gRNA molecules were selected based on (1) distance to the target site (e.g., start codon), e.g., within reminder of the coding sequence, e.g., downstream of the first 500 bp of coding sequence (e.g., anywhere from +500 (relative to the start codon) to the stop codon), and (2) PAM is NNGRRV. The gRNAs were identified and ranked into 3 tiers for N. meningitidis. The targeting domain to be used with N. meningitidis Cas9 enzymes for tier 1 gRNA molecules were selected based on (1) distance to the target site, e.g., within 500 bp (e.g., downstream) of the target site (e.g., start codon) and (2) a high level of orthogonality. The targeting domain to be used with N. meningitidis Cas9 enzymes for tier 2 gRNA molecules were selected based on (1) distance to the target site (e.g., start codon), e.g., within 500 bp (e.g., downstream) of the target site (e.g., start codon). The targeting domain to be used with N. meningitidis Cas9 enzymes for tier 3 gRNA molecules were selected based on distance to the target site (e.g., start codon), e.g., within reminder of the coding sequence, e.g., downstream of the first 500 bp of coding sequence (e.g., anywhere from +500 (relative to the start codon) to the stop codon). Note that tiers are non-inclusive (each gRNA is listed only once for the strategy). In certain instances, no gRNA was identified based on the criteria of the particular tier.
In an embodiment, when a single gRNA molecule is used to target a Cas9 nickase to create a single strand break in close proximity to the BCL11A target position, e.g., the gRNA is used to target either upstream of (e.g., within 500 bp, e.g., within 200 bp upstream of the BCL11A target position), or downstream of (e.g., within 500 bp, e.g., within 200 bp downstream of the BCL11A target position) in the BCL11A gene.
In an embodiment, when a single gRNA molecule is used to target a Cas9 nuclease to create a double strand break to in close proximity to the BCL11A target position, e.g., the gRNA is used to target either upstream of (e.g., within 500 bp, e.g., within 200 bp upstream of the BCL11A target position), or downstream of (e.g., within 500 bp, e.g., within 200 bp downstream of the BCL11A target position) in the BCL11A gene.
In an embodiment, dual targeting is used to create two double strand breaks to in close proximity to the mutation, e.g., the gRNA is used to target either upstream of (e.g., within 500 bp, e.g., within 200 bp upstream of the BCL11A target position), or downstream of (e.g., within 500 bp, e.g., within 200 bp downstream of the BCL11A target position) in the BCL11A gene. In an embodiment, the first and second gRNAs are used to target two Cas9 nucleases to flank, e.g., the first of gRNA is used to target upstream of (e.g., within 500 bp, e.g., within 200 bp upstream of the BCL11A target position), and the second gRNA is used to target downstream of (e.g., within 500 bp, e.g., within 200 bp downstream of the BCL11A target position) in the BCL11A gene.
In an embodiment, dual targeting is used to create a double strand break and a pair of single strand breaks to delete a genomic sequence including the BCL11A target position. In an embodiment, the first, second and third gRNAs are used to target one Cas9 nuclease and two Cas9 nickases to flank, e.g., the first gRNA that will be used with the Cas9 nuclease is used to target upstream of (e.g., within 500 bp, e.g., within 200 bp upstream of the BCL11A target position) or downstream of (e.g., within 500 bp, e.g., within 200 bp downstream of the BCL11A target position), and the second and third gRNAs that will be used with the Cas9 nickase pair are used to target the opposite side of the mutation (e.g., within 200 bp upstream or downstream of the BCL11A target position) in the BCL11A gene.
In an embodiment, when four gRNAs (e.g., two pairs) are used to target four Cas9 nickases to create four single strand breaks to delete genomic sequence including the mutation, the first pair and second pair of gRNAs are used to target four Cas9 nickases to flank, e.g., the first pair of gRNAs are used to target upstream of (e.g., within 500 bp, e.g., within 200 bp upstream of the BCL11A target position), and the second pair of gRNAs are used to target downstream of (e.g., within 500 bp, e.g., within 200 bp downstream of the BCL11A target position) in the BCL11A gene.
In an embodiment, dual targeting (e.g., dual nicking) is used to create two nicks on opposite DNA strands by using S. pyogenes, S. aureus and N. meningitidis Cas9 nickases with two targeting domains that are complementary to opposite DNA strands, e.g., a gRNA comprising any minus strand targeting domain may be paired any gRNA comprising a plus strand targeting domain provided that the two gRNAs are oriented on the DNA such that PAMs face outward and the distance between the 5′ ends of the gRNAs is 0-50 bp. Exemplary nickase pairs including selecting a targeting domain from Group A and a second targeting domain from Group B, or including selecting a targeting domain from Group C and a second targeting domain from Group D in Table 15D (for S. pyogenes). It is contemplated herein that in an embodiment a targeting domain of Group A can be combined with any of the targeting domains of Group B; in an embodiment a targeting domain of Group C can be combined with any of the targeting domains of Group D in Table 15D (for S. pyogenes). For example, BCL11A-5355 or BCL11A-5380 can be combined with BCL11A-5321 or BCL11A-5416; or BCL11A-5333, BCL11A-5354, or BCL11A-5329 can be combined with BCL11A-5367 or BCL11A-5341.
Strategies to Identify gRNAs for S. pyogenes, S. aureus, and N. meningitidis to Knock Down the BCL11A Gene
gRNAs were designed for use with S. pyogenes, S. aureus and N. meningitidis one or more Cas9 molecules, e.g., enzymatically inactive Cas9 (eiCas9) molecules or Cas9 fusion proteins (e.g., an eiCas9 fused to a transcription repressor domain or chromatin modifying protein to alter (e.g., to block, reduce, or decrease) the transcription of the BCL11A gene. As an example, three strategies were utilized to identify gRNAs for use with S. pyogenes, S. aureus and N. meningitidis one or more Cas9 molecules.
In one strategy, the targeting domains for gRNA molecules to knockdown the BCL11A gene were designed to target the 1 kb of sequence 3′ of the start codon. They were listed in a single list for S. pyogenes (Table 3A), S. aureus (Table 3B) and N. meningitidis (Table 3C).
In a second strategy, the gRNAs were identified and ranked into 4 tiers for S. pyogenes (Tables 10A-10D), and S. aureus (Tables 11A-11D). The gRNAs were identified and listed in a single list for N. meningitidis (Table 12). For S. pyogenes, and S. aureus, the targeting domain for tier 1 gRNA molecules were selected based on (1) distance to a target site (e.g., a transcription start site), e.g., within 500 bp (e.g., upstream or downstream) of the target site (e.g., the transcription start site), (2) a high level of orthogonality and (3) the presence of 5′G. The targeting domain for tier 2 gRNA molecules were selected based on (1) distance to a target site (e.g., the transcription start site), e.g., within 500 bp (e.g., upstream or downstream) of the target site (e.g., the transcription start site) and (2) a high level of orthogonality. The targeting domain for tier 3 gRNA molecules were selected based on (1) distance to a target site (e.g., the transcription start site), e.g., within 500 bp (e.g., upstream or downstream) of the target site (e.g., the transcription start site) and (2) the presence of 5′G. The targeting domain for tier 4 gRNA molecules were selected based on distance to a target site (e.g., the transcription start site), e.g., within 500 bp (e.g., upstream or downstream) of the target site (e.g., the transcription start site).
In a third strategy, gRNAs were designed for use with S. pyogenes, S. aureus and N. meningitidis Cas9 molecules. The gRNAs were identified and ranked into 3 tiers for S. pyogenes (Tables 18A-18C). The targeting domain to be used with S. pyogenes Cas9 enzymes for tier 1 gRNA molecules were selected based on (1) distance to a target site (e.g., the transcription start site), e.g., within 500 bp (e.g., upstream or downstream) of the target site (e.g., the transcription start site) and (2) a high level of orthogonality. The targeting domain to be used with S. pyogenes Cas9 enzymes for tier 2 gRNA molecules were selected based on (1) distance to the target site (e.g., the transcription start site), e.g., within 500 bp (e.g., upstream or downstream) of the target site (e.g., the transcription start site). The targeting domain to be used with S. pyogenes Cas9 enzymes for tier 3 gRNA molecules were selected based on distance to the target site (e.g., the transcription start site), e.g., within the additional 500 bp upstream and downstream of the transcription start site (i.e., extending to 1 kb upstream and downstream of the transcription start site. The gRNAs were identified and ranked into 5 tiers for S. aureus, when the relevant PAM was NNGRRT or NNGRRV (Tables 19A-19B). The targeting domain to be used with S. aureus Cas9 enzymes for tier 1 gRNA molecules were selected based on (1) distance to the target site (e.g., the transcription start site), e.g., within 500 bp (e.g., upstream or downstream) of the target site (e.g., the transcription start site), (2) a high level of orthogonality, and (3) PAM is NNGRRT. The targeting domain to be used with S. aureus Cas9 enzymes for tier 2 gRNA molecules were selected based on (1) distance to the target site (e.g., the transcription start site), e.g., within 500 bp (e.g., upstream or downstream) of the target site (e.g., the transcription start site), and (2) PAM is NNGRRT. The targeting domain to be used with S. aureus Cas9 enzymes for tier 3 gRNA molecules were selected based on (1) distance to a target site (e.g., the transcription start site), e.g., within 500 bp (e.g., upstream or downstream) of the target site (e.g., the transcription start site), and (2) PAM is NNGRRV. The targeting domain to be used with S. aureus Cas9 enzymes for tier 4 gRNA molecules were selected based on (1) distance to the target site (e.g., the transcription start site), e.g., within the additional 500 bp upstream and downstream of the transcription start site (i.e., extending to 1 kb upstream and downstream of the transcription start site, and (2) PAM is NNGRRT. The targeting domain to be used with S. aureus Cas9 enzymes for tier 5 gRNA molecules were selected based on (1) distance to the target site (e.g., the transcription start site), e.g., within the additional 500 bp upstream and downstream of the transcription start site (i.e., extending to 1 kb upstream and downstream of the transcription start site, and (2) PAM is NNGRRV. The gRNAs were identified and ranked into 3 tiers for N. meningitidis (Tables 20A-20C). The targeting domain to be used with N. meningitidis Cas9 enzymes for tier 1 gRNA molecules were selected based on (1) distance to a target site (e.g., the transcription start site), e.g., within 500 bp (e.g., upstream or downstream) of the target site (e.g., the transcription start site) and (2) a high level of orthogonality. The targeting domain to be used with N. meningitidis Cas9 enzymes for tier 2 gRNA molecules were selected based on (1) distance to the target site (e.g., the transcription start site), e.g., within 500 bp (e.g., upstream or downstream) of the target site (e.g., the transcription start site). The targeting domain to be used with N. meningitidis Cas9 enzymes for tier 3 gRNA molecules were selected based on distance to the target site (e.g., the transcription start site), e.g., within the additional 500 bp upstream and downstream of the transcription start site (i.e., extending to 1 kb upstream and downstream of the transcription start site. Note that tiers are non-inclusive (each gRNA is listed only once for the strategy). In certain instances, no gRNA was identified based on the criteria of the particular tier.
Strategies to Identify gRNAs for S. pyogenes, S. aureus, and N. meningitidis to Remove (e.g., Delete) the Enhancer Region the BCL11A Gene
gRNAs were designed for use with S. pyogenes, S. aureus and N. meningitidis Cas9 enzymes to remove (e.g., delete) the enhancer region in the BCL11A gene. As an example, two strategies were utilized to identify gRNAs for use with S. pyogenes, S. aureus and N. meningitidis one or more Cas9 molecules.
In an strategy, the gRNAs were identified and ranked into 4 tiers for S. pyogenes (Tables 7A-7D) and for S. aureus (Tables 8A-8D). The gRNAs were identified and listed in a single list for N. meningitidis (Table 9). The targeting domains for tier 1 gRNA molecules for use with the S. pyogenes, S. aureus Cas9 were selected based on (1) a reasonable distance to the target position, e.g., within a region 5′ (51.5 to 51.7 kb downstream of transcription start site, TSS) or 3′ (65.1 to 65.3 kb downstream of TSS), (2) a high level of orthogonality and (3) presence of a 5′G. For selection of tier 2 gRNAs, reasonable distance and high orthogonality were required but the presence of a 5′G was not required. Tier 3 uses the same distance restriction and the requirement for a 5′G, but removes the requirement of good orthogonality. Tier 4 uses the same distance restriction but removes the requirement of good orthogonality and the 5′G. Note that tiers are non-inclusive (each gRNA is listed only once for the strategy). In certain instances, no gRNA was identified based on the criteria of the particular tier.
In a second strategy, gRNAs were designed for use with S. pyogenes, S. aureus and N. meningitidis Cas9 molecules. The gRNAs were identified and ranked into 4 tiers for S. pyogenes (Tables 21A-21E). The targeting domain to be used with S. pyogenes Cas9 enzymes for tier 1 gRNA molecules were selected based on (1) within a region 5′ (51.5 to 51.7 kb downstream of TSS) or 3′ (65.1 to 65.3 kb downstream of TSS), (2) a high level of orthogonality and (3) presence of 5′G. The targeting domain to be used with S. pyogenes Cas9 enzymes for tier 2 gRNA molecules were selected based on (1) within a region 5′ (51.5 to 51.7 kb downstream of TSS) or 3′ (65.1 to 65.3 kb downstream of TSS) and (2) a high level of orthogonality. The targeting domain to be used with S. pyogenes Cas9 enzymes for tier 3 gRNA molecules were selected based on (1) within a region 5′ (51.5 to 51.7 kb downstream of TSS) or 3′ (65.1 to 65.3 kb downstream of TSS) and (2) presence of 5′G. The targeting domain to be used with S. pyogenes Cas9 enzymes for tier 4 gRNA molecules were selected based on within a region 5′ (51.5 to 51.7 kb downstream of TSS) or 3′ (65.1 to 65.3 kb downstream of TSS). The gRNAs were identified and ranked into 5 tiers for S. aureus, when the relevant PAM was NNGRRT or NNGRRV (Tables 22A-22E). The targeting domain to be used with S. aureus Cas9 enzymes for tier 1 gRNA molecules were selected based on (1) within a region 5′ (51.5 to 51.7 kb downstream of TSS) or 3′ (65.1 to 65.3 kb downstream of TSS), (2) a high level of orthogonality, (3)) presence of 5′G and (4) PAM is NNGRRT. The targeting domain to be used with S. aureus Cas9 enzymes for tier 2 gRNA molecules were selected based on (1) within a region 5′ (51.5 to 51.7 kb downstream of TSS) or 3′ (65.1 to 65.3 kb downstream of TSS), (2) a high level of orthogonality, and (3) PAM is NNGRRT. The targeting domain to be used with S. aureus Cas9 enzymes for tier 3 gRNA molecules were selected based on (1) within a region 5′ (51.5 to 51.7 kb downstream of TSS) or 3′ (65.1 to 65.3 kb downstream of TSS), (2) presence of 5′G and (3) PAM is NNGRRT. The targeting domain to be used with S. aureus Cas9 enzymes for tier 4 gRNA molecules were selected based on (1) within a region 5′ (51.5 to 51.7 kb downstream of TSS) or 3′ (65.1 to 65.3 kb downstream of TSS), and (2) PAM is NNGRRT. The targeting domain to be used with S. aureus Cas9 enzymes for tier 5 gRNA molecules were selected based on (1) within a region 5′ (51.5 to 51.7 kb downstream of TSS) or 3′ (65.1 to 65.3 kb downstream of TSS), and (2) PAM is NNGRRV. The gRNAs were identified and ranked into 3 tiers for N. meningitidis (Tables 23A-23C). The targeting domain to be used with N. meningitidis Cas9 enzymes for tier 1 gRNA molecules were selected based on (1) within a region 5′ (51.5 to 51.7 kb downstream of TSS) or 3′ (65.1 to 65.3 kb downstream of TSS), (2) a high level of orthogonality and (3) presence of 5′G. The targeting domain to be used with N. meningitidis Cas9 enzymes for tier 2 gRNA molecules were selected based on (1) within a region 5′ (51.5 to 51.7 kb downstream of TSS) or 3′ (65.1 to 65.3 kb downstream of TSS) and (2) a high level of orthogonality. The targeting domain to be used with N. meningitidis Cas9 enzymes for tier 3 gRNA molecules were selected based on (1) within a region 5′ (51.5 to 51.7 kb downstream of TSS) or 3′ (65.1 to 65.3 kb downstream of TSS) and (2) presence of 5′G. The targeting domain to be used with N. meningitidis Cas9 enzymes for tier 4 gRNA molecules were selected based on within a region 5′ (51.5 to 51.7 kb downstream of TSS) or 3′ (65.1 to 65.3 kb downstream of TSS). Note that tiers are non-inclusive (each gRNA is listed only once for the strategy). In certain instances, no gRNA was identified based on the criteria of the particular tier.
In an embodiment, dual targeting (e.g., dual nicking) is used to create two nicks on opposite DNA strands by using S. pyogenes, S. aureus and N. meningitidis Cas9 nickases with two targeting domains that are complementary to opposite DNA strands, e.g., a gRNA comprising any minus strand targeting domain may be paired any gRNA comprising a plus strand targeting domain provided that the two gRNAs are oriented on the DNA such that PAMs face outward and the distance between the 5′ ends of the gRNAs is 0-50 bp. Exemplary nickase pairs including selecting a targeting domain from Group A and a second targeting domain from Group B, or including selecting a targeting domain from Group C and a second targeting domain from Group D in Table 20E (for S. pyogenes). It is contemplated herein that in an embodiment a targeting domain of Group A can be combined with any of the targeting domains of Group B; in an embodiment a targeting domain of Group C can be combined with any of the targeting domains of Group D in Table 20E (for S. pyogenes). For example, BCL11A-13271 or BCL11A-13264 can be combined with BCL11A-13276; or BCL11A-13262 or BCL11A-13282 can be combined with BCL11A-13290 or BCL11A-13280.
In an embodiment, two or more (e.g., three or four) gRNA molecules are used with one Cas9 molecule. In another embodiment, when two or more (e.g., three or four) gRNAs are used with two or more Cas9 molecules, at least one Cas9 molecule is from a different species than the other Cas9 molecule(s). For example, when two gRNA molecules are used with two Cas9 molecules, one Cas9 molecule can be from one species and the other Cas9 molecule can be from a different species. Both Cas9 species are used to generate a single or double-strand break, as desired.
Any of the targeting domains in the tables described herein can be used with a Cas9 nickase molecule to generate a single strand break.
Any of the targeting domains in the tables described herein can be used with a Cas9 nuclease molecule to generate a double strand break.
When two gRNAs designed for use to target two Cas9 molecules, one Cas9 can be one species, the second Cas9 can be from a different species. Both Cas9 species are used to generate a single or double-strand break, as desired.
It is contemplated herein that any upstream gRNA described herein may be paired with any downstream gRNA described herein. When an upstream gRNA designed for use with one species of Cas9 is paired with a downstream gRNA designed for use from a different species of Cas9, both Cas9 species are used to generate a single or double-strand break, as desired.
Exemplary Targeting Domains
Table 1A provides exemplary targeting domains for the E6V target site in the HBB gene selected according to the first tier parameters, and are selected based on the close proximity and orientation to mutation and orthogonality in the human genome. In an embodiment, the targeting domain is the exact complement of the target domain. Any of the targeting domains in the table can be used with a Cas9 molecule (e.g., a S. pyogenes Cas9 molecule) that gives double stranded cleavage. Any of the targeting domains in the table can be used with a Cas9 (e.g., a S. pyogenes Cas9 nickase) molecule to generate a single strand break. In an embodiment, dual targeting is used to create two nicks on opposite DNA strands by using Cas9 nickases (e.g., a S. pyogenes Cas9 nickase) with two targeting domains that are complementary to opposite DNA strands, e.g., a gRNA comprising any minus strand targeting domain may be paired any gRNA comprising a plus strand targeting domain provided that the two gRNAs are oriented on the DNA such that PAMs face outward and the distance between the 5′ ends of the gRNAs is 0-50 bp. When selecting gRNAs for use in a nickase pair, one gRNA targets a domain in the complementary strand and the second gRNA targets a domain in the non-complementary strand, e.g., a gRNA comprising any minus strand targeting domain may be paired any gRNA comprising a plus strand targeting domain targeting the same target position. In an embodiment, two 20-mer guide RNAs are used to target two Cas9 nucleases (e.g., two S. pyogenes Cas9 nucleases) or two Cas9 nickases (e.g., two S. pyogenes Cas9 nickases), e.g., HBB-8 and HBB-25 are used. In an embodiment, two 17-mer RNAs are used to target two Cas9 nucleases or two Cas9 nickases, e.g., HBB-35 and HBB-53 are used.
Table 1B provides exemplary targeting domains for the E6V target site in the HBB gene selected according to the second tier parameters and are selected based on the presence of a 5′ G and reasonable proximity to mutation. In an embodiment, the targeting domain is the exact complement of the target domain. Any of the targeting domains in the table can be used with a S. pyogenes Cas9 molecule that gives double stranded cleavage. Any of the targeting domains in the table can be used with S. pyogenes single-stranded break nucleases (nickases). In an embodiment, dual targeting is used to create two nicks. When selecting gRNAs for use in a nickase pair, one gRNA targets a domain in the complementary strand and the second gRNA targets a domain in the non-complementary strand, e.g., a gRNA comprising any minus strand targeting domain may be paired any gRNA comprising a plus strand targeting domain targeting the same target position.
Table 1C provides exemplary targeting domains for the E6V target site in the HBB gene selected according to the third tier parameters and are selected based on reasonable proximity to mutation. In an embodiment, the targeting domain is the exact complement of the target domain. Any of the targeting domains in the table can be used with a S. pyogenes Cas9 molecule that gives double stranded cleavage. Any of the targeting domains in the table can be used with S. pyogenes single-stranded break nucleases (nickases). In an embodiment, dual targeting is used to create two nicks. When selecting gRNAs for use in a nickase pair, one gRNA targets a domain in the complementary strand and the second gRNA targets a domain in the non-complementary strand, e.g., a gRNA comprising any minus strand targeting domain may be paired any gRNA comprising a plus strand targeting domain targeting the same target position.
Table 1D provides exemplary targeting domains for the E6V target site in the HBB gene selected based on close proximity to mutation. In an embodiment, the targeting domain is the exact complement of the target domain. Any of the targeting domains in the table can be used with a S. aureus Cas9 molecule that gives double stranded cleavage. Any of the targeting domains in the table can be used with S. aureus single-stranded break nucleases (nickases). In an embodiment, dual targeting is used to create two nicks. When selecting gRNAs for use in a nickase pair, one gRNA targets a domain in the complementary strand and the second gRNA targets a domain in the non-complementary strand, e.g., a gRNA comprising any minus strand targeting domain may be paired any gRNA comprising a plus strand targeting domain targeting the same target position.
Table 2A provides exemplary targeting domains for knocking out the BCL11A gene selected according to first tier parameters, and are selected based on close proximity to start of the coding sequence and orthogonality in the human genome. In an embodiment, the targeting domain is the exact complement of the target domain. Any of the targeting domains in the table can be used with a S. pyogenes Cas9 molecule that gives double stranded cleavage. Any of the targeting domains in the table can be used with a S. pyogenes Cas9 single-stranded break nucleases (nickases). In an embodiment, dual targeting is used to create two nicks. When selecting gRNAs for use in a nickase pair, one gRNA targets a domain in the complementary strand and the second gRNA targets a domain in the non-complementary strand, e.g., a gRNA comprising any minus strand targeting domain may be paired any gRNA comprising a plus strand targeting domain targeting the same target position. In an embodiment, two 20-mer guide RNAs are used to target two S. pyogenes Cas9 nucleases or two S. pyogenes Cas9 nickases, e.g., BCL11A-31 and BCL11A-40, BCL11A-30 and BCL11A-42, or BCL11A-24 and BCL11A-53 are used. In an embodiment, two 17-mer RNAs are used to target two Cas9 nucleases or two Cas9 nickases, e.g., BCL11A-79 and BCL11A-90, BCL11A-77 and BCL11A-92, or BCL11A-71 and BCL11A-103 are used.
Table 2B provides exemplary targeting domains for knocking out the BCL11A gene selected according to the second tier parameters and are selected based on close proximity to start of the coding sequence and presence of a 5′ G. In an embodiment, the targeting domain is the exact complement of the target domain. Any of the targeting domains in the table can be used with a S. pyogenes Cas9 molecule that gives double stranded cleavage. Any of the targeting domains in the table can be used with a S. pyogenes Cas9 single-stranded break nucleases (nickases). In an embodiment, dual targeting is used to create two nicks. When selecting gRNAs for use in a nickase pair, one gRNA targets a domain in the complementary strand and the second gRNA targets a domain in the non-complementary strand, e.g., a gRNA comprising any minus strand targeting domain may be paired any gRNA comprising a plus strand targeting domain targeting the same target position.
Table 2C provides exemplary targeting domains for knocking out the BCL11A gene selected according to the third tier parameters and are selected based on close proximity to start of the coding sequence. In an embodiment, the targeting domain is the exact complement of the target domain. Any of the targeting domains in the table can be used with a S. pyogenes Cas9 molecule that gives double stranded cleavage. Any of the targeting domains in the table can be used with a S. pyogenes Cas9 single-stranded break nucleases (nickases). In an embodiment, dual targeting is used to create two nicks. When selecting gRNAs for use in a nickase pair, one gRNA targets a domain in the complementary strand and the second gRNA targets a domain in the non-complementary strand, e.g., a gRNA comprising any minus strand targeting domain may be paired any gRNA comprising a plus strand targeting domain targeting the same target position.
Table 2D) provides exemplary targeting domains for knocking out the BCL11A gene selected according to the fourth tier parameters and are selected based on presence in the coding sequence. In an embodiment, the targeting domain is the exact complement of the target domain. Any of the targeting domains in the table can be used with a S. pyogenes Cas9 molecule that gives double stranded cleavage. Any of the targeting domains in the table can be used with a S. pyogenes Cas9 single-stranded break nucleases (nickases). In an embodiment, dual targeting is used to create two nicks. When selecting gRNAs for use in a nickase pair, one gRNA targets a domain in the complementary strand and the second gRNA targets a domain in the non-complementary strand, e.g., a gRNA comprising any minus strand targeting domain may be paired any gRNA comprising a plus strand targeting domain targeting the same target position.
Table 2E provides exemplary targeting domains for knocking out the BCL11A gene. In an embodiment, the targeting domain is the exact complement of the target domain. Any of the targeting domains in the table can be used with a S. aureus Cas9 molecule that gives double stranded cleavage. Any of the targeting domains in the table can be used with a S. aureus Cas9 single-stranded break nucleases (nickases). In an embodiment, dual targeting is used to create two nicks. When selecting gRNAs for use in a nickase pair, one gRNA targets a domain in the complementary strand and the second gRNA targets a domain in the non-complementary strand, e.g., a gRNA comprising any minus strand targeting domain may be paired any gRNA comprising a plus strand targeting domain targeting the same target position.
Table 2F provides exemplary targeting domains for knocking out the BCL11A gene. In an embodiment, the targeting domain is the exact complement of the target domain. Any of the targeting domains in the table can be used with an N. meningitidis Cas9 molecule that gives double stranded cleavage. Any of the targeting domains in the table can be used with an N. meningitidis Cas9 single-stranded break nucleases (nickases). In an embodiment, dual targeting is used to create two nicks. When selecting gRNAs for use in a nickase pair, one gRNA targets a domain in the complementary strand and the second gRNA targets a domain in the non-complementary strand, e.g., a gRNA comprising any minus strand targeting domain may be paired any gRNA comprising a plus strand targeting domain targeting the same target position.
Table 3A provides exemplary targeting domains for repressing (i.e., knocking down or decreasing) expression of the BCL11A gene. In an embodiment, the targeting domain is the exact complement of the target domain. Any of the targeting domains in the table can be used with a S. pyogenes eiCas9 molecule to cause a steric block at the promoter region to block transcription resulting in the repression of the BCL11A gene. Alternatively, any of the targeting domains in the table can be used with a S. pyogenes eiCas9 fused to a transcriptional repressor to decrease transcription and therefore downregulate gene expression.
Table 3B provides exemplary targeting domains for repressing (i.e., knocking down or decreasing) expression of the BCL11A gene. Any of the targeting domains in the table can be used with a S. aureus eiCas9 molecule to cause a steric block in the promoter region to block transcription elongation resulting in the repression of the BCL6A gene. Any of the targeting domains in the table can be used with a S. aureus eiCas9 fused to a transcriptional repressor to decrease transcription and therefore downregulate gene expression.
Table 3C provides exemplary targeting domains for repressing (i.e., knocking down or decreasing) expression of the BCL11A gene. Any of the targeting domains in the table can be used with an N. meningitidis eiCas9 molecule to cause a steric block in the promoter region to block transcription elongation resulting in the repression of the BCL11A gene. Any of the targeting domains in the table can be used with an N. meningitidis eiCas9 fused to a transcriptional repressor to decrease transcription and therefore downregulate gene expression.
Table 4A provides exemplary targeting domains for knocking out the BCL11A gene by targeting the early coding sequence the BCL11A gene selected according to first tier parameters. The targeting domains bind within first 500 bp of coding sequence downstream of start codon, good orthogonality, start with G. It is contemplated herein that the targeting domain hybridizes to the target domain through complementary base pairing. Any of the targeting domains in the table can be used with a S. pyogenes Cas9 molecule that generates a double strand break (Cas9 nuclease) or a single-strand break (Cas9 nickase). In an embodiment, dual targeting is used to create two double strand breaks to remove the enhancer region in the BCL11A gene, e.g., the first gRNA is used to target upstream (i.e., 5′) of the enhancer region in the BCL11A gene and the second gRNA is used to target downstream (i.e., 3′) of the enhancer region in the BCL11A gene.
Any of the targeting domains in the table can be used with a S. pyogenes Cas9 (nickase) molecule to generate a single strand break. In an embodiment, dual targeting is used to create two nicks on opposite DNA strands by using S. pyogenes Cas9 nickases with two targeting domains that are complementary to opposite DNA strands, e.g., a gRNA comprising any minus strand targeting domain may be paired any gRNA comprising a plus strand targeting domain provided that the two gRNAs are oriented on the DNA such that PAMs face outward and the distance between the 5′ ends of the gRNAs is 0-50 bp. Exemplary gRNA pairs are: BCL11A-2607 and BCL11A-2593, BCL11A-2607 and BCL11A-2598, BCL11A-264 and BCL11A-2593, BCL11A-2614 and BCL11A-2598, BCL11A-2589 and BCL11A-2664, BCL11A-2589 and BCL11A-2666, BCL11A-2596 and BCL11A-2664, BCL11A-2596 and BCL11A-2666, BCL11A-2603 and BCL11A-2664, of BCL11A-2603 and BCL11A-2666.
In an embodiment, four gRNAs (e.g., two pairs) are used to target four Cas9 nickases to create four nicks to remove the enhancer region in the BCL11A gene, e.g., the first pair of gRNAs are used to target upstream (i.e., 5′) of the enhancer region in the BCL11A gene and the second pair of gRNAs are used to target downstream (i.e., 3′) of the enhancer region in the BCL11A gene. For example, gRNA pairs that target upstream (i.e., 5′) of the enhancer region in the BCL11A gene (e.g., 2607 and BCL11A-2593, BCL11A-2607 and BCL11A-2598, BCL11A-264 and BCL11A-2593, or BCL11A-2614 and BCL11A-2598) can be paired with gRNA pairs that target downstream (i.e., 3′) of the enhancer region in the BCL11A gene (e.g., BCL11A-2589 and BCL11A-2664, BCL11A-2589 and BCL11A-2666, BCL11A-2596 and BCL11A-2664, BCL11A-2596 and BCL11A-2666, BCL11A-2603 and BCL11A-2664, of BCL11A-2603 and BCL11A-2666).
Table 4B provides exemplary targeting domains for knocking out the BCL11A gene by targeting the early coding sequence the BCL11A gene selected according to second tier parameters. The targeting domains bind within first 500 bp of coding sequence downstream of start codon, good orthogonality, and do not start with G. It is contemplated herein that the targeting domain hybridizes to the target domain through complementary base pairing. Any of the targeting domains in the table can be used with a S. pyogenes Cas9 molecule that generates a double strand break (Cas9 nuclease) or a single-strand break (Cas9 nickase). In an embodiment, dual targeting is used to create two double strand breaks to remove the enhancer region in the BCL11A gene, e.g., the first gRNA is used to target upstream (i.e., 5′) of the enhancer region in the BCL11A gene and the second gRNA is used to target downstream (i.e., 3′) of the enhancer region in the BCL11A gene.
Any of the targeting domains in the table can be used with a S. pyogenes Cas9 (nickase) molecule to generate a single strand break. In an embodiment, dual targeting is used to create two nicks on opposite DNA strands by using S. pyogenes Cas9 nickases with two targeting domains that are complementary to opposite DNA strands, e.g., a gRNA comprising any minus strand targeting domain may be paired any gRNA comprising a plus strand targeting domain provided that the two gRNAs are oriented on the DNA such that PAMs face outward and the distance between the 5′ ends of the gRNAs is 0-50 bp.
In an embodiment, four gRNAs (e.g., two pairs) are used to target four Cas9 nickases to create four nicks to remove the enhancer region in the BCL11A gene, e.g., the first pair of gRNAs are used to target upstream (i.e., 5′) of the enhancer region in the BCL11A gene and the second pair of gRNAs are used to target downstream (i.e., 3′) of the enhancer region in the BCL11A gene.
Table 4C provides exemplary targeting domains for knocking out the BCL11A gene by targeting the early coding sequence the BCL11A gene selected according to third tier parameters. The targeting domains bind within first 500 bp of coding sequence downstream of start codon and start with G. It is contemplated herein that the targeting domain hybridizes to the target domain through complementary base pairing. Any of the targeting domains in the table can be used with a S. pyogenes Cas9 molecule that generates a double strand break (Cas9 nuclease) or a single-strand break (Cas9 nickase). In an embodiment, dual targeting is used to create two double strand breaks to remove the enhancer region in the BCL11A gene, e.g., the first gRNA is used to target upstream (i.e., 5′) of the enhancer region in the BCL11A gene and the second gRNA is used to target downstream (i.e., 3′) of the enhancer region in the BCL11A gene.
Any of the targeting domains in the table can be used with a S. pyogenes Cas9 (nickase) molecule to generate a single strand break. In an embodiment, dual targeting is used to create two nicks on opposite DNA strands by using S. pyogenes Cas9 nickases with two targeting domains that are complementary to opposite DNA strands, e.g., a gRNA comprising any minus strand targeting domain may be paired any gRNA comprising a plus strand targeting domain provided that the two gRNAs are oriented on the DNA such that PAMs face outward and the distance between the 5′ ends of the gRNAs is 0-50 bp.
In an embodiment, four gRNAs (e.g., two pairs) are used to target four Cas9 nickases to create four nicks to remove the enhancer region in the BCL11A gene, e.g., the first pair of gRNAs are used to target upstream (i.e., 5′) of the enhancer region in the BCL11A gene and the second pair of gRNAs are used to target downstream (i.e., 3′) of the enhancer region in the BCL11A gene.
Table 4D provides exemplary targeting domains for knocking out the BCL11A gene by targeting the early coding sequence the BCL11A gene selected according to forth tier parameters. The targeting domains bind within first 500 bp of coding sequence downstream of start codon and do not start with G. It is contemplated herein that the targeting domain hybridizes to the target domain through complementary base pairing. Any of the targeting domains in the table can be used with a S. pyogenes Cas9 molecule that generates a double strand break (Cas9 nuclease) or a single-strand break (Cas9 nickase). In an embodiment, dual targeting is used to create two double strand breaks to remove the enhancer region in the BCL11A gene, e.g., the first gRNA is used to target upstream (i.e., 5′) of the enhancer region in the BCL11A gene and the second gRNA is used to target downstream (i.e., 3′) of the enhancer region in the BCL11A gene.
Any of the targeting domains in the table can be used with a S. pyogenes Cas9 (nickase) molecule to generate a single strand break. In an embodiment, dual targeting is used to create two nicks on opposite DNA strands by using S. pyogenes Cas9 nickases with two targeting domains that are complementary to opposite DNA strands, e.g., a gRNA comprising any minus strand targeting domain may be paired any gRNA comprising a plus strand targeting domain provided that the two gRNAs are oriented on the DNA such that PAMs face outward and the distance between the 5′ ends of the gRNAs is 0-50 bp.
In an embodiment, four gRNAs (e.g., two pairs) are used to target four Cas9 nickases to create four nicks to remove the enhancer region in the BCL11A gene, e.g., the first pair of gRNAs are used to target upstream (i.e., 5′) of the enhancer region in the BCL0A gene and the second pair of gRNAs are used to target downstream (i.e., 3′) of the enhancer region in the BCL11A gene.
Table 4E provides exemplary targeting domains for knocking out the BCL11A gene by targeting the early coding sequence the BCL11A gene selected according to fifth tier parameters. The targeting domains outside the first 500 bp of coding sequence downstream of start codon. It is contemplated herein that the targeting domain hybridizes to the target domain through complementary base pairing. Any of the targeting domains in the table can be used with a S. pyogenes Cas9 molecule that generates a double strand break (Cas9 nuclease) or a single-strand break (Cas9 nickase). In an embodiment, dual targeting is used to create two double strand breaks to remove the enhancer region in the BCL11A gene, e.g., the first gRNA is used to target upstream (i.e., 5′) of the enhancer region in the BCL11A gene and the second gRNA is used to target downstream (i.e., 3′) of the enhancer region in the BCL11A gene.
Any of the targeting domains in the table can be used with a S. pyogenes Cas9 (nickase) molecule to generate a single strand break. In an embodiment, dual targeting is used to create two nicks on opposite DNA strands by using S. pyogenes Cas9 nickases with two targeting domains that are complementary to opposite DNA strands, e.g., a gRNA comprising any minus strand targeting domain may be paired any gRNA comprising a plus strand targeting domain provided that the two gRNAs are oriented on the DNA such that PAMs face outward and the distance between the 5′ ends of the gRNAs is 0-50 bp.
In an embodiment, four gRNAs (e.g., two pairs) are used to target four Cas9 nickases to create four nicks to remove the enhancer region in the BCL23A gene, e.g., the first pair of gRNAs are used to target upstream (i.e., 5′) of the enhancer region in the BCL3A gene and the second pair of gRNAs are used to target downstream (i.e., 3′) of the enhancer region in the BCL11A gene.
Table 5A provides exemplary targeting domains for knocking out the BCL11A gene by targeting the early coding sequence the BCL11A gene selected according to first tier parameters. The targeting domains bind within first 500 bp of coding sequence downstream of start codon, good orthogonality, start with G. It is contemplated herein that the targeting domain hybridizes to the target domain through complementary base pairing. Any of the targeting domains in the table can be used with a S. aureus Cas9 molecule that generates a double strand break (Cas9 nuclease) or a single-strand break (Cas9 nickase). In an embodiment, dual targeting is used to create two double strand breaks to remove the enhancer region in the BCL11A gene, e.g., the first gRNA is used to target upstream (i.e., 5′) of the enhancer region in the BCL11A gene and the second gRNA is used to target downstream (i.e., 3′) of the enhancer region in the BCL11A gene.
Any of the targeting domains in the table can be used with a S. aureus Cas9 (nickase) molecule to generate a single strand break. In an embodiment, dual targeting is used to create two nicks on opposite DNA strands by using S. aureus Cas9 nickases with two targeting domains that are complementary to opposite DNA strands, e.g., a gRNA comprising any minus strand targeting domain may be paired any gRNA comprising a plus strand targeting domain provided that the two gRNAs are oriented on the DNA such that PAMs face outward and the distance between the 5′ ends of the gRNAs is 0-50 bp.
In an embodiment, four gRNAs (e.g., two pairs) are used to target four Cas9 nickases to create four nicks to remove the enhancer region in the BCL11A gene, e.g., the first pair of gRNAs are used to target upstream (i.e., 5′) of the enhancer region in the BCL11A gene and the second pair of gRNAs are used to target downstream (i.e., 3′) of the enhancer region in the BCL11A gene.
Table 5B provides exemplary targeting domains for knocking out the BCL11A gene by targeting the early coding sequence the BCL11A gene selected according to second tier parameters. The targeting domains bind within first 500 bp of coding sequence downstream of start codon, good orthogonality, and do not start with G. It is contemplated herein that the targeting domain hybridizes to the target domain through complementary base pairing. Any of the targeting domains in the table can be used with a S. aureus Cas9 molecule that generates a double strand break (Cas9 nuclease) or a single-strand break (Cas9 nickase). In an embodiment, dual targeting is used to create two double strand breaks to remove the enhancer region in the BCL11A gene, e.g., the first gRNA is used to target upstream (i.e., 5′) of the enhancer region in the BCL11A gene and the second gRNA is used to target downstream (i.e., 3′) of the enhancer region in the BCL11A gene.
Any of the targeting domains in the table can be used with a S. aureus Cas9 (nickase) molecule to generate a single strand break. In an embodiment, dual targeting is used to create two nicks on opposite DNA strands by using S. aureus Cas9 nickases with two targeting domains that are complementary to opposite DNA strands, e.g., a gRNA comprising any minus strand targeting domain may be paired any gRNA comprising a plus strand targeting domain provided that the two gRNAs are oriented on the DNA such that PAMs face outward and the distance between the 5′ ends of the gRNAs is 0-50 bp.
In an embodiment, four gRNAs (e.g., two pairs) are used to target four Cas9 nickases to create four nicks to remove the enhancer region in the BCL43A gene, e.g., the first pair of gRNAs are used to target upstream (i.e., 5′) of the enhancer region in the BCL3A gene and the second pair of gRNAs are used to target downstream (i.e., 3′) of the enhancer region in the BCL11A gene.
Table 5C provides exemplary targeting domains for knocking out the BCL11A gene by targeting the early coding sequence the BCL11A gene selected according to third tier parameters. The targeting domains bind within first 500 bp of coding sequence downstream of start codon and start with G. It is contemplated herein that the targeting domain hybridizes to the target domain through complementary base pairing. Any of the targeting domains in the table can be used with a S. aureus Cas9 molecule that generates a double strand break (Cas9 nuclease) or a single-strand break (Cas9 nickase). In an embodiment, dual targeting is used to create two double strand breaks to remove the enhancer region in the BCL11A gene, e.g., the first gRNA is used to target upstream (i.e., 5′) of the enhancer region in the BCL11A gene and the second gRNA is used to target downstream (i.e., 3′) of the enhancer region in the BCL11A gene.
Any of the targeting domains in the table can be used with a S. aureus Cas9 (nickase) molecule to generate a single strand break. In an embodiment, dual targeting is used to create two nicks on opposite DNA strands by using S. aureus Cas9 nickases with two targeting domains that are complementary to opposite DNA strands, e.g., a gRNA comprising any minus strand targeting domain may be paired any gRNA comprising a plus strand targeting domain provided that the two gRNAs are oriented on the DNA such that PAMs face outward and the distance between the 5′ ends of the gRNAs is 0-50 bp.
In an embodiment, four gRNAs (e.g., two pairs) are used to target four Cas9 nickases to create four nicks to remove the enhancer region in the BCL11A gene, e.g., the first pair of gRNAs are used to target upstream (i.e., 5′) of the enhancer region in the BCL11A gene and the second pair of gRNAs are used to target downstream (i.e., 3′) of the enhancer region in the BCL11A gene.
Table 5D provides exemplary targeting domains for knocking out the BCL11A gene by targeting the early coding sequence the BCL11A gene selected according to forth tier parameters. The targeting domains bind within first 500 bp of coding sequence downstream of start codon and do not start with G. It is contemplated herein that the targeting domain hybridizes to the target domain through complementary base pairing. Any of the targeting domains in the table can be used with a S. aureus Cas9 molecule that generates a double strand break (Cas9 nuclease) or a single-strand break (Cas9 nickase). In an embodiment, dual targeting is used to create two double strand breaks to remove the enhancer region in the BCL11A gene, e.g., the first gRNA is used to target upstream (i.e., 5′) of the enhancer region in the BCL11A gene and the second gRNA is used to target downstream (i.e., 3′) of the enhancer region in the BCL11A gene.
Any of the targeting domains in the table can be used with a S. aureus Cas9 (nickase) molecule to generate a single strand break. In an embodiment, dual targeting is used to create two nicks on opposite DNA strands by using S. aureus Cas9 nickases with two targeting domains that are complementary to opposite DNA strands, e.g., a gRNA comprising any minus strand targeting domain may be paired any gRNA comprising a plus strand targeting domain provided that the two gRNAs are oriented on the DNA such that PAMs face outward and the distance between the 5′ ends of the gRNAs is 0-50 bp.
In an embodiment, four gRNAs (e.g., two pairs) are used to target four Cas9 nickases to create four nicks to remove the enhancer region in the BCL11A gene, e.g., the first pair of gRNAs are used to target upstream (i.e., 5′) of the enhancer region in the BCL4A gene and the second pair of gRNAs are used to target downstream (i.e., 3′) of the enhancer region in the BCL11A gene.
Table 5E provides exemplary targeting domains for knocking out the BCL11A gene by targeting the early coding sequence the BCL11A gene selected according to fifth tier parameters. The targeting domains target outside the first 500 bp of coding sequence downstream of start codon. It is contemplated herein that the targeting domain hybridizes to the target domain through complementary base pairing. Any of the targeting domains in the table can be used with a S. aureus Cas9 molecule that generates a double strand break (Cas9 nuclease) or a single-strand break (Cas9 nickase). In an embodiment, dual targeting is used to create two double strand breaks to remove the enhancer region in the BCL11A gene, e.g., the first gRNA is used to target upstream (i.e., 5′) of the enhancer region in the BCL11A gene and the second gRNA is used to target downstream (i.e., 3′) of the enhancer region in the BCL11A gene.
Any of the targeting domains in the table can be used with a S. aureus Cas9 (nickase) molecule to generate a single strand break. In an embodiment, dual targeting is used to create two nicks on opposite DNA strands by using S. aureus Cas9 nickases with two targeting domains that are complementary to opposite DNA strands, e.g., a gRNA comprising any minus strand targeting domain may be paired any gRNA comprising a plus strand targeting domain provided that the two gRNAs are oriented on the DNA such that PAMs face outward and the distance between the 5′ ends of the gRNAs is 0-50 bp.
In an embodiment, four gRNAs (e.g., two pairs) are used to target four Cas9 nickases to create four nicks to remove the enhancer region in the BCL11A gene, e.g., the first pair of gRNAs are used to target upstream (i.e., 5′) of the enhancer region in the BCL11A gene and the second pair of gRNAs are used to target downstream (i.e., 3′) of the enhancer region in the BCL11A gene.