Introduction

CRISPR/Cas system has become a significant genome editing tool in the field of molecular biology. CRISPR, an acronym that is explained as a Clustered Regularly Interspaced Short Palindromic Repeats, is inherently equipped by a bacterial defense system to protect itself from an invasive pathogen. CRISPR system works in tandem with an associated protein called Cas proteins (plurality of types like Cas9, Cas12, and Cas13) that are used for targeting and manipulating the deoxyribonucleic acid (DNA) at the specific site in the nucleotide sequence. Genome editing using CRISPR–Cas requires the significant, additional component called guide ribonucleic acid (gRNA) which is assists the CRISPR complex in the target cleavage. Hence, scientists have been persistently bringing out new modifications on the guide RNA for effective genome editing activity. This type of programmable and reprogrammable strategy of gene editing is been explored in various fields such as research, medicine, diagnostics to regulate gene expression, controlling transcription, chromosomal imaging, drug delivery, and even treating the human ophthalmological and hematological diseases fronts in the future. Therefore, in this review, the aspects of different Cas proteins with their associated mechanisms have been briefed out for a better understanding of gene editing.

Historical View

Discovery of CRISPR 80s to Present

CRISPR–Cas is found in prokaryotes and plays significantly a specific role in the bacterial acquired immune system. Yoshizumi Ishino, a Japanese scientist, discovered the repeated sequences interspersed with spacer sequences in Escherichia coli (1987) while analyzing the gene that is responsible for alkaline phosphatase conversion [1]. Van Embden and his colleagues in the Netherlands threw light into different spacer sequences between DNA repeats in different strains of Mycobacterium tuberculosis (1993). Spacer sequences were used to characterize different strains of M. tuberculosis. This method is known as Spoligotyping, and many such sequences have been discovered in bacteria and archaea [2]. CRISPR sequences were identified in Haloferax (1995) [3] and in Cyanobacterium (1996). CRISPR sequences were discovered in the year 2000 in archaea and bacteria [4]. Jansen and his team termed those repeated nucleotide sequences with spacers found in the genomes of prokaryotes as CRISPR sequences (2002) [5].

CRISPR-associated proteins, shortly known as “Cas” enzymes (particularly Cas9, Cas12a, and Cas13), use CRISPR sequences as a guide to recognize and cleave target DNA’s complementary strands. CRISPR arrays allow bacteria or archaea to retrospect about viruses. When bacteria encounter the virus, they copy and integrate its DNA copy into their genome as “spacers” between CRISPR short DNA repeats. The nucleotide sequences in the spacers act as a template for the Bacterial RNA to recognize the DNA of an incoming virus and direct the Cas9 enzyme to act upon it; thereby preventing the virus from infecting the bacteria. Doudna and Banfield began researching CRISPRs in 2005. Years later, in 2012, yet another scientist, Emmanuel Charpentier, joined the former for a collaborative study on Cas9 proteins. In 2015, a new Cas protein, cpf1, was discovered to further enhance gene editing [6]. These scientists were subsequently awarded the Nobel Prize in Chemistry on October 7, 2020, in recognition of their contributions to CRISPR–Cas9. These studies have been further being scaled up later on in order to improve the accuracy and efficiency of CRISPR–Cas9 technique (Fig. 1).

Fig. 1
figure 1

Schematic representation of CRISPR discoveries from 1987 to 2020

Existing Genetic Tools

Before the CRISPR–Cas system, several types of endonuclease enzymes that could cut DNA were exposed, besides the already existing Meganucleases, ZFNs (Zinc Finger Nucleases), and TALENs (Transcription activator-like effector nucleases) (Table 1). These enzymes can identify and cut specific nucleotide sequences. The restriction enzymes were used to identify a specific site of DNA, cleave it, and incorporate new genes. Those sequences are found multiple times in a genome. As a result, determining the insertion site ahead of time is extremely difficult. So, scientists sought ways to improve the precision of restriction enzymes and transform them so that they could identify a distinct sequence in the genome. So, the example of enzymes having such one-of-a-kind sequences is Meganucleases. Zinc finger nuclease is another type of endonuclease. These synthetic restriction enzymes are made up of a subunit that can identify the preferred DNA sequence and its restriction sites. Scientists engineer the nuclease proteins by combining zinc finger proteins with the Fok I endonuclease DNA cleavage domain [7]. Transcription activator-like effector nucleases (TALENs) are less difficult to synthesis than of zinc finger nucleases. TALENs are made up of a restriction enzyme subunit that cleaves DNA and a sequence recognition component called a TAL effector. TAL effectors have an ability to cover the long and unique sequences in the genome due to the high number of iterations. Scientists can create TAL effectors for specific DNA sequences more easily than zinc finger endonucleases. It remains, however, a difficult task [8].

Table 1 TALENs, ZFNs, and CRISPR

General Mechanism of CRISPR

The CRISPR system requires a group of CRISPR-associated genes known as “Cas,” which are usually found next to the CRISPR sequence. Cas genes typically encode proteins that are required for an adaptive immune response [9]. CRISPR–Cas systems are engineered DNA- or RNA-binding proteins that can bind to specific target sequences by designing crRNAs or gRNAs with complementary spacer sequences. Cas proteins can also act as nucleases, making them useful as programmable genome editing tools [10]. There are three major stages in the CRISPR–Cas defense system: adaptation, expression, and interference. In the first stage, the CRISPR locus is adapted by the insertion of new spacer sequences. During expression, the Cas genes are expressed and the CRISPR sequence is transcribed into a long precursor crRNA. Pre-crRNA is converted into mature crRNA through the involvement of Cas proteins (Cas5d) and other accessory factors. The target viral DNA is identified and destroyed by the combined action of crRNA and Cas proteins during the final stage, Interference [11] (Fig. 2).

Fig. 2
figure 2

Mechanism of CRISPR-mediated immunity in bacteria. 1. Adaptation insertion of new spacers into the locus. 2. Expression processing of the CRISPR RNA. 3. Interference detection of the genetic elements by CRISPR RNA and Cas proteins

The CRISPR/Cas9 technique is widely used in science because of its high degree of flexibility and accuracy in identifying, targeting, and cleaving target nucleic acid. This technique is alluring as it allows for very low-cost genetic engineering. It enables the inclusion or elimination of multiple genes at the same time. This technique can quickly manipulate many different genes in an in vitro or in vivo cell lines, a plant that would otherwise take years to complete. Because it is not species-specific, it can be used on organisms that previously have been resistant to genetic engineering. Jennifer Doudna and Emmanuel Charpentier have been honored with the Nobel Prize in Chemistry on October 7, 2020, in recognition of their contributions to CRISPR–Cas9 (Table 2).

Table 2 Different types of Cas proteins and its functions

Cas9 Protein

Among all the CRISPR effector proteins the well-characterized protein is Cas9, which belongs to the type II CRISPR system which is comprised of subtypes like type II-A, type II-B, and II-C that are sorted on account of Cas9 gene size and the presence of the subtype-specific gene [12]. Cas9 effector protein stemmed from Streptococcus pyogenes, Streptococcus thermophiles, Staphylococcus aureus, Neisseria meningitidis, and Campylobacter jejuni. Cas9 causes double-stranded DNA breaks ensuing in the formation of blunt ends, which may be repaired by a donor template DNA during the non-homologous end joining or homologous recombination (Fig. 3).

Fig. 3
figure 3

Mechanism of action of CRISPR/Cas9 techniques. CRIPSPR/Cas9 gene with cleavage activities creating a double-strand break on the target DNA, which are repaired either by non-homologous end joining (NHEJ) or by homology-directed repair (HDR) with the help of Donor DNA is provided

Mechanisms and Modifications

Cas9 requires crRNA (CRISPR RNA) and tracrRNA (Transactivating crRNA), which complements crRNA’s repeated region [13]. Cas9 binds to a genetically engineered single-guide RNA (sgRNA) made from dual RNA (crRNA:tracrRNA) and recognizes target viral DNA via PAM (Protospacer Adjacent Motif) regions. Cas9 cleaves 3 bp upstream of the PAM sequence after sgRNA base pairs with the target DNA related to the PAM sequence [14]. Cas9 has a bilobed structure with a core cleft that could harbor a crRNA:DNA duplex. A disordered linker that connects the α-helical recognition (REC) and nuclease (NUC) lobes, as well as a highly conserved arginine-rich bridge that forms several interactions with sgRNA. The conserved HNH (Histidine Asparagine Histidine) and RuvC (Resolvase) nuclease domains, as well as a variable C-terminal domain, make up the NUC (Nuclease) lobe which interacts with PAM [15]. The structural conformations of the Cas9 proteins show that guide RNA binding controlling Cas9 activity results in a large conformational reorganization of the protein. As a result of the arrangement of the PAM-interacting residues and the nucleotide sequence of the guide RNA, the protein is capable of DNA binding and PAM recognition [16]. The scrutinized complex binds the guide RNA and scans the DNA; upon recognition of a similar PAM in the non-target strand, it induces the unwinding of the DNA duplex, allowing the guide RNA to probe 10 to 12 nucleotide sequences for complementarily in the target strand’s upstream PAM region [17].

The domain conformational activation of the HNH is subjected to the changes in the linker loops between the HNH and RuvC domains. This allosteric bonding between the nuclease domains results in the cleaving of the HNH domain in the target strand. The RuvC domain conversely cleaves the non-target strand [18]. Among all the Cas9 proteins, spCas9 variant (Streptococcus pyogenes) is the most commonly used genomic tool for gene manipulation, gene disruption, transcriptional activation and repression, and epigenetic modulation. For DNA target recognition, SpCas9 usually recognizes the viral DNA targets and generates a blunt double-strand break (DSB). SpCas9 recognizes a spacer sequence of 20 nucleotides and on the PAM 5-NGG. The PAM 5-NGG recognition limits SpCas9 target sites available in the human genome. Evolutionary approaches have developed variants by altering the PAM specificities for increasing the target site’s availability [19]. The aim of genetically engineered Cas9 variants is to improve target specificity. Several studies have defined how a Cas9 variant is mutated to reduce off-target cleavage activity, resulting in Cas9 and its corresponding gRNA expression from plasmids [20] and their transportation as ribonucleoprotein complexes [21].

Changes in the secondary structures within the short RNA hairpins on the spacer region of gRNAs increase the thermodynamic barrier to crRNA or gRNA strand invasion at off-target sites while generally maintaining on-target action, has increased on-target CRISPR–Cas specificity [22]. The discovery and development of additional Cas9 orthologs that recognize different PAM sequences have increased the number of target sites available. Streptococcus thermophilus Cas9 (stCas9), for example, recognizes the PAM 5′-NNNNGATT and Neisseria meningitidis Cas9 (NmCas9) recognizes 5′-NNNNGATT [23]. These Cas9 orthologs have been repurposed in bacteria and mammalian cells for DNA targeting. Furthermore, PAM sequence 5-NNGRRT (R represents A or G) is recognized by Staphylococcus aureus Cas9 (SaCas9) [23]. Notably, the efficiencies of SaCas9 gene editing are comparable to those of SpCas9, and the smaller size of SaCas9 (1053 amino acids compared to 1368 amino acids of SpCas9) has allowed its use in size-restricted delivery vectors like adeno-associated virus (AAV) [24]. A smaller Cas9 ortholog from Campylobacter jejuni (984 amino acids) was recently reported to recognize the PAM 5-NNNVRYM (V represents A, C, or G; Y represents C or T) and be used for in vivo-targeted genome editing. CasX (980 amino acids), the smallest Cas9 to date, was discovered as a result of additional efforts to identify Cas9 ortholog [25].

Mutations in Cas9 Proteins

Streptococcus pyogenes is the first organism found in the CRISPR/Cas9 system. The binding of gRNA with complementary mismatched target DNA within the bacterial genome leading to off-target activity is one of the major limitations of this system. As a result, shown in Table 3, the modifications of the Cas9 system are made to improve target specification and reduce unexpected cleaning activity [26]. An additional way to reduce off-target cleavage activity increases the proto spacer near the motif length. Nmecas9, a CRISPR–Cas9 system based on Neisseria meningitidis, detects a sequence of 8-nt PAM (5-NNNNGATT) that can reduce off-target cleavage activity [27]. Sacas9 from Staphylococcus aureus detects a 6-nt PAM (5-NNGRRT) sequence of two Cas9 variants from Streptococcus thermophilus (st1cas9 and st3cas9) used to classify two human loci, PRKDC and CARD11, showed lower off-target rates than previously developed SpCas9 [28].

Table 3 CRISPR–Cas9 mutations and the descriptions

Cas9 Ortholog and Their Function

CRISPR/Cas9 orthologs have also been identified in order to improve the targeted specificities. CRISPR/CpfI Prevotella and Francisella were taken from class II, type endonuclease. CpfI, unlike Cas9, requires a single-directed CRISPR RNA (crRNA) that breaks down and produces 4–5 nucleotides fusion and 5-overhangs [29]. Cas9 orthologs also have the ability to identify RNA as well. The cleavage of ssRNA in SaCas9 and Campylobacter jejuni Cas9 are independent of PAM [10]. SaCas9 suppressed gene expression in E. coli when targeted at RNA [30]. The CRISPR RNA (crRNA) dependent but independent binding of PAM and purification of endogenous RNAs wer observed in C. jejuni Cas9 [31]. Francisella novicida Cas9 (FnCas9) was found to identify bacterial mRNA [32]. It modifies the genetic expression and identifies the RNA genome of the hepatitis C virus in eukaryotic cells [33]. FnCas9 is not PAM-dependent and can be used to identify negative-sense ssRNA viruses from the family Filoviridae, Paramyxoviridae, and Orthomyxoviridae (Table 4). More research is needed to determine the potential effects of RNA targeted at Cas9 on eukaryotic cells [10].

Table 4 Orthologs and the organisms involved in CRISPR–Cas9

Applications of Cas9

The CRISPR/Cas technology has a very broad application range in the field of research, development, and diagnostics. Primarily, this technique has been used as a genetic tool for target gene (DNA/RNA) modification in the host. Besides, it plays various roles in the field of Genome library screening, Transcriptional regulation (activation/repression), epigenetic modification, live cell imaging, and also therapeutic applications. In human cells, the Gecko (genome scale CRISPR–Cas9 knock out) library that targets 18,080 genes of having 64,751 specific guide RNA sequences via lentiviral delivery allows both negative and positive selection screening. GeCKO library is mainly used for identifying cancer cell viability [34]. The identification of genes influencing the desired phenotype is often accomplished via genetic screening using RNA interference (RNAi) or knockouts. When the resulting ESC (Embryonic stem cells) mutant libraries were screened for resistance to either Clostridium septicum alpha-toxin or 6-thioguanine, 27 known and 4 previously unknown genes were found to be involved in these phenotypes [35]. sgRNAs recruit dCas9 (nuclease dead Cas9) and Cas9 effector fusion proteins to aim for specific loci for modulating the gene expression [36]. The recruitment of dCas9 to the DNA was directed by sgRNA has been shown to interfere with gene transcription (CRISPRi) by preventing transcriptional elongation, polymerase binding, or transcription factor binding, which influence gene expression levels [37]. When targeted the regions upstream of transcriptional start sites, the sgRNA–dCas9 complex, the effector domain, can dictate the cellular response, resulting in gene repression (CRISPRi) (e.g., KRAB, SID domains) or gene activation (CRISPRa) (e.g., VP16, VP64, p65AD domains) [37, 38].

Targeting the HS2 enhancer with dCas9-KRAB fusions was shown to silence globin gene expression with relatively little off-target effects on non-targeted gene expression [39]. Targeting histone epigenetic modifications has been demonstrated by epigenome editing systems. Gene activation with a nuclease-null dCas9 tethered to an acetyltransferase (HAT) enzyme that catalyzes the direct covalent modification of lysine residues on histone tails [40]. The other study used a nuclease-null dCas9 fused to the Krüpel-associated box (KRAB), a naturally occurring transcriptional repression domain involved in recruiting a heterochromatin-forming complex that mediates histone methylation and deacetylation [39]. dCas9 is fused with GFP (Green Fluorescent Protein) that leads to chromatin live cell imaging.

Genome Editing Technology in Animal Model

The genome editing technology based on CRISPR/Cas9 allows for structural and functional characterization of various diseases in animal models. By means of Cre-loxP Technology, CRISPR/Cas9-based modification of PETN mutations with or without p53 mutations results in knocking out of PETN and P53 genes in mouse liver [41]. The oncogene β-catenin mutation is activated and may be transported to hepatocytes. This could be accomplished by incorporating the Cas9 plasmids [41]. Knocking out of Cas9 in mice model and lung adenocarcinoma models could be generated by inserting a single AAV vector containing loss-of-function mutations in p53, Lkb1, and Kras (G12D) mutations in the lung [42]. Cas9 and sgRNAs are used to target the β-catenin gene. These mutations in oncogenes and tumor suppressor genes develop a new method of using CRISPR/Cas systems in Cancer models. The oncogene β-catenin mutation is activated and may be transported to hepatocytes. This could be accomplished by coinjecting Cas9 plasmids [41]. Knocking of Cas9 in mice model and lung adenocarcinoma models could be generated by inserting a single AAV vector containing loss-of-function mutations in p53, Lkb1, and Kras (G12D) mutations in the lung [42]. Cas9 and sgRNAs are used to target the β-catenin gene. These mutations in oncogenes and tumor suppressor genes develop a new method of using CRISPR/Cas systems in Cancel models. Co-injection of Cas9 mRNA/gRNAs with mutant oligos results in gene knocking mice with precise point mutations in two genes [43]. Cointegration of Cas9mRNA and sgRNA targets the different genes into mice zygotes for generating mutant mouse having biallelic mutations [44]. Microinjection of mRNA encoding Cas9 and specific designed sgRNA into single-celled stage Zebrafish embryo [45]. Many other model organisms, such as Drosophila melanogaster, Caenorhabditis elegans, Axolotl, Xenopus tropicalis, Rattus sp., and Sus barbatus, have used CRISPR–Cas9 technology for efficient genome engineering. The CRISPR/Cas9 system has been demonstrated to be an efficient and reliable method for targeted genome modification.

Therapeutic Applications

CRISPR/Cas9 plays the most exhilarating role in the therapeutic industry for curing many genetic and infectious diseases. Cas9, sgRNA, and a homology-directed repair template were injected into mouse zygotes to correct the dystrophin point mutation in the germline and prevent muscular dystrophy in mutant mice [46]. The researchers used homologous recombination to successfully adjust the cystic fibrosis transmembrane conductor receptor (CFTR) locus in the same diseased human patients cultured intestinal stem cells [47]. CRISPR/Cas9 technology has been used to assess diabetes pathogenesis precisely. To reduce lipid storage in adipocytes, researchers used the CRISPRi technique to silence the expression of Fabp4. An Obese mouse was selected for this study and its body weight was reduced by about 20% as a part of this treatment. It has also reduced type 2 diabetes indicators like insulin resistance, inflammation, high glucose levels, and hepatic steatosis [48]. In Anopheles gambiae, CRISPR/Cas9 enables the inactivation of the FREP1 gene. At the sporozoite and oocyst stages, FREP1 knockout mutants effectively suppressed infection with both rodent and human malaria parasites [49]. The CFTR mutation has been corrected in iPSCs by reprogramming somatic skin fibroblasts from CF patients into embryonic stem cells with the CFTR/Cas9 gRNA vector [50]. CRISPR/Cas9 also plays a major role in preventing HIV by eliminating viral genes from the host cell chromosomes. A sgRNA expression vector targeting the long terminal repeats (LTR) of HIV-1 efficiently cleaves and mutates LTR target sites and suppresses LTR-driven viral gene expression when transfected into HIV-1 provirus-integrated human cells [51]. These findings suggest that the CRISPR–Cas9 technology is the budding technology of the therapeutic industry. Apart from therapeutic applications, CRISPR shows that gene editing technology has the potential to impact a wide range of sectors. Crispr could also be used for pet breeding, allergy-free food production, the production of greener fuels, and pest control which are reviewed in the literature [52,53,54,55].

Cas12 Protein

The sequences extracted from Francisella novicida U112 cultures containing Cas12a-based CRISPR loci revealed that mature crRNAs for Cas12a are 42–44 nucleotides long; with the first 19–20 nucleotides corresponding to the repeat sequence and the remaining 23–25 nucleotides corresponding to the spacer sequence [56]. For type V systems, there are nearly 300,000 putative CRISPR–Cas elements. There is only one predicted RuvC nuclease domain in these Cas nucleases. Rusk (2019) discovered the type V phylogenetic tree to have three major branches: one with Cas12a, Cas12c, Cas12d, and Cas12e; a second branch with Cas12b, Cas12h, and Cas12i; and a third branch with small effectors such as Cas12g, the smallest protein with only 800 amino acids of all the branches [57]. Class 2 type V is further subdivided into the subtypes listed below (V-A, V-B, V-C, and V-U).

However, there is some uncertainty about the protein structure with regard to V-C and V-U. The Cpf1 protein, also known as Cas12a, is encoded by V-A, and its structure paved the way for understanding its mechanism [56]. Cas12a (Cpf1), the prototype of type V effectors, has two RuvC-like nuclease domains but lacks HNH domain. However, a recent structural analysis of the Cas12a–crRNA–target DNA complex elucidates a second nuclease domain with a distinct fold that is functionally equivalent to the Cas9 HNH domain [58].

A T-rich PAM sequence that is (5′-TTN-3′) is required essentially when DNA is cleaved. The positively charged central channel of a nuclease (NUC) lobe may regulate the cleaving of the target strand after catalytic residue mutations in the RuvC domain of Cas12a in the bacterium Acidaminococcus sp. that inhibit the cleaving of the target and non-target strands [59]. Cas12a (cpf1), Cas12b (C2c1), Cas12c (C2c3), Cas12d (CasY), and Cas12e (CasX), the latter of which contains a catalytic domain capable of processing its own mRNA. SpCas9, the PAM for Cas12 proteins, on the other hand, contains thymine (T)-rich sequence and is located at the 5' end of the target sequence. The crRNA–target complex accounted for a distorted PAM duplex with a narrow minor groove that serves as a site for base-specific contacts with amino acid side chains via van der Waals and hydrogen bond interactions [20].

Among the variants of Cas12, Cas12c (C2c3), which has a single large protein (100–200 kDa) with a catalytic RNA-guided cleavage activity of DNA/RNA, aids in the identification of a new Class 2 protein classified as Cas12d. Cas12c and Cas12d share structural similarities. Almost 23 distinct Cas12 varieties from microbes from various habitats have been reported; among these microbes, a CPR (Candidate phyla Radiation) bacterium possesses the CRISPR–Cas genome locus that encodes for a Cas12d protein [60]. Cas12b of type V has been primed to acquire genome editing in human cells. The wild-type BhCas12b cleaves the non-target strand at 37 °C, resulting in a double-strand break. We gain loss-of-function mutation for the BhCas12b by modifying it, which overcomes the limitation of lower editing inefficiency. Despite the fact that Cas12b protein is much smaller than Cas9 and Cas12a, the efficient cas12b nuclease (AacCas12b) derived from Alicyclobacillus acidoterrestris has optimal nuclease or cleavage activity at 48 °C [61].

When infected, mature crRNA molecules guide the subtype-specific interference machinery against invading nucleic acids, effectively silencing foreign genetic material. Cas12a and Cas12b cleave the DNA duplex in both strands shortly after the PAM site is recognized and crRNA binds to the target DNA, leaving staggered ends with 5- and 7-nt [62]. The Cas12 has two varieties namely Cas12a (Cpf1), cas12b (C2c1), and Cas12c (C2c2) of which C2c1 and C2c2 contain RuvC-like endonuclease domains that are correlated to Cpf1. While the third system encompasses an effector with two predicted HEN RNase domains [63]. Another Cas12a, Cpf1, originating from Acidaminococcus sp. and recognized as AsCpf1, contains 1307 amino acids, which are a few amino acids shorter than SpCas9 (1368). The crystal structure of AsCpf1 shows that it has two main lobes: (a) a nuclease lobe (REC) and (b) an alpha helical recognition lobe (NUC) [64]. New Cas12a orthologs with improved crRNA scaffolds boost mammalian genome editing. Just three Cas12a orthologs have been used for mammalian genome editing, and editing performance and targeting coverage also need to be improved [65].

CeCas12a, a novel Cas12a derived from Coprococcus eutactus, is an editable nuclease with higher genome editing efficiencies in human cells than AsCas12a and LbCas12a. CeCas12a has shown to be a more rigorous enzyme for PAM recognition both in vitro and in vivo. These orthologous condenses off-target edits at C-containing PAM at various LbCas1-related sites [66]. Mutations in the NUC domain in Francisella novicida resulted in incomplete inactivation of the target strand cleaving [67]. It was shown that the mutations in the CRISPR genes of cotton crops like AsCpf1 (S542R/K607R and S542R/K548V/N552R) and LbCpf1 (G532R/K595R and G532R/K538V/Y542R) aid to recognize TYCV and TATV PAMs [68]. The study of mutation in the RuvC-I motif of Cas12g1 (D513A) demonstrates the functional variety that emerges along with different types of CRISPR–Cas evolution paths and expands the CRISPR toolbox [69].

Applications of Cas12a

Cas12a (Formerly known as Cpf1) is used as an alternative genetic tool of Cas9. Dead Cas12a from Eubacterium eligens (EedCas12a) revealed that it exhibits greater gene repression in the target DNA template strand than deactivated Cas9 from Streptococcus pyogenes (SpdCas9) [67]. It was also demonstrated that Cas12a’s pre-crRNA processing activity makes it a good candidate for multiplex gene regulation [70]. Different literature studies suggest that the multiplex gene regulation of Cas12a was observed in bacteria, plants as well as in mammalian [65, 66]. Cas12a can also be used as a replacement for Cas9 in cell types where Cas9 is toxic, such as some industrial strains of Streptomyces [71]. Targeted mutagenesis in plants can also be accomplished in vivo by co-expression of Cas12a and its related crRNA, as seen in rice. Furthermore, it was demonstrated that mutagenesis was more efficient when pre-crRNAs with full-length direct repeat sequences were used rather than mature crRNAs [72]. Efficient mutagenesis was also observed in soybean and wild tobacco via the delivery of the pre-assembled ribonucleoprotein (RNP) particle. The RNP was constructed using recombinant Cas12a and in vitro transcribed or chemically synthesized crRNAs [67]. Correction of mutations causing Duchene muscular dystrophy (DMD) in patient-derived induced pluripotent stem cells (iPSCs) and mdx mice, a popular model for studying DMD. Dystrophin expression was restored in iPSCs following Cas12a-mediated gene editing, while corrections in the pathophysiological hallmarks of muscular dystrophy were observed in mdx mice [73]. The adenovirus vector was combined with an AsCas12a expression system, resulting in successful mutations in humanized mice with chimeric liver primary human hepatocytes [74]. Cas12a-mediated genome editing was also used to create rat models that resemble human atherosclerosis, and this system may have applications in understanding early-stage atherosclerosis [75].

Biotechnology and Biomedical Engineering

A massive effort is currently underway to redesign all of these tools for biomedical and biotechnological applications. However, recent research has suggested that CRISPR–Cas nucleases could be used in the bioengineering of smart materials such as hydrogels [76]. DNA encapsulates these water-filled polymers. Cas12a was recently used in a study to specifically degrade the DNA scaffold of DNA hydrogels, implying that this smart cutter could be turned into a programmable device to deliver the cargo of DNA encaged hydrogels in a specific location at a specific time. The cleavage properties of Cas12a make it an ideal candidate to promote controlled delivery of the cargo [56]. Cas12a uses a multi-checkpoint mechanism to ensure precise targeting of DNA, which is desirable in a genome editing tool in order to have low off-target effects [56].

Diagnosis

DETECTR, a DNA endonuclease-targeted CRISPR trans reporter, is a diagnostic tool that combines isothermal RPA with Cas12a enzymatic activity [69]. Binding of the crRNA–Cas12a complex to target DNA results in indiscriminate cleavage of ssDNA coupled to a fluorescent reporter in this assay. DETECTR was used to differentiate between HPV16 and HPV18 in crude DNA extracts from cultured human cells and clinical samples [69]. DETECTR was used to test crude extracts from 25 human anal swab samples, with the results compared to an approved PCR-based assay. Following DNA extraction, DETECTR was correctly identified. HPV16 and HPV18 were detected in clinical specimens within 1 h, with results agreeing with PCR results in 25/25 and 23/25 samples, respectively. Cas12a has a wide range of applications. Currently, many studies are going on this genomic tool to find its more uses in research, diagnostics, and the health sector field.

Cas13 Protein

RNA editing has several advantages over traditional DNA editing technology. CRISPR/Cas13 is an example of a major RNA editing technology. Cas13 enzymes, unlike Cas9 and Cpf1, do not require a PAM sequence but do require a protospacer flanking site (PFS) to target RNA. However, LWACas13s do not require a PFS site. Cas13 enzymes lack the RuvC and HNH domains required for DNA cleavage, preventing them from directly editing the genome. A CRISPR/Cas13-based RNA editing system would most likely be reversible, avoiding genomic off-targets introduced by non-homologous end joining (NHEJ).

Cas13 proteins such as Cas13a, Cas13b, and Cas13c are found in CRISPR/Cas systems of class 2 subtype VI that use a single effector protein [77]. The Cas13 protein and CRISPR RNA (guide RNA, 60–65nt) recognize a 28–30nt sequence on the target RNA through a spacer sequence which can make up the active endoRNase. Cas13a fundamental studies have shown two catalytic activities [78]. Cas13a first catalyzes the processing of pre-crRNAs to supply mature crRNAs, the cleavage of the target RNA is intervened by a separate catalytic activity. When the guide RNA base pairs with the target RNA, a catalytic site is formed between two HEPN domains of Cas13. Cas13 has several properties that make it an excellent candidate for an RNA-modulating tool. To begin, Cas13 is frequently programmed to target potentially any RNA by changing the crRNA spacer sequence. Second, unlike Cas9 and Cpf1, which need an exact PAM (protospacer adjacent motif) to be present in the DNA, Cas13 (mainly Cas13b) does not need a specific sequence element in the target site called PFS. Third, among the CRISPR/Cas systems, the effector complex is one of the most basic. Cas13-crRNA may be a two-component system that is easier to control and deliver than trimeric or multimeric systems. Fourth, because Cas13 is responsible for both crRNA biogenesis and silencing, multiple crRNAs can be loaded simultaneously [79]. In knockdown experiments, they discovered that PspCas13b is highly specific to on-target sequences. According to preliminary transcriptomic and gene-level analyses, PspCas13b may be more specific in knockdown experiments than short-hairpin RNAs (shRNAs). The efficiency of silencing was also impressive, with 60–90% suppression [80]. However, the central seed tolerates single mismatches to some extent, caution may be required when designing guide RNAs to avoid off-target effects. It is unclear how some Cas13 achieve such high selectivity, and whether the specificity can be maintained in all cellular and molecular contexts (Table 5).

Table 5 Orthologs and the organisms involved in CRISPR Cas13

RNA Editing Technology of Cas13

CRISPR/Cas13 is a RNA-targeting genome editing system. When ssRNA get recognized, Cas13a (C2c2) gets activated [77]. Cas13a is derived from Leptotrichia wadei and Cas13b is derived from Prevotella sp. P5–125 [81]. Cas13 subtype Cas13a cleaves nitrogenous base uracil anywhere after binding to the target RNA. This process is known as collateral cleavage. This cleavage process spreads to neighboring untargeted RNAs. Cas13a has been found to cut specific target mRNAs spontaneously in both bacteria and eukaryotic cells [77, 81]. dCas13a, a Cas13a variant, retains the ability to bind targeted RNA and was used for RNA imaging in live cells [81]. dCas13a, like RCas9, has been targeted to mRNA to visualize the formation of stress granules [82]. The subtypes of Cas13 endonuclease have distinctive functions in various animals or plant model systems. It is extremely difficult to design crRNA due to the large number of Orthologs and subtypes of cas13. LwaCas13a, PspCas13, and RfxCas13d have been described in the table for targeting RNAs of various genes in mammalian cells. The comparison between Cas13a, cas13b, and Cas13d shows that RfxCas143d has the most important knockdown in HEK293T cells [83]. LwaCas13a has been used as a model system for plant or agriculture-based applications. It has a significant knockdown in rice protoplasts (Table 6).

Table 6 Comparative analysis between Cas9, Cas12, and Cas13

Applications of Cas13

Research and Development

Cas13 has multiple applications in the field of research and development. Cas13 requires a Cas13 variant for RNA knockdown and to reduce off-target activity. CRISPR sequence of Cas13 requires mainly PFS sequence to identify and cleave the target RNA. The smallest Cas13 variant, also known as Cas13d (CasRx), is delivered by Adeno-associated virus vector. Cas13d does not require a PFS sequence to identify the target DNA. This variant of cas13d has high specificity and it reduces off-target activities of Cas, Cas13a plays a unique role in RNA imaging. The catalytically inactive Cas13 (dCas13a) is bounded with GFP (Green Fluorescent Protein), ZFN (Zinc finger nucleases), and KRAB (Kruppel-associated box domain) [78]. The fusion leads to the recognition of endogenous, untagged, and unmodified mRNA [84]. Cas13a (Formerly known as C2c2) protects plants from viruses over time. Cas13a codon optimization is required for proper protein functioning, which can reduce off-target activity while increasing multiplex target activities of RNA-targeting Cas protein [85]. Apoptosis (programmed cell death) is activated inside the cell by non-specific RNA degradation of Cas13a in response to the infection by a cellular pathogen [72]. Cas13b, catalytically inactive Cas13, is fused with eukaryotic RNA-modifying enzyme N6-methyladenosine (m6A) bounded with YTHDF1 Protein to trigger the assembly of translational mechanism. Cas13 variant can be fused with YTHDF2, which can induce RNA degradation. In this way, Cas13 regulates gene expression in eukaryotic cells [86].

RNA Editing Technology

Another important application of inactivated Cas13 is RNA editing technology, REPAIRv1 (RNA Editing for Programmable A to I Replacement) [81]. Here, dCas13 was fused with human adenosine deaminase acting on RNA (ADAR) enzymes called ADAR1 and ADAR2. The programmable replacement of adenosine with inosine has been recognized as guanine during complementary base pairing. The replacement of A to I causes temporary alteration of translation of a selected gene. The specificity of this system is improved by REPAIRv2, which introduces a mutation within the ADAR2 enzyme [81].

Diagnosis

An emerging application of Cas13 in the field of diagnosis is SHERLOCK (Specific High-Sensitivity Enzymatic Reporter unLOCKing) Technology. This can be used for in vitro detection of DNA and RNA in a sample [87]. In this method sample, the nucleic acid is amplified to dsDNA by recombinase polymerase. Here, RNA sample undergoes reverse transcription and produces dsDNA, which serves as a template of T7 polymerase that produces ssRNA transcripts. Cas13 scans those ssRNA transcripts. The activated Cas13 cleaves the reporter upon recognition and releases the signal. SHERLOCK Technology is a highly sensitive and effective diagnostic technique used for the detection of Zika and dengue viruses [87]. The improved version of SHERLOCK was SHERLOCKv2. This was a multiplex detection method [88]. Multiple variants of Cas13 prefer different motifs of RNA of nt sequences AU, AC, UC, or GA. Different reporters can be used to detect various RNA in the sample. SHERLOCKv2 uses a Csm6 protein that is allosterically activated by Cas13 via cyclic oligoadenylates upon its own activation to improve signal strength. The combination of Csm6 and Cas13 collateral RNA cleavage results in a stronger signal from cleaved reporters [88].

Role of CRISPR/Cas System in COVID-19

A CRISPR/Cas12-based diagnostic method for the detection of the COVID-19 virus has been developed called SARS_Cov2 DETECTR. The total detection time is around 40 min and it has 95% accuracy. This assay has been started with RNA extraction, then reverse transcription and isothermal amplification by using RT_LAMP. Cas12 and guide RNA target E (Envelope) and N (Nucleoprotein) gene of SARS-Cov2. This assay also includes a laminar flow strip for visual readouts, with a single band close to where the sample was applied indicating a negative test and two higher bands or a single higher band indicating cleavage of the fluorescent probe and thus positive for SARS-Cov2 [89]. By the recently discovered RNA-guided RNA-targeting endonuclease, Cas13 can be used as a potential therapeutic tool against COVID-19.PAC_MAN (Prophylactic Antiviral CRISPR in human cells).

PAC-MAN (Prophylactic Antiviral CRISPR in huMAN cells) was created using the Ruminococcus flavefaciens-derived VI-D CRISPR–Cas13d variant, which was chosen for its small size, which allows for easier packaging in viral vehicles, high specificity, and strong catalytic activity in human cells. Given the evidence indicating relatively high mutation and recombination rates of SARS-CoV-2, this technique was developed to simultaneously target multiple regions for RNA degradation, paving the way for a much-needed pan coronavirus targeting strategy [90].

Future Prospects

CRISPR–Cas technology is transforming the new face of research. As compared with the other genome editing tools, it is very cheap, feasible, and more efficient in manipulating the genes. Many of the future challenges in the field of research, medicine, or agriculture are mainly depend on this technology from simple organisms to complex organisms also from targeting one gene to multiple genes. Although this technology has a lot of promise for medicinal development, efficacy and long-term safety are still major problems in clinical trials. Off-target editing, massive DNA deletions, and other rearrangements are all serious safety concerns that have yet to be effectively addressed in preclinical research. Recent studies have shown that CRISPR/Cas9-mediated in vivo genome editing of disease genes can improve disease phenotype and pathology in animal models without causing serious side effects [91]. Meanwhile, researchers have been contributing towards the significant progress in the CRISPR–Cas system to increase the target specificity effectively.

Technological and ethical concerns are still present, which makes an impact on not having the next level of discoveries. But it has the potential to eradicate genetic disorders, making non-pathogens, bringing back extinct species, treating invasive diseases, etc. Continuing the research will have clear, precise information about the CRISPR–Cas system, and more advancement could occur in the near future. Human genome editing for reproductive purposes, according to bioethicists and experts, should not be tried at this time, but work to make gene therapy safe and successful should continue. The National Academy of Sciences, National Academy of Medicine, Chinese Academy of Sciences, and the Royal Society of the United Kingdom call together an international summit to discuss recent scientific advances in human gene editing as well as the range of ethical and governance issues that these advances raise. Most of the biotech startup has already started working on this system and have a hope to improvise also to develop the cutting-edge research and innovations. The possibilities of keen understanding will create excitement among sciences to develop novel strategies in the genome editing approach.