Introduction

Soybean [Glycine max (L.) Merr.] is an important crop that can be used to prevent and treat human health problems because soybean seeds contain abundant nutrients, such as proteins, oils, sugars, and minerals (Bellaloui et al. 2015). Soybeans contain approximately 40% protein and 20% oil (Clemente and Cahoon 2009). Soybean protein is used mostly in human and animal food. Soybean oil is utilized in food, feed, and some industrial applications, such as biodiesel (Clemente and Cahoon 2009). However, to further expand the utilization of soybean, some problems need to be addressed, including the identification of the appropriate ratio of fatty acids, pest resistance, and allergy issues.

Soybean oil consists of the following five fatty acids: palmitic acid (16:0), stearic acid (18:0), oleic acid (18:1), linoleic acid (18:2), and linolenic acid (18:3). In the past, soybean breeding has focused on the improvement of the overall yield of soybean, including increases in total protein and oil (Mahmoud et al. 2006). The variability of soybean germplasm has contributed to critical progress in the improvement of the quality of the oil for food, feed, and industrial applications, using molecular biology and biotechnology tools (Damude and Kinney 2008). Soybean genetics and breeding programs have been developed to improve both oil content and quality, because of increasing demand for vegetable oils and an explosion of interest in the health issues around dietary fats (Clemente and Cahoon 2009). These technologies have led to improved soybean oil quality with high oxidative stability, to enhance the amount of omega-3 fatty acids and avoid the generation of trans fats (Damude and Kinney 2008; Clemente and Cahoon 2009). Some issues related to improvement in the yield and quality of soybeans still need to be addressed. Loss of productivity of soybean crops can be produced by the delayed root growth and consumption of plant nutrients by the soybean cyst nematode (SCN) (Wrather and Koenning 2006). Current SCN-resistant soybean cultivars were mainly developed using germplasms including PI88788, Peking (PI548402), and PI437654, in the United States (Mitchum 2016). However, it is necessary to identify novel genetic resources involved in SCN resistance, because of reduced genetic variability. Soybean is a major food causing allergic reactions and has the high immunological cross-reactivity with almost all other legumes, peanuts, birch pollens, and cow milk proteins (Watanabe et al. 2017). At least 16 potential proteins causing allergies have been identified in soybeans. The immunodominant protein Gly m Bd 30 K (P34) has been identified and characterized, and introgression of low-P34 soybean germplasm has been attempted in order to develop hypoallergenic soybeans for use in food (Herman et al. 2003; Sewekow et al. 2008). Two natural P34-null soybean germplasms with a low expression of P34 protein were isolated via screening of approximately 16,266 accessions by the US Department of Agriculture (Joseph et al. 2006). The breeding of hypoallergenic soybean germplasm provides a basis for the development of a valuable raw material for soybean food production (Herman et al. 2003; Joseph et al. 2006). The optimum yield of soybean is influenced by agricultural decisions, including planting dates, row spacing, seeding rates, and crop rotation (Iqbal et al. 2019). The yield and seed quality of soybean are significantly affected by various environmental conditions, such as biotic and abiotic stresses (Li et al. 2020). Even though traditional breeding and gene insertion approaches have contributed to the development of soybean cultivars with high yield potential and strong stress tolerance, we still need efficient and effective tools for soybean improvement (Hyten et al. 2006; Li et al. 2020). Recently developed genome editing technology is a promising tool for accelerating the genetic engineering of agronomically important traits in soybean.

Genome editing is a useful technology not only for the identification of gene function but also for improving crop traits. Genome editing has advanced by major three technologies: the use of zinc finger nuclease (ZFN), transcription activator-like effector nuclease (TALEN), and the clustered regularly interspaced short palindromic repeat (CRISPR)/CRISPR-associated protein nuclease (Cas) system (Wada et al. 2020). These technologies generate double-stranded breaks (DSBs) in the DNA sequence of the target genome, using a nuclease, which are then corrected using homology-directed repair or nonhomologous end joining (NHEJ) (Gaj et al. 2013). Both ZFN and TALEN edit the genome using Fok I nuclease, which has high specificity but different specific sequence DNA binding modules. ZFNs induce both deletions and insertions, whereas TALENs are more effective in inducing deletions than insertions (Ul Ain et al. 2015). CRISPR/Cas is a genome-editing system that involves the generation of DSBs by interacting with Cas nuclease and a guide RNA (gRNA), based on DNA sequences complementary to the gRNA (Jacobs et al. 2015). CRISPR/Cas technology is the most widely used genome-editing tool because it is simpler, faster, and more efficient than ZFN and TALEN (Sun et al. 2015). The CRISPR/Cas system is classified into three categories—types I, II, and III—according to the Cas nucleases and CRISPR sequences used (Sun et al. 2015). Type I and III systems are composed of multiple Cas nucleases (e.g., Cas3 and Cas6) for editing foreign DNA/RNA, whereas the type II system directly cleaves the targeted sequence of genomic DNA using a single Cas nuclease, such as Cas9 (Makarova et al. 2011). The CRISPR/Cas9 system progresses through the formation of mature dual CRISPR RNA (crRNA) with trans-encoded small crRNA. When Cas9 loads with a short crRNA sequence near a typical sequence (NGG) in a proto-spacer-adjacent motif (PAM), Cas9 directly cleaves the target sequence (Makarova et al. 2011). A synthetic single gRNA (sgRNA) has been used in place of the mature crRNA in the CRISPR/Cas9 system (Sun et al. 2015). Thus, sgRNA and Cas9 are important components of simple and efficient genome editing, and the CRISPR/Cas9 system is widely used in plant genetic studies and crop biotechnology. This system is used in many crop species, including rice, sorghum, wheat, maize, soybean, tomato, sweet orange, apple, and grapevine (Feng et al. 2013; Jiang et al. 2013; Shan et al. 2013; Brooks et al. 2014; Jia and Wang 2014; Xing et al. 2014; Li et al. 2015; Osakabe et al. 2018). However, the CRISPR/Cas9 system in crops can lead to the generation of unintended alleles after editing via the double-strand cleavage of DNA. To address this problem and decrease the effect of off-target changes caused by Cas9, A-to-G base editing (BE) by adenine base editors and C-to-T BE by cytosine base editors were developed using Cas9 nickase (nCas9) (Kang et al. 2018; Yan et al. 2018). Crop genome editing and sequencing technologies are being continually and rapidly developed, as important tools for improving crop traits.

In this review, we classify and summarize the existing applications of genome-editing systems, including ZFN, TALEN, and CRISPR/Cas9, in soybeans. We analyze and compare the genetic variation caused by genome editing with natural variations in soybeans. We also discuss future directions for the optimization and development of new genome-editing tools in soybeans.

Soybean genome editing using ZFN

In soybeans, genetic tools, such as RNAi-based mutagenesis (Flores et al. 2008) and random mutagenesis using radiation mutagens (Men et al. 2002), chemical mutagens such as ethylmethanosulfonate (Cooper et al. 2008), and transposon-based mutagens (Mathieu et al. 2009), have been used for increasing genetic variability. However, in soybeans with highly duplicated genomes, these mutagenesis methods can produce problems such as the silencing of individual gene copies with no phenotype (Curtin et al. 2011). Site-directed mutagenesis using ZFN has been useful for solving these problems. After ZFN activity tests using GFP transgenes, ZFN-induced mutagenesis began in endogenous soybean genes such as Dicer-like (DCL), RNA-dependent RNA polymerase (RDR), and HUA enhancer 1 (HEN1), using hairy root transformation methods (Curtin et al. 2011; Sander et al. 2011). ZFN using NHEJ-mediated repair was used for the targeted mutagenesis of the Fatty acid desaturase 2-1A (FAD2-1A) gene in soybean (Bonawitz et al. 2019). Although mutagenesis using ZFN has a high success rate, it should be improved for additional applications in soybeans, to create allelic replacements such as similar site-directed approaches in paralogous genome loci.

Soybean genome editing using TALEN

TALEN has the advantage of highly efficient genome modification compared with ZFN (Bedell et al. 2012). In TALEN, the TAL effector (TALE), which has a 13–28 tandem repeat domain, recognizes a single DNA base pair and binds to specific DNA sequences by repeat-variable diresidues (RVDs) (Graham et al. 2020). The duplicated genes of Fatty acid desaturase 2–1 (FAD2-1A and FAD2-1B), which are major regulators of the soybean fatty acid pathway, have been edited with high efficiency using TALEN (Haun et al. 2014). Targeted mutagenesis of fad2-1A, fad2-1B, and fad2-1A fad2-1B in soybeans has led to improved oil quality (Haun et al. 2014). Additional targeted mutagenesis of FAD3A, which is the gene with the greatest effect on the linolenic acid pathway in soybeans, in fad2-1A fad2-1B soybean plants using TALEN significantly reduces linolenic acid levels compared to fad2-1A fad2-1B plants (Demorest et al. 2016). Although TALEN is a simple, efficient, and widely used method, the DNA binding ability of TALE is poor, because of the presence of 5-methylated cytosine (5mC) in endogenous targeted genes (Valton et al. 2012). TALEN also has problems with transfection in cells because the molecular size of TALE is larger than that of ZFN (Park et al. 2019). Thus, the use of TALEN for trait improvement and plant mutagenesis has been restricted (Weeks et al. 2016).

Soybean genome editing using CRISPR/Cas9

CRISPR/Cas9 technology is the most recent platform for genome editing and is a novel alternative to the Fok I-mediated ZFN and TALEN technologies (Jacobs et al. 2015; Du et al. 2016; Curtin et al. 2018). The efficiency of targeted mutagenesis in soybeans using the TALEN and CRISPR/Cas9 systems was recently compared (Du et al. 2016). Mutation by TALEN using the Arabidopsis AtU6-26 promoter was more efficient than that by CRISPR/Cas9. However, targeted gene mutation using CRISPR/Cas9 with the soybean GmU6-16 g-1 promoter was more efficient than that using TALEN (Du et al. 2016). Soybean has a highly duplicated genome, in which approximately 75% of the genes exist as multiple copies (Schmutz et al. 2010). Mutation by CRISPR/Cas9 technology is more appropriate for the editing of multiple target genes in soybeans (Du et al. 2016). Gene knockout using CRISPR/Cas9 can create null alleles of multiple genes because of the use of one common gRNA with common target sequences. CRISPR/Cas9 using Streptococcus pyogenes Cas9 (SpCas9) and gRNAs of nine targeted genes were used with the particle bombardment methods in soybean embryogenic cultures and showed editing efficiencies ranging from 8 to 76% in all nine genes (Li et al. 2015). The soybean Acetolactate synthase 1 (ALS1) gene is a major metabolic enzyme in branched chain amino acid biosynthesis and is involved in resistance to chlorsulfuron herbicides (Walter et al. 2014). CRISPR/Cas9 with SpCas9 and gRNA based on the unique PAM sequence of ALS1 was used to edit proline to serine at position 178 of the amino acid by changing the DNA sequence from CCC to AGC. Simultaneously, other mutations of the ALS1 gene by CRISPR/Cas9 occurred the 5-bp deletion near the PAM sequence on the cleavage site of the gRNA (Li et al. 2015).

CRISPR/Cas9 technology has been successfully applied to both the identification of gene functions and the improvement of agriculturally important traits, including flowering time, maturity, plant architecture, seed quality, and tolerance to environmental stresses in soybeans.

Application of CRISPR/Cas9-mediated gene editing to the regulation of soybean growth and development

Soybean yield is often influenced by changing agricultural traits such as flowering time, maturity, and plant height (Kantolic and Slafer 2007). Twelve genes in soybean play important roles as major regulators in flowering and maturity responses, including E1 to E11 and J (Cao et al. 2017; Lin et al. 2021). The soybean E1, E2, and E3 genes delay the flowering time when dominant and accelerate the flowering time when recessive (Miranda et al. 2020). The mutant allele Long juvenile trait (J) gene influences the flowering response with the E1 gene under short-day (SD) condition (Miranda et al. 2020). Soybean height affects the number of nodes and pods as well as yields, due to its influence on stem architecture (Kilgore-Norquest and Sneller 2000). The Dt1 gene in soybean is a major regulator of stem architecture, and dt1 recessive alleles reduce both the height and the number of nodes by promoting the termination of apical stem growth (Liu et al. 2010; Tian et al. 2010). Targeted mutagenesis of the soybean E1 gene using CRISPR/Cas9 resulted in two types of deletion mutants 11- and 40-bp in the coding region and produced an early flowering phenotype under long-day (LD) conditions (Han et al. 2019).

Targeted mutagenesis of four Late elongated hypocotyl (LHY) genes, a key component of the central oscillator regulating circadian clock in Arabidopsis (Genoud et al. 1998), by CRISPR/Cas9 produced a reduction in soybean height and elongation of internodes (Cheng et al. 2019). The Squamosa promoter binding protein-like 9 (SPL9) transcription factors are involved in the determination of plant architecture and yields in soybean (Bao et al. 2019). CRISPR/Cas9-mediated mutagenesis of four GmSPL9 genes resulted in a 1-bp deletion in the target site of GmSPL9a and GmSPL9b, and spl9a spl9b homozygous double mutants had the shorter plastochrons. Soybean plants with various combinations of GmSPL9s mutations had increased numbers of nodes (Bao et al. 2019). The quadruple mutagenesis of four Apetala 1 (AP1) genes, which function in the determination of floral meristem and floral organs in Arabidopsis (Mandel et al. 1992), using CRISPR/Cas9, changed the development of soybeans, producing effects including delayed flowering, changes in flower morphology, and increased node numbers and internode lengths (Chen et al. 2020b). Site-directed mutagenesis of the soybean floral activators, Flowering locus T 2a (GmFT2a) and Flowering locus T 5a (GmFT5a) genes, using CRISPR/Cas9 resulted in a late flowering phenotype compared with the flowering time of wild-type plants (Cai et al. 2018, 2020b). Null mutant alleles of the GmFT2b gene produced using CRISPR/Cas9 showed the late flowering phenotype under floral repressing LD conditions (Chen et al. 2020). BE mutagenesis of the GmFT2a and GmFT4 genes using CRISPR/Cas9 changed nucleotides, such as C to T or C to G mutations, at their target sites (Cai et al. 2020a). Two independent T1‐ft2a‐BE alleles showed the late flowering phenotype under both SD and LD conditions (Cai et al. 2020a). The soybean Pseudo-response regulator 37 (GmPRR37) gene was mutated using CRISPR/Cas9 in the late flowering soybean cultivar Zigongdongdou (ZGDD) (Wang L. et al., 2020). CRISPR/Cas9-mediated mutagenesis of GmPRR37 in ZGDD produced an early flowering phenotype and higher expression of GmFT2a and GmFT5a, under natural LD conditions (Wang L. et al. 2020).

The Kinase-inducible domain interacting 8–1 (GmKIX8-1) gene in soybean acts as a negative regulator of cell proliferation by controlling the transcriptional expression of D3-type cyclins (CYCLIN D3) (Baekelandt et al. 2018; Nguyen et al. 2021). CRISPR9/Cas9-induced GmKIX8-1 deletion mutants had larger seeds and leaves than those of wild-type plants, produced by increased cell proliferation (Nguyen et al. 2021).

Application of CRISPR/Cas9-mediated gene editing to the improvement of soybean seed quality

CRISPR/Cas9 technology has been also used to improve soybean seed quality by reducing the content of undesirable factors such as allergenic proteins and indigestible sugars. Mutation of three (Glyma.20g148400, Glyma.03g163500, and Glyma.19g164900) of nine major genes for allergenic conglycinin and glycinin proteins using CRISPR/Cas9 was performed using hairy root transformation methods in the soybean cultivar Harosoy 63 (Li et al. 2019). Conglycinin and glycinin proteins constitute approximately 70% of the total seed proteins in soybeans and are important biochemical components influencing the quantity and quality of soybean food products (Li et al. 2019). Sugars in soybean seeds are important for seed longevity (ElSayed et al. 2014). Raffinose family oligosaccharides (RFOs; raffinose, stachyose, and verbascose) are ubiquitous storage products and desiccation-resistance components in soybean seeds (Wang et al. 2009; ElSayed et al. 2014; de Souza et al. 2016). Although raffinose and stachyose RFOs have positive influences on metabolizable energy, these components are regarded as anti-nutritional components in soybean food because they are indigestible by animals (Valentine et al 2017). Soybean galactinol synthase is a major enzyme in the biosynthesis of RFO pathway (de Souza  et al. 2016). CRISPR/Cas9-mediated double mutagenesis of two soybean Galactinol synthase 1 genes, GmGOLS1A and GmGOLS1B, produced a reduction in the total amount of RFOs in soybean seeds, resulting in improved nutritional quality (Le et al. 2020). The Fatty acid desaturase 2 (GmFAD2) changes monounsaturated oleic acid to polyunsaturated linoleic acid, controlling the amount of monounsaturated fats in soybean seeds (Do et al. 2019). Mutant alleles of the GmFAD2-1A and GmFAD2-1B genes produced using CRISPR/Cas9 produced increased amounts of oleic acid (over 80%) and decreased amounts of linoleic acid (approximately 1.3%–1.7%) in soybean seeds (Do et al. 2019). The mutation of FAD2-2 microsomal omega-6 desaturase (GmFAD2-2) using CRISPR/Cas9 increased the content of oleic acid (approximately 45.08%–65.9%) and reduced the levels of linoleic acid (approximately 16.08%–31.95%) in soybean seeds (Al Amin et al. 2019). Simultaneous mutagenesis of the GmFAD2-1A and GmFAD2-2A genes using CRISPR/Cas9 produced an increase in the oleic acid content of soybean seeds (Wu et al. 2020). Three lipoxygenases—LOX1, LOX2, and LOX3—contribute to the preposterous taste of soybeans (Liavonchanka and Feussner 2006). CRISPR/Cas9-mediated triple gmlox1/gmlox2/gmlox3 and double gmlox1/gmlox2 mutagenesis resulted in the reduction of this taste because the activity of lipoxygenases was reduced (Wang J. et al., 2020).

Application of CRISPR/Cas9-mediated gene editing to the verification of gene function in response to environmental stress

CRISPR/Cas9 technology has been successfully applied to the characterization of the genes involved in soybean responses to biotic and abiotic stresses such as pathogens, salt, heat, and drought. The Arabidopsis Constitutive expression of PR genes 5 (CPR5) gene is involved in disease resistance signaling pathways, and cpr5 mutants belong to the “disease lesion mimics” class because of their reaction tendency to as if continuous pathogen attack (Bowling et al. 1997). Targeted mutagenesis of the soybean GmCPR5 gene using CRISPR/Cas9 produced shorter trichomes in soybean leaves (Campbell et al. 2019). Thus, CRISPR/Cas9-mediated gmcpr5 mutants will be suitable for use in the study of the function of trichomes in plant–pathogen interactions in soybean. The Heat shock protein 90 s (Hsp90s), which acts as a molecular chaperone, plays an important role in plant protective responses to various environmental stresses (Xu et al. 2013). CRISPR/Cas9-mediated mutagenesis of the soybean Hsp90A2 (GmHsp90A2) gene decreased plant thermotolerance to heat stress and increased wilting in soybeans (Huang et al. 2019). Plant Sodium/hydrogen exchangers (NHXs) act as ion exchangers in the membrane by catalyzing the movement of Na+ or K+ ions (Barragán et al. 2012). A soybean gmnhk5 knockout mutant produced using CRISPR/Cas9 decreased the salt tolerance of hairy roots, whereas GmNHK5 overexpressing plants showed improved salt tolerance (Sun et al. 2021). Plant transcription factors such as AP2/EREBP, WRKY, NAC, and MYB are associated with plant resistance to abiotic stresses through the regulation of ABA signaling (Yu et al. 2021). Targeted mutagenesis of soybean NAC06 (GmNAC06) gene using CRISPR/Cas9 decreased plant tolerance to salt stress caused by the accumulation of reactive oxygen species (Li et al. 2021). Nuclear factor Y transcription factors, including NF-YA, NF-YB, and NF-YC, are involved in ABA-dependent responses to abiotic stresses, flowering time, and plant development processes (Yamamoto et al. 2009; Liang et al. 2012; Yu et al. 2021). CRISPR/Cas9-induced mutagenesis of the soybean Nuclear factor-YC subunit 14 (GmNF-YC14) gene decreased plant tolerance to salt and drought stresses (Yu et al. 2021). These results indicate that CRISPR/Cas9-mediated gene editing has provided useful clues for verifying gene functions related to stress signaling and the development of stress-tolerant soybean plants.

Application of a multiplex mutagenesis population using a CRISPR/Cas9 library in soybeans

Recent research using CRISPR/Cas9 technology has indicated that plants can be generated via large-scale transformation using CRISPR/Cas9 libraries, to improve crop genetic variation (Jacobs et al. 2017; Lu et al. 2017; Meng et al. 2017; Bai et al. 2020). In soybeans, a pooled mutagenesis population generated 407 double or triple mutants using a single multiplex sgRNA or combination of multiple sgRNAs (Bai et al. 2020). Among them, double mutagenesis of the rhizobia-induced Clavata3/embryo surrounding region-related (CLE) genes, GmRIC1 and GmRIC2, produced increased numbers of nodules in soybean roots. Triple mutagenesis of the nitrate-induced CLE genes, GmRDN1-1, GmRDN1-2, and GmRDN1-3, produced decreased nodulation in soybean roots (Bai et al. 2020). GmRICs and GmRDNs, CLE peptide-encoding genes, are associated with the regulation of nodulation in soybean roots (Reid et al. 2011). Specific gRNAs and the CRISPR/Cas9 have been used to develop an easy, rapid, and highly efficient technique to simultaneously edit various homologous genes in soybeans, an approach that can facilitate the molecular manipulation and identification of genes in soybeans.

Development of cas nucleases in crops

This review summarizes the ways in which the functions of numerous soybean genes have been evaluated using genome-editing technologies (Table 1). In addition, we showed the historical progress of the soybean genome editing to improve growth, seed quality, and stress tolerance (Fig. 1). The ZFN, TALEN, and CRISPR/Cas9 systems have been successfully applied to soybean genome editing, including deletion of DNA fragments, BE, DNA insertion, and gene knockout. Of those technologies, CRISPR/Cas9 is the most efficient. In the CRISPR/Cas9 system, the Cas9 nuclease used is an important component of genome editing (Zhang Y. et al. 2019). For instance, SpCas9, which has a very simple PAM (NGG) has been codon optimized for various species, including human (hCas9; Mali et al. 2013), plant (pcoCas9 and Cas9p; Li et al. 2013; Ma et al. 2015), Arabidopsis (AteCas9; Fauser et al. 2014), maize (zCas9; Lee et al. 2019), and soybean (GmCas9; Michno et al. 2015). Thus, identification of the most efficient type of Cas9 nuclease is important to the development and improvement of this versatile genome-editing tool in soybeans.

Table 1 Summary of soybean genes and traits used in genome editing
Fig. 1
figure 1

Progress and prospects of genome editing-mediated soybean improvement. The target traits and corresponding genes used for soybean genome editing are presented in a chronological order. Prospects for genome editing for further soybean improvement are suggested. The ZFN, TALEN, and CRISPR/Cas9 indicates zinc finger nuclease, transcription activator-like effector nuclease, and clustered regularly interspaced short palindromic repeat (CRISPR)/CRISPR-associated protein nuclease9, respectively

There have been 42 orthologous Cas9 and Cas nucleases identified to date, with various PAMs (Zhang Y. et al. 2019). Recently, the Cpf1 nuclease, known as Cas12a, was identified as a new type of V-A Cas enzyme in class II CRISPR systems (Zetsche et al. 2015). The Cpf1 nuclease targets the T-rich region in the PAM and uses a short guide crRNA, which allows easy synthesis and engineering of crRNA (Zetsche et al. 2015). The Cpf1 nucleases, known as Cas12a, with the PAM TTTV from Lachnospiraceae bacterium (LpCpf1) and Acidaminococcus spp. (AsCpf1) have been used for targeted mutagenesis of the FAD2-1A and FAD2-1B genes in soybean protoplast cells (Kim et al. 2017). The editing efficiency using LpCpf1 and soybean Ubiquitin (GmU) promoter was increased to 91.7%, and eight targeted genes in soybean were simultaneously mutated (Duan et al. 2021). These studies demonstrated that Cpf1 nuclease is as effective and efficient in soybean genome editing as other Cas9 nucleases. Orthologous Cas nucleases from different bacterial species have been identified, including Cpf1, C2c1 (known as Cas12b), Cms1, CasX (known as Cas12e), CasY (known as Cas12d), and Cas13 (Zetsche et al. 2015; Xu et al. 2020). Genome editing using these Cas nucleases has been successful in major crops and model plant species, except soybean (Zetsche et al. 2015; Xu et al. 2020). The applications of various Cas nucleases, and the identification of the most effective Cas nucleases for soybean gene editing, would be valuable for efficient gene editing in the highly duplicated soybean genome.

Prospects for genome editing for soybean improvement

The use of genome editing to produce sophisticated modifications of specific target genes, without altering other traits, can accelerate crop breeding. In soybean, which has duplicated and multiplex genes, genome–editing technology can avert the complicated and time-consuming processes of multiple crossings required by traditional breeding to generate multi-trait soybean cultivars. Thus, it would be valuable to develop new and efficient CRISPR/Cas systems, suitable for simultaneous editing of multiplex gene family members. A multiplex CRISPR/Cas system could be widely applied to the genome editing of various crops with polyploid genomes. In soybeans, gene editing has been mainly applied to the investigation of developmental processes and the improvement of the quality of soybean seeds. In contrast, gene editing for enhancing stress tolerance has not yet been seriously attempted in soybean. There are an increasing number of health-conscious consumers, including vegetarians and vegans, for whom soybean can be used as a plant-based meat substitute. Because of the emergence of the plant-based food industry, the demand for soybeans is expected to grow very rapidly. For the wider utilization of soybeans in the new food industry, the amounts of undesirable factors such as allergenic proteins, phytic acid, and nondigestible saccharides such as melibiose, raffinose, and stachyose should be reduced or removed in soybean seeds. Thus, the elimination of various anti-nutritional factors and the increase of functional materials in soybean seeds using gene editing will lead to advances in the soybean-based food industry and feed manufacturing.