Alejandro Thérèse Navarro, Peter Bourke, Eric van de Weg, Paul Arens, Richard Finkers, Chris Maliepaard. Wageningen University and Research, Wagengingen, the Netherlands.
Linkage mapping is an approach to order markers based on recombination events. Mapping algorithms cannot easily handle genotyping errors, which are common in high-throughput genotyping data. To solve this issue, strategies have been developed, aimed mostly at identifying and eliminating spurious genotypes. One such strategy is SMOOTH (van Os et al. 2005), an iterative algorithm to detect genotyping errors. Unlike other approaches, SMOOTH can also be used to impute the most probable alternative genotypes, but its application is limited to diploid species and to markers heterozygous only in one of the parents. We adapted SMOOTH to expand its use to any marker type and to autopolyploids with the use of identity-by-descent probabilities, naming the updated algorithm Smooth Descent (SD). We applied SD to real and simulated data, showing that in the presence of genotyping errors this method produces better genetic maps in terms of marker order and map length. SD is particularly useful for error rates between 5% and 20% and when error rates are not homogeneous among markers or individuals. Moreover, the simplicity of the algorithm allows hundreds of thousands of markers to be efficiently processed, thus being particularly useful for error detection in high-throughput data. We implemented SD within an R package, SmoothDescent that can perform error detection, genotype imputation, and iterative mapping for diploids and autopolyploids.
Cristiane H. Taniguti1, Lucas M. Taniguti3, Gabriel S. Gesteira2, Thiago P. Oliveira3, Jeekin Lau1, Getulio C. Ferreira3, Rodrigo R. Amadeu3, David Byrne1, Oscar Riera-Lizarazu1, Guilherme S. Pereira2, Marcelo Mollinari2, and Augusto F. Garcia3. 1Texas A&M University, College Station, TX. 2North Carolina State University, Raleigh, NC. 3University of Sao Paulo, Sao Paulo, Brazil.
High-throughput sequencing methods produce millions of sequence reads that need to be processed by bioinformatic tools before being applied in genetics research. For each step of the procedure, such as alignment of reads, SNPs identification, and genotype calling, several tools are available, all with different methods and parameters to be selected by users. Changes in a single parameter in the pipeline can cause downstream consequences in the analysis quality. Because the genetic properties of meiotic events are well-known, it is possible to identify low-quality markers using linkage analysis. Genotyping errors lead to an overestimation of recombination events amount, inflated linkage map distances, and issues while grouping and ordering markers. Thus, good-quality genetic maps validate all upstream procedures and help to identify the best combinations of software and parameters. Here, we present the Reads2Map workflows to build linkage maps from sequencing data of experimental F1 outcrossing populations testing combinations of upstream tools. The workflows are written with Workflow Description Language (WDL) which offers a comprehensive structure and metadata for each step, making it easier for users to adapt specific parameters. WDL also allows interfacing with containers to increase reproducibility, facilitate access to diverse software, and use in high-performance computing or cloud service environments. The final workflow output is the input for the Reads2MapApp, a Shiny app, which allows interactive visualization of the produced genetic maps and selection of the best pipeline. We demonstrate Reads2Map workflows and Reads2MapApp using both simulated and empirical RADseq data.
Jibran Tahir and Cyril Brendolise. The New Zealand Institute of Plant and Food Research.
Polyploidy is a key driver of significant evolutionary changes in plant species. The genus Actinidia (kiwifruit) exhibits multiple ploidy levels, which contribute to novel fruit traits, high yields, and resistance to the pathogen. Breeding programs focus on tetraploid cultivars, because of their resilience to the Psa disease, as well as robust fruit quality traits and size. However, there remains a substantial knowledge gap concerning the chromosomal biology and complex gene-trait associations in polyploid kiwifruit, which also exhibits high heterozygosity due to dioecy, compared with diploid species. Here we present the first case study where we employed CaptureSeq for genotyping 235 F1 heterozygous individuals of a tetraploid kiwifruit mapping population using 10K baits which generated a total of 725,175 raw SNP variants. Using this dataset we first identified caveats in the population structure by studying the kinship and recombination landscape. We then performed dosage counts, using an empirical Bayesian analysis (Updog, “normal model”), and selected 45,436 high-quality SNPs across 188 F1s for generating ultra-dense linkage maps for 29 chromosomes, using polymapR. To further assess the quality of our maps, we estimated the Identity-by-descent (IBD) probabilities. We also tested the nature of pairing among chromosomes and provided the first evidence from a mapping population that the inheritance in polyploid kiwifruit species is mostly polysomic with preferential pairing in a few LGs. Finally, we performed QTL mapping for two key polygenic traits for breeding (bacterial resistance and flowering) and a monogenic trait for breeding operation (sex) and calculated the additive effects from various QTLs for a trait. We also tested an HRM-based SNP marker workflow downstream of QTL mapping, to assess the possibility of marker-assisted selection for these traits. Our study provides a detailed view of multiple challenges and successes in solving the genetic maze of polygenic traits in a polyploid dioecious species.
Mason Chizk1, Margaret Worthington1, Carmen Johns1, Carly Godwin1, John R. Clark1, Rishi Aryal2, and Hamid Ashrafi2. 1University of Arkansas, Fayetteville, AR. 2North Carolina State University, Raleigh, NC.
Until recently, large-scale association mapping studies in autopolyploid crop species have been impractical due to the cost of sequencing with adequate read-depth, computing limitations, and software availability. In a collaborative effort that leverages targeted next-generation sequencing (NGS) technologies and newly available software packages to overcome these obstacles, we report the first genome-wide association study (GWAS) in autotetraploid blackberry (Rubus L. subgenus Rubus Watson), identifying numerous quantitative trait loci (QTL) for potential use in marker-assisted selection (MAS). Using NGS via the Capture-Seq platform, 35,054 well-distributed probes were designed from the ‘Hillquist’ (Rubus argutus Link.) reference genome. From 2019 to 2021, a panel of 307 commercially available fresh-market blackberry cultivars and University of Arkansas Division of Agriculture breeding selections were sequenced and phenotyped for various traits that encompass fruit morphology and postharvest texture acidity, sweetness, seed traits, thorniness, and plant height. Using the designed probe set, 81,150 high-quality single nucleotide polymorphisms (SNPs) were identified with greater than 150x average read-depth. Through phenotypic analyses, high broad-sense heritabilities exceeding 0.75 were observed for multiple quantitative traits including primocane height, thorn density, fruit weight, fruit length, drupelet number, 100-seed weight, and seed width/length ratio. QTL were identified for all for each of these traits except fruit weight. A QTL for fruit firmness was identified on Ra04, less than 0.5 Mb away from two polygalacturonase (PG) homologs and two pectin methylesterase (PME) homologs. Homologs for both candidate genes have been widely implicated in texture variation for other crops. A shared QTL for titratable acidity (TA) and pH was identified on Ra05 near three malate synthases, two vacuolar malate transporters, one MYB transcription factor, and one phosphoenolpyruvate carboxylase PEPC. Other major QTL associated with primocane height and thornlessness were discovered on chromosome Ra04 at 25 and 33 Mb, respectively. These data will not only be used to design the first diagnostic markers for MAS in blackberry but will serve as a foundational training dataset for genomic selection to achieve gains in heritable traits with few to no high-impact QTL, such as soluble solid content (SSC) and fruit weight.
Olivia Angelin-Bonnet1, Susan Thomson2, Patrick J. Biggs1, Matthieu Vignes1, and Samantha Baldwin2. 1Massey University, Palmerston North, NZ. 2Plant and Food Research, Lincoln, NZ.
Tuber bruising of tetraploid potato is an important quality trait as it affects the appearance and flavor of the tubers and thus impacts their fitness for sale. The development of potato lines that are more resistant to bruising is therefore a desirable objective for breeding programs, rendering the genetic analysis of this trait an important task. In this study, we investigated the biological mechanisms underlying tetraploid potato tuber bruising using multi-omics data. Genotype by sequencing using exon capture obtained from a breeding population of half-sibling families was used to uncover regions of interest for the bruising phenotype, as well as other agronomic traits of interest. In addition, we employed a Systems Biology approach to obtain a more holistic and comprehensive view of the molecular mechanisms involved in tuber bruising and bridge the gap between genetic variations and phenotype. To this end, RNA sequencing and metabolomics data were also obtained, and GWAS, differential expression analysis, and multi-omics data integration were leveraged in order to detect molecular features (i.e. genomic variants, genes, and metabolites) involved in tuber bruising. We demonstrate that even as capture sequencing only allows us to measure genetic variations in a subset of the genome, it is possible to uncover interesting and biologically meaningful genotype-phenotype associations, especially when combining the GWAS results with other omics datasets. Moreover, these associations were obtained with samples selected from a breeding program, demonstrating that available data from populations not specifically designed for association study can be used to uncover genomic regions potentially associated with a trait of interest.
Paterne Agre, Kumar Lava, Robert Asiedu, Asrat Asfaw. International Institute of Tropical Agriculture, Ibadan, Nigeria.
Anthracnose disease, caused by the fungus Colletotrichum gloeosporioides, is the primary cause of yield loss in water yam (Dioscorea alata), the widely cultivated species of yam. Resistance to yam anthracnose disease (YAD) is a prime target in breeding initiatives to develop cultivars with durable resistance for the sustainable management of the disease in water yam cultivation. This study aimed at tagging quantitative trait loci (QTL) for anthracnose disease resistance in a bi-parental mapping population of D. alata. Parent genotypes and their recombinant progenies were genotyped using the GBS platform and phenotyped in two crop cycles for two years. A high-density genetic linkage map was built with 3,184 polymorphic SNP markers well distributed across the genome covering 1460.94 cm total length. On average 163 SNP markers were mapped per chromosome with 0.58 genetic distance between SNPs. Four QTL regions related to yam anthracnose disease resistance were identified on 3 chromosomes. The proportion of phenotypic variance explained by these QTLs ranged from 29.54 to 39.40%. The QTL regions identified showed genes that code for known plant defense responses such as GDSL-like Lipase/Acylhydrolase, Protein kinase domain, and an F-box protein. The results from the present study provide valuable insight into the genetic architecture of anthracnose resistance in water yam. The candidate markers identified herewith form a relevant resource to apply marker-assisted selection as an alternative to a conventional labor-intensive screening for anthracnose resistance in water yam.
Ranjana Bhattacharjee1, Asrat Amele1, Jessica B. Lyons2, Jessen Bredeson2, Paul D. Fraser3, Lukas Muller4, Agre Paterne1, P. Lava Kumar1, Michael Abberton1, Patrick Adebola1, Daniel Rokhsar1, and Robert Asiedu1. 1International Institute of Tropical Agriculture, PMB 5320, Ibadan, Nigeria. 2University of California – Berkeley, California, USA. 3Royal Holloway University of London, London, UK. 4Boyce Thompson Institute, Cornell University, Ithaca, USA.
Advances in next-generation sequencing (NGS), genotyping technologies coupled with high-performance computation approaches, and partnerships with advanced research institutes have paved many possibilities to apply genomics tools to advance crop-breeding programs in several model and non-model plant species. This enabled the application of ‘genomics-assisted breeding’ or ‘whole-genome selection’. Breeding for yam improvement targets highly diverse biotic and abiotic constraints, whilst meeting complex end-user quality preferences to improve the livelihoods of beneficiaries in developing countries. Achieving breeding targets and increasing the rate of genetic gains in yam, with long breeding cycles, and genomes with high heterozygosity and different ploidy levels is challenging. However, a variety of genomic approaches has already been incorporated at various levels, including sequencing of draft genomes of some species, and application of NGS to genotype genetic resources (genebank accessions) and breeding lines for further exploitation using genomic approaches, such as genetic diversity, linkage mapping, and QTL (quantitative trait loci) analysis while pipelines are established to test GWAS (genome-wide association study) and GS (genomic selection). The use of GS will become more evident in yams as the tools become more efficient for polyploid genotypes, incorporating dominance and epistatic effects, as well as multi-location environmental effects, and progeny selection. This will need to be coupled with high-throughput phenotyping, knowledge from other -omics approaches (e.g., gene expression via transcriptomics, protein function via proteomics, and metabolic pathways via metabolomics), and a big data platform to allow the identification of molecular markers linked to complex traits, the dissection of genetic variability, identification of putative candidate genes, and their causative alleles for gene expression or gene function. The yam program can expand and benefit by bringing together a diverse range of disciplines and partners, and knowledge sharing from other root and tuber crops. The multi-species nature of the crop provides a huge platform to even learn from different species and expand.
Ao Jiao1, Sanjeev Gautam1, Jeewan Pandey1, Douglas C. Scheuring2, Jeffrey Koym1, and M. Isabel Vales1. 1Department of Horticultural Sciences, Texas A&M University, College Station, TX. 2Texas A&M AgriLife Research and Extension Center, Lubbock, TX.
Potato tuber dormancy is defined as the period after harvest during which tubers do not sprout even under favorable conditions. The length of the dormancy period is measured from vine kill until tubers start sprouting and it is affected by genotype and environmental factors. Premature dormancy break is a major factor causing post-harvest tuber quality reduction. Common methods to prevent sprouting include cold storage and the use of sprout inhibitors. Cold storage causes cold-induced sweetening and results in higher energy costs, whereas sprout inhibitors raise health and environmental concerns. Developing potato varieties with long dormancy could contribute to reducing the use of cold storage and sprout inhibitors. In this study, we evaluated tuber dormancy variation and investigated the genetic background of tuber dormancy. Over 200 clones from the Texas A&M Potato Breeding Program were grown in Dalhart, TX in 2019 and were evaluated for dormancy under room conditions (18 °C, RH 60%, dark). The clones exhibited variation in dormancy ranging from 38 to 155 days, with the Russets having significantly longer dormancy (> 96 days) than other market groups (70 – 80 days). Two Texas A&M varieties, Reveille Russet and Vanguard Russet were among the clones with the longest dormancy. A genome-wide association study was performed using GWASpoly with Infinium Illumina 22K V3 Potato Array to identify genomic regions associated with tuber dormancy. The main QTL identified was on chromosome 9, explaining 11% of the phenotypic variation. Follow-up evaluations will be conducted at additional locations under room temperature and cold storage.
Beatriz Tome Gouveia1, Brian M. Schwartz2, Yangi Wu3, Kevin E. Kenworthy4, Ambika Chandra5, Paul L. Raymer2, Marta T. Pudzianowska6, Esdras M. Carbajal1, Manual R. Chavarria Sanchez4, Jing Zhang2, Bradley M. Batterhsell3, Pamela S. Rowe, Meghyn B. Meeks5, Tianyi Wang5, Chase N. McKeithen4, and Susan R. Milla-Lewis1. 1North Carolina State University, Raleigh, NC. 2University of Georgia, Tifton, GA. 3Oklahoma State University, Stillwater, OK. 4University of Florida, Gainesville, FL. 5Texas A&M University, Dallas, TX. 6University of California, Riverside, CA.
Zoysiagrass (Zoysia Willd. species, 2n = 4x = 40), a complex that encompasses 11 species, is primarily used for home lawns, public parks, and athletic fields, being the second most used warm-season turfgrass on golf courses in the US. Water limitations are currently one of the biggest challenges for the turfgrass industry. Starting in 2010, a Turfgrass Specialty Crop Research Initiative (SCRI) project, funded by USDA-NIFA, has focused on addressing the problems of limited availability and reduced quality of water for irrigating turfgrass areas by breeding warm-season turfgrass species for improved drought and salinity tolerances. The objective of this study was to evaluate the performance of zoysiagrass breeding lines from the breeding programs at the University of Georgia – Tifton, and Griffin, North Carolina State University, Texas A&M University, and the University of Florida under drought conditions. Field trials arranged as randomized complete block designs with three replications were installed at research facilities at Citra, FL, and Stillwater, OK in the summer of 2020. The response variables evaluated were turfgrass quality under normal or non-drought conditions (TQND), and both percent green cover (PGC), evaluated using UAS, and turfgrass quality (NTEP ratings from 1 to 9) (TQD) under drought conditions. The genetic variance was significant and non-significant for all traits in the single-environment and multi-environment analysis, respectively. The genotype-by-environment interaction variance was significant for all traits. Heritability estimates were above 0.50 for all traits in all locations, except for TQD in Citra. A high positive correlation was observed between TQD and PGC in both locations, whereas these traits showed low correlation with TQND. Several breeding lines performed better than the checks for both TQD and PGC at both locations. Evaluation of these genotypes will continue through 2023. Results of this study will support the selection of drought-tolerant elite zoysiagrass genotypes with increased performance stability for the target regions.
Cesar A. Medina1, Harpreet Kaur2, Ian Ray2, and Long-Xi Yu1. 1United States Department of Agriculture-Agricultural Research Service, Plant Germplasm, Introduction and Testing Research, Prosser, WA. 2Department of Plant and Environmental Sciences, New Mexico State University, Las Cruces, NM.
Genomic selection (GS) is a variant of marker-assisted selection, in which genome-wide markers are used to determine the genomic estimated breeding value (GEBV) of individuals in a population to a specific trait. GS is useful in complex traits controlled by many genes with small effects. However, some complex traits such as biomass yield or abiotic stress tolerance have low prediction accuracy (measured as Pearson correlation between GEBV and phenotypic values). There is a need to increase the prediction accuracy to employ GS in breeding programs. In this work, we developed and tested an alternative GS model named weighted GBLUP (WGBLUP). We integrated DNA marker significant values of genome-wide association studies (GWAS) in WGBLUP analyses. We performed a case study using phenotypic data on biomass yield under salt stress of alfalfa and 13 phenotypic traits of potato to validate the WGBLUP model. This approach increased prediction accuracies from 50% to more than 80% for alfalfa yield under salt stress and up to 90% in potato tuber length. The use of the WGBLUP model will allow to implementation of GS in different breeding programs, increasing the selection accuracy in complex traits.
Christopher Benson1, David Huff1, Shaun Bushman2, Peter Maughan3, Rick Jeelen3, and Matthew Robbins2. 1Pennsylvania State University, State College, PA. 2USDA-ARS, Logan, UT. 3Brigham Young University, Provo, UT.
Poa annua (annual bluegrass, 2n=28, AABB) is an allotetraploid grass species that outperforms its diploid progenitors in both diversity of morphologies and geographic range. On golf course putting greens, Poa annua's ability to produce seed under 3mm mowing height has contributed to its discordant reputation as both a noxious weed and a valued commodity. Despite an estimated $40 billion U.S. turfgrass industry, there have been limited successful efforts directed at managing Poa annua, either for or against, primarily due to its complex genetic and epigenetic versatility. Here we present the pseudomolecule-level genome assemblies of Poa annua and its diploid progenitors, Poa infirma, and Poa supina. BRAKER2 annotations of these species with Iso-Seq RNA-evidence yielded 72,034, 37,207, and 35,698 high-confidence proteins, respectively. Both the assemblies and the annotations for all species contained >90% conserved orthologs, corroborating their quality. We demonstrate that the parental diploid genomes accurately represent the A and B subgenomes of Poa annua and characterize genetic exchange between and within the subgenomes of Poa annua. We show that the bifurcating ecological niches of the parents is mirrored by genomic and structural mutations in their diploid genomes. The subgenomes of Poa annua bear a strong resemblance to its progenitors, confirming its status as a neo-allotetraploid. We speculate that Poa annua's global proliferation is conferred through the union of two parental genomes with wide genetic distances for hybridization and contrasting ecological ranges. We plan to incorporate genomic and transcriptomic resources to aid in better targeting Poa annua in turfgrass breeding applications.
Felipe Bitencourt Martins1, Aline Costa Lima Moraes1, Alexandre Hild Aono1, Rebecca Caroline Ulbricht Ferreira1, Lucimara Chiari2, Rosangela Maria Simeão2, Sanzio Carvalho Lima Barrios2, Mateus Figueiredo Santos2, Liana Jank2, Cacilda Borges do Valle2, Bianca Baccili Zanotto Vigna2, Anete Pereira de Souza1. 1University of Campinas, Sao Paulo, Brazil. 2Embrapa Gado de Corte, Mato Grosso do Sul, Brazil.
Artificial hybridization plays a fundamental role in plant breeding programs since it generates new genotypic combinations that can result in desirable phenotypes. Depending on the species and mode of reproduction, controlled crosses may be challenging, and contaminating individuals can be introduced accidentally. In this context, the identification of such contaminants is important to avoid compromising further selection cycles, as well as genetic and genomic studies. The main objective of this work was to propose an automated multivariate methodology for the detection and classification of putative contaminants, including apomictic clones, self-fertilized individuals, half-siblings, and full contaminants, in biparental polyploid progenies of tropical forage grasses. We established a pipeline to identify contaminants in genotyping-by-sequencing (GBS) data encoded as allele dosages of single nucleotide polymorphism (SNP) markers by integrating principal component analysis (PCA), genotypic analysis (GA) measures based on Mendelian segregation and clustering analysis (CA). The combination of these methods allowed the correct identification of all contaminants in all simulated progenies (n=200) with more than 690 markers and the detection of putative contaminants in three real progenies of tropical forage grasses, providing an easy and promising methodology for the identification of contaminants in biparental progenies of tetraploid and hexaploid forages or other species. The proposed pipeline was made available through the polyCID Shiny app, which was developed in R language with a user-friendly interface designed to facilitate its use by plant breeders. Furthermore, it can be easily coupled with traditional genetic approaches, such as linkage map construction and other SNP-based techniques, thereby increasing the efficiency of breeding programs.
Haramrit Gill, Jeekin Lau, Qiuyi Fu, Natalie Anderson, David H. Byrne, and Oscar Riera-Lizarazu. Texas A&M University, College Station, TX.
Flower color is one of the most important breeding traits in ornamental roses. The combination of particular anthocyanins, their co-factors and their concentrations leads to different pigmentation patterns. We observed an interesting characteristic that we call ‘flower color transition’ in two tetraploid rose populations a) ‘Stormy Weather’ (SW) X ‘Brite Eyes’ (BE), b) ‘My Girl’ (MG) X ‘Brite Eyes’ (BE). The roses that exhibit this phenotype have flowers that transition from a light-yellow color to a dark pink/red (accumulation of anthocyanins) as the flower ages leading to bushes peppered with flowers of multiple colors. To our knowledge, the genetic control of this phenotype has not been studied in roses previously. Here, we present studies to better understand the inheritance of this trait and to identify quantitative trait loci (QTL) in two tetraploid bi-parental populations segregating for the ‘flower color transition’ trait. Our analysis suggests the presence of QTL on chromosomes 3 and 4. The location of QTL identified for the flower color transition coincides with the location of some genes involved in flavonoid biosynthetic pathway. Additional studies are underway to validate these results.
Ira A. Herniter and Nicholi Vorsa. Rutgers University, New Brunswick, NJ.
Blueberry (Vaccinium sect Cyanoccocus) is an increasingly important fruit crop native to North America. Collected from the wild for thousands of years, blueberry was only domesticated in the early twentieth century. While blueberry has long been a minor crop, recently interest in its health properties, including high levels of anthocyanins, has led to increased consumption and cultivation around the globe. However, quality genetic resources for blueberry have been lacking, greatly hampering the development of new varieties which can produce well under the constraints of changing climatic and pest regimes. We report the development of several populations being used for trait mapping. Four are diploid interspecific populations resulting from crosses between V. corymbosum, native to temperate climate, and V. darrowi, native to a subtropical climate, as well as a large germplasm collection consisting of released blueberry cultivars and wild relatives, including species from across North America, as well as species from Europe, Pacific Russia, and South America. The populations have great diversity in a range of traits, including leaf shape, berry size, fruit chemistry, and flowering time, among others. The populations are an important resource for the development of new varieties and increasing understanding of blueberry physiology.
Jeekin Lau, Cristiane H. Taniguti, David Byrne, and Oscar Riera-Lizarazu. Texas A&M University, College Station, TX.
Since genotyping with the Axiom WagRhSNP68K SNP array can be cost prohibitive, we explored an approach that would permit robust genotyping of samples in one or two 96-well plates. We have observed that genotyping accuracy via SNP arrays increases as the number of individuals used for genotype calling increases. We reasoned that this increased accuracy may be due to greater sample size and allelic diversity. To test this idea, we conducted an experiment where one bi-parental mapping population of 94 individuals plus two parents were clustered alone (one plate of genotyping) and in combinations with sets of related biparental populations and unrelated germplasm with increasing numbers and various levels of genetic diversity. We then compared both marker statistics and the linkage map quality generated from genotype calls of the target mapping population using the various datasets. As the number of individuals used in clustering increased, the number of useful markers increased nominally. However, the resulting linkage maps revealed that the addition of other genotypes in the marker clustering step resulted in shorter total map length and smaller gap sizes as the number of individuals and diversity increased. The decreased map lengths and gap sizes indicate that the inclusion of other genotypes helped genotyping accuracy. The output of this study will be a core set of genotyped rose germplasm that may be used to improve genotype calling of small samples of genotyped materials.
Jeewan Pandey1, Douglas C. Scheuring1, Jeffrey W. Koym2, and Maria Isabel Vales1. 1Department of Horticultural Sciences, Texas A&M University, College Station, TX. 2Texas A&M AgriLife Research and Extension Center, Lubbock, TX.
The current breeding process to develop a new potato cultivar takes a long time (10–15 years). One way to speed up the process and make it more efficient is to shorten the recurrent selection breeding cycle. This can be achieved by assigning breeding values to clones in the breeding pipeline and bringing those with the most favorable breeding values as parents for the crossing block. The aim of this study was to implement one angle of genomic selection by obtaining breeding values of chipping potato clones and recommend parents for the breeding program. Five hundred and forty-nine unique chipping potato clones were evaluated between 2017 and 2020 near Dalhart, TX, and genotyped using the Illumina Infinium Potato SNP array. Genomic-estimated breeding values (GEBVs) for chip color, chip quality, specific gravity, and total yield were obtained using the package StageWise. Potato clones with the most favorable GEBVs were identified and recommended as parents. The mean reliability of the GEBVs obtained were 0.75, 0.43, 0.61, and 0.33 for chip color, chip quality, specific gravity, and total yield, respectively. Breeders will increase the probability of transferring useful traits from parents to their progeny by choosing parental lines with the most favorable GEBVs. In turn, progeny with the best GEBVs can be re-used as parents or advanced to become new varieties. Thanks to the development of software packages suitable for polyploid species, genomic selection in potatoes is becoming more feasible and attractive.
Jose Guillermo Chacon1, Marcelo Mollinari1, Bode A. Olukolu2, Zhao-Bang Zeng1, Gina E. Fernandez3. 1Department of Horticultural Science, North Carolina State University, Raleigh, NC. 2Department of Entomology and Plant Physiology, University of Tennessee, Knoxville, TN. 3Bioinformatics Research Center, North Carolina State University, Raleigh, NC.
The cultivated strawberry (Fragaria ×ananassa L.) is an allopolyploid (2n= 8x = 28) with a complex genomic composition that hindered genetic and genomic studies, as the similarity between subgenomes, introgressions from a dominant genome, and other issues complicate accurate mapping and variant calling. The recent publication of a full chromosome length reference genome and improvements of deep sequencing for polyploids facilitated overcoming part of those difficulties, including the development of linkage maps. A biparental population was generated crossing the NCSU selections NCS 10-080 × NCS 10-147. The parents and 280 seedlings were sequenced using the reduced representation sequencing OmeSeq protocol resulting in 2.47 billion reads. The ngsComposer application was used for quality control, demultiplexing, and filtering, resulting in 1.84 billion reads. The read alignment resulted in a coverage of 92.32% of the four subgenomes in the allo-octoploid strawberry reference genome ‘Camarosa’ 1.0. The map construction was done using the MAPpoly R package. After a quality control screening, a total of 6133 markers and 212 offspring individuals were used to build a genetic linkage map comprised of 28 linkage groups with a total length of 3154 cM and 4022 SNP makers. The minimum linkage group size was 30.2 cM and 49 markers, and a maximum of 155.69 cM and 168 markers, with an average interval genetic distance of 1.32 cm. The high marker density and the correspondence between the number of assembled linkage groups and the number of expected chromosomes indicate that MAPpoly R is a robust analysis. This analysis provides an excellent framework map for forthcoming studies, including QTL analysis and understanding modes of inheritance in this complex polyploid species.
Lindsay V. Clark, Joyce Njuguna, Alexander E. Lipka, and Erik J. Sacks. Department of Crop Sciences, University of Illinois, Urbana-Champaign, Urbana, IL.
polyRAD is an R package for Bayesian genotype calling from sequence read depth in diploid and polyploid organisms. It can use population structure or mapping population design to inform genotype calls and can export discrete or continuous genotypes. Although the original version of polyRAD allowed inheritance mode to vary across the genome, it still required all individuals to be the same ploidy, limiting its use in staple crops such as banana and yam in which breeding populations typically consist of a mixture of ploidies. polyRAD 2.0 will support multiploidy, allowing simultaneous genotyping of individuals of different ploidies. The “possiblePloidies” slot will still be used to indicate potential inheritance modes for loci. A new slot called “taxaPloidy” contains one integer for each individual to indicate its ploidy, and acts as a multiplier for the values stored in “possiblePloidies”. Examples of how to code this information in various crops will be presented in the digital poster. We will also present Miscanthus sacchariflorus as a use case, in which introgression has occurred among diploid, triploid, and tetraploid populations. The development version of polyRAD 2.0 can be installed from GitHub.
Michael Miller and Laura Shannon. University of Minnesota, Saint Paul, MN.
Potato is an important crop to the global food system; however, adoption of new potato varieties has been slow compared to many other staple food crops. This is in part due to the importance of many traits beyond yield, in particular quality traits, to the marketability of potatoes. Quality traits are often measured using subjective and imprecise visual scales. These scales introduce error due to rater fatigue, rater experience, and differences in scale interpretation between raters. Visual scales also limit differentiation between potato clones which express a trait at similar but not identical levels. We have developed an image analysis platform in the R programming language to provide objective and precise numeric measurements of several potato tuber quality traits. Among these traits are measurements of tuber shape including length to width ratio, roundness, and compactness. Combining the phenotype data provided by this platform with the genomic selection tools provided by Tools for Polyploids could allow for robust genomic selection models of potato tuber shape quality traits. A collection of 82 chip market class potato clones from the University of Minnesota breeding program were evaluated for shape traits over 3 field seasons. Genotyping was preformed using the SolCAP SNP array. Allele dosage calling was performed using the fitPoly R package. The StageWise package was then used to create predictions of genomic estimated breeding values for tuber shape traits.
Neil O. Anderson1, Liesl Bower-Jernigan1, Rajmund Eperjesi1, Robert Suryani1, Steven Gullickson2, and Albert Radloff2. 1University of Minnesota, Minneapolis, MN. 2MGK, Minneapolis, MN.
Many species in the Chrysanthemum complex and its alliance of satellite genera (e.g. Arctanthemum, Leucanthemum, Tanacetum) have been bred and selected since the 15th century BCE. Crops include a diversity of uses and forms from green pesticides (pyrethrum, C. cinerariifolium, C. coccineum; 2n=2x=16, 2n=3x=24), edible shoots (C. carinatum, C. coronarium, C. segetum; 2n=2x=16), salt tolerance (C. arcticum, subsp. Arcticum, polaré; 2n=2x=16, 2n=4x=32), to ornamentals (cut flower, potted plant, garden types, C. xgrandiflorum, C. xhybridum; 2n=6x=54) also used for medicinal, herbal teas, and wine. The most widely cultivated crop, C. xgrandiflorum, is a complex perennial geophyte, an allohexaploid of >10 species (C. zawadskii, etc.), complicated by self incompatibility (3 S loci), pseudo-self compatibility (PSC), aneuploidy, sterility, inbreeding depression, and genetic load. Chrysanthemum populations in the University of Minnesota breeding program were analyzed for genetic structure within/among species and cultivar series to determine alliances within the ploidy complex. Genotypic analyses (GBS; DarTseqLD) was used to identify 389 low density, unique SNPs and determine genetic structure within and among wild (C. arcticum, subsp. Arcticum, polaré; C. zawadskii) and cultivated (C. cinerariifolium, C. xgrandiflorum ‘Minn’ series, C. xhybridum ‘MammothTM’ series) species populations. Principal Coordinates Analysis (PcoA) of all species showed two clusters of C. arcticum/C. cinerariifolium and C. xgrandiflorum/C. xhybridum/C. zawadskii with 74.9% diversity for the principal component (PcoA1) and 8.1% for PcoA2. Surprisingly, the first subgroup had C. cinerariifolium in close dalliance with C.a. subsp. Polaré (Nome, AK), followed by C.a. subsp. Arcticum (Aleutian Islands), C. arcticum (Anchor Point, Kenai, Ninilchik, Valdez, AK) with C.a. subsp. Polaré (Churchill, Manitoba, Canada). Chrysanthemum xgrandiflorum had the greatest level of genetic diversity, although it slightly overlapped with both C. zawadskii and C. xhybridum. The ‘Minn’ and ‘MammothTM’ series had low levels of genetic differentiation due to C. xhybridum being derived from C. xgrandiflorum and C. weyrichii (2n=6x=54). Future research will focus on phenotypic trait/SNP associations in a genome-wide association study (GWAS) to aid in marker-assisted selection.
Sanjeev Gautam and M. Isabel Vales. Department of Horticultural Sciences, Texas A&M University, College Station, TX. Texas A&M AgriLife Research and Extension Center, Lubbock, TX.
Heat stress reduces marketable tuber yield and quality of potatoes. Tuber defects can be external (heat sprouts, chained tubers, knobs) or internal (vascular discoloration, internal heat necrosis). Successful cultivation of potatoes under heat stress requires planting heat-tolerant varieties that can produce high yields of marketable tubers. Heat tolerance is a complex trait and breeding for it possesses several bottlenecks. To facilitate future marker-assisted selection for heat tolerance, a genome-wide association study (GWAS) aimed to identify genomic regions associated with heat tolerance was conducted. Phenotyping for a panel of 217 diverse potato genotypes was conducted near Springlake, Texas (heat stress location) for two years using a randomized complete block design with two replicates. The genotypes differed in their capacity of expressing the external as well as internal defects on tubers under heat stress. GWAS was conducted using GWASpoly with Infinium Illumina 22 K V3 Potato Array. Significantly associated SNPs with external defect traits were located on chromosomes 3, 4, 6, and 7 while those with internal defect traits were located on chromosomes 3 and 10. The identified genomic regions may be important to improve heat tolerance in potatoes. Fine mapping of identified regions and validation of the markers associated with these regions would be required to further understand the mechanism involved in heat tolerance.
Simon Fraher1, Tanner Schwarz2, Bonny Oloka3, Ken Pecota1, Chris Heim1, Adrienne Gorny2, G. Craig Yencho1. 1Department of Horticultural Science, North Carolina State University, Raleigh, NC. 2Department of Entomology and Plant Pathology, North Carolina State University, Raleigh, NC. 3Root Crops Program, National Agricultural Research Organization, Entebbe, Uganda.
Due to its highly heterozygous hexaploid nature, sweetpotato (I. batatas) lags behind other crops in terms of genomic tools. New tools and strategies have afforded opportunities to associate sequence data with phenotypic data through QTL analysis, especially the publication of a diploid reference genome for I. trifida in 2018, and open-source software packages like QTLpoly and MAPpoly. QTL analysis is expected to be instrumental in accelerating breeding in this crop, notably for nematode resistance and nutritional factors. We analyzed the progeny of the sweetpotato bi-parental mapping population ‘Tanzania’ x ‘Beauregard’ (TB) representing 250 genotypes (including 4 check lines) for resistance to the emergent plant-parasitic nematode, Meloidogyne enterolobii (M.e.). Parent ‘Tanzania’ is highly resistant, and parent ‘Beauregard’ is highly susceptible. Bioassays showed clear bimodal segregation for resistance, suggesting a simplex major allele conferring resistance. Using the R package QTLpoly and the I. trifida reference genome, we discovered a major QTL peak at base pair 7,039,636 (79.21cM) of linkage group 4 of I. batatas associated with resistance to M.e. This analysis suggests variability in M.e. resistance within the TB population can largely be ascribed to genetic differences amongst the progenies (h² =66.9%). The next steps will search for flanking markers associated with these genotypes and attempt to identify markers that can be screened in the seedling stage, reducing the need for costly and laborious bioassays with this quarantined pest.
Tessa Hochhaus, Cristiane H. Taniguti, Jeekin Lau, Patricia E. Klein, David H. Byrne, and Osar Riera-Lizarazu. Department of Horticultural Sciences, Texas A&M University, College Station, TX.
Roses are highly heterozygous and most commonly diploids, triploids, and tetraploids. Genotyping by sequencing (GBS) has been performed in diploid rose populations, however, it has not been done in populations with higher ploidy because of their increased complexity (autopolyploidy). This complexity is due to the greater number of genotypic classes and the difficulty in accurately calling allele dosage. GBS uses restriction enzymes to reduce genome complexity and adapter barcodes to allow the pooling of multiple samples to increase efficiency and to lower the sample cost. In this study, we are optimizing a GBS protocol for tetraploid roses using three populations (Morden Blush x George Vancouver, Stormy Weather x Brite Eyes, and Brite Eyes x My Girl). The optimization will entail varying sequencing read depth and coverage while minimizing missing data, and using in-house workflows to test various combinations of open-source software for quality control, alignment of reads, identifying SNPs, and dosage calling. Through the development of this pipeline, we hope to facilitate cost-effective genotyping in polyploid roses and the use of genomic-assisted breeding.Back to top