Substantial differences in bias between signal-digest and double-digest RAD-seq libraries: A case study

Type of content
Journal Article
Thesis discipline
Degree name
Publisher
Journal Title
Journal ISSN
Volume Title
Language
Date
2018
Authors
Flanagan SP
Jones AG
Abstract

The tradeoffs of using single-digest versus double-digest restriction site associated DNA sequencing (RAD-seq) protocols have been widely discussed. However, no direct empirical comparisons of the two methods have been conducted. Here, we sampled a single population of Gulf pipefish (Syngnathus scovelli) and genotyped 444 individuals using RAD-seq. Sixty individuals were subjected to single-digest RAD-seq (sdRAD-seq), and the remaining 384 individuals were genotyped using a double-digest RAD-seq (ddRAD-seq) protocol. We analyzed the resulting Illumina sequencing data and compared the two genotyping methods when reads were analyzed either together or separately. Coverage statistics, observed heterozygosity, and allele frequencies differed significantly between the two protocols, as did the results of selection components analysis. We also performed an in silico digestion of the Gulf pipefish genome and modeled five major sources of bias: PCR duplicates, polymorphic restriction sites, shearing bias, asymmetric sampling (i.e., genotyping fewer individuals with sdRAD-seq than with ddRAD 26 seq), and higher major allele frequencies. This combination of approaches allowed us to determine that polymorphic restriction sites, an asymmetric sampling scheme, mean allele frequencies, and to some extent PCR duplicates all contribute to different estimates of allele frequencies between samples genotyped using sdRAD-seq versus ddRAD-seq. Our finding that sdRAD-seq and ddRAD-seq can result in different allele frequencies has implications for comparisons across studies and techniques that endeavor to identify genome-wide signatures of evolutionary processes in natural populations.

Description
Citation
Flanagan SP, Jones AG (2018). Substantial differences in bias between signal-digest and double-digest RAD-seq libraries: A case study. Molecular Ecology Resources. 18(2). 264-280.
Keywords
next-generation sequencing, selection components analysis, Syngnathus scovelli, 10 allele dropout, PCR duplicates
Ngā upoko tukutuku/Māori subject headings
ANZSRC fields of research
Fields of Research::31 - Biological sciences
Rights