TY - JOUR
T1 - Analysis of 62 hybrid assembled human Y chromosomes exposes rapid structural changes and high rates of gene conversion
AU - Skov, Laurits
AU - Schierup, Mikkel Heide
AU - Danish Pan Genome Consortium
AU - Sørensen, Lasse Maretty
AU - Petersen, Bent
AU - Sibbesen, Jonas Andreas
AU - Liu, Siyang
AU - Belling, Kirstine G
AU - Have, Christian Theil
AU - Bork-Jensen, Jette
AU - Hansen, Torben
AU - Krogh, Anders
AU - Sørensen, Thorkild I.A.
AU - Pedersen, Oluf Borbye
AU - Rasmussen, Simon
AU - Kristiansen, Karsten
AU - Brunak, Søren
PY - 2017/8
Y1 - 2017/8
N2 - The human Y-chromosome does not recombine across its male-specific part and is therefore an excellent marker of human migrations. It also plays an important role in male fertility. However, its evolution is difficult to fully understand because of repetitive sequences, inverted repeats and the potentially large role of gene conversion. Here we perform an evolutionary analysis of 62 Y-chromosomes of Danish descent sequenced using a wide range of library insert sizes and high coverage, thus allowing large regions of these chromosomes to be well assembled. These include 17 father-son pairs, which we use to validate variation calling. Using a recent method that can integrate variants based on both mapping and de novo assembly, we genotype 10898 SNVs and 2903 indels (max length of 27241 bp) in our sample and show by father-son concordance and experimental validation that the non-recurrent SNP and indel variation on the Y chromosome tree is called very accurately. This includes variation called in a 0.9 Mb centromeric heterochromatic region, which is by far the most variable in the Y chromosome. Among the variation is also longer sequence-stretches not present in the reference genome but shared with the chimpanzee Y chromosome. We analyzed 2.7 Mb of large inverted repeats (palindromes) for variation patterns among the two palindrome arms and identified 603 mutation and 416 gene conversions events. We find clear evidence for GC-biased gene conversion in the palindromes (and a balancing AT mutation bias), but irrespective of this, also a strong bias towards gene conversion towards the ancestral state, suggesting that palindromic gene conversion may alleviate Muller's ratchet. Finally, we also find a large number of large-scale gene duplications and deletions in the palindromic regions (at least 24) and find that such events can consist of complex combinations of simultaneous insertions and deletions of long stretches of the Y chromosome.
AB - The human Y-chromosome does not recombine across its male-specific part and is therefore an excellent marker of human migrations. It also plays an important role in male fertility. However, its evolution is difficult to fully understand because of repetitive sequences, inverted repeats and the potentially large role of gene conversion. Here we perform an evolutionary analysis of 62 Y-chromosomes of Danish descent sequenced using a wide range of library insert sizes and high coverage, thus allowing large regions of these chromosomes to be well assembled. These include 17 father-son pairs, which we use to validate variation calling. Using a recent method that can integrate variants based on both mapping and de novo assembly, we genotype 10898 SNVs and 2903 indels (max length of 27241 bp) in our sample and show by father-son concordance and experimental validation that the non-recurrent SNP and indel variation on the Y chromosome tree is called very accurately. This includes variation called in a 0.9 Mb centromeric heterochromatic region, which is by far the most variable in the Y chromosome. Among the variation is also longer sequence-stretches not present in the reference genome but shared with the chimpanzee Y chromosome. We analyzed 2.7 Mb of large inverted repeats (palindromes) for variation patterns among the two palindrome arms and identified 603 mutation and 416 gene conversions events. We find clear evidence for GC-biased gene conversion in the palindromes (and a balancing AT mutation bias), but irrespective of this, also a strong bias towards gene conversion towards the ancestral state, suggesting that palindromic gene conversion may alleviate Muller's ratchet. Finally, we also find a large number of large-scale gene duplications and deletions in the palindromic regions (at least 24) and find that such events can consist of complex combinations of simultaneous insertions and deletions of long stretches of the Y chromosome.
KW - Chromosomes, Human, Y/genetics
KW - Denmark
KW - Evolution, Molecular
KW - Fathers
KW - Gene Conversion/genetics
KW - Heterochromatin/genetics
KW - Humans
KW - INDEL Mutation/genetics
KW - Infertility, Male/genetics
KW - Inverted Repeat Sequences/genetics
KW - Male
KW - Nuclear Family
KW - Phylogeny
KW - Polymorphism, Single Nucleotide
U2 - 10.1371/journal.pgen.1006834
DO - 10.1371/journal.pgen.1006834
M3 - Journal article
C2 - 28846694
SN - 1553-7390
VL - 13
SP - e1006834
JO - PLOS Genetics
JF - PLOS Genetics
IS - 8
ER -