Persone Apulia Research Gate

A large and complex structural polymorphism at 16p12.1 underlies microdeletion disease risk

There is a complex relationship between the evolution of segmental duplications and rearrangements associated with human disease. We performed a detailed analysis of one region on chromosome 16p12.1 associated with neurocognitive disease and identified one of the largest structural inconsistencies in the human reference assembly. Various genomic analyses show that all examined humans are homozygously inverted relative to the reference genome for a 1.1-Mb region on 16p12.1. We determined that this assembly discrepancy stems from two common structural configurations with worldwide frequencies of 17.6% (S1) and 82.4% (S2). This polymorphism arose from the rapid integration of segmental duplications, precipitating two local inversions within the human lineage over the last 10 million years. The two human haplotypes differ by 333 kb of additional duplicated sequence present in S2 but not in S1. Notably, we show that the S2 configuration harbors directly oriented duplications, specifically predisposing this chromosome to disease-associated rearrangement.

A recurrent 16p12.1 microdeletion supports a two-hit model for severe developmental delay.

We report the identification of a recurrent, 520-kb 16p12.1 microdeletion associated with childhood developmental delay. The microdeletion was detected in 20 of 11,873 cases compared with 2 of 8,540 controls (P = 0.0009, OR = 7.2) and replicated in a second series of 22 of 9,254 cases compared with 6 of 6,299 controls (P = 0.028, OR = 2.5). Most deletions were inherited, with carrier parents likely to manifest neuropsychiatric phenotypes compared to non-carrier parents (P = 0.037, OR = 6). Probands were more likely to carry an additional large copy-number variant when compared to matched controls (10 of 42 cases, P = 5.7 x 10(-5), OR = 6.6). The clinical features of individuals with two mutations were distinct from and/or more severe than those of individuals carrying only the co-occurring mutation. Our data support a two-hit model in which the 16p12.1 microdeletion both predisposes to neuropsychiatric phenotypes as a single event and exacerbates neurodevelopmental phenotypes in association with other large deletions or duplications. Analysis of other microdeletions with variable expressivity indicates that this two-hit model might be more generally applicable to neuropsychiatric disease.

Autosomal Dominant Familial Dyskinesia and Facial Myokymia Single Exome Sequencing Identifies a Mutation in Adenylyl Cyclase 5

Background: Familial dyskinesia with facial myokymia (FDFM) is an autosomal dominant disorder that is exacerbated by anxiety. In a 5-generation family of German ancestry, we previously mapped FDFM to chromosome band 3p21-3q21. The 72.5-Mb linkage region was too large for traditional positional mutation identification. Objective: To identify the gene responsible for FDFM by exome resequencing of a single affected individual. Participants: We performed whole exome sequencing in 1 affected individual and used a series of bioinformatic filters, including functional significance and presence in dbSNP or the 1000 Genomes Project, to reduce the number of candidate variants. Co-segregation analysis was performed in 15 additional individuals in 3 generations. Main Outcome Measures: Unique DNA variants in the linkage region that co-segregate with FDFM. Results: The exome contained 23 428 single-nucleotide variants, of which 9391 were missense, nonsense, or splice site alterations. The critical region contained 323 variants, 5 of which were not present in 1 of the sequence databases. Adenylyl cyclase 5 (ADCY5) was the only gene in which the variant (c.2176G>A) was co-transmitted perfectly with disease status and was not present in 3510 control white exomes. This residue is highly conserved, and the change is nonconservative and predicted to be damaging. Conclusions: ADCY5 is highly expressed in striatum. Mice deficient in Adcy5 develop a movement disorder that is worsened by stress. We conclude that FDFM likely results from a missense mutation in ADCY5. This study demonstrates the power of a single exome sequence combined with linkage information to identify causative genes for rare autosomal dominant mendelian diseases.

Characterization of missing human genome sequences and copy-number polymorphic insertions

The extent of human genomic structural variation suggests that there must be portions of the genome yet to be discovered, annotated and characterized at the sequence level. We present a resource and analysis of 2,363 new insertion sequences corresponding to 720 genomic loci. We found that a substantial fraction of these sequences are either missing, fragmented or misassigned when compared to recent de novo sequence assemblies from short-read next-generation sequence data. We determined that 18-37% of these new insertions are copy-number polymorphic, including loci that show extensive population stratification among Europeans, Asians and Africans. Complete sequencing of 156 of these insertions identified new exons and conserved noncoding sequences not yet represented in the reference genome. We developed a method to accurately genotype these new insertions by mapping next-generation sequencing datasets to the breakpoint, thereby providing a means to characterize copy-number status for regions previously inaccessible to single-nucleotide polymorphism microarrays.

Characterization of missing human genome sequences and copy-number polymorphic insertions

The extent of human genomic structural variation suggests that there must be portions of the genome yet to be discovered, annotated and characterized at the sequence level. We present a resource and analysis of 2,363 new insertion sequences corresponding to 720 genomic loci. We found that a substantial fraction of these sequences are either missing, fragmented or misassigned when compared to recent de novo sequence assemblies from short-read next-generation sequence data. We determined that 18-37% of these new insertions are copy-number polymorphic, including loci that show extensive population stratification among Europeans, Asians and Africans. Complete sequencing of 156 of these insertions identified new exons and conserved noncoding sequences not yet represented in the reference genome. We developed a method to accurately genotype these new insertions by mapping next-generation sequencing datasets to the breakpoint, thereby providing a means to characterize copy-number status for regions previously inaccessible to single-nucleotide polymorphism microarrays.

Copy Number Variation Analysis in Single-Suture Craniosynostosis: Multiple Rare Variants Including RUNX2 Duplication in Two Cousins With Metopic Craniosynostosis

Little is known about genes that underlie isolated single-suture craniosynostosis. In this study, we hypothesize that rare copy number variants (CNV) in patients with isolated single-suture craniosynostosis contain genes important for cranial development. Using whole genome array comparative genomic hybridization (CGH), we evaluated DNA from 186 individuals with single-suture craniosynostosis for submicroscopic deletions and duplications. We identified a 1.1 Mb duplication encompassing RUNX2 in two affected cousins with metopic synostosis and hypodontia. Given that RUNX2 is required as a master switch for osteoblast differentiation and interacts with TWIST I, mutations in which also cause craniosynostosis, we conclude that the duplication in this family is pathogenic, albeit with reduced penetrance. In addition, we find that a total of 7.5% of individuals with single-suture synostosis in our series have at least one rare deletion or duplication that contains genes and that has not been previously reported in unaffected individuals. The genes within and disrupted by CNVs in this cohort are potential novel candidate genes for craniosynostosis. (C) 2010 Wiley-Liss, Inc.

Discovery of large genomic inversions using long range information. (*Last two authors contributed equally as corresponding authors)

Diversity of Human Copy Number Variation and Multicopy Genes.

Copy number variants affect both disease and normal phenotypic variation, but those lying within heavily duplicated, highly identical sequence have been difficult to assay. By analyzing short-read mapping depth for 159 human genomes, we demonstrated accurate estimation of absolute copy number for duplications as small as 1.9 kilobase pairs, ranging from 0 to 48 copies. We identified 4.1 million "singly unique nucleotide" positions informative in distinguishing specific copies and used them to genotype the copy and content of specific paralogs within highly duplicated gene families. These data identify human-specific expansions in genes associated with brain development, reveal extensive population genetic diversity, and detect signatures consistent with gene conversion in the human species. Our approach makes similar to 1000 genes accessible to genetic studies of disease association.

Emergence of a Homo sapiens-specific gene family and chromosome 16p11.2 CNV susceptibility

Evolution and diversity of copy number variation in the great ape lineage

Copy number variation (CNV) contributes to the genetic basis of disease and has significantly restructured the genomes of humans and great apes. The diversity and rate of this process, however, has not been extensively explored among the great ape lineages. We analyzed 97 deeply sequenced great ape and human genomes and estimate that 16% (469 Mbp) of the hominid genome has been affected by recent copy number changes. We identify a comprehensive set of fixed gene deletion (n=340) and duplication (n=405) events as well as more than 13.5 Mbp of genomic sequence that has been specifically lost on the human lineage over the last 16 million years of evolution. We compared the diversity and rates of copy number and single nucleotide variation across different time points of the hominid phylogeny. We find that CNV diversity partially correlates with single nucleotide polymorphism diversity (r2=0.5) and recapitulates the phylogeny of apes with few exceptions. Duplications significantly outpace deletions (2.8-fold), especially along ancestral African great ape branches. The load of segregating duplications remains significantly higher in bonobos, Western chimpanzees, and Sumatran orangutans - populations that have experienced recent genetic bottlenecks (P=0.0014, 0.02 and 0.0088, respectively). We find that the rate of fixed deletion has been more clocklike with the exception of the chimpanzee lineage where we observe a twofold increase in the chimpanzee-bonobo ancestor (P=4.79 X 10-9) and evidence of increased deletion load among Western chimpanzees (P=0.002). The latter includes the first evidence of a genomic disorder in a chimpanzee with features resembling Smith-Magenis syndrome mediated by a chimpanzee-specific increase in segmental duplication complexity. We hypothesize that demographic effects, such as bottlenecks, have contributed to larger and more gene-rich segments being deleted in the chimpanzee lineage and that this effect, more generally, may account for episodic bursts in CNV during hominid evolution.

Evolution of Human-Specific Neural SRGAP2 Genes by Incomplete Segmental Duplication.

Gene duplication is an important source of phenotypic change and adaptive evolution. We leverage a haploid hydatidiform mole to identify highly identical sequences missing from the reference genome, confirming that the cortical development gene Slit-Robo RhoGTPase-activating protein 2 (SRGAP2) duplicated three times exclusively in humans. We show that the promoter and first nine exons of SRGAP2 duplicated from 1q32.1 (SRGAP2A) to 1q21.1 (SRGAP2B) similar to 3.4 million years ago (mya). Two larger duplications later copied SRGAP2B to chromosome 1p12 (SRGAP2C) and to proximal 1q21.1 (SRGAP2D) similar to 2.4 and similar to 1 mya, respectively. Sequence and expression analyses show that SRGAP2C is the most likely duplicate to encode a functional protein and is among the most fixed human-specific duplicate genes. Our data suggest a mechanism where incomplete duplication created a novel gene function-antagonizing parental SRGAP2 function-immediately "at birth'' 2-3 mya, which is a time corresponding to the transition from Australopithecus to Homo and the beginning of neocortex expansion.

Genome-wide characterization of centromeric satellites from multiple mammalian genomes.

Despite its importance in cell biology and evolution, the centromere has remained the final frontier in genome assembly and annotation due to its complex repeat structure. However, isolation and characterization of the centromeric repeats from newly sequenced species are necessary for a complete understanding of genome evolution and function. In recent years, various genomes have been sequenced, but the characterization of the corresponding centromeric DNA has lagged behind. Here, we present a computational method (RepeatNet) to systematically identify higher-order repeat structures from unassembled whole-genome shotgun sequence and test whether these sequence elements correspond to functional centromeric sequences. We analyzed genome datasets from six species of mammals representing the diversity of the mammalian lineage, namely, horse, dog, elephant, armadillo, opossum, and platypus. We define candidate monomer satellite repeats and demonstrate centromeric localization for five of the six genomes. Our analysis revealed the greatest diversity of centromeric sequences in horse and dog in contrast to elephant and armadillo, which showed high-centromeric sequence homogeneity. We could not isolate centromeric sequences within the platypus genome, suggesting that centromeres in platypus are not enriched in satellite DNA. Our method can be applied to the characterization of thousands of other vertebrate genomes anticipated for sequencing in the near future, providing an important tool for annotation of centromeres.

Hominoid fission of chromosome 14/15 and the role of segmental duplications

Ape chromosomes homologous to human chromosomes 14 and 15 were generated by a fission event of an ancestral submetacentric chromosome, where the two chromosomes were joined head-to-tail. The hominoid ancestral chromosome most closely resembles the macaque chromosome 7. In this work, we provide insights into the evolution of human chromosomes 14 and 15, performing a comparative study between macaque boundary region 14/15 and the orthologous human regions. We construct a 1.6-Mb contig of macaque BAC clones in the region orthologous to the ancestral hominoid fission site and use it to define the structural changes that occurred on human 14q pericentromeric and 15q subtelomeric regions. We characterize the novel euchromatin-heterochromatin transition region (∼20 Mb) acquired during the neocentromere establishment on chromosome 14, and find it was mainly derived through pericentromeric duplications from ancestral hominoid chromosomes homologous to human 2q14-qter and 10. Further, we show a relationship between evolutionary hotspots and low-copy repeat loci for chromosome 15, revealing a possible role of segmental duplications not only in mediating but also in "stitching" together rearrangement breakpoints.

Identification of a novel recurrent 1q42.2-1qter deletion in high risk MYCN single copy 11q deleted neuroblastomas

Neuroblastoma is an aggressive embryonal tumor that accounts for similar to 15% of childhood cancer deaths. Hitherto, despite the availability of comprehensive genomic data on DNA copy number changes in neuroblastoma, relatively little is known about the genes driving neuroblastoma tumorigenesis. In this study, high resolution array comparative genome hybridization (CGH) was performed on 188 primary neuroblastoma tumors and 33 neuroblastoma cell lines to search for previously undetected recurrent DNA copy number gains and losses. A new recurrent distal chromosome 1q deletion (del(1)(q42.2qter)) was detected in seven cases. Further analysis of available array CGH datasets revealed 13 additional similar distal 1q deletions. The majority of all detected 1q deletions was found in high risk 11q deleted tumors without MYCN amplification (Fisher exact test p = 5.61 x 10-5). Using ultra-high resolution (similar to 115 bp resolution) custom arrays covering the breakpoints on 1q for 11 samples, clustering of nine breakpoints was observed within a 12.5-kb region, of which eight were found in a 7-kb copy number variable region, whereas the remaining two breakpoints were colocated 1.4-Mb proximal. The commonly deleted region contains one miRNA (hsa-mir-1537), four transcribed ultra conserved region elements (uc.43-uc.46) and 130 protein coding genes including at least two bona fide tumor suppressor genes, EGLN1 (or PHD2) and FH. This finding further contributes to the delineation of the genomic profile of aggressive neuroblastoma, offers perspectives for the identification of genes contributing to the disease phenotype and may be relevant in the light of assessment of response to new molecular treatments.

Inversion variants in human and primate genomes

Lineage-specific evolution of the vertebrate Otopetrin gene family revealed by comparative genomic analyses

Background: Mutations in the Otopetrin 1 gene (Otop1) in mice and fish produce an unusual bilateral vestibular pathology that involves the absence of otoconia without hearing impairment. The encoded protein, Otop1, is the only functionally characterized member of the Otopetrin Domain Protein (ODP) family; the extended sequence and structural preservation of ODP proteins in metazoans suggest a conserved functional role. Here, we use the tools of sequence-and cytogenetic-based comparative genomics to study the Otop1 and the Otop2-Otop3 genes and to establish their genomic context in 25 vertebrates. We extend our evolutionary study to include the gene mutated in Usher syndrome (USH) subtype 1G (Ush1g), both because of the head-to-tail clustering of Ush1g with Otop2 and because Otop1 and Ush1g mutations result in inner ear phenotypes. Results: We established that OTOP1 is the boundary gene of an inversion polymorphism on human chromosome 4p16 that originated in the common human-chimpanzee lineage more than 6 million years ago. Other lineage-specific evolutionary events included a three-fold expansion of the Otop genes in Xenopus tropicalis and of Ush1g in teleostei fish. The tight physical linkage between Otop2 and Ush1g is conserved in all vertebrates. To further understand the functional organization of the Ushg1-Otop2 locus, we deduced a putative map of binding sites for CCCTC-binding factor (CTCF), a mammalian insulator transcription factor, from genome-wide chromatin immunoprecipitation-sequencing (ChIP-seq) data in mouse and human embryonic stem (ES) cells combined with detection of CTCF-binding motifs. Conclusions: The results presented here clarify the evolutionary history of the vertebrate Otop and Ush1g families, and establish a framework for studying the possible interaction(s) of Ush1g and Otop in developmental pathways.

Palindromic GOLGA8 core duplicons promote chromosome 15q13.3 microdeletion and evolutionary instability

Rapid and accurate large-scale genotyping of duplicated genes and discovery of interlocus gene conversions

Reconstructing complex regions of genomes using long-read sequencing technology.

Resolving the complexity of the human genome using single-molecule sequencing

Structural diversity and African origin of the 17q21.31 inversion polymorphism.(*First two authors contributed equally to this work)

The 17q21.31 inversion polymorphism exists either as direct (H1) or inverted (H2) haplotypes with differential predispositions to disease and selection. We investigated its genetic diversity in 2,700 individuals, with an emphasis on African populations. We characterize eight structural haplotypes due to complex rearrangements that vary in size from 1.08-1.49 Mb and provide evidence for a 30-kb H1-H2 double recombination event. We show that recurrent partial duplications of the KANSL1 gene have occurred on both the H1 and H2 haplotypes and have risen to high frequency in European populations. We identify a likely ancestral H2 haplotype (H2') lacking these duplications that is enriched among African hunter-gatherer groups yet essentially absent from West African populations. Whereas H1 and H2 segmental duplications arose independently and before human migration out of Africa, they have reached high frequencies recently among Europeans, either because of extraordinary genetic drift or selective sweeps.

The birth of a human-specific neural gene by incomplete duplication and gene fusion

The evolution and population diversity of human-specific segmental duplications

Ruolo

Organizzazione

Dipartimento

Area Scientifica

Settore Scientifico Disciplinare

Settore ERC 1° livello

Settore ERC 2° livello

Settore ERC 3° livello

Ruolo

Organizzazione

Dipartimento

Area Scientifica

Settore Scientifico Disciplinare

Settore ERC 1° livello

Settore ERC 2° livello

Settore ERC 3° livello

23 PUBBLICAZIONI

A large and complex structural polymorphism at 16p12.1...

A recurrent 16p12.1 microdeletion supports a two-hit model...

Autosomal Dominant Familial Dyskinesia and Facial Myokymia Single...

Characterization of missing human genome sequences and copy-number...

Characterization of missing human genome sequences and copy-number...

Copy Number Variation Analysis in Single-Suture Craniosynostosis: Multiple...

Discovery of large genomic inversions using long range...

Diversity of Human Copy Number Variation and Multicopy...

Emergence of a Homo sapiens-specific gene family and...

Evolution and diversity of copy number variation in...

Evolution of Human-Specific Neural SRGAP2 Genes by Incomplete...

Genome-wide characterization of centromeric satellites from multiple mammalian...

Hominoid fission of chromosome 14/15 and the role...

Identification of a novel recurrent 1q42.2-1qter deletion in...

Inversion variants in human and primate genomes

Lineage-specific evolution of the vertebrate Otopetrin gene family...

Palindromic GOLGA8 core duplicons promote chromosome 15q13.3 microdeletion...

Rapid and accurate large-scale genotyping of duplicated genes...

Reconstructing complex regions of genomes using long-read sequencing...

Resolving the complexity of the human genome using...

Structural diversity and African origin of the 17q21.31...

The birth of a human-specific neural gene by...

The evolution and population diversity of human-specific segmental...

0 PROGETTI

0 BREVETTI

0 SPINOFF