UCL logo

UCL Discovery

UCL home » Library Services » Electronic resources » UCL Discovery

11,670 whole-genome sequences representative of the Han Chinese population from the CONVERGE project

Cai, N; Bigdeli, TB; Kretzschmar, WW; Li, Y; Liang, J; Hu, J; Peterson, RE; ... Flint, J; + view all (2017) 11,670 whole-genome sequences representative of the Han Chinese population from the CONVERGE project. Scientific Data , 4 , Article 170011. 10.1038/sdata.2017.11. Green open access

[img]
Preview
Text
sdata201711.pdf - ["content_typename_Published version" not defined]

Download (635kB) | Preview

Abstract

The China, Oxford and Virginia Commonwealth University Experimental Research on Genetic Epidemiology (CONVERGE) project on Major Depressive Disorder (MDD) sequenced 11,670 female Han Chinese at low-coverage (1.7X), providing the first large-scale whole genome sequencing resource representative of the largest ethnic group in the world. Samples are collected from 58 hospitals from 23 provinces around China. We are able to call 22 million high quality single nucleotide polymorphisms (SNP) from the nuclear genome, representing the largest SNP call set from an East Asian population to date. We use these variants for imputation of genotypes across all samples, and this has allowed us to perform a successful genome wide association study (GWAS) on MDD. The utility of these data can be extended to studies of genetic ancestry in the Han Chinese and evolutionary genetics when integrated with data from other populations. Molecular phenotypes, such as copy number variations and structural variations can be detected, quantified and analysed in similar ways.

Type: Article
Title: 11,670 whole-genome sequences representative of the Han Chinese population from the CONVERGE project
Location: England
Open access status: An open access version is available from UCL Discovery
DOI: 10.1038/sdata.2017.11
Publisher version: http://dx.doi.org/10.1038/sdata.2017.11
Language: English
Additional information: © 2017 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0 Metadata associated with this Data Descriptor is available at http://www.nature.com/sdata/ and is released under the CC0 waiver to maximize reuse.
UCL classification: UCL > Provost and Vice Provost Offices
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Life Sciences
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Life Sciences > Div of Biosciences
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Life Sciences > Div of Biosciences > Genetics, Evolution and Environment
URI: http://discovery.ucl.ac.uk/id/eprint/1542401
Downloads since deposit
110Downloads
Download activity - last month
Download activity - last 12 months
Downloads by country - last 12 months

Archive Staff Only

View Item View Item