TY - JOUR
T1 - Near-complete Middle Eastern genomes refine autozygosity and enhance disease-causing and population-specific variant discovery
AU - Qatar Genome Program Research Consortium
AU - Consortium Lead Principal Investigators (in alphabetical order)
AU - Data Management and Computing Infrastructure group
AU - Applied Bioinformatics Core
AU - Sequencing and Genotyping group
AU - Biobank and Sample Preparation
AU - Qatar Genome Project Management
AU - Ghorbani, Mohammadmersad
AU - Moosa, Shabir
AU - Siddig, Zenab
AU - Farhad, Radi
AU - Naeem, Haroon
AU - Harvey, William T.
AU - Mastrorosa, Francesco Kumara
AU - Munson, Katherine M.
AU - Mohamad Razali, Rozaimi
AU - Aliyev, Elbay
AU - Diboun, Ilhame
AU - Abouelhassan, Rawan
AU - Tauro, Melissa
AU - Hassan, Sondoss
AU - Mathew, Rebecca
AU - Al Hashmi, Muna
AU - Mathew, Lisa S.
AU - Wang, Kun
AU - Salhab, Abdul Rahman
AU - Vempalli, Fazulur Rehaman
AU - El Khouly, Ahmed
AU - Tatari, Zohreh
AU - Suhre, Karsten
AU - Puthen, Jithesh V.
AU - Fakhro, Khalid
AU - Estivill, Xavier
AU - Chouchane, Lotfi
AU - Badii, Ramin
AU - Alshafai, Mashael
AU - Al-Khodor, Souhaila
AU - Albagha, Omar
AU - Al-Ali, Rashid
AU - Poolat, Shafeeq
AU - Pathare, Tushar
AU - Zaid, Tariq Abu
AU - Hamza, Mehshad
AU - Khatib, Mohammedhusen
AU - Saqri, Tariq Abu
AU - Temanni, Ramzi
AU - Almabrazi, Hakeem
AU - Syed, Najeeb
AU - Lorenz, Stephan
AU - Liu, Wei
AU - Afifi, Nahla
AU - Alkhayat, Eiman
AU - Qafoud, Fatima
AU - Fethnou, Eleni
AU - Althani, Asmaa
AU - Saad, Chadi
AU - Al-Sarraj, Yasser
N1 - Publisher Copyright:
© The Author(s) 2025.
PY - 2025/5/5
Y1 - 2025/5/5
N2 - Advances in long-read sequencing have enabled routine complete assembly of human genomes, but much remains to be done to represent broader populations and show impact on disease-gene discovery. Here, we report highly accurate, near-complete and phased genomes from six Middle Eastern (ME) family trios (n = 18) with neurodevelopmental conditions, representing ancestries from Sudan, Jordan, Syria, Qatar and Afghanistan. These genomes revealed 42.2 Mb of new sequence (13.8% impacting known genes), 75 new HLA/KIR alleles and strong signals of inbreeding, with ROH covering up to one-third of chromosomes 6 and 12 in one individual. Using assembly-based variant calling, we identified 23 de novo and recessive variants as strong candidates for causing previously unresolved symptoms in the probands. The ME genomes revealed unique variation relative to existing references, showing enhanced mappability and variant calling. These results underscore the value of de novo assembly for disease variant discovery and the need for sampled ME-specific references to better characterize population-relevant variation.
AB - Advances in long-read sequencing have enabled routine complete assembly of human genomes, but much remains to be done to represent broader populations and show impact on disease-gene discovery. Here, we report highly accurate, near-complete and phased genomes from six Middle Eastern (ME) family trios (n = 18) with neurodevelopmental conditions, representing ancestries from Sudan, Jordan, Syria, Qatar and Afghanistan. These genomes revealed 42.2 Mb of new sequence (13.8% impacting known genes), 75 new HLA/KIR alleles and strong signals of inbreeding, with ROH covering up to one-third of chromosomes 6 and 12 in one individual. Using assembly-based variant calling, we identified 23 de novo and recessive variants as strong candidates for causing previously unresolved symptoms in the probands. The ME genomes revealed unique variation relative to existing references, showing enhanced mappability and variant calling. These results underscore the value of de novo assembly for disease variant discovery and the need for sampled ME-specific references to better characterize population-relevant variation.
KW - American-college
KW - Association
KW - History
KW - Homozygosity
KW - Joint consensus recommendation
KW - Medical genetics
KW - Sequence
KW - Standards
UR - https://www.scopus.com/pages/publications/105004360655
U2 - 10.1038/s41588-025-02173-7
DO - 10.1038/s41588-025-02173-7
M3 - Article
AN - SCOPUS:105004360655
SN - 1061-4036
VL - 57
SP - 1119
EP - 1131
JO - Nature Genetics
JF - Nature Genetics
IS - 5
M1 - 129
ER -