Case Control Study Open Access
Copyright ©The Author(s) 2021. Published by Baishideng Publishing Group Inc. All rights reserved.
World J Diabetes. Dec 15, 2021; 12(12): 2073-2086
Published online Dec 15, 2021. doi: 10.4239/wjd.v12.i12.2073
Genome-wide association study reveals novel loci for adult type 1 diabetes in a 5-year nested case-control study
Yan Gao, Zeng-Li Zhang, Department of Occupational and Environmental Health, School of Public Health, Medical College of Soochow University, Suzhou 215123, Jiangsu Province, China
Yan Gao, Yan Lu, Jun Zhang, Institute of Suzhou Biobank, Suzhou Center for Disease Prevention and Control, Suzhou 215004, Jiangsu Province, China
Shi Chen, Department of Public Health Sciences, University of North Carolina Charlotte, NC 28223, United States
Shi Chen, School of Data Science, University of North Carolina Charlotte, NC 28223, United States
Wen-Yong Gu, Department of Ultrasound, The Second Affiliated Hospital of Soochow University, Suzhou 215004, Jiangsu Province, China
Chen Fang, Department of Endocrinology, The Second Affiliated Hospital of Soochow University, Suzhou 215004, Jiangsu Province, China
Chen Fang, Department of Clinical Nutrition, The Second Affiliated Hospital of Soochow University, Suzhou 215004, Jiangsu Province, China
Yi-Ting Huang, Clinical Nutrition Department, The Second Affiliated Hospital of Soochow University, Suzhou 215004, Jiangsu Province, China
Yue Gao, Ming Xu, Department of Occupational Disease Prevention, Jiangsu Provincial Center for Disease Control and Prevention, Nanjing 210009, Jiangsu Province, China
Yue Gao, Ming Xu, Public Health Research Institute of Jiangsu Province, Nanjing 210009, Jiangsu Province, China
Jian Su, Ming Wu, Department of Chronic Disease Prevention and Control, Jiangsu Provincial Center for Disease Control and Prevention, Nanjing 210009, Jiangsu Province, China
ORCID number: Yan Gao (0000-0002-2293-6083); Shi Chen (0000-0002-2316-111X); Wen-Yong Gu (0000-0001-9032-6175); Chen Fang (0000-0003-4329-9471); Yi-Ting Huang (0000-0002-2950-2413); Yue Gao (0000-0002-2641-782X); Yan Lu (0000-0003-1183-875X); Jian Su (0000-0002-4026-4823); Ming Wu (0000-0002-8798-0288); Jun Zhang (0000-0001-5272-9862); Ming Xu (0000-0001-8846-2880); Zeng-Li Zhang (0000-0002-0108-0263).
Author contributions: Xu M and Zhang ZL designed the present study; Gao Y, Gu WY, Fang C, Huang YT, Lu Y and Su J performed the sample and data collection; Gao Y, Chen S and Xu M finished the analysis of data; Chen S and Gao Y wrote the article; Wu M, Zhang J, Gao Y, Xu M and Wu M mainly supported the costing of GWAS analysis.
Supported by the National Science Foundation for Young Scientists of China (No. 81602919); the National Science Foundation for Young Scientists of China (No. 82070814); the Suzhou Science and Technology Development Plan (No. SYS2018099); and the 5th Suzhou Health Talent Program (No. GSWS2019071).
Institutional review board statement: This study was reviewed and approved by the Research Ethics Committee of Jiangsu Provincial Center for Disease Control and Prevention (No. 2012025).
Informed consent statement: All patients gave informed consent.
Conflict-of-interest statement: The authors declare no conflict of interest in this study.
Data sharing statement: Technical appendix, statistical code, and dataset are available from the corresponding author at zhangzengli@suda.edu.cn with the permission of government.
STROBE statement: The authors have read the STROBE statement—checklist of items, and the manuscript was prepared and revised according to the STROBE statement—checklist of items.
Open-Access: This article is an open-access article that was selected by an in-house editor and fully peer-reviewed by external reviewers. It is distributed in accordance with the Creative Commons Attribution NonCommercial (CC BY-NC 4.0) license, which permits others to distribute, remix, adapt, build upon this work non-commercially, and license their derivative works on different terms, provided the original work is properly cited and the use is non-commercial. See: http://creativecommons.org/Licenses/by-nc/4.0/
Corresponding author: Zeng-Li Zhang, MD, PhD, Academic Fellow, Chief Doctor, Department of Occupational and Environmental Health, School of Public Health, Medical College of Soochow University, No. 199 Ren’ai Road, Suzhou 215123, Jiangsu Province, China. zhangzengli@suda.edu.cn
Received: April 21, 2021
Peer-review started: April 21, 2021
First decision: July 15, 2021
Revised: August 3, 2021
Accepted: November 3, 2021
Article in press: October 31, 2021
Published online: December 15, 2021
Processing time: 238 Days and 14.1 Hours

Abstract
BACKGROUND

Type 1 diabetes (T1D) is a severe and prevalent metabolic disease. Due to its high heredity, an increasing number of genome-wide association studies have been performed, most of which were from hospital-based case-control studies with a relatively small sample size. The association of single nucleotide polymorphisms (SNPs) and T1D has been less studied and is less understood in natural cohorts.

AIM

To investigate the significant variants of T1D, which could be potential biomarkers for T1D prediction or even therapy.

METHODS

A genome-wide association study (GWAS) of adult T1D was performed in a nested case-control study (785 cases vs 804 controls) from a larger 5-year cohort study in Suzhou, China. Potential harmful or protective SNPs were evaluated for T1D. Subsequent expression and splicing quantitative trait loci (eQTL and sQTL) analyses were carried out to identify target genes modulated by these SNPs.

RESULTS

A harmful SNP for T1D, rs3117017 [odds ratio (OR) = 3.202, 95% confidence interval (CI): 2.296-4.466, P = 9.33 × 10-4] and three protective SNPs rs55846421 (0.113, 0.081-0.156, 1.76 × 10-9), rs75836320 (0.283, 0.205-0.392, 1.07 × 10-4), rs362071 (0.568, 0.495-0.651, 1.66 × 10-4) were identified. Twenty-two genes were further identified as potential candidates for T1D onset.

CONCLUSION

We identified a potential genetic basis of T1D, both protective and harmful, using a GWAS in a larger nested case-control study of a Chinese population.

Key Words: Type 1 diabetes; Genome-wide association study; Nested case-control study; Polymorphism

Core Tip: Type 1 diabetes (T1D) is a severe and prevalent metabolic disease. Due to its high heredity, an increasing number of genome-wide association studies have been performed, most of which were from hospital-based case-control studies with a relatively small sample size. The aim of this study was to investigate the significant variants of T1D, which could be potential biomarkers for T1D prediction or even therapy. The effects of different polymorphisms in Chinese T1D patients were determined in a healthy population cohort study. The results showed 4 novel variants highly associated with the onset of T1D, namely rs3117017, rs55846421, rs75836320, and rs362071.



INTRODUCTION

Diabetes mellitus, a prevalent endocrine system disease worldwide, is characterized by high blood glucose level and can be life-threatening. Diabetes can be pathologically classified as type 1 diabetes (T1D), type 2 diabetes (T2D), gestational diabetes and other unclassified diabetes[1]. According to the International Diabetes Federation (IDF), T1D accounts for approximately 5%-10% of all diabetes[2]. Globally, the incidence of T1D is increasing, with an overall annual increase of 2%-5%[3,4]. There is also substantial regional disparity, ranging from 0.5% in the Caribbean to 7.0% in Africa[5]. In the IDF 9th edition, the assumption of T1D prevalence to incidence ratio was 6.3 for countries with no available age-specific incidence rates[2]. In developed countries, severe burden in patients and the entire healthcare system are expected as the incidence of T1D is estimated to double every 20 years[6,7]. According to current research, T1D is associated with increased mortality in the population: Its standardized mortality ratio ranges from 0 to 854 in different countries[8]. This implies that T1D patients have 3- to 4-times higher mortality than the general population[4,9].

With regard to the autoimmune feature of T1D, insulin-producing β cells in the pancreas are destroyed by the body due to insufficient islet secretion instead of insulin resistance in T2D[10]. T1D pathobiology is multifactorial, which includes genetic factors, environmental factors, and their potential interactions[11]. It has been demonstrated that first degree relatives of T1D patients had an approximately 6% risk of T1D, which is 15-times higher than the risk in the general population[12]. This finding suggests that genetic factors can be strong determinants of T1D. In China, existing observational studies indicated that approximately two-thirds of new T1D cases were reported in adults over 20 years old. T1D onset in adults is not rare in the Chinese population[13]. However, the exact mechanism of T1D onset in adults remains unknown[14].

Single nucleotide polymorphisms (SNPs) are potential genetic factors of T1D. SNPs have been investigated in population studies of both Caucasian and Chinese populations[15-18]. SNPs of the human leucocyte antigen (HLA) class II gene are identified as major risk factors for T1D[15]. However, these studies are mostly hospital-based case-control studies, which are inevitably prone to sampling bias[19,20]. The profile of T1D-related SNPs in a larger population is rarely addressed or investigated. Consequently, sampling bias in hospital-based studies can have a substantial impact on the subsequent genome-wide association study (GWAS), and may lead to unreliable results.

In order to overcome the challenge due to sampling bias, we performed a GWAS of adult T1D in a nested case-control study from a large 5-year cohort study in Suzhou, China. We aim to comprehensively investigate and quantify the association between T1D and SNPs in the general population. In addition, we will identify novel potential genes that are regulated by the identified SNPs, and evaluate their roles in the pathogenesis of adult T1D.

MATERIALS AND METHODS
Participants, materials and methods

Ethical compliance: This study was reviewed and approved by the Research Ethics Committee of Jiangsu Provincial Center for Disease Control and Prevention (No. 2012025). The study was in full compliance with the Declaration of Helsinki.

Each participant, either in the case or control group, was informed of the nature of this study. Blood samples from the participants were collected after they signed the written informed consents.

Participants and background information on the study: A nested case-control study within a larger cohort study was performed. The original larger cohort included 3466780 residents in Suzhou, China. The complete demographic, clinical, and epidemiological data of the cohort were collected from the Suzhou medical and social insurance system between January 2012 and December 2018. To exclude latent autoimmune diabetes in adults, T2D and other types of diabetes, all corresponding medical records were cross validated by physicians at the First Affiliated Hospital of Soochow University and the Second Affiliated Hospital of Soochow University. Follow-up visits were performed at the Suzhou Center for Disease Prevention and Control, following the procedures recommended by Park et al[21].

According to the medical records, a total of 1088 patients were diagnosed with T1D during this period, of whom 966 were adult patients more than 20 years old at the time of diagnosis. Fifty-eight patients were further excluded in the pre-collection stage for the following reasons: (1) 2 patients passed away before blood sample collection; (2) 21 patients moved and were unable to be contacted; (3) 34 patients declined to participate in the follow-up investigation; and (4) 1 patient was diagnosed with systemic lupus erythematosus with severe hematopoietic dysfunction, and was not eligible for blood sample collection. The T1D patients were divided into the testing and validation groups (414 and 494 patients, respectively), according to their residence.

For GWAS analysis, routine quality control steps included removing SNPs with imputation quality (INFO) scores < 0.4, with minor allele frequency (MAF) < 0.05, and without a valid Hardy-Weinberg test result (P < 10-6). SNPs on mitochondrial DNA and sex chromosomes were also excluded. Five patients were removed from the group. In addition, we further excluded 118 participants with close familial relationship after checking their genetic relationships. Finally, 785 adult T1D patients were included in the study and formed the nested case group. The complete screening process and quality control of T1D patients enrolled in the study are shown in Figure 1.

Figure 1
Figure 1 Study workflow. T1D: Type 1 diabetes; GWAS: Genome-wide association study; CDC: Center for Disease Prevention and Control.

In the control group, a similar number of age-, gender- and residence-matched participants without metabolic system diseases were randomly selected. A total of 804 participants were further divided into the testing (377) and validation (427) groups.

Genotype imputation: Ungenotyped data were imputed using IMPUTE2 software (V2.3.2)[22]. 920636 SNPs were eventually filtered for the GWAS analysis, after excluding invalid imputed SNPs with INFO < 0.4. SHAPEIT V2 software was used to improve the imputation performance following the execution of IMPUTE2. The 1000 Genomes Project Phase III database (http://mathgen.stats.ox.ac.uk/impute/impute_v2.html) was used as the reference dataset to determine population bias.

Gene regulatory network construction: Gene ontology functional annotation and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway analysis of host genes of the polymorphisms were carried out with the R package clusterProfiler (R version 3.6.0). The Bayesian algorithm was chosen to optimize the regulatory network model and to identify the most probable gene regulation pathway in diabetes.

STRING software (version 11.0) was used to analyze functional interaction networks of potential downstream regulated genes of potential SNPs.

In Silico bioinformatics analysis: Expression quantitative trait loci (eQTLs) in GTEx (http://gtexportal.org/home/) was used to assess the impacts of SNPs on T1D. QTL overlapping was applied when the SNPs had a high linkage disequilibrium (LD) (r2> 0.8) with the top QTL genes for SNPs.

Statistical analyses: The χ2 test was performed to evaluate gender differences in the nested case-control study. Age and body mass index (BMI) were compared using the student t-test. After adjusting for age, gender, and BMI, the R package GWAS tools was for statistical analyses. Following existing GWAS publications, P < 5 × 10-8 was set as the detection limit to account for Bonferroni adjustment in this study[23].

RESULTS
Characteristics of participants in the nested case-control study

Detailed demographic, clinical, and epidemiological characteristics of the 785 adult T1D cases and 804 control participants are provided in Table 1. There were no statistically significant differences in age, gender and BMI between the two groups in both the testing and validation stage of GWAS.

Table 1 Characters of participants in the present type 1 diabetes genome-wide association study investigation.
Discovery stage
Validation stage
Combination
Cases (n = 381)
Controls (n = 377)
P
Cases (n = 404)
Controls (n = 427)
P
Cases (n = 785)
Controls (n = 804)
P
Sex
Male1972000.711a2092230.887a4064230.722a
Female184177195204379381
Age (yr)
mean ± SD28.45 ± 7.7028.99 ± 8.010.338b28.53 ± 7.9528.91 ± 7.670.496b28.49 ± 7.5128.94 ± 7.980.248b
< 251091010.637a1181140.704a2272150.494a
[25, 35)180174188203368377
≥ 359210298110190212
BMI (kg/m2)
mean ± SD21.77 ± 4.1022.10 ± 4.000.266b21.93 ± 4.2621.52 ± 3.700.146b21.85 ± 4.1821.79 ± 3.850.773b
< 18.560460.371a51660.500a1111120.985a
[18.5, 25)270279303311573590
≥ 2551525050101102
Glucose level (mmol/L)
mean ± SD12.83 ± 2.355.43 ± 1.33< 0.0001b13.42 ± 2.485.53 ± 1.30< 0.0001b13.13 ± 2.445.49 ± 81.32< 0.0001b
Novel T1D-related SNPs from GWAS

Among all SNPs examined, four novel loci significantly associated with T1D were identified (Table 2): rs55846421 at 14q21 (PBonferroni-adjust = 4.28 × 10-11), rs3117017 at 6q13 (PBonferroni-adjust = 6.36 × 10-5), rs75836320 at 12q14 (PBonferroni-adjust = 0.011), and rs362071 at 22q11.2 (PBonferroni-adjust = 4.5 × 10-4). P-values of these four SNPs were significantly lower than the threshold of genome-wide significance (P-value after Bonferroni adjust should be less than 0.05, which usually means original P-value < 10-8). LD analysis further confirmed the independence of these SNPs. In the validation stage, the diabetic susceptibilities of these four loci between 404 adult T1D cases and 427 controls showed similar trends as in the testing stage. After combining participants in both stages, these four SNPs were still significantly associated with the onset of adult T1D: rs55846421 (PBonferroni-adjust = 1.76 × 10-9), rs3117017 (PBonferroni-adjust = 9.33 × 10-4), rs75836320 (PBonferroni-adjust = 1.07 × 10-4), and rs362071 (PBonferroni-adjust = 1.66 × 10-4).

Table 2 Identification of type 1 diabetes risk loci in a Chinese population.
SNPs
Location
Stage
Genotype distribution
ORadd (95%CI)
Pa for Bonferroni
Cases (n = 785)
Controls (n = 804)
rs55846421Chr14. 50310401Discovery10/149/2220/6/3710.024 (0.010-0.056)4.28 × 10-11
Validation8/124/2722/37/3850.232 (0.159-0.337)8.73 × 10-8
Combination18/273/4942/43/7560.113 (0.081-0.156)1.76 × 10-9
rs3117017Chr6. 33095275Discovery2/40/33936/117/2244.958 (3.107-7.912)6.36 × 10-5
Validation6/32/36532/98/2973.202 (2.296-4.466)4.51 × 10-4
Combination8/72/70468/215/5213.794 (2.984-4.823)9.33 × 10-4
rs75836320Chr12. 65156128Discovery8/77/2961/12/3560.153 (0.082-0.285)0.011
Validation8/63/3334/27/3960.429 (0.289-0.638)3.3 × 10-4
Combination16/140/6295/39/7520.283 (0.205-0.392)1.07 × 10-4
rs362071Chr22. 20811645Discovery82/193/10446/101/2300.445 (0.348-0.570)4.5 × 10-4
Validation100/173/13179/149/1990.705 (0.588-0.846)0.0002
Combination182/366/235125/250/4290.568 (0.495-0.651)1.66 × 10-4
eQTL and sQTL analyses of candidate SNPs

As there were no specific data for islet tissue or diabetes in the GTEx database, we comprehensively analyzed these SNPs in all types of tissues (Table 3). Of the four novel SNPs, rs55846421 is located on the intron of NEMF gene. However, based on our eQTL results, rs55846421 showed a potential causal effect on LRR1, RP11-596C23.6, RHOQP1, KLHDC1 and VCPKMT genes, instead of NEMF.

Table 3 The expression and splicing quantitative trait loci analysis of target single nucleotide polymorphisms.
SNP name
Influence type
Gene symbol
Expression tissues
P value
rs55846421eQTLsARF6Cells-cultured fibroblasts0.00015
KLHDC1Brain-cerebellum 0.0000022
Lung0.000034
Thyroid0.00021
LRR1Adipose-subcutaneous8.00 × 10-10
Adipose-visceral 5.20 × 10-7
Breast-mammary tissue8.70 × 10-7
Esophagus-muscularis 0.000054
Lung0.000077
Nerve-tibial 0.00013
Thyroid0.00016
RHOQP1Adipose-subcutaneous3.30 × 10-8
RP11-596C23.6Artery-tibial 1.1 × 10-9
Esophagus-mucosa 2.3 × 10-9
Artery-aorta 6.0 × 10-8
Testis2.9 × 10-7
Esophagus-muscularis0.0000019
Esophagus-gastroesophageal junction0.0000025
Muscle-skeletal 0.0000027
Small intestine-terminal ileum 0.0000042
Lung0.000019
Colon-sigmoid 0.00002
Spleen0.000024
Artery-coronary 0.000031
Thyroid0.000061
Heart-atrial appendage0.000062
sQTLsRPL36ALLiver0.0000063
Brain-nucleus accumbens0.0000071
rs3117017eQTLsCOL11A2Thyroid2.0 × 10-8
HCG24Brain-nucleus accumbens5.0 × 10-25
Brain-putamen6.4 × 10-17
Brain-cortex3.0 × 10-14
Brain-caudate3.4 × 10-14
Brain-frontal cortex4.3 × 10-11
Brain-anterior cingulate cortex0.000022
Brain-hippocampus0.000025
HLA-DOAWhole blood0.00005
HLA-DPA1Cells-cultured fibroblasts0.000071
HLA-DPB1Testis0.000028
HLA-DPB2Brain-nucleus accumbens1.3 × 10-18
Brain-caudate2.4 × 10-13
Brain-putamen3.4 × 10-11
Brain-cortex3.5 × 10-8
Brain-hypothalamus3.6 × 10-7
Brain-cerebellum0.000057
HSD17B8Thyroid1.0 × 10-13
Muscle-skeletal3.5 × 10-11
Esophagus-muscularis1.3 × 10-8
Artery-tibial5.6 × 10-7
Esophagus-gastroesophageal junction0.0000011
Colon-sigmoid0.0000018
Heart-left ventricle0.0000028
Testis0.0000032
Adipose-visceral0.000016
Artery-aorta0.000018
Esophagus-mucosa0.000029
Skin-not sun exposed0.00003
Cells-cultured fibroblasts0.000065
RING1Pancreas0.0000059
RPS18Artery-tibial0.000032
Whole blood0.000084
WDR46Lung0.000018
sQTLsCOL11A2Thyroid3.5 × 10-8
Brain-spinal cord9.0 × 10-7
Pituitary0.0000084
HLA-DPB1Skin-sun exposed (lower leg)6.6 × 10-7
Muscle-skeletal8.5 × 10-7
Skin-not sun exposed (suprapubic)8.6 × 10-7
HLA-DPB2Brain-cerebellum1.9 × 10-12
Skin-sun exposed (lower leg)6.6 × 10-7
Muscle-skeletal8.5 × 10-7
Skin-not sun exposed (suprapubic)8.6 × 10-7
Brain-spinal cord0.000004
Colon-transverse0.000014
Artery-tibial0.000063
Testis0.00012
rs75836320----
rs362071eQTLsMED15Artery-tibial3.1 × 10-13
Nerve-tibial1.2 × 10-11
Esophagus-muscularis5.4 × 10-11
Lung1.4 × 10-9
Adipose-subcutaneous6.5 × 10-9
Thyroid8.2 × 10-9
Spleen1.4 × 10-8
Whole blood3.6 × 10-8
Artery-aorta9.0 × 10-8
Colon-transverse1.2 × 10-7
Adrenal gland2.0 × 10-7
Adipose-visceral5.0 × 10-7
Prostate5.8 × 10-7
Liver9.7 × 10-7
Esophagus-gastroesophageal junction0.0000046
Pancreas0.0000063
Skin-sun exposed (lower leg)0.000015
Cells-cultured fibroblasts0.000052
sQTLsKLHL22Skin-sun exposed (lower leg)0.000003
Skin-not sun exposed (suprapubic)0.0000049
MED15Muscle-skeletal3.0 × 10-7

rs3117017 is located on a haplotype block, including part of the HLA-DPB2 gene. HLA-DPB2 is considered the key gene in both T1D and T2D metabolism[24]. In addition, expression of COL11A2, HSD17B8, HCG24, HLA-DOA, HLA-DPB2, HLA-DPA1, RPS18, and RING1 genes are also associated with rs3117017 in different tissues.

rs75836320 is located on the promoter region of TBC1D30 gene, a key gene for insulin processing and secretion. However, no GTEx data were available for this SNP due to its low frequency (MAF < 1%) in the samples.

rs362071 is located on the intron of KLHL22 gene, which has not been previously reported to be associated with diabetes. The expression of MED15 and KLHL22 genes may be highly correlated with this SNP, according to the outcome of eQTL prediction.

Bioinformatics network analysis of possible influenced genes of SNPs

Gene ontology (GO) and KEGG pathway analyses were performed on the target genes listed in Table 3. As shown in Figure 2, these genes are mainly involved in enriched GO terms of “protein monoubiquitination”, “antigen processing and presentation of exogenous peptide antigen via major histocompatibility complex (MHC) class II”, “antigen processing and presentation of peptide antigen via MHC class II”, “antigen processing and presentation of peptide or polysaccharide antigen via MHC class II” in biological process (Figure 2A), and “MHC class II receptor activity”, “immune receptor activity”, “estradiol 17-beta-dehydrogenase activity”, and “Ribosomal large subunit binding” in molecular function (Figure 2B).

Figure 2
Figure 2 Gene ontology analysis of potential expression quantitative trait loci genes of genome-wide association study outcomes. A: Biological process analysis for expression quantitative trait loci (eQTL) genes; B: Gene ontology molecular function analysis for eQTL genes. GO: Gene ontology; MHC: Major histocompatibility complex.

The results of KEGG analysis are shown in Figure 3. The top five pathways were “Asthma”, “Allograft rejection”, “Graft-versus-host diseases”, “Type 1 diabetes mellitus” and “intestinal immune network for IgA production” shown in Figure 3A. The enrichment map of KEGG in potential target genes is shown in Figure 3B.

Figure 3
Figure 3 Kyoto Encyclopedia of Genes and Genomes pathway for expression quantitative trait loci genes of genome-wide association study outcomes. A: Network of pathways involved in A; B: The potential pathways for expression quantitative trait loci genes. KEEG: Kyoto Encyclopedia of Genes and Genomes.

For the association among potential downstream regulated genes of these SNPs, the protein-protein interaction network is shown in Figure 4. KLHDC1, NEMF, RPS18, HSD17B8, RING1, COL11A2, HLA-DPA1 and HLA-DOA genes form a complicated regulation pathway. The corresponding SNPs on this pathway may have an interactive effect in T1D. In addition, LRR1, KLHL22 and MED15 genes also form a straightforward regulation chain, echoing the eQTL finding of rs362071 in both diabetes-related genes KLHL22 and MED15.

Figure 4
Figure 4 The predictive interaction of expression quantitative trait loci genes in diabetes onset.
DISCUSSION

Through a comprehensive nested case-control study in a Chinese cohort, we identified 4 novel SNPs (one harmful and three protective) with a significant association with T1D via GWAS. This study has established the genetic basis of T1D in the Chinese Han population in Suzhou, one of the largest and fastest growing cities in southeast China.

According to the eQTL and sQTL results, rs55846421, the most significant SNP in this study, is highly associated with the expression of six genes (ARF6, KLHDC1, LRR1, RHOQP1, RP11-596C23.6 and VCPKMT) and splicing quantity of one gene (RPL36AL). However, only ARF6 is currently known to be associated with diabetes or diabetes-related diseases[25-27]. As a type of small G-protein, ARF6 is involved in transporting and docking insulin granules on plasma membrane for exocytotic insulin secretion[27]. rs3117017 is located on the HLA-DPB2 gene region and is highly associated with HLA-DPB2 gene expression, which belongs to the HLA genes. Several studies have demonstrated the importance of HLA class II genes, including HLA-DQA1, HLA-DR, and HLA-DQB1[28,29]. A recent hospital-based GWAS analysis of T1D identified a novel T1D-related SNP rs1770 on the 6p21.3 region of HLA-DQB1[15]. rs1770 is especially important for T1D in the Chinese population. Our findings on rs3117017 also suggest that another HLA class II gene - HLA-DPB2 - is associated with T1D susceptibility and pathogenesis in the Chinese population. As of now, there is no record of rs75836320 regulating downstream gene expression in either the GTEx database or GEO database. The last SNP identified in this study, rs362071, belongs to the KLHL22 gene as an intron variant, and we first report its association with T1D. KLHL22 is usually not associated with diseases, but the product of KLHL22 could be polymerized with CUL3. CUL3/KLHL2 is a notable E3 ubiquitin ligase gene[30-32] and its expression will trigger amino acid-dependent mTORC1 signaling, which may be involved in aging, cancer and diabetes[33,34].

In addition to genetic basis, we suggest that economic growth in the study region (Suzhou, China) may have influenced people’s diet preference, which increased the incidence of T1D. Studies have shown that T1D incidence in adolescents may be associated with gross domestic product (GDP) in Poland[17]. While GDP showed a steady increasing trend in Suzhou from 2012 to 2018, the incidence of adult T1D remained relatively stable during the same period (Supplementary Figure 1, provided by the Suzhou Statistical Yearbook since 2012 to 2018). Therefore, adult T1D is not significantly associated with economic growth in Suzhou, or may have a lag effect which warrants further investigation. In addition, diet may not play an important role in adult T1D in the Chinese population.

Furthermore, during the preprocessing stage of this study, we excluded 118 patients due to their close familial heredity to other T1D patients. Genetic correlation of T1D was more than 13% in our study sample. However, according to studies based on the Caucasian population[16,35], the degree of familial cluster was much lower than in the Chinese population. Another study confirmed a significant difference in GWAS results between Chinese and Caucasian populations[15]. Based on previous studies in both populations and our current study, we suggest that the genetic basis and pathogenesis of T1D may vary between Chinese and Caucasian populations. These distinctions can have profound implications on T1D risk estimation, diagnosis, and treatment across different ethnic groups.

There are some limitations in our study. First, unlike a hospital-based case-control study, the sample size of T1D patients (cases) in our nested case-control study was relatively small due to the low incidence of T1D in the natural cohort. Therefore, the results of this study need to be further validated in larger studies. Second, we only investigated the contribution of genetic factors (SNPs) in this study. Other factors, for example, dietary preferences and habits, could also be important in T1D pathogenesis. We will investigate the influence of these factors on T1D in future studies. Third, T1D is a disease with high heritability. In our study, we excluded individuals with high homology, following the protocol of GWAS. However, this exclusion may conceal the actual effects of some relevant SNPs in T1D.

CONCLUSION

To the best of our knowledge, this is the first study to conduct a GWAS for adult T1D in a nested case-control study. We identified 4 novel SNPs as genetic biomarkers for adult T1D onset. Subsequent basic pedigree can further complement and strengthen our research.

ARTICLE HIGHLIGHTS
Research background

The genome-wide association study (GWAS) of type 1 diabetes (T1D) is valuable.

Research motivation

We hoped to add data from a natural cohort to fill gaps in the knowledge of T1D susceptibility.

Research objectives

We conducted a cohort study to evaluate the variants of genes in order to adjust the bias in hospital-based research.

Research methods

The GWAS analysis used in our research reflected the associations between single nucleotide polymorphisms (SNPs) and T1D with high-throughput sequencing.

Research results

In T1D patients, rs3117017 displayed its damaging role in the onset of T1D, while rs55846421, rs75836320, and rs362071 displayed their protective roles in T1D.

A larger nested case-control study of a Chinese population will be helpful in validating our findings.

Research conclusions

The GWAS analysis from a hospital-based population was a little different to that from normal population.

Research perspectives

The specific function of these SNPs should be investigated in the future.

ACKNOWLEDGEMENTS

We thank the participants enrolled in our study, the Suzhou Bureau of Social Security, and the China National Center for Disease Control and Prevention. We would also like to thank Professor Yao-Rong Ge of the University of North Carolina, Charlotte and Mr. Chao Jiang at Nanjing University of Chinese Medicine for their help. We also appreciate the technical support on statistics from Target-Gene Biotechnology Co., Ltd., Nanjing, and the recommendation of eQTL from Biolight Biotechnology Co., Ltd, Nanjing.

Footnotes

Provenance and peer review: Unsolicited article; Externally peer reviewed

Peer-review model: Single blind

Specialty type: Endocrinology and metabolism

Country/Territory of origin: China

Peer-review report’s scientific quality classification

Grade A (Excellent): 0

Grade B (Very good): B

Grade C (Good): 0

Grade D (Fair): 0

Grade E (Poor): 0

P-Reviewer: Ata F S-Editor: Wang JJ L-Editor: Webster JR P-Editor: Wang JJ

References
1.  Paneni F, Beckman JA, Creager MA, Cosentino F. Diabetes and vascular disease: pathophysiology, clinical consequences, and medical therapy: part I. Eur Heart J. 2013;34:2436-2443.  [PubMed]  [DOI]  [Cited in This Article: ]  [Cited by in Crossref: 592]  [Cited by in F6Publishing: 701]  [Article Influence: 58.4]  [Reference Citation Analysis (0)]
2.  International Diabetes Federation (IDF)  IDF diabetes atlas, 9th ed, 2019.  [PubMed]  [DOI]  [Cited in This Article: ]
3.  DiMeglio LA, Evans-Molina C, Oram RA. Type 1 diabetes. Lancet. 2018;391:2449-2462.  [PubMed]  [DOI]  [Cited in This Article: ]  [Cited by in Crossref: 595]  [Cited by in F6Publishing: 854]  [Article Influence: 122.0]  [Reference Citation Analysis (0)]
4.  Maffi P, Secchi A. The Burden of Diabetes: Emerging Data. Dev Ophthalmol. 2017;60:1-5.  [PubMed]  [DOI]  [Cited in This Article: ]  [Cited by in Crossref: 59]  [Cited by in F6Publishing: 62]  [Article Influence: 7.8]  [Reference Citation Analysis (0)]
5.  Patterson CC, Karuranga S, Salpea P, Saeedi P, Dahlquist G, Soltesz G, Ogle GD. Worldwide estimates of incidence, prevalence and mortality of type 1 diabetes in children and adolescents: Results from the International Diabetes Federation Diabetes Atlas, 9th edition. Diabetes Res Clin Pract. 2019;157:107842.  [PubMed]  [DOI]  [Cited in This Article: ]  [Cited by in Crossref: 239]  [Cited by in F6Publishing: 298]  [Article Influence: 49.7]  [Reference Citation Analysis (0)]
6.  Bach JF, Chatenoud L. The hygiene hypothesis: an explanation for the increased frequency of insulin-dependent diabetes. Cold Spring Harb Perspect Med. 2012;2:a007799.  [PubMed]  [DOI]  [Cited in This Article: ]  [Cited by in Crossref: 63]  [Cited by in F6Publishing: 77]  [Article Influence: 6.4]  [Reference Citation Analysis (0)]
7.  Bach JF. The effect of infections on susceptibility to autoimmune and allergic diseases. N Engl J Med. 2002;347:911-920.  [PubMed]  [DOI]  [Cited in This Article: ]  [Cited by in Crossref: 1881]  [Cited by in F6Publishing: 1741]  [Article Influence: 75.7]  [Reference Citation Analysis (0)]
8.  Morgan E, Cardwell CR, Black CJ, McCance DR, Patterson CC. Excess mortality in Type 1 diabetes diagnosed in childhood and adolescence: a systematic review of population-based cohorts. Acta Diabetol. 2015;52:801-807.  [PubMed]  [DOI]  [Cited in This Article: ]  [Cited by in Crossref: 51]  [Cited by in F6Publishing: 54]  [Article Influence: 5.4]  [Reference Citation Analysis (0)]
9.  Lind M, Svensson AM, Kosiborod M, Gudbjörnsdottir S, Pivodic A, Wedel H, Dahlqvist S, Clements M, Rosengren A. Glycemic control and excess mortality in type 1 diabetes. N Engl J Med. 2014;371:1972-1982.  [PubMed]  [DOI]  [Cited in This Article: ]  [Cited by in Crossref: 575]  [Cited by in F6Publishing: 603]  [Article Influence: 54.8]  [Reference Citation Analysis (0)]
10.  Laron Z. Interplay between heredity and environment in the recent explosion of type 1 childhood diabetes mellitus. Am J Med Genet. 2002;115:4-7.  [PubMed]  [DOI]  [Cited in This Article: ]  [Cited by in Crossref: 26]  [Cited by in F6Publishing: 28]  [Article Influence: 1.2]  [Reference Citation Analysis (0)]
11.  Rodríguez-Rodríguez I, Chatzigiannakis I, Rodríguez JV, Maranghi M, Gentili M, Zamora-Izquierdo MÁ. Utility of Big Data in Predicting Short-Term Blood Glucose Levels in Type 1 Diabetes Mellitus Through Machine Learning Techniques. Sensors (Basel). 2019;19.  [PubMed]  [DOI]  [Cited in This Article: ]  [Cited by in Crossref: 36]  [Cited by in F6Publishing: 38]  [Article Influence: 6.3]  [Reference Citation Analysis (0)]
12.  Dorman JS, Bunker CH. HLA-DQ locus of the human leukocyte antigen complex and type 1 diabetes mellitus: a HuGE review. Epidemiol Rev. 2000;22:218-227.  [PubMed]  [DOI]  [Cited in This Article: ]  [Cited by in Crossref: 71]  [Cited by in F6Publishing: 75]  [Article Influence: 3.1]  [Reference Citation Analysis (1)]
13.  Weng J, Zhou Z, Guo L, Zhu D, Ji L, Luo X, Mu Y, Jia W; T1D China Study Group. Incidence of type 1 diabetes in China, 2010-13: population based study. BMJ. 2018;360:j5295.  [PubMed]  [DOI]  [Cited in This Article: ]  [Cited by in Crossref: 149]  [Cited by in F6Publishing: 196]  [Article Influence: 28.0]  [Reference Citation Analysis (0)]
14.  Chiang JL, Kirkman MS, Laffel LM, Peters AL; Type 1 Diabetes Sourcebook Authors. Type 1 diabetes through the life span: a position statement of the American Diabetes Association. Diabetes Care. 2014;37:2034-2054.  [PubMed]  [DOI]  [Cited in This Article: ]  [Cited by in Crossref: 614]  [Cited by in F6Publishing: 589]  [Article Influence: 53.5]  [Reference Citation Analysis (0)]
15.  Zhu M, Xu K, Chen Y, Gu Y, Zhang M, Luo F, Liu Y, Gu W, Hu J, Xu H, Xie Z, Sun C, Li Y, Sun M, Xu X, Hsu HT, Chen H, Fu Q, Shi Y, Xu J, Ji L, Liu J, Bian L, Zhu J, Chen S, Xiao L, Li X, Jiang H, Shen M, Huang Q, Fang C, Huang G, Fan J, Jiang Z, Jiang Y, Dai J, Ma H, Zheng S, Cai Y, Dai H, Zheng X, Zhou H, Ni S, Jin G, She JX, Yu L, Polychronakos C, Hu Z, Zhou Z, Weng J, Shen H, Yang T. Identification of Novel T1D Risk Loci and Their Association With Age and Islet Function at Diagnosis in Autoantibody-Positive T1D Individuals: Based on a Two-Stage Genome-Wide Association Study. Diabetes Care. 2019;42:1414-1421.  [PubMed]  [DOI]  [Cited in This Article: ]  [Cited by in Crossref: 43]  [Cited by in F6Publishing: 61]  [Article Influence: 10.2]  [Reference Citation Analysis (0)]
16.  Garyu JW, Meffre E, Cotsapas C, Herold KC. Progress and challenges for treating Type 1 diabetes. J Autoimmun. 2016;71:1-9.  [PubMed]  [DOI]  [Cited in This Article: ]  [Cited by in Crossref: 23]  [Cited by in F6Publishing: 19]  [Article Influence: 2.1]  [Reference Citation Analysis (0)]
17.  Wellcome Trust Case Control Consortium. Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls. Nature. 2007;447:661-678.  [PubMed]  [DOI]  [Cited in This Article: ]  [Cited by in Crossref: 7131]  [Cited by in F6Publishing: 7047]  [Article Influence: 391.5]  [Reference Citation Analysis (0)]
18.  Onengut-Gumuscu S, Chen WM, Burren O, Cooper NJ, Quinlan AR, Mychaleckyj JC, Farber E, Bonnie JK, Szpak M, Schofield E, Achuthan P, Guo H, Fortune MD, Stevens H, Walker NM, Ward LD, Kundaje A, Kellis M, Daly MJ, Barrett JC, Cooper JD, Deloukas P; Type 1 Diabetes Genetics Consortium, Todd JA, Wallace C, Concannon P, Rich SS. Fine mapping of type 1 diabetes susceptibility loci and evidence for colocalization of causal variants with lymphoid gene enhancers. Nat Genet. 2015;47:381-386.  [PubMed]  [DOI]  [Cited in This Article: ]  [Cited by in Crossref: 562]  [Cited by in F6Publishing: 500]  [Article Influence: 50.0]  [Reference Citation Analysis (0)]
19.  Rudant J, Clavel J, Infante-Rivard C. Selection bias in case-control studies on household exposure to pesticides and childhood acute leukemia. J Expo Sci Environ Epidemiol. 2010;20:299-309.  [PubMed]  [DOI]  [Cited in This Article: ]  [Cited by in Crossref: 9]  [Cited by in F6Publishing: 8]  [Article Influence: 0.5]  [Reference Citation Analysis (0)]
20.  Nishimoto IN, Pintos J, Schlecht NF, Torloni H, Carvalho AL, Kowalski LP, Franco EL. Assessment of control selection bias in a hospital-based case-control study of upper aero-digestive tract cancers. J Cancer Epidemiol Prev. 2002;7:131-141.  [PubMed]  [DOI]  [Cited in This Article: ]
21.  Park Y, Wintergerst KA, Zhou Z. Clinical heterogeneity of type 1 diabetes (T1D) found in Asia. Diabetes Metab Res Rev. 2017;33.  [PubMed]  [DOI]  [Cited in This Article: ]  [Cited by in Crossref: 21]  [Cited by in F6Publishing: 25]  [Article Influence: 3.1]  [Reference Citation Analysis (0)]
22.  Howie BN, Donnelly P, Marchini J. A flexible and accurate genotype imputation method for the next generation of genome-wide association studies. PLoS Genet. 2009;5:e1000529.  [PubMed]  [DOI]  [Cited in This Article: ]  [Cited by in Crossref: 2978]  [Cited by in F6Publishing: 3083]  [Article Influence: 192.7]  [Reference Citation Analysis (0)]
23.  Meng W, Adams MJ, Palmer CNA; 23andMe Research Team, Shi J, Auton A, Ryan KA, Jordan JM, Mitchell BD, Jackson RD, Yau MS, McIntosh AM, Smith BH. Author Correction: Genome-wide association study of knee pain identifies associations with GDF5 and COL27A1 in UK Biobank. Commun Biol. 2020;3:149.  [PubMed]  [DOI]  [Cited in This Article: ]  [Cited by in Crossref: 1]  [Cited by in F6Publishing: 2]  [Article Influence: 0.4]  [Reference Citation Analysis (0)]
24.  Lie BA, Todd JA, Pociot F, Nerup J, Akselsen HE, Joner G, Dahl-Jørgensen K, Rønningen KS, Thorsby E, Undlien DE. The predisposition to type 1 diabetes linked to the human leukocyte antigen complex includes at least one non-class II gene. Am J Hum Genet. 1999;64:793-800.  [PubMed]  [DOI]  [Cited in This Article: ]  [Cited by in Crossref: 129]  [Cited by in F6Publishing: 136]  [Article Influence: 5.2]  [Reference Citation Analysis (0)]
25.  Duan Y, Prasad R, Feng D, Beli E, Li Calzi S, Longhini ALF, Lamendella R, Floyd JL, Dupont M, Noothi SK, Sreejit G, Athmanathan B, Wright J, Jensen AR, Oudit GY, Markel TA, Nagareddy PR, Obukhov AG, Grant MB. Bone Marrow-Derived Cells Restore Functional Integrity of the Gut Epithelial and Vascular Barriers in a Model of Diabetes and ACE2 Deficiency. Circ Res. 2019;125:969-988.  [PubMed]  [DOI]  [Cited in This Article: ]  [Cited by in Crossref: 48]  [Cited by in F6Publishing: 63]  [Article Influence: 10.5]  [Reference Citation Analysis (0)]
26.  Hu J, Yang Q, Chen Z, Liang W, Feng J, Ding G. Small GTPase Arf6 regulates diabetes-induced cholesterol accumulation in podocytes. J Cell Physiol. 2019;234:23559-23570.  [PubMed]  [DOI]  [Cited in This Article: ]  [Cited by in Crossref: 9]  [Cited by in F6Publishing: 16]  [Article Influence: 2.7]  [Reference Citation Analysis (0)]
27.  Kowluru A. Role of G-proteins in islet function in health and diabetes. Diabetes Obes Metab. 2017;19 Suppl 1:63-75.  [PubMed]  [DOI]  [Cited in This Article: ]  [Cited by in Crossref: 24]  [Cited by in F6Publishing: 22]  [Article Influence: 2.8]  [Reference Citation Analysis (0)]
28.  Nerup J, Platz P, Andersen OO, Christy M, Lyngsoe J, Poulsen JE, Ryder LP, Nielsen LS, Thomsen M, Svejgaard A. HL-A antigens and diabetes mellitus. Lancet. 1974;2:864-866.  [PubMed]  [DOI]  [Cited in This Article: ]  [Cited by in Crossref: 549]  [Cited by in F6Publishing: 501]  [Article Influence: 9.8]  [Reference Citation Analysis (0)]
29.  Noble JA, Valdes AM, Cook M, Klitz W, Thomson G, Erlich HA. The role of HLA class II genes in insulin-dependent diabetes mellitus: molecular analysis of 180 Caucasian, multiplex families. Am J Hum Genet. 1996;59:1134-1148.  [PubMed]  [DOI]  [Cited in This Article: ]
30.  Chen J, Ou Y, Yang Y, Li W, Xu Y, Xie Y, Liu Y. KLHL22 activates amino-acid-dependent mTORC1 signalling to promote tumorigenesis and ageing. Nature. 2018;557:585-589.  [PubMed]  [DOI]  [Cited in This Article: ]  [Cited by in Crossref: 56]  [Cited by in F6Publishing: 59]  [Article Influence: 8.4]  [Reference Citation Analysis (0)]
31.  Metzger T, Kleiss C, Sumara I. CUL3 and protein kinases: insights from PLK1/KLHL22 interaction. Cell Cycle. 2013;12:2291-2296.  [PubMed]  [DOI]  [Cited in This Article: ]  [Cited by in Crossref: 16]  [Cited by in F6Publishing: 17]  [Article Influence: 1.4]  [Reference Citation Analysis (0)]
32.  Beck J, Peter M. Regulating PLK1 dynamics by Cullin3/KLHL22-mediated ubiquitylation. Cell Cycle. 2013;12:2528-2529.  [PubMed]  [DOI]  [Cited in This Article: ]  [Cited by in Crossref: 8]  [Cited by in F6Publishing: 8]  [Article Influence: 0.7]  [Reference Citation Analysis (0)]
33.  Laplante M, Sabatini DM. mTOR signaling in growth control and disease. Cell. 2012;149:274-293.  [PubMed]  [DOI]  [Cited in This Article: ]  [Cited by in Crossref: 5794]  [Cited by in F6Publishing: 6352]  [Article Influence: 488.6]  [Reference Citation Analysis (1)]
34.  Zoncu R, Efeyan A, Sabatini DM. mTOR: from growth signal integration to cancer, diabetes and ageing. Nat Rev Mol Cell Biol. 2011;12:21-35.  [PubMed]  [DOI]  [Cited in This Article: ]  [Cited by in Crossref: 2855]  [Cited by in F6Publishing: 3101]  [Article Influence: 206.7]  [Reference Citation Analysis (0)]
35.  Ikegami H, Ogihara T. Genetics of insulin-dependent diabetes mellitus. Endocr J. 1996;43:605-613.  [PubMed]  [DOI]  [Cited in This Article: ]  [Cited by in Crossref: 52]  [Cited by in F6Publishing: 54]  [Article Influence: 1.9]  [Reference Citation Analysis (0)]