Selection of Tagging Single Nucleotide Polymorphisms and Association With Body Fat, Insulin Sensitivity, and the Metabolic Syndrome in a Nor
http://www.100md.com
《糖尿病学杂志》
1 Department of Clinical Developmental Sciences, St. George’s, University of London, London, U.K
2 Georgia Prevention Institute, Department of Pediatrics, Medical College of Georgia, Augusta, Georgia
3 Twin Research and Genetic Epidemiology Unit, St. Thomas’ Hospital, London, U.K
DBP, diastolic blood pressure; GEE, generalized estimating equation; MAF, minor allele frequency; PTP-1B, protein tyrosine phosphatase-1B; SBP, systolic blood pressure; SiM, Avignon’s insulin sensitivity index based on both fasting and 2-h insulin and glucose values; SNP, single nucleotide polymorphism; TDT, transmission-disequilibrium test; tSNP, tagging SNP
ABSTRACT
Protein tyrosine phosphatase-1B negatively regulates leptin and insulin signaling, potentially contributing to hormonal resistance. We selected six tagging single nucleotide polymorphisms (SNPs) representing 18 common variants in the protein tyrosine phosphatase-1B gene (PTPN1) and tested their effect on serum leptin, body fat, and measures of insulin sensitivity and the metabolic syndrome in a large sample of normal female Caucasian twins (n = 2,777; mean age, 47.4 ± 12.5 years) from the St. Thomas’ U.K. Adult Twin Registry. SNP rs718049 was significantly associated with waist circumference (P = 0.008) and central fat (P = 0.035) and also with Avignon’s insulin sensitivity index (SiM) (P = 0.007), fasting insulin (P = 0.004), fasting glucose (P = 0.022), triglyceride (P = 0.023), and systolic blood pressure (P = 0.046). SNPs rs2282146 and rs1885177 were associated with SiM (P = 0.049 and P = 0.013, respectively), and 1484insG was associated with triglyceride (P = 0.029). A risk haplotype (7.3%) was associated with lower SiM (P = 0.036) and a protective haplotype (5.2%) with higher SiM (P = 0.057), with mean values in homozygotes differing by >1 SD (P = 0.003). The protective haplotype also showed lower triglyceride (P = 0.045) and lower systolic blood pressure (P = 0.006). Fine mapping analyses predicted significant associations with SiM and fasting insulin for several ungenotyped SNPs. PTPN1 variants appear to contribute to central fat and metabolic syndrome traits, secondary to their effect on insulin sensitivity.
Obesity poses one of the most pressing current public health problems and is a major risk factor for type 2 diabetes. Protein tyrosine phosphatase-1B (PTP-1B) negatively regulates the signaling pathways of insulin and leptin, two hormones involved in the central regulation of energy balance (1). Activated insulin receptor and receptor substrate-1 are dephosphorylated by PTP-1B (2), and phosphorylated JAK2 and STAT3 have been identified as its targets in the leptin pathway (3,4). PTP-1BeCdeficient mice are hypersensitive to insulin (5) and leptin (4,6) and are protected from diet-induced obesity (4eC6). Hence, increased activity of PTP-1B potentially contributes to leptin resistance associated with overweight (7) and resistance to insulin, which predisposes to type 2 diabetes. In humans, PTPN1, the gene coding for PTP-1B, maps to a region in which a major human quantitative trait locus for obesity and type 2 diabetes has been reported (8eC10). Several rare PTPN1 single nucleotide polymorphisms (SNPs) have been shown to be associated with insulin resistance and diabetes in different populations (11eC14). Recently, Palmer et al. (15) mounted an extensive study of 35 SNPs spanning a 161-kb region including the PTPN1 gene in >800 Hispanic Americans. Twenty SNPs showed significant association with fasting glucose and the insulin sensitivity index Si. Through its effect on insulin sensitivity, changes in PTP-1B activity may also be reflected in secondary development of the metabolic syndrome, characterized by increases in central obesity, triglycerides, fasting glucose, and systolic blood pressure (SBP) and lower levels of HDL cholesterol (16). In Caucasians, PTPN1 SNPs have been associated with serum triglyceride levels in obese subjects (14) and normal individuals (15), and a strong association of common risk haplotypes with hypertension has been found in Chinese and Japanese subjects (17). To the best of our knowledge, no human studies have focused on impaired leptin signal transduction as the underlying pathway by specifically investigating the association of PTPN1 variants with serum leptin and measures of body fat.
Increased availability of SNPs has recently transformed candidate gene association studies. Examination on a gene-wide level is now recommended with all variants within a candidate gene considered jointly (18). This is best achieved through selection of a minimal set of tagging SNPs (tSNPs) effectively capturing information of all common variants by taking into account patterns of linkage disequilibrium across the gene (19,20). We therefore used a multistep design to select a set of tSNPs representing all considered common variants in PTPN1 and tested the effect of these tSNPs, individually and as haplotypes on a range of variables characterizing body composition, insulin sensitivity and the metabolic syndrome in a sample of Caucasian female twins (n = 2,777; mean age, 47.4 ± 12.5 years) that is almost three times the size of the Hispanic population studied by Palmer et al. (15) and 50% larger than the combined Chinese and Japanese cohorts of Olivier et al. (17). Where possible, we compared any specific single SNP and haplotype associations in our larger sample with those found in previous studies. Finally, we performed a fine mapping analysis that uses the tSNPs to predict potential associations between the remaining (ungenotyped) SNPs and the outcome variables, in an attempt to locate potential functional sites (19).
RESEARCH DESIGN AND METHODS
The St. Thomas’ U.K. Adult Twin Registry (Twins U.K.) comprises unselected, mostly female volunteers ascertained from the general population through national media campaigns in the U.K. (21). Means and ranges of quantitative phenotypes in Twins U.K. are normally distributed and similar to the age-matched general population in the U.K. (22). General characteristics of the subjects are given in Table 1.
Zygosity, body composition, and biochemical analyses.
Zygosity was determined by standardized questionnaire and confirmed by DNA fingerprinting. Serum leptin concentration was determined after an overnight fast using a radioimmunoassay (Linco Research, St. Louis, MO). The majority of these twins also had measures of total and central body fat obtained by dual energy X-ray absorptiometry body composition scans (Hologic QDR-2000; Vertec, Waltham, MA) (23). Blood sample collection for determination of fasting serum insulin and glucose was as described by de Lange et al. (24). Fasting insulin was measured by immunoassay (Abbott Laboratories, Maidenhead, U.K.) and glucose on an Ektachem 700 multichannel analyzer using an enzymatic colorimetric slide assay (Johnson and Johnson Clinical Diagnostic Systems, Amersham, U.K.). A subsample of twins underwent an oral glucose tolerance test for which glucose and insulin levels were measured before and 2 h after a 75-g oral glucose load. Levels of HDL cholesterol and triglycerides were measured using a Cobas Fara machine (Roche Diagnostics) (25). SBP and diastolic blood pressure (DBP) were measured twice using an automated cuff sphygmomanometer (OMRON HEM713C) and averaged over the two readings. Informed consent was obtained from participants before they entered the study. The protocol was approved by the local research ethics committee.
Genotyping for SNP validation and tSNP selection.
Nineteen validated SNPs from the National Center for Biotechnology Information database (http://www.ncbi.nlm.nih.gov/SNP) and one SNP, 1484insG, reported by Di Paola et al. (13) were selected for validation in the Twins U.K. cohort by genotyping eight random unrelated subjects. Among these 20 SNPs, only rs718050 was not polymorphic. The other 19 SNPs were further genotyped in a sample of 94 unrelated subjects for tSNP selection, using PCR and restriction. This sample size is three times that suggested by one permutation study (26), which indicates that genotyping 25eC32 unphased individuals is sufficient to select tSNPs. SNP rs3787339 (minor allele frequency [MAF] of 1%) turned out to be uninformative and was excluded from all further analyses. Primers and PCR conditions for SNP validation and tSNP selection are given in online appendix 1 (available at http://diabetes.diabetesjournals.org). The rs number and relative position of these 19 SNPs are shown in Fig. 1.
Genotyping in cohort.
Six tSNPs (rs6067484, rs1885177, rs2282146, rs718049, rs3787348, and 1484insG) were selected and genotyped in the complete cohort by Pyrosequencing (Biotage, Uppsala, Sweden). Genotyping accuracy was assessed by inclusion of duplicates (pairs of monozygotic twins) in the arrays and negative controls (water blanks) on each plate. Primers and PCR conditions for SNP genotyping in the full cohort by Pyrosequencing are given in online appendix 2.
Analytical approach.
The main purposes of our analyses were to select a set of tSNPs representing the common variants in PTPN1 and to test the effect of these tSNPs, individually and/or as haplotypes, on a range of variables including serum leptin, measures of body fat, insulin sensitivity, and components of the metabolic syndrome.
tSNP selection.
We used two approaches to identify an optimal subset of tSNPs. The first one, devised by Stram et al. (20), selects a subset of SNPs as tSNPs by optimizing their prediction of common haplotypes of all SNPs genotyped in the 94 unrelated individuals. It considers a measure of association (RH2) between the true and predicted number of copies (0, 1, or 2) of each possible common haplotype carried by a randomly sampled subject, in which the prediction is based on knowledge of the tSNPs. The program tagsnps was used with the following parameters: common haplotypes were defined as "the minimal set of haplotypes that covers 90% of existing haplotypes" and sets of tSNPs resolving the common haplotypes were selected at a RH2 threshold of 0.85 because the number of selected tSNPs at this threshold was equal to the number of tSNPs selected using the method of Chapman et al. (19), at an RL2 threshold of 0.85 (see below). Chapman et al. selected an optimal set of tSNPs in such a way that the allele frequencies of the remaining (non-tSNPs) can be predicted well. A series of regression equations are calculated for which the predictive efficiency is assessed in terms of RL2, which measures the proportion of variance of each remaining SNP "explained" by regression on the tSNP alleles (locus-based scoring). These regression equations can also be used to predict which of the ungenotyped SNPs might also show association with the trait. The package htSNP2 was used to select a tSNP set that predicts remaining SNPs with a minimum RL2 of 0.85, as suggested by Chapman et al. (19). The program mlpop implemented in the package genassoc predicts the association between unmeasured SNPs and the trait in unrelated subjects. We adapted mlpop for the analysis of related subjects such as twins by replacing the linear regression with a generalized estimating equations (GEEs) procedure, which yields unbiased standard errors and P values (27). The main difference between the approaches of Stram et al. (20) and Chapman et al. (19) is that the former is based on prediction of extended haplotypes (in this case, based on 18 SNPs) from the marker haplotypes (in this case, based on 6 tSNPs), whereas the latter is based on prediction of single SNP loci.
Association analysis.
Regular association analyses were performed using GEEs. Analyses were done separately for each of the SNPs and followed up by haplotype analyses. For individual SNP association analyses, we first tested the 2-df (degrees of freedom) codominant model. In the presence of a significant association, a dominant and a recessive model were further tested to find the best mode of inheritance. Age and menopausal status were included as covariates in the models. Details of our approach to test the association of statistically inferred haplotypes with continuous traits have been described previously (28). In short, we used haplotype trend regression as outlined by Zaykin et al. (29), with the probabilities of haplotype pairs estimated by PHASE 2.0 software (30). Obesity-related variables included leptin, weight, BMI, waist circumference, total fat mass, percent total fat, central fat mass, and percent central fat. Factor analysis was used to combine strongly correlated indexes of obesity into two measures: one for general obesity (serum leptin, BMI, weight, total fat mass, and percent total fat) and one for central obesity (waist circumference, central fat mass, and percent central fat). To reduce the likelihood of identifying false-positive associations, results of single variables characterizing obesity were confirmed by these two combined scores. We used two indexes of insulin sensitivity: fasting insulin and Avignon’s insulin sensitivity index (SiM). SiM is calculated based on both fasting and 2-h insulin and glucose data (31) and was calculated according to the following formulae: SiM = (0.137 x SIB + SIH2)/2, where SIB = 108/(fasting insulin x fasting glucose x VD); SIH2 = 108/(2-h insulin x 2-h glucose x VD) and VD = 150 ml/kg x body wt. Both fasting insulin (r = 0.68) and SiM (r = 0.92) are highly correlated with insulin sensitivity (Si) in the normal population. SiM is also a good predictor of diabetes, especially in Caucasians (32). For the metabolic syndrome measures, we included five variables: triglycerides, HDL cholesterol, waist circumference, fasting glucose, and SBP, based on the clinical definition of the insulin receptor substrate 9 (metabolic syndrome) as specified in the third report of the Adult Treatment Panel (ATPIII) of high blood cholesterol (16). To control for population stratification, dizygotic twin pairs discordant for genotype were also used in sibling transmission-disequilibrium test (TDT) association analysis to confirm the results of regular association tests as described elsewhere (27). Preliminary analyses were performed using STATA 8 (StataCorp, College Station, TX). Phenotypes significantly (P < 0.05) deviating from normal were log transformed to obtain normal distributions before analysis. Hardy-Weinberg equilibrium was tested by a 2 test with 1 df in one twin of each pair chosen at random to prevent inflated significance. Pairwise linkage disequilibrium coefficients were calculated using GOLD and reported as D' and r2 (33).
RESULTS
Among 20 SNPs tested in the eight subjects, 19 SNPs were polymorphic and genotyped in the 94 subjects. Figure 1 shows the positions and MAFs of these 19 SNPs. All of the SNPs except rs2282146 (P303P in exon 8) were located in noncoding regions, and their genotype frequencies were consistent with Hardy-Weinberg proportions. The uninformative rs3787339 (MAF of 1%) was removed from further analyses (34). The other 18 SNPs had MAFs >5% and showed strong pairwise linkage disequilibrium (online appendixes 3 and 4). The presence of strong linkage disequilibrium throughout the gene suggested the feasibility of tSNP selection, facilitating great savings in time and costs.
Table 2 shows the inferred haplotypes of the 18 SNPs (n = 94). Of the seven haplotypes together comprising >90% of the total, three were common (>5%), one had a frequency of 4.2%, and three showed a frequency of 1.9%. The same set of six tSNPs, rs6067484, rs1885177, rs2282146, rs718049, rs3787348, and 1484insG, were selected by tagsnps (20) and htSNP2 (19) programs. This set of tSNPs accurately predicted both common haplotypes and unmeasured loci. Minimum values of RH2 and RL2 were 0.884 for haplotype 3 and 0.882 for rs10485614, respectively.
Table 3 shows the genotype and allele frequencies of the six tSNPs in the whole cohort. The total number of subjects genotyped for each polymorphism varied slightly and was somewhat lower than 2,356, the total number of twins genotyped (i.e., one monozygotic and both dizygotic twins of each pair). This was due to unsuccessful amplification of the target sequences for some samples. None of the loci showed deviation from Hardy-Weinberg equilibrium. The inferred haplotype frequencies of the six tSNPs in the whole cohort are also shown in Table 3. The same seven most common haplotypes observed in the 94 subjects (Table 2) were also identified by the six tSNPs, although the actual estimates of haplotype frequencies and corresponding order varied somewhat between the full cohort and the tSNP selection sample (n = 94).
Table 4 presents the results of individual SNP analyses. For the SNPs of lowest frequency, rs2282146 and 1484insG, only dominant models were tested, whereas for the four more common SNPs, the 2-df codominant model was first tested, followed by a dominant and a recessive model in the presence of a significant association. With the exception of the effect of rs718049 on waist circumference, none of the main effects of the SNPs on leptin and other obesity-related variables reached statistical significance. We therefore did not include these variables in Table 4. Regular association (P = 0.074) and sibling TDT tests (P = 0.025) for the combined central obesity score confirmed the significant associations with waist circumference (data not shown). No significant effects of haplotypes on any of the obesity-related variables were found (data not shown). Carriers of the minor allele of rs718049 had significantly higher waist circumference, lower SiM, and higher fasting insulin and glucose. Excepting glucose, these associations were maintained in sibling TDT. SNP rs2282146 was significantly associated with SiM (P = 0.049), and this association changed to borderline significance in sibling TDT (P = 0.062), with carriers of the minor allele having lower SiM. Under the recessive model, carriers of the minor allele of rs1885177 had significantly lower SiM (P = 0.013) and maintained significance in sibling TDT (P = 0.039). Two SNPs showed significant associations with other components of the metabolic syndrome. Again, the strongest relationship was shown with rs718049, associated with serum triglyceride and SBP, with all except triglyceride remaining significant in sibling TDT. SNP 1484insG was associated with triglyceride (P = 0.029) only in the regular test.
Table 5 shows haplotype frequencies and results of haplotype association tests. Although the overall test did not reach significance (P = 0.085), haplotype 4 was significantly associated with lower SiM (P = 0.036), i.e., confers risk of insulin resistance, and haplotype 5 was borderline-significantly associated with higher SiM (P = 0.057), i.e., is protective. The mean levels of SiM were 0.46 SD lower for haplotype 4 homozygotes and 0.56 SD higher for haplotype 5 homozygotes compared with those homozygous for the most common haplotype 1, a difference of >1 SD (P = 0.003). The explained variance of SiM by the PTPN1 haplotypes was 1.92%. The risk and protective haplotypes are discriminated by rs2282146 and rs718049, reflecting the strongest individual significant associations with SiM in the regular and sibling TDT analyses. The protective effects of haplotype 5 were also observed for SBP (P = 0.006) and triglyceride (P = 0.045) levels. The -coefficient for haplotype 5 on SBP was eC7.2, and on triglycerides was eC0.17, i.e., SBP levels were 7.2 mmHg lower and triglyceride levels 0.17 mmol/l lower for protective haplotype 5 homozygotes compared with common haplotype 1 homozygotes. The explained percentages of variance in SBP and triglyceride levels by the PTPN1 haplotypes were 0.33% and 0.63%, respectively.
The approach we used to select an optimal set of tSNPs (19) can also predict which of those SNPs not typed in the full cohort might also show association with the trait. Figure 2 shows the fine mapping results (including the ungenotyped SNPs) based on prediction of single SNP loci from the six tSNPs. Associations of both tSNPs and predicted values for unmeasured SNPs with SiM, fasting insulin, and fasting glucose under the additive genetic model are indicated by eC10Log(P). Predicted associations for lipids, SBP, and waist circumference were found to be uninformative and are not shown.
DISCUSSION
Our previous work (21) and previous work by others (35) have shown that variation in leptin levels, total adiposity, and central abdominal fat mass is under strong genetic influence. However, in this study, neither the single SNP analyses nor the haplotype analyses of PTPN1 showed any significant effect on leptin, weight, BMI, or total fat variables. That is, our results do not support a major role for PTPN1 variants in body weight regulation, because downstream effects of the hypothesized impairment in leptin signal transduction such as increases in serum leptin and measures of total body fat were not observed. This conclusion is especially important because the design of our study allows us to exclude the most likely alternative explanations for this largely negative result. First, our study did not suffer from a lack of statistical power to detect SNPs with small effects. The current study had 80% ( = 0.05) power to detect a biallelic quantitative trait locus explaining as little as 0.5% of the variance (36). Second, strong linkage disequilibrium across PTPN1 allowed us to effectively capture common variation in this gene by the selected set of tSNPs, making it unlikely that we may have missed any major variants that are either causal or in strong linkage disequilibrium with a causal locus. The third advantage of our study is the comprehensive and accurate measurement of the phenotypes of interest: general and central obesity. In addition to serum leptin levels and anthropometric variables, central and total body fat was assessed by dual energy X-ray absorptiometry, which is a more objective and reliable method of assessing adiposity than, for example, BMI because it allows discrimination between fat, muscle mass, bone, and vital organs (23). Information on all phenotypes was subsequently used to generate one combined score for general obesity and one for central obesity.
Although we observed a small but significant effect of one PTPN1 variant on central obesity, we suggest this effect is mediated through the inhibition of insulin signaling by PTP-1B, because central fat is a major determinant of insulin sensitivity whereas total fat is the major determinant of leptin levels (23). We found that carriers of the minor alleles of rs1885177, rs2282146, and rs718049 had significantly lower SiM and higher fasting insulin and glucose, associations maintained in sibling TDT. Effects of PTPN1 on insulin sensitivity and type 2 diabetes risk have been reported by two recent studies (15,37). Palmer et al. (15) found that 20 SNPs with MAFs >0.1 in a single haplotype block covering the PTPN1 genomic sequence showed significant association with the insulin sensitivity index Si in Hispanic Americans (P = 0.003eC0.044, n = 811, based on codominant models). Although the study of Palmer et al. is a family study of 55 pedigrees, unlike us, no TDT results were reported to exclude the possibility of false-positive associations due to stratification or admixture. Four of our tSNPs were among those genotyped by Palmer et al.: rs1885177, rs718049, rs3787348, and 1484insG. Our results and those of Palmer et al. are consistent in that carriage of the A allele of rs1885177 [rare allele (2) in our cohort and marginally the more frequent allele (1) in that of Palmer et al.] was associated with lower SiM and Si, as was the allele (2) of rs718049. SNP rs2282146, associated with SiM by us (P = 0.049), had an MAF of 4% in the Hispanic population and did not show a significant influence on Si. Palmer et al. (15) found that their 20 SNPs were all significantly associated with fasting glucose (P = <0.001eC0.029), however, we only found an association with rs718049 (P = 0.045).
Palmer et al. (15) analyzed eight SNPs tagging haplotypes with frequency >10%, having completed genotyping of all 35 SNPs in their 811 subjects. We reduced effort considerably by first genotyping candidate tSNPs in a small subsample, which confirmed the strong linkage disequilibrium in the region that emerged from the larger dataset of Palmer et al. and enabled tSNP selection. The eight tSNPs of Palmer et al. included three of our tSNPs, rs718049, rs3787348, and 1484insG, and another three that we had already ascertained were tagged by rs718049: rs3787345, rs745118, and rs2282147. We had not attempted to validate their remaining two SNPs. Palmer et al. (15) defined one risk, one protective, and one neutral haplotype with respect to Si. We can only directly compare the haplotypes constructed by common SNPs rs718049, rs3787348, and 1484insG in the Hispanic sample and our sample with respect to Si and SiM. In fact, rs718049 and rs3787348 alone discriminate the Hispanic haplotypes. Haplotype rs718049-rs3787348-1484insG 1-1-1 is protective in the Hispanics and in our twin population. The Hispanic neutral haplotype is also identical to our neutral (most common) haplotype: 1-2-1. The Hispanic risk haplotype 2-2-1 is slightly different; in the twins it is 2-1-1. Therefore, overall, our haplotype results are consistent with those of Palmer et al. with respect to insulin sensitivity. However, unlike them, we found no haplotypes associated with fasting glucose. The same haplotypes associated with Si and fasting glucose by Palmer et al. (15) have independently been shown to be associated with type 2 diabetes risk and protection in Caucasian type 2 diabetic subjects (37).
Olivier et al. (17) found that of six SNPs in strong linkage disequilibrium spanning PTPN1, one SNP (rs16995294, not genotyped by us) was significantly associated with hypertension in 1,553 Chinese and Japanese subjects from 672 families. In a TDT, they identified two 6-SNP haplotypes significantly overtransmitted in hypertensive subjects (P < 0.0001). Again, although direct comparisons are not possible, our finding of a protective haplotype for SBP in Caucasians provides further evidence in support of an influence of PTPN1 on development of hypertension. In the absence of an obvious etiological role, this effect is likely to be secondary to the influence of the gene on insulin sensitivity.
Finally, comparing our study of the 1484insG SNP with that of Di Paola et al. (13) in 477 normoglycemic women from two Italian regions, we found no significant differences in quantitative phenotypes between wild-type and 1484insG carriers, including those for which they found positive associations: SBP and DBP (higher in one group only) and elevated 1-h and 2-h insulin levels (measured in the other group). They found that 1484insG was associated with higher serum triglycerides in Italian men but not in women, however, we found an association in women, which remained significant after adjustment for BMI (P = 0.029).
Variation in the coding region of PTPN1 is relatively rare (only two nonsynonymous changes listed on the National Center for Biotechnology Information database), and associations of the synonymous coding or intronic SNPs studied by us are likely to result from linkage disequilibrium with functional variants, rather than be causal themselves. We can use the prediction rules computed during the choice of the marker subset to target specific loci for further study (19). With regard to SiM and fasting insulin, we predict additional significant associations with rs3787345 and rs2038526 in intron 4, rs754118 in intron 5, and rs2282147 in intron 7. Although it remains difficult to predict where the causal locus or loci might be due to the strong linkage disequilibrium across this gene, Fig. 2 indicates that certain regions of the gene can be excluded and others need to be considered when looking for causal loci and, as such, provides useful direction. The large introns in PTPN1 and the flanking regions may contain unknown regulatory elements; for example, the 3'-untranslated region 1484insG variant was shown to stabilize PTP1-B mRNA in vitro (13). We are now validating SNPs in the PTPN1 5'-flanking region and will explore whether they are in strong linkage disequilibrium with the existing tSNPs (especially rs2282146), which could be the causal locus or loci. Our findings and those of others relating PTPN1 variation to insulin resistance, lipid profile, central obesity, and hypertension suggest that this gene has a significant influence on progression of the metabolic syndrome. Demonstration of an etiological PTPN1 variant is eagerly awaited and would confirm PTP1-B as an influential factor.
ACKNOWLEDGMENTS
This study has received funding from British Heart Foundation Project Grant PG/04/028. The Twin Research and Genetic Epidemiology Unit has received support from the Wellcome Trust, Arthritis Research Campaign, the Chronic Disease Research Foundation, and the European Union 5th Framework Programme Genom EU twin no. QLG2-CT-2002-01254.
This research was conducted within the network of the London IDEAS Genetic Knowledge Park, and used the St. George’s, University of London Medical Biomics Centre.
FOOTNOTES
N.J.S.-J. and X.W. share joint first authorship.
Additional information for this article can be found in an online appendix at http://diabetes.diabetesjournals.org.
REFERENCES
Schwartz MW, Woods SC, Porte D, Seeley RJ, Baskin DG: Central nervous system control of food intake. Nature404 :661 eC671,2000
Byon JC, Kusari AB, Kusari J: Protein-tyrosine phosphatase-1B acts as a negative regulator of insulin signal transduction. Mol Cell Biochem182 :101 eC108,1998
Lund IK, Hansen JA, Andersen HS, Moller NP, Billestrup N: Mechanism of protein tyrosine phosphatase 1B-mediated inhibition of leptin signalling. J Mol Endocrinol34 :339 eC351,2005
Zabolotny JM, Bence-Hanulec KK, Stricker-Krongrad A, Haj F, Wang Y, Minokoshi Y, Kim YB, Elmquist JK, Tartaglia LA, Kahn BB, Neel BG: PTP-1B regulates leptin signal transduction in vivo. Dev Cell2 :489 eC495,2002
Elchebly M, Payette P, Michaliszyn E, Cromlish W, Collins S, Loy AL, Normandin D, Cheng A, Himms-Hagen J, Chan CC, Ramachandran C, Gresser MJ, Tremblay ML, Kennedy BP: Increased insulin sensitivity and obesity resistance in mice lacking the protein tyrosine phosphatase-1B gene. Science283 :1544 eC1548,1999
Cheng A, Uetani N, Simoncic PD, Chaubey VP, Lee-Loy A, McGlade CJ, Kennedy BP, Tremblay ML: Attenuation of leptin action and regulation of obesity by protein tyrosine phosphatase 1B. Dev Cell2 :497 eC503,2002
Maffei M, Halaas J, Ravussin E, Pratley RE, Lee GH, Zhang Y, Fei H, Kim S, Lallone R, Ranganathan S, et al.: Leptin levels in human and rodent: measurement of plasma leptin and ob RNA in obese and weight-reduced subjects. Nat Med1 :1155 eC1161,1995
Lembertas AV, Perusse L, Chagnon YC, Fisler JS, Warden CH, Purcell-Huynh DA, Dionne FT, Gagnon J, Nadeau A, Lusis AJ, Bouchard C: Identification of an obesity quantitative trait locus on mouse chromosome 2 and evidence of linkage to body fat and insulin on the human homologous region 20q. J Clin Invest100 :1240 eC1247,1997
Ji L, Malecki M, Warram JH, Yang Y, Rich SS, Krolewski AS: New susceptibility locus for NIDDM is localized to human chromosome 20q. Diabetes46 :876 eC881,1997
Lee JH, Reed DR, Li WD, Xu W, Joo EJ, Kilker RL, Nanthakumar E, North M, Sakul H, Bell C, Price RA: Genome scan for human obesity and linkage to markers in 20q13. Am J Hum Genet64 :196 eC209,1999
Echwald SM, Bach H, Vestergaard H, Richelsen B, Kristensen K, Drivsholm T, Borch-Johnsen K, Hansen T, Pedersen O: A P387L variant in protein tyrosine phosphatase-1B (PTP-1B) is associated with type 2 diabetes and impaired serine phosphorylation of PTP-1B in vitro. Diabetes51 :1 eC6,2002
Mok A, Cao H, Zinman B, Hanley AJ, Harris SB, Kennedy BP, Hegele RA: A single nucleotide polymorphism in protein tyrosine phosphatase PTP-1B is associated with protection from diabetes or impaired glucose tolerance in Oji-Cree. J Clin Endocrinol Metab87 :724 eC727,2002
Di Paola R, Frittitta L, Miscio G, Bozzali M, Baratta R, Centra M, Spampinato D, Santagati MG, Ercolino T, Cisternino C, Soccio T, Mastroianno S, Tassi V, Almgren P, Pizzuti A, Vigneri R, Trischitta V: A variation in 3' UTR of hPTP1-B increases specific gene expression and associates with insulin resistance. Am J Hum Genet70 :806 eC812,2002
Kipfer-Coudreau S, Eberle D, Sahbatou M, Bonhomme A, Guy-Grand B, Froguel P, Galan P, Basdevant A, Clement K: Single nucleotide polymorphisms of protein tyrosine phosphatase 1B gene are associated with obesity in morbidly obese French subjects. Diabetologia47 :1278 eC1284,2004
Palmer ND, Bento JL, Mychaleckyj JC, Langefeld CD, Campbell JK, Norris JM, Haffner SM, Bergman RN, Bowden DW; Insulin Resistance Atherosclerosis Study (IRAS) Family Study: Association of protein tyrosine phosphatase 1B gene polymorphisms with measures of glucose homeostasis in Hispanic Americans: the Insulin Resistance Atherosclerosis Study (IRAS) family study. Diabetes53 :3013 eC3019,2004
de Ferranti SD, Gauvreau K, Ludwig DS, Neufeld EJ, Newburger JW, Rifai N: Prevalence of the metabolic syndrome in American adolescents: findings from the Third National Health and Nutrition Examination Survey. Circulation110 :2494 eC2497,2004
Olivier M, Hsiung CA, Chuang LM, Ho LT, Ting CT, Bustos VI, Lee TM, De Witte A, Chen YD, Olshen R, Rodriguez B, Wen CC, Cox DR: Single nucleotide polymorphisms in protein tyrosine phosphatase 1 (PTPN1) are associated with essential hypertension and obesity. Hum Mol Genet13 :1885 eC1892,2004
Neale BM, Sham PC: The future of association studies: gene-based analysis and replication. Am J Hum Genet75 :353 eC362,2004
Chapman JM, Cooper JD, Todd JA, Clayton DG: Detecting disease associations due to linkage disequilibrium using haplotype tags: a class of tests and the determinants of statistical power. Hum Hered56 :18 eC31,2003
Stram DO, Haiman CA, Hirschhorn JN, Altshuler D, Kolonel LN, Henderson BE, Pike MC: Choosing haplotype-tagging SNPs based on unphased genotype data using a preliminary sample of unrelated subjects with an example from the Multiethnic Cohort Study. Hum Hered55 :27 eC36,2003
Spector TD, MacGregor AJ: The St. Thomas’ U.K. Adult Twin Registry. Twin Res5 :440 eC443,2002
Andrew T, Hart D, Snieder H, de Lange M, Spector TD, MacGregor AJ: Are twins and singletons comparable A study of disease-related and lifestyle characteristics in adult women. Twin Res4 :464 eC477,2001
Samaras K, Spector TD, Nguyen TV, Baan K, Campbell LV, Kelly PJ: Independent genetic factors determine the amount and distribution of fat in women after the menopause. J Clin Endocrinol Metab82 :781 eC785,1997
de Lange M, Snieder H, Ariens RA, Andrew T, Grant PJ, Spector TD: The relation between insulin resistance and hemostasis: pleiotropic genes and common environment. Twin Res6 :152 eC161,2003
Middelberg RP, Spector TD, Swaminathan R, Snieder H: Genetic and environmental influences on lipids, lipoproteins, and apolipoproteins: effects of menopause. Arterioscler Thromb Vasc Biol22 :1142 eC1147,2002
Thompson D, Stram D, Goldgar D, Witte JS: Haplotype tagging single nucleotide polymorphisms and association studies. Hum Hered56 :48 eC55,2003
Treegout D-A, Ducimeteere P, Tiret L: Testing association between candidate-gene markers and phenotype in related individuals, by use of estimating equations. Am J Hum Genet61 :189 eC199,1997
Dong Y, Zhu H, Wang X, Dalageorgou C, Carter N, Spector TD, Snieder H: Obesity reveals an association between blood pressure and the G-protein beta3-subunit gene: a study of female dizygotic twins. Pharmacogenetics14 :419 eC427,2004
Zaykin DV, Westfall PH, Young SS, Karnoub MA, Wagner MJ, Ehm MG: Testing association of statistically inferred haplotypes with discrete and continuous traits in samples of unrelated individuals. Hum Hered53 :79 eC91,2002
Stephens M, Donnelly P: A comparison of Bayesian methods for haplotype reconstruction from population genotype data. Am J Hum Genet73 :1162 eC1169,2003
Avignon A, Boegner C, Mariano-Goulart D, Colette C, Monnier L: Assessment of insulin sensitivity from plasma insulin and glucose in the fasting or post oral glucose-load state. Int J Obes Relat Metab Disord23 :512 eC517,1999
Hanley AJ, Williams K, Gonzalez C, D’Agostino RB Jr, Wagenknecht LE, Stern MP, Haffner SM: Prediction of type 2 diabetes using simple measures of insulin resistance: combined results from the San Antonio Heart Study, the Mexico City Diabetes Study, and the Insulin Resistance Atherosclerosis Study. Diabetes52 :463 eC469,2003
Abecasis GR, Cookson WO: GOLD: graphical overview of linkage disequilibrium. Bioinformatics16 :182 eC183,2000
Tabor HK, Risch NJ, Myers RM: Candidate-gene approaches for studying complex genetic traits: practical considerations (Opinion). Nat Rev Genet3 :391 eC397,2002
Comuzzie AG, Hixson JE, Almasy L, Mitchell BD, Mahaney MC, Dyer TD, Stern MP, MacCluer JW, Blangero J: A major quantitative trait locus determining serum leptin levels and fat mass is located on human chromosome 2. Nat Genet15 :273 eC276,1997
Sham PC, Cherny SS, Purcell S, Hewitt JK: Power of linkage versus association analysis of quantitative traits, by use of variance-components models, for sibship data. Am J Hum Genet66 :1616 eC1630,2000
Bento JL, Palmer ND, Mychaleckyj JC, Lange LA, Langefeld CD, Rich SS, Freedman BI, Bowden DW: Association of protein tyrosine phosphatase 1B gene polymorphisms with type 2 diabetes. Diabetes53 :3007 eC3012,2004(Nicola J. Spencer-Jones, )
2 Georgia Prevention Institute, Department of Pediatrics, Medical College of Georgia, Augusta, Georgia
3 Twin Research and Genetic Epidemiology Unit, St. Thomas’ Hospital, London, U.K
DBP, diastolic blood pressure; GEE, generalized estimating equation; MAF, minor allele frequency; PTP-1B, protein tyrosine phosphatase-1B; SBP, systolic blood pressure; SiM, Avignon’s insulin sensitivity index based on both fasting and 2-h insulin and glucose values; SNP, single nucleotide polymorphism; TDT, transmission-disequilibrium test; tSNP, tagging SNP
ABSTRACT
Protein tyrosine phosphatase-1B negatively regulates leptin and insulin signaling, potentially contributing to hormonal resistance. We selected six tagging single nucleotide polymorphisms (SNPs) representing 18 common variants in the protein tyrosine phosphatase-1B gene (PTPN1) and tested their effect on serum leptin, body fat, and measures of insulin sensitivity and the metabolic syndrome in a large sample of normal female Caucasian twins (n = 2,777; mean age, 47.4 ± 12.5 years) from the St. Thomas’ U.K. Adult Twin Registry. SNP rs718049 was significantly associated with waist circumference (P = 0.008) and central fat (P = 0.035) and also with Avignon’s insulin sensitivity index (SiM) (P = 0.007), fasting insulin (P = 0.004), fasting glucose (P = 0.022), triglyceride (P = 0.023), and systolic blood pressure (P = 0.046). SNPs rs2282146 and rs1885177 were associated with SiM (P = 0.049 and P = 0.013, respectively), and 1484insG was associated with triglyceride (P = 0.029). A risk haplotype (7.3%) was associated with lower SiM (P = 0.036) and a protective haplotype (5.2%) with higher SiM (P = 0.057), with mean values in homozygotes differing by >1 SD (P = 0.003). The protective haplotype also showed lower triglyceride (P = 0.045) and lower systolic blood pressure (P = 0.006). Fine mapping analyses predicted significant associations with SiM and fasting insulin for several ungenotyped SNPs. PTPN1 variants appear to contribute to central fat and metabolic syndrome traits, secondary to their effect on insulin sensitivity.
Obesity poses one of the most pressing current public health problems and is a major risk factor for type 2 diabetes. Protein tyrosine phosphatase-1B (PTP-1B) negatively regulates the signaling pathways of insulin and leptin, two hormones involved in the central regulation of energy balance (1). Activated insulin receptor and receptor substrate-1 are dephosphorylated by PTP-1B (2), and phosphorylated JAK2 and STAT3 have been identified as its targets in the leptin pathway (3,4). PTP-1BeCdeficient mice are hypersensitive to insulin (5) and leptin (4,6) and are protected from diet-induced obesity (4eC6). Hence, increased activity of PTP-1B potentially contributes to leptin resistance associated with overweight (7) and resistance to insulin, which predisposes to type 2 diabetes. In humans, PTPN1, the gene coding for PTP-1B, maps to a region in which a major human quantitative trait locus for obesity and type 2 diabetes has been reported (8eC10). Several rare PTPN1 single nucleotide polymorphisms (SNPs) have been shown to be associated with insulin resistance and diabetes in different populations (11eC14). Recently, Palmer et al. (15) mounted an extensive study of 35 SNPs spanning a 161-kb region including the PTPN1 gene in >800 Hispanic Americans. Twenty SNPs showed significant association with fasting glucose and the insulin sensitivity index Si. Through its effect on insulin sensitivity, changes in PTP-1B activity may also be reflected in secondary development of the metabolic syndrome, characterized by increases in central obesity, triglycerides, fasting glucose, and systolic blood pressure (SBP) and lower levels of HDL cholesterol (16). In Caucasians, PTPN1 SNPs have been associated with serum triglyceride levels in obese subjects (14) and normal individuals (15), and a strong association of common risk haplotypes with hypertension has been found in Chinese and Japanese subjects (17). To the best of our knowledge, no human studies have focused on impaired leptin signal transduction as the underlying pathway by specifically investigating the association of PTPN1 variants with serum leptin and measures of body fat.
Increased availability of SNPs has recently transformed candidate gene association studies. Examination on a gene-wide level is now recommended with all variants within a candidate gene considered jointly (18). This is best achieved through selection of a minimal set of tagging SNPs (tSNPs) effectively capturing information of all common variants by taking into account patterns of linkage disequilibrium across the gene (19,20). We therefore used a multistep design to select a set of tSNPs representing all considered common variants in PTPN1 and tested the effect of these tSNPs, individually and as haplotypes on a range of variables characterizing body composition, insulin sensitivity and the metabolic syndrome in a sample of Caucasian female twins (n = 2,777; mean age, 47.4 ± 12.5 years) that is almost three times the size of the Hispanic population studied by Palmer et al. (15) and 50% larger than the combined Chinese and Japanese cohorts of Olivier et al. (17). Where possible, we compared any specific single SNP and haplotype associations in our larger sample with those found in previous studies. Finally, we performed a fine mapping analysis that uses the tSNPs to predict potential associations between the remaining (ungenotyped) SNPs and the outcome variables, in an attempt to locate potential functional sites (19).
RESEARCH DESIGN AND METHODS
The St. Thomas’ U.K. Adult Twin Registry (Twins U.K.) comprises unselected, mostly female volunteers ascertained from the general population through national media campaigns in the U.K. (21). Means and ranges of quantitative phenotypes in Twins U.K. are normally distributed and similar to the age-matched general population in the U.K. (22). General characteristics of the subjects are given in Table 1.
Zygosity, body composition, and biochemical analyses.
Zygosity was determined by standardized questionnaire and confirmed by DNA fingerprinting. Serum leptin concentration was determined after an overnight fast using a radioimmunoassay (Linco Research, St. Louis, MO). The majority of these twins also had measures of total and central body fat obtained by dual energy X-ray absorptiometry body composition scans (Hologic QDR-2000; Vertec, Waltham, MA) (23). Blood sample collection for determination of fasting serum insulin and glucose was as described by de Lange et al. (24). Fasting insulin was measured by immunoassay (Abbott Laboratories, Maidenhead, U.K.) and glucose on an Ektachem 700 multichannel analyzer using an enzymatic colorimetric slide assay (Johnson and Johnson Clinical Diagnostic Systems, Amersham, U.K.). A subsample of twins underwent an oral glucose tolerance test for which glucose and insulin levels were measured before and 2 h after a 75-g oral glucose load. Levels of HDL cholesterol and triglycerides were measured using a Cobas Fara machine (Roche Diagnostics) (25). SBP and diastolic blood pressure (DBP) were measured twice using an automated cuff sphygmomanometer (OMRON HEM713C) and averaged over the two readings. Informed consent was obtained from participants before they entered the study. The protocol was approved by the local research ethics committee.
Genotyping for SNP validation and tSNP selection.
Nineteen validated SNPs from the National Center for Biotechnology Information database (http://www.ncbi.nlm.nih.gov/SNP) and one SNP, 1484insG, reported by Di Paola et al. (13) were selected for validation in the Twins U.K. cohort by genotyping eight random unrelated subjects. Among these 20 SNPs, only rs718050 was not polymorphic. The other 19 SNPs were further genotyped in a sample of 94 unrelated subjects for tSNP selection, using PCR and restriction. This sample size is three times that suggested by one permutation study (26), which indicates that genotyping 25eC32 unphased individuals is sufficient to select tSNPs. SNP rs3787339 (minor allele frequency [MAF] of 1%) turned out to be uninformative and was excluded from all further analyses. Primers and PCR conditions for SNP validation and tSNP selection are given in online appendix 1 (available at http://diabetes.diabetesjournals.org). The rs number and relative position of these 19 SNPs are shown in Fig. 1.
Genotyping in cohort.
Six tSNPs (rs6067484, rs1885177, rs2282146, rs718049, rs3787348, and 1484insG) were selected and genotyped in the complete cohort by Pyrosequencing (Biotage, Uppsala, Sweden). Genotyping accuracy was assessed by inclusion of duplicates (pairs of monozygotic twins) in the arrays and negative controls (water blanks) on each plate. Primers and PCR conditions for SNP genotyping in the full cohort by Pyrosequencing are given in online appendix 2.
Analytical approach.
The main purposes of our analyses were to select a set of tSNPs representing the common variants in PTPN1 and to test the effect of these tSNPs, individually and/or as haplotypes, on a range of variables including serum leptin, measures of body fat, insulin sensitivity, and components of the metabolic syndrome.
tSNP selection.
We used two approaches to identify an optimal subset of tSNPs. The first one, devised by Stram et al. (20), selects a subset of SNPs as tSNPs by optimizing their prediction of common haplotypes of all SNPs genotyped in the 94 unrelated individuals. It considers a measure of association (RH2) between the true and predicted number of copies (0, 1, or 2) of each possible common haplotype carried by a randomly sampled subject, in which the prediction is based on knowledge of the tSNPs. The program tagsnps was used with the following parameters: common haplotypes were defined as "the minimal set of haplotypes that covers 90% of existing haplotypes" and sets of tSNPs resolving the common haplotypes were selected at a RH2 threshold of 0.85 because the number of selected tSNPs at this threshold was equal to the number of tSNPs selected using the method of Chapman et al. (19), at an RL2 threshold of 0.85 (see below). Chapman et al. selected an optimal set of tSNPs in such a way that the allele frequencies of the remaining (non-tSNPs) can be predicted well. A series of regression equations are calculated for which the predictive efficiency is assessed in terms of RL2, which measures the proportion of variance of each remaining SNP "explained" by regression on the tSNP alleles (locus-based scoring). These regression equations can also be used to predict which of the ungenotyped SNPs might also show association with the trait. The package htSNP2 was used to select a tSNP set that predicts remaining SNPs with a minimum RL2 of 0.85, as suggested by Chapman et al. (19). The program mlpop implemented in the package genassoc predicts the association between unmeasured SNPs and the trait in unrelated subjects. We adapted mlpop for the analysis of related subjects such as twins by replacing the linear regression with a generalized estimating equations (GEEs) procedure, which yields unbiased standard errors and P values (27). The main difference between the approaches of Stram et al. (20) and Chapman et al. (19) is that the former is based on prediction of extended haplotypes (in this case, based on 18 SNPs) from the marker haplotypes (in this case, based on 6 tSNPs), whereas the latter is based on prediction of single SNP loci.
Association analysis.
Regular association analyses were performed using GEEs. Analyses were done separately for each of the SNPs and followed up by haplotype analyses. For individual SNP association analyses, we first tested the 2-df (degrees of freedom) codominant model. In the presence of a significant association, a dominant and a recessive model were further tested to find the best mode of inheritance. Age and menopausal status were included as covariates in the models. Details of our approach to test the association of statistically inferred haplotypes with continuous traits have been described previously (28). In short, we used haplotype trend regression as outlined by Zaykin et al. (29), with the probabilities of haplotype pairs estimated by PHASE 2.0 software (30). Obesity-related variables included leptin, weight, BMI, waist circumference, total fat mass, percent total fat, central fat mass, and percent central fat. Factor analysis was used to combine strongly correlated indexes of obesity into two measures: one for general obesity (serum leptin, BMI, weight, total fat mass, and percent total fat) and one for central obesity (waist circumference, central fat mass, and percent central fat). To reduce the likelihood of identifying false-positive associations, results of single variables characterizing obesity were confirmed by these two combined scores. We used two indexes of insulin sensitivity: fasting insulin and Avignon’s insulin sensitivity index (SiM). SiM is calculated based on both fasting and 2-h insulin and glucose data (31) and was calculated according to the following formulae: SiM = (0.137 x SIB + SIH2)/2, where SIB = 108/(fasting insulin x fasting glucose x VD); SIH2 = 108/(2-h insulin x 2-h glucose x VD) and VD = 150 ml/kg x body wt. Both fasting insulin (r = 0.68) and SiM (r = 0.92) are highly correlated with insulin sensitivity (Si) in the normal population. SiM is also a good predictor of diabetes, especially in Caucasians (32). For the metabolic syndrome measures, we included five variables: triglycerides, HDL cholesterol, waist circumference, fasting glucose, and SBP, based on the clinical definition of the insulin receptor substrate 9 (metabolic syndrome) as specified in the third report of the Adult Treatment Panel (ATPIII) of high blood cholesterol (16). To control for population stratification, dizygotic twin pairs discordant for genotype were also used in sibling transmission-disequilibrium test (TDT) association analysis to confirm the results of regular association tests as described elsewhere (27). Preliminary analyses were performed using STATA 8 (StataCorp, College Station, TX). Phenotypes significantly (P < 0.05) deviating from normal were log transformed to obtain normal distributions before analysis. Hardy-Weinberg equilibrium was tested by a 2 test with 1 df in one twin of each pair chosen at random to prevent inflated significance. Pairwise linkage disequilibrium coefficients were calculated using GOLD and reported as D' and r2 (33).
RESULTS
Among 20 SNPs tested in the eight subjects, 19 SNPs were polymorphic and genotyped in the 94 subjects. Figure 1 shows the positions and MAFs of these 19 SNPs. All of the SNPs except rs2282146 (P303P in exon 8) were located in noncoding regions, and their genotype frequencies were consistent with Hardy-Weinberg proportions. The uninformative rs3787339 (MAF of 1%) was removed from further analyses (34). The other 18 SNPs had MAFs >5% and showed strong pairwise linkage disequilibrium (online appendixes 3 and 4). The presence of strong linkage disequilibrium throughout the gene suggested the feasibility of tSNP selection, facilitating great savings in time and costs.
Table 2 shows the inferred haplotypes of the 18 SNPs (n = 94). Of the seven haplotypes together comprising >90% of the total, three were common (>5%), one had a frequency of 4.2%, and three showed a frequency of 1.9%. The same set of six tSNPs, rs6067484, rs1885177, rs2282146, rs718049, rs3787348, and 1484insG, were selected by tagsnps (20) and htSNP2 (19) programs. This set of tSNPs accurately predicted both common haplotypes and unmeasured loci. Minimum values of RH2 and RL2 were 0.884 for haplotype 3 and 0.882 for rs10485614, respectively.
Table 3 shows the genotype and allele frequencies of the six tSNPs in the whole cohort. The total number of subjects genotyped for each polymorphism varied slightly and was somewhat lower than 2,356, the total number of twins genotyped (i.e., one monozygotic and both dizygotic twins of each pair). This was due to unsuccessful amplification of the target sequences for some samples. None of the loci showed deviation from Hardy-Weinberg equilibrium. The inferred haplotype frequencies of the six tSNPs in the whole cohort are also shown in Table 3. The same seven most common haplotypes observed in the 94 subjects (Table 2) were also identified by the six tSNPs, although the actual estimates of haplotype frequencies and corresponding order varied somewhat between the full cohort and the tSNP selection sample (n = 94).
Table 4 presents the results of individual SNP analyses. For the SNPs of lowest frequency, rs2282146 and 1484insG, only dominant models were tested, whereas for the four more common SNPs, the 2-df codominant model was first tested, followed by a dominant and a recessive model in the presence of a significant association. With the exception of the effect of rs718049 on waist circumference, none of the main effects of the SNPs on leptin and other obesity-related variables reached statistical significance. We therefore did not include these variables in Table 4. Regular association (P = 0.074) and sibling TDT tests (P = 0.025) for the combined central obesity score confirmed the significant associations with waist circumference (data not shown). No significant effects of haplotypes on any of the obesity-related variables were found (data not shown). Carriers of the minor allele of rs718049 had significantly higher waist circumference, lower SiM, and higher fasting insulin and glucose. Excepting glucose, these associations were maintained in sibling TDT. SNP rs2282146 was significantly associated with SiM (P = 0.049), and this association changed to borderline significance in sibling TDT (P = 0.062), with carriers of the minor allele having lower SiM. Under the recessive model, carriers of the minor allele of rs1885177 had significantly lower SiM (P = 0.013) and maintained significance in sibling TDT (P = 0.039). Two SNPs showed significant associations with other components of the metabolic syndrome. Again, the strongest relationship was shown with rs718049, associated with serum triglyceride and SBP, with all except triglyceride remaining significant in sibling TDT. SNP 1484insG was associated with triglyceride (P = 0.029) only in the regular test.
Table 5 shows haplotype frequencies and results of haplotype association tests. Although the overall test did not reach significance (P = 0.085), haplotype 4 was significantly associated with lower SiM (P = 0.036), i.e., confers risk of insulin resistance, and haplotype 5 was borderline-significantly associated with higher SiM (P = 0.057), i.e., is protective. The mean levels of SiM were 0.46 SD lower for haplotype 4 homozygotes and 0.56 SD higher for haplotype 5 homozygotes compared with those homozygous for the most common haplotype 1, a difference of >1 SD (P = 0.003). The explained variance of SiM by the PTPN1 haplotypes was 1.92%. The risk and protective haplotypes are discriminated by rs2282146 and rs718049, reflecting the strongest individual significant associations with SiM in the regular and sibling TDT analyses. The protective effects of haplotype 5 were also observed for SBP (P = 0.006) and triglyceride (P = 0.045) levels. The -coefficient for haplotype 5 on SBP was eC7.2, and on triglycerides was eC0.17, i.e., SBP levels were 7.2 mmHg lower and triglyceride levels 0.17 mmol/l lower for protective haplotype 5 homozygotes compared with common haplotype 1 homozygotes. The explained percentages of variance in SBP and triglyceride levels by the PTPN1 haplotypes were 0.33% and 0.63%, respectively.
The approach we used to select an optimal set of tSNPs (19) can also predict which of those SNPs not typed in the full cohort might also show association with the trait. Figure 2 shows the fine mapping results (including the ungenotyped SNPs) based on prediction of single SNP loci from the six tSNPs. Associations of both tSNPs and predicted values for unmeasured SNPs with SiM, fasting insulin, and fasting glucose under the additive genetic model are indicated by eC10Log(P). Predicted associations for lipids, SBP, and waist circumference were found to be uninformative and are not shown.
DISCUSSION
Our previous work (21) and previous work by others (35) have shown that variation in leptin levels, total adiposity, and central abdominal fat mass is under strong genetic influence. However, in this study, neither the single SNP analyses nor the haplotype analyses of PTPN1 showed any significant effect on leptin, weight, BMI, or total fat variables. That is, our results do not support a major role for PTPN1 variants in body weight regulation, because downstream effects of the hypothesized impairment in leptin signal transduction such as increases in serum leptin and measures of total body fat were not observed. This conclusion is especially important because the design of our study allows us to exclude the most likely alternative explanations for this largely negative result. First, our study did not suffer from a lack of statistical power to detect SNPs with small effects. The current study had 80% ( = 0.05) power to detect a biallelic quantitative trait locus explaining as little as 0.5% of the variance (36). Second, strong linkage disequilibrium across PTPN1 allowed us to effectively capture common variation in this gene by the selected set of tSNPs, making it unlikely that we may have missed any major variants that are either causal or in strong linkage disequilibrium with a causal locus. The third advantage of our study is the comprehensive and accurate measurement of the phenotypes of interest: general and central obesity. In addition to serum leptin levels and anthropometric variables, central and total body fat was assessed by dual energy X-ray absorptiometry, which is a more objective and reliable method of assessing adiposity than, for example, BMI because it allows discrimination between fat, muscle mass, bone, and vital organs (23). Information on all phenotypes was subsequently used to generate one combined score for general obesity and one for central obesity.
Although we observed a small but significant effect of one PTPN1 variant on central obesity, we suggest this effect is mediated through the inhibition of insulin signaling by PTP-1B, because central fat is a major determinant of insulin sensitivity whereas total fat is the major determinant of leptin levels (23). We found that carriers of the minor alleles of rs1885177, rs2282146, and rs718049 had significantly lower SiM and higher fasting insulin and glucose, associations maintained in sibling TDT. Effects of PTPN1 on insulin sensitivity and type 2 diabetes risk have been reported by two recent studies (15,37). Palmer et al. (15) found that 20 SNPs with MAFs >0.1 in a single haplotype block covering the PTPN1 genomic sequence showed significant association with the insulin sensitivity index Si in Hispanic Americans (P = 0.003eC0.044, n = 811, based on codominant models). Although the study of Palmer et al. is a family study of 55 pedigrees, unlike us, no TDT results were reported to exclude the possibility of false-positive associations due to stratification or admixture. Four of our tSNPs were among those genotyped by Palmer et al.: rs1885177, rs718049, rs3787348, and 1484insG. Our results and those of Palmer et al. are consistent in that carriage of the A allele of rs1885177 [rare allele (2) in our cohort and marginally the more frequent allele (1) in that of Palmer et al.] was associated with lower SiM and Si, as was the allele (2) of rs718049. SNP rs2282146, associated with SiM by us (P = 0.049), had an MAF of 4% in the Hispanic population and did not show a significant influence on Si. Palmer et al. (15) found that their 20 SNPs were all significantly associated with fasting glucose (P = <0.001eC0.029), however, we only found an association with rs718049 (P = 0.045).
Palmer et al. (15) analyzed eight SNPs tagging haplotypes with frequency >10%, having completed genotyping of all 35 SNPs in their 811 subjects. We reduced effort considerably by first genotyping candidate tSNPs in a small subsample, which confirmed the strong linkage disequilibrium in the region that emerged from the larger dataset of Palmer et al. and enabled tSNP selection. The eight tSNPs of Palmer et al. included three of our tSNPs, rs718049, rs3787348, and 1484insG, and another three that we had already ascertained were tagged by rs718049: rs3787345, rs745118, and rs2282147. We had not attempted to validate their remaining two SNPs. Palmer et al. (15) defined one risk, one protective, and one neutral haplotype with respect to Si. We can only directly compare the haplotypes constructed by common SNPs rs718049, rs3787348, and 1484insG in the Hispanic sample and our sample with respect to Si and SiM. In fact, rs718049 and rs3787348 alone discriminate the Hispanic haplotypes. Haplotype rs718049-rs3787348-1484insG 1-1-1 is protective in the Hispanics and in our twin population. The Hispanic neutral haplotype is also identical to our neutral (most common) haplotype: 1-2-1. The Hispanic risk haplotype 2-2-1 is slightly different; in the twins it is 2-1-1. Therefore, overall, our haplotype results are consistent with those of Palmer et al. with respect to insulin sensitivity. However, unlike them, we found no haplotypes associated with fasting glucose. The same haplotypes associated with Si and fasting glucose by Palmer et al. (15) have independently been shown to be associated with type 2 diabetes risk and protection in Caucasian type 2 diabetic subjects (37).
Olivier et al. (17) found that of six SNPs in strong linkage disequilibrium spanning PTPN1, one SNP (rs16995294, not genotyped by us) was significantly associated with hypertension in 1,553 Chinese and Japanese subjects from 672 families. In a TDT, they identified two 6-SNP haplotypes significantly overtransmitted in hypertensive subjects (P < 0.0001). Again, although direct comparisons are not possible, our finding of a protective haplotype for SBP in Caucasians provides further evidence in support of an influence of PTPN1 on development of hypertension. In the absence of an obvious etiological role, this effect is likely to be secondary to the influence of the gene on insulin sensitivity.
Finally, comparing our study of the 1484insG SNP with that of Di Paola et al. (13) in 477 normoglycemic women from two Italian regions, we found no significant differences in quantitative phenotypes between wild-type and 1484insG carriers, including those for which they found positive associations: SBP and DBP (higher in one group only) and elevated 1-h and 2-h insulin levels (measured in the other group). They found that 1484insG was associated with higher serum triglycerides in Italian men but not in women, however, we found an association in women, which remained significant after adjustment for BMI (P = 0.029).
Variation in the coding region of PTPN1 is relatively rare (only two nonsynonymous changes listed on the National Center for Biotechnology Information database), and associations of the synonymous coding or intronic SNPs studied by us are likely to result from linkage disequilibrium with functional variants, rather than be causal themselves. We can use the prediction rules computed during the choice of the marker subset to target specific loci for further study (19). With regard to SiM and fasting insulin, we predict additional significant associations with rs3787345 and rs2038526 in intron 4, rs754118 in intron 5, and rs2282147 in intron 7. Although it remains difficult to predict where the causal locus or loci might be due to the strong linkage disequilibrium across this gene, Fig. 2 indicates that certain regions of the gene can be excluded and others need to be considered when looking for causal loci and, as such, provides useful direction. The large introns in PTPN1 and the flanking regions may contain unknown regulatory elements; for example, the 3'-untranslated region 1484insG variant was shown to stabilize PTP1-B mRNA in vitro (13). We are now validating SNPs in the PTPN1 5'-flanking region and will explore whether they are in strong linkage disequilibrium with the existing tSNPs (especially rs2282146), which could be the causal locus or loci. Our findings and those of others relating PTPN1 variation to insulin resistance, lipid profile, central obesity, and hypertension suggest that this gene has a significant influence on progression of the metabolic syndrome. Demonstration of an etiological PTPN1 variant is eagerly awaited and would confirm PTP1-B as an influential factor.
ACKNOWLEDGMENTS
This study has received funding from British Heart Foundation Project Grant PG/04/028. The Twin Research and Genetic Epidemiology Unit has received support from the Wellcome Trust, Arthritis Research Campaign, the Chronic Disease Research Foundation, and the European Union 5th Framework Programme Genom EU twin no. QLG2-CT-2002-01254.
This research was conducted within the network of the London IDEAS Genetic Knowledge Park, and used the St. George’s, University of London Medical Biomics Centre.
FOOTNOTES
N.J.S.-J. and X.W. share joint first authorship.
Additional information for this article can be found in an online appendix at http://diabetes.diabetesjournals.org.
REFERENCES
Schwartz MW, Woods SC, Porte D, Seeley RJ, Baskin DG: Central nervous system control of food intake. Nature404 :661 eC671,2000
Byon JC, Kusari AB, Kusari J: Protein-tyrosine phosphatase-1B acts as a negative regulator of insulin signal transduction. Mol Cell Biochem182 :101 eC108,1998
Lund IK, Hansen JA, Andersen HS, Moller NP, Billestrup N: Mechanism of protein tyrosine phosphatase 1B-mediated inhibition of leptin signalling. J Mol Endocrinol34 :339 eC351,2005
Zabolotny JM, Bence-Hanulec KK, Stricker-Krongrad A, Haj F, Wang Y, Minokoshi Y, Kim YB, Elmquist JK, Tartaglia LA, Kahn BB, Neel BG: PTP-1B regulates leptin signal transduction in vivo. Dev Cell2 :489 eC495,2002
Elchebly M, Payette P, Michaliszyn E, Cromlish W, Collins S, Loy AL, Normandin D, Cheng A, Himms-Hagen J, Chan CC, Ramachandran C, Gresser MJ, Tremblay ML, Kennedy BP: Increased insulin sensitivity and obesity resistance in mice lacking the protein tyrosine phosphatase-1B gene. Science283 :1544 eC1548,1999
Cheng A, Uetani N, Simoncic PD, Chaubey VP, Lee-Loy A, McGlade CJ, Kennedy BP, Tremblay ML: Attenuation of leptin action and regulation of obesity by protein tyrosine phosphatase 1B. Dev Cell2 :497 eC503,2002
Maffei M, Halaas J, Ravussin E, Pratley RE, Lee GH, Zhang Y, Fei H, Kim S, Lallone R, Ranganathan S, et al.: Leptin levels in human and rodent: measurement of plasma leptin and ob RNA in obese and weight-reduced subjects. Nat Med1 :1155 eC1161,1995
Lembertas AV, Perusse L, Chagnon YC, Fisler JS, Warden CH, Purcell-Huynh DA, Dionne FT, Gagnon J, Nadeau A, Lusis AJ, Bouchard C: Identification of an obesity quantitative trait locus on mouse chromosome 2 and evidence of linkage to body fat and insulin on the human homologous region 20q. J Clin Invest100 :1240 eC1247,1997
Ji L, Malecki M, Warram JH, Yang Y, Rich SS, Krolewski AS: New susceptibility locus for NIDDM is localized to human chromosome 20q. Diabetes46 :876 eC881,1997
Lee JH, Reed DR, Li WD, Xu W, Joo EJ, Kilker RL, Nanthakumar E, North M, Sakul H, Bell C, Price RA: Genome scan for human obesity and linkage to markers in 20q13. Am J Hum Genet64 :196 eC209,1999
Echwald SM, Bach H, Vestergaard H, Richelsen B, Kristensen K, Drivsholm T, Borch-Johnsen K, Hansen T, Pedersen O: A P387L variant in protein tyrosine phosphatase-1B (PTP-1B) is associated with type 2 diabetes and impaired serine phosphorylation of PTP-1B in vitro. Diabetes51 :1 eC6,2002
Mok A, Cao H, Zinman B, Hanley AJ, Harris SB, Kennedy BP, Hegele RA: A single nucleotide polymorphism in protein tyrosine phosphatase PTP-1B is associated with protection from diabetes or impaired glucose tolerance in Oji-Cree. J Clin Endocrinol Metab87 :724 eC727,2002
Di Paola R, Frittitta L, Miscio G, Bozzali M, Baratta R, Centra M, Spampinato D, Santagati MG, Ercolino T, Cisternino C, Soccio T, Mastroianno S, Tassi V, Almgren P, Pizzuti A, Vigneri R, Trischitta V: A variation in 3' UTR of hPTP1-B increases specific gene expression and associates with insulin resistance. Am J Hum Genet70 :806 eC812,2002
Kipfer-Coudreau S, Eberle D, Sahbatou M, Bonhomme A, Guy-Grand B, Froguel P, Galan P, Basdevant A, Clement K: Single nucleotide polymorphisms of protein tyrosine phosphatase 1B gene are associated with obesity in morbidly obese French subjects. Diabetologia47 :1278 eC1284,2004
Palmer ND, Bento JL, Mychaleckyj JC, Langefeld CD, Campbell JK, Norris JM, Haffner SM, Bergman RN, Bowden DW; Insulin Resistance Atherosclerosis Study (IRAS) Family Study: Association of protein tyrosine phosphatase 1B gene polymorphisms with measures of glucose homeostasis in Hispanic Americans: the Insulin Resistance Atherosclerosis Study (IRAS) family study. Diabetes53 :3013 eC3019,2004
de Ferranti SD, Gauvreau K, Ludwig DS, Neufeld EJ, Newburger JW, Rifai N: Prevalence of the metabolic syndrome in American adolescents: findings from the Third National Health and Nutrition Examination Survey. Circulation110 :2494 eC2497,2004
Olivier M, Hsiung CA, Chuang LM, Ho LT, Ting CT, Bustos VI, Lee TM, De Witte A, Chen YD, Olshen R, Rodriguez B, Wen CC, Cox DR: Single nucleotide polymorphisms in protein tyrosine phosphatase 1 (PTPN1) are associated with essential hypertension and obesity. Hum Mol Genet13 :1885 eC1892,2004
Neale BM, Sham PC: The future of association studies: gene-based analysis and replication. Am J Hum Genet75 :353 eC362,2004
Chapman JM, Cooper JD, Todd JA, Clayton DG: Detecting disease associations due to linkage disequilibrium using haplotype tags: a class of tests and the determinants of statistical power. Hum Hered56 :18 eC31,2003
Stram DO, Haiman CA, Hirschhorn JN, Altshuler D, Kolonel LN, Henderson BE, Pike MC: Choosing haplotype-tagging SNPs based on unphased genotype data using a preliminary sample of unrelated subjects with an example from the Multiethnic Cohort Study. Hum Hered55 :27 eC36,2003
Spector TD, MacGregor AJ: The St. Thomas’ U.K. Adult Twin Registry. Twin Res5 :440 eC443,2002
Andrew T, Hart D, Snieder H, de Lange M, Spector TD, MacGregor AJ: Are twins and singletons comparable A study of disease-related and lifestyle characteristics in adult women. Twin Res4 :464 eC477,2001
Samaras K, Spector TD, Nguyen TV, Baan K, Campbell LV, Kelly PJ: Independent genetic factors determine the amount and distribution of fat in women after the menopause. J Clin Endocrinol Metab82 :781 eC785,1997
de Lange M, Snieder H, Ariens RA, Andrew T, Grant PJ, Spector TD: The relation between insulin resistance and hemostasis: pleiotropic genes and common environment. Twin Res6 :152 eC161,2003
Middelberg RP, Spector TD, Swaminathan R, Snieder H: Genetic and environmental influences on lipids, lipoproteins, and apolipoproteins: effects of menopause. Arterioscler Thromb Vasc Biol22 :1142 eC1147,2002
Thompson D, Stram D, Goldgar D, Witte JS: Haplotype tagging single nucleotide polymorphisms and association studies. Hum Hered56 :48 eC55,2003
Treegout D-A, Ducimeteere P, Tiret L: Testing association between candidate-gene markers and phenotype in related individuals, by use of estimating equations. Am J Hum Genet61 :189 eC199,1997
Dong Y, Zhu H, Wang X, Dalageorgou C, Carter N, Spector TD, Snieder H: Obesity reveals an association between blood pressure and the G-protein beta3-subunit gene: a study of female dizygotic twins. Pharmacogenetics14 :419 eC427,2004
Zaykin DV, Westfall PH, Young SS, Karnoub MA, Wagner MJ, Ehm MG: Testing association of statistically inferred haplotypes with discrete and continuous traits in samples of unrelated individuals. Hum Hered53 :79 eC91,2002
Stephens M, Donnelly P: A comparison of Bayesian methods for haplotype reconstruction from population genotype data. Am J Hum Genet73 :1162 eC1169,2003
Avignon A, Boegner C, Mariano-Goulart D, Colette C, Monnier L: Assessment of insulin sensitivity from plasma insulin and glucose in the fasting or post oral glucose-load state. Int J Obes Relat Metab Disord23 :512 eC517,1999
Hanley AJ, Williams K, Gonzalez C, D’Agostino RB Jr, Wagenknecht LE, Stern MP, Haffner SM: Prediction of type 2 diabetes using simple measures of insulin resistance: combined results from the San Antonio Heart Study, the Mexico City Diabetes Study, and the Insulin Resistance Atherosclerosis Study. Diabetes52 :463 eC469,2003
Abecasis GR, Cookson WO: GOLD: graphical overview of linkage disequilibrium. Bioinformatics16 :182 eC183,2000
Tabor HK, Risch NJ, Myers RM: Candidate-gene approaches for studying complex genetic traits: practical considerations (Opinion). Nat Rev Genet3 :391 eC397,2002
Comuzzie AG, Hixson JE, Almasy L, Mitchell BD, Mahaney MC, Dyer TD, Stern MP, MacCluer JW, Blangero J: A major quantitative trait locus determining serum leptin levels and fat mass is located on human chromosome 2. Nat Genet15 :273 eC276,1997
Sham PC, Cherny SS, Purcell S, Hewitt JK: Power of linkage versus association analysis of quantitative traits, by use of variance-components models, for sibship data. Am J Hum Genet66 :1616 eC1630,2000
Bento JL, Palmer ND, Mychaleckyj JC, Lange LA, Langefeld CD, Rich SS, Freedman BI, Bowden DW: Association of protein tyrosine phosphatase 1B gene polymorphisms with type 2 diabetes. Diabetes53 :3007 eC3012,2004(Nicola J. Spencer-Jones, )