Trans-ethnic predicted expression genome-wide association analysis identifies a gene for estrogen receptor-negative breast cancer.
ABSTRACT: Genome-wide association studies (GWAS) have identified more than 90 susceptibility loci for breast cancer, but the underlying biology of those associations needs to be further elucidated. More genetic factors for breast cancer are yet to be identified but sample size constraints preclude the identification of individual genetic variants with weak effects using traditional GWAS methods. To address this challenge, we utilized a gene-level expression-based method, implemented in the MetaXcan software, to predict gene expression levels for 11,536 genes using expression quantitative trait loci and examine the genetically-predicted expression of specific genes for association with overall breast cancer risk and estrogen receptor (ER)-negative breast cancer risk. Using GWAS datasets from a Challenge launched by National Cancer Institute, we identified TP53INP2 (tumor protein p53-inducible nuclear protein 2) at 20q11.22 to be significantly associated with ER-negative breast cancer (Z = -5.013, p = 5.35×10-7, Bonferroni threshold = 4.33×10-6). The association was consistent across four GWAS datasets, representing European, African and Asian ancestry populations. There are 6 single nucleotide polymorphisms (SNPs) included in the prediction of TP53INP2 expression and five of them were associated with estrogen-receptor negative breast cancer, although none of the SNP-level associations reached genome-wide significance. We conducted a replication study using a dataset outside of the Challenge, and found the association between TP53INP2 and ER-negative breast cancer was significant (p = 5.07x10-3). Expression of HP (16q22.2) showed a suggestive association with ER-negative breast cancer in the discovery phase (Z = 4.30, p = 1.70x10-5) although the association was not significant after Bonferroni adjustment. Of the 249 genes that are 250 kb within known breast cancer susceptibility loci identified from previous GWAS, 20 genes (8.0%) were statistically significant associated with ER-negative breast cancer (p<0.05), compared to 582 (5.2%) of 11,287 genes that are not close to previous GWAS loci. This study demonstrated that expression-based gene mapping is a promising approach for identifying cancer susceptibility genes.
Project description:Genome-wide association studies (GWAS) have identified more than 170 breast cancer susceptibility loci. Here we hypothesize that some risk-associated variants might act in non-breast tissues, specifically adipose tissue and immune cells from blood and spleen. Using expression quantitative trait loci (eQTL) reported in these tissues, we identify 26 previously unreported, likely target genes of overall breast cancer risk variants, and 17 for estrogen receptor (ER)-negative breast cancer, several with a known immune function. We determine the directional effect of gene expression on disease risk measured based on single and multiple eQTL. In addition, using a gene-based test of association that considers eQTL from multiple tissues, we identify seven (and four) regions with variants associated with overall (and ER-negative) breast cancer risk, which were not reported in previous GWAS. Further investigation of the function of the implicated genes in breast and immune cells may provide insights into the etiology of breast cancer.
Project description:Genome-wide association studies (GWAS) of breast cancer defined by hormone receptor status have revealed loci contributing to susceptibility of estrogen receptor (ER)-negative subtypes. To identify additional genetic variants for ER-negative breast cancer, we conducted the largest meta-analysis of ER-negative disease to date, comprising 4754 ER-negative cases and 31 663 controls from three GWAS: NCI Breast and Prostate Cancer Cohort Consortium (BPC3) (2188 ER-negative cases; 25 519 controls of European ancestry), Triple Negative Breast Cancer Consortium (TNBCC) (1562 triple negative cases; 3399 controls of European ancestry) and African American Breast Cancer Consortium (AABC) (1004 ER-negative cases; 2745 controls). We performed in silico replication of 86 SNPs at P ≤ 1 × 10(-5) in an additional 11 209 breast cancer cases (946 with ER-negative disease) and 16 057 controls of Japanese, Latino and European ancestry. We identified two novel loci for breast cancer at 20q11 and 6q14. SNP rs2284378 at 20q11 was associated with ER-negative breast cancer (combined two-stage OR = 1.16; P = 1.1 × 10(-8)) but showed a weaker association with overall breast cancer (OR = 1.08, P = 1.3 × 10(-6)) based on 17 869 cases and 43 745 controls and no association with ER-positive disease (OR = 1.01, P = 0.67) based on 9965 cases and 22 902 controls. Similarly, rs17530068 at 6q14 was associated with breast cancer (OR = 1.12; P = 1.1 × 10(-9)), and with both ER-positive (OR = 1.09; P = 1.5 × 10(-5)) and ER-negative (OR = 1.16, P = 2.5 × 10(-7)) disease. We also confirmed three known loci associated with ER-negative (19p13) and both ER-negative and ER-positive breast cancer (6q25 and 12p11). Our results highlight the value of large-scale collaborative studies to identify novel breast cancer risk loci.
Project description:Breast cancer is the most common cancer among women. To date, 22 common breast cancer susceptibility loci have been identified accounting for ?8% of the heritability of the disease. We attempted to replicate 72 promising associations from two independent genome-wide association studies (GWAS) in ?70,000 cases and ?68,000 controls from 41 case-control studies and 9 breast cancer GWAS. We identified three new breast cancer risk loci at 12p11 (rs10771399; P = 2.7 × 10(-35)), 12q24 (rs1292011; P = 4.3 × 10(-19)) and 21q21 (rs2823093; P = 1.1 × 10(-12)). rs10771399 was associated with similar relative risks for both estrogen receptor (ER)-negative and ER-positive breast cancer, whereas the other two loci were associated only with ER-positive disease. Two of the loci lie in regions that contain strong plausible candidate genes: PTHLH (12p11) has a crucial role in mammary gland development and the establishment of bone metastasis in breast cancer, and NRIP1 (21q21) encodes an ER cofactor and has a role in the regulation of breast cancer cell growth.
Project description:Common variants in 94 loci have been associated with breast cancer including 15 loci with genome-wide significant associations (P<5 × 10(-8)) with oestrogen receptor (ER)-negative breast cancer and BRCA1-associated breast cancer risk. In this study, to identify new ER-negative susceptibility loci, we performed a meta-analysis of 11 genome-wide association studies (GWAS) consisting of 4,939 ER-negative cases and 14,352 controls, combined with 7,333 ER-negative cases and 42,468 controls and 15,252 BRCA1 mutation carriers genotyped on the iCOGS array. We identify four previously unidentified loci including two loci at 13q22 near KLF5, a 2p23.2 locus near WDR43 and a 2q33 locus near PPIL3 that display genome-wide significant associations with ER-negative breast cancer. In addition, 19 known breast cancer risk loci have genome-wide significant associations and 40 had moderate associations (P<0.05) with ER-negative disease. Using functional and eQTL studies we implicate TRMT61B and WDR43 at 2p23.2 and PPIL3 at 2q33 in ER-negative breast cancer aetiology. All ER-negative loci combined account for ?11% of familial relative risk for ER-negative disease and may contribute to improved ER-negative and BRCA1 breast cancer risk prediction.
Project description:Most common breast cancer susceptibility variants have been identified through genome-wide association studies (GWAS) of predominantly estrogen receptor (ER)-positive disease. We conducted a GWAS using 21,468 ER-negative cases and 100,594 controls combined with 18,908 BRCA1 mutation carriers (9,414 with breast cancer), all of European origin. We identified independent associations at P < 5 × 10-8 with ten variants at nine new loci. At P < 0.05, we replicated associations with 10 of 11 variants previously reported in ER-negative disease or BRCA1 mutation carrier GWAS and observed consistent associations with ER-negative disease for 105 susceptibility variants identified by other studies. These 125 variants explain approximately 16% of the familial risk of this breast cancer subtype. There was high genetic correlation (0.72) between risk of ER-negative breast cancer and breast cancer risk for BRCA1 mutation carriers. These findings may lead to improved risk prediction and inform further fine-mapping and functional work to better understand the biological basis of ER-negative breast cancer.
Project description:Approximately 15-30% of all breast cancer tumors are estrogen receptor negative (ER-). Compared with ER-positive (ER+) disease they have an earlier age at onset and worse prognosis. Despite the vast number of risk variants identified for numerous cancer types, only seven loci have been unambiguously identified for ER-negative breast cancer. With the aim of identifying new susceptibility SNPs for this disease we performed a pleiotropic genome-wide association study (GWAS). We selected 3079 SNPs associated with a human complex trait or disease at genome-wide significance level (P<5 × 10(-8)) to perform a secondary analysis of an ER-negative GWAS from the National Cancer Institute's Breast and Prostate Cancer Cohort Consortium (BPC3), including 1998 cases and 2305 controls from prospective studies. We then tested the top ten associations (i.e. with the lowest P-values) using three additional populations with a total sample size of 3509 ER+ cases, 2543 ER- cases and 7031 healthy controls. None of the 3079 selected variants in the BPC3 ER-GWAS were significant at the adjusted threshold. 186 variants were associated with ER- breast cancer risk at a conventional threshold of P<0.05, with P-values ranging from 0.049 to 2.3 × 10(-4). None of the variants reached statistical significance in the replication phase. In conclusion, this study did not identify any novel susceptibility loci for ER-breast cancer using a "pleiotropic approach".
Project description:Breast cancers demonstrate substantial biological, clinical and etiological heterogeneity. We investigated breast cancer risk associations of eight susceptibility loci identified in GWAS and two putative susceptibility loci in candidate genes in relation to specific breast tumor subtypes. Subtypes were defined by five markers (ER, PR, HER2, CK5/6, EGFR) and other pathological and clinical features. Analyses included up to 30 040 invasive breast cancer cases and 53 692 controls from 31 studies within the Breast Cancer Association Consortium. We confirmed previous reports of stronger associations with ER+ than ER- tumors for six of the eight loci identified in GWAS: rs2981582 (10q26) (P-heterogeneity = 6.1 × 10(-18)), rs3803662 (16q12) (P = 3.7 × 10(-5)), rs13281615 (8q24) (P = 0.002), rs13387042 (2q35) (P = 0.006), rs4973768 (3p24) (P = 0.003) and rs6504950 (17q23) (P = 0.002). The two candidate loci, CASP8 (rs1045485, rs17468277) and TGFB1 (rs1982073), were most strongly related with the risk of PR negative tumors (P = 5.1 × 10(-6) and P = 4.1 × 10(-4), respectively), as previously suggested. Four of the eight loci identified in GWAS were associated with triple negative tumors (P ? 0.016): rs3803662 (16q12), rs889312 (5q11), rs3817198 (11p15) and rs13387042 (2q35); however, only two of them (16q12 and 2q35) were associated with tumors with the core basal phenotype (P ? 0.002). These analyses are consistent with different biological origins of breast cancers, and indicate that tumor stratification might help in the identification and characterization of novel risk factors for breast cancer subtypes. This may eventually result in further improvements in prevention, early detection and treatment.
Project description:A genome-wide association study (GWAS) identified single-nucleotide polymorphisms (SNPs) at 1p11.2 and 14q24.1 (RAD51L1) as breast cancer susceptibility loci. The initial GWAS suggested stronger effects for both loci for estrogen receptor (ER)-positive tumors. Using data from the Breast Cancer Association Consortium (BCAC), we sought to determine whether risks differ by ER, progesterone receptor (PR), human epidermal growth factor receptor 2 (HER2), grade, node status, tumor size, and ductal or lobular morphology. We genotyped rs11249433 at 1p.11.2, and two highly correlated SNPs rs999737 and rs10483813 (r(2)= 0.98) at 14q24.1 (RAD51L1), for up to 46 036 invasive breast cancer cases and 46 930 controls from 39 studies. Analyses by tumor characteristics focused on subjects reporting to be white women of European ancestry and were based on 25 458 cases, of which 87% had ER data. The SNP at 1p11.2 showed significantly stronger associations with ER-positive tumors [per-allele odds ratio (OR) for ER-positive tumors was 1.13, 95% CI = 1.10-1.16 and, for ER-negative tumors, OR was 1.03, 95% CI = 0.98-1.07, case-only P-heterogeneity = 7.6 × 10(-5)]. The association with ER-positive tumors was stronger for tumors of lower grade (case-only P= 6.7 × 10(-3)) and lobular histology (case-only P= 0.01). SNPs at 14q24.1 were associated with risk for most tumor subtypes evaluated, including triple-negative breast cancers, which has not been described previously. Our results underscore the need for large pooling efforts with tumor pathology data to help refine risk estimates for SNP associations with susceptibility to different subtypes of breast cancer.
Project description:Most genome-wide association studies (GWAS) have been carried out in European ancestry populations; no risk variants for breast cancer have been identified solely from African ancestry GWAS data. Few GWAS hits have replicated in African ancestry populations.In a nested case-control study of breast cancer in the Black Women's Health Study (1,199 cases/1,948 controls), we evaluated index single-nucleotide polymorphisms (SNP) in 21 loci from GWAS of European or Asian ancestry populations, overall, in subtypes defined by estrogen receptor (ER) and progesterone receptor (PR) status (ER+/PR+, n = 336; ER-/PR-, n = 229), and in triple-negative breast cancer (TNBC, N = 81). To evaluate the contribution of genetic factors to population differences in breast cancer subtype, we also examined global percent African ancestry.Index SNPs in five loci were replicated, including three associated with ER-/PR- breast cancer (TERT rs10069690 in 5p15.33, rs704010 in 10q22.3, and rs8170 in 19p13.11): per allele ORs were 1.29 [95% confidence interval (CI) 1.04-1.59], P = 0.02, 1.52 (95% CI 1.12-2.08), P = 0.01, and 1.30 (95% CI 1.01-1.68), P = 0.04, respectively. Stronger associations were observed for TNBC. Furthermore, cases in the highest quintile of percent African ancestry were three times more likely to have TNBC than ER+/PR+ cancer.These findings provide the first confirmation of the TNBC SNP rs8170 in an African ancestry population, and independent confirmation of the TERT ER- SNP. Furthermore, the risk of developing ER- breast cancer, particularly TNBC, increased with increasing proportion of global African ancestry.The findings illustrate the importance of genetic factors in the disproportionately high occurrence of TNBC in African American women.
Project description:Breast cancer is the most diagnosed malignancy and the second leading cause of cancer mortality in females. Previous association studies have identified variants on 2q35 associated with the risk of breast cancer. To identify functional susceptibility loci for breast cancer, we interrogated the 2q35 gene desert for chromatin architecture and functional variation correlated with gene expression. We report a novel intergenic breast cancer risk locus containing an enhancer copy number variation (enCNV; deletion) located approximately 400Kb upstream to IGFBP5, which overlaps an intergenic ER?-bound enhancer that loops to the IGFBP5 promoter. The enCNV is correlated with modified ER? binding and monoallelic-repression of IGFBP5 following oestrogen treatment. We investigated the association of enCNV genotype with breast cancer in 1,182 cases and 1,362 controls, and replicate our findings in an independent set of 62,533 cases and 60,966 controls from 41 case control studies and 11 GWAS. We report a dose-dependent inverse association of 2q35 enCNV genotype (percopy OR?=?0.68 95%CI 0.55-0.83, P?=?0.0002; replication OR?=?0.77 95% CI 0.73-0.82, P?=?2.1 × 10-19) and identify 13 additional linked variants (r2?>?0.8) in the 20Kb linkage block containing the enCNV (P?=?3.2?×?10-15 - 5.6?×?10-17). These associations were independent of previously reported 2q35 variants, rs13387042/rs4442975 and rs16857609, and were stronger for ER-positive than ER-negative disease. Together, these results suggest that 2q35 breast cancer risk loci may be mediating their effect through IGFBP5.