Project description:<p>The genome-wide association study (GWAS) includes participants enrolled into two different studies. The first study, the San Francisco Bay Area Cancer Study (SFBCS) is a population-based case-control study of breast cancer conducted in the San Francisco Bay Area and included women ages 35-79 years from three racial/ethnic groups: Non-Hispanic whites, African Americans, and Hispanics/Latinas. For the GWAS, only Hispanic/Latina women were included. Women diagnosed with invasive breast cancer between 1995 and 2002 were identified through the Greater Bay Area Cancer Registry. Controls were identified by random digit dialing and were frequency-matched to cases by age in 5 year increments and by race/ethnicity. Hispanic/Latina ethnicity was assessed by self-report. 175 Hispanic/Latina cases and 307 Hispanic/Latina controls from the SFBCS had given adequate consent and provided biospecimens that were used in the GWAS to be included in this data submission.</p> <p>The second study is the Northern California site of the Breast Cancer Family Registry (NC-BCFR). This population-based family study recruited breast cancer cases ages 18-64 years diagnosed from 1995-2009 that were identified through the Greater Bay Area Cancer Registry. Cases included all women at increased genetic susceptibility for breast cancer who met one or more of the following criteria: (a) being diagnosed with breast cancer at age <35 years; b) having a personal history of ovarian cancer or childhood cancer; (c) being diagnosed with two different breast cancers (bilateral breast cancers), with the first one diagnosed at age <50 years; and d) having one or more first-degree relatives with breast cancer, ovarian cancer or childhood cancer. Cases not meeting these criteria were randomly sampled and racial/ethnic minorities were oversampled. Controls were recruited by random digit dialing and were matched by 5-year age increments and by race/ethnicity. For the current GWAS only Latina/Hispanic cases and controls were included. Latina/Hispanic ethnicity was assessed by self-report. 631 Hispanic/Latina cases and 61 Hispanic/Latina controls from the NC-BCFR had given adequate consent and provided biospecimens that were used in the GWAS to be included in this data submission.</p>
| phs000912 | dbGaP
Project description:Pelagic estuarine microbial communities from San Francisco Bay, California, US
Project description:Environmental extracts and fractions obtained with passive sampling (HP-20, SMIRC) from Mission Bay, Point Loma, Scripps Pier, Salton Sea (San Diego and Imperial Counties)
Project description:To gain improved temporal, spatial and phylogenetic resolution of marine microbial communities, in this study we expanded the original prototype genome proxy array (an oligonucleotide microarray targeting marine microbial genome fragments and genomes), evaluated it against metagenomic sequencing, and applied it to time series samples from the Monterey Bay long term ecological research site. The expanded array targeted 268 microbial genotypes (vs. 14 in the original prototype) across much of the known diversity of cultured and uncultured marine microbes. The target abundances measured by the genome proxy array were highly correlated to pyrosequence-based abundances (linear regression R2 = 0.85-0.91, p<0.0001). Fifty-seven samples from ~4-years in Monterey Bay were examined with the array, spanning the photic zone (0m), the base of the surface mixed layer (30m), and the subphotic zone (200m). A significant portion of the expanded genome proxy array’s targets showed signal (95 out of 268 targets present in ≥ 1 sample). The multi-year community survey showed the consistent presence of a core group of common and abundant targeted taxa at each depth in Monterey Bay, higher variability among shallow than deep samples, and episodic occurrences of more transient marine genotypes. The abundance of the most dominant genotypes peaked after strong episodic upwelling events. The genome-proxy array’s ability to track populations of closely-related genotypes indicated population shifts within several abundant target taxa, with specific populations in some cases clustering by depth or oceanographic season. Although 51 cultivated organisms were targeted (representing 19% of the array) the majority of targets detected and of total target signal (85% and ~92%, respectively) were from uncultivated lineages, often those derived from Monterey Bay. The array provided cost-effective (~$15 per array, for construction and use) insights into the natural history of uncultivated lineages in the wild. To gain improved temporal, spatial and phylogenetic resolution of marine microbial communities, in this study we expanded the original prototype genome proxy array (an oligonucleotide microarray targeting marine microbial genome fragments and genomes), evaluated it against metagenomic sequencing, and applied it to time series samples from the Monterey Bay long term ecological research site. The expanded array targeted 268 microbial genotypes (vs. 14 in the original prototype) across much of the known diversity of cultured and uncultured marine microbes. The target abundances measured by the genome proxy array were highly correlated to pyrosequence-based abundances (linear regression R2 = 0.85-0.91, p<0.0001). Fifty-seven samples from ~4-years in Monterey Bay were examined with the array, spanning the photic zone (0m), the base of the surface mixed layer (30m), and the subphotic zone (200m). A significant portion of the expanded genome proxy array’s targets showed signal (95 out of 268 targets present in ≥ 1 sample). The multi-year community survey showed the consistent presence of a core group of common and abundant targeted taxa at each depth in Monterey Bay, higher variability among shallow than deep samples, and episodic occurrences of more transient marine genotypes. The abundance of the most dominant genotypes peaked after strong episodic upwelling events. The genome-proxy array’s ability to track populations of closely-related genotypes indicated population shifts within several abundant target taxa, with specific populations in some cases clustering by depth or oceanographic season. Although 51 cultivated organisms were targeted (representing 19% of the array) the majority of targets detected and of total target signal (85% and ~92%, respectively) were from uncultivated lineages, often those derived from Monterey Bay. The array provided cost-effective (~$15 per array, for construction and use) insights into the natural history of uncultivated lineages in the wild.