Project description:Hepatitis C virus subtype 3a is a highly prevalent and globally distributed strain that is often associated with infection via injection drug use. This subtype exhibits particular phenotypic characteristics. In spite of this, detailed genetic analysis of this subtype has rarely been performed. We performed full-length viral sequence analysis in 18 patients with chronic HCV subtype 3a infection and assessed genomic viral variability in comparison to other HCV subtypes. Two novel regions of intragenotypic hypervariability within the envelope protein E2, of HCV genotype 3a, were identified. We named these regions HVR495 and HVR575. They consisted of flanking conserved hydrophobic amino acids and central variable residues. A 5-amino-acid insertion found only in genotype 3a and a putative glycosylation site is contained within HVR575. Evolutionary analysis of E2 showed that positively selected sites within genotype 3a infection were largely restricted to HVR1, HVR495, and HVR575. Further analysis of clonal viral populations within single hosts showed that viral variation within HVR495 and HVR575 were subject to intrahost positive selecting forces. Longitudinal analysis of four patients with acute HCV subtype 3a infection sampled at multiple time points showed that positively selected mutations within HVR495 and HVR575 arose early during primary infection. HVR495 and HVR575 were not present in HCV subtypes 1a, 1b, 2a, or 6a. Some variability that was not subject to positive selection was present in subtype 4a HVR575. Further defining the functional significance of these regions may have important implications for genotype 3a E2 virus-receptor interactions and for vaccine studies that aim to induce cross-reactive anti-E2 antibodies.
Project description:Goldfish Carassius auratus is an ideal model for exploring fish morphology evolution. Although genes underlying several ornamental traits have been identified, little is known about the effects of artificial selection on embryo gene expression. In the present study, hybrid transcriptome sequencing was conducted to reveal gene expression profiles of Celestial-Eye (CE) and Ryukin (RK) goldfish embryos. Full-length transcriptome sequencing on the PacBio platform identified 54,218 and 54,106 transcript isoforms in CE and RK goldfish, respectively. Of particular note was that thousands of alternative splicing (AS) and alternative polyadenylation (APA) events were identified in both goldfish breeds, and most of them were inter-breed specific. RT-PCR and Sanger sequencing showed that most of the predicted AS and APA were correct. Moreover, abundant long non-coding RNA and fusion genes were detected, and again most of them were inter-breed specific. Through RNA-seq, we detected thousands of differentially expressed genes (DEGs) in each embryonic stage between the two goldfish breeds. KEGG enrichment analysis on DEGs showed extensive differences between CE and RK goldfish in gene expression. Taken together, our results demonstrated that artificial selection has led to far-reaching influences on goldfish gene expression, which probably laid the genetic basis for hundreds of goldfish variations.
Project description:Dyslipidemia and obesity are especially prevalent in populations with Amerindian backgrounds, such as Mexican-Americans, which predispose these populations to cardiovascular disease. Here we design an approach, known as the cross-population allele screen (CPAS), which we conduct prior to a genome-wide association study (GWAS) in 19,273 Europeans and Mexicans, in order to identify Amerindian risk genes in Mexicans. Utilizing CPAS to restrict the GWAS input variants to only those differing in frequency between the two populations, we identify novel Amerindian lipid genes, receptor-related orphan receptor alpha (RORA) and salt-inducible kinase 3 (SIK3), and three loci previously unassociated with dyslipidemia or obesity. We also detect lipoprotein lipase (LPL) and apolipoprotein A5 (APOA5) harbouring specific Amerindian signatures of risk variants and haplotypes. Notably, we observe that SIK3 and one novel lipid locus underwent positive selection in Mexicans. Furthermore, after a high-fat meal, the SIK3 risk variant carriers display high triglyceride levels. These findings suggest that Amerindian-specific genetic architecture leads to a higher incidence of dyslipidemia and obesity in modern Mexicans.
Project description:Litsea is a group of evergreen trees or shrubs in the laurel family, Lauraceae. Species of the genus are widely used for a wide range of medicinal and industrial aspects. At present, most studies related to the gene resources of Litsea are restricted to morphological analyses or features of individual genomes, and currently available studies of select molecular markers are insufficient. In this study, we assembled and annotated the complete chloroplast genomes of nine species in Litsea, carried out a series of comparative analyses, and reconstructed phylogenetic relationships within the genus. The genome length ranged from 152,051 to 152,747 bp and a total of 128 genes were identified. High consistency patterns of codon bias, repeats, divergent analysis, single nucleotide polymorphisms (SNP) and insertions and deletions (InDels) were discovered across the genus. Variations in gene length and the presence of the pseudogene ycf1Ψ, resulting from IR contraction and expansion, are reported. The hyper-variable gene rpl16 was identified for its exceptionally high Ka/Ks and Pi values, implying that those frequent mutations occurred as a result of positive selection. Phylogenetic relationships were recovered for the genus based on analyses of full chloroplast genomes and protein-coding genes. Overall, both genome sequences and potential molecular markers provided in this study enrich the available genomic resources for species of Litsea. Valuable genomic resources and divergent analysis are also provided for further research of the evolutionary patterns, molecular markers, and deeper phylogenetic relationships of Litsea.
Project description:The peculiar position of Sardinia in the Mediterranean sea has rendered its population an interesting biogeographical isolate. The aim of this study was to investigate the genetic population structure, as well as to estimate Runs of Homozygosity and regions under positive selection, using about 1.2 million single nucleotide polymorphisms genotyped in 1077 Sardinian individuals. Using four different methods--fixation index, inflation factor, principal component analysis and ancestry estimation--we were able to highlight, as expected for a genetic isolate, the high internal homogeneity of the island. Sardinians showed a higher percentage of genome covered by RoHs>0.5 Mb (F(RoH%0.5)) when compared to peninsular Italians, with the only exception of the area surrounding Alghero. We furthermore identified 9 genomic regions showing signs of positive selection and, we re-captured many previously inferred signals. Other regions harbor novel candidate genes for positive selection, like TMEM252, or regions containing long non coding RNA. With the present study we confirmed the high genetic homogeneity of Sardinia that may be explained by the shared ancestry combined with the action of evolutionary forces.
Project description:Research into antiviral agents directed at hepatitis C virus (HCV) proteins is commonly based and tested on a single genotype, namely, genotype 1. This is despite the high level of variability of the RNA virus and the frequency of infection with genotypes other than genotype 1. The systematic evolution of ligands by exponential enrichment (SELEX) is a novel in vitro approach used in this study that allows rapid screening of vast nucleic acid libraries to isolate sequences (termed aptamers) that bind to target proteins with high affinity. The SELEX approach was used in the present study to isolate DNA aptamers to the RNA-dependent RNA polymerase (RdRp) (nonstructural protein 5B [NS5B]) of HCV subtype 3a, with the aim of inhibiting polymerase activity. Ten rounds of selection were performed using a Biacore 2000 as the partitioning system. Two aptamers, r10/43 and r10/47, were chosen for further studies on the basis of their abilities to bind the HCV RdRp and inhibit polymerase activity. The affinities (equilibrium dissociation constants) of these aptamers for the HCV subtype 3a polymerase were estimated to be 1.3 +/- 0.3 nM (r10/43) and 23.5 +/- 6.7 nM (r10/47). The inhibition constants of r10/43 and r10/47 were estimated to be 1.4 +/- 2.4 nM and 6.0 +/- 2.3 nM, respectively. Inhibition of HCV 3a polymerase was specific for r10/47, while r10/43 also demonstrated some inhibitory effect on norovirus and phi6 polymerase activity. Neither r10/43 nor r10/47 was able to inhibit the RdRp activity of HCV genotype 1a and 1b polymerases. This study is the first description of an inhibitor specific to the HCV subtype 3a polymerase.
Project description:GB virus C (GBV-C or hepatitis G virus) is a recently described flavivirus which frequently leads to chronic viremia in humans. Although GBV-C is associated with acute posttransfusion hepatitis, it is not clear if the virus is pathogenic for humans. We constructed a full-length cDNA from the plasma of a person with chronic GBV-C viremia. Peripheral blood mononuclear cells (PBMCs) transfected with full-length RNA transcripts from this GBV-C clone resulted in viral replication. This was demonstrated by serial passage of virus from cell culture supernatants, detection of increasing concentrations of positive- and negative-sense GBV-C RNA over time, and the detection of the GBV-C E2 antigen by confocal microscopy. In addition, two types of GBV-C particles were identified in cell lysates; these particles had buoyant densities of 1.06 and 1.12 to 1.17 g/ml in sucrose gradients. PBMCs sorted for expression of CD4 contained 100-fold-more GBV-C RNA than CD4-negative cells. Taken together, these data demonstrate that RNA transcripts from GBV-C full-length cDNA are infectious in primary CD4-positive T cells. In contrast, RNA transcripts from an infectious hepatitis C virus clone did not replicate in the same cell culture system. Infectious RNA transcripts from GBV-C cDNA should prove useful for studying viral replication and may allow identification of differences between GBV-C and hepatitis C virus cultivation in vitro.
Project description:Copy number variants (CNVs), defined as losses and gains of segments of genomic DNA, are a major source of genomic variation.In this study, we identified over 2,000 human CNVs that overlap with orthologous chimpanzee or orthologous macaque CNVs. Of these, 170 CNVs overlap with both chimpanzee and macaque CNVs, and these were collapsed into 34 hotspot regions of CNV formation. Many of these hotspot regions of CNV formation are functionally relevant, with a bias toward genes involved in immune function, some of which were previously shown to evolve under balancing selection in humans. The genes in these primate CNV formation hotspots have significant differential expression levels between species and show evidence for positive selection, indicating that they have evolved under species-specific, directional selection.These hotspots of primate CNV formation provide a novel perspective on divergence and selective pressures acting on these genomic regions.
Project description:Eight genotypes of the hepatitis E virus (Orthohepevirus A; HEV) designated HEV-1 to HEV-8 have been reported from various mammalian hosts. Notably, domestic pigs and wild boars are the natural reservoirs of HEV-3 and HEV-4 genotypes with zoonotic propensity. Since HEV infection in domestic pigs is usually subclinical, it may remain undetected, facilitating zoonotic spillover of HEV to the exposed human populations. A previous study from our group in 2021, using deep sequencing of a pooled saliva sample, generated various swine enteric virus genomes, including a near full-length swine HEV genome (7040 nt; 97.7% genome coverage) from five-month-old grower pigs at a backyard pig farm in the uMgungundlovu District, KwaZulu-Natal, South Africa. In the present study, we describe the further characterization, including genotyping and subtyping of the swine HEV isolate using phylogenetics and ‘HEVnet Typing Tool’. Our analyses confirmed that the South African swine HEV genome characterized in this study belonged to HEV genotype 3 subtype 3c (HEV-3c). While HEV-3c infections in domestic pigs have been previously reported from Brazil, Germany, Italy, and the Netherlands, they only generated partial genome sequences of open reading frame 1 (ORF1) and/or ORF2. To our knowledge, this is the first near full-length swine HEV-3c genome generated from naturally infected domestic pigs (Sus scrofa domesticus) in South Africa. However, due to the gap in the information on the HEV-3c genome sequences in various geographical locations worldwide, including South Africa, the epidemiology of the South African swine HEV genome characterized in this study remains inconclusive. Molecular and genomic surveillance of HEV in domestic pig populations in South Africa would be useful to determine their prevalence, circulating subtypes, and zoonosis risk.
Project description:Besides the complete genome, different partial genomic sequences of Hepatitis E virus (HEV) have been used in genotyping studies, making it difficult to compare the results based on them. No commonly agreed partial region for HEV genotyping has been determined. In this study, we used a statistical method to evaluate the phylogenetic performance of each partial genomic sequence from a genome wide, by comparisons of evolutionary distances between genomic regions and the full-length genomes of 101 HEV isolates to identify short genomic regions that can reproduce HEV genotype assignments based on full-length genomes. Several genomic regions, especially one genomic region at the 3'-terminal of the papain-like cysteine protease domain, were detected to have relatively high phylogenetic correlations with the full-length genome. Phylogenetic analyses confirmed the identical performances between these regions and the full-length genome in genotyping, in which the HEV isolates involved could be divided into reasonable genotypes. This analysis may be of value in developing a partial sequence-based consensus classification of HEV species.