refine.bio
  • Search
      • Normalized Compendia
      • RNA-seq Sample Compendia
  • Docs
  • About
  • My Dataset
github link
Showing
of 252 results
Sort by

Filters

Technology

Platform

accession-icon SRP061522
Truncation of LOC100288798 (SLC38A4-AS) lncRNA in human haploid KBM7 cell line
  • organism-icon Homo sapiens
  • sample-icon 16 Downloadable Samples
  • Technology Badge IconIlluminaHiSeq2000

Description

Many thousand long non-coding (lnc) RNAs are mapped in the human genome. Time consuming studies using reverse genetic approaches by post-transcriptional knock-down or genetic modification of the locus demonstrated diverse biological functions for a few of these transcripts. The Human Gene Trap Mutant Collection in haploid KBM7 cells is a ready-to-use tool for studying protein-coding gene function. As lncRNAs show remarkable differences in RNA biology compared to protein-coding genes, it is unclear if this gene trap collection is useful for functional analysis of lncRNAs. Here we use the uncharacterized LOC100288798 lncRNA as a model to answer this question. Using public RNA-seq data we show that LOC100288798 is ubiquitously expressed, but inefficiently spliced. The minor spliced LOC100288798 isoforms are exported to the cytoplasm, whereas the major unspliced isoform is nuclear localized. This shows that LOC100288798 RNA biology differs markedly from typical mRNAs. De novo assembly from RNA-seq data suggests that LOC100288798 extends 289kb beyond its annotated 3'' end and overlaps the downstream SLC38A4 gene. Three cell lines with independent gene trap insertions in LOC100288798 were available from the KBM7 gene trap collection. RT-qPCR and RNA-seq confirmed successful lncRNA truncation and its extended length. Expression analysis from RNA-seq data shows significant deregulation of 41 protein-coding genes upon LOC100288798 truncation. Our data shows that gene trap collections in human haploid cell lines are useful tools to study lncRNAs, and identifies the previously uncharacterized LOC100288798 as a potential gene regulator. Overall design: We cultured and processed 8 KBM7 cell lines in one batch. These cell lines were: two wild type KBM7 cells (WT2 and WT3), two monoclonal KBM7 cell lines with gene trap cassette insertions outside of the body of LOC100288798 (C1 and C2), two independently obtained KBM7 clones with gene trap cassette insertion 3kb downstream LOC100288798 transcriptional start site (TSS) (3kb1 and 3kb2), one independently obtained KBM7 clone with gene trap cassette insertion 100kb downstream LOC100288798 TSS replicated twice at the thawing step (100kb1 and 100kb2). We isolated total RNA from all th 8 cell lines, applied DNAseI treatment and ribosomal RNA depletion, and thhen prepared strand-specific RNA-seq libraries, which were pooled in equal molarities and sequenced using Illumina HiSeq 2000 (8 pooled samples were sequence on 2 lanes). We performed 50bp single-end RNA-seq. We used these 8 samples (4 untreated: WT2, WT3, C1, C2 and 4 treated:3kb1, 3kb2, 100kbk1, 100kb2) to analyze genome-wide gene deregulation associated with LOC100288798 lncRNA truncation

Publication Title

A human haploid gene trap collection to study lncRNAs with unusual RNA biology.

Sample Metadata Fields

No sample metadata fields

View Samples
accession-icon SRP059959
Long non-coding RNAs display higher natural expression variation than protein-coding genes in healthy humans
  • organism-icon Homo sapiens
  • sample-icon 57 Downloadable Samples
  • Technology Badge IconIllumina HiSeq 2000

Description

Background: Long non-coding RNAs (lncRNAs) are increasingly implicated as gene regulators and may ultimately be more numerous than protein-coding genes in the human genome. Despite large numbers of reported lncRNAs, reference annotations are likely incomplete due to their lower and tighter tissue-specific expression compared to mRNAs. An unexplored factor potentially confounding lncRNA identification is inter-individual expression variability. Here, we characterize lncRNA natural expression variability in human primary granulocytes. Results: We annotate granulocyte lncRNAs and mRNAs in RNA-seq data from ten healthy individuals, identifying multiple lncRNAs absent from reference annotations, and use this to investigate three known features (higher tissue-specificity, lower expression, and reduced splicing efficiency) of lncRNAs relative to mRNAs. Expression variability was examined in seven individuals sampled three times at one or more than one month intervals. We show that lncRNAs display significantly more inter-individual expression variability compared to mRNAs. We confirm this finding in 2 independent human datasets by analyzing multiple tissues from the GTEx project and lymphoblastoid cell lines from the GEUVADIS project. Using the latter dataset we also show that including more human donors into the transcriptome annotation pipeline allows identification of an increasing number of lncRNAs, but minimally affects mRNA gene number. Conclusions: A comprehensive annotation of lncRNAs is known to require an approach that is sensitive to low and tight tissue-specific expression. Here we show that increased inter-individual expression variability is an additional general lncRNA feature to consider when creating a comprehensive annotation of human lncRNAs or proposing their use as prognostic or disease markers. Overall design: We used PolyA+ RNA-seq data from human primary granulocytes of 10 healthy individuals to de novo annotate lncRNAs and mRNAs in this cell type and ribosomal depleted (total) RNA-seq data from seven of these individuals sampled three times to analyze lncRNA amd mRNA expression variability

Publication Title

Long non-coding RNAs display higher natural expression variation than protein-coding genes in healthy humans.

Sample Metadata Fields

No sample metadata fields

View Samples
accession-icon GSE50786
Comparison of histone deacetylase 9-1 mutant (SALK_001723) dry seed transcriptome with Col wild-type
  • organism-icon Arabidopsis thaliana
  • sample-icon 6 Downloadable Samples
  • Technology Badge Icon Affymetrix Arabidopsis ATH1 Genome Array (ath1121501)

Description

Analysis of the transcriptome of dry hda9-1 mutant seeds with those of Col wild-type seeds, using Affymetrix GeneChip Arabidopsis ATH1 Genome Array.

Publication Title

HISTONE DEACETYLASE 9 represses seedling traits in Arabidopsis thaliana dry seeds.

Sample Metadata Fields

Specimen part

View Samples
accession-icon GSE41802
Isocitrata Dehydrogenase (IDH) Mutations Promote a Reversible ZEB1/mir-200-Dependent Epithelial Mesenchymal Transition (EMT)
  • organism-icon Homo sapiens
  • sample-icon 18 Downloadable Samples
  • Technology Badge Icon Affymetrix Human Genome U133 Plus 2.0 Array (hgu133plus2)

Description

Mutations in the genes encoding isocitrate dehydrogenase 1 and 2 (IDH1/2) occur in a variety of tumor types, resulting in production of the proposed oncometabolite, 2-hydroxyglutarate (2-HG). How mutant IDH and 2-HG alter signaling pathways to promote cancer, though, remains unclear. Additionally, there exist relatively few cell lines with IDH mutations. To examine the effect of endogenous IDH mutations and 2-HG, we created a panel of isogenic epithelial cell lines with either wild-type IDH1/2 or clinically relevant IDH1/2 mutations. Differences were noted in the ability of IDH mutations to cause robust 2-HG accumulation. IDH1/2 mutants that produce high levels of 2-HG cause an epithelial-mesenchymal transition (EMT)-like phenotype, characterized by changes in EMT-related gene expression and cellular morphology. 2-HG is sufficient to recapitulate aspects of this phenotype in the absence of an IDH mutation. In the cells types examined, mutant IDH-induced EMT is dependent on upregulation of the transcription factor ZEB1 and downregulation of the mir-200 family of microRNAs. Furthermore, sustained knockdown of IDH1 in IDH1 R132H mutant cells is sufficient to reverse many characteristics of EMT, demonstrating that continued expression of mutant IDH is required to maintain this phenotype. These results suggest mutant IDH proteins can reversibly deregulate discrete signaling pathways that contribute to tumorigenesis

Publication Title

Isocitrate dehydrogenase (IDH) mutations promote a reversible ZEB1/microRNA (miR)-200-dependent epithelial-mesenchymal transition (EMT).

Sample Metadata Fields

Cell line

View Samples
accession-icon GSE36036
Niche modulated versus niche modulating genes in multiple myeloma
  • organism-icon Homo sapiens
  • sample-icon 31 Downloadable Samples
  • Technology Badge Icon Affymetrix Human Gene 1.0 ST Array (hugene10st)

Description

Background. Multiple myeloma (MM) cells depend on the bone marrow (BM) niche for growth and survival. However, the tumor genes regulated by the niche are largely unknown.

Publication Title

Niche-modulated and niche-modulating genes in bone marrow cells.

Sample Metadata Fields

Disease, Disease stage, Time

View Samples
accession-icon GSE39184
Contact versus contactless signatures in leukemia
  • organism-icon Homo sapiens
  • sample-icon 17 Downloadable Samples
  • Technology Badge Icon Affymetrix Human Gene 1.0 ST Array (hugene10st)

Description

Gene expression profile (GEP) was analyzed in bone marrow (BM) samples from patients with leukemia or leukemic phase of lymphoma at different time points following aspiration. Among numerous changes in GEP evolved over time a discrete subset of > 60 genes exhibited prompt and sustained switch in expression consistently. Similar results were discovered recently in BM samples from patients with multiple myeloma (GSE36036). GEP was also examined in peripheral blood as well as in BM samples depleted of red blood cells (=WBC) and in cultured cells from some of the patients.

Publication Title

Niche-modulated and niche-modulating genes in bone marrow cells.

Sample Metadata Fields

Specimen part, Disease

View Samples
accession-icon GSE60771
Testing gene expression changes in VCaP upon depletion of the mutated ETS transcription factor ERG
  • organism-icon Homo sapiens
  • sample-icon 16 Downloadable Samples
  • Technology Badge Icon Affymetrix Human Genome U133 Plus 2.0 Array (hgu133plus2)

Description

VCaP cells expressing inducible shRNAs for either ERG or a non-targeting control were treated with Doxycycline for 1, 3, 7 and 10 days prior to collection

Publication Title

TMPRSS2:ERG blocks neuroendocrine and luminal cell differentiation to maintain prostate cancer proliferation.

Sample Metadata Fields

No sample metadata fields

View Samples
accession-icon SRP046010
Investigating gene expression changes in wildtype and TMPRSS2-ERG homozygous mouse prostate tissue
  • organism-icon Mus musculus
  • sample-icon 4 Downloadable Samples
  • Technology Badge IconIllumina HiSeq 2000

Description

A transgenic TMPRSS2:ERG mouse model was engineered in FVB background and compared to its wildtype counterpart in the absence of any treatment This experiment is designed to look at ERG-dependent changes in phenotype and gene expression Overall design: A loxP-GFP-loxP-hERG exon 4-11 cassette was inserted into a BAC clone containing the TMPRSS2 locus using a recombineering kit. This modified BAC was used for pronuclear injection and generation of germline-transmitting mice. One line expressing high GFP was used for pronuclear injection of Cre protein and one sub-line that transmitted the TMPRSS2:ERG transgene into the germline was subsequently bred to homozygosity.

Publication Title

TMPRSS2:ERG blocks neuroendocrine and luminal cell differentiation to maintain prostate cancer proliferation.

Sample Metadata Fields

No sample metadata fields

View Samples
accession-icon GSE1460
Gene Expression Profile during human CD4+ T cell differentiation
  • organism-icon Homo sapiens
  • sample-icon 30 Downloadable Samples
  • Technology Badge Icon Affymetrix Human Genome U133A Array (hgu133a)

Description

Subpopulations of human fetal thymocyte and circulating nave T cells were obtained through FACS sorting, including CD3-CD4+CD8- intrathymic T progenitor cells (ITTP), CD3intCD4+CD8+ "double positive" thymocytes (DP), CD3highCD4+CD8- "single positive" thymocytes (SP4), CD3+CD4+CD8-CD45RA+CD62L+ nave T cells from cord blood (CB4+), and CD3+CD4+CD8-CD45RA+CD62L+ nave T cells from adult blood (AB4+).

Publication Title

Gene expression profiles during human CD4+ T cell differentiation.

Sample Metadata Fields

No sample metadata fields

View Samples
accession-icon SRP105534
UHRF2 regulates local 5-methylcytosine and suppresses spontaneous seizures [RNA-seq]
  • organism-icon Mus musculus
  • sample-icon 8 Downloadable Samples
  • Technology Badge IconIllumina HiSeq 2500

Description

5-methylcytosine (5mC) regulates multiple cellular processes and is faithfully maintained following DNA replication. Ubiquitin-like PHD and ring finger domain-containing protein 1 (UHRF1) plays an important role in the maintenance of 5mC levels. Interestingly, UHRF1 has a paralog UHRF2 that has similar sequence and domain architecture, but the biological function of UHRF2 is not clear. Here, we have generated Uhrf2 knockout mice and characterized the role of UHRF2 in vivo. Uhrf2 knockout mice are viable, but the adult mice develop frequent spontaneous seizures and display abnormal electrical activities in brain. To explore possible mechanism beyond these phenomenon, we utilized high-throughput sequencing to identify global expression changes in Uhrf2 knockout mice brains. In addition, we explored genome-wide 5mc profiles in these samples to examine if UHRF2 regulates 5mc levels in specific genome loci. Overall design: Examination of global gene expressions in brains of wild type and Uhrf2 knockout mice by RNA-seq.

Publication Title

UHRF2 regulates local 5-methylcytosine and suppresses spontaneous seizures.

Sample Metadata Fields

Specimen part, Subject

View Samples
...

refine.bio is a repository of uniformly processed and normalized, ready-to-use transcriptome data from publicly available sources. refine.bio is a project of the Childhood Cancer Data Lab (CCDL)

fund-icon Fund the CCDL

Developed by the Childhood Cancer Data Lab

Powered by Alex's Lemonade Stand Foundation

Cite refine.bio

Casey S. Greene, Dongbo Hu, Richard W. W. Jones, Stephanie Liu, David S. Mejia, Rob Patro, Stephen R. Piccolo, Ariel Rodriguez Romero, Hirak Sarkar, Candace L. Savonen, Jaclyn N. Taroni, William E. Vauclain, Deepashree Venkatesh Prasad, Kurt G. Wheeler. refine.bio: a resource of uniformly processed publicly available gene expression datasets.
URL: https://www.refine.bio

Note that the contributor list is in alphabetical order as we prepare a manuscript for submission.

BSD 3-Clause LicensePrivacyTerms of UseContact