refine.bio
  • Search
      • Normalized Compendia
      • RNA-seq Sample Compendia
  • Docs
  • About
  • My Dataset
github link
Showing
of 33 results
Sort by

Filters

Technology

Platform

accession-icon GSE119416
Expression data from cytokine producing human CD4+ T cells
  • organism-icon Homo sapiens
  • sample-icon 36 Downloadable Samples
  • Technology Badge Icon Affymetrix Human Genome U133 Plus 2.0 Array (hgu133plus2)

Description

Immune system homeostasis depends on signals that drive effector (like secretion of pro-inflammatory cytokines like IFNg) and regulatory (like secretion of the anti-inflammatory cytokine IL-10) functions.

Publication Title

The cholesterol biosynthesis pathway regulates IL-10 expression in human Th1 cells.

Sample Metadata Fields

Specimen part, Subject

View Samples
accession-icon GSE49355
Specific extracellular matrix remodeling signature of colon hepatic metastases [HG-U133A]
  • organism-icon Homo sapiens
  • sample-icon 56 Downloadable Samples
  • Technology Badge Icon Affymetrix Human Genome U133A Array (hgu133a)

Description

To identify genes implicated in metastatic colonization of the liver in colorectal cancer, we collected pairs of primary tumors and hepatic metastases before chemotherapy in 13 patients. We compared mRNA expression in the pairs of patients to identify genes deregulated during metastatic evolution. We then validated the identified genes using data obtained by different groups. The 33-gene signature was able to classify 87% of hepatic metastases, 98% of primary tumors, 97% of normal colon mucosa, and 95% of normal liver tissues in six datasets obtained using five different microarray platforms. The identified genes are specific to colon cancer and hepatic metastases since other metastatic locations and hepatic metastases originating from breast cancer were not classified by the signature. Gene Ontology term analysis showed that 50% of the genes are implicated in extracellular matrix remodeling, and more precisely in cell adhesion, extracellular matrix organization and angiogenesis. Because of the high efficiency of the signature to classify colon hepatic metastases, the identified genes represent promising targets to develop new therapies that will specifically affect hepatic metastasis microenvironment.

Publication Title

Specific extracellular matrix remodeling signature of colon hepatic metastases.

Sample Metadata Fields

Sex, Age, Specimen part, Subject

View Samples
accession-icon GSE36398
Transcriptional profiling in facioscapulohumeral muscular dystrophy to identify candidate biomarkers
  • organism-icon Homo sapiens
  • sample-icon 50 Downloadable Samples
  • Technology Badge Icon Affymetrix Human Gene 1.0 ST Array (hugene10st)

Description

Facioscapulohumeral muscular dystrophy (FSHD) is a progressive neuromuscular disorder caused by contractions of repetitive elements within the macrosatellite D4Z4 on chromosome 4q35. In order to develop mRNA-based biomarkers of affected muscles, we used GeneChip Gene 1.0 ST arrays for global analysis of gene expression in muscle biopsy specimens obtained from FSHD subjects and their unaffected first degree relatives.

Publication Title

Transcriptional profiling in facioscapulohumeral muscular dystrophy to identify candidate biomarkers.

Sample Metadata Fields

Sex, Specimen part, Disease, Disease stage, Subject

View Samples
accession-icon SRP154576
Designing a single cell RNA sequencing benchmark dataset to compare protocols and analysis methods (9 cell mixture dataset).
  • organism-icon Homo sapiens
  • sample-icon 5 Downloadable Samples
  • Technology Badge IconNextSeq 500

Description

Single cell RNA sequencing (scRNA-seq) technology has undergone rapid development in recent years and brings new challenges in data processing and analysis. This has led to an explosion of tailored analysis methods for scRNA-seq to address various biological questions. However, the current lack of gold-standard benchmarking datasets makes it difficult for researchers to evaluate the performance of the many methods available in a systematic manner. Here, we designed and generated a cross-platform benchmark dataset that has in-built truth in various forms and varying levels of biological noise. We used this dataset to compare different protocols and data analysis methods. We found that different protocols have different data quality and ERCC spike-in works independently to endogenous RNA. We found significant differences in the results from the methods compared and we associated the results with data characteristics to identify methods that perform well in different situations. Our dataset and analysis provide a valuable resource for algorithm selection in different biological settings. Overall design: our experiment utilized the 3 human lung adenocarcinoma cell lines H2228, H1975 and HCC827. The experiment included mixtures of RNA and single cells from these cell lines. For the single cell designs, the three cell lines were mixed equally and processed by 10X chromium, Drop-seq and CEL-seq2, referred to as sc_10X, sc_Drop-seq and sc_CEL-seq2 respectively in analysis that follows. For the mixture designs, we used plate-based protocols to mix and dilute samples in 2 different ways. 9 cell mixtures from the 3 cell lines were sorted in different combinations in the cell mixture experiment and data were generated by CEL-seq2, the material after pooling from 384 wells were subsampled in either 1/9 or 1/3 to simulate cells of different sizes, with different PCR product clean up ratios ranging from 0.7 to 0.9, referred to as cellmix1 to cellmix4. For the cell mixture experiment, we also sorted wells with 10 times more cells (90 cells) to provide a pseudo bulk reference for each mixture (referred to as cellmix5). Distinct RNA mixtures which were diluted down to create single cell equivalents (ranging from 3.75, 7.5, 15 to 30 pg per well) were generated using CEL-seq2 and SORT-seq (referred to as RNAmix_CEL-seq2 and RNAmix_Sort-seq. This is the 9 cell mixture dataset.

Publication Title

scPipe: A flexible R/Bioconductor preprocessing pipeline for single-cell RNA-sequencing data.

Sample Metadata Fields

Specimen part, Subject

View Samples
accession-icon SRP186516
Designing a single cell RNA sequencing benchmark dataset to compare protocols and analysis methods [5 Cell Lines Cel-seq]
  • organism-icon Homo sapiens
  • sample-icon 3 Downloadable Samples
  • Technology Badge IconNextSeq 500

Description

Single cell RNA sequencing (scRNA-seq) technology has undergone rapid development in recent years and brings new challenges in data processing and analysis. This has led to an explosion of tailored analysis methods for scRNA-seq to address various biological questions. However, the current lack of gold-standard benchmarking datasets makes it difficult for researchers to evaluate the performance of the many methods available in a systematic manner. Here, we designed and generated a cross-platform benchmark dataset that has in-built truth in various forms and varying levels of biological noise. We used this dataset to compare different protocols and data analysis methods. We found that different protocols have different data quality and ERCC spike-in works independently to endogenous RNA. We found significant differences in the results from the methods compared and we associated the results with data characteristics to identify methods that perform well in different situations. Our dataset and analysis provide a valuable resource for algorithm selection in different biological settings. Overall design: our experiment utilized the 5 human lung adenocarcinoma cell lines H2228, H1975, A549, H838 and HCC827. For the single cell designs, the five cell lines were mixed equally and processed by 10X chromium and CEL-seq2, referred to as sc_10X_5cl, and sc_CEL-seq2_5cl respectively in analysis that follows. For CEL-seq2, three plates were sorted and processed.

Publication Title

scPipe: A flexible R/Bioconductor preprocessing pipeline for single-cell RNA-sequencing data.

Sample Metadata Fields

Subject

View Samples
accession-icon SRP155038
Designing a single cell RNA sequencing benchmark dataset to compare protocols and analysis methods (RNAmix_CEL-seq2 )
  • organism-icon Homo sapiens
  • sample-icon 1 Downloadable Sample
  • Technology Badge IconNextSeq 500

Description

Single cell RNA sequencing (scRNA-seq) technology has undergone rapid development in recent years and brings new challenges in data processing and analysis. This has led to an explosion of tailored analysis methods for scRNA-seq to address various biological questions. However, the current lack of gold-standard benchmarking datasets makes it difficult for researchers to evaluate the performance of the many methods available in a systematic manner. Here, we designed and generated a cross-platform benchmark dataset that has in-built truth in various forms and varying levels of biological noise. We used this dataset to compare different protocols and data analysis methods. We found that different protocols have different data quality and ERCC spike-in works independently to endogenous RNA. We found significant differences in the results from the methods compared and we associated the results with data characteristics to identify methods that perform well in different situations. Our dataset and analysis provide a valuable resource for algorithm selection in different biological settings. Overall design: our experiment utilized the 3 human lung adenocarcinoma cell lines H2228, H1975 and HCC827. The experiment included mixtures of RNA and single cells from these cell lines. For the single cell designs, the three cell lines were mixed equally and processed by 10X chromium, Drop-seq and CEL-seq2, referred to as sc_10X, sc_Drop-seq and sc_CEL-seq2 respectively in analysis that follows. For the mixture designs, we used plate-based protocols to mix and dilute samples in 2 different ways. 9 cell mixtures from the 3 cell lines were sorted in different combinations in the cell mixture experiment and data were generated by CEL-seq2, the material after pooling from 384 wells were subsampled in either 1/9 or 1/3 to simulate cells of different sizes, with different PCR product clean up ratios ranging from 0.7 to 0.9, referred to as cellmix1 to cellmix4. For the cell mixture experiment, we also sorted wells with 10 times more cells (90 cells) to provide a pseudo bulk reference for each mixture (referred to as cellmix5). Distinct RNA mixtures which were diluted down to create single cell equivalents (ranging from 3.75, 7.5, 15 to 30 pg per well) were generated using CEL-seq2 and SORT-seq (referred to as RNAmix_CEL-seq2 and RNAmix_Sort-seq. This is the RNAmix_CEL-seq2 dataset.

Publication Title

scPipe: A flexible R/Bioconductor preprocessing pipeline for single-cell RNA-sequencing data.

Sample Metadata Fields

Specimen part, Subject

View Samples
accession-icon SRP158266
Designing a single cell RNA sequencing benchmark dataset to compare protocols and analysis methods (Drop-Seq)
  • organism-icon Homo sapiens
  • sample-icon 1 Downloadable Sample
  • Technology Badge IconNextSeq 500

Description

Single cell RNA sequencing (scRNA-seq) technology has undergone rapid development in recent years and brings new challenges in data processing and analysis. This has led to an explosion of tailored analysis methods for scRNA-seq to address various biological questions. However, the current lack of gold-standard benchmarking datasets makes it difficult for researchers to evaluate the performance of the many methods available in a systematic manner. Here, we designed and generated a cross-platform benchmark dataset that has in-built truth in various forms and varying levels of biological noise. We used this dataset to compare different protocols and data analysis methods. We found that different protocols have different data quality and ERCC spike-in works independently to endogenous RNA. We found significant differences in the results from the methods compared and we associated the results with data characteristics to identify methods that perform well in different situations. Our dataset and analysis provide a valuable resource for algorithm selection in different biological settings. Overall design: our experiment utilized the 3 human lung adenocarcinoma cell lines H2228, H1975 and HCC827. The experiment included mixtures of RNA and single cells from these cell lines. For the single cell designs, the three cell lines were mixed equally and processed by 10X chromium, Drop-seq and CEL-seq2, referred to as sc_10X, sc_Drop-seq and sc_CEL-seq2 respectively in analysis that follows. For the mixture designs, we used plate-based protocols to mix and dilute samples in 2 different ways. 9 cell mixtures from the 3 cell lines were sorted in different combinations in the cell mixture experiment and data were generated by CEL-seq2, the material after pooling from 384 wells were subsampled in either 1/9 or 1/3 to simulate cells of different sizes, with different PCR product clean up ratios ranging from 0.7 to 0.9, referred to as cellmix1 to cellmix4. For the cell mixture experiment, we also sorted wells with 10 times more cells (90 cells) to provide a pseudo bulk reference for each mixture (referred to as cellmix5). Distinct RNA mixtures which were diluted down to create single cell equivalents (ranging from 3.75, 7.5, 15 to 30 pg per well) were generated using CEL-seq2 and SORT-seq (referred to as RNAmix_CEL-seq2 and RNAmix_Sort-seq. This is the RNAmix_CEL-seq2 dataset.

Publication Title

scPipe: A flexible R/Bioconductor preprocessing pipeline for single-cell RNA-sequencing data.

Sample Metadata Fields

Specimen part, Subject

View Samples
accession-icon SRP155039
Designing a single cell RNA sequencing benchmark dataset to compare protocols and analysis methods (RNAmix_Sort-seq)
  • organism-icon Homo sapiens
  • sample-icon 1 Downloadable Sample
  • Technology Badge IconNextSeq 500

Description

Single cell RNA sequencing (scRNA-seq) technology has undergone rapid development in recent years and brings new challenges in data processing and analysis. This has led to an explosion of tailored analysis methods for scRNA-seq to address various biological questions. However, the current lack of gold-standard benchmarking datasets makes it difficult for researchers to evaluate the performance of the many methods available in a systematic manner. Here, we designed and generated a cross-platform benchmark dataset that has in-built truth in various forms and varying levels of biological noise. We used this dataset to compare different protocols and data analysis methods. We found that different protocols have different data quality and ERCC spike-in works independently to endogenous RNA. We found significant differences in the results from the methods compared and we associated the results with data characteristics to identify methods that perform well in different situations. Our dataset and analysis provide a valuable resource for algorithm selection in different biological settings. Overall design: our experiment utilized the 3 human lung adenocarcinoma cell lines H2228, H1975 and HCC827. The experiment included mixtures of RNA and single cells from these cell lines. For the single cell designs, the three cell lines were mixed equally and processed by 10X chromium, Drop-seq and CEL-seq2, referred to as sc_10X, sc_Drop-seq and sc_CEL-seq2 respectively in analysis that follows. For the mixture designs, we used plate-based protocols to mix and dilute samples in 2 different ways. 9 cell mixtures from the 3 cell lines were sorted in different combinations in the cell mixture experiment and data were generated by CEL-seq2, the material after pooling from 384 wells were subsampled in either 1/9 or 1/3 to simulate cells of different sizes, with different PCR product clean up ratios ranging from 0.7 to 0.9, referred to as cellmix1 to cellmix4. For the cell mixture experiment, we also sorted wells with 10 times more cells (90 cells) to provide a pseudo bulk reference for each mixture (referred to as cellmix5). Distinct RNA mixtures which were diluted down to create single cell equivalents (ranging from 3.75, 7.5, 15 to 30 pg per well) were generated using CEL-seq2 and SORT-seq (referred to as RNAmix_CEL-seq2 and RNAmix_Sort-seq.

Publication Title

scPipe: A flexible R/Bioconductor preprocessing pipeline for single-cell RNA-sequencing data.

Sample Metadata Fields

Specimen part, Subject

View Samples
accession-icon SRP074596
RNAseq of microglia from Rab7 Mutants & Control and Wild-Type mice
  • organism-icon Mus musculus
  • sample-icon 10 Downloadable Samples
  • Technology Badge IconIon Torrent Proton

Description

We purified by magnet assisted cell sorting microglial cells from brains of adult Rab7 null mutant, aged mice and respective controls, isolated total RNA and performed RNAseq to determine the transciptome profiles. Overall design: Examination of transcriptomes of Rab7 null mutants and control (2 replicates each) and aged mice and young controls (3 replicates each)

Publication Title

Age-related myelin degradation burdens the clearance function of microglia during aging.

Sample Metadata Fields

Age, Specimen part, Cell line, Subject

View Samples
accession-icon GSE32624
Functional and epigenetic studies reveal multistep differentiation and plasticity of in vitro and in vivo follicular T helper cells.
  • organism-icon Mus musculus
  • sample-icon 6 Downloadable Samples
  • Technology Badge Icon Affymetrix Mouse Genome 430 2.0 Array (mouse4302)

Description

Follicular T helper cells (Tfh) are critical for providing help to B cells for germinal center (GC) formation. Mutations affecting SAP prevent GC formation due to defective T:B cell interactions, yet effects on Tfh cell differentiation remain unclear. We describe the in vitro differentiation of functionally competent Tfh-like cells that expressed IL-21, Tfh markers, and Bcl6, and rescued GC formation in SAP-deficient hosts substantially better than other T helper (Th) cells. SAP-deficient Tfh-like cells appeared virtually indistinguishable from wildtype, yet failed to support GCs in vivo. Interestingly, both Tfh-like and in vivo-derived Tfh cells could produce effector cytokines in response to polarizing conditions. Moreover, other Th cell populations could be reprogrammed to obtain Tfh characteristics. ChIP-Seq analyses revealed positive epigenetic markings on Tbx21, Gata3 and Rorc in Tfh-like and ex vivo Tfh cells, and Bcl6 in other Th cells, supporting the concept of plasticity between Tfh and other Th populations.

Publication Title

Functional and epigenetic studies reveal multistep differentiation and plasticity of in vitro-generated and in vivo-derived follicular T helper cells.

Sample Metadata Fields

Specimen part

View Samples

refine.bio is a repository of uniformly processed and normalized, ready-to-use transcriptome data from publicly available sources. refine.bio is a project of the Childhood Cancer Data Lab (CCDL)

fund-icon Fund the CCDL

Developed by the Childhood Cancer Data Lab

Powered by Alex's Lemonade Stand Foundation

Cite refine.bio

Casey S. Greene, Dongbo Hu, Richard W. W. Jones, Stephanie Liu, David S. Mejia, Rob Patro, Stephen R. Piccolo, Ariel Rodriguez Romero, Hirak Sarkar, Candace L. Savonen, Jaclyn N. Taroni, William E. Vauclain, Deepashree Venkatesh Prasad, Kurt G. Wheeler. refine.bio: a resource of uniformly processed publicly available gene expression datasets.
URL: https://www.refine.bio

Note that the contributor list is in alphabetical order as we prepare a manuscript for submission.

BSD 3-Clause LicensePrivacyTerms of UseContact