refine.bio
  • Search
      • Normalized Compendia
      • RNA-seq Sample Compendia
  • Docs
  • About
  • My Dataset
github link
Showing
of 167 results
Sort by

Filters

Technology

Platform

accession-icon SRP154576
Designing a single cell RNA sequencing benchmark dataset to compare protocols and analysis methods (9 cell mixture dataset).
  • organism-icon Homo sapiens
  • sample-icon 5 Downloadable Samples
  • Technology Badge IconNextSeq 500

Description

Single cell RNA sequencing (scRNA-seq) technology has undergone rapid development in recent years and brings new challenges in data processing and analysis. This has led to an explosion of tailored analysis methods for scRNA-seq to address various biological questions. However, the current lack of gold-standard benchmarking datasets makes it difficult for researchers to evaluate the performance of the many methods available in a systematic manner. Here, we designed and generated a cross-platform benchmark dataset that has in-built truth in various forms and varying levels of biological noise. We used this dataset to compare different protocols and data analysis methods. We found that different protocols have different data quality and ERCC spike-in works independently to endogenous RNA. We found significant differences in the results from the methods compared and we associated the results with data characteristics to identify methods that perform well in different situations. Our dataset and analysis provide a valuable resource for algorithm selection in different biological settings. Overall design: our experiment utilized the 3 human lung adenocarcinoma cell lines H2228, H1975 and HCC827. The experiment included mixtures of RNA and single cells from these cell lines. For the single cell designs, the three cell lines were mixed equally and processed by 10X chromium, Drop-seq and CEL-seq2, referred to as sc_10X, sc_Drop-seq and sc_CEL-seq2 respectively in analysis that follows. For the mixture designs, we used plate-based protocols to mix and dilute samples in 2 different ways. 9 cell mixtures from the 3 cell lines were sorted in different combinations in the cell mixture experiment and data were generated by CEL-seq2, the material after pooling from 384 wells were subsampled in either 1/9 or 1/3 to simulate cells of different sizes, with different PCR product clean up ratios ranging from 0.7 to 0.9, referred to as cellmix1 to cellmix4. For the cell mixture experiment, we also sorted wells with 10 times more cells (90 cells) to provide a pseudo bulk reference for each mixture (referred to as cellmix5). Distinct RNA mixtures which were diluted down to create single cell equivalents (ranging from 3.75, 7.5, 15 to 30 pg per well) were generated using CEL-seq2 and SORT-seq (referred to as RNAmix_CEL-seq2 and RNAmix_Sort-seq. This is the 9 cell mixture dataset.

Publication Title

scPipe: A flexible R/Bioconductor preprocessing pipeline for single-cell RNA-sequencing data.

Sample Metadata Fields

Specimen part, Subject

View Samples
accession-icon SRP186516
Designing a single cell RNA sequencing benchmark dataset to compare protocols and analysis methods [5 Cell Lines Cel-seq]
  • organism-icon Homo sapiens
  • sample-icon 3 Downloadable Samples
  • Technology Badge IconNextSeq 500

Description

Single cell RNA sequencing (scRNA-seq) technology has undergone rapid development in recent years and brings new challenges in data processing and analysis. This has led to an explosion of tailored analysis methods for scRNA-seq to address various biological questions. However, the current lack of gold-standard benchmarking datasets makes it difficult for researchers to evaluate the performance of the many methods available in a systematic manner. Here, we designed and generated a cross-platform benchmark dataset that has in-built truth in various forms and varying levels of biological noise. We used this dataset to compare different protocols and data analysis methods. We found that different protocols have different data quality and ERCC spike-in works independently to endogenous RNA. We found significant differences in the results from the methods compared and we associated the results with data characteristics to identify methods that perform well in different situations. Our dataset and analysis provide a valuable resource for algorithm selection in different biological settings. Overall design: our experiment utilized the 5 human lung adenocarcinoma cell lines H2228, H1975, A549, H838 and HCC827. For the single cell designs, the five cell lines were mixed equally and processed by 10X chromium and CEL-seq2, referred to as sc_10X_5cl, and sc_CEL-seq2_5cl respectively in analysis that follows. For CEL-seq2, three plates were sorted and processed.

Publication Title

scPipe: A flexible R/Bioconductor preprocessing pipeline for single-cell RNA-sequencing data.

Sample Metadata Fields

Subject

View Samples
accession-icon SRP155038
Designing a single cell RNA sequencing benchmark dataset to compare protocols and analysis methods (RNAmix_CEL-seq2 )
  • organism-icon Homo sapiens
  • sample-icon 1 Downloadable Sample
  • Technology Badge IconNextSeq 500

Description

Single cell RNA sequencing (scRNA-seq) technology has undergone rapid development in recent years and brings new challenges in data processing and analysis. This has led to an explosion of tailored analysis methods for scRNA-seq to address various biological questions. However, the current lack of gold-standard benchmarking datasets makes it difficult for researchers to evaluate the performance of the many methods available in a systematic manner. Here, we designed and generated a cross-platform benchmark dataset that has in-built truth in various forms and varying levels of biological noise. We used this dataset to compare different protocols and data analysis methods. We found that different protocols have different data quality and ERCC spike-in works independently to endogenous RNA. We found significant differences in the results from the methods compared and we associated the results with data characteristics to identify methods that perform well in different situations. Our dataset and analysis provide a valuable resource for algorithm selection in different biological settings. Overall design: our experiment utilized the 3 human lung adenocarcinoma cell lines H2228, H1975 and HCC827. The experiment included mixtures of RNA and single cells from these cell lines. For the single cell designs, the three cell lines were mixed equally and processed by 10X chromium, Drop-seq and CEL-seq2, referred to as sc_10X, sc_Drop-seq and sc_CEL-seq2 respectively in analysis that follows. For the mixture designs, we used plate-based protocols to mix and dilute samples in 2 different ways. 9 cell mixtures from the 3 cell lines were sorted in different combinations in the cell mixture experiment and data were generated by CEL-seq2, the material after pooling from 384 wells were subsampled in either 1/9 or 1/3 to simulate cells of different sizes, with different PCR product clean up ratios ranging from 0.7 to 0.9, referred to as cellmix1 to cellmix4. For the cell mixture experiment, we also sorted wells with 10 times more cells (90 cells) to provide a pseudo bulk reference for each mixture (referred to as cellmix5). Distinct RNA mixtures which were diluted down to create single cell equivalents (ranging from 3.75, 7.5, 15 to 30 pg per well) were generated using CEL-seq2 and SORT-seq (referred to as RNAmix_CEL-seq2 and RNAmix_Sort-seq. This is the RNAmix_CEL-seq2 dataset.

Publication Title

scPipe: A flexible R/Bioconductor preprocessing pipeline for single-cell RNA-sequencing data.

Sample Metadata Fields

Specimen part, Subject

View Samples
accession-icon SRP158266
Designing a single cell RNA sequencing benchmark dataset to compare protocols and analysis methods (Drop-Seq)
  • organism-icon Homo sapiens
  • sample-icon 1 Downloadable Sample
  • Technology Badge IconNextSeq 500

Description

Single cell RNA sequencing (scRNA-seq) technology has undergone rapid development in recent years and brings new challenges in data processing and analysis. This has led to an explosion of tailored analysis methods for scRNA-seq to address various biological questions. However, the current lack of gold-standard benchmarking datasets makes it difficult for researchers to evaluate the performance of the many methods available in a systematic manner. Here, we designed and generated a cross-platform benchmark dataset that has in-built truth in various forms and varying levels of biological noise. We used this dataset to compare different protocols and data analysis methods. We found that different protocols have different data quality and ERCC spike-in works independently to endogenous RNA. We found significant differences in the results from the methods compared and we associated the results with data characteristics to identify methods that perform well in different situations. Our dataset and analysis provide a valuable resource for algorithm selection in different biological settings. Overall design: our experiment utilized the 3 human lung adenocarcinoma cell lines H2228, H1975 and HCC827. The experiment included mixtures of RNA and single cells from these cell lines. For the single cell designs, the three cell lines were mixed equally and processed by 10X chromium, Drop-seq and CEL-seq2, referred to as sc_10X, sc_Drop-seq and sc_CEL-seq2 respectively in analysis that follows. For the mixture designs, we used plate-based protocols to mix and dilute samples in 2 different ways. 9 cell mixtures from the 3 cell lines were sorted in different combinations in the cell mixture experiment and data were generated by CEL-seq2, the material after pooling from 384 wells were subsampled in either 1/9 or 1/3 to simulate cells of different sizes, with different PCR product clean up ratios ranging from 0.7 to 0.9, referred to as cellmix1 to cellmix4. For the cell mixture experiment, we also sorted wells with 10 times more cells (90 cells) to provide a pseudo bulk reference for each mixture (referred to as cellmix5). Distinct RNA mixtures which were diluted down to create single cell equivalents (ranging from 3.75, 7.5, 15 to 30 pg per well) were generated using CEL-seq2 and SORT-seq (referred to as RNAmix_CEL-seq2 and RNAmix_Sort-seq. This is the RNAmix_CEL-seq2 dataset.

Publication Title

scPipe: A flexible R/Bioconductor preprocessing pipeline for single-cell RNA-sequencing data.

Sample Metadata Fields

Specimen part, Subject

View Samples
accession-icon SRP155039
Designing a single cell RNA sequencing benchmark dataset to compare protocols and analysis methods (RNAmix_Sort-seq)
  • organism-icon Homo sapiens
  • sample-icon 1 Downloadable Sample
  • Technology Badge IconNextSeq 500

Description

Single cell RNA sequencing (scRNA-seq) technology has undergone rapid development in recent years and brings new challenges in data processing and analysis. This has led to an explosion of tailored analysis methods for scRNA-seq to address various biological questions. However, the current lack of gold-standard benchmarking datasets makes it difficult for researchers to evaluate the performance of the many methods available in a systematic manner. Here, we designed and generated a cross-platform benchmark dataset that has in-built truth in various forms and varying levels of biological noise. We used this dataset to compare different protocols and data analysis methods. We found that different protocols have different data quality and ERCC spike-in works independently to endogenous RNA. We found significant differences in the results from the methods compared and we associated the results with data characteristics to identify methods that perform well in different situations. Our dataset and analysis provide a valuable resource for algorithm selection in different biological settings. Overall design: our experiment utilized the 3 human lung adenocarcinoma cell lines H2228, H1975 and HCC827. The experiment included mixtures of RNA and single cells from these cell lines. For the single cell designs, the three cell lines were mixed equally and processed by 10X chromium, Drop-seq and CEL-seq2, referred to as sc_10X, sc_Drop-seq and sc_CEL-seq2 respectively in analysis that follows. For the mixture designs, we used plate-based protocols to mix and dilute samples in 2 different ways. 9 cell mixtures from the 3 cell lines were sorted in different combinations in the cell mixture experiment and data were generated by CEL-seq2, the material after pooling from 384 wells were subsampled in either 1/9 or 1/3 to simulate cells of different sizes, with different PCR product clean up ratios ranging from 0.7 to 0.9, referred to as cellmix1 to cellmix4. For the cell mixture experiment, we also sorted wells with 10 times more cells (90 cells) to provide a pseudo bulk reference for each mixture (referred to as cellmix5). Distinct RNA mixtures which were diluted down to create single cell equivalents (ranging from 3.75, 7.5, 15 to 30 pg per well) were generated using CEL-seq2 and SORT-seq (referred to as RNAmix_CEL-seq2 and RNAmix_Sort-seq.

Publication Title

scPipe: A flexible R/Bioconductor preprocessing pipeline for single-cell RNA-sequencing data.

Sample Metadata Fields

Specimen part, Subject

View Samples
accession-icon GSE76880
Expression data from human 3D skin models in response to IL-31 treatment
  • organism-icon Homo sapiens
  • sample-icon 8 Downloadable Samples
  • Technology Badge Icon Affymetrix Human Gene 1.0 ST Array (hugene10st)

Description

Atopic dermatitis, a chronic inflammatory skin disease with increasing prevalance, is closely associated with skin barrier defects. A cytokine related to disease severity and inhibition of keratinocyte differentiation is IL-31. To identify its molecular targets, IL-31-dependent gene expression was determined in 3-dimensional organotypic skin models.

Publication Title

Control of the Physical and Antimicrobial Skin Barrier by an IL-31-IL-1 Signaling Network.

Sample Metadata Fields

Sex, Specimen part

View Samples
accession-icon SRP071611
Analysis of the expression profile of skin macrophages FACS-sorted from mice overexpressing activin and/or oncogenes of human papilloma virus 8 in keratinocytes.
  • organism-icon Mus musculus
  • sample-icon 12 Downloadable Samples
  • Technology Badge IconIllumina HiSeq 2500

Description

We have shown that activin promoted skin tumorigenesis in mice induced by the human papilloma virus 8 oncogenes. Activin attracted blood monocytes to the skin as revealed by depletion of CCR2-positive monocytes. To determine if activin also altered the gene expression profile of these cells, we performed RNA-Sequencing of macrophages FACS-sorted from the pre-cancerous ear skin. We have found that activin induces a pro-migratiory, pro-angiogenic and pro-tumorigenic genes in skin macrophages in vivo. This largely contributes to the pro-tumorigenic function of activin, since macrophage depletion delayed spontaneous tumorigenesis in HPV8-transgenic mice by reducing keratinocyte proliferation and angiogenesis. Overall design: F4/80+CD11b+CD45+ cells were FACS-sorted from the pre-cancerous ear skin of wt/wt, HPV8/wt, wt/Act and HPV8/Act mice and their expression profile was analysed by RNA-Sequencing. Experiment was performed in triplicates, for each replicate ear skin of 3-6 mice of corresponding genotype was pooled.

Publication Title

Activin promotes skin carcinogenesis by attraction and reprogramming of macrophages.

Sample Metadata Fields

Specimen part, Cell line, Subject

View Samples
accession-icon GSE30873
Effects of caspase-8 deletion in the intestinal epithelium
  • organism-icon Mus musculus
  • sample-icon 6 Downloadable Samples
  • Technology Badge Icon Affymetrix Mouse Genome 430 2.0 Array (mouse4302)

Description

Caspase-8 is a cystein protease involved in regulating apoptosis. The function of caspase-8 was studied in the intestinal epithelium, using mice with an intestinal epithelial cell specific deletion of caspase-8.

Publication Title

Caspase-8 regulates TNF-α-induced epithelial necroptosis and terminal ileitis.

Sample Metadata Fields

Specimen part

View Samples
accession-icon GSE78210
3D cultivation of NSCLC cell lines alters gene expression of key cancer-associated signalling pathways
  • organism-icon Homo sapiens
  • sample-icon 15 Downloadable Samples
  • Technology Badge Icon Affymetrix Human Gene 1.0 ST Array (hugene10st)

Description

Background: The main focus of the work was the evaluation of gene expression differences between our established NSCLC 3D cell culture model and the 2D cell culture in regard to the use of our model for drug screening applications.

Publication Title

3D-cultivation of NSCLC cell lines induce gene expression alterations of key cancer-associated pathways and mimic <i>in-vivo</i> conditions.

Sample Metadata Fields

Cell line

View Samples
accession-icon GSE62455
Gene expression of paired samples of hepatic stellate cells (HSC) and hepatocyte cell culture (HCC) treated with conditioned media of HSC cells
  • organism-icon Homo sapiens
  • sample-icon 34 Downloadable Samples
  • Technology Badge Icon Affymetrix Human Gene 1.0 ST Array (hugene10st)

Description

All living cells rely on the communication with other cells to ensure their function and survival. Molecular signals are sent among cells of the same cell type and from cells of one cell type to another. In cancer, not only the cancer cells themselves are responsible for the malignancy, but also stromal (non-cancerous) cells and the molecular signals they send to cancer cells are important factors that determine the severity and outcome of the disease. Therefore, the identification of stromal signals and their influence on cancer cells is important when looking for novel treatment strategies.

Publication Title

Causal Modeling of Cancer-Stromal Communication Identifies PAPPA as a Novel Stroma-Secreted Factor Activating NFκB Signaling in Hepatocellular Carcinoma.

Sample Metadata Fields

Specimen part

View Samples
...

refine.bio is a repository of uniformly processed and normalized, ready-to-use transcriptome data from publicly available sources. refine.bio is a project of the Childhood Cancer Data Lab (CCDL)

fund-icon Fund the CCDL

Developed by the Childhood Cancer Data Lab

Powered by Alex's Lemonade Stand Foundation

Cite refine.bio

Casey S. Greene, Dongbo Hu, Richard W. W. Jones, Stephanie Liu, David S. Mejia, Rob Patro, Stephen R. Piccolo, Ariel Rodriguez Romero, Hirak Sarkar, Candace L. Savonen, Jaclyn N. Taroni, William E. Vauclain, Deepashree Venkatesh Prasad, Kurt G. Wheeler. refine.bio: a resource of uniformly processed publicly available gene expression datasets.
URL: https://www.refine.bio

Note that the contributor list is in alphabetical order as we prepare a manuscript for submission.

BSD 3-Clause LicensePrivacyTerms of UseContact