Mouse Pancreatic Islet Atlas

Mouse pancreatic islet scRNA-seq atlas across sexes, ages, and stress conditions including diabetes

cellxgene_census

Info

cellxgene_census/mouse_pancreas_atlas
Hrovatin et al. (2023)
13.07 GiB
02-02-2024
301796 × 24923

Used in

No related benchmarks found.

Description

To better understand pancreatic β-cell heterogeneity we generated a mouse pancreatic islet atlas capturing a wide range of biological conditions. The atlas contains scRNA-seq datasets of over 300,000 mouse pancreatic islet cells, of which more than 100,000 are β-cells, from nine datasets with 56 samples, including two previously unpublished datasets. The samples vary in sex, age (ranging from embryonic to aged), chemical stress, and disease status (including T1D NOD model development and two T2D models, mSTZ and db/db) together with different diabetes treatments. Additional information about data fields is available in anndata uns field ‘field_descriptions’ and on https://github.com/theislab/mm_pancreas_atlas_rep/blob/main/resources/cellxgene.md.

Preview

dataset is an AnnData object with n_obs × n_vars = 301796 × 24923 with slots:

Reference

Name Description Type Data type Size
obs
assay Type of assay used to generate the cell data, indicating the methodology or technique employed. vector category 301796
assay_ontology_term_id Experimental Factor Ontology (EFO:) term identifier for the assay, providing a standardized reference to the assay type. vector category 301796
batch A batch identifier. This label is very context-dependent and may be a combination of the tissue, assay, donor, etc. vector category 301796
cell_type Classification of the cell type based on its characteristics and function within the tissue or organism. vector category 301796
cell_type_ontology_term_id Cell Ontology (CL:) term identifier for the cell type, offering a standardized reference to the specific cell classification. vector category 301796
dataset_id Identifier for the dataset from which the cell data is derived, useful for tracking and referencing purposes. vector category 301796
development_stage Stage of development of the organism or tissue from which the cell is derived, indicating its maturity or developmental phase. vector category 301796
development_stage_ontology_term_id Ontology term identifier for the developmental stage, providing a standardized reference to the organism’s developmental phase. If the organism is human (organism_ontology_term_id == 'NCBITaxon:9606'), then the Human Developmental Stages (HsapDv:) ontology is used. If the organism is mouse (organism_ontology_term_id == 'NCBITaxon:10090'), then the Mouse Developmental Stages (MmusDv:) ontology is used. Otherwise, the Uberon (UBERON:) ontology is used. vector category 301796
disease Information on any disease or pathological condition associated with the cell or donor. vector category 301796
disease_ontology_term_id Ontology term identifier for the disease, enabling standardized disease classification and referencing. Must be a term from the Mondo Disease Ontology (MONDO:) ontology term, or PATO:0000461 from the Phenotype And Trait Ontology (PATO:). vector category 301796
donor_id Identifier for the donor from whom the cell sample is obtained. vector category 301796
is_primary_data Indicates whether the data is primary (directly obtained from experiments) or has been computationally derived from other primary data. vector bool 301796
self_reported_ethnicity Ethnicity of the donor as self-reported, relevant for studies considering genetic diversity and population-specific traits. vector category 301796
self_reported_ethnicity_ontology_term_id Ontology term identifier for the self-reported ethnicity, providing a standardized reference for ethnic classifications. If the organism is human (organism_ontology_term_id == 'NCBITaxon:9606'), then the Human Ancestry Ontology (HANCESTRO:) is used. vector category 301796
sex Biological sex of the donor or source organism, crucial for studies involving sex-specific traits or conditions. vector category 301796
sex_ontology_term_id Ontology term identifier for the biological sex, ensuring standardized classification of sex. Only PATO:0000383, PATO:0000384 and PATO:0001340 are allowed. vector category 301796
size_factors The size factors created by the normalisation method, if any. vector float32 301796
soma_joinid If the dataset was retrieved from CELLxGENE census, this is a unique identifier for the cell. vector int64 301796
suspension_type Type of suspension or medium in which the cells were stored or processed, important for understanding cell handling and conditions. vector category 301796
tissue Specific tissue from which the cells were derived, key for context and specificity in cell studies. vector category 301796
tissue_general General category or classification of the tissue, useful for broader grouping and comparison of cell data. vector category 301796
tissue_general_ontology_term_id Ontology term identifier for the general tissue category, aiding in standardizing and grouping tissue types. For organoid or tissue samples, the Uber-anatomy ontology (UBERON:) is used. The term ids must be a child term of UBERON:0001062 (anatomical entity). For cell cultures, the Cell Ontology (CL:) is used. The term ids cannot be CL:0000255, CL:0000257 or CL:0000548. vector category 301796
tissue_ontology_term_id Ontology term identifier for the tissue, providing a standardized reference for the tissue type. For organoid or tissue samples, the Uber-anatomy ontology (UBERON:) is used. The term ids must be a child term of UBERON:0001062 (anatomical entity). For cell cultures, the Cell Ontology (CL:) is used. The term ids cannot be CL:0000255, CL:0000257 or CL:0000548. vector category 301796
var
feature_id Unique identifier for the feature, usually a ENSEMBL gene id. vector object 24923
feature_name A human-readable name for the feature, usually a gene symbol. vector object 24923
hvg Whether or not the feature is considered to be a ‘highly variable gene’ vector bool 24923
hvg_score A ranking of the features by hvg. vector float64 24923
soma_joinid If the dataset was retrieved from CELLxGENE census, this is a unique identifier for the feature. vector int64 24923
obsp
knn_connectivities K nearest neighbors connectivities matrix. sparsematrix float32 301796 × 301796
knn_distances K nearest neighbors distance matrix. sparsematrix float64 301796 × 301796
obsm
X_pca The resulting PCA embedding. densematrix float32 301796 × 50
varm
pca_loadings The PCA loadings matrix. densematrix float32 24923 × 50
layers
counts Raw counts sparsematrix float32 301796 × 24923
normalized Normalised expression values sparsematrix float32 301796 × 24923
uns
dataset_description Long description of the dataset. atomic str 1
dataset_id A unique identifier for the dataset. This is different from the obs.dataset_id field, which is the identifier for the dataset from which the cell data is derived. atomic str 1
dataset_name A human-readable name for the dataset. atomic str 1
dataset_organism The organism of the sample in the dataset. atomic str 1
dataset_reference Bibtex reference of the paper in which the dataset was published. atomic str 1
dataset_summary Short description of the dataset. atomic str 1
dataset_url Link to the original source of the dataset. atomic str 1
knn Supplementary K nearest neighbors data. dict 3
normalization_id Which normalization was used atomic str 1
pca_variance The PCA variance objects. dict 2

References

Hrovatin, Karin, Aimée Bastidas-Ponce, Mostafa Bakhti, Luke Zappia, Maren Büttner, Ciro Sallino, Michael Sterr, et al. 2023. “Delineating Mouse β-Cell Identity During Lifetime and in Diabetes with a Single Cell Atlas.” bioRxiv. https://doi.org/10.1101/2022.12.22.521557.