Mouse Pancreatic Islet Atlas
Mouse pancreatic islet scRNA-seq atlas across sexes, ages, and stress conditions including diabetes
cellxgene_census
Info
Quick links
Description
To better understand pancreatic β-cell heterogeneity we generated a mouse pancreatic islet atlas capturing a wide range of biological conditions. The atlas contains scRNA-seq datasets of over 300,000 mouse pancreatic islet cells, of which more than 100,000 are β-cells, from nine datasets with 56 samples, including two previously unpublished datasets. The samples vary in sex, age (ranging from embryonic to aged), chemical stress, and disease status (including T1D NOD model development and two T2D models, mSTZ and db/db) together with different diabetes treatments. Additional information about data fields is available in anndata uns field 'field_descriptions' and on https://github.com/theislab/mm_pancreas_atlas_rep/blob/main/resources/cellxgene.md.
Preview
An AnnData object with n_obs × n_vars = 301796 × 24923 with slots:
Name | Description | Type | Data type | Size |
---|---|---|---|---|
obs | ||||
soma_joinid | If the dataset was retrieved from CELLxGENE census, this is a unique identifier for the cell. | vector | int64 | 301796 |
dataset_id | Identifier for the dataset from which the cell data is derived, useful for tracking and referencing purposes. | vector | category | 301796 |
assay | Type of assay used to generate the cell data, indicating the methodology or technique employed. | vector | category | 301796 |
assay_ontology_term_id | Experimental Factor Ontology (`EFO:`) term identifier for the assay, providing a standardized reference to the assay type. | vector | category | 301796 |
cell_type | Classification of the cell type based on its characteristics and function within the tissue or organism. | vector | category | 301796 |
cell_type_ontology_term_id | Cell Ontology (`CL:`) term identifier for the cell type, offering a standardized reference to the specific cell classification. | vector | category | 301796 |
development_stage | Stage of development of the organism or tissue from which the cell is derived, indicating its maturity or developmental phase. | vector | category | 301796 |
development_stage_ontology_term_id | Ontology term identifier for the developmental stage, providing a standardized reference to the organism's developmental phase. If the organism is human (`organism_ontology_term_id == 'NCBITaxon:9606'`), then the Human Developmental Stages (`HsapDv:`) ontology is used. If the organism is mouse (`organism_ontology_term_id == 'NCBITaxon:10090'`), then the Mouse Developmental Stages (`MmusDv:`) ontology is used. Otherwise, the Uberon (`UBERON:`) ontology is used. | vector | category | 301796 |
disease | Information on any disease or pathological condition associated with the cell or donor. | vector | category | 301796 |
disease_ontology_term_id | Ontology term identifier for the disease, enabling standardized disease classification and referencing. Must be a term from the Mondo Disease Ontology (`MONDO:`) ontology term, or `PATO:0000461` from the Phenotype And Trait Ontology (`PATO:`). | vector | category | 301796 |
donor_id | Identifier for the donor from whom the cell sample is obtained. | vector | category | 301796 |
is_primary_data | Indicates whether the data is primary (directly obtained from experiments) or has been computationally derived from other primary data. | vector | bool | 301796 |
self_reported_ethnicity | Ethnicity of the donor as self-reported, relevant for studies considering genetic diversity and population-specific traits. | vector | category | 301796 |
self_reported_ethnicity_ontology_term_id | Ontology term identifier for the self-reported ethnicity, providing a standardized reference for ethnic classifications. If the organism is human (`organism_ontology_term_id == 'NCBITaxon:9606'`), then the Human Ancestry Ontology (`HANCESTRO:`) is used. | vector | category | 301796 |
sex | Biological sex of the donor or source organism, crucial for studies involving sex-specific traits or conditions. | vector | category | 301796 |
sex_ontology_term_id | Ontology term identifier for the biological sex, ensuring standardized classification of sex. Only `PATO:0000383`, `PATO:0000384` and `PATO:0001340` are allowed. | vector | category | 301796 |
suspension_type | Type of suspension or medium in which the cells were stored or processed, important for understanding cell handling and conditions. | vector | category | 301796 |
tissue | Specific tissue from which the cells were derived, key for context and specificity in cell studies. | vector | category | 301796 |
tissue_ontology_term_id | Ontology term identifier for the tissue, providing a standardized reference for the tissue type. For organoid or tissue samples, the Uber-anatomy ontology (`UBERON:`) is used. The term ids must be a child term of `UBERON:0001062` (anatomical entity). For cell cultures, the Cell Ontology (`CL:`) is used. The term ids cannot be `CL:0000255`, `CL:0000257` or `CL:0000548`. | vector | category | 301796 |
tissue_general | General category or classification of the tissue, useful for broader grouping and comparison of cell data. | vector | category | 301796 |
tissue_general_ontology_term_id | Ontology term identifier for the general tissue category, aiding in standardizing and grouping tissue types. For organoid or tissue samples, the Uber-anatomy ontology (`UBERON:`) is used. The term ids must be a child term of `UBERON:0001062` (anatomical entity). For cell cultures, the Cell Ontology (`CL:`) is used. The term ids cannot be `CL:0000255`, `CL:0000257` or `CL:0000548`. | vector | category | 301796 |
batch | A batch identifier. This label is very context-dependent and may be a combination of the tissue, assay, donor, etc. | vector | category | 301796 |
size_factors | The size factors created by the normalisation method, if any. | vector | float32 | 301796 |
var | ||||
soma_joinid | If the dataset was retrieved from CELLxGENE census, this is a unique identifier for the feature. | vector | int64 | 24923 |
feature_id | Unique identifier for the feature, usually a ENSEMBL gene id. | vector | object | 24923 |
feature_name | A human-readable name for the feature, usually a gene symbol. | vector | object | 24923 |
hvg | Whether or not the feature is considered to be a 'highly variable gene' | vector | bool | 24923 |
hvg_score | A ranking of the features by hvg. | vector | float64 | 24923 |
obsp | ||||
knn_connectivities | K nearest neighbors connectivities matrix. | sparsematrix | float32 | 301796 × 301796 |
knn_distances | K nearest neighbors distance matrix. | sparsematrix | float64 | 301796 × 301796 |
obsm | ||||
X_pca | The resulting PCA embedding. | densematrix | float32 | 301796 × 50 |
varm | ||||
pca_loadings | The PCA loadings matrix. | densematrix | float32 | 24923 × 50 |
layers | ||||
counts | Raw counts | sparsematrix | float32 | 301796 × 24923 |
normalized | Normalised expression values | sparsematrix | float32 | 301796 × 24923 |
uns | ||||
dataset_description | Long description of the dataset. | atomic | str | 1 |
dataset_id | A unique identifier for the dataset. This is different from the `obs.dataset_id` field, which is the identifier for the dataset from which the cell data is derived. | atomic | str | 1 |
dataset_name | A human-readable name for the dataset. | atomic | str | 1 |
dataset_organism | The organism of the sample in the dataset. | atomic | str | 1 |
dataset_reference | Bibtex reference of the paper in which the dataset was published. | atomic | str | 1 |
dataset_summary | Short description of the dataset. | atomic | str | 1 |
dataset_url | Link to the original source of the dataset. | atomic | str | 1 |
knn | Supplementary K nearest neighbors data. | dict | 3 | |
normalization_id | Which normalization was used | atomic | str | 1 |
pca_variance | The PCA variance objects. | dict | 2 |
Related Resources and Citations