Query gene across published studies

Summary of the data curation process

1. Retrieval

Published data is downloaded from its respective repository.

2. Noise reduction

Background noise is eliminated to ensure data accuracy.

3. Standardization

Author annotations, such as cell type and tissue, are mapped to a consistent, controlled vocabulary.

4. Expression threshold

A gene is considered expressed in a cell type or tissue if it has a non-zero value in at least 100 cells and 15% of the total cells.

5. Study Count

Count the number of studies reporting the gene as expressed in each cell type/tissue.