panelScope

Multi-view gene panel characterization for spatially resolved omics

We present panelScope, a framework based on a diverse collection of metrics to characterize a gene panel, allowing researchers to determine whether a panel is well-suited to their study’s objectives. We demonstrate the utility of panelScope by generating multiple-views of gene panels that describe their ability to capture cell types of interest, enrichment for biological pathways, or the amount of redundant information. In parallel, we leverage these metrics as loss functions in a genetic algorithm for panel design, where users can choose to weight each characterization category. Importantly, we have implemented this framework in an interactive web platform, which includes a library of pre-existing gene panels that users can compare to their own gene panels. Thus, by quantitatively summarizing a panel from multiple views, panelScope enables the design of panels that can capture diverse information relevant to one’s specific research questions.

Installation

To correctly use panelScope via your local device, we suggest first create a conda environment by:

conda create -n <env> python=3.9
conda activate <env>

After entering an entire new virtue environment, the recommended way to install depending packages are:

conda install pandas
conda install scanpy

Usage

There are two main parts implemented within panelScope, the metrics-computation part and the optimization part.

The metrics-computation part is written by R, which describes numerical properties of a selected gene panel based on provided single cell dataset.

The optimization part is written by Python using an efficient version of evolution algorithm. As shown in the following figure, an iterative optimization process containing 4 steps is evolved. At the initialization step, a number of gene panels (default as 50) are randomly selected from the input dataset as initial population. Then, an evaluation-selection-generation loop would be repeated for a number of times (default as 5000). We use scoring functions related to feature diversity, pathway diversity, panel entropy, spatial specificity and variation recovery as our evaluation metrics. The algorithm would randomly pick better performed gene panels and use them to generate new panels to replace the old, not-well-performed ones.

It has 4 arguements, respectively are:

--dataset_path: should be a str. The absolute path of the reference single cell dataset within .rds format.

--panel_num: should be an int. This pre-defined number indicates the amount of genes within the final panel.

--search_space_path: should be a str. The absolute path of a json file which contains all candidate genes.

--objmode: should be a str. This str should be a choice in ["overall","cts","corr","pathway","spatial","tv"].
          These represent the objective functions: "Overall", "Panel entropy score", "Feature diversity score",
          "Pathway diversity score", "Moran’s I" and "transcriptional-variability-based functions". The users
          could optimize the gene panel by anyone of them. One step further, the users could also define their
          own functions and replace the R codes for more speficial panel selection.

Demo

We provide an example using subsampled data from the Human Cell Atlas (HCA).
In this demo we automatically design a 200-gene panel with panelScope-OA. The algorithm chooses genes by maximising multiple criteria, which can be customised with the --objmode argument. When panelScope-OA finishes, the selected genes are written to a .txt file—one gene per line.
After sucessful installation (please check Installation section above), use following line command to conduct our algorithm:

python main.py --dataset_path ./demo_hca_10x2.rds --panel_num 200 --search_space_path ./search.json --objmode overall

Input files (all included in this repository):

--dataset_path: demo_hca_10x2.rds is a random subsample of 1 000 cells × 1 000 genes from the HCA, provided for computational efficiency. This should be a Seurat object save in .rds format.
--panel_num: Number of genes to select (here 200). An integer.
--search_space_path: search.json lists the 1,000 candidate genes from which the panel will be chosen. A json file.
--objmode: Optimisation mode. We use overall, which applies all characterisation criteria defined in our framework. Other modes can be specified to emphasise specific objectives.

Feel free to adjust --panel_num, edit the search-space JSON, or choose a different --objmode setting to tailor the panel design to your own data and priorities.
Output file (all included in this repository):

demo_result.txt: the selected genes are written to a .txt file—one gene per line.

Citation

If you find our codes useful, please consider citing our work:

@article{panelScope,
  title={Multi-view gene panel characterization for spatially resolved omics},
  author={Daniel Kim+, Wenze Ding+, Akira Nguyen Shaw, Marni Torkel, Cameron J Turtle, Pengyi Yang, Jean Yang*
},
  journal={},
  year={2025},
}

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
LICENSE		LICENSE
README.md		README.md
demo_hca_10x2.rds		demo_hca_10x2.rds
demo_result.txt		demo_result.txt
evo.py		evo.py
figure1.png		figure1.png
figure2.png		figure2.png
functions.R		functions.R
main.py		main.py
scores.R		scores.R
search.json		search.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

panelScope

Table of Contents

Installation

Usage

Demo

Citation

About

Uh oh!

Releases

Packages

Languages

License

SydneyBioX/panelScope

Folders and files

Latest commit

History

Repository files navigation

panelScope

Table of Contents

Installation

Usage

Demo

Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages