Cell-Cell Communication Inference (Source-Target)

Detect interactions between source and target cell types

Description

The growing availability of single-cell data has sparked an increased interest in the inference of cell-cell communication (CCC), with an ever-growing number of computational tools developed for this purpose.

Different tools propose distinct preprocessing steps with diverse scoring functions, that are challenging to compare and evaluate. Furthermore, each tool typically comes with its own set of prior knowledge. To harmonize these, Dimitrov et al, 2022 recently developed the LIANA framework, which was used as a foundation for this task.

The challenges in evaluating the tools are further exacerbated by the lack of a gold standard to benchmark the performance of CCC methods. In an attempt to address this, Dimitrov et al use alternative data modalities, including the spatial proximity of cell types and downstream cytokine activities, to generate an inferred ground truth. However, these modalities are only approximations of biological reality and come with their own assumptions and limitations. In time, the inclusion of more datasets with known ground truth interactions will become available, from which the limitations and advantages of the different CCC methods will be better understood.

This subtask evaluates methods in their ability to predict interactions between spatially-adjacent source cell types and target cell types. This subtask focuses on the prediction of interactions from steady-state, or single-context, single-cell data.

Summary

viewof color_by_rank = Inputs.toggle({label: "Color by rank"})
viewof scale_column = Inputs.toggle({label: "Rescale per column"})

funkyheatmap(
    funky_heatmap_args.data,
    funky_heatmap_args.columns,
    funky_heatmap_args.column_info,
    funky_heatmap_args.column_groups,
    funky_heatmap_args.palettes,
    funky_heatmap_args.expand,
    funky_heatmap_args.col_annot_offset,
    funky_heatmap_args.add_abc,
    scale_column,
    {
        fontSize: 14,
        rowHeight: 26,
        rootStyle: 'max-width: none',
        colorByRank: color_by_rank
    }
);

OJS Runtime Error

Failed to fetch dynamically imported module

Figure 1: Overview of the results per method. This figures shows the mean of the scaled scores (group Overall), the mean scores per dataset (group Dataset) and the mean scores per metric (group Metric).

funkyheatmap = (await require('d3@7').then(d3 => {
  window.d3 = d3;
  return import('https://unpkg.com/funkyheatmap-js@0.1.7');
})).default;

OJS Error

TypeError: Failed to fetch dynamically imported module: https://unpkg.com/funkyheatmap-js@0.1.7

Metrics

Precision-recall AUC¹: Area under the precision-recall curve for the binary classification task predicting interactions.

Odds Ratio²: The odds ratio represents the ratio of true and false positives within a set of prioritized interactions (top ranked hits) versus the same ratio for the remainder of the interactions. Thus, in this scenario odds ratios quantify the strength of association between the ability of methods to prioritize interactions and those interactions assigned to the positive class.

Results

Results table of the scores per method, dataset and metric (after scaling). Use the filters to make a custom subselection of methods and datasets. The “Overall mean” dataset is the mean value across all datasets.

Filters Active - 1


CellPhoneDB (max) ³2
CellPhoneDB (sum) ³2
Connectome (max) ⁶2
Connectome (sum) ⁶2
Log2FC (max) ⁴2
Log2FC (sum) ⁴2
Magnitude Rank Aggregate (max) ⁴2
Magnitude Rank Aggregate (sum) ⁴2
NATMI (max) ⁵2
NATMI (sum) ⁵2
SingleCellSignalR (max) ⁷2
SingleCellSignalR (sum) ⁷2
Specificity Rank Aggregate (max) ⁴2
Specificity Rank Aggregate (sum) ⁴2


Mouse brain atlas ⁸14
Overall mean14

Method	Dataset	Mean score	Precision-recall AUC	Odds Ratio	Runtime (s)	CPU (%)	Memory (GB)
CellPhoneDB (max) ³	Overall mean	0.39	0.07	0.70	8,101	100	114.16
Magnitude Rank Aggregate (max) ⁴	Overall mean	0.38	0.05	0.70	10,470	100	118.75
NATMI (max) ⁵	Overall mean	0.34	0.05	0.63	2,099	85	20.90
Log2FC (sum) ⁴	Overall mean	0.32	0.10	0.54	1,368	88	20.90
NATMI (sum) ⁵	Overall mean	0.31	0.08	0.54	1,058	92	20.90
Specificity Rank Aggregate (max) ⁴	Overall mean	0.30	0.05	0.54	8,320	100	143.75
Connectome (sum) ⁶	Overall mean	0.28	0.03	0.54	1,239	87	20.41
Log2FC (max) ⁴	Overall mean	0.24	0.07	0.41	1,609	94	20.41
SingleCellSignalR (max) ⁷	Overall mean	0.23	0.05	0.41	1,300	95	20.41
Connectome (max) ⁶	Overall mean	0.14	0.03	0.25	1,480	96	20.41
CellPhoneDB (sum) ³	Overall mean	0.01	0.02	0.00	7,550	100	113.38
Magnitude Rank Aggregate (sum) ⁴	Overall mean	0.00	0.00	0.00	8,280	100	118.95
Specificity Rank Aggregate (sum) ⁴	Overall mean	-0.00	-0.00	0.00	8,649	100	138.18
SingleCellSignalR (sum) ⁷	Overall mean	-0.00	-0.00	0.00	1,608	86	20.90

Details

Methods

CellPhoneDB (max)³: CellPhoneDBv2 calculates a mean of ligand-receptor expression as a measure of interaction magnitude, along with a permutation-based p-value as a measure of specificity. Here, we use the former to prioritize interactions, subsequent to filtering according to p-value less than 0.05. Links: Docs.

CellPhoneDB (sum)³: CellPhoneDBv2 calculates a mean of ligand-receptor expression as a measure of interaction magnitude, along with a permutation-based p-value as a measure of specificity. Here, we use the former to prioritize interactions, subsequent to filtering according to p-value less than 0.05. Links: Docs.

Connectome (max)⁶: Connectome uses the product of ligand-receptor expression as a measure of magnitude, and the average of the z-transformed expression of ligand and receptor as a measure of specificity. Links: Docs.

Connectome (sum)⁶: Connectome uses the product of ligand-receptor expression as a measure of magnitude, and the average of the z-transformed expression of ligand and receptor as a measure of specificity. Links: Docs.

Log2FC (max)⁴: logFC (implemented in LIANA and inspired by iTALK) combines both expression and magnitude, and represents the average of one-versus-the-rest log2-fold change of ligand and receptor expression per cell type. Links: Docs.

Log2FC (sum)⁴: logFC (implemented in LIANA and inspired by iTALK) combines both expression and magnitude, and represents the average of one-versus-the-rest log2-fold change of ligand and receptor expression per cell type. Links: Docs.

Magnitude Rank Aggregate (max)⁴: RobustRankAggregate generates a consensus rank of all methods implemented in LIANA providing either specificity or magnitude scores. Links: Docs.

Magnitude Rank Aggregate (sum)⁴: RobustRankAggregate generates a consensus rank of all methods implemented in LIANA providing either specificity or magnitude scores. Links: Docs.

NATMI (max)⁵: NATMI uses the product of ligand-receptor expression as a measure of magnitude. As a measure of specificity, NATMI proposes $s p e c i f i c i t y . e d g e = \frac{l}{l_{s}} \cdot \frac{r}{r_{s}}$ ; where $l$ and $r$ represent the average expression of ligand and receptor per cell type, and $l_{s}$ and $r_{s}$ represent the sums of the average ligand and receptor expression across all cell types. We use its specificity measure, as recommended by the authors for single-context predictions. Links: Docs.

NATMI (sum)⁵: NATMI uses the product of ligand-receptor expression as a measure of magnitude. As a measure of specificity, NATMI proposes $s p e c i f i c i t y . e d g e = \frac{l}{l_{s}} \cdot \frac{r}{r_{s}}$ ; where $l$ and $r$ represent the average expression of ligand and receptor per cell type, and $l_{s}$ and $r_{s}$ represent the sums of the average ligand and receptor expression across all cell types. We use its specificity measure, as recommended by the authors for single-context predictions. Links: Docs.

Random Events⁹: Random generation of cell-cell communication events by random selection of ligand, receptor, source, target, and score. Links: Docs.

SingleCellSignalR (max)⁷: SingleCellSignalR provides a magnitude score as $L R s c o r e = \frac{\sqrt{l r}}{μ + \sqrt{l r}}$ ; where $l$ and $r$ are the average ligand and receptor expression per cell type, and $μ$ is the mean of the expression matrix. Links: Docs.

SingleCellSignalR (sum)⁷: SingleCellSignalR provides a magnitude score as $L R s c o r e = \frac{\sqrt{l r}}{μ + \sqrt{l r}}$ ; where $l$ and $r$ are the average ligand and receptor expression per cell type, and $μ$ is the mean of the expression matrix. Links: Docs.

Specificity Rank Aggregate (max)⁴: RobustRankAggregate generates a consensus rank of all methods implemented in LIANA providing either specificity or magnitude scores. Links: Docs.

Specificity Rank Aggregate (sum)⁴: RobustRankAggregate generates a consensus rank of all methods implemented in LIANA providing either specificity or magnitude scores. Links: Docs.

True Events⁹: Perfect prediction of cell-cell communication events from target data. Links: Docs.

Baseline methods

Random Events: Random generation of cell-cell communication events by random selection of ligand, receptor, source, target, and score.

True Events: Perfect prediction of cell-cell communication events from target data.

Datasets

Mouse brain atlas⁸: A murine brain atlas with adjacent cell types as assumed benchmark truth, inferred from deconvolution proportion correlations using matching 10x Visium slides (see Dimitrov et al., 2022). 14249 cells x 34617 features with 23 cell type labels.

Download raw data

Task info Method info Metric info Dataset info Results Quality control

Quality control results

✓ All checks succeeded!

Visualization of raw results