Plotting#

The DRVI plotting module provides visualization tools for analyzing latent representations and interpretability. These functions help researchers understand the disentangled representations learned by the DRVI model and their biological implications.

Overview#

The plotting module is organized into several categories:

  • Latent Visualization: Functions for exploring and visualizing latent dimensions

  • Interpretability Analysis: Tools for understanding how latent dimensions affect gene expression

  • Utility Functions: Additional utility functions.

Latent Dimension Analysis#

The core functions are:

plotting.plot_latent_dimension_stats

Plot the statistics of latent dimensions.

plotting.plot_latent_dims_in_umap

Plot the latent dimensions on a UMAP embedding.

plotting.plot_latent_dims_in_heatmap

Plot the latent dimensions in a heatmap.

plot_latent_dimension_stats#

Analyzes and visualizes statistics of latent dimensions to understand their properties and importance.

  • Plots multiple statistics (reconstruction effect, max value, mean, std) across dimension ranking

  • Distinguishes between vanished and non-vanished dimensions

Use Cases:

  • Identify which latent dimensions are most important for reconstruction

  • Understand the distribution of activation values across dimensions

  • Detect vanished dimensions that contribute little to the model

plot_latent_dims_in_umap#

Visualizes latent dimensions as continuous variables on UMAP embeddings to understand their spatial distribution. For each latent dimension, one UMAP plot wil be generated.

Use Cases:

  • Understand how latent dimensions relate to cell clustering

  • Identify spatial patterns in latent dimension activation

plot_latent_dims_in_heatmap#

Creates heatmap visualizations of latent dimensions across different cell groups or conditions.

  • Groups cells by categorical variables (e.g., cell types, conditions)

  • Supports balanced sampling for better visualization

  • Configurable ordering and filtering of dimensions

Use Cases:

  • Compare latent dimension activation across cell types

  • Identify condition-specific latent patterns

Interpretability and Differential Effects#

The core functions are:

plotting.show_top_differential_vars

Show top differential variables in a bar plot.

plotting.plot_relevant_genes_on_umap

Plot relevant genes on UMAP embedding.

plotting.show_differential_vars_scatter_plot

Show a scatter plot of differential variables considering multiple criteria.

plotting.differential_vars_heatmap

Generate a heatmap of differential variables based on traverse data.

show_top_differential_vars#

Displays bar plots of the top relevant expressed genes for each latent dimension.

  • Shows top N genes with highest score per dimension

  • Support for gene symbol mapping

Use Cases:

  • Identify the most important genes for each biological process

  • Compare gene effects across different latent dimensions

plot_relevant_genes_on_umap#

Visualizes the expression of top relevant genes for each dimension on UMAP embeddings.

  • Shows genes most affected by selected latent dimensions

  • Automatic title generation and layout

Use Cases:

  • Understand expression patterns of key genes

  • Validate biological interpretation of latent dimensions

show_differential_vars_scatter_plot#

Creates scatter plots, allowing users to understand the scoring function under the hood.

  • Compares two main effect values (min_possible and max_possible)

  • Colors genes by the combined effect

Use Cases:

  • Visualize max_possible and min_possible effects

  • Understand the scoring function

differential_vars_heatmap#

Generates comprehensive heatmaps showing how genes respond to latent dimension traversals.

  • Shows stepwise effects across all latent dimensions and genes

  • Groups genes by their maximum effect dimension

Use Cases:

  • Observe the sparsity of the identified modules

Utility Functions#

The core functions are:

plotting.make_balanced_subsample

Create a balanced subsample of AnnData based on a categorical column.

plotting.cmap

make_balanced_subsample#

Creates balanced subsamples of AnnData objects with respect to a categorical variable.

  • Equal sampling from each category

  • Configurable minimum sample size per category

Use Cases:

  • Create balanced samples for heatmap visualization

Custom Colormaps#

The module provides specialized colormaps designed for biological data visualization:

  • cmap.saturated_red_blue_cmap: Enhanced red-blue diverging colormap for differential effects

  • cmap.saturated_just_sky_cmap: Sky-blue colormap for positive-only effects

  • cmap.saturated_sky_cmap: Sky-blue colormap on the positive side and gray colormap for the negative side