Cross-Sample Analysis in Spatial Transcriptomics Made Simple with MUSTANG

Cross-Sample Analysis in Spatial Transcriptomics Made Simple with MUSTANG

Table of Contents

Introduction

Recent advances in single-cell RNA sequencing (scRNA-seq) have expanded our understanding of cellular development and heterogeneity. However, scRNA-seq lacks spatial context due to tissue dissociation, limiting insights into tissue organization. Spatial transcriptomics (ST) addresses this by preserving spatial information, mapping gene expression to specific tissue locations. Yet, most ST methods, like Visium and Slide-seq, lack single-cell resolution, capturing multiple cells per “spot” and creating mixed signals.

Spot deconvolution methods help untangle these signals, distinguishing cell types within each spot to enable cell-type-specific analysis. Existing methods often rely on reference cell profiles, introducing potential bias. Although reference-free methods reduce this bias, they are limited to single-sample analysis, restricting cross-condition comparisons. While multi-sample ST data offers potential for such comparisons, no current tools provide reference-free, multi-sample deconvolution, highlighting a critical gap in ST analysis.

To address this gap, Niyakan S. et al., in a recent publication, developed MUSTANG, a novel multi-sample deconvolution tool that leverages spatial and transcriptional similarities across samples to improve the accuracy and scope of spatial transcriptomics analysis.

MUSTANG stands out as a reference-free, multi-sample analysis tool that does not require pre-defined cell-type expression profiles. By integrating transcriptional similarities across samples, MUSTANG enables researchers to decode complex cell compositions across different tissue sections. This advancement not only expands the scope of spatial transcriptomics but also strengthens data robustness by accounting for biological variance across multiple samples.

Behind MUSTANG: How It Achieves Multi-Sample Accuracy

The MUSTANG framework combines advanced data processing techniques with a robust multi-sample deconvolution strategy, setting it apart from traditional single-sample approaches. This process begins by constructing transcriptional and spatial adjacency matrices across all spots in each sample. These matrices capture gene expression and spatial position, respectively, allowing MUSTANG to form a comprehensive similarity graph that aligns both spatial and transcriptional features across samples.

Once the similarity graph is established, MUSTANG implements a joint Bayesian deconvolution model to infer cell-type compositions within each spot. This model integrates batch correction for cross-sample consistency, ensuring that technical variations do not obscure biological signals. The Bayesian approach further incorporates a Poisson model to calculate gene expression counts per cell type within each spot, refining accuracy through weighted transcriptional and spatial information sharing.

By combining transcriptional and spatial data from all samples, MUSTANG not only deconvolves complex multi-cellular spots but also maintains spatial continuity within and across samples. This methodological design enables a more cohesive and reliable analysis, making MUSTANG a powerful tool for multi-sample spatial transcriptomics.

Real-World Applications and Results: Validating MUSTANG with Diverse Datasets

To demonstrate its efficacy, MUSTANG was tested on three types of datasets: a semi-synthetic mouse brain dataset, a real-world human brain dataset, and a mouse bone marrow dataset. In each case, MUSTANG’s deconvolution performance was compared with that of established single-sample deconvolution tools, such as BayesTME and STdeconvolve. The results consistently showcased MUSTANG’s superiority in accurately identifying cell-type proportions and maintaining spatial coherence.

In the semi-synthetic mouse brain dataset, which simulated varying spot sizes, MUSTANG outperformed other methods in several evaluation metrics, including Pearson correlation coefficient (PCC), structural similarity, and root-mean-square error. MUSTANG’s cell-type deconvolution in this dataset closely matched the ground-truth cell compositions, reinforcing its potential for accurate cell-type identification in real-world data.

For the human brain and mouse bone marrow datasets, MUSTANG successfully mapped cell-type distributions that aligned with known anatomical regions, validating its real-world applicability. In the human brain dataset, MUSTANG’s deconvolution accuracy was reflected in high PCC values, affirming its advantage in multi-sample spatial analysis. The mouse bone marrow dataset further confirmed MUSTANG’s ability to accurately distinguish tumor and non-tumor cells in the tissue microenvironment, highlighting its relevance in disease-related research.

Implications and Future Directions: The Impact of MUSTANG on Spatial Transcriptomics

MUSTANG’s development addresses a crucial need in spatial transcriptomics: the ability to analyze multiple tissue samples simultaneously. By integrating cross-sample transcriptional and spatial data, MUSTANG provides a robust framework for multi-sample analysis, opening doors to more comprehensive tissue-level insights. This multi-sample approach allows researchers to uncover gene expression patterns that are consistent across samples while capturing region-specific variations.

In clinical and research settings, MUSTANG has the potential to enhance studies that rely on spatial mapping of gene expression, such as tumor microenvironment characterization and brain tissue organization. Its capacity to accurately deconvolve cell types across samples without reference profiles reduces bias, making it an ideal tool for studies in diverse biological contexts.

Looking forward, MUSTANG paves the way for further advances in multi-sample spatial analysis, such as joint cell-cell interaction studies across samples. Its ability to integrate spatial and transcriptional similarities marks a step toward more precise, large-scale spatial transcriptomics, making it a valuable addition to the spatial genomics toolkit.

Unlock the Potential of Spatial Transcriptomics with MUSTANG and Bridge Informatics

As a bioinformatics service provider (BSP), Bridge Informatics empowers researchers with cutting-edge computational solutions to drive discoveries in spatial transcriptomics and beyond. Our expertise spans diverse data types and analysis challenges, enabling us to provide tailored bioinformatics support for advanced tools like MUSTANG.

Our team of bioinformaticians, with deep biological knowledge and technical proficiency, helps bridge the gap between computational innovation and biological insight. Whether you need assistance with data processing pipelines, multi-sample analysis strategies, or integrating novel tools into your workflow, we’re here to guide you every step of the way.

The emergence of tools like MUSTANG exemplifies the exciting advancements shaping the field of spatial transcriptomics. If you’re ready to explore how multi-sample deconvolution can enhance your research, we’re here to help. Click here to schedule a free consultation with a member of our team today and unlock new possibilities for your spatial genomics projects.

Are you interested in reading more about Spatial Transcriptomics (ST)? Check out our other ST-related articles:

Share this article with a friend