Feature extraction and spatial analysis

Feature extraction and spatial analysis#

This notebook demonstrates a powerful, annotation-free workflow for analyzing the morphological landscape of a Whole Slide Image (WSI). The core idea is to leverage a pre-trained deep learning model to automatically extract meaningful features from image tiles and then use standard clustering tools to cluster and visualize these features.

The process involves three main steps:

Tiling.
Feature Extraction: A pre-trained vision model is used to process each tile. The model converts the visual information of each patch into a high-dimensional feature vector—a numerical signature that represents its content.
Downstream Analysis: These feature vectors are then analyzed using a pipeline common in single-cell genomics. We will perform dimensionality reduction with PCA, build a neighborhood graph, and then use UMAP for visualization and the Leiden algorithm for clustering.

By the end of this workflow, we will have generated a UMAP embedding that visually organizes the tiles by similarity and a clustered map of the tiles of the WSI, effectively segmenting the tissue into distinct morphological regions without any manual intervention.

First, we will load a GTEx Small Intestine slide as example.

from huggingface_hub import hf_hub_download

slide = hf_hub_download(
    "rendeirolab/lazyslide-data",
    "GTEX-11DXX-1626.svs",
    repo_type="dataset",
)

from wsidata import open_wsi
import lazyslide as zs

Let’s open the wsi!

wsi = open_wsi(slide)
wsi

WSI: /home/runner/work/lazyslide-tutorials/lazyslide-tutorials/.cache/huggingface/hub/datasets--rendeirolab--lazyslide-data/snapshots/d469afd4a763ad366861e8c49d4cf424bfad902c/GTEX-11DXX-1626.svs
Reader: openslide
Dimensions: 38717×51791 (h×w), 3 Pyramids
Pixel physical size: 0.49 MPP (20X)

SpatialData object
with coordinate systems:

What does the tissue look like?

zs.pl.tissue(wsi)

../_images/d138adb982a02f3d6ac7ae16c1fe452d0d5a8a7ba9dccee60d34cd0658a2982f.png

Let’s first find and tile the tissue, we will request tiny tile size of 128*128 px, this may take a while.

zs.pp.find_tissues(wsi)
zs.pp.tile_tissues(wsi, 224)

Morphological feature extraction#

Feature extraction is to transform the image into a the numeric representation, which comprises of different morphological features. Typically, this is done by feeding the tiles into a vision model.

LazySlide supports automatic mix-precision inference which may reduce memory usuage and have faster inference spped, try amp=True. Since we are working on a big slide with tiny tile sizes, this may take 10mins to finish (MacBook M3 Max).

zs.tl.feature_extraction(wsi, "plip", amp=True)

::{note} Autocast doesn’t work well with mps backend though, you may get all nan results.

zs.tl.feature_extraction(wsi, "plip")

Features are saved as AnnData store with a convention of “{model name}_{tiles key}”. For example, h0-mini_tiles

Feature aggregation#

To perform analysis across dataset, a usual way is to pool features into a 1D vector that can represent the entire slide. By default, the mean pooling is applied. Advanced slide encoders will be introduced later.

zs.tl.feature_aggregation(wsi, feature_key="plip")

You can retrieve specific feature with the fetch accessor. This will return a copy of the anndata.

adata = wsi.fetch.features_anndata("plip")

Pre-computed results#

If you don’t want to run feature extraction, you can simply load the pre-computed one

wsi = zs.datasets.gtex_small_intestine()

wsi

Examination of feature space#

You may need to install scanpy and igraph to run the following command.

This code takes the deep-learning feature data from image tiles we have extracted from PLIP, preprocesses it, and then performs a standard analysis workflow to identify groups of similar tiles (clustering). We can visualize the tile space using UMAP. The colors represent the different clusters, as found by computing Leiden clustering on the neighborhood graph of tiles.

import scanpy as sc

adata = wsi["plip_tiles"]
sc.pp.scale(adata)
sc.pp.pca(adata)
sc.pp.neighbors(adata)
sc.tl.umap(adata)
sc.tl.leiden(adata, flavor="igraph", resolution=0.2)

sc.pl.umap(adata, color="leiden")

../_images/0dc5d25eff986d2b869858a3f938866221eac58f468ea3f9f2e3897bef7512bd.png

sc.tl.rank_genes_groups(adata, groupby="leiden")

features = set()
for i in adata.obs["leiden"].unique():
    names = sc.get.rank_genes_groups_df(adata, i).names
    features.update(list(names[0:10]) + list(names[-10:]))
features = list(features)

import marsilea as ma
import marsilea.plotter as mp
from scipy.stats import zscore

key = "leiden"
h = ma.Heatmap(zscore(adata[:, features].X.T), height=2, width=4, label="Feature")
order = sorted(adata.obs[key].unique())
h.group_cols(adata.obs[key], order=order)
h.add_top(mp.Chunk(order, fill_colors=adata.uns[f"{key}_colors"], padding=2), pad=0.05)
h.add_dendrogram("right", method="average", linewidth=0.1)
h.add_legends()
h.render()

<marsilea.heatmap.Heatmap at 0x7fcb783bd370>

../_images/066f63b387c2d53f59df34910238cfd771ffc9221ad0cd2f844b15209baf8df5.png

Identification of spatial domains#

The Leiden clustering on features from the foundational model can already recover the spatial domains of tissues pretty well. However, this clustering algorithm is based on the proximity of the morphological features of each tile, but it does not consider actual spatial information.

zs.pl.tiles(
    wsi,
    feature_key="plip",
    color="leiden",
    alpha=0.5,
    palette=adata.uns[f"{key}_colors"],
    show_contours=False,
)

../_images/50163ed593fec3314e8cdd974bee88371e343ed640f911246b3066939307bba6.png

For simplicity, you can run spatial domain analysis with zs.tl.spatial_domain. This is equivalent to the previous analysis.

zs.tl.spatial_domain(wsi, feature_key="plip", resolution=0.2)

wsi.write()

Integration of spatial information with UTAG#

In this example, you may notice the border of domain is not very smooth, this can be improved by integrating spatial information.

UTAG is a method develop to discovery spatial domain with unsupervised learning.

The basic idea is to use message passing to combine the physical position of the tiles and the features of each tile to create a spatially-informed neighborhood graph. In the following code block, we first create an adjacency matrix of the tile locations with zs.pp.tile_graph and then integrate spatial tile context with vision features using spatial feature smoothing with zs.tl.spatial_features.

zs.pp.tile_graph(wsi)
zs.tl.spatial_features(wsi, "plip")

zs.tl.spatial_domain(wsi, layer="spatial_features", feature_key="plip", resolution=0.2)

zs.pl.tiles(wsi, color="domain", alpha=0.5)

../_images/c963bd1b8d2d250be457c70fc2c8cde099435183f27d52441847a859cd28dbcd.png

Now we can observe there are 6 domains in the tissues, with smooth borders.

Text feature extraction#

Apart from deriving morphological features from vision models, you can also run multimodal to derive text features.

Currently, there are two vision-language models for pathology

PLIP, Nature Medicine, 2023
CONCH, Nature Medicine, 2024

Since we’ve extracted the plip vision features for our WSI, we only need to extract features for the texts.

terms = ["mucosa", "submucosa", "musclaris", "lymphocyte"]

embeddings = zs.tl.text_embedding(terms, model="plip")
zs.tl.text_image_similarity(wsi, embeddings, model="plip", softmax=True)

zs.pl.tiles(
    wsi,
    feature_key="plip_tiles_text_similarity",
    color=terms,
    cmap="rainbow",
    show_image=False,
    tissue_id=3,
    alpha=0.7,
)

../_images/f5e683323080b15bf39717937d0badc0bf2c81684e75c3527fa06393d3c238b0.png