Skip to yearly menu bar Skip to main content


Poster
in
Workshop: Learning Meaningful Representations of Life (LMRL) Workshop @ ICLR 2025

Weakly Supervised Latent Variable Inference of Proximity Bias in CRISPR Gene Knockouts from Single-Cell Images

Aditya Ravuri · Kristina Ulicna · Jana Osea · Konstantin Donhauser · Jason Hartford


Abstract:

High-throughput screening enables biologists to study cell perturbations by generating large, high-dimensional datasets, such as gene expression profiles and cell microscopy images. Particularly in CRISPR-Cas9 screens, where gene knockout effects are typically represented using perturbation-specific conditional mean embeddings, these representations can be distorted by off-target effects in which the knockouts impact not only the target gene but also neighboring genes on the same chromosome arm, introducing "proximity bias". To address this, we develop a discrete latent variable inference method that leverages correlations between neighboring perturbations as a weak supervision signal to detect single cells affected by off-target effects. Removing these cells reduces spurious correlations between adjacent gene embeddings, achieving comparable correction performance without relying on additional gene expression data. Moreover, we show that the identified cells exhibit chromosome-arm specificity, reinforcing the validity of our approach and its potential for scaling into a genome-wide proximity bias correction method.

Chat is not available.