Bioinformatics Seminars

Current Bioinformatics Seminar

Time: 11AM Tuesdays.
Venue: Davis Auditorium and Online

21 April 2026

Representing and Recreating Cancer Genomics Data at Scale

Andre Kahles
ETH Zurich

Technological advances in sequencing have generated large-scale cancer genomics datasets, creating both new opportunities and substantial computational challenges. In the first part of this talk, I will introduce the MetaGraph framework, a scalable approach for representing and querying cohort-level sequencing data using graph-based indexes. I will discuss its performance across diverse cohorts and demonstrate applications to RNA-Seq and DNA-Seq data, including analyses of gene expression, alternative and trans-splicing, and structural variation. In the second part, I will focus on the complementary challenge of generating realistic sequencing data for benchmarking new computational methods. As part of the ICGC benchmarking working group, I will outline the need for high-fidelity simulation of DNA and RNA sequencing data. I will then present an overview of an emerging simulation framework that captures key biological and technical characteristics, and discuss how such data can support the robust evaluation of genomic analysis tools.


Search past seminars