Prediction of the 3D cancer genome from whole-genome sequencing using InfoHiC
Prediction of the 3D cancer genome from whole-genome sequencing using InfoHiC

Prediction of the 3D cancer genome from whole-genome sequencing using InfoHiC

Mol Syst Biol. 2024 Sep 25. doi: 10.1038/s44320-024-00065-2. Online ahead of print.

ABSTRACT

The 3D genome prediction in cancer is crucial for uncovering the impact of structural variations (SVs) on tumorigenesis, especially when they are present in noncoding regions. We present InfoHiC, a systemic framework for predicting the 3D cancer genome directly from whole-genome sequencing (WGS). InfoHiC utilizes contig-specific copy number encoding on the SV contig assembly, and performs a contig-to-total Hi-C conversion for the cancer Hi-C prediction from multiple SV contigs. We showed that InfoHiC can predict 3D genome folding from all types of SVs using breast cancer cell line data. We applied it to WGS data of patients with breast cancer and pediatric patients with medulloblastoma, and identified neo topologically associating domains. For breast cancer, we discovered super-enhancer hijacking events associated with oncogenic overexpression and poor survival outcomes. For medulloblastoma, we found SVs in noncoding regions that caused super-enhancer hijacking events of medulloblastoma driver genes (GFI1, GFI1B, and PRDM6). In addition, we provide trained models for cancer Hi-C prediction from WGS at https://github.com/dmcb-gist/InfoHiC , uncovering the impacts of SVs in cancer patients and revealing novel therapeutic targets.

PMID:39322849 | DOI:10.1038/s44320-024-00065-2