Code & Resources:

Code:

Resources:

Code:

Hi-C Doodle

This is a teaching tool that I created to introduce students to the concepts of 3D genome organization, along with the measurement and visualization using Hi-C and related Chromosome Conformation Capture techniques.

Example simulation

Tutorial video

Single-cell Repli-seq analysis script
(Dileep and Gilbert, 2018)

​This is an R pipeline for analyzing DNA replication in single cells using Whole Genome Amplification (WGA). The pipeline starts with read counts from single-cell WGA binned into 50kb windows. Sample data is provided along with the code. Read counts of all cells are converted to reads per million to control for variable sequencing depth. To control for amplification and mappability biases, we also sorted G1 and G2 cells, which contain a relatively uniform DNA content. Regions of low mappability and over-amplification were removed based on the G1 and G2 controls. Read counts were normalized by dividing the coverage data of each single cell by the coverage of the G1 and G2 control cells. A median filter was applied to smooth the data, producing Copy Number Variation (CNV) profiles in 50 kb bins. Next, a novel binarization strategy is used to convert the CNV profile to replicated or un-replicate. The binarization algorithm is based on iterating through multiple thresholds for binarization and calculating the difference between the CNV profile and binarized profile for each given threshold. The threshold with the least difference is chosen as the ideal binarization. 

Resources:

​​Data and computational scripts from Dileep et al. Cell, 2023

​ Data access and scripts used for Gene-fusion calling and analysis, DNA break mapping (BLISS), Mate-Pair Sequencing, and Hi-C.

​​Constitutive and Developmental replication timing domain labels​

​Constitutively replicating and developmental replication timing domains in mice (mm10) and humans (hg38) (Dileep et al. Genome Res; 2015, Dixon et al. Nat Genetics, 2018)