Dysregulation of transcription networks regulating oligodendrogenesis in age-related decline in CNS remyelination
Dimas, Morabito & Rawji et al. 2025 (bioRxiv)
In demyelinating diseases like multiple sclerosis (MS), efficient remyelination is critical for functional recovery. Remyelination efficiency declines with age, and is linked to progressive disability. The gene regulatory network underlying remyelination, and how it is altered with aging, remains unclear. Here we present a comparative single-nucleus RNA and ATAC sequencing analysis of remyelination in young and aged mice. We identified gene modules dynamically expressed throughout oligodendrocyte differentiation, revealing age-dependent changes in key processes related to myelination. Multi-omic analysis allowed us to map the regulatory network driving efficient remyelination within oligodendrocyte lineage cells in young mice. We highlight key transcription factors in the network dysregulated with age, and we describe similar dysregulations in MS lesions. Modifying the expression of these transcription factors in primary oligodendrocyte progenitor cells impacts differentiation. These findings provide a foundational understanding of this regenerative process in the context of aging and in chronic demyelinating diseases.
This repository contains the code used for the data analysis associated with Dimas, Morabito & Rawji et al. bioRxiv (2025). The sections of the README below follow the same flow as the mannuscript, linking to the relevant script(s) for each of the data analysis steps.
These scripts can be used to reproduce the results starting from the raw sequencing reads downloaded from GEO, or from the processed data objects. Please note that the file paths included in these scripts are relative to the UCI HPC3 or the CNAG cluster, and these paths must be updated in order to run these scripts on a different machine. The data will be made publicly available upon final publication of this study, and peer reviewers have been provided with a GEO access code.
Raw and processed snATAC-seq and snRNA-seq datasets have been deposited on the NCBI Gene Expression Omnibus (GEO). At this time, the data remains private and will be released upon publication of this study. A data access code has been provided to the peer reviewers of this study.
snATAC-seq and snRNA-seq were performed separately in mouse spinal cord sections using 10X Genomics Chromium. Pre-processing (i.e. mapping, quantification, etc) of the sequencing data from these assays was performed using cellranger-atac count for snATAC-seq and cellranger count for snRNA-seq.
- Pre-processing snATAC-seq data with
cellranger-atac count - Pre-processing snRNA-seq data with
cellranger count
We first performed clustering analysis for ATAC and RNA separately. This clustering analysis includes a batch correction step to correct for the effect of different sequencing batches, using harmony for snATAC-seq and Liger for snRNA-seq.
- snATAC-seq clustering analysis with ArchR + Harmony
- snRNA-seq clustering analysis with Seurat + Liger
- snRNA-seq cluster marker genes
We next integrated snATAC-seq and snRNA-seq into a joint cellular embedding using Seurat.
Here we perform a multi-omic pseudotime trajectory analysis using monocle3 to study the transcriptional and epigenomic dynamics of the oligodendrocyte lineage in remyelination.
- snATAC + snRNA oligodendrocyte trajectory analysis with monocle3
- Identifying trajectory DEGs with generalized linear models
- Modeling trajectory expression dynamics using recurrent variational autoencoders (RVAEs)
We used hdWGCNA to construct a gene co-expression network in the oligodendrocyte lineage, identifying 10 gene modules capturing key gene expression patterns across the trajectory. We also approximated epigenomic activity of these modules using the snATAC-seq dataset.
We next used cicero to identify linkages between promoter regions and non-coding cis-regulatory elements (putative enhancers), and we compared the strength of these linkages across ageing.
We leverage epigenomic and transcriptomic measurements to construct the gene-regulatory network (GRN) underlying the oligodendrocyte lineage throughout remyelination in young and aged mice. We leveraged TF motif presence in promoter elements and linked enhancer elements from co-accessibility analysis in order to define directed links between TFs and downstream target genes. These links are assembled into a full GRN, and XGBoost regression was used to prioritize the most likely TF-gene links and to define TF Regulons.
We sought to identify the processive regulatory events, or cascades of TF-TF activation that underly the cellular transitions from progenitors to mature oligodendrocytes. We propose a unique computational strategy to accomplish this goal. In the young and aged GRNs, we performed optimal pathfinding analysis, using the strongest directed TF-TF links to find the most likely path from "start" TFs (expressed in progenitors) to "end" TFs (expressed in mature).
We next sought to prioritize TFs that are altered in ageing mice, and altered in human multiple sclerosis (MS). First, we collected and re-processed two snRNA-seq datasets of human MS.
- Preprocessing the Schirmer et al. 2019 dataset
- Preprocessing the Absinta et al. 2021 dataset
- Clustering the Schirmer et al. 2019 dataset
- Clustering the Absinta et al. 2021 dataset
Next we performed differential expression analysis across ageing in the mouse dataset, and between MS tissue vs. healthy in the human datasets.
- Differential expression analysis with ageing in mice
- Differential expression analysis in the human datasets
Finally, we synthesize this information to prioritize TFs for further study with downstream experiments.
We performed bulk RNA-seq experiments in cell models where we over-expressed TFs of interest (Bach2, Bhlhe41, Elf2, Foxk2, Nr6a1, Sox8, and Stat3).
- TODO: Bulk RNA-seq data processing
- TODO: Bulk RNA-seq differential expression analysis
- Bulk RNA-seq downstream plotting and comparison with single-nucleus data
Here are scripts that were used for additional plotting, and scripts containing helper functions.