Marat Kazanov Lab

Graphical abstract: APOBEC mutagenesis in non-canonical DNA structures. *iScience* 2022.

A primary research focus of our research group is to investigate the nature of human cancer by deciphering mutational patterns caused by different mutational processes. Recent advances in DNA sequencing allowed us to obtain nucleotide sequences of whole cancer genomes for different cancer types and create a comprehensive map of mutational signatures. The generation of such mutagenesis maps allows us to examine a whole spectrum of somatic mutations, understand the underlying mutational processes, and contribute to the search of cancer-associated genes.

A major focus is on the APOBEC family enzymes — an important component of the human innate immune system that contributes strongly to the mutational load in cancer. Using motif-centered analysis of whole-genome mutation catalogs, we revealed previously unknown epigenomic features of APOBEC mutagenesis and have extended this approach to other kinds of mutagenesis in human cancers.

Selected publications

Ponomarev GV, Fatykhov B, Nazarov VA, Abasov R, Shvarov E, Landik N-V, Denisova AA, Chervova AA, Gelfand MS, Kazanov MD. APOBEC mutagenesis is low in most types of non-B DNA structures, unlike other types of cancer mutagenesis. iScience 25(7):104535, 2022.
Chervova A, Fatykhov B, Koblov A, Shvarov E, Preobrazhenskaya J, Vinogradov D, Ponomarev GV, Gelfand MS, Kazanov MD. Analysis of gene expression and mutation data points on contribution of transcription to the mutagenesis by APOBEC enzymes. NAR Cancer 3(3):zcab025, 2021.
Kazanov MD, Roberts SA, Polak P, Stamatoyannopoulos J, Klimczak LJ, Gordenin DA, Sunyaev SR. APOBEC-induced cancer mutations are uniquely enriched in early-replicating, gene-dense, and active chromatin regions. Cell Reports 13(6):1103–1109, 2015.

RNA secondary structure elements in viral genomes — RNA secondary structure elements in RNA viruses. *Sci Rep.* 2024.

A growing area of our research focuses on host–virus interactions, combining transcriptomic and genomic analyses with structural and sequence-based computational approaches. Our recent studies are centered on mpox virus (MPXV), where we investigate how the human innate immune system — in particular APOBEC3 enzymes — shapes the viral genome and transcriptome. We also develop computational tools for the systematic analysis of RNA secondary structure in viral genomes.

Selected publications

Lyskova AO, Abasov RKh, Pavlova A, Matveev EV, Madorskaya AV, Kazanov FM, Garshina DV, Smolnikova AE, Ponomarev GV, Sharova EI, Ivankov DN, Adebali O, Gelfand MS, Kazanov MD. MPXV RNA-seq Data Provide Evidence for Protection of Viral Transcripts from APOBEC3 Editing. Journal of Virology, 2026.
Madorskaya AV, Kazanov FM, Ponomarev GV, Smolnikova A, Garshina D, Matveev EV, Abasov RKh, Ivankov DN, Gelfand MS, Kazanov MD. Comment on "Inverted repeats in the monkeypox virus genome are hot spots for mutation". bioRxiv 2025.08.19.670320, 2025.
Kazanov FM, Matveev EV, Ponomarev GV, Ivankov DN, Kazanov MD. Analysis of the abundance and diversity of RNA secondary structure elements in RNA viruses using the RNAsselem Python package. Sci Rep. 14(1):28587, 2024.

Structural susceptibility of proteins to proteolytic processing — Structural features and susceptibility scores for proteolytic processing. *Int. J. Mol. Sci.* 2023.

A major research direction of the lab is the investigation of mechanisms of recognition between proteases and their substrates and the development of bioinformatics methods for predicting protease cleavage. This work has two complementary biological focuses.

In cancer, different tumor types show increased activity of various proteases and decreased activity of their natural inhibitors. Identification and study of these altered interactions provides a handle for the design of agents for blocking tumor invasion and metastasis. Development of bioinformatics methods for predicting protease substrates can substantially reduce the amount of required experimental efforts.

In the context of viral infections, host proteases play a critical role in activating viral surface proteins required for cell entry. Cleavage of viral fusion proteins — such as the spike protein of coronaviruses or the hemagglutinin of influenza — by host proteases like TMPRSS2, furin, or cathepsins is often a prerequisite for infectivity. We apply computational approaches to characterize protease cleavage specificity and to analyze how protease recognition sequences in viral proteins shape host tropism and pathogenicity.

Selected publications

Matveev EV, Ponomarev GV, Kazanov MD. Genome-wide bioinformatics analysis of human protease capacity for proteolytic cleavage of the SARS-CoV-2 spike glycoprotein. Microbiol Spectr. 12(2):e0353023, 2024.
Matveev EV, Safronov VV, Ponomarev GV, Kazanov MD. Predicting Structural Susceptibility of Proteins to Proteolytic Processing. Int. J. Mol. Sci. 24, 10761, 2023.
Belushkin AA, Vinogradov DV, Gelfand MS, Osterman AL, Cieplak P, Kazanov MD. Sequence-derived structural features driving proteolytic processing. Proteomics 14(1):42–50, 2014.
Kazanov MD, Igarashi Y, Eroshkin AM, Cieplak P, Ratnikov B, Zhang Y, Li Z, Godzik A, Osterman AL, Smith JW. Structural determinants of limited proteolysis. Journal of Proteome Research 10(8):3642–3651, 2011.

Integrative genomic reconstruction of carbohydrate utilization in bifidobacteria — Phylogeny and effector specificities of ROK-family transcriptional regulators in Thermotogae. *Nucleic Acids Res.* 2013.

Our research group applies computational methods to study the genomics of bacteria comprising the human microbiome and multi-drug resistant pathogenic bacteria. In several projects devoted to these topics in collaboration with Prof. Andrei Osterman and Prof. Dmitry Rodionov (Sanford-Burnham Medical Research Institute, USA), we studied bacterial metabolism by reconstructing the regulatory networks. In projects in collaboration with the research groups from Dmitry Rogachev National Medical Research Center of Pediatric Hematology, Oncology, and Immunology, we performed genomic analysis of the antimicrobial resistance and virulence determinants of clinical isolates from immunocompromised pediatric patients.

Selected publications

Arzamasov AA, Rodionov DA, Hibberd MC, Guruge JL, Kent JE, Kazanov MD, Leyn SA et al. Integrative genomic reconstruction reveals heterogeneity in carbohydrate utilization across human gut bifidobacteria. Nat Microbiol. 10(8):2031–2047, 2025.
Kazanov MD, Li X, Gelfand MS, Osterman AL, Rodionov DA. Functional diversification of ROK-family transcriptional regulators of sugar catabolism in the Thermotogae phylum. Nucleic Acids Research 41(2):790–803, 2013.
Sun EI, Leyn SA, Kazanov MD, Saier MH Jr, Novichkov PS, Rodionov DA. Comparative genomics of metabolic capacities of regulons controlled by cis-regulatory RNA motifs in bacteria. BMC Genomics 14:597, 2013.
Rodionov DA, Novichkov PS, Stavrovskaya ED, Rodionova IA, Li X, Kazanov MD et al. Comparative genomic reconstruction of transcriptional networks controlling central metabolism in the Shewanella genus. BMC Genomics 12 Suppl 1:S3, 2011.

A recurring theme across several projects is the application of machine learning methods to predict biologically relevant properties directly from protein sequences and structures. Working with large experimental datasets of protease cleavage events, we developed supervised learning models that capture structural and physico-chemical determinants of limited proteolysis and of structural susceptibility to proteolytic processing. These studies demonstrate how statistical learning can extract generalizable rules from high-throughput biochemical data and translate them into predictive tools applicable to genome-scale sequence analysis.

Selected publications

Matveev EV, Safronov VV, Ponomarev GV, Kazanov MD. Predicting Structural Susceptibility of Proteins to Proteolytic Processing. Int. J. Mol. Sci. 24, 10761, 2023.
Kazanov MD, Igarashi Y, Eroshkin AM, Cieplak P, Ratnikov B, Zhang Y, Li Z, Godzik A, Osterman AL, Smith JW. Structural determinants of limited proteolysis. Journal of Proteome Research 10(8):3642–3651, 2011.
Hibberd MC, Webber DM, Rodionov DA, Henrissat S, Chen RY, Zhou C, Lynn HM, Wang Y, Chang HW, Lee EM, Lelwala-Guruge J, Kazanov MD, Arzamasov AA, Leyn SA, Lombard V, Terrapon N, Henrissat B, Castillo JJ, Couture G, Bacalzo NP Jr, Chen Y, Lebrilla CB, Mostafa I, Das S, Mahfuz M, Barratt MJ, Osterman AL, Ahmed T, Gordon JI. Bioactive glycans in a microbiome-directed food for children with malnutrition. Nature 625(7993):157–165, 2024.

Kazanov Lab

Mutational Processes in Human Cancer

Selected publications

Host–Virus Interactions

Selected publications

Proteases in Cancer and Virus Entry: Bioinformatics Methods for the Identification of Protease Substrates

Selected publications

Genomics of Commensal and Pathogenic Bacteria

Selected publications

Machine Learning Applied to Biological Sequence Analysis

Selected publications