Logo

Publikacije (75)

Nazad
S. Malikić, Simone Ciccolella, F. Mehrabadi, Camir Ricketts, Khaledur Rahman, Ehsan Haghshenas, Daniel N Seidman, Faraz Hach et al.

Recent technological advances in single cell sequencing (SCS) provide high resolution data for studying intra-tumor heterogeneity and tumor evolution. Available computational methods for tumor phylogeny inference via SCS typically aim to identify the most likely perfect phylogeny tree satisfying infinite sites assumption (ISA). However limitations of SCS technologies such as frequent allele dropout or highly variable sequence coverage, commonly result in mutational call errors and prohibit a perfect phylogeny. In addition, ISA violations are commonly observed in tumor phylogenies due to the loss of heterozygosity, deletions and convergent evolution. In order to address such limitations, we, for the first time, introduce a new combinatorial formulation that integrates single cell sequencing data with matching bulk sequencing data, with the objective of minimizing a linear combination of (i) potential false negatives (due to e.g. allele dropout or variance in sequence coverage) and (ii) potential false positives (due to e.g. read errors) among mutation calls, as well as (iii) the number of mutations that violate ISA - to define the optimal sub-perfect phylogeny. Our formulation ensures that several lineage constraints imposed by the use of variant allele frequencies (VAFs, derived from bulk sequence data) are satisfied. We express our formulation both in the form of an integer linear program (ILP) and - for the first time in the context of tumor phylogeny reconstruction - a boolean constraint satisfaction problem (CSP) and solve them by leveraging state-of-the-art ILP/CSP solvers. The resulting method, which we name PhISCS, is the first to integrate SCS and bulk sequencing data under the finite sites model. Using several simulated and real SCS data sets, we demonstrate that PhISCS is not only more general but also more accurate than the alternative tumor phylogeny inference tools. PhISCS is very fast especially when its CSP based variant is used returns the optimal solution, except in rare instances for which it provides an optimality gap. PhISCS is available at https://github.com/haghshenas/PhISCS.

Clemency Jolly, M. Gerstung, I. Leshchiner, S. Dentro, Santiago Gonzalez, T. Mitchell, Yulia Rubanova, Pavana Anur et al.

Cancer develops through a continuous process of somatic evolution. Whole genome sequencing provides a snapshot of the tumor genome at the point of sampling, however, the data can contain information that permits the reconstruction of a tumor9s evolutionary past. Here, we apply such life history analyses on an unprecedented scale, to a set of 2,658 tumors spanning 39 cancer types. We estimated the timing of large chromosomal gains during tumor evolution, by comparing the rates of doubled to non-doubled point mutations within gained regions. Although we find that such events typically occur in the second half of clonal evolution, we also observe distinctive and early chromosomal gains in some cancer types, such as gains of chromosomes 7, 19 and 20 in glioblastoma, and isochromosome 17q in medulloblastoma. By integrating these results with the qualitative timing of individual driver mutations, we obtained an overall ranking, from early to late, of frequent somatic events per cancer type, which both identified novel patterns of tumor evolution, and incorporated additional detail into known models, such as the progression of APC-KRAS-TP53 in colorectal cancer proposed by Vogelstein and Fearon. To estimate how mutational processes acting on the tumor genome change over time, we classified mutations in each sample according to three broad time periods (early clonal, late clonal, and subclonal), and quantified the activity of mutational signatures in each period. Most mutational processes appear to remain remarkably constant, however, certain signatures show clear and consistent changes during clonal evolution. Particularly, mutational signatures associated with exposure to carcinogens, such as smoking and UV light, tend to decrease over time. In contrast, signatures associated with defective endogenous processes, such as APOBEC mutagenesis and defective double strand break repair, show an increase between early and late phases of tumor evolution. Making use of clock-like mutational signatures, we converted mutational time estimates for large events, such as whole genome duplication (WGD), and the emergence of the most recent common ancestor (MRCA), into real time estimates, which allowed us to combine our analyses into overall timelines of cancer evolution, per tumor type. For example, the typical timeline of ovarian adenocarcinoma development shows that early tumor evolution is characterized by mutations in TP53, and widespread genome instability, with WGD events taking place on average 8 years prior to diagnosis. In later stages of evolution, signatures of defective repair processes increase, and the MRCA emerges on average 1 year before diagnosis. Taken together, these data reveal the common and divergent evolutionary trajectories available to a cancer, which might be crucial in understanding specific tumor biology, and in providing new opportunities for early detection and cancer prevention. Citation Format: Clemency Jolly, Moritz Gerstung, Ignaty Leshchiner, Stefan C. Dentro, Santiago Gonzalez, Thomas J. Mitchell, Yulia Rubanova, Pavana Anur, Daniel Rosebrock, Kaixian Yu, Maxime Tarabichi, Amit Deshwar, Jeff Wintersinger, Kortine Kleinheinz, Ignacio Vasquez-Garcia, Kerstin Haase, Subhajit Sengupta, Geoff Macintyre, Salem Malikic, Nilgun Donmez, Dimitri G. Livitz, Mark Cmero, Jonas Demeulemeester, Steve Schumacher, Yu Fan, Xiaotong Yao, Juhee Lee, Matthias Schlesner, Paul C. Boutros, David D. Bowtell, Hongtu Zhu, Gad Getz, Marcin Imielinski, Rameen Beroukhim, S Cenk Sahinalp, Yuan Ji, Martin Peifer, Florian Markowetz, Ville Mustonen, Ke Juan, Wenyi Wang, Quaid D. Morris, Paul T. Spellman, David C. Wedge, Peter Van Loo, PCAWG Evolution and Heterogeneity Working Group. The evolutionary history of 2,658 cancers [abstract]. In: Proceedings of the American Association for Cancer Research Annual Meeting 2018; 2018 Apr 14-18; Chicago, IL. Philadelphia (PA): AACR; Cancer Res 2018;78(13 Suppl):Abstract nr 218.

S. Dentro, I. Leshchiner, K. Haase, J. Wintersinger, A. Deshwar, M. Tarabichi, Yulia Rubanova, K. Yu et al.

We have characterised intra-tumour heterogeneity (ITH) across 2,778 whole genome sequences of tumours in the International Cancer Genome Consortium Pan-Cancer Analysis of Whole Genomes project, representing 36 distinct cancer types. We applied 6 copy number (CNA) callers and 11 subclonal reconstruction algorithms and developed approaches to integrate the results in robust, high-confidence CNA calls and subclonal architectures. The analysis reveals widespread ITH. We find at least one subclone in nearly all (96.7%) tumours with sufficient sequencing depth. Analysis using dN/dS ratios yields clear signs of positive selection in clonal and subclonal mutations and we find subclonal driver mutations in known driver genes. However, only 24% of subclones contain a driver mutation in a known driver gene, suggesting that a multitude of undiscovered late drivers exist and that tumours continue to undergo selection after tumourigenesis, at least until diagnosis. Consistent with other studies, we find that in 9% of tumours all clinically actionable mutations are subclonal, while 20% of tumours contain at least one subclonal actionable driver. These findings emphasise the relevance of ITH in treatment decision making. Distinct patterns of ITH emerge; for example, prostate, uterus and esophageal adenocarcinomas show high proportions of both subclonal single nucleotide variants (SNVs) and CNAs. Kidney chromophobe and pancreatic endocrine tumours also contain high proportions of subclonal SNVs, but few subclonal CNAs. On the other hand, hepatocellular carcinomas and head-and-neck and lung SCCs contain low proportions of subclonal SNVs and high proportions of subclonal CNAs. Mutational signature analysis reveals changes in signature activity. Exposures to UV light in melanomas and acid reflux in stomach and oesophageal cancers contribute more clonal mutations. While APOBEC and DNA damage repair response related signatures show increased activity in subclones. These findings highlight distinct evolutionary narratives between and within histologically distinct tumour types. Citation Format: Stefan Dentro, Ignaty Leshchiner, Kerstin Haase, Jeff Wintersinger, Amit Deshwar, Maxime Tarabichi, Yulia Rubanova, Kaixian Yu, Ignacio Vazquez Garcia, Geoff Macintyre, Kortine Kleinheinz, Dimitri Livitz, Salem Malikic, Nilgun Donmez, Subhajit Sengupta, Yuan Ji, Jonas Demeulemeester, Pavana Anur, Clemency Jolly, Marek Cmero, Daniel Rosebrock, Steve Schumacher, Yu Fan, Matthew Fittall, Xiaotong Yao, Juhee Lee, Matthias Schlesner, Hongtu Zhu, David Adams, Gad Getz, Paul Boutros, Marcin Imielinski, Rameen Beroukhim, Cenk Sahinalp, Martin Peifer, Inigo Martincorena, Florian Markowetz, Ville Mustonen, Ke Yuan, Moritz Gerstung, Wenyi Wang, Paul Spellman, Quaid Morris, David Wedge, Peter Van Loo. Pervasive intra-tumour heterogeneity and subclonal selection across cancer types [abstract]. In: Proceedings of the American Association for Cancer Research Annual Meeting 2018; 2018 Apr 14-18; Chicago, IL. Philadelphia (PA): AACR; Cancer Res 2018;78(13 Suppl):Abstract nr 3000.

S. Dentro, I. Leshchiner, K. Haase, M. Tarabichi, J. Wintersinger, A. Deshwar, K. Yu, Yulia Rubanova et al.

Ongoing cancer evolution gives rise to intra-tumour heterogeneity (ITH), which is a major mechanism of therapeutic resistance and therefore an important clinical challenge. However, the extent, origin and drivers of ITH across cancer types are poorly understood. Here, we extensively characterise ITH across 2,778 cancer whole genome sequences from 36 cancer types. We demonstrate that nearly all tumours (94.7%) with sufficient sequencing depth contain evidence of recent subclonal expansions, and that most cancer types show clear signs of positive selection in both clonal and subclonal protein coding variants. We find distinctive subclonal patterns of driver gene mutations, fusions, structural variation and copy-number alterations across cancer types. Dynamic, tumour type-specific changes of mutational processes between subclonal expansions shape differences between clonal and subclonal events. Our results underline the importance of ITH and its drivers in tumour evolution, and provide an unprecedented pan-cancer resource of extensively annotated subclonal events, laying a foundation for future cancer genomic studies.

Ibrahim Numanagić, Ibrahim Numanagić, S. Malikić, Michael Ford, X. Qin, L. Toji, Milan Radovich, T. Skaar et al.

High-throughput sequencing provides the means to determine the allelic decomposition for any gene of interest—the number of copies and the exact sequence content of each copy of a gene. Although many clinically and functionally important genes are highly polymorphic and have undergone structural alterations, no high-throughput sequencing data analysis tool has yet been designed to effectively solve the full allelic decomposition problem. Here we introduce a combinatorial optimization framework that successfully resolves this challenging problem, including for genes with structural alterations. We provide an associated computational tool Aldy that performs allelic decomposition of highly polymorphic, multi-copy genes through using whole or targeted genome sequencing data. For a large diverse sequencing data set, Aldy identifies multiple rare and novel alleles for several important pharmacogenes, significantly improving upon the accuracy and utility of current genotyping assays. As more data sets become available, we expect Aldy to become an essential component of genotyping toolkits. Many genes of functional and clinical significance are highly polymorphic and experience structural alterations. Here, Numanagić et al. develop Aldy, a computational tool for resolving the copy number and the sequence content of each copy of a gene by analyzing whole or targeted genome sequencing data.

S. Malikić, S. Malikić, Katharina Jahn, Katharina Jahn, J. Kuipers, J. Kuipers, S. C. Sahinalp, N. Beerenwinkel et al.

Understanding the clonal architecture and evolutionary history of a tumour poses one of the key challenges to overcome treatment failure due to resistant cell populations. Previously, studies on subclonal tumour evolution have been primarily based on bulk sequencing and in some recent cases on single-cell sequencing data. Either data type alone has shortcomings with regard to this task, but methods integrating both data types have been lacking. Here, we present B-SCITE, the first computational approach that infers tumour phylogenies from combined single-cell and bulk sequencing data. Using a comprehensive set of simulated data, we show that B-SCITE systematically outperforms existing methods with respect to tree reconstruction accuracy and subclone identification. B-SCITE provides high-fidelity reconstructions even with a modest number of single cells and in cases where bulk allele frequencies are affected by copy number changes. On real tumour data, B-SCITE generated mutation histories show high concordance with expert generated trees. Intra-tumour heterogeneity provides important information about subclonal tumour evolution. Here, the authors develop B-SCITE, a computational method for inferring tumour phylogenies from combined single-cell and bulk sequencing data.

M. Gerstung, Clemency Jolly, I. Leshchiner, S. Dentro, Santiago Gonzalez, T. Mitchell, Yulia Rubanova, Pavana Anur et al.

Cancer develops through a process of somatic evolution1,2. Sequencing data from a single biopsy represent a snapshot of this process that can reveal the timing of specific genomic aberrations and the changing influence of mutational processes3. Here, by whole-genome sequencing analysis of 2,658 cancers as part of the Pan-Cancer Analysis of Whole Genomes (PCAWG) Consortium of the International Cancer Genome Consortium (ICGC) and The Cancer Genome Atlas (TCGA)4, we reconstruct the life history and evolution of mutational processes and driver mutation sequences of 38 types of cancer. Early oncogenesis is characterized by mutations in a constrained set of driver genes, and specific copy number gains, such as trisomy 7 in glioblastoma and isochromosome 17q in medulloblastoma. The mutational spectrum changes significantly throughout tumour evolution in 40% of samples. A nearly fourfold diversification of driver genes and increased genomic instability are features of later stages. Copy number alterations often occur in mitotic crises, and lead to simultaneous gains of chromosomal segments. Timing analyses suggest that driver mutations often precede diagnosis by many years, if not decades. Together, these results determine the evolutionary trajectories of cancer, and highlight opportunities for early cancer detection. Whole-genome sequencing data for 2,778 cancer samples from 2,658 unique donors across 38 cancer types is used to reconstruct the evolutionary history of cancer, revealing that driver mutations can precede diagnosis by several years to decades.

Yuan Yuan, Young Seok Ju, Youngwook Kim, Jun Li, Yumeng Wang, Christopher J Yoon, Yang Yang, I. Martincorena et al.

30. 6. 2017.
62
M. Tarabichi, I. Martincorena, M. Gerstung, A. Leroi, Florian Markowetz, Stefan C. Ignaty Moritz Clemency Kerstin Maxime Jeff Amit Dentro Leshchiner Gerstung Jolly Haase Tarabichi W, S. Dentro, I. Leshchiner et al.

Williams et al. (Nat. Genet. 48:238-224, 2016) recently reported neutral tumor evolution in one third of 904 samples from The Cancer Genome Atlas. Here, we assess the reproducibility and validity of their method and the extent of positive selection in subclonal mutations across cancer types. Our results do not support observable neutral tumor evolution and uncover strong positive selection within subclonal mutations across cancers.

Ibrahim Numanagić, S. Malikić, V. Pratt, T. Skaar, D. Flockhart, S. C. Sahinalp

Motivation: CYP2D6 is highly polymorphic gene which encodes the (CYP2D6) enzyme, involved in the metabolism of 20–25% of all clinically prescribed drugs and other xenobiotics in the human body. CYP2D6 genotyping is recommended prior to treatment decisions involving one or more of the numerous drugs sensitive to CYP2D6 allelic composition. In this context, high-throughput sequencing (HTS) technologies provide a promising time-efficient and cost-effective alternative to currently used genotyping techniques. To achieve accurate interpretation of HTS data, however, one needs to overcome several obstacles such as high sequence similarity and genetic recombinations between CYP2D6 and evolutionarily related pseudogenes CYP2D7 and CYP2D8, high copy number variation among individuals and short read lengths generated by HTS technologies. Results: In this work, we present the first algorithm to computationally infer CYP2D6 genotype at basepair resolution from HTS data. Our algorithm is able to resolve complex genotypes, including alleles that are the products of duplication, deletion and fusion events involving CYP2D6 and its evolutionarily related cousin CYP2D7. Through extensive experiments using simulated and real datasets, we show that our algorithm accurately solves this important problem with potential clinical implications. Availability and implementation: Cypiripi is available at http://sfu-compbio.github.io/cypiripi. Contact: cenk@sfu.ca.

Nema pronađenih rezultata, molimo da izmjenite uslove pretrage i pokušate ponovo!

Pretplatite se na novosti o BH Akademskom Imeniku

Ova stranica koristi kolačiće da bi vam pružila najbolje iskustvo

Saznaj više