Publication type: Article in scientific journal
Type of review: Peer review (publication)
Title: Data-driven information extraction and enrichment of molecular profiling data for cancer cell lines
Authors: Smith, Ellery
Paloots, Rahel
Giagkos, Dimitris
Baudis, Michael
Stockinger, Kurt
et. al: No
DOI: 10.1093/bioadv/vbae045
Published in: Bioinformatics Advances
Issue Date: Mar-2024
Publisher / Ed. Institution: Oxford University Press
ISSN: 2635-0041
Language: English
Subjects: Cancer cell line; Copy number variant; Information extraction; Natural language processing
Subject (DDC): 005: Computer programming, programs and data
006: Special computer methods
Abstract: With the proliferation of research means and computational methodologies, published biomedical literature is growing exponentially in numbers and volume. Cancer cell lines are frequently-used models in biological and medical research that are currently applied for a wide range of purposes, from studies of cellular mechanisms to drug development, which has led to a wealth of related data and publications. Sifting through large quantities of text to gather relevant information on cell lines of interest is tedious and extremely slow when performed by humans. Hence, novel computational information extraction and correlation mechanisms are required to boost meaningful knowledge extraction. In this work, we present the design, implementation and application of a novel data extraction and exploration system. This system extracts deep semantic relations between textual entities from scientific literature to enrich existing structured clinical data concerning cancer cell lines. We introduce a new public data exploration portal, which enables automatic linking of genomic copy number variants plots with ranked, related entities such as affected genes. Each relation is accompanied by literature-derived evidences, allowing for deep, yet rapid, literature search, using existing structured data as a springboard.
URI: https://digitalcollection.zhaw.ch/handle/11475/30347
Related research data: https://github.com/progenetix/cancercelllines-web
Fulltext version: Accepted version
License (according to publishing contract): Licence according to publishing contract
Departement: School of Engineering
Organisational Unit: Institute of Computer Science (InIT)
Published as part of the ZHAW project: INODE – Intelligent Open Data Exploration (EU Horizon 2020)
Appears in collections:Publikationen School of Engineering

Files in This Item:
There are no files associated with this item.
Show full item record
Smith, E., Paloots, R., Giagkos, D., Baudis, M., & Stockinger, K. (2024). Data-driven information extraction and enrichment of molecular profiling data for cancer cell lines. Bioinformatics Advances. https://doi.org/10.1093/bioadv/vbae045
Smith, E. et al. (2024) ‘Data-driven information extraction and enrichment of molecular profiling data for cancer cell lines’, Bioinformatics Advances [Preprint]. Available at: https://doi.org/10.1093/bioadv/vbae045.
E. Smith, R. Paloots, D. Giagkos, M. Baudis, and K. Stockinger, “Data-driven information extraction and enrichment of molecular profiling data for cancer cell lines,” Bioinformatics Advances, Mar. 2024, doi: 10.1093/bioadv/vbae045.
SMITH, Ellery, Rahel PALOOTS, Dimitris GIAGKOS, Michael BAUDIS und Kurt STOCKINGER, 2024. Data-driven information extraction and enrichment of molecular profiling data for cancer cell lines. Bioinformatics Advances. März 2024. DOI 10.1093/bioadv/vbae045
Smith, Ellery, Rahel Paloots, Dimitris Giagkos, Michael Baudis, and Kurt Stockinger. 2024. “Data-Driven Information Extraction and Enrichment of Molecular Profiling Data for Cancer Cell Lines.” Bioinformatics Advances, March. https://doi.org/10.1093/bioadv/vbae045.
Smith, Ellery, et al. “Data-Driven Information Extraction and Enrichment of Molecular Profiling Data for Cancer Cell Lines.” Bioinformatics Advances, Mar. 2024, https://doi.org/10.1093/bioadv/vbae045.


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.