학술논문

Home

자료검색

학술논문

검색결과 돌아가기

검색화면

내보내기 프린트

Associating biological context with protein-protein interactions through text mining at PubMed scale.

Resource Type: Academic Journal
Authors: Sosa DN; Stanford University, Department of Biomedical Data Science, Stanford, CA, USA.; Hintzen R; BenevolentAI, London, UK.; Xiong B; Stanford University, Department of Biomedical Data Science, Stanford, CA, USA.; de Giorgio A; BenevolentAI, London, UK.; Fauqueur J; BenevolentAI, London, UK.; Davies M; BenevolentAI, London, UK.; Lever J; University of Glasgow, Glasgow, UK.; Altman RB; Stanford University, Department of Bioengineering, Stanford, CA, USA; Stanford University, Department of Genetics, Stanford, CA, USA. Electronic address: russ.altman@stanford.edu.
Source: Publisher: Elsevier Country of Publication: United States NLM ID: 100970413 Publication Model: Print-Electronic Cited Medium: Internet ISSN: 1532-0480 (Electronic) Linking ISSN: 15320464 NLM ISO Abbreviation: J Biomed Inform Subsets: MEDLINE
Subject
Language: English

Online Access

초록

Inferring knowledge from known relationships between drugs, proteins, genes, and diseases has great potential for clinical impact, such as predicting which existing drugs could be repurposed to treat rare diseases. Incorporating key biological context such as cell type or tissue of action into representations of extracted biomedical knowledge is essential for principled pharmacological discovery. Existing global, literature-derived knowledge graphs of interactions between drugs, proteins, genes, and diseases lack this essential information. In this study, we frame the task of associating biological context with protein-protein interactions extracted from text as a classification task using syntactic, semantic, and novel meta-discourse features. We introduce the Insider corpora, which are automatically generated PubMed-scale corpora for training classifiers for the context association task. These corpora are created by searching for precise syntactic cues of cell type and tissue relevancy to extracted regulatory relations. We report F1 scores of 0.955 and 0.862 for identifying relevant cell types and tissues, respectively, for our identified relations. By classifying with this framework, we demonstrate that the problem of context association can be addressed using intuitive, interpretable features. We demonstrate the potential of this approach to enrich text-derived knowledge bases with biological detail by incorporating cell type context into a protein-protein network for dengue fever.
Competing Interests: Declaration of competing interest The authors declare the following financial interests/personal relationships which may be considered as potential competing interests: Russ B. Altman reports a relationship with BenevolentAI that includes: consulting or advisory. We declare no conflicts of interest. This research was supported by a grant to Stanford University from BenevolentAI. RBA is an advisor to BenevolentAI.
(Copyright © 2023 Elsevier Inc. All rights reserved.)

공지

DAU Library

학술논문

요약정보

Associating biological context with protein-protein interactions through text mining at PubMed scale.

Online Access

초록