Difference between revisions of "Obvio"

From Knoesis wiki
Jump to: navigation, search
Line 76: Line 76:
 
=Automatic Subgraph Creation=
 
=Automatic Subgraph Creation=
 
In an evaluation of the context-driven subgraph model for LBD, several existing discoveries were rediscovered  using my approach. A manuscript detailing the approach is under preparation
 
In an evaluation of the context-driven subgraph model for LBD, several existing discoveries were rediscovered  using my approach. A manuscript detailing the approach is under preparation
<ref>'''[http://knoesis.wright.edu/students/delroy D. Cameron]''', [http://www.cs.uky.edu/people/faculty/rkavuluru Ramakanth Kavuluru], [http://www.lhncbc.nlm.nih.gov/personnel/thomas-rindflesch T. C. Rindflesch], [http://knoesis.wright.edu/amit A. P. Sheth], [http://knoesis.wright.edu/tkprasad/ K. Thirunarayan], [http://mor.nlm.nih.gov/ O. Bodenreider] Context-Driven Automatic Subgraph Creation for Literature-Based Discovery (<font color="red">(under review)</font>)</ref>  
+
<ref>'''[http://knoesis.wright.edu/students/delroy D. Cameron]''', [http://www.cs.uky.edu/people/faculty/rkavuluru Ramakanth Kavuluru], [http://www.lhncbc.nlm.nih.gov/personnel/thomas-rindflesch T. C. Rindflesch], [http://knoesis.wright.edu/amit A. P. Sheth], [http://knoesis.wright.edu/tkprasad/ K. Thirunarayan], [http://mor.nlm.nih.gov/ O. Bodenreider] Context-Driven Automatic Subgraph Creation for Literature-Based Discovery (<font color="red">under review</font>)</ref>  
  
 
=Publications=
 
=Publications=

Revision as of 00:59, 11 February 2014

Obvio (spanish for obvious) is the name of the project on semantics-based techniques for Literature-Based Discovery (LBD) using Biomedical Literature. The goal of Obvio is to uncover hidden connections between concepts in biomedical texts, thereby leading to hypothesis generation from publicly available scientific literature.

Overview

Obvio is driven by assertions extracted from structured text (called semantic predications) as well as assertions obtained from structured knowledge sources (such as the UMLS). The fundamental notion is that LBD could be greatly facilitated by the Semantic Integration of assertions extracted from scientific literature and well curated background knowledge from heterogeneous data sources.

People

Graduate Students: Delroy Cameron, Swapnil Soni, Nishita Jaykumar, Vishnu Bompally
External Collaborators: Olivier Bodenreider, Thomas C. Rindflesch, Ramakanth Kavuluru, Drashti Dave
Faculty: Krishnaprasad Thirunarayan, Amit P. Sheth (Advisor)
Past Members: Pablo N. Mendes, Tu Danh, Sreeram Vallabhaneni, Hima Yalamanchili

Applications

There are four notable research applications of semantic predications. These include:

  1. Information Retrieval (IR)
  2. Question Answering (QA)
  3. Automatic Summarization
  4. Literature-based Discovery (LBD)

Question Answering

We applied semantic predications to biomedical QA using data from the 2006 TREC Challenge using the notion of reachability to determine whether documents that answer complex biomedical questions could be connected through semantic predications, together with background knowledge<ref>D. Cameron, R. Kavuluru, O. Bodenreider, P. N. Mendes, A. P. Sheth, K. Thirunarayan, Semantic Predications for Complex Information Needs in Biomedical Literature, 5th International Conference on Bioinformatics and Biomedicine BIBM2011, Atlanta GA, November 12-15, 2011 (acceptance rate=19.4%)</ref>.

Reachability

The QA tasks put forth by the Text REtrieval Conferences (TREC) offer an opportunity to determine whether semantic predications and background knowledge can produce relevant information for complex biomedical questions and information needs. Below is our presentation from BIBM 2011 on our findings for the application of semantic predications and background knowledge to QA.

Literature-based Discovery (LBD)

A second application of semantic predications is to the field of Literature-based Discovery (LBD). LBD refers to uncovering conclusions that have never been made explicit before, but are implicit in publicly available literature. Semantic predications have been demonstrated to be important constructs in facilitating LBD by providing context among associated concepts.

RS-DFO Hypothesis

Much of the early research aimed at rediscovering Swanson's Hypotheses focused almost entirely on Information Retrieval (IR) techniques, such as term and concept co-occurrence. Only recently has significant attention been devoted to semantics-based techniques that exploit the meaning of associations between concepts. While generally more intuitive, the feasibility of such semantics-based approaches has not been fully established. It is reasonable to expect that if semantics-based techniques are adequate for discovering new knowledge, they ought to be sufficient for recovering existing knowledge.

In this work<ref>D. Cameron, O. Bodenreider, H. Yalamanchili, T. Danh, S. Vallabhaneni, K. Thirunarayan, A. P. Sheth, T. C. Rindflesch, A Graph-Based Recovery and Decomposition of Swanson’s Hypothesis using Semantic Predications, Journal of Biomedical Informatics 46(2) 238-251, (2013). ScienceDirect, PMID </ref> we investigate the applicability of semantics-based techniques for recovering and decomposing Swanson's Raynaud Syndrome--Fish Oil hypothesis using semantic predications, background knowledge and graph-based algorithms for path extraction and subgraph creation. Below is a presentation, and various datasets and experimental results for download and consumption based on our investigation.


Datasets and Experimental Results
  1. Dataset
    1. Baseline (B1)
      1. Original PDFs of the 65 articles cited by Swanson's RS-DFO paper (30.5MB)
      2. ASCII text with end-of-line text wrapping fixed
      3. Text in Medline format for parsing by SemRep
      4. SemRep Relations Output
      5. SemRep Relations Output (vascular reactivity)
      6. SemRep Extracted Predications
      7. Manually Identified Predications (vascular reactivity)
    2. Baseline (B2)
      1. Titles and abstracts of the 65 articles cited by Swanson's RS-DFO paper in Medline format for parsing by SemRep
      2. SemRep Relations Output
      3. SemRep Relations Output (vascular reactivity)
      4. SemRep Extracted Predications
      5. Manually Identified Predications (vascular reactivity)
  2. Experimental Results
    1. Association-Subgraph Comparisons (Experiment I)
    2. Association-Subgraph Comparisons (Experiment II)
    3. All Generated Subgraphs (Experiments 1 & 2)

Testosterone-Sleep

Preliminary results on recovering the Testosterone-Cortisol-Sleep Discovery using our approach for automatic subgraph creation support the validity of our technique.

Automatic Summarization

There is vibrant research on automatic summarization of scientific literature using semantic predications at the National Library of Medicine. Much of this work is embodied in Semantic Medline which is a tool for LBD developed by Thomas C. Rindflesch.

Information Retrieval

Another application of semantic predications is to the field of Information Retrieval. By modeling documents as a collection of predications (i.e., a subgraph), and modeling a search query as a subgraph as well, documents can be ranked based on their semantic similarity to the search query using subgraph-to-subraph query analysis.

Automatic Subgraph Creation

In an evaluation of the context-driven subgraph model for LBD, several existing discoveries were rediscovered using my approach. A manuscript detailing the approach is under preparation <ref>D. Cameron, Ramakanth Kavuluru, T. C. Rindflesch, A. P. Sheth, K. Thirunarayan, O. Bodenreider Context-Driven Automatic Subgraph Creation for Literature-Based Discovery (under review)</ref>

Publications

<references/>

SWLBD Workshop Series

Kno.e.sis and the National Library of Medicine (NLM) organized The First International Workshop on the role of Semantic Web in Literature-Based Discovery (SWLBD2012) in conjunction with The IEEE Conference on Bioinformatics and Biomedicine (BIBM2012) in Philadelphia PA, USA.

  • Due date for full workshop papers submission: Aug 6, 2012
  • Notification of paper acceptance to authors: August 28, 2012
  • Camera-ready version of accepted papers: September 4, 2012
  • Workshop: October 4, 2012

Internal

Obvio Web App
Automatic Subgraph Creation
Recovery and Decomposition
Reachability

Contact: Delroy Cameron