Difference between revisions of "PREDOSE"

From Knoesis wiki
Jump to: navigation, search
(updated project overview)
Line 23: Line 23:
  
 
{{block
 
{{block
   |title= Project Description
+
   |title= Project Overview
 
   |title_background=#77bbbb
 
   |title_background=#77bbbb
 
   |content=  
 
   |content=  
This research project aims to develop a mechanism to automate 'qualitative coding' in social research by automatically extracting triples from web data, particularly web forum posts. The goal of such triple extraction is to provide a framework that can be exploited to study user knowledge, attitudes and behaviors as it relates to non-medical use of pharmaceutical opiods (e.g. OxyContin, buprenorphine etc). Interesting areas include 1) Social Network analysis, intended to determine information diffusion patterns and 2) Spatial-Temporal-Thematic analysis, intended to determine trends within the community regarding usage, distribution, of method of administration of pharmaceutical opioids (including Suboxone and Subutex, which are buprenorphine products).
+
'''Problem:''' Historically, qualitative research has been characterized by manual data collection, initiated by interactive interview sessions with individual or a group of individual addicts. The audio-to-text transcribed interviews obtained from this process are then typically annotated by researchers/experts with themes or topics that surfaced during interview sessions. Various tools, such as ... have been developed to facilitate this annotation process, and provide additional service such as search, retrieval and data analysis. However, the intensive manual effort required to make the interactive approach scalable is enormous. Furthermore, to effectively process the large volume and complexity of the Web-based data, the field certainly needs a highly automated way of accessing and processing Web data.  
 +
 
 +
'''Proposed Solution:''' Researchers at the Kno.e.sis Center at Wright State University have successfully applied Semantic Web, Machine Learning and Natural Language Processing techniques to <u>automatically extract knowledge</u> from biomedical text. Substantial progress has also been made in using these and other techniques to <u>understand the content and identify social perceptions</u> through metadata extraction and spatio-temporal and thematic analysis (broadly termed semantic analysis) of <u>informal text</u> on MySpace, Facebook, and Twitter. These cutting-edge information processing techniques, with appropriate adaptations can now be exploited to fit the needs of public health and drug abuse research.
 
}}
 
}}
  
 
{{block
 
{{block
   |title= Architecture and Research Plan
+
   |title= Research Plan
 
   |title_background=#77bbbb
 
   |title_background=#77bbbb
   |content=  
+
   |content= The overall research plan has three(3) distinct stages 1) Data Collection 2) Automatic Qualitative Coding and 3) Data Analysis & Interpretation
  
 
}}
 
}}
 +
====Stage 1: Data Collection====
 +
 +
====Stage 2: Automatic Qualitative Coding====
 +
 
[[Image:Citar-research-plan-071811.png | center | 600px | thumb | Fig1: Research Plan]]
 
[[Image:Citar-research-plan-071811.png | center | 600px | thumb | Fig1: Research Plan]]
  
 +
====Stage 3: Data Analysis & Interpretation====
 +
<!--
 
{{block
 
{{block
 
   |title= Publications
 
   |title= Publications
Line 43: Line 51:
 
N/A
 
N/A
 
}}
 
}}
 +
-->
  
 
=Funding=
 
=Funding=

Revision as of 02:47, 19 July 2011

Introduction

The non-medical use of pharmaceutical opioids has been identified as one of the fastest growing forms of drug abuse in the U.S. Furthermore, significant increases in the illicit use of pharmaceutical opioids have expanded the pathways to heroin addiction and resulted in escalating rates of accidental overdose deaths. To design effective and responsive prevention and policy measures, public health professionals require timely and reliable information on new and emerging drug trends. Although existing epidemiological data systems provide critically important information about drug abuse trends, they are often time-lagged. There is therefore a need for epidemiological sources that could complement existing drug trend monitoring systems and enhance their capacity for early identification of new and emerging trends. The World Wide Web (Web) has been identified as one of the leading data sources for detecting patterns and changes in the non-medical use of pharmaceutical and other illicit drugs. Many Web 2.0 empowered social platforms, including Web forums, provide venues for individuals to freely share their experiences, post questions, and offer comments about different drugs.


This project aims to address this critical need for relevant and timely information by pursuing two(2) specific goals:

Goals
  1. To determine user knowledge attitudes and behavior related to the non-medical use of pharmaceutical opioids (namely buprenorphine) as discussed on Web-based forums
  2. To determine spatio-temporal trends and patterns in pharmaceutical opioid abuse as discussed on Web-based forums
Project Team

Principal Investigators: Raminta Daniulaityte, Amit P. Sheth
Co-Investigators: Robert Carlson, Russel Falck
Graduate Students: Delroy Cameron, Sujan Udayanga

Project Overview

Problem: Historically, qualitative research has been characterized by manual data collection, initiated by interactive interview sessions with individual or a group of individual addicts. The audio-to-text transcribed interviews obtained from this process are then typically annotated by researchers/experts with themes or topics that surfaced during interview sessions. Various tools, such as ... have been developed to facilitate this annotation process, and provide additional service such as search, retrieval and data analysis. However, the intensive manual effort required to make the interactive approach scalable is enormous. Furthermore, to effectively process the large volume and complexity of the Web-based data, the field certainly needs a highly automated way of accessing and processing Web data.

Proposed Solution: Researchers at the Kno.e.sis Center at Wright State University have successfully applied Semantic Web, Machine Learning and Natural Language Processing techniques to automatically extract knowledge from biomedical text. Substantial progress has also been made in using these and other techniques to understand the content and identify social perceptions through metadata extraction and spatio-temporal and thematic analysis (broadly termed semantic analysis) of informal text on MySpace, Facebook, and Twitter. These cutting-edge information processing techniques, with appropriate adaptations can now be exploited to fit the needs of public health and drug abuse research.

Research Plan

The overall research plan has three(3) distinct stages 1) Data Collection 2) Automatic Qualitative Coding and 3) Data Analysis & Interpretation

Stage 1: Data Collection

Stage 2: Automatic Qualitative Coding

Fig1: Research Plan

Stage 3: Data Analysis & Interpretation

Funding

This project is sponsored by the National Institutes of Health (NIH) Grant Award No. R21 DA030571-01A1 to the Ohio Center for Excellence in Knowledge-enabled Computing (Kno.e.sis) and the Center for Treatment, Interventions and Addictions Research (CITAR) titled “A Study of Social Web Data on Buprenorphine Abuse using Semantic Web Technology.” Any opinions, findings, conclusions or recommendations expressed in this material are those of the investigator(s) and do not necessarily reflect the views of the National Institutes of Health.

Contact: Delroy Cameron