Difference between revisions of "Satya"

From Knoesis wiki
Jump to: navigation, search
Line 1: Line 1:
 
'''Satya S. Sahoo''' <br/>
 
'''Satya S. Sahoo''' <br/>
Ph.D candidate, Computer Science and Engineering<br/>
+
Ph.D candidate, CSE<br/>
 +
Kno.e.sis Center, Wright State University
  
 
==Updates==
 
==Updates==
Co-organizing a full-day Workshop on "Role of Semantic Web in Provenance Management" [http://wiki.knoesis.org/index.php/SWPM-2009 SWPM'09], October 25 at ISWC 2009!
+
* Co-organizing a full-day Workshop on "Role of Semantic Web in Provenance Management" [http://wiki.knoesis.org/index.php/SWPM-2009 SWPM'09], October 25 at ISWC 2009!
 +
* Nominated as member to [http://www.w3.org/2005/Incubator/prov/charter W3C Provenance Incubator Group]. The focus of the XG is "in the area of provenance for Semantic Web technologies, development, and possible standardization".
  
 
==Research Interests==
 
==Research Interests==
Line 13: Line 15:
 
* '''Provenance Representation''': An upper-level ontology for representation of provenance information called [http://wiki.knoesis.org/index.php/Provenir_Ontology '''provenir'''].  
 
* '''Provenance Representation''': An upper-level ontology for representation of provenance information called [http://wiki.knoesis.org/index.php/Provenir_Ontology '''provenir'''].  
 
* '''Provenance Analysis''': A set of provenance query operators has been defined based on a systematic classification of provenance queries. The provenance classification scheme has been proposed for the first time in provenance research.
 
* '''Provenance Analysis''': A set of provenance query operators has been defined based on a systematic classification of provenance queries. The provenance classification scheme has been proposed for the first time in provenance research.
* '''Provenance Management Infrastructure''': A provenance query engine has been implemented to support the provenance query operators over a RDF data store. The query engine uses a new class of materialized views called ''Materialized Provenance View'' (MPV), for optimizing performance of complex queries over large datasets (~308 million RDF triple).
+
* '''Provenance Management Infrastructure''': A provenance query engine has been implemented to support the provenance query operators over a RDF data store. The query engine uses a new class of materialized views called ''Materialized Provenance View'' (MPV), for optimizing performance of complex queries over large datasets.
 
===Semantics and Services enabled Problem Solving Environment for ''T.cruzi''===
 
===Semantics and Services enabled Problem Solving Environment for ''T.cruzi''===
Aim: develop and deploy a novel ontology-driven problem-solving environment for T.cruzi.
+
Aim: develop and deploy a novel ontology-driven problem-solving environment for ''T.cruzi''.
 
* Ontology Development: Development of two ontologies (a) [http://bioportal.bioontology.org/ontologies/39544 Parasite Life cycle ontology], (b) [http://bioportal.bioontology.org/ontologies/40425 Parasite Experiment ontology]. Both have been released to the National Center for Biomedical Ontologies (NCBO)<br/>
 
* Ontology Development: Development of two ontologies (a) [http://bioportal.bioontology.org/ontologies/39544 Parasite Life cycle ontology], (b) [http://bioportal.bioontology.org/ontologies/40425 Parasite Experiment ontology]. Both have been released to the National Center for Biomedical Ontologies (NCBO)<br/>
 
Collaborators: ''Tarleton Research Group, University of Georgia'' and ''The Wellcome Trust Sanger Institute, Cambridge, UK''
 
Collaborators: ''Tarleton Research Group, University of Georgia'' and ''The Wellcome Trust Sanger Institute, Cambridge, UK''
 
* Parasite Knowledge Repository: Integrated repository in RDF format of gene knockout, strains, proteomics, pathway, and microarray data for query answering in parasite research
 
* Parasite Knowledge Repository: Integrated repository in RDF format of gene knockout, strains, proteomics, pathway, and microarray data for query answering in parasite research
 
* Trykipedia: Explore use of Wiki-based platform for collaborative ontology development [http://wiki.knoesis.org/index.php/Trykipedia Trykipedia]<br/>
 
* Trykipedia: Explore use of Wiki-based platform for collaborative ontology development [http://wiki.knoesis.org/index.php/Trykipedia Trykipedia]<br/>
 +
==Research Internship==
 +
* Microsoft Research, Redmond (Summer 2008) - eScience Team
 +
* Lister Hill National Center for Biomedical Communications (NIH) (Summer 2007, 2006) - Medical Ontology Research<br/>
 
==Select Publications==
 
==Select Publications==
 
===Provenance Management Framework===
 
===Provenance Management Framework===
Line 28: Line 33:
 
* S.S. Sahoo, O. Bodenreider, J.L. Rutter, K.J. Skinner, A.P. Sheth, “An ontology-driven semantic mash-up of gene and biological pathway information: Application to the domain of nicotine dependence,” Journal of Biomedical Informatics (Special Issue: Semantic Mashup of Biomedical Data), 41(5), 752-765, Oct. 2008 [http://dx.doi.org/10.1016/j.jbi.2008.02.006 pdf]
 
* S.S. Sahoo, O. Bodenreider, J.L. Rutter, K.J. Skinner, A.P. Sheth, “An ontology-driven semantic mash-up of gene and biological pathway information: Application to the domain of nicotine dependence,” Journal of Biomedical Informatics (Special Issue: Semantic Mashup of Biomedical Data), 41(5), 752-765, Oct. 2008 [http://dx.doi.org/10.1016/j.jbi.2008.02.006 pdf]
 
* S.S. Sahoo, K. Zeng, O. Bodenreider, A.P. Sheth, “From ‘glycosyltransferase’ to ‘congenital muscular dystrophy’: Integrating knowledge from NCBI Entrez Gene and the Gene Ontology,”  Medinfo 2007, Brisbane, Australia, 20-24 August, 2007, IOS Press, 2007, pp. 1260–64. [http://mor.nlm.nih.gov:8000/pubs/pdf/2007-medinfo-ss.pdf pdf]<br/>
 
* S.S. Sahoo, K. Zeng, O. Bodenreider, A.P. Sheth, “From ‘glycosyltransferase’ to ‘congenital muscular dystrophy’: Integrating knowledge from NCBI Entrez Gene and the Gene Ontology,”  Medinfo 2007, Brisbane, Australia, 20-24 August, 2007, IOS Press, 2007, pp. 1260–64. [http://mor.nlm.nih.gov:8000/pubs/pdf/2007-medinfo-ss.pdf pdf]<br/>
==Research Internships==
 
* Microsoft Research, Redmond (Summer 2008) - eScience Team
 
* Lister Hill National Center for Biomedical Communications (NIH) (Summer 2007, 2006) - Medical Ontology Research<br/>
 
 
==CV==
 
==CV==
 
[http://knoesis.wright.edu/students/satya/resume/CV.pdf pdf]
 
[http://knoesis.wright.edu/students/satya/resume/CV.pdf pdf]

Revision as of 18:49, 25 September 2009

Satya S. Sahoo
Ph.D candidate, CSE
Kno.e.sis Center, Wright State University

Updates

  • Co-organizing a full-day Workshop on "Role of Semantic Web in Provenance Management" SWPM'09, October 25 at ISWC 2009!
  • Nominated as member to W3C Provenance Incubator Group. The focus of the XG is "in the area of provenance for Semantic Web technologies, development, and possible standardization".

Research Interests

I am interested in Scientific metadata and data management involving knowledge representation (ontology), data integration and query performance optimization.
My special interests are in provenance metadata management for scientific applications and I have developed an algebra for provenance management. I have also implemented a provenance query engine and defined a new class of materialized views for query performance optimization

Current Projects

Provenance Management Framework

Provenance is critical metadata to interpret scientific results, validate experimental processes, and associate trust values.
We have defined an end-to-end framework, underpinned by a novel provenance algebra, addressing three important aspects of provenance management:

  • Provenance Representation: An upper-level ontology for representation of provenance information called provenir.
  • Provenance Analysis: A set of provenance query operators has been defined based on a systematic classification of provenance queries. The provenance classification scheme has been proposed for the first time in provenance research.
  • Provenance Management Infrastructure: A provenance query engine has been implemented to support the provenance query operators over a RDF data store. The query engine uses a new class of materialized views called Materialized Provenance View (MPV), for optimizing performance of complex queries over large datasets.

Semantics and Services enabled Problem Solving Environment for T.cruzi

Aim: develop and deploy a novel ontology-driven problem-solving environment for T.cruzi.

Collaborators: Tarleton Research Group, University of Georgia and The Wellcome Trust Sanger Institute, Cambridge, UK

  • Parasite Knowledge Repository: Integrated repository in RDF format of gene knockout, strains, proteomics, pathway, and microarray data for query answering in parasite research
  • Trykipedia: Explore use of Wiki-based platform for collaborative ontology development Trykipedia

Research Internship

  • Microsoft Research, Redmond (Summer 2008) - eScience Team
  • Lister Hill National Center for Biomedical Communications (NIH) (Summer 2007, 2006) - Medical Ontology Research

Select Publications

Provenance Management Framework

  • S.S. Sahoo, D.B. Weatherly, R. Mutharaju, P. Anantharam, A. Sheth, R.L. Tarleton, “Ontology-driven Provenance Management in eScience: an Application in Parasite Research”, The 8th International Conference on Ontologies, DataBases, and Applications of Semantics, (ODBASE 2009), Vilamoura, Algarve-Portugal, Nov 02 - 04, 2009 (to appear) pdf
  • S.S. Sahoo, A. Sheth, C. Henson, “Semantic Provenance for eScience: ‘Meaningful’ Metadata to Manage the Deluge of Scientific Data,” IEEE Internet Computing, Web-Scale Workflow Track, M.B. Blake and M. Huhns (Eds.), 12(4), pp.46-54, July-Aug. 2008 pdf
  • S.S. Sahoo, C. Thomas, A. Sheth, W.S. York and S. Tartir, “Knowledge Modeling and Its Application in Life Sciences: A Tale of Two Ontologies.” 15th International WWW2006 Conference, pp. 317-326, Scotland, May 23–26, 2006. (Acceptance Rate: 11%) pdf

Scientific Data Integration

  • S.S. Sahoo, O. Bodenreider, J.L. Rutter, K.J. Skinner, A.P. Sheth, “An ontology-driven semantic mash-up of gene and biological pathway information: Application to the domain of nicotine dependence,” Journal of Biomedical Informatics (Special Issue: Semantic Mashup of Biomedical Data), 41(5), 752-765, Oct. 2008 pdf
  • S.S. Sahoo, K. Zeng, O. Bodenreider, A.P. Sheth, “From ‘glycosyltransferase’ to ‘congenital muscular dystrophy’: Integrating knowledge from NCBI Entrez Gene and the Gene Ontology,” Medinfo 2007, Brisbane, Australia, 20-24 August, 2007, IOS Press, 2007, pp. 1260–64. pdf

CV

pdf