CyberInfrastructure Proposal For EarthCube Community
This proposal proposes a cyberInfrastructure for long tails scientists to share and discover their data. Proposed system mainly supports three scenarios which can be identified in separate work flows.
Contents
Publishing Legacy(change) Data
Publishing Data in Digital Format
Publishing Data in Linked Data
Architecture
- Data Registry
Data publishers will register their data through the data registry and important provenance information such as author, location and etc will be collected.
- Annotator
Registered data will be annotated using standard vocabularies stored such as (GCMD and AGI index) using the annotation tools. These tools will suggest possible matches for the user and user will have the ability to further refine the suggestions given by the system. Annotations will be stored in the Meta Data Store.
- Indexer
Collected data and its associated data will be indexed using the indexer to facilitate Searching.
- Simple Search
Simple Search facilitates key word based queries.
- Faceted Search
In addition to the Simple Search functionality system will provide the Faceted Search where user can provide the key value pairs to search/discover data
- Mapping to RDF
As defined in the more advanced work flow given data can be transformed to RDF using existing tools and this allows data publishers to publish the data in a standard form
- Data Publisher
- Semantic Browsing
Form of Data
Table
Image
Unstructured Data
Links
Annotator - Kino http://wiki.knoesis.org/index.php/Kino
Semantic Browsing - iExplore http://knoesis.wright.edu/iExplore/