NIH | National Cancer Institute | NCI Wiki  

Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

Data discovery enables secondary users to find the types of data available in the ecosystem as well as summary-level information about available data sets.

Data Discovery has architectural implications on the Semantic Infrastructure:

Models describe the structural, behavioral, and semantic aspects of data. This requires the following capabilities:

  • enable the models to be visible, with unique identifier for the model and each of its elements as well as access to a meta-model representation of the meaning of terms used to describe the model, its functions, and its effects;
  • one or more discovery mechanisms that enable searching for models and model fragments that best meet the search criteria specified by the service participant; where the discovery mechanism will have access to the individual model descriptions, possibly through some repository mechanism;
  • accessible storage of models, so service participants can access, examine, and use the models as defined.

Descriptions include references to metrics which describe the operational constraints of the data being described. This requires the following capabilities:

  • the infrastructure monitoring and reporting information on service data;
  • possible interface requirements to make accessible metrics information generated or most easily accessed by the related services;
  • mechanisms to catalog and enable discovery of which metrics are available for a modeled data element and information on how these metrics can be accessed;
  • mechanisms to catalog and enable discovery of compliance records associated with data elements that are based on these metrics.

Models provide up-to-date information on what a data element is, the conditions for interacting with the data element, and the results of such interactions. As such, the model is the source of vital information in establishing compliance with relevant conditions of use. This requires the existence of:

  • one or more discovery mechanisms that enable searching for data elements that best meet the criteria specified by a service participant, where the discovery mechanism will have access to individual modeled descriptions, possibly through some repository mechanism;
  • tools to appropriately track users of the modeled descriptions and notify them when a new version of the modeled description is available.

This Functional Profile includes, but is not limited to, the following capability elaborations:

Derived From Requirements

  • Semantic Infrastructure Requirements::caGRID 2.0 Platform and Terminology Integration::Service Discovery and Utilization This group of requirements focuses on enabling developers of composite services and applications to discover, compose, and invoke services. This includes the discovery of published services based on service metadata and the generation of client APIs in multiple languages to provide cross-platform access to existing services. The platform will use the semantic infrastructure service metadata to address all the service discovery requirements. The semantic infrastructure relies on metadata about services and artifacts. Link to use case satisfied from caGRID 2.0 Roadmap: As institutions share de-identified glioblastoma data sets, they are available to others via data discovery. The treatment recommendation service used by the oncologist is able to discover these new data sets and their corresponding information models, and include that data for subsequent use in recommendation of treatment. Link to use case satisfied from caGRID 2.0 Roadmap: all of the data management and access services in the use case are utilized by application developers to build the user interfaces that the clinicians use during the course of patient care.

Anchor
_16_5_1_24a0131_1283167155540_936130_3058
_16_5_1_24a0131_1283167155540_936130_3058

...

dataDiscovery

Data Discovery

Data discovery capabilities include:

  • visibility of models and all model elements
  • unique identification for the models and each of its elements
  • access to a meta-model representation of the meaning of terms used to describe the model, its functions, and its effects;
  • one or more discovery mechanisms that enable searching for models and model fragments that best meet the search criteria specified by the service participant; where the discovery mechanism will have access to the individual model descriptions, possibly through some repository mechanism;
  • accessible storage of models, so service participants can access, examine, and use the models as defined.
  • access to the infrastructure monitoring and reporting information on service data;
  • access to metrics information generated or available through related services;
  • mechanisms to catalog and enable discovery of which metrics are available for a modeled data element and information on how these metrics can be accessed;
  • mechanisms to catalog and enable discovery of compliance records associated with data elements that are based on these metrics.
  • tools to appropriately track users of the modeled descriptions and notify them when a new version of the modeled description is available.

...

Scrollbar