NIH | National Cancer Institute | NCI Wiki  

Error rendering macro 'rw-search'

null

You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 2 Next »

Summary

Description of the profile

Search, using different criteria.

The wealth of data must be accessible, resulting in the need for exploration of available datasets. This includes the ability to view seamlessly across independent data sets, allowing a secondary user to integrate data from multiple sources.

The semantic infrastructure will provide metadata for discovery of these datasets.

Provide an application developer with the ability to define application-specific attributes (for example, defined using ISO 21090 healthcare datatypes) and an information model that defines the relationships between these attributes and other attributes in the broader ecosystem. In particular, the last requirement suggests linked datasets, where application developers can connect data in disparate repositories as if the repositories are part of a larger federated data ecosystem.
An artifact is a managed resource within the Semantic Infrastructure.

An artifact is associated with the following capabilities:

  • descriptions to enable the artifact to be visible, where the description includes a unique identifier for the artifact and a sufficient, and preferably a machine processible, representation of the meaning of terms used to describe the artifact, its functions, and its effects;
  • one or more discovery mechanisms that enable searching for artifacts that best meet the search criteria specified by the service participant; where the discovery mechanism will have access to the individual artifact descriptions, possibly through some repository mechanism;
  • accessible storage of artifacts and artifact descriptions, so service participants can access, examine, and use the artifacts as defined.
    Discovery is the set of capabilities which enable searching for artifacts that best meet the search criteria specified by the service participant.

Aspects of discovery include:

  • Search for services, policies, and other artifact descriptions accessible via some repository mechanism
  • Search for operational characteristics of artifacts, which are metrics defined in artifact descriptions. The information is accessible via infrastructure monitoring capabilties or directly from services.
  • Tracking and notification mechanisms related to artifact usage, service availability, operational conformance

Capabilities associated with discovery are delineated in the functional profiles inherited, and specialized, by the discovery profile, namely:

  • Artifact
  • Metrics
  • Interoperability
    Descriptions provide up-to-date information on what a resource is, the conditions for interacting with the resource, and the results of such interactions. As such, the description is the source of vital information in establishing willingness to interact with a resource, reachability to make interaction possible, and compliance with relevant conditions of use.

Architectural implications of interoperability on the Semantic Infrastructure are reflected in the following capabilities:

  • one or more discovery mechanisms that enable searching for described resources that best meet the criteria specified by a service participant, where the discovery mechanism will have access to individual descriptions, possibly through some repository mechanism;
  • tools to appropriately track users of the descriptions and notify them when a new version of the description is available.
    Artifact Descriptions include references to metrics which describe the operational characteristics of the subjects being described

Architectural implications of metrics on the Semantic Infrastructure are reflected in the following capabilities:

  • access to platform infrastructure monitoring and reporting capabilities
  • access to metrics information generated or accessible by related services
  • mechanisms to catalog and enable discovery of which metrics are available for a described artifact and information on how these metrics can be accessed;
  • mechanisms to catalog and enable discovery of compliance records associated with policies, contracts, and constraints that are based on these metrics.
Capabilities

Requirements traceability

Requirement

Source

Capability

The major goal is to be able to query across data sets, get to the instance data, aggregate it and perform a wide range of operations on this data.  By multiple data sets CDISC means different clinical trials, different EHR systems, and obviously queries across clinical trials and EHR systems. The KR should be able to identify those data sets that are constructed using its information models and model elements.  This includes any alignments between a local information models and (for example) a specific CDISC standard.  This alignment would be an information model in its own right, part of the KR, and the basis for subsequent Searches (i.e., transforms are based on information models).  The KR would point to local data sets that were instances of its approved information models.  It would also store any official transforms (alignments) between  these local data sets; these transforms each have an information model and point to the local date sets that these transforms were applicable to.

Gap Analysis::CDISC::CDISC-18 -  Use CDISC standards to query and aggregate data across organizations, data sets, time, and geographies

queryDataSetsTimeAndGeography,

The wealth of data must be accessible, resulting in the need for exploration of available datasets. This includes the ability to view seamlessly across independent data sets, allowing a secondary user to integrate data from multiple sources. In addition, the query capability must support sophisticated queries such as temporal queries and spatial queries. The semantic infrastructure will provide metadata for discovery of these datasets. Comples temporal and spatial queries will be informed by the metadata but will be formulated and executed by the platform. Link to use case satisfied from caGRID 2.0 Roadmap: The oncologist must be able to quickly find glioblastoma data sets, indicating the fields that he is interested in comparing from his clinical data in order to find similar disease conditions and associated treatment plans. Temporal queries allow clinicians to identify changes in patient condition and treatment over time.

Semantic Infrastructure Requirements::caGRID 2.0 Platform and Terminology Integration::Data Exploration and Query

queryDataSetsTimeAndGeography, queryDataSetsTimeAndGeography,

Data management includes linking of disparate data sets and updates of data across the ecosystem. Data updates may include updates to multiple data sources, necessitating the need for transactions. Linkages between the different disparate data sets will be managed by the semantic infrastructure. Data updates that trigger transactions are captured by the platform and are propagated upstream to the semantic infrastructure. An example would be the platform monitoring events to identify changes to data. Link to use case satisfied from caGRID 2.0 Roadmap: the patient has an electronic medical record that spans multiple institutions. The clinical workup data (for example, genomics and proteomics data) is linked to the clinical care record; similarly pathology and radiology findings must be attached to the patient's electronic medical record.

Semantic Infrastructure Requirements::caGRID 2.0 Platform and Terminology Integration::Data Management

queryDataSetsTimeAndGeography,

This set of requirements includes providing an application developer with the ability to define application-specific attributes (for example, defined using ISO 21090 healthcare datatypes) and an information model that defines the relationships between these attributes and other attributes in the broader ecosystem. In particular, the last requirement suggests linked datasets, where application developers can connect data in disparate repositories as if the repositories are part of a larger federated data ecosystem. Additional requirements include the ability to publish and discover information models. Support is needed for forms data and common clinical document standards, such as HL7 CDA. To support the use of binary data throughout the system, the binary data must be typed and semantically annotated. All Information models, their representation and binding to data-types and terminologies will be managed by the semantic infrastructure. The ability to publish and discover information models will be supported by the semantic infrastructure, and the platform will leverage these capabilities. Link to use case satisfied from caGRID 2.0 Roadmap: The pathology, radiology and other data have various data formats which must be described, and the information model for the patient record must link between these various datatypes. The complete information model includes semantic links between datasets to build a comprehensive electronic medical record. Annotations on data are defined and included in the information model.

Semantic Infrastructure Requirements::caGRID 2.0 Platform and Terminology Integration::Data Representation and Information Models

linkedDataSetManagement,

 

Semantic Profile::OASIS SOA::Governance Model

monitor from inherited abstract profile Metrics, metrics from inherited abstract profile Metrics, discovery from inherited abstract profile Artifact, identity from inherited abstract profile Artifact, metadata from inherited abstract profile Artifact, store from inherited abstract profile Artifact,

 

Semantic Profile::OASIS SOA::Service Description Model

discovery from inherited abstract profile Artifact, identity from inherited abstract profile Artifact, metadata from inherited abstract profile Artifact, store from inherited abstract profile Artifact, monitor from inherited abstract profile Metrics, metrics from inherited abstract profile Metrics, metricsDiscovery from inherited abstract profile Metrics, complianceDiscovery from inherited abstract profile Metrics, interoperabilityDiscovery from inherited abstract profile Interoperability, serviceChangeNotification from inherited abstract profile Interoperability,

complianceDiscovery

Description

Mechanisms to catalog and enable discovery of compliance records associated with policies, contracts, and constraints that are based on these metrics.

Requirements addressed
Overview of possible operations

discovery

Description

One or more discovery mechanisms that enable searching for artifacts that best meet the search criteria specified by the service participant; where the discovery mechanism will have access to the individual artifact descriptions, possibly through some repository mechanism.

Requirements addressed
Overview of possible operations

identity

Description

Descriptions which include a unique identifier for the artifact.

Requirements addressed
Overview of possible operations

interoperabilityDiscovery

Description

One or more discovery mechanisms that enable searching for described resources that best meet the criteria specified by a service participant, where the discovery mechanism will have access to individual descriptions, possibly through some repository mechanism.

Requirements addressed
Overview of possible operations

linkedDataSetManagement

Description

Manage linkages between data in disparate repositories.

Requirements addressed
Overview of possible operations

metadata

Description

A representation of the meaning of terms used to describe the artifact, its functions, and its effects.

Requirements addressed
Overview of possible operations

metrics

Description

Access to metrics information generated or accessible by related services

Requirements addressed
Overview of possible operations

metricsDiscovery

Description

Mechanisms to catalog and enable discovery of which metrics are available for a described artifact and information on how these metrics can be accessed.

Requirements addressed
Overview of possible operations

monitor

Description

Access to platform infrastructure monitoring and reporting capabilities.

Requirements addressed
Overview of possible operations

provenance

Description

While the Resource identity provides the means to know which subject and subject description are being considered, Provenance as related to the Description class provides information that reflects on the quality or usability of the subject. Provenance specifically identifies the entity (human, defined role, organization, ...) that assumes responsibility for the resource being described and tracks historic information that establishes a context for understanding what the resource provides and how it has changed over time. Responsibilities may be directly assumed by the Stakeholder who owns a Resource or the Owner may designate Responsible Parties for the various aspects of maintaining the resource and provisioning it for use by others. There may be more than one entity identified under Responsible Parties; for example, one entity may be responsible for code maintenance while another is responsible for provisioning of the executable code. The historical aspects may also have multiple entries, such as when and how data was collected and when and how it was subsequently processed, and as with other elements of description, may provide links to other assets maintained by the Resource owner.

Requirements addressed
Overview of possible operations

queryDataSetsTimeAndGeography

Description

Query and aggregate data across organizations, data sets, time, and geographies.

Data management includes linking of disparate data sets and updates of data across the ecosystem. Data updates may include updates to multiple data sources, necessitating the need for transactions.

Linkages between the different disparate data sets will be managed by the semantic infrastructure. Data updates that trigger transactions are captured by the platform and are propagated upstream to the semantic infrastructure. An example would be the platform monitoring events to identify changes to data.

Link to use case satisfied from caGRID 2.0 Roadmap: the patient has an electronic medical record that spans multiple institutions. The clinical workup data (for example, genomics and proteomics data) is linked to the clinical care record; similarly pathology and radiology findings must be attached to the patient's electronic medical record.

Requirements addressed
Overview of possible operations

serviceChangeNotification

Description

Tools to appropriately track users of the descriptions and notify them when a new version of the description is available.

Requirements addressed
Overview of possible operations

store

Description

Accessible storage of artifacts and artifact descriptions, so service participants can access, examine, and use the artifacts as defined.

Requirements addressed
Overview of possible operations
  • No labels