NIH | National Cancer Institute | NCI Wiki  

Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

The Semantic Infrastructure has to support seamless integration with the caGRID 2.0 platform. The following are some high-level platform and terminology requirements that are either supported or addressed by the semantic infrastructure:Semantic Infrastructure.

Service Generation

Service generation is the ability to generate services from user defined service metadata, the . The semantic infrastructure provides this metadata and the platform leverages this metadata for service generation. The constraints and policies specified in the semantic infrastructure are inherited by the platform and are enforced at as runtime policies.

Additional platform specific and runtime information is provided by the developer at the time of service generation.

Service Discovery

...

and Utilization

This group of requirements focuses on enabling developers of composite services and applications to discover, compose, and invoke services. This includes the discovery of published services based on service metadata and the generation of client APIs in multiple languages to provide cross-platform access to existing services.

...

Link to use case satisfied from caGRID 2.0 roadmapRoadmap: As institutions share de-identified glioblastoma data sets, they are available to others via data discovery. The treatment recommendation service used by the oncologist is able to discover these new data sets and their corresponding information models, and include that data for subsequent use in recommendation of treatment.

Link to use case satisfied from caGRID 2.0 roadmapRoadmap: all of the data management and access services in the use case are utilized by application developers to build the user interfaces that the clinicians use during the course of patient care.

...

Service orchestration and choreography allows both application developers and non-developers to discover service "building blocks" that can be composed dynamically to provide business capabilities. Special cases include the orchestration of multiple services for a distributed query, or for a transactional workflow. Service orchestration and choreography will leverage static and behavioral semantics from the Semantic Infrastructure v22.0.

The semantic infrastructure Semantic Infrastructure provides the behavioral semantics required for dynamic composibility of services or generation of distributed queries. This includes runtime contract discovery /and negotiation to determine composibility of services based on service capabilities and constraints.

...

Link to use case satisfied from caGRID 2.0 roadmapRoadmap: Federated query over the TCGA data and other data sets is performed using a service orchestration.

...

Policy and Rules Management allow non-developer secondary users to create policies and rules and apply them to services. The scope of policies includes, but is not limited to, definition and configuration of business processing policy and related rules, compliance policies, quality of service policies, and security policies. Some key functional requirements to manage for managing policies include capabilities to author policies and store policies, and for approval, validation, and run-time execution of policiesand to approve and validate policies and execute policies at runtime.

The semantic infrastructure Semantic Infrastructure will provide a mechanism to specify policies, including business processing policies and related rules, compliance policies, and quality of service policies. Tools and services for creating security specific policies will be provided by the caGRID 2.0 platform and will be used by the semantic infrastructure. All other policies specified in the semantic infrastructure Semantic Infrastructure will be enforced by the platform at runtime.

Link to use case satisfied from caGRID 2.0 roadmapRoadmap: Each institution has different data sharing needs, access control needs, and business rules for processing that are defined and customized. For example, policy at the pathologist's institution may state that the patient is scheduled for a visit when the review is complete.

...

The semantic infrastructure will provide a placeholder to specify events and triggering conditions for data and services, the . The platform monitors these events at runtime and acts on these events.

Link to use case satisfied from caGRID 2.0 roadmapRoadmap: As patient care proceeds, the system notifies the designated clinicians that data (for example, images) are ready for review. Similarly, when notifications are received, event processing logic allows the appropriate parties to assign clinicians for care. In order to facilitate better treatment (a learning healthcare system), as new de-identified glioblastoma data is made available, notifications are sent that could indicate a recommended change in the treatment plan.

...

This set of requirements includes providing an application developer with the ability to define application-specific attributes (for example., defined using ISO 21090 healthcare datatypes) and an information model that defines the relationships between these attributes and other attributes in the broader ecosystem. In particular, the last requirement suggests linked datasets, where application developers can connect data in disparate repositories as if the repositories are part of a larger federated data ecosystem. Additional requirements include the ability to publish and discover information models. Support is needed for forms data and common clinical document standards, such as HL7 CDA. To support the use of binary data throughout the system, the binary data must be typed and semantically annotated.

All Information models, their representation and binding to data-types /and terminologies will be managed by the semantic infrastructure. The ability to publish and discover information models will be supported by the semantic infrastructure, and the platform will leverage these capabilities.

Link to use case satisfied from caGRID 2.0 roadmapRoadmap: The pathology, radiology and other data have various data formats which must be described, and the information model for the patient record must link between these various datatypes. The complete information model includes semantic links between datasets to build a comprehensive electronic medical record. Annotations on data are defined and included in the information model.

...

Linkages between the different disparate data sets will be managed by the semantic infrastructure. Data updates that trigger transactions are captured by the platform and are propagated upstream to the semantic infrastructure. An example would be the platform monitoring events to identify changes to data,.

Link to use case satisfied from caGRID 2.0 roadmapRoadmap: the patient has an electronic medical record that spans multiple institutions. The clinical workup data (for example, genomics and proteomics data) is linked to the clinical care record; similarly pathology and radiology findings must be attached to the patient's electronic medical record.

...

The semantic infrastructure will provide metadata for discovery of these datasets, complex . Comples temporal and spatial queries will be informed by the metadata but will be formulated and executed by the platform.

Link to use case satisfied from caGRID 2.0 roadmapRoadmap: The oncologist must be able to quickly find glioblastoma data sets, indicating the fields that he is interested in comparing from his clinical data in order to find similar disease conditions and associated treatment plans. Temporal queries allow clinicians to identify changes in patient condition and treatment over time.

...

Link to use case satisfied from caGRID 2.0 roadmapRoadmap: The origin of data is tied to the data creator, allowing the oncologist performing the match against TCGA data and other datasets to include and exclude data sets based on their origin.

...

In a diverse information environment, semantics must be used to clearly indicate the meaning of data. This requirement is expected to be addressed by the Semantics Semantic Infrastructure, although there will be a touchpoint between the caGrid 2.0 and the semantics infrastructure Semantic Infrastructure to annotate data with semantics. Integration with the semantics infrastructure Semantic Infrastructure will enable reasoning, semantic query, data mediation (for example, ad hoc data transformation) and other powerful capabilities.

Data Semantic semantics are captured in the semantic infrastructure Semantic Infrastructure and the platform will leverage the semantic infrastructure Semantic Infrastructure interfaces for reasoning and analysis.

Link to use case satisfied from caGRID 2.0 roadmapRoadmap: The oncologist accesses the TCGA database to search for de-identified glioblastoma tumor data that is similar to the patient data exported from the hospital medical record. During this search, the semantics of the data fields are leveraged to indicate matches between TCGA data fields and the hospital medical record data fields.

...

There are numerous data repositories on the web today. These data repositories contain essential information that must be accessible to services in the ecosystem. As a result, caGrid 2.0 must provide capabilities to integrate these external repositories into the Grid grid with the assumption that the remote service cannot be changed.

...