NIH | National Cancer Institute | NCI Wiki  

Contents of this Page
Summary
Description of the profile

Load new artifacts, in a broad variety of formats (including binary data, tabular data, models, machine interpretable data formats, serializations, XML documents, XMI documents) through multiple interfaces (including web services, REST-ful interfaces, open-source/off the shelf APIs).

Provide an application developer with the ability to define application-specific attributes (for example, defined using ISO 21090 healthcare datatypes) and an information model that defines the relationships between these attributes and other attributes in the broader ecosystem.

To support the use of binary data throughout the system, the binary data must be typed and semantically annotated.

All Information models, their representation and binding to data-types and terminologies will be managed by the semantic infrastructure.

KR information models, and discrete model elements, should be available in a number of broadly used machine interpretable data formats, or serializations. The stakeholders most interested in this publishing capability were caDSR, HL7 CIC, and CDISC. In particular, CDISC wants to support the export and import of schema for tabular data. This implies the capability to perform on-demand transformations between models and formats for the representation of model elements. NB: In dicussions with CDISC users, they indicated that this export feature should enable re-use of metadata and, in some sense, federation of repositories.

NB: THIS IS NOT AN EXHAUSTIVE ENUMERATION OF EXPORT FORMATS

The KR should be able to import information models, and their model elements, from a range of formats. Users would like it to be as easy as possible to register models into the KR. Once in the KR the data elements from these models would be immediately available for use. Optionally, they may then by refined and passed through the appropriate governance process to become curated models.

NB: THIS IS NOT AN EXHAUSTIVE ENUMERATION OF IMPORT FORMATS

Images and other representations of static content

Load specializes capabilities architecturally implied by its associated concepts of Artifact . The implied architectural capabilities are described in the following paragraphs.

Artifact An artifact is a managed resource within the Semantic Infrastructure.

An artifact is associated with the following capabilities:

  • descriptions to enable the artifact to be visible, where the description includes a unique identifier for the artifact and a sufficient, and preferably a machine processible, representation of the meaning of terms used to describe the artifact, its functions, and its effects;
  • one or more discovery mechanisms that enable searching for artifacts that best meet the search criteria specified by the service participant; where the discovery mechanism will have access to the individual artifact descriptions, possibly through some repository mechanism;
  • accessible storage of artifacts and artifact descriptions, so service participants can access, examine, and use the artifacts as defined.
Capabilities
Requirements traceability

Requirement

Source

Capability

Import local value domains into the modeling tool

Gap Analysis::Import::022 - Import Localized Value Domains

  • localizeValueDomains

  • Support providing data elements in multiple formats and standards schema

    Gap Analysis::Export::052 - Multiple Formats

  • dataElementDefinitionFormats

  • Express ValueDomains as either XSD, RDF or OWL

    Gap Analysis::Export::076 - Value Domain Expressions

  • dataElementDefinitionFormats

  • Provide alternative consistent data / information model representations to best suit stakeholders' implementation needs

    Gap Analysis::Export::160 - Multiple Information Model Representations

  • dataElementDefinitionFormats

  • The Clinical Trials industry makes extensive use of tabular formats.  They do a lot of work in Excel and in other 2-dimensional tabular formats in general (e.g., SAS, Oracle).  This format is also used to analyze clinical trial data, it is generally how clinical trials capture, store, and present data.  Thus the tabular formats have to be supported.  However, the CDISC intent is to support other formats and perhaps move away from the tabular formats. *Source:  * * 5/20/2010 Interview, David Iberson-Hurst

    Gap Analysis::CDISC::CDISC-12 -  Support tabular formats and emerging formats

  • tabularFormats

  • CDISC metadata, which is the focus of CDISC standards, should be computer-tractable.  This means that CDISC standards should support computer semantic interoperability (CSI) and be available through services.  Per Model Driven Architecture (MDA), metadata are context-defining data about data.  This includes the standard data element names, data types, and lengths are well as provenance models, models of executable business processes, data about interfaces/APIs, configuration information, data transformation rules, etc. *Source:  * * 5/20/2010 Interview, David Iberson-Hurst

    Gap Analysis::CDISC::CDISC-14 -  Provide CDISC metadata in machine-interpretable formats

  • dataElementDefinitionFormats

  • There is a need for the appropriate users to be able to review and retrieve (import) through various medium applicable data elements for file formats such as  xls spreadsheets,  xml, csv. text, pdf.

    Gap Analysis::HL7 CIC::CIC-5 -  Provide data element definitions in multiple formats

  • dataElementDefinitionFormats

  • Content includes all unstructured text and other forms of content that make up a service specification. Examples include storyboards, and scope. Content is an integral part of service specification, and content is leveraged across the enterprise for documentation and communicaitons.

    Semantic Infrastructure Requirements::Artifact Management::Content

  • imageManagement

  • This set of requirements includes providing an application developer with the ability to define application-specific attributes (for example, defined using ISO 21090 healthcare datatypes) and an information model that defines the relationships between these attributes and other attributes in the broader ecosystem. In particular, the last requirement suggests linked datasets, where application developers can connect data in disparate repositories as if the repositories are part of a larger federated data ecosystem. Additional requirements include the ability to publish and discover information models. Support is needed for forms data and common clinical document standards, such as HL7 CDA. To support the use of binary data throughout the system, the binary data must be typed and semantically annotated. All Information models, their representation and binding to data-types and terminologies will be managed by the semantic infrastructure. The ability to publish and discover information models will be supported by the semantic infrastructure, and the platform will leverage these capabilities. Link to use case satisfied from caGRID 2.0 Roadmap: The pathology, radiology and other data have various data formats which must be described, and the information model for the patient record must link between these various datatypes. The complete information model includes semantic links between datasets to build a comprehensive electronic medical record. Annotations on data are defined and included in the information model.

    Semantic Infrastructure Requirements::caGRID 2.0 Platform and Terminology Integration::Data Representation and Information Models

  • dataElementDefinitionFormats

  • localizeValueDomains

  • tabularFormats

  • Service Oriented Architecture is an architectural paradigm for organizing and utilizing distributed capabilities that may be under the control of different ownership domains. Consequently, it is important that organizations that plan to engage in service interactions adopt governance policies and procedures sufficient to ensure that there is standardization across both internal and external organizational boundaries to promote the effective creation and use of SOA-based services. SOA governance requires numerous architectural capabilities on the Semantic Infrastructure: Governance is expressed through policies and assumes multiple use of focused policy modules that can be employed across many common circumstances This is elaborated in the inherited Policy profile. Governance requires that the participants understand the intent of governance, the structures created to define and implement governance, and the processes to be followed to make governance operational. This is provided by capabilities specialized from the inherited Management Profile. Governance policies are made operational through rules and regulations. This is provided by the following capabilities, most of which are specializations of the inherited Artifact Profile: * descriptions to enable the rules and regulations to be visible, where the description includes a unique identifier and a sufficient, and preferably a machine process-able, representation of the meaning of terms used to describe the rules and regulations; * one or more discovery mechanisms that enable searching for rules and regulations that may apply to situations corresponding to the search criteria specified by the service participant; where the discovery mechanism will have access to the individual descriptions of rules and regulations, possibly through some repository mechanism; * accessible storage of rules and regulations and their respective descriptions, so service participants can understand and prepare for compliance, as defined. * SOA services to access automated implementations of the Governance Processes. Governance implies management to define and enforce rules and regulations.. This is elaborated in the inherited Management profile. Governance relies on metrics to define and measure compliance. This is elaborated in the inherited Metric profile.

    Semantic Profile::OASIS SOA::Governance Model

  • discovery from inherited abstract profile Artifact
  • identity from inherited abstract profile Artifact
  • metadata from inherited abstract profile Artifact
  • store from inherited abstract profile Artifact
  • A service description is an artifact, usually document-based, that defines or references the information needed to use, deploy, manage and otherwise control a service. This includes not only the information and behavior models associated with a service to define the service interface but also includes information needed to decide whether the service is appropriate for the current needs of the service consumer. Thus, the service description will also include information such as service reachability, service functionality, and the policies and contracts associated with a service. A service description artifact may be a single document or it may be an interlinked set of documents. Architectural implications of service description on the Semantic Infrastructure are reflected in the following functional decomposition: * Description will change over time and its contents will reflect changing needs and context. This is elaborated in the inherited Change profile. * Description makes use of defined semantics, where the semantics may be used for categorization or providing other property and value information for description classes. This is elaborated in the inherited Semantic Model profile. * Descriptions include reference to policies defining conditions of use and optionally contracts representing agreement on policies and other conditions. This is elaborated in the inherited Policy profile. * Descriptions include references to metrics which describe the operational characteristics of the subjects being described. This is elaborated in the inherited Metrics profile. * Descriptions of the interactions are important for enabling auditability and repeatability, thereby establishing a context for results and support for understanding observed change in performance or results. This is elaborated in the inherited Interaction profile. * Descriptions may capture very focused information subsets or can be an aggregate of numerous component descriptions. Service description is an example of a likely aggregate for which manual maintenance of all aspects would not be feasible. This is elaborated in the inherited Composition profile. * Descriptions provide up-to-date information on what a resource is, the conditions for interacting with the resource, and the results of such interactions. As such, the description is the source of vital information in establishing willingness to interact with a resource, reachability to make interaction possible, and compliance with relevant conditions of use. This is elaborated in the inherited Interoperability profile. Policy capabilities are specialization of Artifact capabilities.

    Semantic Profile::OASIS SOA::Service Description Model

  • discovery from inherited abstract profile Artifact
  • identity from inherited abstract profile Artifact
  • metadata from inherited abstract profile Artifact
  • store from inherited abstract profile Artifact
  • One of the key requirements for participants interacting with each other in the context of a SOA is achieving visibility: before services can interoperate, the participants have to be visible to each other using whatever means are appropriate. The Reference Model analyzes visibility in terms of awareness, willingness, and reachability. Visibility in a SOA ecosystem has the following architectural implications on mechanisms providing support for awareness, willingness, and reachability: Mechanisms providing support for awareness will likely have the following minimum capabilities: * creation of Description, preferably conforming to a standard Description format and structure; * publishing of Description directly to a consumer or through a third party mediator; * discovery of Description, preferably conforming to a standard for Description discovery; * notification of Description updates or notification of the addition of new and relevant Descriptions; * classification of Description elements according to standardized classification schemes. In a SOA ecosystem with complex social structures, awareness may be provided for specific communities of interest. The architectural mechanisms for providing awareness to communities of interest will require support for: * policies that allow dynamic formation of communities of interest; * trust that awareness can be provided for and only for specific communities of interest, the bases of which is typically built on keying and encryption technology. The architectural mechanisms for determining willingness to interact will require support for: * verification of identity and credentials of the provider and/or consumer; * access to and understanding of description; * inspection of functionality and capabilities; * inspection of policies and/or contracts. The architectural mechanisms for establishing reachability will require support for: * the location or address of an endpoint; * verification and use of a service interface by means of a communication protocol; * determination of presence with an endpoint which may only be determined at the point interaction but may be further aided by the use of a presence protocol for which the endpoints actively participate.

    Semantic Profile::OASIS SOA::Service Visibility Model

  • discovery from inherited abstract profile Artifact
  • dataElementDefinitionFormats
    Description

    Provide data element definitions in multiple formats

    Support providing data elements in multiple formats and standards schema.

    Provide metadata in machineinterpretable formats

    Provide alternative consistent data / information model representations to best suit stakeholders' implementation needs

    Express ValueDomains as either XSD, RDF or OWL

    Requirements addressed
    Overview of possible operations
    discovery
    Description

    One or more discovery mechanisms that enable searching for artifacts that best meet the search criteria specified by the service participant; where the discovery mechanism will have access to the individual artifact descriptions, possibly through some repository mechanism.

    Requirements addressed
    Overview of possible operations
    identity
    Description

    Descriptions which include a unique identifier for the artifact.

    Requirements addressed
    Overview of possible operations
    imageManagement
    Description

    Management of image and other non-text, unstructured resources

    Requirements addressed
    Overview of possible operations
    localizeValueDomains
    Description

    Import local value domains into the modeling tool

    Requirements addressed
    Overview of possible operations
    metadata
    Description

    A representation of the meaning of terms used to describe the artifact, its functions, and its effects.

    Requirements addressed
    Overview of possible operations
    provenance
    Description

    While the Resource identity provides the means to know which subject and subject description are being considered, Provenance as related to the Description class provides information that reflects on the quality or usability of the subject. Provenance specifically identifies the entity (human, defined role, organization, ...) that assumes responsibility for the resource being described and tracks historic information that establishes a context for understanding what the resource provides and how it has changed over time. Responsibilities may be directly assumed by the Stakeholder who owns a Resource or the Owner may designate Responsible Parties for the various aspects of maintaining the resource and provisioning it for use by others. There may be more than one entity identified under Responsible Parties; for example, one entity may be responsible for code maintenance while another is responsible for provisioning of the executable code. The historical aspects may also have multiple entries, such as when and how data was collected and when and how it was subsequently processed, and as with other elements of description, may provide links to other assets maintained by the Resource owner.

    Requirements addressed
    Overview of possible operations
    store
    Description

    Accessible storage of artifacts and artifact descriptions, so service participants can access, examine, and use the artifacts as defined.

    Requirements addressed
    Overview of possible operations
    tabularFormats
    Description

    Support tabular formats and emerging formats.

    Requirements addressed
    Overview of possible operations

    To be provided.

    • No labels