NIH | National Cancer Institute | NCI Wiki  

Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

Jason Lucas, Scott Bauer, Larry Wright, Cory Endle, Kim Ong, Tracy Safran, Rob Wynn, Gilberto Fragoso, Margaret Haber, Kumar, Sherri De Coronado, John Campbell, Bron, Luba, ShamineSana Din, Craig Stancl

 

Discussion Points:

  • House keeping Items
    • Reviewed agenda and approved - unless there are changes along the way.
    • WebEx will be live all day.
    • Goal is to record key tasks and wikis
  • Goal is to set context for the rest of the meetings this week and to start identifying the issues to be addressed.  
  • Larry would like to start to complete the complete EVS Project Architecture (including LexEVS)
  • General workflow for architecture:
    • Gather terminology content → protege and meme → loaded into LexEVS terminology service → accessed via java api, rest service api, browsers
    • Architecture now needs to include the triple store database and usage (REST API and native REST API).  This service is to support clinical trials (CTRP).  Ability to make changes into the production service was a driving factor in going to TripleStore architecture.  Loading into triplestore can be done nightly if needed.  Currently the loads have been nightly.
    • OBO is currently being looked as as a third delivery channel.  
    • Expected to have expanded use of services and downloads
      • Adverse events were the most downloads  - and then used and built in other systems (CDISC and FDA, etc).
    • Report Writer extracts value sets from LexEVS
      • Current work happening to create a SPARQL based report writer. Planned for early February.
        • This would be only on QA team.
        • External use would require a security layer (doesn't exist today)
        • Gilberto noted that report writer cannot currently take a search and return result set (with preferred names).  Noted this is "simple search" and could also be part of the term browser.
        • Existing templates should be able to be run without authentication.
  • The TripleStore still cannot provide all the terminology data needed for EVS and is stored in LexEVS. 
  • Mappings - need to determine how to capture and allow access to mappings.  The LexEVS model and triple store model do not provide the needed flexibility today.
  • Synchronization of data sources and coordination of distribution of data is an open issue.  Consideration is needed to provide an umbrella API (Federated) that serves both LexEVS and TripleStore content.
  • Gilberto noted that CTS2 services should be revisited and he'd like to review missing functionality and complexity (noted by CTRP).
  • LexEVS historically has been based on standards since the early inception of the tooling.  
    • As a terminology service, all the content has been loaded into Lexgrid data model.
    • Focus from standards has shifted to providing usable services to end users.
    • There are possibilities for enhancing the service today that still provides interoperability.  
    • Thesaurus based use cases should be considered when determining goals.
    • CTRP usage of TripleStore was speed in loading content.  LexEVS loading of transitive table is long process.  Tracy noted that LexEVS could start to use TripleStore technology to remove that bottleneck when loading content.
    • Noted that existing applications may not be ready to transition to new serivces.
    • Primary goal of EVS is to proviede terminolgoy content to NCI customers and users to support the sciences.  
      • Noted that no interest in re-desigining 
    • LexEVS provides a consistent model. 
  • Mappings
    • Currently default to the LexGrid XML format.   The other is RRF loader mappings.
    • No plans to load mappings into SPARQL.
    • ICDO3 Map from Meta - Kim looked at performace - not the best results.
    • Mappings in LexEVS in coding schemes.  
    • FTP will be main distribution for Maps. (Tab delimited to text)
    • Review of CTS2 mapping support will help decide if additional functionality can be added.
    • No support for contextual mapping currently exists.
  • Systems Priorities
    • LexEVS has spent time this last year and utilizing Docker.  This will provide efficient deployments.  
      • Current deployments are completed by using a documented deployment document.
      • Containers will reduce the middle man needed for deployments.  
      • Systems team has been trying to create an "approved" NCI container so the LexEVS team can use.  
      • Will still need to continue running tests.
      • Need to talk with systems about environment.
      • Docker is operational at NCI.
      • Docker usage for data
        • Gilberto suggested promotion of data using docker.
          • Provide the "database" container to the systems team.  
          • This would help as it is deployed up the tiers.  
      • Docker distributions to be used for end users.
        • ASU was working on this type of container.  
        • Current scripts do 90% of what is needed.  Need to change configuration so it doesn't remove the services.  
  • QA
    • Currently there are test scripts created by Kim and Shamime(?)Sana
      • Covers the majority of the usage.
      • Test scripts still being developed.  
      • Tin is still available for reference.
      • Tests should be avaiable in Jenkins. (LexEVS team is currently doing this)
      • Docker shouldn't cause concern for QA.
    • Tech stack support 
      • Confirm that NODEJS and other technologies are supported.
  • Security Scans
    • 508 needs to be addressed.
      • Heroku and SWAGGER pages may need to be reviewed. 
  • Lucene, SOLR and elastic search
    • Jason noted that there might be need for discussion during the API focused discussion on Thursday.

...

  • Action Items
    • caDSR team to provide a list of what is used from the Java API to determine what would need to be exposed in a REST API.  Phillipa could meet with the team Wednesday at 3PM.
    • caDSR to update to Java 8 jar in 2018Q1.
    • Send retirement notice regarding the 6.4 retirement and caCORE REST.

 

TimeLocationTopicsParticipantsResources
2:00 PM - 3:00 PM  5-W-032
User Group Discussion - FDA and CDISC

User Teams to share how they are using EVS and  discuss requirements/priorities for the future.

  • APIs: Java, REST (CTS2 or 3-store), SPARQL, FTP
  • Backwards compatibility of server/client/data releases
  • Incl: Java/jar file issues and future
  • Incl: New terminology server API/content/other needs.

Editors

Liz, Erin, Brenda

 

...

  • Action Items
    • Request for a table of contents on the CDISC Terminology page. (Cancer.gov)
    • Request to update the CDISC term suggestion request form.

...

  • Action Items:
    • Follow up on the use of REST Services (CTS, LexEVS REST)
    • Investigate the issue when finding clinical trials (search) the same stage could be listed several times in the dropdown. Investigate the issue - Drugs and Drug family is problematic when determing what should come to the top of the list.
    • Determine mapping for accrual coding.

...

  • Action Items:
    • Update Architecture to had include value sets and mappings.
    • Update Architecture to include all the ways that reports are created. (content channels - as separate slide)
    • Update Architecture to incude additional sources being loaded into SPARQL.
    • Update Architecture by adding channels from triplestore to LexEVS.
    • Update Architecture to add detail for Browser and assiciated dependencies
    • Update Architecture to change from SPARQL to TripleStore.  

...

Decision Points:

  • Action Items:
    • Further investigation of VS Hierarchy Display needed.
    • Further investigation of what FDA needs (NDF-RT/MED-RT)
    • Small group to meet to discuss ICD-O-3.1

...

  • Action Items:
    • Cory to complete testing of containers using CentOS.
    • Systems team to configure Dev system to be available for Docker and non-docker applications.
    • Investigate usage of MariaDB for JSON and graphing capabilities.
    • Jacob to provide versions of elastic search and SOLR.

 

TimeLocationTopicsParticipantsResources

3:00 PM - 4:00 PM

(added meeting)

3-W-030
User Group Discussion - caDSR 

Continued discussion of current API

  • APIs: Java, REST (CTS2 or 3-store), SPARQL

caDSR - Philippa,

Vikram, Natalia, Rui

EVS CTRP REST API

CTS2 REST API

caDSR LexEVS API calls

AttendeesAttendees

Jason Lucas, Kim Ong, Rob Wynn, Natalia, Tracy Safran, Craig Stancl, Cory Endle, Scott Bauer, Larry Wright, Luba, Sana, Philippa, Vikram, Natalia

...

  • Action Items:
    • Natalia to send spreadsheet of class usage.
    • LexEVS team to send Migration link to caDSR team.
    • Determine gaps between LexEVS REST services and caDSR LexEVS API usage.
    • Provide assistance to caDSR team to parse results coming from CTS2.

 

...

Thursday, December 7th, 2017

TimeLocationTopicsParticipantsResources
 9:00 AM - 12:00 PM

1-E-030

(9:00-1112:00)

(gap 11-12)


API Services

Proposed topics for discussion:

  • Overview of EVS REST API and future direction
    • Discuss what exists today.
    • Discuss future plans.
  • Overview of LexEVS provided API services and future direction
    • LexEVS API
    • LexEVS Remote API
    • LexEVS REST API (CTS2-based)
  • Discuss a new REST service
    • Discuss user requirements for a new REST API
      • Document the gaps from CTS2 REST API and user requirements.
      • Explore how simple searches and searches with an identifier and how this could replace most of the remote API.
    • Consider a new REST API to supplement CTS2 REST Services
      • Propose examination of current CTS2 REST API
      • Capture what doesn't work for the end users.
      • Review gap between Remote API and REST services.
    • Discuss surfacing terminology history in a REST service (NCIt and ULMS)
      • Determine what history information is required by the user to be returned.
        • Discuss surfacing history of value sets in a REST service
      • Discuss if this fits into LexEVS or is part of a new REST API

Kim Ong

Tracy Safran

Rob Wynne

End users such as CTRP, others

Larry Wright

Margaret Haber

Sherri De Coronado

Gilberto Fragoso

DiffCTS2_LexEVS.pptx

Attendees: 

Discussion Points:

Decision Points:

 

...

Prioritization and debrief

Determine strategic direction and priorities

Kim Ong

Tracy Safran

Rob Wynne

Larry Wright

Margaret Haber

Sherri De Coronado

Gilberto Fragoso

...

Scott Bauer, Craig Stancl, Cory Endle, Gilberto Fragoso, Larry Wright, Lyubov, Tracy Safran, Rob Wynne, Kim Ong, Jason Lucas, Liz, Margaret, Bron Kiesler, Sherri De Coronado, Sana Din

Discussion Points:

  • EVS REST API
    • CTRP and GDC are currently using the API.
    • https://evsrestapi.nci.nih.gov/evsrestapi/swagger-ui.html#/evs-controller
    • Working with other groups for API usage.
    • CTRP in the URL identifies the data being searched.  Kim noted that in future it would be coding system and version.
    • Modeled specific for the CTRP usecases.  Data in the response may have specific CTRP.
    • Larry would perfer to have a common API that could then be customized for specific users.  
    • Suggested to perform a gap analysis to look at what caDSR current uses and what is available in the EVS CTRP API.
      • Will need to get parameter instances so we can know exactly what is used. (restrict to code/properties, etc)
      • This should be reviewed by NG, Tracy, Scott, etc and the caDSR team (Natalia, Vikram, etc.)
    • Agreement to have a joint working group to help build a combined REST service to get content from both LexEVS and TripleStore environments
    • Minimal testing has been completed (10 concurrent users).
  • REST Service Future Direction
    • The REST service should not have application specific content.
    • There will be a need to consider input from users -  caDSR, GDC, CDISC, CTRP 
    • Sherri suggested that it may be worthwhile to generalize the existing CTRP REST service and then get user feedback.
    • Considerations for REST services:
      • Keep the CTRP API separate and build a more common API
      • Create a common API and provide specific convience APIs on top of the common API for specific customers.
      • Fill in the gaps (to be identified) with CTS2 based REST service. Create local extensions and look at opportunities to update the specification.
    • Customer needs will come first.
  • LexEVS REST (CTS2) and LexEVS Remote Service gap analysis
    • Service Discovery - could be created
    • Paging and Iteration - no gap
    • Entity Search - gaps in search type (LexEVS 6.4 Search Algorithm Implementation Details)
    • Result Sorting - gaps exist
    • Custom Result Filtering - gap exists
    • CTS2 URI Read of Entiy - could be created
    • Code System Operations - gap exists
    • Entity Count - gap exists
    • Relational Operations - gap exists
    • Query Parameters - gap exists
    • Result Parameters - gap exists
    • Operation Functions - gap exists
    • Value Set Operations - gap exists
    • Value Set Search - gap exists

Decision Points:

  • Action Items:
    • Perform a gap analysis to look at what caDSR current uses and what is available in the EVS CTRP API.
    • Assemble a joint working group to investigate a combined/coordinated REST service to get content from both LexEVS and TripleStore environments.
    • Identify additions to the existing EVS API to make it more useful to users.

 

TimeLocationTopicsParticipantsResources
 1:00 PM - 3:00 PM5-E-030
Prioritization and debrief

Determine strategic direction and priorities

Kim Ong

Tracy Safran

Rob Wynne

Larry Wright

Margaret Haber

Sherri De Coronado

Gilberto Fragoso


 

Attendees: 

Tracy Safran, Jason Lucas, Rob Wynne, Larry Wright, Gilberto Fragoso, Kumar, Cory Endle, Scott Bauer, Craig Stancl, Lyubov, Margaret Haber, Bron, Sherri de Coronado, Sana Din

Discussion Points:

  • Overall Impessions and themes
    • Mappings
      • Extract mapping from Meta efficiently
        • This would save effort of creating custom maps.
      • Extend the model of mappings (is it supported in CTS2) to support different types of maps.
        • ie ICD-O3, Meta and the logic (OR, AND..)
    • Diversity of paths through the architecture
      • Addition of triple store
      • Multiple APIs to address user communities.
      • flow described - input of data through the tooling and delivery
      • 2 views for documentation were identified:
        • Focus on what users needed
        • Focus on overall architecture (technical)
    • Remote API Roadmap
      • Determine the replacement for what of the API is needed. (based on gap analysis)
      • Determine current users and identify what is required for those users.  
    • REST API 
      • Federation using SPARQL or other tooling
      • Big Data will require that performance be addressed (caching, etc.)
        • Will support the annotation pipelines
      • LexEVS will need to provide REST services for content not available in TripleStore
      • Provide documentation to better help users
    • Report Writer
      • Support for other terminologies. LexEVS REST services/EVS REST Services
    • SWAGGER documentation
      • Differentiate from the general API and CTRP specific API
    • Microservices
      • integration of triple store to support/enhance LexEVS functionality
      • Hierarchy/Transitive Table support.
    • User needs to create a unified service
      • Discuss with stakeholders to gather requirements
      • Determine how to move forward based on the requirements (best practice)
        • Separate APIs
        • Combined APIs
      • Ensure the service simpilfies what the user needs to know about the technical implementation.
      • This could be several months of effort (across teams)
    • User Education - Enable users to use the services
      • Provide better documentation for end users.
      • Provide mapping of source into LexEVS or REST models so users can understand how to query the service in LexEVS
      • Review and update Wiki Organization
      • Provide documentation to aid in building applications that will utilize the services (REST, Java API, TripleStore/SPARQL/ftp)
      • Architecture diagrams to describe the 1) flow of data and 2) technical specifics.
      • Provide timeline for enhancement (REST Services), dates for deprecation, system deployments
      • LexEVS REST Code Migration Guide
    • Build and Deploy (Docker)
      • Continued development of Docker containers with the systems team.
      • Use of Node.js to be discussed with systems team.
      • Investigate use of Docker for data deployments. 
      • Migrate CTS2 API from Heroku to NCI.

 

Decision Points:

  • Action Items

Attendees: 

Discussion Points:

...

  • :