NIH | National Cancer Institute | NCI Wiki  

You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 7 Next »

Meeting date: July 20, 2021

WebEx recording of 7/20/2021 meeting

  • Introduction: Medical Image De-Identification Initiative (MIDI)
  • Task Group goals
  • Steering Committee
  • Timeline
  • Discussion

Meeting date: August 10, 2021

WebEx recording of 8/10/2021 meeting

  • Instructions to access the MIDI Task Group wiki page
  • Accept Mendeley invitation to access private group for literature review/annotated bibliography
  • Outline of approach
    • metadata vs. pixel data
    • metadata
    • structured (strongly typed) vs. text
    • pixel data
    • burned-in text ("printed" and hand-written)
    • identifiable features (e.g., faces, iris, retina)
    • with or without "public" data to compare with
  • Challenging topics
    • evaluation of success of de-identification
    • quantitative comparison of performance
    • quantifying re-identification risk
    • creating test data sets
    • faces (etc.) reconstructed from cross-sections
    • burned-in text - detection, removal, cleaning
    • cleaning text descriptors (metadata or burned in)
    • buried metadata (e.g., EXIF, geotags in JPEG inside DICOM)
    • dates (incl. preserving temporal relationships)
    • pseudonym consistency across separate submissions
    • risks of hashing to create pseudonymous identifiers
    • uniqueness of images limits statistical approaches
    • loss allowable during de-identification (e.g., age fuzzing, pixels)
    • private data element preservation to retain utility
    • ultrasound - still frames and cine loops, lossy compressed
    • photographs
    • video
    • gross pathology and whole slide images (incl. labels)
    • IRB/ethics committee messaging wrt. de-identification decisions
    • IT security approval/audits of de-identification
    • regulatory requirements: HIPAA Privacy Rule, GDPR, CCPA, others?
    • sufficiency of standards, e.g., DICOM PS3.15 Annex E
    • risk of not following a standard (home-grown decisions)
    • threat of image "signatures", private set intersection methods
    • policy versus the technical details of recompression/decompression artifacts for JPEG
    • data minimization
  • Inventory of tools
    • user interface vs. scripted (bulk, service)
    • configurable - user vs. installer vs. hard-coded
    • platform, language
    • open source, free, commercial, service
    • on-site vs. outside (e.g., [IP]II needs to leave walls for AI on cloud)
  • Roadmap and deliverables
    • interim report
      • full report
      • "primer" on medical image de-identification for newbies/execs
      • confirm what is out of scope (non-goals) - consent, data use agreements, ...
  • Tasking: Members will think about which task they would like to contribute to.

Meeting date: September 14, 2021

WebEx recording of 9/14/2021 meeting

  • Role of AI in de-identification - demand for data, opportunities, threats