Meeting date: July 20, 2021
WebEx recording of 7/20/2021 meeting
- Introduction: Medical Image De-Identification Initiative (MIDI)
- Task Group goals
- Steering Committee
- Timeline
- Discussion
Meeting date: August 10, 2021
WebEx recording of 8/10/2021 meeting
- Instructions to access the MIDI Task Group wiki page
- Accept Mendeley invitation to access private group for literature review/annotated bibliography
- Outline of approach
- metadata vs. pixel data
- metadata
- structured (strongly typed) vs. text
- pixel data
- burned-in text ("printed" and hand-written)
- identifiable features (e.g., faces, iris, retina)
- with or without "public" data to compare with
- Challenging topics
- evaluation of success of de-identification
- quantitative comparison of performance
- quantifying re-identification risk
- creating test data sets
- faces (etc.) reconstructed from cross-sections
- burned-in text - detection, removal, cleaning
- cleaning text descriptors (metadata or burned in)
- buried metadata (e.g., EXIF, geotags in JPEG inside DICOM)
- dates (incl. preserving temporal relationships)
- pseudonym consistency across separate submissions
- risks of hashing to create pseudonymous identifiers
- uniqueness of images limits statistical approaches
- loss allowable during de-identification (e.g., age fuzzing, pixels)
- private data element preservation to retain utility
- ultrasound - still frames and cine loops, lossy compressed
- photographs
- video
- gross pathology and whole slide images (incl. labels)
- IRB/ethics committee messaging wrt. de-identification decisions
- IT security approval/audits of de-identification
- regulatory requirements: HIPAA Privacy Rule, GDPR, CCPA, others?
- sufficiency of standards, e.g., DICOM PS3.15 Annex E
- risk of not following a standard (home-grown decisions)
- threat of image "signatures", private set intersection methods
- policy versus the technical details of recompression/decompression artifacts for JPEG
- data minimization
- Inventory of tools
- user interface vs. scripted (bulk, service)
- configurable - user vs. installer vs. hard-coded
- platform, language
- open source, free, commercial, service
- on-site vs. outside (e.g., [IP]II needs to leave walls for AI on cloud)
- Roadmap and deliverables
- interim report
- full report
- "primer" on medical image de-identification for newbies/execs
- confirm what is out of scope (non-goals) - consent, data use agreements, ...
- interim report
- Tasking: Members will think about which task they would like to contribute to.
Meeting date: September 14, 2021
WebEx recording of 9/14/2021 meeting
- Role of AI in de-identification - demand for data, opportunities, threats