Dprov

Yuan Tian at CICI Presentation Series

September 4, 2025

PI Yuan Tian discusses the Dprov project: A Data Provenance Framework for Medical Machine Learning. In this talk Tian reviews the needs for data integrity, provence and authenticity. Integrity involves detecting public ML models trained on corrupt data, Provence involves establishing a Standardize efficient, reproducible dataset-model tracking. Authenticity involves removing patient or corrupt data from models effectively. Tian further discusses data set requirements.

Data

Associated Links