TY - GEN
T1 - Derm-T2IM
T2 - 46th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, EMBC 2024
AU - Farooq, Muhammad Ali
AU - Yao, Wang
AU - Schukat, Michael
AU - Little, Mark A.
AU - Corcoran, Peter
N1 - Publisher Copyright:
© 2024 IEEE.
PY - 2024
Y1 - 2024
N2 - This study explores the utilization of Dermatoscopic synthetic data generated through stable diffusion models as a strategy for enhancing the robustness of machine learning model training. Synthetic data plays a pivotal role in mitigating challenges associated with limited labeled datasets, thereby facilitating more effective model training and fine-tuning. In this context, we aim to incorporate enhanced data transformation techniques by extending the recent success of few-shot learning in text-to-image latent diffusion models. The optimally tuned model is further used for rendering high-quality skin lesion synthetic data with diverse and realistic characteristics, providing a valuable supplement and diversity to the existing training data. We investigate the impact of incorporating newly generated synthetic data into the training pipeline of state-of-the-art machine learning models, assessing its effectiveness in enhancing model performance and generalization to unseen real-world data. Experimental results demonstrate the efficacy of the synthetic data rendered through stable diffusion models helps in improving the robustness and adaptability of CNN and vision transformer (ViT) models on different real-world skin cancer datasets. The dataset along with the trained model are open-sourced on our GitHub https://github.com/MAli-Farooq/Derm-T2IM.
AB - This study explores the utilization of Dermatoscopic synthetic data generated through stable diffusion models as a strategy for enhancing the robustness of machine learning model training. Synthetic data plays a pivotal role in mitigating challenges associated with limited labeled datasets, thereby facilitating more effective model training and fine-tuning. In this context, we aim to incorporate enhanced data transformation techniques by extending the recent success of few-shot learning in text-to-image latent diffusion models. The optimally tuned model is further used for rendering high-quality skin lesion synthetic data with diverse and realistic characteristics, providing a valuable supplement and diversity to the existing training data. We investigate the impact of incorporating newly generated synthetic data into the training pipeline of state-of-the-art machine learning models, assessing its effectiveness in enhancing model performance and generalization to unseen real-world data. Experimental results demonstrate the efficacy of the synthetic data rendered through stable diffusion models helps in improving the robustness and adaptability of CNN and vision transformer (ViT) models on different real-world skin cancer datasets. The dataset along with the trained model are open-sourced on our GitHub https://github.com/MAli-Farooq/Derm-T2IM.
UR - http://www.scopus.com/inward/record.url?scp=85214993217&partnerID=8YFLogxK
U2 - 10.1109/EMBC53108.2024.10781852
DO - 10.1109/EMBC53108.2024.10781852
M3 - Conference Publication
AN - SCOPUS:85214993217
T3 - Proceedings of the Annual International Conference of the IEEE Engineering in Medicine and Biology Society, EMBS
BT - 46th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, EMBC 2024 - Proceedings
PB - Institute of Electrical and Electronics Engineers Inc.
Y2 - 15 July 2024 through 19 July 2024
ER -