
Intelligent Clothing Design Interaction System Based on Multimodal Learning
The rapid development of Artificial Intelligence (AI) in fashion design has enhanced the efficiency of clothing production and customization. However, conventional design systems often fail to effectively combine text and image information, such as sketches and product details, when creating innovative and user-centric designs. This paper proposes an Intelligent Clothing Design Interaction System (ICDIS) based on multimodal learning to improve AI-aided fashion design. The ICDIS system combines computer vision and Natural Language Processing (NLP) synergistically to provide an optimized, more efficient fashion design process. The proposed ICDIS utilizes multimodal learning by integrating transformer-based NLP for text description and computer vision for image processing. With the integration, real-time design modification is enabled, allowing designers to iteratively enhance outputs based on text input and visual feedback, thereby improving the AI-aided fashion design process to be more interactive, efficient, and creative. Empirical tests demonstrate that ICDIS enhances design accuracy by 27% (in terms of adherence to professional designer critiques) and accelerates design iterations by 35% compared to conventional AI-based design environments. Users’ satisfaction and engagement rates also rose by 22%, underscoring the efficacy of multimodal interaction for creative tasks. This work has extensive potential for fashion design, e-shopping, and online personalization, where AI-based programs can increase creativity and efficiency. In its ability to bridge the gap between text- based inspiration and image-based design, ICDIS represents a groundbreaking step toward a more informed, interactive apparel design paradigm.
[1] Altenburg T., Chen X., Lutkenhorst W., Staritz C., and Whitfield L., “Exporting out of China or out of Africa? Automation Versus Relocation in the Global Clothing Industry,” Discussion Paper, no. 1, pp. 1-35, 2020. https://hdl.handle.net/10419/213279
[2] Chen L., Fang Q., and Chen Y., “Intelligent Clothing Interaction Design and Evaluation System based on DCGAN Image Generation Module and Kansei Engineering,” in Proceedings of the Sustainability Innovation and Fashion Technology International Conference, Shanghai, pp. 1-6, 2021. DOI:10.1088/1742- 6596/1790/1/012025
[3] Chowdhury S., Faisal M., Rahman Z., and Ahmed M., “Transforming Operational Efficiency: The Impact of AI-Driven Sustainable Management and Lean Manufacturing Practices in the US Fashion Industry,” Journal of Next-Gen Engineering Systems, vol. 2, no. 1, pp. 1-10, 2025.
[4] Deng H. and Liu M., “Personalized Smart Clothing Design Based on Multimodal Visual Data Detection,” Computational Intelligence and Neuroscience, vol. 2022, pp. 1-12, 2022. https://doi.org/10.1155/2022/4440652
[5] Deng M., Liu Y., and Chen L., “AI-Driven Innovation in Ethnic Clothing Design: An Intersection of Machine Learning and Cultural Heritage,” Electronic Research Archive, vol. 31, no. 9, pp. 5793-5814, 2023. https://aimspress.com/article/doi/10.3934/era.202 3295
[6] Dong M., Zeng X., Koehl L., and Zhang J., “An Interactive Knowledge-Based Recommender System for Fashion Product Design in the Big Data Environment,” Information Sciences, vol. 540, pp. 469-488, 2020. https://doi.org/10.1016/j.ins.2020.05.094
[7] Huang B., Lu Q., Huang S., Wang X., and Yang H., “Multi-Modal Clothing Recommendation Model Based on Large Model and VAE Enhancement,” arXiv Preprint, vol. arXiv:2410.02219, pp. 1-11, 2024. https://doi.org/10.48550/arXiv.2410.02219
[8] Jiang M., Tian Z., Yu C., Shi Y., and et al., “Intelligent 3D Garment System of the Human Body Based on Deep Spiking Neural Network,” Virtual Reality and Intelligent Hardware, vol. 6, no. 1, pp. 43-55, 2024. https://doi.org/10.1016/j.vrih.2023.07.002
[9] Jiang Y., DeepFashion-MultiModal, https://github.com/yumingj/DeepFashion- MultiModal, Last Visited 2025.
[10] Kampakaki E. and Papahristou E., “Digital Intelligence as Prerequisite of Artificial Intelligence’s Integration in the Clothing Industry 4.0,” SETN Workshops, vol. 2844, pp. 36-41, 2020. https://ceur-ws.org/Vol-2844/fashion1.pdf
[11] Kang H., “Accelerating Innovation Through AI- Powered Conceptual Abstraction and Interaction Design,” Doctoral Dissertation, Carnegie Mellon University, 2024. https://doi.org/10.1184/R1/26863186
[12] Kim J., Laine T., and Ahlund C., “Multimodal Interaction Systems Based on Internet of Things and Augmented Reality: A Systematic Literature Review,” Applied Sciences, vol. 11, no. 4, pp. 1738-1756, 2021. https://doi.org/10.3390/app11041738
[13] Krishnamoorthy R. and Manojkumar C., “IoT- Based Robotic Systems Integrating Artificial Intelligence for Sustainable Crop Protection in Agriculture,” PatternIQ Mining, vol. 1, no. 3, pp. 1-11, 2024. https://doi.org/10.70023/piqm24301
[14] Mohiuddin Babu M., Akter S., Rahman M., Billah M., and Hack-Polay D., “The Role of Artificial Intelligence in Shaping the Future of the Agile Intelligent Clothing Design Interaction System Based on Multimodal Learning 581 Fashion Industry,” Production Planning and Control, vol. 35, no. 15, pp. 2084-2098, 2024. https://doi.org/10.1080/09537287.2022.2060858
[15] Pang K., Zou X., and Wong W., “FashionM3: A Unified Vision-Language Model for Multimodal Fashion Retrieval and Generation,” in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 1234-1245, 2025. https://doi.org/10.48550/arXiv.2504.17826
[16] Ruckdashel R., Venkataraman D., and Park J., “Smart Textiles: A Toolkit to Fashion the Future,” Journal Of Applied Physics, vol. 129, no. 13, pp. 130901-130915, 2021. https://doi.org/10.1063/5.0024006
[17] Shakeel P., Aboobaider B., and Salahuddin L., “Detecting Lung Cancer Region from CT Image Using Meta-Heuristic Optimized Segmentation Approach,” International Journal of Pattern Recognition and Artificial Intelligence, vol. 36, no. 16, pp. 2240001, 2022. https://doi.org/10.1142/S0218001422400018
[18] Sharma S., Koehl L., Bruniaux P., Zeng X., and Wang Z., “Development of an Intelligent Data- Driven System to Recommend Personalized Fashion Design Solutions,” Sensors, vol. 21, no. 12, pp. 4239-4255, 2021. https://doi.org/10.3390/s21124239
[19] Shimizu R., Saito Y., Matsutani M., and Goto M., “Fashion Intelligence System: An Outfit Interpretation Utilizing Images and Rich Abstract Tags,” Expert Systems with Applications, vol. 213, pp. 119167, 2023. https://doi.org/10.1016/j.eswa.2022.119167
[20] Singh A. and Patras I., “Fashionsd-X: Multimodal Fashion Garment Synthesis Using Latent Diffusion,” in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 5678-5689, 2024. https://doi.org/10.48550/arXiv.2404.18591
[21] Wang T. and Ye M., “TexFit: Text-Driven Fashion Image Editing with Diffusion Models,” in Proceedings of the AAAI Conference on Artificial Intelligence, Vancouver, pp. 10198-10206, 2024. https://doi.org/10.1609/aaai.v38i9.28885
[22] Wang Z., Tao X., Zeng X., Xing Y., and et al., “An Interactive Personalized Garment Design Recommendation System Using Intelligent Techniques,” Applied Sciences, vol. 12, no. 9, pp. 4654-4668, 2022. https://doi.org/10.3390/app12094654