Song, S. (2025) “Research on Cross-Modal Interaction Techniques between Natural Language Processing and Computer Vision”, International Journal of Computer Science and Information Technology, 7(2), pp. 31–36. doi:10.62051/ijcsit.v7n2.03.