Song, Shuo. “Research on Cross-Modal Interaction Techniques Between Natural Language Processing and Computer Vision”. International Journal of Computer Science and Information Technology, vol. 7, no. 2, Sept. 2025, pp. 31-36, https://doi.org/10.62051/ijcsit.v7n2.03.