Song, S. (2025). Research on Cross-Modal Interaction Techniques between Natural Language Processing and Computer Vision. International Journal of Computer Science and Information Technology, 7(2), 31-36. https://doi.org/10.62051/ijcsit.v7n2.03