SONG, Shuo. Research on Cross-Modal Interaction Techniques between Natural Language Processing and Computer Vision. International Journal of Computer Science and Information Technology, U.K., v. 7, n. 2, p. 31–36, 2025. DOI: 10.62051/ijcsit.v7n2.03. Disponível em: https://wepub.org/index.php/IJCSIT/article/view/5770. Acesso em: 18 jun. 2026.