[1]
S. Song, “Research on Cross-Modal Interaction Techniques between Natural Language Processing and Computer Vision”, IJCSIT, vol. 7, no. 2, pp. 31–36, Sep. 2025, doi: 10.62051/ijcsit.v7n2.03.