Patents Assigned to Sichuan Institute of Artificial Intelligence, Yibin, Sichuan, China
  • Publication number: 20240086643
    Abstract: A visual dialogue method and system is provided. The method includes obtaining original input data, where the original input data includes current image data and a new question, and the new question is related to the current image data; preprocessing text data and image data in the original input data to obtain a text feature sequence and a visual feature sequence, respectively; using a VisDial dataset to construct a text corpus; obtaining text sequence knowledge by using a potential knowledge searcher based on the visual feature sequence and the text corpus; constructing a sparse scene graph based on the visual feature sequence; performing data fusion on the text feature sequence, the visual feature sequence, the text sequence knowledge, and the sparse scene graph to obtain a data fusion result; and obtaining dialogue content of the new question by using a decoder based on the data fusion result.
    Type: Application
    Filed: October 27, 2022
    Publication date: March 14, 2024
    Applicant: Sichuan Institute of Artificial Intelligence, Yibin, Sichuan, China
    Inventors: Lei ZHAO, Junlin LI, Jie SHAO, Lianli GAO, Jingkuan SONG