Patents by Inventor Sirjan Kafle

Sirjan Kafle has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 12008331
    Abstract: Described herein are systems and methods for generating an embedding—a learned representation—for an image. The embedding for the image is derived to capture visual aspects, as well as textual aspects, of the image. An encoder-decoder is trained to generate the visual representation of the image. An optical character recognition (OCR) algorithm is used to identify text/words in the image. From these words, an embedding is derived by performing an average pooling operation on pre-trained embeddings that map to the identified words. Finally, the embedding representing the visual aspects of the image is combined with the embedding representing the textual aspects of the image to generate a final embedding for the image.
    Type: Grant
    Filed: December 23, 2021
    Date of Patent: June 11, 2024
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Xun Luan, Aman Gupta, Sirjan Kafle, Ananth Sankar, Di Wen, Saurabh Kataria, Ying Xuan, Sakshi Verma, Bharat Kumar Jain, Xue Xia, Bhargavkumar Kanubhai Patel, Vipin Gupta, Nikita Gupta
  • Publication number: 20230206010
    Abstract: Described herein are systems and methods for generating an embedding—a learned representation—for an image. The embedding for the image is derived to capture visual aspects, as well as textual aspects, of the image. An encoder-decoder is trained to generate the visual representation of the image. An optical character recognition (OCR) algorithm is used to identify text/words in the image. From these words, an embedding is derived by performing an average pooling operation on pre-trained embeddings that map to the identified words. Finally, the embedding representing the visual aspects of the image is combined with the embedding representing the textual aspects of the image to generate a final embedding for the image.
    Type: Application
    Filed: December 23, 2021
    Publication date: June 29, 2023
    Inventors: Xun Luan, Aman Gupta, Sirjan Kafle, Ananth Sankar, Di Wen, Saurabh Kataria, Ying Xuan, Sakshi Verma, Bharat Kumar Jain, Xue Xia, Bhargavkumar Kanubhai Patel, Vipin Gupta, Nikita Gupta