Patents by Inventor Baoguang SHI

Baoguang SHI has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20220391647
    Abstract: A method for customizing an optical character recognition system is disclosed. The optical character recognition system includes a general-purpose decoder configured to convert character images, recognized in a digital image, into text based on a general-purpose text structure. An application-specific customization is received. The application-specific customization includes an application-specific text structure that differs from the general-purpose text structure. A customized model is generated based on the application-specific customization. An enhanced application-specific decoder is generated by modifying the general-purpose decoder to, during run-time execution of the optical character recognition system, leverage the customized model to convert character images demonstrating the application-specific text structure into text.
    Type: Application
    Filed: June 3, 2021
    Publication date: December 8, 2022
    Applicant: Microsoft Technology Licensing, LLC
    Inventors: Baoguang SHI, Dinei Afonso Ferreira FLORENCIO
  • Patent number: 10607120
    Abstract: Disclosed are a training method and apparatus for a CNN model, which belong to the field of image recognition. The method comprises: performing a convolution operation, maximal pooling operation and horizontal pooling operation on training images, respectively, to obtain second feature images; determining feature vectors according to the second feature images; processing the feature vectors to obtain category probability vectors; according to the category probability vectors and an initial category, calculating a category error; based on the category error, adjusting model parameters; based on the adjusted model parameters, continuing the model parameters adjusting process, and using the model parameters when the number of iteration times reaches a pre-set number of times as the model parameters for the well-trained CNN model. After the convolution operation and maximal pooling operation on the training images on each level of convolution layer, a horizontal pooling operation is performed.
    Type: Grant
    Filed: March 30, 2018
    Date of Patent: March 31, 2020
    Assignee: TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED
    Inventors: Xiang Bai, Feiyue Huang, Xiaowei Guo, Cong Yao, Baoguang Shi
  • Patent number: 10262241
    Abstract: Embodiments of this disclosure belong to the field of computer technologies and disclose a method and an apparatus for recognizing a character string in an image.
    Type: Grant
    Filed: September 30, 2017
    Date of Patent: April 16, 2019
    Assignee: TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED
    Inventors: Xiang Bai, Juhong Wang, Tingting Liu, Wei Chen, Baoguang Shi, Cong Yao, Pengyuan Lv
  • Publication number: 20180225552
    Abstract: Disclosed are a training method and apparatus for a CNN model, which belong to the field of image recognition. The method comprises: performing a convolution operation, maximal pooling operation and horizontal pooling operation on training images, respectively, to obtain second feature images; determining feature vectors according to the second feature images; processing the feature vectors to obtain category probability vectors; according to the category probability vectors and an initial category, calculating a category error; based on the category error, adjusting model parameters; based on the adjusted model parameters, continuing the model parameters adjusting process, and using the model parameters when the number of iteration times reaches a pre-set number of times as the model parameters for the well-trained CNN model. After the convolution operation and maximal pooling operation on the training images on each level of convolution layer, a horizontal pooling operation is performed.
    Type: Application
    Filed: March 30, 2018
    Publication date: August 9, 2018
    Applicant: TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED
    Inventors: Xiang Bai, Feiyue Huang, Xiaowei Guo, Cong Yao, Baoguang Shi
  • Patent number: 9977997
    Abstract: Disclosed are a training method and apparatus for a CNN model, which belong to the field of image recognition. The method comprises: performing a convolution operation, maximal pooling operation and horizontal pooling operation on training images, respectively, to obtain second feature images; determining feature vectors according to the second feature images; processing the feature vectors to obtain category probability vectors; according to the category probability vectors and an initial category, calculating a category error; based on the category error, adjusting model parameters; based on the adjusted model parameters, continuing the model parameters adjusting process, and using the model parameters when the number of iteration times reaches a pre-set number of times as the model parameters for the well-trained CNN model. After the convolution operation and maximal pooling operation on the training images on each level of convolution layer, a horizontal pooling operation is performed.
    Type: Grant
    Filed: April 12, 2017
    Date of Patent: May 22, 2018
    Assignee: TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED
    Inventors: Xiang Bai, Feiyue Huang, Xiaowei Guo, Cong Yao, Baoguang Shi
  • Publication number: 20180025256
    Abstract: Embodiments of this disclosure belong to the field of computer technologies and disclose a method and an apparatus for recognizing a character string in an image.
    Type: Application
    Filed: September 30, 2017
    Publication date: January 25, 2018
    Inventors: Xiang BAI, Juhong WANG, Tingting LIU, Wei CHEN, Baoguang SHI, Cong YAO, Pengyuan LV
  • Publication number: 20170220904
    Abstract: Disclosed are a training method and apparatus for a CNN model, which belong to the field of image recognition. The method comprises: performing a convolution operation, maximal pooling operation and horizontal pooling operation on training images, respectively, to obtain second feature images; determining feature vectors according to the second feature images; processing the feature vectors to obtain category probability vectors; according to the category probability vectors and an initial category, calculating a category error; based on the category error, adjusting model parameters; based on the adjusted model parameters, continuing the model parameters adjusting process, and using the model parameters when the number of iteration times reaches a pre-set number of times as the model parameters for the well-trained CNN model. After the convolution operation and maximal pooling operation on the training images on each level of convolution layer, a horizontal pooling operation is performed.
    Type: Application
    Filed: April 12, 2017
    Publication date: August 3, 2017
    Inventors: Xiang BAI, Feiyue HUANG, Xiaowei GUO, Cong YAO, Baoguang SHI