Patents by Inventor Lifeng Shang

Lifeng Shang has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

METHOD AND SYSTEM FOR TRAINING LARGE-SCALE LANGUAGE MODELS

Publication number: 20240127000

Abstract: A computer-implemented method is provided for model training performed by a processing system. The method comprises determining a set of first weights based on a first matrix associated with a source model, determining a set of second weights based on the set of first weights, forming a second matrix associated with a target model based on the set of first weights and the set of second weights, initializing the target model based on the second matrix, and training the target model.

Type: Application

Filed: September 30, 2022

Publication date: April 18, 2024

Inventors: Yichun Yin, Lifeng Shang, Cheng Chen, Xin Jiang, Xiao Chen, Qun Liu
DATA PROCESSING METHOD AND RELATED DEVICE

Publication number: 20240119268

Abstract: This disclosure relates to the field of artificial intelligence, and discloses a data processing method. The method includes: obtaining a transformer model including a target network layer and a target module; and processing to-be-processed data by using the transformer model, to obtain a data processing result. The target module is configured to: perform a target operation on a feature map output at the target network layer, to obtain an operation result, and fuse the operation result and the feature map output, to obtain an updated feature map output. In this disclosure, the target module is inserted into the transformer model, and the operation result generated by the target module and an input are fused, so that information carried in a feature map output by the target network layer of the transformer model is increased.

Type: Application

Filed: November 30, 2023

Publication date: April 11, 2024

Inventors: Lu HOU, Lifeng SHANG, Xin JIANG, Li QIAN
METHOD AND DEVICE FOR COMPRESSING GENERATIVE PRE-TRAINED LANGUAGE MODELS VIA QUANTIZATION

Publication number: 20240104346

Abstract: A method is provided for quantizing a neural network model performed by a processing system. The method comprises determining a scaling factor based on a distribution of weights associated with the neural network model, determining quantized weights based on the scaling factor and the weights associated with the distribution, determining a training loss of the neural network model based on the quantized weights during training of the neural network model, and determining an updated scaling factor for the neural network model based on a gradient of the training loss.

Type: Application

Filed: September 15, 2022

Publication date: March 28, 2024

Inventors: Lu HOU, Chaofan TAO, Wei ZHANG, Lifeng SHANG, Xin JIANG, Qun LIU, Li QIAN
MODEL COMPRESSION METHOD AND APPARATUS

Publication number: 20230229912

Abstract: A model compression method is provided, which can be applied to the field of artificial intelligence. The method includes: obtaining a first neural network model, a second neural network model, and a third neural network model; processing first to-be-processed data using the first neural network model, to obtain a first output; processing the first to-be-processed data using the third neural network model, to obtain a second output; determining a first target loss based on the first output and the second output, and updating the second neural network model based on the first target loss, to obtain an updated second neural network model; and compressing the updated second neural network model to obtain a target neural network model. The model generated based on the method has higher processing precision.

Type: Application

Filed: March 20, 2023

Publication date: July 20, 2023

Inventors: Wei ZHANG, Lu HOU, Yichun YIN, Lifeng SHANG
Paraphrase sentence generation method and apparatus

Patent number: 11586814

Abstract: A paraphrase sentence generation method and apparatus relating to the research field of natural language processing include generating m second sentences based on a first sentence and a paraphrase generation model, determining a matching degree between each of the m second sentences and the first sentence based on a paraphrase matching model, and determining n second sentences from the m second sentences based on matching degrees among the m second sentences and the first sentence, where the paraphrase generation model is obtained through reinforcement learning-based training based on a reward of the paraphrase matching model.

Type: Grant

Filed: April 23, 2020

Date of Patent: February 21, 2023

Assignee: HUAWEI TECHNOLOGIES CO., LTD.

Inventors: Xin Jiang, Lifeng Shang, Hang Li, Zichao Li
DATA PROCESSING METHOD AND RELATED DEVICE

Publication number: 20220383078

Abstract: In a data processing method, a processing device obtains a first neural network model and an available resource state of a terminal device, and determines a second neural network model based on the first neural network model and the available resource state. An appropriate model size is determined based on the available resource state, and a part of the first neural network model is selected, based on the determined model size, as the second neural network model on which data processing is to be performed.

Type: Application

Filed: August 8, 2022

Publication date: December 1, 2022

Applicant: HUAWEI TECHNOLOGIES CO., LTD.

Inventors: Lu HOU, Lifeng SHANG, Xin JIANG
COMMAND EXECUTION METHOD, APPARATUS, AND DEVICE

Publication number: 20220214894

Abstract: Embodiments of the present disclosure disclose a command execution method and apparatus, a terminal, and a server related speech recognition and natural language processing. In the command execution method, during an interaction between a terminal and a user, a server configured to execute a user command or the terminal may store slots and GUI information corresponding to the slots. When the filling information of the slots configured for the user command is missing, the server configured to execute the user command may obtain the missing filling information of the slots from the stored GUI information, to avoid multiple interactions between the user and the terminal. The interaction between the user and the terminal is made more intelligent, thus improving command execution efficiency.

Type: Application

Filed: March 22, 2022

Publication date: July 7, 2022

Applicant: HUAWEI TECHNOLOGIES CO., LTD.

Inventors: Tao CAI, Lifeng SHANG, Xiaoguang LI, Yuyang ZHANG, Wei ZHANG, Li QIAN
TEXT PROCESSING MODEL TRAINING METHOD, AND TEXT PROCESSING METHOD AND APPARATUS

Publication number: 20220180202

Abstract: A text processing model training method, and a text processing method and apparatus in the natural language processing field in the artificial intelligence field are disclosed. The training method includes: obtaining training text; separately inputting the training text into a teacher model and a student model to obtain sample data output by the teacher model and prediction data output by the student model; the sample data includes a sample semantic feature and a sample label; the prediction data includes a prediction semantic feature and a prediction label; and the teacher model is a pre-trained language model used for text classification; and training a model parameter of the student model based on the sample data and the prediction data, to obtain a target student model. The method enables the student model to effectively perform knowledge transfer, thereby improving accuracy of a text processing result of the student model.

Type: Application

Filed: February 28, 2022

Publication date: June 9, 2022

Applicant: HUAWEI TECHNOLOGIES CO., LTD.

Inventors: Yichun Yin, Lifeng Shang, Xin Jiang, Xiao Chen
Human-Computer Dialogue Method and Apparatus

Publication number: 20220147848

Abstract: A method includes: obtaining a text entered by a user; determining at least one topic related to the text; determining a target dialogue robot from the plurality of dialogue robots based on the at least one topic related to the text and a predefined mapping relationship between a dialogue robot and a topic, where a target topic corresponding to the target dialogue robot is some or all of the at least one topic related to the text; allocating the text to the target dialogue robot; and obtaining a reply for the text from the target dialogue robot, where the reply is generated by the target dialogue robot based on at least one semantic understanding of the text.

Type: Application

Filed: January 18, 2022

Publication date: May 12, 2022

Inventors: Lifeng Shang, Zhengdong Lu, Hang LI
Human-computer dialogue method and apparatus

Patent number: 11308405

Abstract: An apparatus is pre-equipped with a plurality of dialogue robots, and each dialogue robot is configured to conduct a human-computer dialogue based on at least one topic. The method includes: obtaining a text entered by a user; determining at least one topic related to the text, and determining a target dialogue robot from the plurality of dialogue robots based on the at least one topic related to the text and a predefined mapping relationship between a dialogue robot and a topic, where a target topic corresponding to the target dialogue robot is some or all of the at least one topic related to the text; and allocating the text to the target dialogue robot and obtaining a reply for the text from the target dialogue robot, where the reply is generated by the target dialogue robot based on at least one semantic understanding of the text.

Type: Grant

Filed: July 17, 2019

Date of Patent: April 19, 2022

Assignee: HUAWEI TECHNOLOGIES CO., LTD.

Inventors: Lifeng Shang, Zhengdong Lu, Hang Li
Sequence translation probability adjustment

Patent number: 11132516

Abstract: A sequence conversion method includes receiving a source sequence, converting the source sequence into a source vector representation sequence, obtaining at least two candidate target sequences and a translation probability value of each of the at least two candidate target sequences according to the source vector representation sequence, adjusting the translation probability value of each candidate target sequence, selecting an output target sequence from the at least two candidate target sequences according to an adjusted translation probability value of each candidate target sequence, and outputting the output target sequence. Hence, loyalty of a target sequence to a source sequence can be improved during sequence conversion.

Type: Grant

Filed: April 26, 2019

Date of Patent: September 28, 2021

Assignee: HUAWEI TECHNOLOGIES CO., LTD.

Inventors: Zhaopeng Tu, Lifeng Shang, Xiaohua Liu, Hang Li
Information Processing Method and Apparatus

Publication number: 20200264923

Abstract: An information processing method includes receiving first request information entered by a user, determining a first task engine for the first request information, where a first slot is set in the first task engine, extracting key information from the first request information based on the first slot, and if the key information fails to be extracted from the first request information based on the first slot, or if the key information is extracted from the first request information based on the first slot, but the extracted key information does not meet a condition, obtaining target key information from a shared parameter list of the user.

Type: Application

Filed: May 7, 2020

Publication date: August 20, 2020

Inventors: Zhefeng Yan, Lifeng Shang, Tao Cai, Li Qian
Paraphrase Sentence Generation Method and Apparatus

Publication number: 20200250377

Abstract: A paraphrase sentence generation method and apparatus relating to the research field of natural language processing include generating m second sentences based on a first sentence and a paraphrase generation model, determining a matching degree between each of the m second sentences and the first sentence based on a paraphrase matching model, and determining n second sentences from the m second sentences based on matching degrees among the m second sentences and the first sentence, where the paraphrase generation model is obtained through reinforcement learning-based training based on a reward of the paraphrase matching model.

Type: Application

Filed: April 23, 2020

Publication date: August 6, 2020

Inventors: Xin Jiang, Lifeng Shang, Hang Li, Zichao Li
Human-Computer Dialogue Method and Apparatus

Publication number: 20190341021

Abstract: An apparatus is pre-equipped with a plurality of dialogue robots, and each dialogue robot is configured to conduct a human-computer dialogue based on at least one topic. The method includes: obtaining a text entered by a user; determining at least one topic related to the text, and determining a target dialogue robot from the plurality of dialogue robots based on the at least one topic related to the text and a predefined mapping relationship between a dialogue robot and a topic, where a target topic corresponding to the target dialogue robot is some or all of the at least one topic related to the text; and allocating the text to the target dialogue robot and obtaining a reply for the text from the target dialogue robot, where the reply is generated by the target dialogue robot based on at least one semantic understanding of the text.

Type: Application

Filed: July 17, 2019

Publication date: November 7, 2019

Inventors: Lifeng Shang, Zhengdong Lu, Hang Li
Sequence Conversion Method and Apparatus

Publication number: 20190251178

Abstract: A sequence conversion method includes receiving a source sequence, converting the source sequence into a source vector representation sequence, obtaining at least two candidate target sequences and a translation probability value of each of the at least two candidate target sequences according to the source vector representation sequence, adjusting the translation probability value of each candidate target sequence, selecting an output target sequence from the at least two candidate target sequences according to an adjusted translation probability value of each candidate target sequence, and outputting the output target sequence. Hence, loyalty of a target sequence to a source sequence can be improved during sequence conversion.

Type: Application

Filed: April 26, 2019

Publication date: August 15, 2019

Inventors: Zhaopeng Tu, Lifeng Shang, Xiaohua Liu, Hang Li
Method and Apparatus for Determining Key Social Information

Publication number: 20170034111

Abstract: A method and apparatus for determining key social information, comprises acquiring directly-retransmitted social information and indirectly-retransmitted social information of original social information, and establishing a social information retransmitting tree; acquiring an information characteristic of each piece of retransmitted social information in the social information retransmitting tree; determining a characteristic vector of each piece of retransmitted social information according to the information characteristic of the retransmitted social information; inputting the obtained characteristic vector into a preset filtering model, and acquiring candidate key social information; and selecting final key social information from all candidate key social information according to a criticality evaluation value of each piece of candidate key social information.

Type: Application

Filed: July 29, 2016

Publication date: February 2, 2017

Inventors: Lifeng Shang, Jing Li, KamFai Wong
NEAR-DUPLICATE VIDEO RETRIEVAL

Publication number: 20150332124

Abstract: A similarity of a first video to a second video may be identified automatically. Images are received from the videos, and divided into sub-images. The sub-images are evaluated based on a feature common to each of the sub-images. Binary representations of the images may be created based on the evaluation of the sub-images. A similarity of the first video to the second video may be determined based on a number of occurrences of a binary representation in the first video and the second video.

Type: Application

Filed: July 27, 2015

Publication date: November 19, 2015

Inventors: Linjun Yang, Lifeng Shang, Xian-Sheng Hua, Fei Wang
Near-duplicate video retrieval

Patent number: 9092520

Abstract: A similarity of a first video to a second video may be identified automatically. Images are received from the videos, and divided into sub-images. The sub-images are evaluated based on a feature common to each of the sub-images. Binary representations of the images may be created based on the evaluation of the sub-images. A similarity of the first video to the second video may be determined based on a number of occurrences of a binary representation in the first video and the second video.

Type: Grant

Filed: June 20, 2011

Date of Patent: July 28, 2015

Assignee: Microsoft Technology Licensing, LLC

Inventors: Linjun Yang, Lifeng Shang, Xian-Sheng Hua, Fei Wang
NEAR-DUPLICATE VIDEO RETRIEVAL

Publication number: 20120321181

Abstract: A similarity of a first video to a second video may be identified automatically. Images are received from the videos, and divided into sub-images. The sub-images are evaluated based on a feature common to each of the sub-images. Binary representations of the images may be created based on the evaluation of the sub-images. A similarity of the first video to the second video may be determined based on a number of occurrences of a binary representation in the first video and the second video.

Type: Application

Filed: June 20, 2011

Publication date: December 20, 2012

Applicant: Microsoft Corporation

Inventors: Linjun Yang, Lifeng Shang, Xian-Sheng Hua, Fei Wang