Patents by Inventor Yushu Cao

Yushu Cao has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Voice awakening method and device

Patent number: 11322138

Abstract: A voice awakening method and device are provided. According to an embodiment, the method includes: receiving voice information of a user; obtaining an awakening confidence level corresponding to the voice information based on the voice information; determining, on the basis of the awakening confidence level, whether the voice information is suspected wake-up voice information; and performing, in response to determining the voice information being the suspected wake-up voice information, a secondary determination on the voice information to obtain a secondary determination result, and determining whether to perform a wake-up operation on the basis of the secondary determination result. The embodiment implements a secondary verification on the voice information, thereby reducing the probability that the smart device is mistakenly awakened.

Type: Grant

Filed: February 6, 2019

Date of Patent: May 3, 2022

Assignees: Baidu Online Network Technology (Beijing) Co., Ltd., Shanghai Xiaodu Technology Co., Ltd.

Inventors: Jun Li, Rui Yang, Lifeng Zhao, Xiaojian Chen, Yushu Cao
Voice interaction method, device and computer readable storage medium

Patent number: 11282519

Abstract: Embodiments of the present disclosure provide a voice interaction method, device, and a computer readable storage medium. Through a conversion of a terminal device from a near-filed voice interaction mode to a far-filed voice interaction mode, the terminal device is configured to perform the following operations when in the far-filed voice interaction mode: obtaining voice information of a user; obtaining, according to the voice information, target information required by the user from a server; and playing the target information in a voice manner, so that the terminal device can be turned into a smart speaker with a screen in the far-field voice interaction mode, and into a common mobile phone or tablet computer in the near-field voice interaction mode. The terminal device provides the user with a flexible and convenient voice service.

Type: Grant

Filed: July 11, 2019

Date of Patent: March 22, 2022

Inventors: Yushu Cao, Qing Si, Qinglong He, Xiangdong Xue
Method for voice controlling, terminal device, cloud server and system

Patent number: 11127398

Abstract: The embodiment of the disclosure provides a method for voice controlling, a terminal device, a cloud server and a system. The method includes: receiving voice information that the user performs voice controlling on a terminal device; transmitting voice information to the cloud server, so that the cloud server determines, according to the voice information, a voice control and a control instruction that match the voice information in the current interface, and generates a corresponding voice control instruction; receiving the voice control instruction transmitted by the cloud server; and controlling, according to the voice control instruction, a corresponding voice control of the terminal device to perform an operation. The method of the embodiments of the present disclosure achieves controlling over the controls in the interface through the voice, which deepens the controlling degree of the voice over the terminal device, and improves the user experience.

Type: Grant

Filed: December 28, 2018

Date of Patent: September 21, 2021

Assignees: BAIDU ONLINE NETWORK TECHNOLOGY (BEIJING) CO., LTD., SHANGHAI XIAODU TECHNOLOGY CO. LTD.

Inventors: Lichao Xu, Yushu Cao, Lishang Xiao, Lifeng Zhao, Xiangdong Xue, Ji Zhou
Method and apparatus for identifying information

Patent number: 10803861

Abstract: Embodiments of the present disclosure disclose a method and apparatus for identifying information. One embodiment of the method includes: collecting to-be-processed audio in real-time; performing voice recognition on the to-be-processed audio; performing data-processing on the to-be-processed audio, when the audio is recognized as a wake-up word, the wake-up word is used for instructing performing data-processing on the to-be-processed audio. The embodiment can identify keywords from the to-be-processed audio obtained in real-time and then perform data-processing on the to-be-processed audio, which improves completeness in obtaining the to-be-processed audio and accuracy in performing data-processing on the to-be-processed audio.

Type: Grant

Filed: December 28, 2017

Date of Patent: October 13, 2020

Assignee: BAIDU ONLINE NETWORK TECHNOLOGY (BEIJING) CO., LTD.

Inventors: Xiaojian Chen, Lifeng Zhao, Jun Li, Yushu Cao
VOICE INTERACTION METHOD, DEVICE AND COMPUTER READABLE STORAGE MEDIUM

Publication number: 20190333513

Abstract: Embodiments of the present disclosure provide a voice interaction method, device, and a computer readable storage medium. Through a conversion of a terminal device from a near-filed voice interaction mode to a far-filed voice interaction mode, the terminal device is configured to perform the following operations when in the far-filed voice interaction mode: obtaining voice information of a user; obtaining, according to the voice information, target information required by the user from a server; and playing the target information in a voice manner, so that the terminal device can be turned into a smart speaker with a screen in the far-field voice interaction mode, and into a common mobile phone or tablet computer in the near-field voice interaction mode. The terminal device provides the user with a flexible and convenient voice service.

Type: Application

Filed: July 11, 2019

Publication date: October 31, 2019

Inventors: Yushu CAO, Qing SI, Qinglong HE, Xiangdong XUE
METHOD FOR VOICE CONTROLLING, TERMINAL DEVICE, CLOUD SERVER AND SYSTEM

Publication number: 20190318736

Abstract: The embodiment of the disclosure provides a method for voice controlling, a terminal device, a cloud server and a system. The method includes: receiving voice information that the user performs voice controlling on a terminal device; transmitting voice information to the cloud server, so that the cloud server determines, according to the voice information, a voice control and a control instruction that match the voice information in the current interface, and generates a corresponding voice control instruction; receiving the voice control instruction transmitted by the cloud server; and controlling, according to the voice control instruction, a corresponding voice control of the terminal device to perform an operation. The method of the embodiments of the present disclosure achieves controlling over the controls in the interface through the voice, which deepens the controlling degree of the voice over the terminal device, and improves the user experience.

Type: Application

Filed: December 28, 2018

Publication date: October 17, 2019

Inventors: LICHAO XU, YUSHU CAO, LISHANG XIAO, LIFENG ZHAO, XIANGDONG XUE, JI ZHOU
VOICE AWAKENING METHOD AND DEVICE

Publication number: 20190251963

Abstract: A voice awakening method and device are provided. According to an embodiment, the method includes: receiving voice information of a user; obtaining an awakening confidence level corresponding to the voice information based on the voice information; determining, on the basis of the awakening confidence level, whether the voice information is suspected wake-up voice information; and performing, in response to determining the voice information being the suspected wake-up voice information, a secondary determination on the voice information to obtain a secondary determination result, and determining whether to perform a wake-up operation on the basis of the secondary determination result. The embodiment implements a secondary verification on the voice information, thereby reducing the probability that the smart device is mistakenly awakened.

Type: Application

Filed: February 6, 2019

Publication date: August 15, 2019

Inventors: Jun Li, Rui Yang, Lifeng Zhao, Xiaojian Chen, Yushu Cao
METHOD AND APPARATUS FOR IDENTIFYING INFORMATION

Publication number: 20190147860

Abstract: Embodiments of the present disclosure disclose a method and apparatus for identifying information. One embodiment of the method includes: collecting to-be-processed audio in real-time; performing voice recognition on the to-be-processed audio; performing data-processing on the to-be-processed audio, when the audio is recognized as a wake-up word, the wake-up word is used for instructing performing data-processing on the to-be-processed audio. The embodiment can identify keywords from the to-be-processed audio obtained in real-time and then perform data-processing on the to-be-processed audio, which improves completeness in obtaining the to-be-processed audio and accuracy in performing data-processing on the to-be-processed audio.

Type: Application

Filed: December 28, 2017

Publication date: May 16, 2019

Inventors: Xiaojian CHEN, Lifeng ZHAO, Jun LI, Yushu CAO
Dynamic kerning pair reduction for digital font rendering

Patent number: 10275426

Abstract: Systems, methods, and computer-readable media are disclosed for dynamic kerning pair reduction for digital font rendering. Example methods may include receiving a first font file comprising glyph data and a first set of kerning pairs, determining a first kerning pair of the first set of kerning pairs that comprises a kerning adjustment value below a kerning adjustment threshold, removing the first kerning pair from the first set of kerning pairs to generate a second set of kerning pairs, and generating a second font file comprising the glyph data and the second set of kerning pairs.

Type: Grant

Filed: September 22, 2015

Date of Patent: April 30, 2019

Assignee: Amazon Technologies, Inc.

Inventors: Yushu Cao, Sivarangini Ragavan, Michael Patrick Bacus
Dynamic contrast adjustments for glyph rendering

Patent number: 9620086

Abstract: Systems, methods, and computer-readable media are disclosed for dynamic contrast adjustment for glyph rendering. Example methods may include rendering a first glyph associated with a font in a first font size, increasing a first contrast of the first glyph in the first font size by adjusting a first grayscale value associated with the first glyph in the first font size to generate an adjusted first grayscale value, and storing the adjusted first grayscale value in a grayscale mapping table associated with the font, the grayscale mapping table comprising a default grayscale value for the first glyph in a second font size. Example methods may include generating a font file comprising the first glyph and the grayscale mapping table.

Type: Grant

Filed: June 26, 2015

Date of Patent: April 11, 2017

Assignee: Amazon Technologies, Inc.

Inventors: Lokesh Joshi, Yushu Cao, Hao Hu