Patents by Inventor Fuliang Weng

Fuliang Weng has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Methods for clear call under noisy conditions

Patent number: 12277950

Abstract: This invention provides a new and improved voice communication system with high quality noise cancellation method and devices to overcome the limitations and difficulties encountered in conventional technologies. This invention discloses a noise cancellation apparatus that includes a vibration sensor and a microphone for receiving and transmitting voice signals as incoming speeches. The vibration sensor is applied to receive vibration signals corresponding to the voice signals for applying the vibration signals as reference signals for removing noise signals generated from environmental noises by converting vibration signals to intermediate PDL representation together with the speaker characteristics, mapping them into full band high quality clean acoustic representation, and synthesizing clear personal speech with characteristics identical to the original microphone speech without noises.

Type: Grant

Filed: December 7, 2021

Date of Patent: April 15, 2025

Inventor: Fuliang Weng
Speech recognition method, apparatus, and device, and computer-readable storage medium

Patent number: 12087289

Abstract: A speech recognition method, apparatus, and device, and a computer-readable storage medium provided pertain to the field of artificial intelligence technologies. The method includes: obtaining or generating a dynamic target language model based on reply information of a first intent, where the dynamic target language model includes a front-end part and a core part; obtaining a speech signal, parsing the speech signal to generate a key word; and invoking the dynamic target language model to determine a second intent and a service content. The front-end part of the dynamic target language model parses out the second intent based on the key word, and the core part of the dynamic target language model parses out the service content based on the key word. The speech recognition method prevents a provided service content from deviating from a user requirement and achieves a good recognition effect.

Type: Grant

Filed: November 30, 2021

Date of Patent: September 10, 2024

Assignee: HUAWEI TECHNOLOGIES CO., LTD.

Inventors: Weiran Nie, Fuliang Weng, Youjia Huang, Hai Yu, Shumang Hu
Methods and systems for confusion reduction for compressed acoustic models

Patent number: 12067978

Abstract: Methods and systems are disclosed herein for improvements relating to compressed automatic speech recognition (ASR) systems. The ASR system may comprise a compressed acoustic engine and an adaptive decoder. The adaptive decoder may be dynamically compiled based on characteristics of the compressed acoustic engine and a current state of the application device. In some embodiments, a dynamic command list is used to manage context-specific commands. Two or more commands recognized by the adaptive decoder may be confusable due to compression of the ASR system. Alternate commands may be determined that are semantically equivalent but phonetically different than the confusable commands to reduce classification error of the adaptive decoder. An alternate command may replace one or more of the confusable commands in the adaptive decoder. In some embodiments, a user interface is displayed to a user of the ASR system to select the alternate command for replacement in the decoder.

Type: Grant

Filed: June 1, 2021

Date of Patent: August 20, 2024

Assignee: Samsung Electronics Co., Ltd.

Inventors: Fuliang Weng, Alexei Ivanov, Stephen Cradock
METHODS FOR SYNTHESIS-BASED CLEAR HEARING UNDER NOISY CONDITIONS

Publication number: 20230217194

Abstract: This invention provides a new and improved hearing aid system with high quality noise cancellation method and devices to overcome the limitations and difficulties encountered in conventional technologies. The technical limitations of the noise uncertainty and speech distortion in the hearing aid field are resolved by restoration of the high-quality speech by converting the speech content into an intermediate linguistic representation and by synthesizing the speech of the same speaker with pre-trained using artificial intelligence (AI) modules. In this invention, the noise uncertainties are circumvented by focusing on the target speaker or picking up the dominant speech by choosing the corresponding setting assuming the speech from the target speaker is the dominant speech based on the Lombard effect.

Type: Application

Filed: December 9, 2022

Publication date: July 6, 2023

Inventor: Fuliang Weng
Active activity planning system and method for supporting temporal constraints

Patent number: 11514338

Abstract: An activity planning system comprises a knowledge base, a query processor, and a temporal reasoner. A query including temporal constraints is input into the query processor. The query processor converts the query into a formal representation. The formal representation is a formal graphical semantic representation grounded on an ontology defined in the knowledge base. The temporal reasoner processes the query representation output by the query processor against the knowledge base which defines a set of object. For each object, the temporal reasoner produces a normalized score from 0 to 1 to indicate the degree of how likely the object satisfies the temporal constraints imposed by the query.

Type: Grant

Filed: June 27, 2017

Date of Patent: November 29, 2022

Assignee: Robert Bosch GmbH

Inventors: Doo Soon Kim, Fuliang Weng
METHODS FOR CLEAR CALL UNDER NOISY CONDITIONS

Publication number: 20220180886

Abstract: This invention provides a new and improved voice communication system with high quality noise cancellation method and devices to overcome the limitations and difficulties encountered in conventional technologies. This invention discloses a noise cancellation apparatus that includes a vibration sensor and a microphone for receiving and transmitting voice signals as incoming speeches. The vibration sensor is applied to receive vibration signals corresponding to the voice signals for applying the vibration signals as reference signals for removing noise signals generated from environmental noises by converting vibration signals to intermediate PDL representation together with the speaker characteristics, mapping them into full band high quality clean acoustic representation, and synthesizing clear personal speech with characteristics identical to the original microphone speech without noises.

Type: Application

Filed: December 7, 2021

Publication date: June 9, 2022

Inventor: Fuliang Weng
Speaker identification with ultra-short speech segments for far and near field voice assistance applications

Patent number: 11295748

Abstract: A speaker recognition device includes a memory, and a processor. The memory stores enrolled key phrase data corresponding to utterances of a key phrase by enrolled users,and text-dependent and text-independent acoustic speaker models of the enrolled users. The processor is operatively connected to the memory, and executes instructions to authenticate a speaker as an enrolled user, which includes detecting input key phrase data corresponding to a key phrase uttered by the speaker, computing text-dependent and text-independent scores for the speaker using speech models of the enrolled user, computing a confidence score, and authenticating or rejecting the speaker as the enrolled user based on whether the confidence score indicates that the input key phrase data corresponds to the speech from the enrolled user.

Type: Grant

Filed: December 14, 2018

Date of Patent: April 5, 2022

Assignee: Robert Bosch GmbH

Inventors: Zhongnan Shen, Fuliang Weng, Gengyan Bei, Pongtep Angkititrakul
SPEECH RECOGNITION METHOD, APPARATUS, AND DEVICE, AND COMPUTER-READABLE STORAGE MEDIUM

Publication number: 20220093087

Abstract: A speech recognition method, apparatus, and device, and a computer-readable storage medium provided pertain to the field of artificial intelligence technologies. The method includes: obtaining or generating a dynamic target language model based on reply information of a first intent, where the dynamic target language model includes a front-end part and a core part; obtaining a speech signal, parsing the speech signal to generate a key word; and invoking the dynamic target language model to determine a second intent and a service content. The front-end part of the dynamic target language model parses out the second intent based on the key word, and the core part of the dynamic target language model parses out the service content based on the key word. The speech recognition method prevents a provided service content from deviating from a user requirement and achieves a good recognition effect.

Type: Application

Filed: November 30, 2021

Publication date: March 24, 2022

Applicant: HUAWEI TECHNOLOGIES CO.,LTD.

Inventors: Weiran Nie, Fuliang Weng, Youjia Huang, Hai Yu, Shumang Hu
METHODS AND SYSTEMS FOR CONFUSION REDUCTION FOR COMPRESSED ACOUSTIC MODELS

Publication number: 20210375270

Abstract: Methods and systems are disclosed herein for improvements relating to compressed automatic speech recognition (ASR) systems. The ASR system may comprise a compressed acoustic engine and an adaptive decoder. The adaptive decoder may be dynamically compiled based on characteristics of the compressed acoustic engine and a current state of the application device. In some embodiments, a dynamic command list is used to manage context-specific commands. Two or more commands recognized by the adaptive decoder may be confusable due to compression of the ASR system. Alternate commands may be determined that are semantically equivalent but phonetically different than the confusable commands to reduce classification error of the adaptive decoder. An alternate command may replace one or more of the confusable commands in the adaptive decoder. In some embodiments, a user interface is displayed to a user of the ASR system to select the alternate command for replacement in the decoder.

Type: Application

Filed: June 1, 2021

Publication date: December 2, 2021

Applicant: Knowles Electronics, LLC

Inventors: Fuliang Weng, Alexei Ivanov, Stephen Cradock
VOICE RECOGNITION FOR IMPOSTER REJECTION IN WEARABLE DEVICES

Publication number: 20210287674

Abstract: Various methods, systems, and apparatus are disclosed with improved imposter rejection for keyword recognition systems in a wearable device. Speech signals are measured by a microphone and a vibration sensor, the vibration sensor configured to measure vibrations in the body of a wearer of the device. An audio signal from the microphone and a vibration signal from the vibration sensor are input into a classifier to determine whether the wearer of the device spoke the keyword. In some embodiments, high-frequency components of a signal from the microphone may be combined with low-frequency components of a signal from the vibration sensor to generate a combined speech signal. The classifier may use a classification model trained with positive training data of the wearer speaking the keyword and negative training data of a non-wearer speaking the keyword.

Type: Application

Filed: March 15, 2021

Publication date: September 16, 2021

Applicant: Knowles Electronics, LLC

Inventors: Andy Unruh, Wenjing Yang, Bin Jiang, Stephen Cradock, Alexei Ivanov, Fuliang Weng, Scott Choi
ADAPTIVE DECODER FOR HIGHLY COMPRESSED GRAPHEME MODEL

Publication number: 20210210109

Abstract: Systems, methods, and apparatuses are disclosed herein for automatic speech recognition (ASR) in devices with limited memory or power constraints. An ASR system may have an acoustic engine and a decoder to identify a spoken command from an input audio stream. A dynamic command list may be used to reduce the size of an adapted lexicon used by the decoder, where the dynamic command list is associated with a state of the system. The decoder may be expanded based on labelled speech samples input into a compressed acoustic model of the ASR system. Speech samples may be collected and integrated to be user-specific.

Type: Application

Filed: December 26, 2020

Publication date: July 8, 2021

Inventors: Fuliang Weng, Alexei Ivanov
Speaker Identification with Ultra-Short Speech Segments for Far and Near Field Voice Assistance Applications

Publication number: 20200152206

Abstract: A speaker recognition device includes a memory, and a processor. The memory stores enrolled key phrase data corresponding to utterances of a key phrase by enrolled users,and text-dependent and text-independent acoustic speaker models of the enrolled users. The processor is operatively connected to the memory, and executes instructions to authenticate a speaker as an enrolled user, which includes detecting input key phrase data corresponding to a key phrase uttered by the speaker, computing text-dependent and text-independent scores for the speaker using speech models of the enrolled user, computing a confidence score, and authenticating or rejecting the speaker as the enrolled user based on whether the confidence score indicates that the input key phrase data corresponds to the speech from the enrolled user.

Type: Application

Filed: December 14, 2018

Publication date: May 14, 2020

Inventors: Zhongnan Shen, Fuliang Weng, Gengyan Bei, Pongtep Angkititrakul
Student Learning Guidance Platform- eGPS

Publication number: 20200104960

Abstract: A student learning guidance platform that includes a functional module for understanding a problem and formulating solutions for the given problem thus formulating and providing possible paths from the given problem to the solutions; a functional module for monitoring how a student solves the problem when the student processes from a first state to a succeeding state in a problem space; a functional module for figuring out whether there is a gap between the student and a required and finding a reason why; a functional module for providing assistance where necessary and further providing a process for a competent teacher to participate assistance when necessary; and a functional module for building a student model and recommending necessary steps for the student to improve.

Type: Application

Filed: October 1, 2019

Publication date: April 2, 2020

Inventor: Fuliang Weng
System and method for speech-enabled personalized operation of devices and services in multiple operating environments

Patent number: 10410630

Abstract: A system provides multi-modal user interaction. The system is configured to detect acoustic events to perform context-sensitive personalized conversations with the speaker. Conversation or communication among the speakers or devices is categorized into different classes as confidential, partially anonymous, or public. When exchange with cloud infrastructure is needed, a clear indicator is presented to the speaker via one or more modalities. Furthermore, different dialog strategies are employed in situations where conversation failures, such as misunderstanding, wrong expectation, emotional stress, or memory deficiencies, occur.

Type: Grant

Filed: June 19, 2015

Date of Patent: September 10, 2019

Assignee: Robert Bosch GmbH

Inventors: Fuliang Weng, Katrin Schulze, Zhongnan Shen, Pongtep Angkititrakul, Gengyan Bei, Xiao Xiong
An Active Activity Planning System and Method for Supporting Temporal Constraints

Publication number: 20190197417

Abstract: An activity planning system comprises a knowledge base, a query processor, and a temporal reasoner. A query including temporal constraints is input into the query processor. The query processor converts the query into a formal representation. The formal representation is a formal graphical semantic representation grounded on an ontology defined in the knowledge base. The temporal reasoner processes the query representation output by the query processor against the knowledge base which defines a set of object. For each object, the temporal reasoner produces a normalized score from 0 to 1 to indicate the degree of how likely the object satisfies the temporal constraints imposed by the query.

Type: Application

Filed: June 27, 2017

Publication date: June 27, 2019

Applicant: Robert Bosch GmbH

Inventors: Doo Soon Kim, Fuliang Weng
Method and system for automation of response selection and composition in dialog systems

Patent number: 10311869

Abstract: A dialog system includes a processor. The system can further include a dialog manager. The dialog manager can be configured to receive input from a user using the processor. The system can further include a user category classification and detection module, which is configured to identify categories for the user from the received input. The system can further include a user mood detection and tracking module configured to identify a mood of the user. The system can further include a user physical and mind state and energy level detection module configured to identify a mental status of the user. The system can further include a user acquaintance module configured to identify an acquaintance status of the user. The system can further include user personality detection and tracking module configured to identify a personality status of the user. The system can further include a conversational context detection and response generation module.

Type: Grant

Filed: October 21, 2015

Date of Patent: June 4, 2019

Assignee: Robert Bosch GmbH

Inventors: Fuliang Weng, Zhongnan Shen
System and method for event summarization using observer social media messages

Patent number: 10224025

Abstract: A method for processing messages pertaining to an event includes receiving a plurality of messages pertaining to the event from electronic communication devices associated with a plurality of observers of the event, generating a first message stream that includes only a portion of the plurality of messages corresponding to a first participant in the event, identifying a first sub-event in the first message stream with reference to a time distribution of messages and content distribution of messages in the first message stream, generating a sub-event summary with reference to a portion of the plurality of messages in the first message stream that are associated with the first sub-event, and transmitting the sub-event summary to a plurality of electronic communication devices associated with a plurality of users who are not observers of the event.

Type: Grant

Filed: December 13, 2013

Date of Patent: March 5, 2019

Assignee: Robert Bosch GmbH

Inventors: Fei Liu, Fuliang Weng, Chao Shen, Lin Zhao
System and method for dialog-enabled context-dependent and user-centric content presentation

Patent number: 10209853

Abstract: An in-vehicle infotainment system, smart home information access and device control unit, or mobile system presents summarized information to a user based on a user preference model that is associated with the user. The system modifies the presentation of information to the user based on environmental context data about the vehicle and user context data about the activity of the user. During presentation of the information, the system modifies the content and presentation of the summarized information in response to multi-modal input requests from the user.

Type: Grant

Filed: December 11, 2014

Date of Patent: February 19, 2019

Assignee: Robert Bosch GmbH

Inventors: Fuliang Weng, Kui Xu, Fei Liu, Lin Zhao, Zhe Feng, Zhongnan Shen
METHOD AND SYSTEM FOR AUTOMATION OF RESPONSE SELECTION AND COMPOSITION IN DIALOG SYSTEMS

Publication number: 20180240459

Abstract: A dialog system includes a processor. The system can further include a dialog manager. The dialog manager can be configured to receive input from a user using the processor. The system can further include a user category classification and detection module, which is configured to identify categories for the user from the received input. The system can further include a user mood detection and tracking module configured to identify a mood of the user. The system can further include a user physical and mind state and energy level detection module configured to identify a mental status of the user. The system can further include a user acquaintance module configured to identify an acquaintance status of the user. The system can further include user personality detection and tracking module configured to identify a personality status of the user. The system can further include a conversational context detection and response generation module.

Type: Application

Filed: October 21, 2015

Publication date: August 23, 2018

Inventors: Fuliang Weng, Zhongnan Shen
System and method of conversational assistance in an interactive information system

Patent number: 9667742

Abstract: A method of providing information assistance services includes generating a plurality of service requests for a plurality of request elements that are generated from a single client request received by a processor. The service requests are sent to both software application service providers that are executed by the processor and remote service providers that are connected to the local processor through a data network. The processor receives a plurality of service responses from the service providers, generating at least one output message element corresponding to the service responses, and sending the output message data to at least one output device that is operatively connected to the processor to produce a response to the client request.

Type: Grant

Filed: July 12, 2013

Date of Patent: May 30, 2017

Assignee: Robert Bosch GmbH

Inventors: Fuliang Weng, Zhongnan Shen, Zhe Feng, Kui Xu

1 2 3 4 5 … next