Patents by Inventor Fuliang Weng
Fuliang Weng has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Patent number: 12087289Abstract: A speech recognition method, apparatus, and device, and a computer-readable storage medium provided pertain to the field of artificial intelligence technologies. The method includes: obtaining or generating a dynamic target language model based on reply information of a first intent, where the dynamic target language model includes a front-end part and a core part; obtaining a speech signal, parsing the speech signal to generate a key word; and invoking the dynamic target language model to determine a second intent and a service content. The front-end part of the dynamic target language model parses out the second intent based on the key word, and the core part of the dynamic target language model parses out the service content based on the key word. The speech recognition method prevents a provided service content from deviating from a user requirement and achieves a good recognition effect.Type: GrantFiled: November 30, 2021Date of Patent: September 10, 2024Assignee: HUAWEI TECHNOLOGIES CO., LTD.Inventors: Weiran Nie, Fuliang Weng, Youjia Huang, Hai Yu, Shumang Hu
-
Patent number: 12067978Abstract: Methods and systems are disclosed herein for improvements relating to compressed automatic speech recognition (ASR) systems. The ASR system may comprise a compressed acoustic engine and an adaptive decoder. The adaptive decoder may be dynamically compiled based on characteristics of the compressed acoustic engine and a current state of the application device. In some embodiments, a dynamic command list is used to manage context-specific commands. Two or more commands recognized by the adaptive decoder may be confusable due to compression of the ASR system. Alternate commands may be determined that are semantically equivalent but phonetically different than the confusable commands to reduce classification error of the adaptive decoder. An alternate command may replace one or more of the confusable commands in the adaptive decoder. In some embodiments, a user interface is displayed to a user of the ASR system to select the alternate command for replacement in the decoder.Type: GrantFiled: June 1, 2021Date of Patent: August 20, 2024Assignee: Samsung Electronics Co., Ltd.Inventors: Fuliang Weng, Alexei Ivanov, Stephen Cradock
-
Publication number: 20230217194Abstract: This invention provides a new and improved hearing aid system with high quality noise cancellation method and devices to overcome the limitations and difficulties encountered in conventional technologies. The technical limitations of the noise uncertainty and speech distortion in the hearing aid field are resolved by restoration of the high-quality speech by converting the speech content into an intermediate linguistic representation and by synthesizing the speech of the same speaker with pre-trained using artificial intelligence (AI) modules. In this invention, the noise uncertainties are circumvented by focusing on the target speaker or picking up the dominant speech by choosing the corresponding setting assuming the speech from the target speaker is the dominant speech based on the Lombard effect.Type: ApplicationFiled: December 9, 2022Publication date: July 6, 2023Inventor: Fuliang Weng
-
Patent number: 11514338Abstract: An activity planning system comprises a knowledge base, a query processor, and a temporal reasoner. A query including temporal constraints is input into the query processor. The query processor converts the query into a formal representation. The formal representation is a formal graphical semantic representation grounded on an ontology defined in the knowledge base. The temporal reasoner processes the query representation output by the query processor against the knowledge base which defines a set of object. For each object, the temporal reasoner produces a normalized score from 0 to 1 to indicate the degree of how likely the object satisfies the temporal constraints imposed by the query.Type: GrantFiled: June 27, 2017Date of Patent: November 29, 2022Assignee: Robert Bosch GmbHInventors: Doo Soon Kim, Fuliang Weng
-
Publication number: 20220180886Abstract: This invention provides a new and improved voice communication system with high quality noise cancellation method and devices to overcome the limitations and difficulties encountered in conventional technologies. This invention discloses a noise cancellation apparatus that includes a vibration sensor and a microphone for receiving and transmitting voice signals as incoming speeches. The vibration sensor is applied to receive vibration signals corresponding to the voice signals for applying the vibration signals as reference signals for removing noise signals generated from environmental noises by converting vibration signals to intermediate PDL representation together with the speaker characteristics, mapping them into full band high quality clean acoustic representation, and synthesizing clear personal speech with characteristics identical to the original microphone speech without noises.Type: ApplicationFiled: December 7, 2021Publication date: June 9, 2022Inventor: Fuliang Weng
-
Patent number: 11295748Abstract: A speaker recognition device includes a memory, and a processor. The memory stores enrolled key phrase data corresponding to utterances of a key phrase by enrolled users,and text-dependent and text-independent acoustic speaker models of the enrolled users. The processor is operatively connected to the memory, and executes instructions to authenticate a speaker as an enrolled user, which includes detecting input key phrase data corresponding to a key phrase uttered by the speaker, computing text-dependent and text-independent scores for the speaker using speech models of the enrolled user, computing a confidence score, and authenticating or rejecting the speaker as the enrolled user based on whether the confidence score indicates that the input key phrase data corresponds to the speech from the enrolled user.Type: GrantFiled: December 14, 2018Date of Patent: April 5, 2022Assignee: Robert Bosch GmbHInventors: Zhongnan Shen, Fuliang Weng, Gengyan Bei, Pongtep Angkititrakul
-
Publication number: 20220093087Abstract: A speech recognition method, apparatus, and device, and a computer-readable storage medium provided pertain to the field of artificial intelligence technologies. The method includes: obtaining or generating a dynamic target language model based on reply information of a first intent, where the dynamic target language model includes a front-end part and a core part; obtaining a speech signal, parsing the speech signal to generate a key word; and invoking the dynamic target language model to determine a second intent and a service content. The front-end part of the dynamic target language model parses out the second intent based on the key word, and the core part of the dynamic target language model parses out the service content based on the key word. The speech recognition method prevents a provided service content from deviating from a user requirement and achieves a good recognition effect.Type: ApplicationFiled: November 30, 2021Publication date: March 24, 2022Applicant: HUAWEI TECHNOLOGIES CO.,LTD.Inventors: Weiran Nie, Fuliang Weng, Youjia Huang, Hai Yu, Shumang Hu
-
Publication number: 20210375270Abstract: Methods and systems are disclosed herein for improvements relating to compressed automatic speech recognition (ASR) systems. The ASR system may comprise a compressed acoustic engine and an adaptive decoder. The adaptive decoder may be dynamically compiled based on characteristics of the compressed acoustic engine and a current state of the application device. In some embodiments, a dynamic command list is used to manage context-specific commands. Two or more commands recognized by the adaptive decoder may be confusable due to compression of the ASR system. Alternate commands may be determined that are semantically equivalent but phonetically different than the confusable commands to reduce classification error of the adaptive decoder. An alternate command may replace one or more of the confusable commands in the adaptive decoder. In some embodiments, a user interface is displayed to a user of the ASR system to select the alternate command for replacement in the decoder.Type: ApplicationFiled: June 1, 2021Publication date: December 2, 2021Applicant: Knowles Electronics, LLCInventors: Fuliang Weng, Alexei Ivanov, Stephen Cradock
-
Publication number: 20210287674Abstract: Various methods, systems, and apparatus are disclosed with improved imposter rejection for keyword recognition systems in a wearable device. Speech signals are measured by a microphone and a vibration sensor, the vibration sensor configured to measure vibrations in the body of a wearer of the device. An audio signal from the microphone and a vibration signal from the vibration sensor are input into a classifier to determine whether the wearer of the device spoke the keyword. In some embodiments, high-frequency components of a signal from the microphone may be combined with low-frequency components of a signal from the vibration sensor to generate a combined speech signal. The classifier may use a classification model trained with positive training data of the wearer speaking the keyword and negative training data of a non-wearer speaking the keyword.Type: ApplicationFiled: March 15, 2021Publication date: September 16, 2021Applicant: Knowles Electronics, LLCInventors: Andy Unruh, Wenjing Yang, Bin Jiang, Stephen Cradock, Alexei Ivanov, Fuliang Weng, Scott Choi
-
Publication number: 20210210109Abstract: Systems, methods, and apparatuses are disclosed herein for automatic speech recognition (ASR) in devices with limited memory or power constraints. An ASR system may have an acoustic engine and a decoder to identify a spoken command from an input audio stream. A dynamic command list may be used to reduce the size of an adapted lexicon used by the decoder, where the dynamic command list is associated with a state of the system. The decoder may be expanded based on labelled speech samples input into a compressed acoustic model of the ASR system. Speech samples may be collected and integrated to be user-specific.Type: ApplicationFiled: December 26, 2020Publication date: July 8, 2021Inventors: Fuliang Weng, Alexei Ivanov
-
Publication number: 20200152206Abstract: A speaker recognition device includes a memory, and a processor. The memory stores enrolled key phrase data corresponding to utterances of a key phrase by enrolled users,and text-dependent and text-independent acoustic speaker models of the enrolled users. The processor is operatively connected to the memory, and executes instructions to authenticate a speaker as an enrolled user, which includes detecting input key phrase data corresponding to a key phrase uttered by the speaker, computing text-dependent and text-independent scores for the speaker using speech models of the enrolled user, computing a confidence score, and authenticating or rejecting the speaker as the enrolled user based on whether the confidence score indicates that the input key phrase data corresponds to the speech from the enrolled user.Type: ApplicationFiled: December 14, 2018Publication date: May 14, 2020Inventors: Zhongnan Shen, Fuliang Weng, Gengyan Bei, Pongtep Angkititrakul
-
Publication number: 20200104960Abstract: A student learning guidance platform that includes a functional module for understanding a problem and formulating solutions for the given problem thus formulating and providing possible paths from the given problem to the solutions; a functional module for monitoring how a student solves the problem when the student processes from a first state to a succeeding state in a problem space; a functional module for figuring out whether there is a gap between the student and a required and finding a reason why; a functional module for providing assistance where necessary and further providing a process for a competent teacher to participate assistance when necessary; and a functional module for building a student model and recommending necessary steps for the student to improve.Type: ApplicationFiled: October 1, 2019Publication date: April 2, 2020Inventor: Fuliang Weng
-
Patent number: 10410630Abstract: A system provides multi-modal user interaction. The system is configured to detect acoustic events to perform context-sensitive personalized conversations with the speaker. Conversation or communication among the speakers or devices is categorized into different classes as confidential, partially anonymous, or public. When exchange with cloud infrastructure is needed, a clear indicator is presented to the speaker via one or more modalities. Furthermore, different dialog strategies are employed in situations where conversation failures, such as misunderstanding, wrong expectation, emotional stress, or memory deficiencies, occur.Type: GrantFiled: June 19, 2015Date of Patent: September 10, 2019Assignee: Robert Bosch GmbHInventors: Fuliang Weng, Katrin Schulze, Zhongnan Shen, Pongtep Angkititrakul, Gengyan Bei, Xiao Xiong
-
Publication number: 20190197417Abstract: An activity planning system comprises a knowledge base, a query processor, and a temporal reasoner. A query including temporal constraints is input into the query processor. The query processor converts the query into a formal representation. The formal representation is a formal graphical semantic representation grounded on an ontology defined in the knowledge base. The temporal reasoner processes the query representation output by the query processor against the knowledge base which defines a set of object. For each object, the temporal reasoner produces a normalized score from 0 to 1 to indicate the degree of how likely the object satisfies the temporal constraints imposed by the query.Type: ApplicationFiled: June 27, 2017Publication date: June 27, 2019Applicant: Robert Bosch GmbHInventors: Doo Soon Kim, Fuliang Weng
-
Patent number: 10311869Abstract: A dialog system includes a processor. The system can further include a dialog manager. The dialog manager can be configured to receive input from a user using the processor. The system can further include a user category classification and detection module, which is configured to identify categories for the user from the received input. The system can further include a user mood detection and tracking module configured to identify a mood of the user. The system can further include a user physical and mind state and energy level detection module configured to identify a mental status of the user. The system can further include a user acquaintance module configured to identify an acquaintance status of the user. The system can further include user personality detection and tracking module configured to identify a personality status of the user. The system can further include a conversational context detection and response generation module.Type: GrantFiled: October 21, 2015Date of Patent: June 4, 2019Assignee: Robert Bosch GmbHInventors: Fuliang Weng, Zhongnan Shen
-
Patent number: 10224025Abstract: A method for processing messages pertaining to an event includes receiving a plurality of messages pertaining to the event from electronic communication devices associated with a plurality of observers of the event, generating a first message stream that includes only a portion of the plurality of messages corresponding to a first participant in the event, identifying a first sub-event in the first message stream with reference to a time distribution of messages and content distribution of messages in the first message stream, generating a sub-event summary with reference to a portion of the plurality of messages in the first message stream that are associated with the first sub-event, and transmitting the sub-event summary to a plurality of electronic communication devices associated with a plurality of users who are not observers of the event.Type: GrantFiled: December 13, 2013Date of Patent: March 5, 2019Assignee: Robert Bosch GmbHInventors: Fei Liu, Fuliang Weng, Chao Shen, Lin Zhao
-
Patent number: 10209853Abstract: An in-vehicle infotainment system, smart home information access and device control unit, or mobile system presents summarized information to a user based on a user preference model that is associated with the user. The system modifies the presentation of information to the user based on environmental context data about the vehicle and user context data about the activity of the user. During presentation of the information, the system modifies the content and presentation of the summarized information in response to multi-modal input requests from the user.Type: GrantFiled: December 11, 2014Date of Patent: February 19, 2019Assignee: Robert Bosch GmbHInventors: Fuliang Weng, Kui Xu, Fei Liu, Lin Zhao, Zhe Feng, Zhongnan Shen
-
Publication number: 20180240459Abstract: A dialog system includes a processor. The system can further include a dialog manager. The dialog manager can be configured to receive input from a user using the processor. The system can further include a user category classification and detection module, which is configured to identify categories for the user from the received input. The system can further include a user mood detection and tracking module configured to identify a mood of the user. The system can further include a user physical and mind state and energy level detection module configured to identify a mental status of the user. The system can further include a user acquaintance module configured to identify an acquaintance status of the user. The system can further include user personality detection and tracking module configured to identify a personality status of the user. The system can further include a conversational context detection and response generation module.Type: ApplicationFiled: October 21, 2015Publication date: August 23, 2018Inventors: Fuliang Weng, Zhongnan Shen
-
Patent number: 9667742Abstract: A method of providing information assistance services includes generating a plurality of service requests for a plurality of request elements that are generated from a single client request received by a processor. The service requests are sent to both software application service providers that are executed by the processor and remote service providers that are connected to the local processor through a data network. The processor receives a plurality of service responses from the service providers, generating at least one output message element corresponding to the service responses, and sending the output message data to at least one output device that is operatively connected to the processor to produce a response to the client request.Type: GrantFiled: July 12, 2013Date of Patent: May 30, 2017Assignee: Robert Bosch GmbHInventors: Fuliang Weng, Zhongnan Shen, Zhe Feng, Kui Xu
-
Patent number: 9656690Abstract: A method of providing parking assistance in a vehicle includes identifying with a controller in a vehicle a plurality of available parking spaces for the vehicle, generating with a video output device operatively connected to the controller an interface with a graphical depiction of the vehicle and the plurality of available parking spaces, receiving a first input gesture with a gesture input device to select one parking space from the plurality of available parking spaces, and operating the vehicle to park the vehicle in the one parking space using the controller configured with a parking assistance service in the vehicle.Type: GrantFiled: October 29, 2013Date of Patent: May 23, 2017Assignee: Robert Bosch GmbHInventors: Zhongnan Shen, Fuliang Weng, Benno Albrecht