Patents Examined by Michael Colucci
  • Patent number: 11532307
    Abstract: The present disclosure discloses an image processing device including: a receiving module configured to receive a voice signal and an image to be processed; a conversion module configured to convert the voice signal into an image processing instruction and determine a target area according to a target voice instruction conversion model, in which the target area is a processing area of the image to be processed; and a processing module configured to process the target area according to the image processing instruction and a target image processing model. The examples may realize the functionality of using voice commands to control image processing, which may save users' time spent in learning image processing software prior to image processing, and improve user experience.
    Type: Grant
    Filed: September 29, 2018
    Date of Patent: December 20, 2022
    Assignee: SHANGHAI CAMBRICON INFORMATION TECHNOLOGY CO., LTD
    Inventors: Tianshi Chen, Shuai Hu, Xiaobing Chen
  • Patent number: 11528568
    Abstract: A device and method for improving hearing devices by using computer recognition of words and substituting either computer generated words or pre-recorded words in streaming conversation received from a distant speaker. The system may operate in multiple modes such as a first mode being amplification and conditioning of the voice sounds; a second mode having said microphone pickup up the voice sounds from a speaker, a processor configured to convert voice sounds to discrete words corresponding to words spoken by said speaker, generating a synthesized voice speaking said words and outputting said synthesized voice to said sound reproducing element, which is hearable by the user. Other modes include translation of foreign languages into a user's ear and using a heads up display to project the text version of words which the computer had deciphered or translated. The system may be triggered by eye moment, spoken command, hand movement or similar.
    Type: Grant
    Filed: August 28, 2020
    Date of Patent: December 13, 2022
    Assignee: GN HEARING A/S
    Inventor: Michael B. Lasky
  • Patent number: 11520982
    Abstract: A method may include generating, based a context-free grammar, a sample forming a corpus. The context-free grammar may include production rules for replacing a first nonterminal symbol with a second nonterminal symbol and/or a terminal symbol. The sample may be generated by rewriting recursively a first text string to form a second text string associated with the sample. The first text string may be rewritten by applying the production rules to replace nonterminal symbols included in the first text string until no nonterminal symbols remain in the first text string. A machine learning model may be trained, based on the corpus, to process a natural language. Related methods and articles of manufacture are also disclosed.
    Type: Grant
    Filed: September 27, 2019
    Date of Patent: December 6, 2022
    Assignee: SAP SE
    Inventors: Keguo Zhou, Jiyuan Zhan, Liangqi Xiong
  • Patent number: 11514889
    Abstract: A device and a method for clarifying dysarthria voices is disclosed. Firstly, a dysarthria voice signal is received and framed to generate dysarthria frames. Then, the dysarthria frames are received to retrieve dysarthria features. Finally, the dysarthria features are received. Without receiving phases corresponding to the dysarthria features, the dysarthria features are converted into an intelligent voice signal based on an intelligent voice conversion model. The intelligent voice conversion model is not trained by the dynamic time warping (DTW). The present invention avoids the phase distortion of the voice signal and provides more natural and clarified voices with low noise.
    Type: Grant
    Filed: October 1, 2020
    Date of Patent: November 29, 2022
    Assignee: NATIONAL CHUNG CHENG UNIVERSITY
    Inventors: Tay-Jyi Lin, Che Chia Pai, Hsi Che Wang, Ching-Wei Yeh
  • Patent number: 11514330
    Abstract: Methods and systems are provided for a natural language processing system comprising a chatbot adapted for dialog generation. In one example, the system may include a combination of a variational autoencoder (VAE) and a generative adversarial network (GAN) for generating natural responses to input queries. The VAE may convert queries into vector embeddings that may then be used by the GAN to continuously update and improve responses provided by the chatbot.
    Type: Grant
    Filed: January 13, 2020
    Date of Patent: November 29, 2022
    Assignee: Cambia Health Solutions, Inc.
    Inventors: Weicheng Ma, Kai Cao, Bei Pan, Lin Chen, Xiang Li
  • Patent number: 11507759
    Abstract: A speech translation device, for conversation between a first speaker making an utterance in a first language and a second speaker making an utterance in a second language different from the first language, includes: a speech detector that detects, from sounds that are input, a speech segment in which the first speaker or the second speaker made an utterance; a display that, after speech recognition is performed on the utterance, displays a translation result obtained by translating the utterance from the first language to the second language or from the second language to the first language; and an utterance instructor that outputs, in the second language via the display, a message prompting the second speaker to make an utterance after a first speaker's utterance or outputs, in the first language via the display, a message prompting the first speaker to make an utterance after a second speaker's utterance.
    Type: Grant
    Filed: March 19, 2020
    Date of Patent: November 22, 2022
    Assignee: PANASONIC HOLDINGS CORPORATION
    Inventors: Hiroki Furukawa, Atsushi Sakaguchi, Tsuyoki Nishikawa
  • Patent number: 11501753
    Abstract: A method includes receiving, from an electronic device, information defining a user utterance associated with a skill to be performed, where the skill is not recognized by a natural language understanding (NLU) engine. The method also includes receiving, from the electronic device, information defining one or more actions for performing the skill. The method further includes identifying, using at least one processor, one or more known skills having one or more slots that map to at least one word or phrase in the user utterance. The method also includes creating, using the at least one processor, a plurality of additional utterances based on the one or more mapped slots. In addition, the method includes training, using the at least one processor, the NLU engine using the plurality of additional utterances.
    Type: Grant
    Filed: December 27, 2019
    Date of Patent: November 15, 2022
    Assignee: Samsung Electronics Co., Ltd.
    Inventors: Yilin Shen, Avik Ray, Hongxia Jin
  • Patent number: 11488580
    Abstract: It is an aspect of the present disclosure to provide a dialogue system capable of providing an extended function to the user by registering a new vocabulary that matches the user's preference and by changing the pre-stored conversation pattern.
    Type: Grant
    Filed: November 13, 2019
    Date of Patent: November 1, 2022
    Assignees: HYUNDAI MOTOR COMPANY, KIA CORPORATION
    Inventors: Seona Kim, Jeong-Eom Lee, Dongsoo Shin
  • Patent number: 11488587
    Abstract: Disclosed is a regional-features-based speech recognition method, including learning speech features by region using speech data classified by region category, and recognizing input speech using an acoustic model and a language model generated through classification of a region category for the input speech and the learning. A user may use a dialect recognition service that is improved using learning based on artificial intelligence (AI) and enhanced mobile broadband (eMBB), ultra-reliable and low latency communications (URLLC), and massive machine-type communications (mMTC) techniques of 5G mobile communication.
    Type: Grant
    Filed: March 18, 2020
    Date of Patent: November 1, 2022
    Assignee: LG ELECTRONICS INC.
    Inventor: Seonyeong Park
  • Patent number: 11488581
    Abstract: A new approach to automatic speech recognition is disclosed. An example method include receiving a first text representing speech recognition of a phrase spoken by a user, isolating a candidate named entity from within the phrase, receiving a first phonetic representation of the candidate named entity, comparing the first phonetic representation to phonetic representations in a mapping database which map the phonetic representations to words to yield a comparison, based on the comparison, identifying a second phonetic representation in the mapping database that matches a second text in the mapping database to the second phonetic representation and replacing the candidate named entity with the second text. The approach can be used for new brands for which automatic speech recognition error rates are high.
    Type: Grant
    Filed: December 6, 2019
    Date of Patent: November 1, 2022
    Assignee: Amazon Technologies, Inc.
    Inventors: Shlomi Chovel, Adriano Devillaine, Omer Shabtai Jakobinsky, Colin Zhen De Kho, Kawshik Karur Rangaraju, Ajay Soni, Yochai Zvik, Yunqiang Zhu
  • Patent number: 11475877
    Abstract: Disclosed are an end-to-end system for speech recognition and speech translation and an electronic device. The system comprises an acoustic encoder and a multi-task decoder and a semantic invariance constraint module, and completes two tasks for speech recognition and speech translation. In addition, according to the characteristic of the semantic consistency of texts between different tasks, semantic constraints are imposed on the model to learn high-level semantic information, and the semantic information can effectively improve the performance of speech recognition and speech translation. The application has the following advantages that the error accumulation problem of serial system is avoided, and the calculation cost of the model is low and the real-time performance is very high.
    Type: Grant
    Filed: June 28, 2022
    Date of Patent: October 18, 2022
    Assignee: INSTITUTE OF AUTOMATION, CHINESE ACADEMY OF SCIENCES
    Inventors: Jianhua Tao, Shuai Zhang, Jiangyan Yi
  • Patent number: 11450319
    Abstract: The present disclosure discloses an image processing device including: a receiving module configured to receive a voice signal and an image to be processed; a conversion module configured to convert the voice signal into an image processing instruction and determine a target area according to a target voice instruction conversion model, in which the target area is a processing area of the image to be processed; and a processing module configured to process the target area according to the image processing instruction and a target image processing model. The examples may realize the functionality of using voice commands to control image processing, which may save users' time spent in learning image processing software prior to image processing, and improve user experience.
    Type: Grant
    Filed: December 18, 2019
    Date of Patent: September 20, 2022
    Inventors: Tianshi Chen, Shuai Hu, Xiaobing Chen
  • Patent number: 11437045
    Abstract: System, methods, and computer readable media can be used to create a virtual assistant. One of the methods includes receiving audio from a conversation between two parties while the conversation is occurring. The method includes generating a partial transcript of the conversation. The method includes identifying topics based on the partial transcript. The method includes presenting a user interface element based on the identified topic.
    Type: Grant
    Filed: October 18, 2018
    Date of Patent: September 6, 2022
    Assignee: United Services Automobile Association (USAA)
    Inventors: Scott Evan Daly, Robert Hugh Newman, II, Kori Rochelle Newman
  • Patent number: 11437032
    Abstract: The present disclosure discloses an image processing device including: a receiving module configured to receive a voice signal and an image to be processed; a conversion module configured to convert the voice signal into an image processing instruction and determine a target area according to a target voice instruction conversion model, in which the target area is a processing area of the image to be processed; and a processing module configured to process the target area according to the image processing instruction and a target image processing model. The examples may realize the functionality of using voice commands to control image processing, which may save users' time spent in learning image processing software prior to image processing, and improve user experience.
    Type: Grant
    Filed: December 18, 2019
    Date of Patent: September 6, 2022
    Assignee: SHANGHAI CAMBRICON INFORMATION TECHNOLOGY CO., LTD
    Inventors: Tianshi Chen, Shuai Hu, Xiaobing Chen
  • Patent number: 11430439
    Abstract: Method for providing assistance in conversation including recognizing, by recognition module, conversation between primary user and at least one secondary user, identifying, by recognition module, first and second context data for primary user and at least one secondary user based on conversation; generating, by response generation module, at least one response on behalf of primary user based on at least one of second context data derived from at least one secondary user, and first context data; analyzing, by determining module, at least one action of primary user in at least one response on second context data; determining, by determining module, intervening situation in conversation based on at least one action; selecting, by intervening response module, intervening response from at least one response for determined intervening situation based on at least one action; and delivering, by response delivery module, intervening response to at least one secondary user during determined intervening situation.
    Type: Grant
    Filed: July 22, 2020
    Date of Patent: August 30, 2022
    Inventors: Ritesh Shreeshreemal, Gaurav Chaurasia
  • Patent number: 11423916
    Abstract: The present invention proposes a new method for improving the performance of a real-valued filterbank based spectral envelope adjuster. By adaptively locking the gain values for adjacent channels dependent on the sign of the channels, as defined in the application, reduced aliasing is achieved. Furthermore, the grouping of the channels during gain-calculation, gives an improved energy estimate of the real valued subband signals in the filterbank.
    Type: Grant
    Filed: June 14, 2020
    Date of Patent: August 23, 2022
    Assignee: DOLBY INTERNATIONAL AB
    Inventors: Kristofer Kjoerling, Lars Villemoes
  • Patent number: 11423897
    Abstract: Systems and methods are described herein for generating an adaptive response to a user request. Input indicative of a user request may be received and utilized to identify an item in an electronic catalog. Title segments may be identified from the item's title. Significant segments of the user request may be determined. In response to the user request, a shortened title may be generated from the identified title segments and provided as output at the user device (e.g., via audible output provided at a speaker of the user device, via textual output, or the like). At least one of the title segments provided in the shortened title may correlate to the significant segment identified from the user request. In some embodiments, the length and content of the shortened title may vary based at least in part on the contextual intent of the user's request.
    Type: Grant
    Filed: January 30, 2020
    Date of Patent: August 23, 2022
    Assignee: Amazon Technologies, Inc.
    Inventors: Ran Levy, Ori Rozen, Leon Portman, Knaan Ratosh, Ido Arad, Hadar Neumann
  • Patent number: 11417327
    Abstract: An electronic apparatus is provided. The electronic device includes: a storage configured to store recognition related information and misrecognition related information of a trigger word for entering a speech recognition mode; and a processor configured to identify whether or not the speech recognition mode is activated on the basis of characteristic information of a received uttered speech and the recognition related information, identify a similarity between text information of the received uttered speech and text information of the trigger word, and update at least one of the recognition related information or the misrecognition related information on the basis of whether or not the speech recognition mode is activated and the similarity.
    Type: Grant
    Filed: November 27, 2019
    Date of Patent: August 16, 2022
    Assignee: SAMSUNG ELECTRONICS CO., LTD.
    Inventor: Chanhee Choi
  • Patent number: 11417322
    Abstract: Methods, systems, and apparatus, including computer programs stored on a computer-readable storage medium, for transliteration for speech recognition training and scoring. In some implementations, language examples are accessed, some of which include words in a first script and words in one or more other scripts. At least portions of some of the language examples are transliterated to the first script to generate a training data set. A language model is generated based on occurrences of the different sequences of words in the training data set in the first script. The language model is used to perform speech recognition for an utterance.
    Type: Grant
    Filed: December 12, 2019
    Date of Patent: August 16, 2022
    Assignee: Google LLC
    Inventors: Bhuvana Ramabhadran, Min Ma, Pedro J. Moreno Mengibar, Jesse Emond, Brian E. Roark
  • Patent number: 11417353
    Abstract: A method for detecting an audio signal and an apparatus, where the method includes determining a segmental signal-to-noise ratio (SSNR) of an audio signal in response to the audio signal being an unvoiced signal, reducing a reference voice activity detection (VAD) decision threshold to obtain a reduced VAD decision threshold, and comparing the SSNR with the reduced VAD decision threshold to determine whether the audio signal is an active signal.
    Type: Grant
    Filed: June 15, 2020
    Date of Patent: August 16, 2022
    Assignee: HUAWEI TECHNOLOGIES CO., LTD.
    Inventor: Zhe Wang