Patents Examined by Michael Colucci
  • Patent number: 11514889
    Abstract: A device and a method for clarifying dysarthria voices is disclosed. Firstly, a dysarthria voice signal is received and framed to generate dysarthria frames. Then, the dysarthria frames are received to retrieve dysarthria features. Finally, the dysarthria features are received. Without receiving phases corresponding to the dysarthria features, the dysarthria features are converted into an intelligent voice signal based on an intelligent voice conversion model. The intelligent voice conversion model is not trained by the dynamic time warping (DTW). The present invention avoids the phase distortion of the voice signal and provides more natural and clarified voices with low noise.
    Type: Grant
    Filed: October 1, 2020
    Date of Patent: November 29, 2022
    Assignee: NATIONAL CHUNG CHENG UNIVERSITY
    Inventors: Tay-Jyi Lin, Che Chia Pai, Hsi Che Wang, Ching-Wei Yeh
  • Patent number: 11514330
    Abstract: Methods and systems are provided for a natural language processing system comprising a chatbot adapted for dialog generation. In one example, the system may include a combination of a variational autoencoder (VAE) and a generative adversarial network (GAN) for generating natural responses to input queries. The VAE may convert queries into vector embeddings that may then be used by the GAN to continuously update and improve responses provided by the chatbot.
    Type: Grant
    Filed: January 13, 2020
    Date of Patent: November 29, 2022
    Assignee: Cambia Health Solutions, Inc.
    Inventors: Weicheng Ma, Kai Cao, Bei Pan, Lin Chen, Xiang Li
  • Patent number: 11507759
    Abstract: A speech translation device, for conversation between a first speaker making an utterance in a first language and a second speaker making an utterance in a second language different from the first language, includes: a speech detector that detects, from sounds that are input, a speech segment in which the first speaker or the second speaker made an utterance; a display that, after speech recognition is performed on the utterance, displays a translation result obtained by translating the utterance from the first language to the second language or from the second language to the first language; and an utterance instructor that outputs, in the second language via the display, a message prompting the second speaker to make an utterance after a first speaker's utterance or outputs, in the first language via the display, a message prompting the first speaker to make an utterance after a second speaker's utterance.
    Type: Grant
    Filed: March 19, 2020
    Date of Patent: November 22, 2022
    Assignee: PANASONIC HOLDINGS CORPORATION
    Inventors: Hiroki Furukawa, Atsushi Sakaguchi, Tsuyoki Nishikawa
  • Patent number: 11501753
    Abstract: A method includes receiving, from an electronic device, information defining a user utterance associated with a skill to be performed, where the skill is not recognized by a natural language understanding (NLU) engine. The method also includes receiving, from the electronic device, information defining one or more actions for performing the skill. The method further includes identifying, using at least one processor, one or more known skills having one or more slots that map to at least one word or phrase in the user utterance. The method also includes creating, using the at least one processor, a plurality of additional utterances based on the one or more mapped slots. In addition, the method includes training, using the at least one processor, the NLU engine using the plurality of additional utterances.
    Type: Grant
    Filed: December 27, 2019
    Date of Patent: November 15, 2022
    Assignee: Samsung Electronics Co., Ltd.
    Inventors: Yilin Shen, Avik Ray, Hongxia Jin
  • Patent number: 11488580
    Abstract: It is an aspect of the present disclosure to provide a dialogue system capable of providing an extended function to the user by registering a new vocabulary that matches the user's preference and by changing the pre-stored conversation pattern.
    Type: Grant
    Filed: November 13, 2019
    Date of Patent: November 1, 2022
    Assignees: HYUNDAI MOTOR COMPANY, KIA CORPORATION
    Inventors: Seona Kim, Jeong-Eom Lee, Dongsoo Shin
  • Patent number: 11488587
    Abstract: Disclosed is a regional-features-based speech recognition method, including learning speech features by region using speech data classified by region category, and recognizing input speech using an acoustic model and a language model generated through classification of a region category for the input speech and the learning. A user may use a dialect recognition service that is improved using learning based on artificial intelligence (AI) and enhanced mobile broadband (eMBB), ultra-reliable and low latency communications (URLLC), and massive machine-type communications (mMTC) techniques of 5G mobile communication.
    Type: Grant
    Filed: March 18, 2020
    Date of Patent: November 1, 2022
    Assignee: LG ELECTRONICS INC.
    Inventor: Seonyeong Park
  • Patent number: 11488581
    Abstract: A new approach to automatic speech recognition is disclosed. An example method include receiving a first text representing speech recognition of a phrase spoken by a user, isolating a candidate named entity from within the phrase, receiving a first phonetic representation of the candidate named entity, comparing the first phonetic representation to phonetic representations in a mapping database which map the phonetic representations to words to yield a comparison, based on the comparison, identifying a second phonetic representation in the mapping database that matches a second text in the mapping database to the second phonetic representation and replacing the candidate named entity with the second text. The approach can be used for new brands for which automatic speech recognition error rates are high.
    Type: Grant
    Filed: December 6, 2019
    Date of Patent: November 1, 2022
    Assignee: Amazon Technologies, Inc.
    Inventors: Shlomi Chovel, Adriano Devillaine, Omer Shabtai Jakobinsky, Colin Zhen De Kho, Kawshik Karur Rangaraju, Ajay Soni, Yochai Zvik, Yunqiang Zhu
  • Patent number: 11475877
    Abstract: Disclosed are an end-to-end system for speech recognition and speech translation and an electronic device. The system comprises an acoustic encoder and a multi-task decoder and a semantic invariance constraint module, and completes two tasks for speech recognition and speech translation. In addition, according to the characteristic of the semantic consistency of texts between different tasks, semantic constraints are imposed on the model to learn high-level semantic information, and the semantic information can effectively improve the performance of speech recognition and speech translation. The application has the following advantages that the error accumulation problem of serial system is avoided, and the calculation cost of the model is low and the real-time performance is very high.
    Type: Grant
    Filed: June 28, 2022
    Date of Patent: October 18, 2022
    Assignee: INSTITUTE OF AUTOMATION, CHINESE ACADEMY OF SCIENCES
    Inventors: Jianhua Tao, Shuai Zhang, Jiangyan Yi
  • Patent number: 11450319
    Abstract: The present disclosure discloses an image processing device including: a receiving module configured to receive a voice signal and an image to be processed; a conversion module configured to convert the voice signal into an image processing instruction and determine a target area according to a target voice instruction conversion model, in which the target area is a processing area of the image to be processed; and a processing module configured to process the target area according to the image processing instruction and a target image processing model. The examples may realize the functionality of using voice commands to control image processing, which may save users' time spent in learning image processing software prior to image processing, and improve user experience.
    Type: Grant
    Filed: December 18, 2019
    Date of Patent: September 20, 2022
    Inventors: Tianshi Chen, Shuai Hu, Xiaobing Chen
  • Patent number: 11437045
    Abstract: System, methods, and computer readable media can be used to create a virtual assistant. One of the methods includes receiving audio from a conversation between two parties while the conversation is occurring. The method includes generating a partial transcript of the conversation. The method includes identifying topics based on the partial transcript. The method includes presenting a user interface element based on the identified topic.
    Type: Grant
    Filed: October 18, 2018
    Date of Patent: September 6, 2022
    Assignee: United Services Automobile Association (USAA)
    Inventors: Scott Evan Daly, Robert Hugh Newman, II, Kori Rochelle Newman
  • Patent number: 11437032
    Abstract: The present disclosure discloses an image processing device including: a receiving module configured to receive a voice signal and an image to be processed; a conversion module configured to convert the voice signal into an image processing instruction and determine a target area according to a target voice instruction conversion model, in which the target area is a processing area of the image to be processed; and a processing module configured to process the target area according to the image processing instruction and a target image processing model. The examples may realize the functionality of using voice commands to control image processing, which may save users' time spent in learning image processing software prior to image processing, and improve user experience.
    Type: Grant
    Filed: December 18, 2019
    Date of Patent: September 6, 2022
    Assignee: SHANGHAI CAMBRICON INFORMATION TECHNOLOGY CO., LTD
    Inventors: Tianshi Chen, Shuai Hu, Xiaobing Chen
  • Patent number: 11430439
    Abstract: Method for providing assistance in conversation including recognizing, by recognition module, conversation between primary user and at least one secondary user, identifying, by recognition module, first and second context data for primary user and at least one secondary user based on conversation; generating, by response generation module, at least one response on behalf of primary user based on at least one of second context data derived from at least one secondary user, and first context data; analyzing, by determining module, at least one action of primary user in at least one response on second context data; determining, by determining module, intervening situation in conversation based on at least one action; selecting, by intervening response module, intervening response from at least one response for determined intervening situation based on at least one action; and delivering, by response delivery module, intervening response to at least one secondary user during determined intervening situation.
    Type: Grant
    Filed: July 22, 2020
    Date of Patent: August 30, 2022
    Inventors: Ritesh Shreeshreemal, Gaurav Chaurasia
  • Patent number: 11423897
    Abstract: Systems and methods are described herein for generating an adaptive response to a user request. Input indicative of a user request may be received and utilized to identify an item in an electronic catalog. Title segments may be identified from the item's title. Significant segments of the user request may be determined. In response to the user request, a shortened title may be generated from the identified title segments and provided as output at the user device (e.g., via audible output provided at a speaker of the user device, via textual output, or the like). At least one of the title segments provided in the shortened title may correlate to the significant segment identified from the user request. In some embodiments, the length and content of the shortened title may vary based at least in part on the contextual intent of the user's request.
    Type: Grant
    Filed: January 30, 2020
    Date of Patent: August 23, 2022
    Assignee: Amazon Technologies, Inc.
    Inventors: Ran Levy, Ori Rozen, Leon Portman, Knaan Ratosh, Ido Arad, Hadar Neumann
  • Patent number: 11423916
    Abstract: The present invention proposes a new method for improving the performance of a real-valued filterbank based spectral envelope adjuster. By adaptively locking the gain values for adjacent channels dependent on the sign of the channels, as defined in the application, reduced aliasing is achieved. Furthermore, the grouping of the channels during gain-calculation, gives an improved energy estimate of the real valued subband signals in the filterbank.
    Type: Grant
    Filed: June 14, 2020
    Date of Patent: August 23, 2022
    Assignee: DOLBY INTERNATIONAL AB
    Inventors: Kristofer Kjoerling, Lars Villemoes
  • Patent number: 11417322
    Abstract: Methods, systems, and apparatus, including computer programs stored on a computer-readable storage medium, for transliteration for speech recognition training and scoring. In some implementations, language examples are accessed, some of which include words in a first script and words in one or more other scripts. At least portions of some of the language examples are transliterated to the first script to generate a training data set. A language model is generated based on occurrences of the different sequences of words in the training data set in the first script. The language model is used to perform speech recognition for an utterance.
    Type: Grant
    Filed: December 12, 2019
    Date of Patent: August 16, 2022
    Assignee: Google LLC
    Inventors: Bhuvana Ramabhadran, Min Ma, Pedro J. Moreno Mengibar, Jesse Emond, Brian E. Roark
  • Patent number: 11417327
    Abstract: An electronic apparatus is provided. The electronic device includes: a storage configured to store recognition related information and misrecognition related information of a trigger word for entering a speech recognition mode; and a processor configured to identify whether or not the speech recognition mode is activated on the basis of characteristic information of a received uttered speech and the recognition related information, identify a similarity between text information of the received uttered speech and text information of the trigger word, and update at least one of the recognition related information or the misrecognition related information on the basis of whether or not the speech recognition mode is activated and the similarity.
    Type: Grant
    Filed: November 27, 2019
    Date of Patent: August 16, 2022
    Assignee: SAMSUNG ELECTRONICS CO., LTD.
    Inventor: Chanhee Choi
  • Patent number: 11417353
    Abstract: A method for detecting an audio signal and an apparatus, where the method includes determining a segmental signal-to-noise ratio (SSNR) of an audio signal in response to the audio signal being an unvoiced signal, reducing a reference voice activity detection (VAD) decision threshold to obtain a reduced VAD decision threshold, and comparing the SSNR with the reduced VAD decision threshold to determine whether the audio signal is an active signal.
    Type: Grant
    Filed: June 15, 2020
    Date of Patent: August 16, 2022
    Assignee: HUAWEI TECHNOLOGIES CO., LTD.
    Inventor: Zhe Wang
  • Patent number: 11410661
    Abstract: A system for analyzing audio content is disclosed. In general, the system includes a transcription module, a correlation module, and a database. The transcription module is configured to receive a plurality of audio (and video) files generated by a plurality of different sources, execute speech-to-text transcriptions in real-time based on portions of audio content included within the audio files, and generate written transcripts of such transcriptions. The correlation module is configured to receive metadata associated with each of such audio files, derive correlations between such written transcripts and metadata, and report such correlations to a user of the system (and/or conclusions and classifications based on such correlations). The database is configured to receive, record, and make accessible for searching and review the correlations generated by the correlation module.
    Type: Grant
    Filed: July 13, 2020
    Date of Patent: August 9, 2022
    Inventor: Walter Bachtiger
  • Patent number: 11404067
    Abstract: A method of operating an electronic device and an electronic device thereof are provided. The method includes receiving a first voice signal of a first user, authenticating whether the first user has authority to control the electronic device, based on the first voice signal, and determining an instruction corresponding to the first voice signal based on an authentication result and controlling the electronic device according to the instruction. The electronic device includes a receiver configured to receive a first voice signal of a first user and at least one processor configured to authenticate whether the first user has authority to control the electronic device based on the first voice signal, determine an instruction corresponding to the first voice signal, and control the electronic device according to the instruction.
    Type: Grant
    Filed: June 15, 2020
    Date of Patent: August 2, 2022
    Assignee: Samsung Electronics Co., Ltd.
    Inventors: Anas Toma, Ahmad Abu Shariah, Hadi Jadallah
  • Patent number: 11392646
    Abstract: There is provided an information processing device, an information processing terminal, and an information processing method which are capable of presenting a choice which is easily recognized by voice. The information processing device according to one aspect of the present technology acquires a plurality of pieces of presentation information to be presented as choices and causes a part which is not similar to the other presentation information among the respective pieces of presentation information to be presented in a form different from a similar part. The present technology can be applied to devices having a voice assistant function of assisting a behavior of a user.
    Type: Grant
    Filed: November 1, 2018
    Date of Patent: July 19, 2022
    Assignee: SONY CORPORATION
    Inventor: Mari Saito