Patents Examined by Edgar X Guerra-Erazo

Systems and methods for dynamically improving user intelligibility of synthesized speech in a work environment

Patent number: 11810545

Abstract: A method and apparatus that dynamically adjust operational parameters of a text-to-speech engine in a speech-based system are disclosed. A voice engine or other application of a device provides a mechanism to alter the adjustable operational parameters of the text-to-speech engine. In response to one or more environmental conditions, the adjustable operational parameters of the text-to-speech engine are modified to increase the intelligibility of synthesized speech.

Type: Grant

Filed: May 7, 2020

Date of Patent: November 7, 2023

Assignee: VOCOLLECT, INC.

Inventors: James Hendrickson, Debra Drylie Stiffey, Duane Littleton, John Pecorari, Arkadiusz Slusarczyk
Local and cloud speech recognition

Patent number: 11804227

Abstract: Disclosed herein are system, apparatus, article of manufacture, method and/or computer program product embodiments, and/or combinations and sub-combinations thereof, for distributing the performance of speech recognition among a remote control device and a voice platform in the cloud. In some embodiments, the remote control device operates to receive a voice input from a user. The remote control device detects a trigger word in the voice input. The remote control device then processes the voice input. The remote control device then transmits the voice input to a voice platform based on the detecting in order to determine an intent associated with the voice input.

Type: Grant

Filed: May 21, 2021

Date of Patent: October 31, 2023

Assignee: Roku, Inc.

Inventors: Anthony John Wood, David Stern, Gregory Mack Garner
Scalable dynamic class language modeling

Patent number: 11804218

Abstract: This document generally describes systems and methods for dynamically adapting speech recognition for individual voice queries of a user using class-based language models. The method may include receiving a voice query from a user that includes audio data corresponding to an utterance of the user, and context data associated with the user. One or more class models are then generated that collectively identify a first set of terms determined based on the context data, and a respective class to which the respective term is assigned for each respective term in the first set of terms. A language model that includes a residual unigram may then be accessed and processed for each respective class to insert a respective class symbol at each instance of the residual unigram that occurs within the language model. A transcription of the utterance of the user is then generated using the modified language model.

Type: Grant

Filed: February 10, 2021

Date of Patent: October 31, 2023

Assignee: Google LLC

Inventors: Justin Max Scheiner, Petar Aleksic
Optimization method for implementation of mel-frequency cepstral coefficients

Patent number: 11804238

Abstract: An optimization method for an implementation of mel-frequency cepstral coefficients is provided. The optimization method includes the following steps: performing a framing step, including using a 400×16 static random access memory to temporarily store a plurality of sampling points of a sound signal with overlap, and decomposing the sound signal into a plurality of frames. Each of the plurality of frames is 400 of the sampling points, there is an overlapping region between adjacent two of the plurality of frames, and the overlapping region includes 240 of the sampling points. The optimization method further includes performing a windowing step, which includes multiplying each of the plurality of frames by a window function in a bit-level design, and the optimization method includes performing a fast Fourier transform (FFT) step, which includes applying a 512 point FFT on a frame signal to obtain a corresponding frequency spectrum.

Type: Grant

Filed: October 29, 2021

Date of Patent: October 31, 2023

Assignee: REALTEK SEMICONDUCTOR CORP.

Inventors: Li-Li Tan, Zhi-Lin Wang, Xiao-Feng Cao, Xiao-Huan Li
Automatically determining language for speech recognition of spoken utterance received via an automated assistant interface

Patent number: 11798541

Abstract: Determining a language for speech recognition of a spoken utterance received via an automated assistant interface for interacting with an automated assistant. Implementations can enable multilingual interaction with the automated assistant, without necessitating a user explicitly designate a language to be utilized for each interaction. Implementations determine a user profile that corresponds to audio data that captures a spoken utterance, and utilize language(s), and optionally corresponding probabilities, assigned to the user profile in determining a language for speech recognition of the spoken utterance. Some implementations select only a subset of languages, assigned to the user profile, to utilize in speech recognition of a given spoken utterance of the user.

Type: Grant

Filed: November 16, 2020

Date of Patent: October 24, 2023

Assignee: GOOGLE LLC

Inventors: Pu-sen Chao, Diego Melendo Casado, Ignacio Lopez Moreno
Optimization of workload scheduling in a distributed shared resource environment

Patent number: 11789774

Abstract: An artificial intelligence (AI) platform to support optimization of workload scheduling in a distributed computing environment. Unstructured data corresponding to one or more application artifacts related to a workload in the distributed computing environment is leveraged. NLP is applied to the unstructured data to identify one or more host requirements corresponding to the application artifacts. One or more hosts in the computing environment compatible with the identified host requirements are selectively identified and compatibility between the application artifacts and the identified hosts is assessed. The workload is selectively scheduled responsive to the selective host identification based on the assessed compatibility. The scheduled workload is selectively executed on at least one of the selectively identified hosts responsive to the assessment workload compatibility.

Type: Grant

Filed: February 22, 2021

Date of Patent: October 17, 2023

Assignee: International Business Machines Corporation

Inventors: Abhishek Malvankar, John M. Ganci, Jr., Ashok Pon Kumar Sree Prakash, Umamaheswari Devi
Voice application platform

Patent number: 11790904

Abstract: Among other things, requests are received from voice assistant devices expressed in accordance with different corresponding protocols of one or more voice assistant frameworks. Each of the requests represents a voiced input by a user to the corresponding voice assistant device. The received requests are re-expressed in accordance with a common request protocol. Based on the received requests, responses to the requests are expressed in accordance with a common response protocol. Each of the responses is re-expressed according to a protocol of the framework with respect to which the corresponding request was expressed. The responses are sent to the voice assistant devices for presentation to the users.

Type: Grant

Filed: November 25, 2020

Date of Patent: October 17, 2023

Assignee: Voicify, LLC

Inventors: Robert T. Naughton, Nicholas G. Laidlaw, Alexander M. Dunn, Jeffrey K. McMahon
Determining state of automated assistant dialog

Patent number: 11790899

Abstract: Determining a dialog state of an electronic dialog that includes an automated assistant and at least one user, and performing action(s) based on the determined dialog state. The dialog state can be represented as one or more slots and, for each of the slots, one or more candidate values for the slot and a corresponding score (e.g., a probability) for each of the candidate values. Candidate values for a slot can be determined based on language processing of user utterance(s) and/or system utterance(s) during the dialog. In generating scores for candidate value(s) of a given slot at a given turn of an electronic dialog, various features are determined based on processing of the user utterance and the system utterance using a memory network. The various generated features can be processed using a scoring model to generate scores for candidate value(s) of the given slot at the given turn.

Type: Grant

Filed: November 19, 2020

Date of Patent: October 17, 2023

Assignee: GOOGLE LLC

Inventors: Abhinav Rastogi, Larry Paul Heck, Dilek Hakkani-Tur
Asset user discovery data classification and risk evaluation

Patent number: 11782912

Abstract: Methods, systems, and devices for asset discovery, user discovery, data classification, risk evaluation, and data/device security are described. The method includes retrieving data stored at one or more remote locations, summarizing the retrieved data at the one or more remote locations, transferring the summarized data from the one or more remote locations to the at least one computing device, processing the transferred data by the at least one computing device, discovering assets in technology environments, classifying data that resides on each asset of the discovered assets into a respective confidentiality group of multiple confidentiality groups, calculating one or more risk scores for the discovered assets or users of the discovered assets, or both, and performing a security action to protect data that resides on an asset of the discovered assets.

Type: Grant

Filed: August 17, 2020

Date of Patent: October 10, 2023

Assignee: Lucidum, Inc.

Inventors: Shuning Wu, Wangyan Feng, Joel M. Fulton
Ambiguity resolution for application integration

Patent number: 11756548

Abstract: Systems and processes for operating an intelligent automated assistant are provided. An intelligent automated assistant receives a user input and generates a set of one or more token sequences based on the user input. The set of one or more token sequences are interpreted to generate a plurality of candidate interpretations each including a corresponding action and metadata associated with the action. A top candidate interpretation is selected from the plurality of candidate interpretations, and the corresponding action is performed based on the associated metadata.

Type: Grant

Filed: September 21, 2022

Date of Patent: September 12, 2023

Assignee: Apple Inc.

Inventors: Lewis N. Perkins, Peter E. Boothroyd, Antonio M. Cancio, Thorvaldur Helgason, Antoine R. Raux, Gayathri Sairamkrishnan
Method and apparatus for displaying candidate word, and graphical user interface

Patent number: 11755835

Abstract: Embodiments of the present invention relate to the field of terminal technologies, and provide a method and an apparatus for displaying a candidate word, and a graphical user interface to improve efficiency of a user in entering information by using an input method. The method is applied to a scenario in which a user enters information by using an input method. The method includes: determining a type of an application that invokes the input method; determining, according to the type, dimension information corresponding to the type; determining, according to the dimension information, a lexicon corresponding to the dimension information; and displaying, in a default candidate option area of the input method, at least one candidate word that is in the lexicon and meets a preset condition.

Type: Grant

Filed: January 8, 2021

Date of Patent: September 12, 2023

Assignee: HONOR DEVICE CO., LTD.

Inventors: Weibin Zheng, Yue Zhang
Search and knowledge base question answering for a voice user interface

Patent number: 11740863

Abstract: A voice-controlled question answering system that is capable of answering questions using both a knowledge base and a search engine. The knowledge base is used to answer questions when answers to those questions are contained in the knowledge base. If an answer using the knowledge base is unavailable, and if the question is suitable for answering using an unstructured search approach, the system may obtain an answer using a search engine. The search engine results may be processed to obtain an answer to the question suitable for output using a voice user interface.

Type: Grant

Filed: May 1, 2020

Date of Patent: August 29, 2023

Assignee: Amazon Technologies, Inc.

Inventors: Daniel Lewis Spector, Fergus O'Donoghue, Chase Wesley Brown, Jr., Shayne Leon Snow, Brandon Gerald Li Horst, William Folwell Barton
Systems and methods for slot relation extraction for machine learning task-oriented dialogue systems

Patent number: 11734519

Abstract: A system and method for implementing slot-relation extraction for a task-oriented dialogue system that includes implementing dialogue intent classification machine learning models that predict a category of dialogue of a single utterance based on an input of utterance data relating to the single utterance, wherein the category of dialogue informs a selection of slot-filling machine learning models; implementing the slot-filling machine learning models that predict slot classification labels for each of a plurality of slots within the utterance based on the input of the utterance data; implementing a slot relation extraction machine learning model that predicts semantic relationship classifications between two or more distinct slots of tokens of the utterance; and generating a response to the single utterance or performing actions in response to the single utterance based on the semantic relationship classifications between the distinct pairings of the two or more distinct slots of the single utterance.

Type: Grant

Filed: February 10, 2021

Date of Patent: August 22, 2023

Assignee: Clinc, Inc.

Inventors: Andrew Lee, Zhenguo Chen, Jonathan K. Kummerfeld
Automated translation of subject matter specific documents

Patent number: 11734514

Abstract: Documents in source natural languages are translated into target natural languages using a computer-implemented translation that is configured to operate within the domain of the subject matter of the documents that imposes specialized requirements for translation and readability. Subject matter specific documents typically include domain-specific terminology, are subject to various regulatory guidelines, and have different readability requirements depending on the intended reader. The computer-implemented translation applies machine-learning techniques that deconstruct elements of the subject matter specific document into a standard data structure and perform pre-processing steps to tokenize digitized document text to identify the correct sentence structure and syntax for the target natural language to optimize translation by, e.g., a neural machine translation engine.

Type: Grant

Filed: November 16, 2020

Date of Patent: August 22, 2023

Assignee: IQVIA INC.

Inventors: Gary Shorter, Naouel Baili Ben Abdallah, Barry Ahrens
Method and system for expansion to everyday language by using word vectorization technique based on social network content

Patent number: 11734508

Abstract: Provided is a method and system for expanding to an everyday language using a word vectorization technique based on social network content. A content providing method includes collecting social network content on the Internet; expanding corresponding content information to a word set of words included in the social network content with respect to target content that is to be serviced to a client; and providing the target content to the client with respect to user information associated with the client using the word set.

Type: Grant

Filed: September 9, 2020

Date of Patent: August 22, 2023

Assignee: LINE Corporation

Inventor: Hyukjae Jang
Controllable style-based text transformation

Patent number: 11734509

Abstract: Methods, systems and computer program products for multi-style text transformation are provided herein. A computer-implemented method includes selecting at least one set of style specifications for transforming at least a portion of input text. The at least one set of style specifications include one or more target writing style domains selected from a plurality of writing style domains, weights for each of the target writing style domains representing relative impact of the target writing style domains for transformation of at least a portion of the input text, and weights for each of a set of linguistic aspects for transformation of at least a portion of the input text. The computer-implemented method also includes generating one or more style-transformed output texts based at least in part on the at least one set of style specifications utilizing at least one unsupervised neural network.

Type: Grant

Filed: December 29, 2020

Date of Patent: August 22, 2023

Assignee: International Business Machines Corporation

Inventors: Abhijit Mishra, Parag Jain, Amar P. Azad, Karthik Sankaranarayanan
Proximity aware voice agent

Patent number: 11721337

Abstract: A personal assistant device configured to control companion devices may include a memory configured to maintain a companion device library including a plurality of companion device each associated with at least one long-name, short-cut name and companion device room location, and a processor. The processor may be configured to receive a user command from a microphone, extract a companion device name and action from the user command, determine whether the companion device name includes a unique name, and command a companion device associated with the unique name to perform the action from the user command in response to the user command including the unique name.

Type: Grant

Filed: July 20, 2020

Date of Patent: August 8, 2023

Assignee: Harman International Industries, Incorporated

Inventor: Craig Gunther
Multimodal transmission of packetized data

Patent number: 11705121

Abstract: A system of multi-modal transmission of packetized data in a voice activated data packet based computer network environment is provided. A natural language processor component can parse an input audio signal to identify a request and a trigger keyword. Based on the input audio signal, a direct action application programming interface can generate a first action data structure, and a content selector component can select a content item. An interface management component can identify first and second candidate interfaces, and respective resource utilization values. The interface management component can select, based on the resource utilization values, the first candidate interface to present the content item. The interface management component can provide the first action data structure to the client computing device for rendering as audio output, and can transmit the content item converted for a first modality to deliver the content item for rendering from the selected interface.

Type: Grant

Filed: July 23, 2020

Date of Patent: July 18, 2023

Assignee: GOOGLE LLC

Inventors: Gaurav Bhaya, Robert Stets, Umesh Patil
Event-based speech interactive media player

Patent number: 11699436

Abstract: Interactive content containing audio or video may be provided in conjunction with non-interactive content containing audio or video to enhance user engagement and interest with the contents and to increase the effectiveness of the distributed information. Interactive content may be directly inserted into the existing, non-interactive content. Additionally or alternatively, interactive content may be streamed in parallel to the existing content, with minimal modification to the existing content. For example, the server may monitor content from a content provider; detect an event (e.g., a marker embedded in the content stream, or in a data source external to the content stream); upon detecting the event, play interactive content at a designated time while silencing the content stream of the content provider (e.g., by muting, pausing, playing silence.) The marker may be a sub-audible tone or metadata associated with the content stream. The user may respond to the interactive content by voice.

Type: Grant

Filed: July 2, 2020

Date of Patent: July 11, 2023

Assignee: XAPPMEDIA, INC.

Inventors: Patrick B. Higbie, John P. Kelvie, Michael M. Myers, Franklin D. Raines
Adaptive multichannel dereverberation for automatic speech recognition

Patent number: 11699453

Abstract: Utilizing an adaptive multichannel technique to mitigate reverberation present in received audio signals, prior to providing corresponding audio data to one or more additional component(s), such as automatic speech recognition (ASR) components. Implementations disclosed herein are “adaptive”, in that they utilize a filter, in the reverberation mitigation, that is online, causal and varies depending on characteristics of the input. Implementations disclosed herein are “multichannel”, in that a corresponding audio signal is received from each of multiple audio transducers (also referred to herein as “microphones”) of a client device, and the multiple audio signals (e.g., frequency domain representations thereof) are utilized in updating of the filter—and dereverberation occurs for audio data corresponding to each of the audio signals (e.g., frequency domain representations thereof) prior to the audio data being provided to ASR component(s) and/or other component(s).

Type: Grant

Filed: August 28, 2020

Date of Patent: July 11, 2023

Assignee: GOOGLE LLC

Inventors: Joseph Caroselli, Arun Narayanan, Izhak Shafran, Richard Rose

prev 1 2 3 4 5 6 … next