Patents Examined by Brian L. Albertalli
  • Patent number: 11984125
    Abstract: Presented herein are techniques for augmenting a speech recognition engine. According to the disclosed techniques, audio data is obtained as part of an automatic speech recognition session. Speech hints are also obtained as part of the automatic speech recognition session. A dynamic language model is generated from the speech hints for use during the automatic speech recognition session. A combined language model is then generated from the dynamic language model and a static language model. Finally, the audio data is converted to text using the combined language model as part of the automatic speech recognition session.
    Type: Grant
    Filed: June 29, 2021
    Date of Patent: May 14, 2024
    Assignee: CISCO TECHNOLOGY, INC.
    Inventors: Rishabh Gupta Yadav, Kareem Nassar, Sylvain Le Groux, Matthew James Ceravolo
  • Patent number: 11978468
    Abstract: An audio signal processing method includes measuring a voice signal, wherein the measurement performed by an audio system including first through third sensors. Measuring the voice signal produces first through third audio signals by the first through third sensors, respectively. The audio signal processing method further includes: producing an output signal by using the first audio signal, the second audio signal and the third audio signal, wherein the output signal corresponds to: the first audio signal below a first crossing frequency, the second audio signal between the first crossing frequency and a second crossing frequency, the third audio signal above the second crossing frequency. The first crossing frequency is lower than or equal to the second crossing frequency, wherein the first crossing frequency and the second crossing frequency are different for at least some operating conditions of the audio system.
    Type: Grant
    Filed: April 6, 2022
    Date of Patent: May 7, 2024
    Assignee: Analog Devices International Unlimited Company
    Inventors: Stijn Robben, Abdel Yussef Hussenbocus, Jean-Marc Luneau
  • Patent number: 11967325
    Abstract: Disclosed are an electronic device capable of efficiently performing speech recognition and natural language understanding and a method for controlling thereof. The electronic device includes: a microphone; a non-volatile memory configured to store virtual assistant model data comprising data that is classified according to a plurality of domains and data that is commonly used for the plurality of domains; a volatile memory; and a processor configured to: based on receiving, through the microphone, a trigger input to perform speech recognition for a user speech, initiate loading the virtual assistant model data from the non-volatile memory into the volatile memory, load, into the volatile memory, first data from among the data classified according to the plurality of domains and, while loading the first data into the volatile memory, load at least a part of the data commonly used for the plurality of domains into the volatile memory.
    Type: Grant
    Filed: December 30, 2022
    Date of Patent: April 23, 2024
    Assignee: SAMSUNG ELECTRONICS CO., LTD.
    Inventors: Saebom Jang, Hyeonmok Ko, Kyenghun Lee, Kunal Sharma, Raghavendra Hanumantasetty Ramasetty
  • Patent number: 11967330
    Abstract: Described herein is a method for generating a modified bitstream on a source device, wherein the method includes the steps of: a) receiving, by a receiver, a bitstream including coded media data; b) generating, by an embedder, payload of additional media data and embedding the payload in the bitstream for obtaining, as an output from the embedder, a modified bitstream including the coded media data and the payload of the additional media data; and c) outputting the modified bitstream to a sink device. Described is further a method for processing said modified bitstream on a sink device. Described are moreover a respective source device and sink device as well as a system of a source device and a sink device and respective computer program products.
    Type: Grant
    Filed: August 13, 2020
    Date of Patent: April 23, 2024
    Assignees: DOLBY INTERNATIONAL AB, DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Christof Fersch, Daniel Fischer, Leon Terentiv, Gregory John McGarry
  • Patent number: 11967338
    Abstract: Systems and methods for a computerized interactive voice companion include functionality that receives audio of a user's voice as the user is speaking; detects a tone and/or other relevant aspects associated with the content of the user's voice based on the audio of the user's voice as the user is speaking and determines, as the user is speaking, a response to the user speaking based on the detected tone and/or other relevant aspects associated with the content of the user's voice of the user's voice. The computerized interactive voice companion system, then orally or visually provides the response to the user automatically in real-time as a reply to the user speaking. The system may then continue the conversation based on continuing to detect the mood of the user as they speak and basing responses on this, as well as other recent user behavior detected to be relevant to the conversation.
    Type: Grant
    Filed: October 27, 2020
    Date of Patent: April 23, 2024
    Assignee: DISH NETWORK TECHNOLOGIES INDIA PRIVATE LIMITED
    Inventor: Rangu Kr
  • Patent number: 11960791
    Abstract: A method for controlling a motion tracking system, including the steps of: digitally processing sound waves detected by a plurality of microphones so as to detect a voice of a user and estimate a first direction of the user; digitally processing electromagnetic waves captured by antennas so as to detect data packets transmitted to a computing apparatus by sensors and estimate second directions of each sensor; digitally averaging the second directions so as to provide an average direction for the sensors; digitally computing a difference between the first direction and the average direction; and starting to digitally track motion of the user based on measurements of each sensor when the computed difference does not exceed a predetermined difference threshold.
    Type: Grant
    Filed: October 29, 2020
    Date of Patent: April 16, 2024
    Assignee: SWORD HEALTH, S.A.
    Inventors: Márcio Filipe Moutinho Colunas, José Carlos Coelho Alves, Luís António Correia de Oliveira, Luís Ungaro Pinto Coelho, Virgílio António Ferro Bento
  • Patent number: 11941345
    Abstract: A computer-implemented process is programmed to process a source input, determine text enhancements, and present the text enhancements to apply to the sentences dictated from the source input. A text processor may use machine-learning models to process an audio input to generate sentences in a presentable format. An audio input can be processed by an automatic speech recognition model to generate electronic text. The electronic text may be used to generate sentence structures using a normalization model. A comprehension model may be used to identify instructions associated with the sentence structures and generate sentences based on the instructions and the sentence structures. An enhancement model may be used to identify enhancements to apply to the sentences. The enhancements may be presented alongside sentences generated by the comprehension model to provide the user an option to select either the enhancements or the sentences.
    Type: Grant
    Filed: October 26, 2021
    Date of Patent: March 26, 2024
    Assignee: Grammarly, Inc.
    Inventors: Timo Mertens, Vipul Raheja, Chad Mills, Ihor Skliarevskyi, Ignat Blazhko, Robyn Perry, Nicholas Bern, Dhruv Kumar, Melissa Lopez
  • Patent number: 11935529
    Abstract: Techniques for virtual assistant execution of ambiguous commands is provided. A voice instruction from a user may be received at a virtual assistant. The voice instruction may request the virtual assistant to perform a command. The command that is most likely being requested by the voice instruction from the user is identified. An ordered set of actions to execute when performing the command may be retrieved. Each action of the ordered set of actions may indicate if the action is reversible. Each action of the ordered set of actions may be executed in order until a not reversible action is reached or no further actions are in the ordered set of actions.
    Type: Grant
    Filed: June 15, 2021
    Date of Patent: March 19, 2024
    Assignee: MOTOROLA SOLUTIONS, INC.
    Inventors: Ying Bin Tan, Chew How Lim, Yih Farn Ghee, Joe Yie Chong
  • Patent number: 11929059
    Abstract: The present disclosure relates to a text-to-speech synthesis method using machine learning based on a sequential prosody feature. The text-to-speech synthesis method includes receiving input text, receiving a sequential prosody feature, and generating output speech data for the input text reflecting the received sequential prosody feature by inputting the input text and the received sequential prosody feature to an artificial neural network text-to-speech synthesis model.
    Type: Grant
    Filed: August 27, 2020
    Date of Patent: March 12, 2024
    Assignee: NEOSAPIENCE, INC.
    Inventors: Taesu Kim, Younggun Lee
  • Patent number: 11915694
    Abstract: A voice control interactive system and method to provide a hands-free operation for the operator to monitor and control multiple conveyors in a warehouse. The system comprises a first computing device and a second computing device. The first computing device receives an audio signal generated by a second computing device and generates a control signal and a response signal in response to the audio signal. The audio signal comprises information relating to a verbal command spoken by an operator associated with the second computing device. The response signal comprises information relating to a response for the verbal command, wherein the information is generated based on a location of the second computing device. The control signal comprises information to control a conveyor.
    Type: Grant
    Filed: February 25, 2021
    Date of Patent: February 27, 2024
    Assignee: Intelligrated Headquarters, LLC
    Inventors: Jason-David Nitzberg, Timothy R. Williams, Zachary Reott, Sang Pheng, Lori A. Pike, Jason A. Johnson, Jeffrey P. Pike
  • Patent number: 11907666
    Abstract: Various embodiments of a system and associated method for anonymization of text without losing semantic utility of text by extracting a latent embedding representation of content with respect to a given task and by learning an optimal strategy for text embedding manipulation to satisfy both privacy and utility requirements are disclosed herein. In particular, the system balances private attribute obfuscation with retained semantic utility.
    Type: Grant
    Filed: November 16, 2021
    Date of Patent: February 20, 2024
    Assignee: Arizona Board of Regents on Behalf of Arizona State University
    Inventors: Ahmadreza Mosallanezhad, Ghazaleh Beigi, Huan Liu
  • Patent number: 11902766
    Abstract: An illustrative collaboration space provider system provides a virtual collaboration session that allows for audio communication between a user and one or more other users virtually located within a virtual collaboration space. The user is represented by an avatar located at an avatar location within the virtual collaboration space. The collaboration space provider system receives user input from the user, the user input representative of a voice origination location that is within the virtual collaboration space and is distinct from the avatar location. During the virtual collaboration session, the collaboration space provider system simulates propagation within the virtual collaboration space of a voice communication spoken by the user. The propagation of the voice communication is simulated to originate from the voice origination location and not from the avatar location. Corresponding methods and systems are also disclosed.
    Type: Grant
    Filed: July 30, 2021
    Date of Patent: February 13, 2024
    Assignee: Verizon Patent and Licensing Inc.
    Inventors: Samuel Charles Mindlin, Kunal Jathal, Shan Anis, David Skuratowicz
  • Patent number: 11900937
    Abstract: Example techniques involve suppressing a wake word response to a local wake word. An example implementation involves a playback device receiving audio content for playback by the playback device and providing a sound data stream representing the received audio content to a voice assistant service (VAS) wake-word engine and a local keyword engine. The playback device plays back a first portion of the audio content and detects, via the local keyword engine, that a second portion of the received audio content includes sound data matching one or more particular local keywords. Before the second portion of the received audio content is played back, the playback device disables a local keyword response of the local keyword engine to the one or more particular local keywords and then plays back the second portion of the audio content via one or more speakers.
    Type: Grant
    Filed: July 1, 2022
    Date of Patent: February 13, 2024
    Assignee: Sonos, Inc.
    Inventor: Jonathan P. Lang
  • Patent number: 11893309
    Abstract: In response to a user interacting with a tangible peripheral assistant control device (e.g., depressing a button of the device), causing an automated assistant to perform one or more actions. The action(s) performed can be based on input previously provided by the user in configuring the peripheral assistant control device. The action(s) performed in response to interaction with the peripheral assistant control device can vary based on one or more conditions, such as which user is currently active, where the peripheral assistant control device is currently located (which can optionally be inferred based on which of multiple assistant computing devices the button is paired with), and/or the current state of one or more smart devices and/or other devices (e.g., as determined based on a device topology). A utility of the peripheral assistant control device can be automatically extended beyond what was specifically requested by a user during configuration.
    Type: Grant
    Filed: March 10, 2022
    Date of Patent: February 6, 2024
    Assignee: GOOGLE LLC
    Inventors: Tomer Amarilio, Yuzhao Ni, Bryan Allen, Norbert Tydingco, Will Donnelly, Feng Yuan, Nathaniel Nesiba, Anurag Jain, Jacky Cheung, Ronghui Zhu, Chunya Hua, Gregory Kielian
  • Patent number: 11887612
    Abstract: Disclosed is an LPC residual signal encoding/decoding apparatus of an MDCT based unified voice and audio encoding device. The LPC residual signal encoding apparatus analyzes a property of an input signal, selects an encoding method of an LPC filtered signal, and encode the LPC residual signal based on one of a real filterbank, a complex filterbank, and an algebraic code excited linear prediction (ACELP).
    Type: Grant
    Filed: August 25, 2022
    Date of Patent: January 30, 2024
    Assignee: Electronics and Telecommunications Research Institute
    Inventors: Seung Kwon Beack, Tae Jin Lee, Min Je Kim, Kyeongok Kang, Dae Young Jang, Jin Woo Hong, Jeongil Seo, Chieteuk Ahn, Hochong Park, Young-Cheol Park
  • Patent number: 11886820
    Abstract: A method and system are provided for training a machine-learning (ML) system/module and to provide an ML model. In one embodiment, a method includes using a labeled entities set to train a machine learning (ML) system, to obtain an ML model, and using the trained ML model to predict labels for entities in an unlabeled entities set, yielding a machine-labeled entities set. One or more individual ML models may be trained and used in this way, where each individual ML model corresponds to a respective document source. The document sources can be identified via classification of a corpus of documents. The prediction of labels provides a respective confidence score for each machine-labeled entity. The method also includes selecting from the machine-labeled entities set, a subset of machine-labeled entities having a respective confidence score at least equal to a threshold confidence score; and updating the labeled entities set by adding thereto the selected subset of machine-labeled entities.
    Type: Grant
    Filed: October 6, 2020
    Date of Patent: January 30, 2024
    Assignee: Genpact Luxembourg S.à r.l. II
    Inventors: Sreekanth Menon, Prakash Selvakumar, Sudheesh Sudevan
  • Patent number: 11881218
    Abstract: Prevention of voice misappropriation in voice interaction/response systems. The system relies on telemetry data, including thermal data of components to determine whether a received voice command was made by actual voice. If the voice command is determined to have been made by an actual voice, a response to the command is generated and transmitted, otherwise if the voice command is determined to have likely not been made by an actual voice (e.g., artificial means replicating a voice, such as a laser or the like), no response to the command is transmitted or action taken with respect to the command.
    Type: Grant
    Filed: July 12, 2021
    Date of Patent: January 23, 2024
    Assignee: BANK OF AMERICA CORPORATION
    Inventor: Steven Mark DiMaria
  • Patent number: 11881225
    Abstract: Audio encoder for encoding a multichannel signal is shown. The audio encoder includes a downmixer for downmixing the multichannel signal to obtain a downmix signal, a linear prediction domain core encoder for encoding the downmix signal, wherein the downmix signal has a low band and a high band, wherein the linear prediction domain core encoder is configured to apply a bandwidth extension processing for parametrically encoding the high band, a filterbank for generating a spectral representation of the multichannel signal, and a joint multichannel encoder configured to process the spectral representation including the low band and the high band of the multichannel signal to generate multichannel information.
    Type: Grant
    Filed: January 13, 2022
    Date of Patent: January 23, 2024
    Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.
    Inventors: Sascha Disch, Guillaume Fuchs, Emmanuel Ravelli, Christian Neukam, Konstantin Schmidt, Conrad Benndorf, Andreas Niedermeier, Benjamin Schubert, Ralf Geiger
  • Patent number: 11868724
    Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for generating author vectors. One of the methods includes obtaining a set of sequences of words, the set of sequences of words comprising a plurality of first sequences of words and, for each first sequence of words, a respective second sequence of words that follows the first sequence of words, wherein each first sequence of words and each second sequence of words has been classified as being authored by a first author; and training a neural network system on the first sequences and the second sequences to determine an author vector for the first author, wherein the author vector characterizes the first author.
    Type: Grant
    Filed: March 14, 2022
    Date of Patent: January 9, 2024
    Assignee: GOOGLE LLC
    Inventors: Quoc V. Le, Brian Patrick Strope
  • Patent number: 11853706
    Abstract: Sentiment analysis is a task in natural language processing. The embodiments are directed to using a generative language model to extract an aspect term, aspect category and their corresponding polarities. The generative language model may be trained as a single, joint, and multi-task model. The single-task generative language model determines a term polarity from the aspect term in the sentence or a category polarity from an aspect category in the sentence. The joint-task generative language model determines both the aspect term and the term polarity or the aspect category and the category polarity. The multi-task generative language model determines the aspect term, term polarity, aspect category and category polarity of the sentence.
    Type: Grant
    Filed: September 8, 2021
    Date of Patent: December 26, 2023
    Assignee: salesforce.com, inc.
    Inventors: Ehsan Hosseini-Asl, Wenhao Liu