Patents Examined by Yogeshkumar Patel
  • Patent number: 11900942
    Abstract: A software-based system and method that provides a generalized scheme to voice-enable text-oriented chatbots. The system can be configured to adapt to a plurality of different types of chatbots, a plurality of different speech-to-text and text-to-speech services, a plurality of different grammars, and even a plurality of different languages. The system can further be configured to handle “HTTP complex” situations such as electronic calendars by automatically analyzing these HTTP complex situations into various sub-dialogs, which the system can then automatically use to communicate with users, and then present the final results back to the chatbot. These methods enable organizations to preserve their extensive investment in legacy chatbots while rapidly and relatively inexpensively providing voice functionality to a broader range of users.
    Type: Grant
    Filed: December 1, 2021
    Date of Patent: February 13, 2024
    Assignee: Interactive Media S.p.A.
    Inventors: Livio Pugliese, Roberto Marega, Alberto Navatta
  • Patent number: 11900918
    Abstract: The present disclosure provides a method for training a linguistic model, related to fields of speech, natural language processing, deep learning technologies. A method includes: obtaining grammars corresponding to a plurality of sample texts and a slot value of a slot in each grammar by using semantic analysis; generating a grammar graph corresponding to each grammar based on the corresponding grammar and the slot value of the slot in the corresponding grammar; obtaining a weight of each grammar, a weight of each slot, and a weight of each slot value in each grammar graph based on the sample texts; determining at least one grammar frequency of each order based on the weight of each grammar, the weight of each slot, and the weight of each slot value in each grammar graph; and training the linguistic model based on the at least one grammar frequency of each order.
    Type: Grant
    Filed: October 19, 2021
    Date of Patent: February 13, 2024
    Assignee: BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD.
    Inventors: Liao Zhang, Zhengxiang Jiang, Xiaoyin Fu
  • Patent number: 11900934
    Abstract: A method and apparatus for generating a new function of a voice agent, wherein usage logs of users of the voice agent may be analyzed to extract a set of utterances of the users with respect to a new function of the voice agent, proto capsules for the set of utterances are provided. The method includes based on the set of utterances, ranks of importance of the proto capsules may be determined, a vocabulary of a proto capsule having a higher rank than a preset criterion may be identified, and a source code stub for a new function of the voice agent corresponding to the proto capsule having the higher rank may be generated based on the identified vocabulary.
    Type: Grant
    Filed: October 5, 2021
    Date of Patent: February 13, 2024
    Assignee: Samsung Electronics Co., Ltd.
    Inventors: Marcin Walas, Pawel Kubiak, Wojciech Szmyd, Bozena Lukasiak
  • Patent number: 11887605
    Abstract: A method including searching, on the basis of a voiceprint feature of a speaker, for an identifier of the speaker in a speaker registry, the voiceprint feature of the speaker being a parameter obtained according to a voice signal of the speaker captured by a microphone array; if position information corresponding to the identifier of the speaker in the speaker registry is different from position information of the speaker, updating the speaker registry, the position information of the speaker being a parameter obtained according to the voice signal of the speaker captured by the microphone array; and labeling the voice signal of the speaker with the identifier of the speaker, so as to track the speaker. The present disclosure enables voice tracking of multiple persons.
    Type: Grant
    Filed: February 26, 2021
    Date of Patent: January 30, 2024
    Assignee: Alibaba Group Holding Limited
    Inventors: Gang Liu, Yunfeng Xu, Tao Yu, Zhang Liu
  • Patent number: 11886817
    Abstract: An electronic apparatus is disclosed.
    Type: Grant
    Filed: July 27, 2021
    Date of Patent: January 30, 2024
    Assignee: SAMSUNG ELECTRONICS CO., LTD.
    Inventors: Hyojung Han, Sathish Indurthi, Beomseok Lee, Mohd Abbas Zaidi, Nikhil Kumar
  • Patent number: 11882385
    Abstract: A method including: establishing connections, at a server, to at least two client devices using a call control protocol, the call control protocol negotiating video formats and connection information for sending and receiving media streams; receiving information from a first client at the server, the information comprising meta-data describing different media streams the first client is configured to transmit; transmitting the information received from the first client to the at least one other client; receiving a subscribe message from the at least one other client at the server, subscribing to at least one available media stream from the first client; in response to receiving at least one subscribe message from the at least one other client, transmitting, by the server, a message instructing the first client to start transmitting media streams subscribed to by the at least one other client.
    Type: Grant
    Filed: February 16, 2023
    Date of Patent: January 23, 2024
    Assignee: CISCO TECHNOLOGY, INC.
    Inventors: Espen Berger, Pascal Bühler, Jan Asle Kroknes
  • Patent number: 11881206
    Abstract: Systems and methods for spatially emulating a sound source. An apparatus includes a microphone array including microphones; and a sound profiler communicatively connected to the microphone array, the sound profiler including a processing circuitry and a memory which contains instructions that, when executed by the processing circuitry, configure the apparatus to: generate synthesized audio based on sound beam metadata, a sound profile, and target listener location data, wherein the sound beam metadata includes timed sound beams defining a directional dependence of a spatial sound wave, wherein the sound profile includes timed sound coefficients determined based on audio signals captured in a space wherein the target listener location data includes a position and an orientation, wherein the synthesized audio emulates sound that would be heard by a listener at the position and orientation of the target listener location data; and providing the synthesized audio for projection.
    Type: Grant
    Filed: May 6, 2022
    Date of Patent: January 23, 2024
    Assignee: INSOUNDZ LTD.
    Inventors: Ron Ziv, Tomer Goshen, Emil Winebrand, Yadin Aharoni
  • Patent number: 11875762
    Abstract: An effect imparting device, control method and non-transitory computer readable medium are provided. This effect imparting device has: a plurality of effect units that provide effects to a sound that has been input; a storage part for storing a plurality of patches each including a collection of parameters to be applied to the effect units; an input part for receives a designation of a patch; an application part for applying the parameters included in the designated patch to the effect units; an output part for outputting the sound to which effects have been provided in accordance with the parameters applied to the effect units; and a muting means for temporarily muting the effects—provided sound to be outputted when the effects units include an effect unit in which the type of an effect is changed through changing of the designated patch.
    Type: Grant
    Filed: March 30, 2018
    Date of Patent: January 16, 2024
    Assignee: Roland Corporation
    Inventor: Yukio Shigeno
  • Patent number: 11868386
    Abstract: One aspect of the present disclosure relates to a method of sentiment analysis based on ambiguity analysis, which includes analyzing information with the sentiment analysis models and the ambiguity analysis models. Another aspect of the present disclosure relates to a method of training the sentiment analysis models and ambiguity analysis models, which includes acquiring information, constructing lexicons, conducting sentiment analysis and ambiguity analysis with said lexicons, acquiring corpus, and training models, etc. Meanwhile, another aspect of the present disclosure relates to a system of sentiment analysis, which includes input, and output modules, acquisition modules, processing modules and database.
    Type: Grant
    Filed: September 28, 2022
    Date of Patent: January 9, 2024
    Assignee: HITHINK ROYALFLUSH INFORMATION NETWORK CO., LTD.
    Inventors: Zheng Yi, Wei Xia
  • Patent number: 11863954
    Abstract: The present disclosure generally relates to user interfaces for electronic audio devices. In some examples, the operating mode of the device changes to various states of sound transparency.
    Type: Grant
    Filed: October 15, 2021
    Date of Patent: January 2, 2024
    Assignee: Apple Inc.
    Inventors: David Chance Graham, Patrick L Coffman, Thomas S. Hulbert, Cyrus Daniel Irani, Daniel Max Strongwater
  • Patent number: 11862192
    Abstract: The disclosure provides technology for enhancing the ability of a computing device to detect when a user has discontinued reading a text source. An example method includes receiving audio data comprising a spoken word associated with a text source, wherein the audio data comprises a first duration and a second duration; comparing the audio data with data of the text source, wherein the first duration of the audio data corresponds with the data of the text source; calculating, by a processing device, a correspondence measure between the second duration of the audio data and the data of the text source; and responsive to determining the correspondence measure satisfies a threshold, transmitting a signal to cease comparing audio data with the data of the text source.
    Type: Grant
    Filed: August 27, 2018
    Date of Patent: January 2, 2024
    Assignee: Google LLC
    Inventors: Chaitanya Gharpure, Evan Fisher, Eric Liu, Peng Yang, Emily Hou, Victoria Fang
  • Patent number: 11854576
    Abstract: A headset that can detect the voice activity of a user includes an inner microphone generating an inner microphone signal; an outer microphone generating an outer microphone signal, wherein the inner microphone and outer microphone are positioned such that, when the headset is worn by a user, the inner microphone is disposed nearer to the user's head; and a voice-activity detector determining a sign of a phase difference between the inner microphone signal and the outer microphone signal and generating a voice activity detection signal representing a user's voice activity when the sign of the phase difference indicates that the outer microphone received an audio signal after the inner microphone received the audio signal.
    Type: Grant
    Filed: August 25, 2021
    Date of Patent: December 26, 2023
    Assignee: Bose Corporation
    Inventor: Dale McElhone
  • Patent number: 11854567
    Abstract: One example includes a digital twin of a microphone array. The digital twin acts as a digital copy of a physical microphone array. The digital array allows the microphone array to be analyzed, simulated and optimized. Further, the microphone array can be optimized for performing sound quality operations such as noise suppression and speech intelligibility.
    Type: Grant
    Filed: October 22, 2021
    Date of Patent: December 26, 2023
    Assignee: EMC IP HOLDING COMPANY LLC
    Inventors: Danqing Sha, Amy N. Seibel, Eric Bruno, Zhen Jia
  • Patent number: 11848030
    Abstract: Some examples include a computing device that receives media content to distribute to a plurality of electronic devices. The computing device may receive an indication of first data to relate to the media content for distribution to the plurality of electronic devices. A portion of the multimedia content may be decoded to enable a determination that the media content already has second data embedded in the media content. A psychoacoustic mask may be extracted from the media content and subtracted from the received media content to remove the embedded second data. The first data may be associated with the media content by either embedding third data in the media content, or by embedding the first data in the media content.
    Type: Grant
    Filed: November 13, 2020
    Date of Patent: December 19, 2023
    Assignee: ADORI LABS, INC.
    Inventors: Viswanathan Iyer, Kartik Parija
  • Patent number: 11842121
    Abstract: Some demonstrative embodiments include apparatuses, systems and methods of sound control. For example, an apparatus may be configured to process one or more audio inputs to be heard in one or more personal sound zones, and a plurality of monitoring inputs, wherein the plurality of monitoring inputs represent acoustic sound at a plurality of predefined monitoring sensing locations, which are defined within the one or more personal sound zones; determine a sound control pattern based on the one or more audio inputs, and the plurality of monitoring inputs, the sound control pattern comprising a plurality of sound control signals configured to drive a respective plurality of acoustic transducers such that the one or more audio inputs are to be heard in the one or more personal sound zones; and output the plurality of sound control signals to the plurality of acoustic transducers.
    Type: Grant
    Filed: May 1, 2022
    Date of Patent: December 12, 2023
    Assignee: SILENTIUM LTD.
    Inventors: Tzvi Fridman, Ziv Hermon, Yoel Naor, Yuval Serfaty
  • Patent number: 11842277
    Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for controlling an agent. One of the methods includes receiving a current observation characterizing a current state of the environment as of the time step; generating an embedding of the current observation; processing scene memory data comprising embeddings of prior observations received at prior time steps using an encoder neural network, wherein the encoder neural network is configured to apply an encoder self-attention mechanism to the scene memory data to generate an encoded representation of the scene memory data; processing the encoded representation of the scene memory data and the embedding of the current observation using a decoder neural network to generate an action selection output; and causing the agent to perform the selected action.
    Type: Grant
    Filed: September 26, 2022
    Date of Patent: December 12, 2023
    Assignee: Google LLC
    Inventors: Kuan Fang, Alexander Toshkov Toshev
  • Patent number: 11832076
    Abstract: An electronic device includes a display and one or more processors. The display displays, when a person is at a location, a virtual image at a sound localization point (SLP) in empty space. The one or more processors process sound with a room impulse response (RIR) for the location in order to generate binaural sound that externally localizes at the SLP in empty space in response to the person being at the location.
    Type: Grant
    Filed: November 5, 2021
    Date of Patent: November 28, 2023
    Inventors: Philip Scott Lyren, Glen A. Norris
  • Patent number: 11822897
    Abstract: Approaches for the translation of structured text include an embedding module for encoding and embedding source text in a first language, an encoder for encoding output of the embedding module, a decoder for iteratively decoding output of the encoder based on generated tokens in translated text from previous iterations, a beam module for constraining output of the decoder with respect to possible embedded tags to include in the translated text for a current iteration using a beam search, and a layer for selecting a token to be included in the translated text for the current iteration. The translated text is in a second language different from the first language. In some embodiments, the approach further includes scoring and pointer modules for selecting the token based on the output of the beam module or copied from the source text or reference text from a training pair best matching the source text.
    Type: Grant
    Filed: August 31, 2021
    Date of Patent: November 21, 2023
    Assignee: salesforce.com, inc.
    Inventors: Kazuma Hashimoto, Raffaella Buschiazzo, James Bradbury, Teresa Anna Marshall, Caiming Xiong, Richard Socher
  • Patent number: 11817107
    Abstract: Innovations in phase quantization during speech encoding and phase reconstruction during speech decoding are described. For example, to encode a set of phase values, a speech encoder omits higher-frequency phase values and/or represents at least some of the phase values as a weighted sum of basis functions. Or, as another example, to decode a set of phase values, a speech decoder reconstructs at least some of the phase values using a weighted sum of basis functions and/or reconstructs lower-frequency phase values then uses at least some of the lower-frequency phase values to synthesize higher-frequency phase values. In many cases, the innovations improve the performance of a speech codec in low bitrate scenarios, even when encoded data is delivered over a network that suffers from insufficient bandwidth or transmission quality problems.
    Type: Grant
    Filed: July 27, 2022
    Date of Patent: November 14, 2023
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Soren Skak Jensen, Sriram Srinivasan, Koen Bernard Vos
  • Patent number: 11817112
    Abstract: A method, a device, a computer readable medium and an electronic apparatus for speech signal processing are disclosed. The method comprises: acquiring sound source position information and at least two channels of sound signals from a microphone array; suppressing, according to the sound source position information, a sound signal from the sound source direction in the at least two channels of sound signals, to obtain a noise reference signal of the microphone array; acquiring, according to the sound source position information, a sound signal from the sound source direction in the at least two channels of sound signals, to obtain a speech reference signal; removing, based on the noise reference signal, a residual noise signal in the speech reference signal to obtain a desired speech signal.
    Type: Grant
    Filed: June 21, 2021
    Date of Patent: November 14, 2023
    Assignee: Beijing Horizon Robotics Technology Research and Development Co., Ltd.
    Inventor: Yuxiang Hu