Patents Examined by Brian L. Albertalli
  • Patent number: 10741188
    Abstract: An audio decoder for providing at least four audio channel signals on the basis of an encoded representation is configured to provide a first residual signal and a second residual signal on the basis of a jointly encoded representation of the first residual signal and of the second residual signal using a multi-channel decoding. The audio decoder is configured to provide a first audio channel signal and a second audio channel signal on the basis of a first downmix signal and the first residual signal using a residual-signal-assisted multi-channel decoding. The audio decoder is configured to provide a third audio channel signal and a fourth audio channel signal on the basis of a second downmix signal and the second residual signal using a residual-signal-assisted multi-channel decoding. An audio encoder is based on corresponding considerations.
    Type: Grant
    Filed: April 9, 2018
    Date of Patent: August 11, 2020
    Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.
    Inventors: Sascha Dick, Christian Ertel, Christian Helmrich, Johannes Hilpert, Andreas Hoelzer, Achim Kuntz
  • Patent number: 10733999
    Abstract: An apparatus for decoding an encoded multichannel signal of a current frame to obtain three or more current audio output channels is provided. A multichannel processor is adapted to select two decoded channels from three or more decoded channels depending on first multichannel parameters. Moreover, the multichannel processor is adapted to generate a first group of two or more processed channels based on the selected channels. A noise filling module is adapted to identify for at least one of the selected channels, one or more frequency bands, within which all spectral lines are quantized to zero, and to generate a mixing channel using, depending on side information, a proper subset of three or more previous audio output channels that have been decoded, and to fill the spectral lines of frequency bands, within which all spectral lines are quantized to zero, with noise generated using spectral lines of the mixing channel.
    Type: Grant
    Filed: August 17, 2018
    Date of Patent: August 4, 2020
    Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.
    Inventors: Sascha Dick, Christian Helmrich, Nikolaus Rettelbach, Florian Schuh, Richard Fueg, Frederik Nagel
  • Patent number: 10726852
    Abstract: Methods and apparatus to perform windowed sliding transforms are disclosed. An example apparatus includes a transformer to transform a first block of time-domain samples of an input signal into a first frequency-domain representation based on a second frequency-domain representation of a second block of time-domain samples of the input signal, and a windower to apply a third frequency-domain representation of a time-domain window function to the first frequency-domain representation.
    Type: Grant
    Filed: February 19, 2018
    Date of Patent: July 28, 2020
    Assignee: The Nielsen Company (US), LLC
    Inventor: Zafar Rafii
  • Patent number: 10719668
    Abstract: A system for translation from a first human language to a second language including one or more processors and one or more non-transitory memory units coupled to said one or more processors storing computer readable program instructions, wherein the computer readable program instructions configure the one or more processors to perform the steps of: receive an input representation of information in the first language, convert the input representation of information in the first language to one or more sets of one or more marked-lemma dependency trees (MDTs), convert said one or more sets of one or more marked-MDTs to a representation of information in said second language, and output said representation of information in said second language, wherein the MDTs are represented in a mathematically-equivalent or isomorphic memory structure using one of word embeddings, sense embeddings, tree kernels, capsules, pose vectors, embeddings, and vectorizations.
    Type: Grant
    Filed: March 5, 2018
    Date of Patent: July 21, 2020
    Inventor: Graham Morehead
  • Patent number: 10714084
    Abstract: According to an example, with respect to artificial intelligence based service implementation, a voice call may be analyzed to generate voice data. The voice data may be converted to text data, which may be analyzed to identify keywords. Based on an analysis of the identified keywords, a user of a plurality of users may be identified. A user assistance flow of a plurality of user assistance flows that corresponds to a determined intent of the identified user may be ascertained. The voice call may be transferred to a digital assistant that may provide artificial intelligence based assistance to the identified user based on the user assistance flow that corresponds to the determined intent.
    Type: Grant
    Filed: February 9, 2018
    Date of Patent: July 14, 2020
    Assignee: ACCENTURE GLOBAL SOLUTIONS LIMITED
    Inventors: Edward Engles, John D. Bolze
  • Patent number: 10699066
    Abstract: A computer system associates one or more actions with an emoji. The computer system detects a selection of the emoji within an electronic communication by a user. In response to the detecting the selection of the emoji within the electronic communication, the computer system initiates performance of at least one action of the one or more actions based on determining that one or more contextual factors associated with the electronic communication satisfy a set of conditions associated with the at least one action.
    Type: Grant
    Filed: August 20, 2018
    Date of Patent: June 30, 2020
    Assignee: PayPal, Inc.
    Inventors: Raoul Johnson, Christopher Harkness
  • Patent number: 10699700
    Abstract: Provided are a speech recognition training processing method and an apparatus including the same. The speech recognition training processing method includes acquiring multi-talker mixed speech sequence data corresponding to a plurality of speakers, encoding the multi-speaker mixed speech sequence data into an embedded sequence data, generating speaker specific context vectors at each frame based on the embedded sequence, generating senone posteriors for each of the speaker based on the speaker specific context vectors and updating an acoustic model by performing permutation invariant training (PIT) model training based on the senone posteriors.
    Type: Grant
    Filed: July 31, 2018
    Date of Patent: June 30, 2020
    Assignee: TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED
    Inventors: Yanmin Qian, Dong Yu
  • Patent number: 10692488
    Abstract: A computer selects a test set of sentences from among sentences applied to train a whole sentence recurrent neural network language model to estimate the probability of likelihood of each whole sentence processed by natural language processing being correct. The computer generates imposter sentences from among the test set of sentences by substituting one word in each sentence of the test set of sentences. The computer generates, through the whole sentence recurrent neural network language model, a first score for each sentence of the test set of sentences and at least one additional score for each of the imposter sentences. The computer evaluates an accuracy of the natural language processing system in performing sequential classification tasks based on an accuracy value of the first score in reflecting a correct sentence and the at least one additional score in reflecting an incorrect sentence.
    Type: Grant
    Filed: August 23, 2019
    Date of Patent: June 23, 2020
    Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Yinghui Huang, Abhinav Sethy, Kartik Audhkhasi, Bhuvana Ramabhadran
  • Patent number: 10692507
    Abstract: Methods and apparatus to perform windowed sliding transforms are disclosed. An example apparatus includes a transformer to transform a first block of time-domain samples of an input signal into a first frequency-domain representation based on a second frequency-domain representation of a second block of time-domain samples of the input signal, and a windower to apply a third frequency-domain representation of a time-domain window function to the first frequency-domain representation.
    Type: Grant
    Filed: February 19, 2018
    Date of Patent: June 23, 2020
    Assignee: The Nielsen Company (US), LLC
    Inventor: Zafar Rafii
  • Patent number: 10679614
    Abstract: Techniques are described herein for enabling an automated assistant to adjust its behavior depending on a detected vocabulary level or other vocal characteristics of an input utterance provided to an automated assistant. The estimated vocabulary level or other vocal characteristics may be used to influence various aspects of a data processing pipeline employed by the automated assistant. In some implementations, one or more tolerance thresholds associated with, for example, grammatical tolerances or vocabulary tolerances, may be adjusted based on the estimated vocabulary level or vocal characteristics of the input utterance.
    Type: Grant
    Filed: April 24, 2019
    Date of Patent: June 9, 2020
    Assignee: GOOGLE LLC
    Inventors: Pedro Gonnet Anders, Victor Carbune, Daniel Keysers, Thomas Deselaers, Sandro Feuz
  • Patent number: 10664507
    Abstract: Electronic natural language processing in a natural language processing (NLP) system, such as a Question-Answering (QA) system. A receives electronic text input, in question form, and determines a readability level indicator in the question. The readability level indicator includes at least a grammatical error, a slang term, and a misspelling type. The computer determines a readability level for the electronic text input based on the readability level indicator, and retrieves candidate answers based on the readability level.
    Type: Grant
    Filed: June 19, 2019
    Date of Patent: May 26, 2020
    Assignee: International Business Machines Corporation
    Inventors: Donna K. Byron, Devendra Goyal, Lakshminarayanan Krishnamurthy, Priscilla Santos Moraes, Michael C. Smith
  • Patent number: 10665230
    Abstract: In one embodiment, a domain-name based framework implemented in a digital assistant ecosystem uses domain names as unique identifiers for request types, requesting entities, responders, and target entities embedded in a natural language request. Further, the framework enables interpreting natural language requests according to domain ontologies associated with different responders. A domain ontology operates as a keyword dictionary for a given responder and defines the keywords and corresponding allowable values to be used for request types and request parameters. The domain-name based framework thus enables the digital assistant to interact with any responder that supports a domain ontology to generate precise and complete responses to natural language based requests.
    Type: Grant
    Filed: December 12, 2017
    Date of Patent: May 26, 2020
    Assignee: VERISIGN, INC.
    Inventors: Andrew Fregly, Burton S. Kaliski, Jr., Swapneel Sheth
  • Patent number: 10657332
    Abstract: Exemplary embodiments relate to techniques to classify or detect the intent of content written in a language for which a classifier does not exist. These techniques involve building a code-switching corpus via machine translation, generating a universal embedding for words in the code-switching corpus, training a classifier on the universal embeddings to generate an embedding mapping/table; accessing new content written in a language for which a specific classifier may not exist, and mapping entries in the embedding mapping/table to the universal embeddings. Using these techniques, a classifier can be applied to the universal embedding without needing to be trained on a particular language. Exemplary embodiments may be applied to recognize similarities in two content items, make recommendations, find similar documents, perform deduplication, and perform topic tagging for stories in foreign languages.
    Type: Grant
    Filed: December 21, 2017
    Date of Patent: May 19, 2020
    Assignee: FACEBOOK, INC.
    Inventors: Ying Zhang, Reshef Shilon, Jing Zheng
  • Patent number: 10657960
    Abstract: A dialog content is generated using information that is unique to a user and information that is not unique. The processing executed by a dialog system includes a step of identifying a person based on a dialog with a user, a step of acquiring personal information, a step of analyzing the dialog, a step of extracting an event, a step of searching for a local episode and a global episode based on the personal information and the event, a step of generating dialog data using the search result, a step of outputting a dialog, and a step of accepting user evaluation.
    Type: Grant
    Filed: July 22, 2016
    Date of Patent: May 19, 2020
    Assignee: SHARP KABUSHIKI KAISHA
    Inventors: Rei Tokunaga, Toru Ueda
  • Patent number: 10650812
    Abstract: A system for extracting verifiable entities from a user-utterance received on an automated calling service is provided. The system may include a receiver configured to receive a user-utterance, a processor and a non-transitory computer-readable media comprising computer-executable instructions. The processor may be configured to execute the instructions which, canonicalize the user-utterance into a plurality of tokens, determine the number of tokens of the user-utterance, and generate, using a sliding-window protocol, a comprehensive number of n-gram sequences from the user-utterance. The processor may be configured to process a plurality of threads of execution that may include a series of actions executed on the n-gram sequences to identify and extract verified entities from the user-utterance.
    Type: Grant
    Filed: August 13, 2018
    Date of Patent: May 12, 2020
    Assignee: Bank of America Corporation
    Inventors: Viju Kothuvatiparambil, Maruthi Z. Shanmugam, Donatus Asumu
  • Patent number: 10629213
    Abstract: Methods and apparatus to perform windowed sliding transforms are disclosed.
    Type: Grant
    Filed: March 30, 2018
    Date of Patent: April 21, 2020
    Assignee: The Nielsen Company (US), LLC
    Inventors: Zafar Rafii, Markus Cremer, Bongjun Kim
  • Patent number: 10628743
    Abstract: A robotic system for processing input, such as text data provided through a messaging system, spoken language data provided through a microphone, or any other such input data, which may function to process the input so as to be able to respond or reply to a user based on comprehension of the input sentences. An automated theorem prover (ATP) may operate as an underlying framework for the AI system that understands and responds to spoken or written statements translated into a proper format. An ATP formatter may be used to translate natural language processing (NLP) output from a NLP syntactical sentence parser into the proper format, such that the ATP system may be able to generate and populate an ontology from the NLP output. User queries may be mapped to this ontology in order to facilitate comprehension. If desired, the system may automatically populate the ontology through Internet searching.
    Type: Grant
    Filed: January 24, 2019
    Date of Patent: April 21, 2020
    Inventor: Andrew R. Kalukin
  • Patent number: 10623811
    Abstract: Systems and methods for determining whether a first electronic device detects a media item that is to be output by a second electronic device is described herein. In some embodiments, an individual may request, using a first electronic device, that a media item be played on a second electronic device. The backend system may send first audio data representing a first response to the first electronic device, along with instructions to delay outputting the first response, as well as to continue sending audio data of additional audio captured thereby. The backend system may also send second audio data representing a second response to the second electronic device along with the media item. Text data may be generated representing the captured audio, which may then be compared with text data representing the second response to determine whether or not they match.
    Type: Grant
    Filed: September 27, 2018
    Date of Patent: April 14, 2020
    Assignee: AMAZON TECHNOLOGIES, INC.
    Inventor: Dennis Francis Cwik
  • Patent number: 10621998
    Abstract: Disclosed is an LPC residual signal encoding/decoding apparatus of an MDCT based unified voice and audio encoding device. The LPC residual signal encoding apparatus analyzes a property of an input signal, selects an encoding method of an LPC filtered signal, and encode the LPC residual signal based on one of a real filterbank, a complex filterbank, and an algebraic code excited linear prediction (ACELP).
    Type: Grant
    Filed: August 4, 2017
    Date of Patent: April 14, 2020
    Assignee: Electronics and Telecommunications Research Institute
    Inventors: Seung Kwon Beack, Tae Jin Lee, Min Je Kim, Kyeongok Kang, Dae Young Jang, Jin Woo Hong, Jeongil Seo, Chieteuk Ahn, Hochong Park, Young-cheol Park
  • Patent number: 10614821
    Abstract: The present invention is directed to apparatus and methods for decoding Higher Order Ambisonics (HOA) audio signals. HOA audio signals may be decompressed based on perceptual decoding to determine at least an HOA representation corresponding to the HOA audio signals. A rotated transform may be determined based on a rotation of a spherical sample grid. A rotated HOA representation may be determined based on the rotated transform and the HOA representation. The rotated HOA representation may be rendered to output to a loudspeaker setup.
    Type: Grant
    Filed: May 20, 2019
    Date of Patent: April 7, 2020
    Assignee: Dolby Laboratories Licensing Corporation
    Inventors: Johannes Boehm, Sven Kordon, Alexander Krueger, Peter Jax