Patents Examined by Edgar X Guerra-Erazo
-
Patent number: 11990144Abstract: Non-voice data is embedded in a voice bit stream that includes frames of voice bits by selecting a frame of voice bits to carry the non-voice data, placing non-voice identifier bits in a first portion of the voice bits in the selected frame, and placing the non-voice data in a second portion of the voice bits in the selected frame. The non-voice identifier bits are employed to reduce a perceived effect of the non-voice data on audible speech produced from the voice bit stream.Type: GrantFiled: July 28, 2021Date of Patent: May 21, 2024Assignee: Digital Voice Systems, Inc.Inventor: John C. Hardwick
-
Patent number: 11983497Abstract: Privacy, protection, and de-anonymization are issues of societal importance that are implicitly at the core of several key information systems, from electronic health records to online reviews. The system and method herein allows for an identification of an author of anonymous writing based on the text and structured data, subject to practical constraints on the intruder's amount of training data and effort using Shapley values.Type: GrantFiled: November 20, 2020Date of Patent: May 14, 2024Assignee: Drexel UniversityInventors: Matthew John Schneider, Shawn Mankad
-
Patent number: 11979960Abstract: Disclosed herein are example techniques to provide contextual information corresponding to a voice command. An example implementation may involve receiving voice data indicating a voice command, receiving contextual information indicating a characteristic of the voice command, and determining a device operation corresponding to the voice command. Determining the device operation corresponding to the voice command may include identifying, among multiple zones of a media playback system, a zone that corresponds to the characteristic of the voice command, and determining that the voice command corresponds to one or more particular devices that are associated with the identified zone. The example implementation may further involve causing the one or more particular devices to perform the device operation.Type: GrantFiled: November 17, 2021Date of Patent: May 7, 2024Assignee: Sonos, Inc.Inventors: Jonathan P. Lang, Romi Kadri, Christopher Butts
-
Patent number: 11978472Abstract: A system for processing and presenting a conversation includes a sensor, a processor, and a presenter. The sensor is configured to capture an audio-form conversation. The processor is configured to automatically transform the audio-form conversation into a transformed conversation. The transformed conversation includes a synchronized text, wherein the synchronized text is synchronized with the audio-form conversation. The presenter is configured to present the transformed conversation including the synchronized text and the audio-form conversation. The presenter is further configured to present the transformed conversation to be navigable, searchable, assignable, editable, and shareable.Type: GrantFiled: March 23, 2021Date of Patent: May 7, 2024Assignee: Otter.ai, Inc.Inventors: Yun Fu, Simon Lau, Kaisuke Nakajima, Julius Cheng, Gelei Chen, Sam Song Liang, James Mason Altreuter, Kean Kheong Chin, Zhenhao Ge, Hitesh Anand Gupta, Xiaoke Huang, James Francis McAteer, Brian Francis Williams, Tao Xing
-
Patent number: 11935526Abstract: A system and method for voice control of a media playback device is disclosed. The method includes receiving an instruction of a voice command, converting the voice command to text, transmitting the text command to the playback device, and having the playback device execute the command. An instruction may include a command to play a set of audio tracks, and the media playback device plays the set of audio tracks upon receiving the instruction.Type: GrantFiled: September 10, 2020Date of Patent: March 19, 2024Assignee: Spotify ABInventors: Daniel Bromand, Richard Mitic, Horia Jurcut, Jennifer Thom-Santelli, Henriette Cramer, Karl Humphreys, Robert Williams, Kurt Jacobson, Henrik Lindström
-
Patent number: 11907674Abstract: Implementations relate to generating multi-modal response(s) through utilization of large language model(s) (LLM(s)). Processor(s) of a system can: receive natural language (NL) based input, generate a multi-modal response that is responsive to the NL based output, and cause the multi-modal response to be rendered. In some implementations, and in generating the multi-modal response, the processor(s) can process, using a LLM, LLM input (e.g., that includes at least the NL based input) to generate LLM output, and determine, based on the LLM output, textual content for inclusion in the multi-modal response and multimedia content for inclusion in the multi-modal response. In some implementations, the multimedia content can be obtained based on a multimedia content tag that is included in the LLM output and that is indicative of the multimedia content. In various implementations, the multimedia content can be interleaved between segments of the textual content.Type: GrantFiled: September 20, 2023Date of Patent: February 20, 2024Assignee: GOOGLE LLCInventors: Oscar Akerlund, Evgeny Sluzhaev, Golnaz Ghiasi, Thang Luong, Yifeng Lu, Igor Petrovski, Ágoston Weisz, Wei Yu, Rakesh Shivanna, Michael Andrew Goodman, Apoorv Kulshreshtha, Yu Du, Amin Ghafouri, Sanil Jain, Dustin Tran, Vikas Peswani, YaGuang Li
-
Patent number: 11900945Abstract: An information processing method, a system, an apparatus, an electronic device and a storage medium, where the method is applied to a client, and includes: receiving a transcript and a sentence identifier of the transcript sent by a service server; reading a local sentence identifier, and when the received sentence identifier is the same as the local sentence identifier, updating a displayed caption content corresponding to the local sentence identifier with the transcript. When the received sentence identifier of the client is the same as the local sentence identifier, the displayed caption content is replaced with the received transcript.Type: GrantFiled: March 21, 2022Date of Patent: February 13, 2024Assignee: BEIJING BYTEDANCE NETWORK TECHNOLOGY CO., LTD.Inventors: Li Zhao, Xiao Han, Kojung Chen, Jian Tong
-
Patent number: 11886825Abstract: Systems and methods for natural language processing are described. One or more embodiments of the present disclosure generate a word embedding for each word of an input phrase, wherein the input phrase indicates a sentiment toward an aspect term, compute a gate vector based on the aspect term, identify a dependency tree representing relations between words of the input phrase, generate a representation vector based on the dependency tree and the word embedding using a graph convolution network, wherein the gate vector is applied to a layer of the graph convolution network, and generate a probability distribution over a plurality of sentiments based on the representation vector.Type: GrantFiled: March 31, 2021Date of Patent: January 30, 2024Assignee: ADOBE, INC.Inventors: Amir Pouran Ben Veyseh, Franck Dernoncourt
-
Patent number: 11868432Abstract: A method for extracting a kansei adjective of a product based on principal component analysis and explanation (PCA-E) includes constructing a product kansei evaluation vector matrix through original kansei adjectives; performing dimensionality reduction through PCA; and determining, based on principal component load factors, kansei adjectives representing principal components. In this way, the kansei adjectives extracted are explanatory to help users understand the selected kansei adjectives and make accurate evaluation.Type: GrantFiled: June 2, 2023Date of Patent: January 9, 2024Assignee: SICHUAN UNIVERSITYInventors: Wu Zhao, Xin Guo, Miao Yu, Kai Zhang, Wei Jiang, Chong Jiang, Bing Lai, Yiwei Jiang, Jun Li, Bo Wu, Xingyu Chen
-
Patent number: 11869635Abstract: A system for activating a cued health assessment, which includes an audio receiver for receiving voice samples to measure one of a plurality of voice biomarkers, an audio processing module for extracting one of a plurality of biomarkers from the received voice samples, the audio processing module further classifies the received voice samples to one of plurality of predetermined health states according to the extracted biomarkers, and a voice sample scheduler for activating a cued health assessment module when the classified health state is a clinically actionable health state.Type: GrantFiled: April 12, 2021Date of Patent: January 9, 2024Assignee: Sonde Health, Inc.Inventors: James D. Harper, Michael Chen
-
Patent number: 11853702Abstract: Generate, for each of the words of a common vocabulary of first and second text corpora, a first word embedding vector in the first text corpus and a second word embedding vector in the second text corpus. Generate, for each word in a random sample of non-landmark words, an artificially shifted word embedding vector by modifying the first word embedding vector for that word. Train a machine learning classifier to predict whether an artificial shift has been injected for a given word, based on the artificially shifted word embedding vector and the second word embedding vector for the given word. Predict semantic shifts for at least a plurality of the words of the common vocabulary by providing the first word embedding vectors and the second word embedding vectors for at least the plurality of the words of the common vocabulary as input to the trained machine learning classifier.Type: GrantFiled: January 29, 2021Date of Patent: December 26, 2023Assignees: International Business Machines Corporation, RENSSELAER POLYTECHNIC INSTITUTEInventors: Pin-Yu Chen, Maurício Gruppi, Sibel Adali
-
Patent number: 11854543Abstract: A method for receiving processed information at a remote device is described. The method includes transmitting from the remote device a verbal request to a first information provider and receiving a digital message from the first information provider in response to the transmitted verbal request. The digital message includes a symbolic representation indicator associated with a symbolic representation of the verbal request and data used to control an application. The method also includes transmitting, using the application, the symbolic representation indicator to a second information provider for generating results to be displayed on the remote device.Type: GrantFiled: June 15, 2021Date of Patent: December 26, 2023Assignee: Google LLCInventors: Gudmundur Hafsteinsson, Michael J. LeBeau, Natalia Marmasse, Sumit Agarwal, Dipchand Nishar
-
Patent number: 11842723Abstract: A device may receive audio data based on a capturing of sounds associated with a structure. The device may obtain a model associated with the structure. The model may have been trained to receive the audio data as input, determine a score that identifies a likelihood that a sound is present in the audio data, and identify the sound based on the score. The device may determine at least one parameter associated with the sound. The device may generate a metric based on the at least one parameter associated with the sound, and perform an action based on the metric.Type: GrantFiled: April 12, 2021Date of Patent: December 12, 2023Assignee: Capital One Services, LLCInventors: Michael Mossoba, Joshua Edwards, Abdelkadar M'hamed Benkreira, Austen Novis, Sophie Bermudez
-
Patent number: 11836415Abstract: A system is provided for streaming media content in a vehicle. The system includes a personal media streaming appliance system configured to connect to a media delivery system and receive media content from the media delivery system at least via a cellular network. The personal media streaming appliance system operates to transmit a media signal representative to the received media content to a vehicle media playback system so that the vehicle media playback system operates to play the media content in the vehicle. Customized voice communications are generated based on receiving input, such as a user query and/or a media track change indication.Type: GrantFiled: November 20, 2020Date of Patent: December 5, 2023Assignee: Spotify ABInventors: Emma-Camelia Gosu, Johan Oskarsson, Daniel Bromand
-
Patent number: 11829727Abstract: Approaches for cross-lingual regularization for multilingual generalization include a method for training a natural language processing (NLP) deep learning module. The method includes accessing a first dataset having a first training data entry, the first training data entry including one or more natural language input text strings in a first language; translating at least one of the one or more natural language input text strings of the first training data entry from the first language to a second language; creating a second training data entry by starting with the first training data entry and substituting the at least one of the natural language input text strings in the first language with the translation of the at least one of the natural language input text strings in the second language; adding the second training data entry to a second dataset; and training the deep learning module using the second dataset.Type: GrantFiled: April 23, 2021Date of Patent: November 28, 2023Assignee: salesforce.com, inc.Inventors: Jasdeep Singh, Nitish Shirish Keskar, Bryan McCann
-
Patent number: 11817013Abstract: A display apparatus and a method for questions and answers includes a display unit includes an input unit configured to receive user's speech voice; a communication unit configured to perform data communication with an answer server; and a processor configured to create and display one or more question sentences using the speech voice in response to the speech voice being a word speech, create a question language corresponding to the question sentence selected from among the displayed one or more question sentences, transmit the created question language to the answer server via the communication unit, and, in response to one or more answer results related to the question language being received from the answer server, display the received one or more answer results. Accordingly, the display apparatus may provide an answer result appropriate to a user's question intention although a non-sentence speech is input.Type: GrantFiled: November 13, 2020Date of Patent: November 14, 2023Assignee: SAMSUNG ELECTRONICS CO., LTD.Inventor: Eun-sang Bak
-
Patent number: 11817104Abstract: A user device (e.g., voice assistant device, voice enabled device, smart device, computing device, etc.) may receive/detect audio content (e.g., speech, etc.) that includes a wake word and/or words similar to a wake word. The user device may require a wake word, a portion of the wake word, or words similar to the wake word to be detected prior to interacting with a user. The user device may, based on characteristics of the audio content, determine if the audio content originates from an authorized user. The user device may decrease and/or increase scrutiny applied to wake word detection based on whether audio content originates from an authorized user.Type: GrantFiled: February 26, 2021Date of Patent: November 14, 2023Assignee: Comcast Cable Communications, LLCInventors: Hans Sayyadi, Nima Bina
-
Patent number: 11817078Abstract: A method and apparatus that dynamically adjust operational parameters of a text-to-speech engine in a speech-based system are disclosed. A voice engine or other application of a device provides a mechanism to alter the adjustable operational parameters of the text-to-speech engine. In response to one or more environmental conditions, the adjustable operational parameters of the text-to-speech engine are modified to increase the intelligibility of synthesized speech.Type: GrantFiled: June 2, 2023Date of Patent: November 14, 2023Assignee: VOCOLLECT, INC.Inventors: James Hendrickson, Debra Drylie Stiffey, Duane Littleton, John Pecorari, Arkadiusz Slusarczyk
-
Patent number: 11817092Abstract: In one example, a method includes receiving audio data generated by one or more microphones of a computing device, the audio data representing a spoken utterance; identifying, based on the audio data, a user that provided the spoken utterance; identifying, based on the audio data, an automation action associated with one or more automation devices, the automation action corresponding to the spoken utterance; determining whether the identified user is authorized to cause performance of the identified automation action; and responsive to determining that the identified user is authorized to cause performance of the identified automation action, causing the one or more automation devices to perform the identified automation action.Type: GrantFiled: December 2, 2020Date of Patent: November 14, 2023Assignee: GOOGLE LLCInventors: Yuzhao Ni, David Roy Schairer
-
Patent number: 11809826Abstract: For assertion detection from clinical text in a medical system, a model, such as a neural network, is trained to operate on multi-labeled clinical text. Using multi-task learning, both the scope and the class losses are minimized. As a result, a machine learning model can predict both the scope and class of clinical text for a patient where the clinical text is not limited to one class or a particular length.Type: GrantFiled: November 17, 2020Date of Patent: November 7, 2023Assignee: Siemens Healthcare GmbHInventors: Rajeev Bhatt Ambati, Oladimeji Farri, Ramya Vunikili