Miscellaneous Analysis Or Detection Of Speech Characteristics (epo) Patents (Class 704/E11.001)

E Subclasses

General speech analysis without concrete application (epo) (Class 704/E11.002)

Detection of presence or absence of speech signals (epo) (Class 704/E11.003)

Pitch determination of speech signals (epo) (Class 704/E11.006)

Voiced-unvoiced decision (epo) (Class 704/E11.007)

User interface for audio message

Patent number: 12265696

Abstract: The present disclosure generally relates to receiving voice input via the one or more microphones; and displaying a visual indication of the voice input, where in accordance with a determination that a portion of the voice input corresponds to voice input that is to be transmitted to one or more devices displaying, via the one or more display devices, the visual indication includes displaying the visual indication with a first set of one or more colors; and in accordance with a determination that the voice input does not include an instruction to transmit any portion of the voice input to the one or more devices, displaying, via the one or more display devices, the visual indication includes displaying the visual indication with a second set of one or more colors that is different from the first set of one or more colors.

Type: Grant

Filed: October 20, 2022

Date of Patent: April 1, 2025

Assignee: Apple Inc.

Inventors: Andrew Seunghyun Kim, Patrick L. Coffman
System and method for summarizing a multimedia content item

Patent number: 12254036

Abstract: A multimedia content item is summarized based on its audio track and a desired compression budget. The audio track is extracted and processed by an automatic speech recognizer to obtain a time-aligned text transcript. The text-transcript is partitioned into a plurality of segment sequences. An informativeness score based on a salience score and a diversity score is computed for each of the segments. A coherence score is also computed for the segments in the plurality of sequences. A subsequence of one of the segment sequences that optimizes for informativeness and coherence is selected for generating a new content item summarizing the multimedia content item.

Type: Grant

Filed: April 30, 2019

Date of Patent: March 18, 2025

Assignee: YAHOO ASSETS LLC

Inventor: Inderjeet Mani
Method and computing device in which semantic definitions are composed as a semantic metaset

Patent number: 12235883

Abstract: The present application discloses a method of representing semantic definitions on a computing device. Semantic definition statements are composed using operators. The semantic definition statements include semantic concept statements using semantic concept operators and semantic context statements using semantic context operators. The semantic definition statements are saved in a metaset. The metaset is converted into a digital data structure and stored in a memory storage device of a computing device. The present application further discloses a method of semantically searching for a visual using a metaset.

Type: Grant

Filed: August 12, 2022

Date of Patent: February 25, 2025

Assignee: ZRO Inc.

Inventor: Tio Seng Heng
Speech instruction recognition method, electronic device, and non-transient computer readable storage medium

Patent number: 12230275

Abstract: A speech instruction recognition method, an electronic device, and a non-transient computer readable storage medium. The speech instruction recognition method comprises: acquiring a target speech; processing the target speech to obtain a target speech vector corresponding to the target speech; performing speech recognition on the target speech to obtain a target speech text of the target speech, and processing the target speech text to obtain a target text vector corresponding to the target speech text; and inputting the target speech vector and the target text vector to a pre-trained instruction recognition model to obtain an instruction category corresponding to the target speech.

Type: Grant

Filed: January 6, 2021

Date of Patent: February 18, 2025

Assignee: BOE Technology Group Co., Ltd.

Inventor: Shaoxun Su
Speech conversion method and apparatus, storage medium, and electronic device

Patent number: 12223973

Abstract: Embodiments of the present application provide a speech conversion method and apparatus, a storage medium, and an electronic device. The method includes: acquiring a source speech to be converted and a target speech sample of a target speaker; recognizing a style category of the target speech sample, and extracting a target audio feature from the target speech sample according to the style category; extracting a source audio feature from the source speech; acquiring a first style feature of the target speech sample and determining a second style feature of the target speech sample according to the first style feature; fusing and mapping the source audio feature, the target audio feature, and the second style feature to obtain a joint encoding feature; and decoding the joint encoding feature, to obtain a target speech feature, and converting the source speech based on the target speech feature to obtain a target speech.

Type: Grant

Filed: August 9, 2024

Date of Patent: February 11, 2025

Assignee: NANJING SILICON INTELLIGENCE TECHNOLOGY CO., LTD.

Inventors: Huapeng Sima, Ao Yao, Yiping Tang
System and method for configuring input elements of a controlling device

Patent number: 12212802

Abstract: A configurable input element of a controlling device is configured by using a data representative of an over-the-top (OTT) media app determined to be installed on an OTT device and a data representative of the OTT device to identify at least one command that is required to be transmitted to cause the OTT device to launch the OTT media app. The at least one command is provisioned to the controlling device and assigned to the configurable input element. When the input element is subsequently activated, the controlling device will transmit the at least one command to cause the OTT device to launch the OTT media app.

Type: Grant

Filed: November 30, 2023

Date of Patent: January 28, 2025

Assignee: Universal Electronics Inc.

Inventors: Thomas Hascher, Menno Koopmans
System and methods for neural topic modeling using topic attention networks

Patent number: 12190061

Abstract: Systems and methods for topic modeling are described. The systems and methods include encoding words of a document using an embedding matrix to obtain word embeddings for the document. The words of the document comprise a subset of words in a vocabulary, and the embedding matrix is trained as part of a topic attention network based on a plurality of topics. The systems and methods further include encoding a topic-word distribution matrix using the embedding matrix to obtain a topic embedding matrix. The topic-word distribution matrix represents relationships between the plurality of topics and the words of the vocabulary. The systems and methods further include computing a topic context matrix based on the topic embedding matrix and the word embeddings and identifying a topic for the document based on the topic context matrix.

Type: Grant

Filed: December 17, 2021

Date of Patent: January 7, 2025

Assignee: ADOBE INC.

Inventors: Shashank Shailabh, Madhur Panwar, Milan Aggarwal, Pinkesh Badjatiya, Simra Shahid, Nikaash Puri, S Sejal Naidu, Sharat Chandra Racha, Balaji Krishnamurthy, Ganesh Karbhari Palwe
Method and system for controlling distributions of attributes in language models for text generation

Patent number: 12153896

Abstract: A method for generating a language model for text generation by receiving a pre-trained language model having attributes with existing probability distributions over the pre-trained language model; receiving at least one target constraint; the target constraint specifying an expectation of a target attribute over a language model that approximates the pre-trained language model; computing parameters of an energy based model by applying the target constraint to the pre-trained language model; obtaining samples from a reference policy; updating parameters of a target policy using the obtained samples and the energy based model; updating the reference policy with the target policy if the target policy is superior to the reference policy; and outputting the target policy as a target language model. The target language model is adapted to generate text with the target attribute over a probability distribution that approximates the desired probability distribution specified by the target constraint.

Type: Grant

Filed: August 2, 2021

Date of Patent: November 26, 2024

Assignee: Naver Corporation

Inventors: Marc Dymetman, Hady Elsahar, Muhammad Khalifa
Systems and methods for controlling device configuration in a networked environment

Patent number: 12149761

Abstract: A system and method uses a first device fingerprint for a set-top box (STB) installed within a home theater environment which includes an over-the-top (OTT) device to cause a one a plurality of original equipment manufacturer (OEM) remote control setup procedures to be selected for use to configure an OEM remote control for the STB and the selected one of the plurality of OEM remote control setup procedures uses a second device fingerprint for the OTT device to cause the OEM remote control to be configured to transmit one or more commands to control functional operations of the OTT device.

Type: Grant

Filed: July 19, 2022

Date of Patent: November 19, 2024

Assignee: UNIVERSAL ELECTRONICS INC.

Inventor: Paul D. Arling
Piezo-resistive transistor based resonator with anti-ferroelectric gate dielectric

Patent number: 12125893

Abstract: Describe is a resonator that uses anti-ferroelectric (AFE) materials in the gate of a transistor as a dielectric. The use of AFE increases the strain/stress generated in the gate of the FinFET. Along with the usual capacitive drive, which is boosted with the increased polarization, additional current drive is also achieved from the piezoelectric response generated to due to AFE material. In some embodiments, the acoustic mode of the resonator is isolated using phononic gratings all around the resonator using the metal line above and vias' to body and dummy fins on the side. As such, a Bragg reflector is formed above or below the AFE based transistor. Increased drive signal from the AFE results in larger output signal and larger bandwidth.

Type: Grant

Filed: April 3, 2023

Date of Patent: October 22, 2024

Assignee: Intel Corporation

Inventors: Tanay Gosavi, Chia-Ching Lin, Raseong Kim, Ashish Verma Penumatcha, Uygar Avci, Ian Young
Systems and methods for modulating data objects to effect state changes

Patent number: 12120182

Abstract: Systems and methods for modulating content to effect state change are described. A state control system initiates a process for modulating output objects to effect one or more changes in a state profile associated with a user device. The system queries for historical data associated with the user device; determines whether any historical data is identified for user device and in response to determining that historical data is found predicts a current state profile associated with the user device. The system further collects real-time sensor data associated with user device; filters and normalizes the sensor data; and delivers a plurality of output objects to the user device or secondary device(s) based on real-time sensor data.

Type: Grant

Filed: July 7, 2022

Date of Patent: October 15, 2024

Assignee: Daily Rays Inc.

Inventors: Ashley Saye, Diego I. Medina-Bernal, Jonathon Nostrant
Apparatuses and methods for detecting malware

Patent number: 12099599

Abstract: Apparatuses and methods for determining if a computer program is malware and to which malware class it belongs to. In the method, the behaviour of a computer program is traced by observing the activity of the program. Behaviour sequences comprising API-calls or similar activity of a computer program are then provided into a classifier for classifying the computer program. From the outcome of the classifier, a classification result and the portions relevant to decision can be provided to a person for further confirmation.

Type: Grant

Filed: November 30, 2021

Date of Patent: September 24, 2024

Assignee: Huawei Technologies Co., Ltd.

Inventors: Moez Draief, Xiang Chen, Konstantin Kutzkov, Kevin Scaman, Milan Vojnovic
Wake word method to prolong the conversational state between human and a machine in edge devices

Patent number: 12062361

Abstract: A voice-activated system edge device cooperating with a remote command processor has a state machine defined by a listening mode state and a conversation monitoring mode state. The state machine transitions from the listening mode state to the conversation monitoring mode state in response to a wake word detection. A command accompanying the wake word is transmitted to the remote command processor for execution thereon. The conversation monitoring mode state is maintained for a conversation monitoring window time duration to receive a connection word accompanied by another command transmitted to the remote command processor for further execution thereon.

Type: Grant

Filed: November 2, 2021

Date of Patent: August 13, 2024

Assignee: AONDEVICES, INC.

Inventors: Mouna Elkhatib, Adil Benyassine, Aruna Vittal, Eli Uc, Daniel Schoch
Speech noise reduction method and apparatus, computing device, and computer-readable storage medium

Patent number: 12057135

Abstract: This application discloses a speech noise reduction method performed by a computing device. The method includes: obtaining a noisy speech signal, the noisy speech signal including a pure speech signal and a noise signal; estimating a posteriori signal-to-noise ratio and a priori signal-to-noise ratio of the noisy speech signal; determining a speech/noise likelihood ratio in a Bark domain based on the estimated posteriori signal-to-noise ratio and the estimated priori signal-to-noise ratio; estimating a priori speech existence probability based on the determined speech/noise likelihood ratio; determining a gain based on the estimated posteriori signal-to-noise ratio, the estimated priori signal-to-noise ratio, and the estimated priori speech existence probability, the gain being a frequency domain transfer function used for converting the noisy speech signal into an estimation of the pure speech signal; and exporting the estimation of the pure speech signal from the noisy speech signal based on the gain.

Type: Grant

Filed: April 9, 2021

Date of Patent: August 6, 2024

Assignee: TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED

Inventors: Xuan Ji, Meng Yu
Book recommendation and flashcard generation

Patent number: 12014649

Abstract: System and method for personalized book recommendations have been designed to align with a user's current vocabulary and facilitate sustainable vocabulary growth. By analyzing the user's vocabulary size and growth trajectory, the system can recommend books that are appropriately challenging yet comprehensible. This approach enables users to explore books of their choice more effectively, even before their vocabulary reaches a level sufficient for understanding any book in general. In addition, a vocabulary-based flashcard generator assistant system has been incorporated. This assistant system enables the user to rapidly create high-quality, personalized flashcards for optimal vocabulary acquisition. The flashcards contain content that is tailored to the individual's learning style and pace, further promoting sustainable and effective vocabulary growth.

Type: Grant

Filed: August 3, 2022

Date of Patent: June 18, 2024

Inventor: Dzmitry Kushal
Voice command handler for programming stimulation systems and methods of using

Patent number: 12002462

Abstract: A method for programming a stimulation device of a stimulation system using a programming device includes providing a set of programming commands for the programming device that include a first programming command increasing a stimulation amplitude and a second programming command includes decreasing the stimulation amplitude; receiving a verbal communication by a voice command handler of the programming device or in communication with the programming device; determining whether the verbal communication is a trigger word and, when the verbal communication is the trigger word, entering a triggered state, wherein, after entering the triggered state, the programming device remains in the triggered state until a one of at least one stop condition is met; and, when in the triggered state, determining whether the verbal communication is one of the programming commands and, when the verbal communication is one of the programming commands, executing the one of the programming commands.

Type: Grant

Filed: October 28, 2021

Date of Patent: June 4, 2024

Assignee: Boston Scientific Neuromodulation Corporation

Inventors: Jimmy Lee Chao, Vishal Jagannathan, Eugene Mesina, Travis McCoy
Dialogue processing method and device

Patent number: 11977815

Abstract: A dialogue processing method and device are provided. The method includes: A dialogue processing device receives dialogue information from user equipment; if the dialogue information does not include slot information that is corresponding to a first slot type and that can determine a service, the dialogue processing device obtains a service identifier set corresponding to the first slot type from a server, and sends the service identifier set to the user equipment; and after a target service identifier is received from the user equipment, the dialogue processing device requests a service corresponding to the target service identifier from the server, and sends execution success information to the user equipment. According to this method, a service item can be presented to a user in a timely manner, and the user can be prevented from initiating a plurality of rounds of dialogues with the dialogue processing device, thereby improving service execution efficiency and further improving use experience of the user.

Type: Grant

Filed: July 9, 2021

Date of Patent: May 7, 2024

Assignee: HUAWEI TECHNOLOGIES CO., LTD.

Inventors: Hongfeng Luo, Yibo Zhang
Speech processing and multi-modal widgets

Patent number: 11966663

Abstract: Techniques for performing speech processing using multi-modal widget information are described. A system may receive input data corresponding to a user input. The system may also receive widget context data corresponding to one or more multi-modal widgets active at a device. The system may use the widget context data to perform natural language understanding (NLU) processing with respect to the user input, and for selecting a skill component for responding to the user input. The system may send a widget identifier to the skill component when invoking the skill to respond to the user input.

Type: Grant

Filed: September 29, 2021

Date of Patent: April 23, 2024

Assignee: Amazon Technologies, Inc.

Inventors: Nhat Vu Doan, Nicholas Adam Cummings, Prashant Jayaram Thakare, Jalaj Kumar, Ganesh Prabu Ravi, Chih-Shin Wang, Narenda Gyanchandani
Information processing device, information processing method, and program

Patent number: 11947869

Abstract: Provided is an information processing device, an information processing method, and a program, the information processing device including a control unit that dynamically controls output of notification information related to a function corresponding to a gesture regarding function execution of the device based on a recognition status of an operation body that is executing the gesture in a predetermined operation region.

Type: Grant

Filed: March 23, 2020

Date of Patent: April 2, 2024

Assignee: SONY GROUP CORPORATION

Inventors: Kei Takahashi, Junichi Shimizu, Junichi Nagahara, Manabu Fujiki, Tomohiro Imura, Keiichi Kitahara
Word embedding with disentangling prior

Patent number: 11947908

Abstract: Described herein are system and method embodiments to improve word representation learning. Embodiments of a probabilistic prior may seamlessly integrate statistical disentanglement with word embedding. Different from previous deterministic methods, word embedding may be taken as a probabilistic generative model, and it enables imposing a prior that may identify independent factors generating word representation vectors. The probabilistic prior not only enhances the representation of word embedding, but also improves the model's robustness and stability. Furthermore, embodiments of the disclosed method may be flexibly plugged in various word embedding models. Extensive experimental results show that embodiments of the presented method may improve word representation on different tasks.

Type: Grant

Filed: April 7, 2021

Date of Patent: April 2, 2024

Assignee: Baidu USA LLC

Inventors: Shaogang Ren, Ping Li
Audio processing for detecting occurrences of loud sound characterized by brief audio bursts

Patent number: 11922968

Abstract: A boundary of a highlight of audiovisual content depicting an event is identified. The audiovisual content may be a broadcast, such as a television broadcast of a sporting event. The highlight may be a segment of the audiovisual content deemed to be of particular interest. Audio data for the audiovisual content is stored, and the audio data is automatically analyzed to detect one or more audio events indicative of one or more occurrences to be included in the highlight. Each audio event may be a brief, high-energy audio burst such as the sound made by a tennis serve. A time index within the audiovisual content, before or after the audio event, may be designated as the boundary, which may be the beginning or end of the highlight.

Type: Grant

Filed: February 25, 2022

Date of Patent: March 5, 2024

Assignee: STATS LLC

Inventors: Mihailo Stojancic, Warren Packard
Media unit retrieval and related processes

Patent number: 11921775

Abstract: Media unit retrieval methods, systems and computer program products are provided that allow a user to search for an item by iteratively presenting media units such as images representing items to the user and receiving user input consisting of selections of the presented media units (including possibly the empty selection). Features, or attributes, a user is interested in, for example semantic features, are inferred from the interaction and media units are retrieved for presentation based on similarity with user-selected media units, through sampling of a probability distribution describing the intent or interests, or combinations of approaches. Accordingly, the user-experience is akin to a conversation about what the user is looking for. Retrieval may be based on both selected and unselected media units and the selection may comprise making a selection with a single action. Further, a database of media units can capture similarity relationships for efficient media unit retrieval.

Type: Grant

Filed: December 9, 2022

Date of Patent: March 5, 2024

Assignee: DREAM IT GET IT LIMITED

Inventors: Michael Elkaim, Michael Kopp, Kristjan Korjus
Systems and methods for utility-preserving deep reinforcement learning-based text anonymization

Patent number: 11907666

Abstract: Various embodiments of a system and associated method for anonymization of text without losing semantic utility of text by extracting a latent embedding representation of content with respect to a given task and by learning an optimal strategy for text embedding manipulation to satisfy both privacy and utility requirements are disclosed herein. In particular, the system balances private attribute obfuscation with retained semantic utility.

Type: Grant

Filed: November 16, 2021

Date of Patent: February 20, 2024

Assignee: Arizona Board of Regents on Behalf of Arizona State University

Inventors: Ahmadreza Mosallanezhad, Ghazaleh Beigi, Huan Liu
System and method for configuring input elements of a controlling device

Patent number: 11889142

Abstract: A configurable input element of a controlling device is configured by using a data representative of an over-the-top (OTT) media app determined to be installed on an OTT device and a data representative of the OTT device to identify at least one command that is required to be transmitted to cause the OTT device to launch the OTT media app. The at least one command is provisioned to the controlling device and assigned to the configurable input element. When the input element is subsequently activated, the controlling device will transmit the at least one command to cause the OTT device to launch the OTT media app.

Type: Grant

Filed: December 9, 2022

Date of Patent: January 30, 2024

Assignee: Universal Electronics Inc.

Inventors: Thomas Hascher, Menno Koopmans
Voice enablement and disablement of speech processing functionality

Patent number: 11887590

Abstract: Methods and devices for enabling and disabling applications using voice are described herein. In some embodiments, an individual speak an utterance to their electronic device, which may send audio data representing the utterance to a backend system. The backend system may generate text data representing the utterance, and may determine that an intent of the utterance was for an application to be enabled or disabled for their user account on the backend system. If, for instance, the intent was to enable the application, the backend system may receive one or more rules for performing functionalities of the application, as well as one or more sample templates of sample utterances and sample responses that future utterances may use when requesting the application. Furthermore, one or more invocation phrases that may be used within the future utterances to invoke the application may be received, along with slot values for the sample templates.

Type: Grant

Filed: September 24, 2020

Date of Patent: January 30, 2024

Assignee: Amazon Technologies, Inc.

Inventors: Shaman D'Souza, Ian Suttle, Srikanth Nori, Rajiv Reddy, Amol Kanitkar, Tina Orooji
System and method for social learning utilizing user devices

Patent number: 11830380

Abstract: Methods, systems and computer program products for automated learning are provided herein. A computer-implemented method includes authenticating a plurality of users for an automated learning session, wherein the plurality of users correspond to at least one device, and providing the automated learning session for the plurality of users. Providing the automated learning session comprises analyzing a plurality of learning models corresponding to one or more of the plurality of users, determining, based on the analysis, one or more activities to be performed by the plurality of users during the automated learning session, and executing the one or more activities on at least one device.

Type: Grant

Filed: January 10, 2019

Date of Patent: November 28, 2023

Assignee: International Business Machines Corporation

Inventors: Smitkumar Narotambhai Marvaniya, Tejas Indulal Dhamecha, Malolan Chetlur, Renuka Sindhgatta, Bikram Sengupta
Device, method, and program for analyzing speech signal

Patent number: 11798579

Abstract: A parameter included in a fundamental frequency pattern of a voice can be estimated from the fundamental frequency pattern with high accuracy and the fundamental frequency pattern of the voice can be reconstructed from the parameter included in the fundamental frequency pattern. A learning unit 30 learns a deep generation model including an encoder which regards a parameter included in a fundamental frequency pattern in a voice signal as a latent variable of the deep generation model and estimates the latent variable from the fundamental frequency pattern in the voice signal on the basis of parallel data of the fundamental frequency pattern in the voice signal and the parameter included in the fundamental frequency pattern in the voice signal, and a decoder which reconstructs the fundamental frequency pattern in the voice signal from the latent variable.

Type: Grant

Filed: February 19, 2019

Date of Patent: October 24, 2023

Assignee: NIPPON TELEGRAPH AND TELEPHONE CORPORATION

Inventors: Ko Tanaka, Hirokazu Kameoka
Systems and methods for code-mixing adversarial training

Patent number: 11755847

Abstract: Embodiments described herein provide adversarial attacks targeting the cross-lingual generalization ability of massive multilingual representations, demonstrating their effectiveness on multilingual models for natural language inference and question answering. An efficient adversarial training scheme can thus be implemented with the adversarial attacks, which takes the same number of steps as standard supervised training and show that it encourages language-invariance in representations, thereby improving both clean and robust accuracy.

Type: Grant

Filed: January 15, 2021

Date of Patent: September 12, 2023

Assignee: Salesforce, Inc.

Inventors: Samson Min Rong Tan, Shafiq Rayhan Joty
Method and apparatus for awakening skills by speech

Patent number: 11721328

Abstract: The present invention discloses a method and apparatus for awakening skills by speech, which are applied to an electronic device. The method for awakening skills by speech includes: recognizing awakening text information corresponding to a speech request message to be processed; invoking a service skill semantic model to determine a target service field corresponding to the awakening text information and a corresponding first confidence, and invoking a knowledge skill semantic model to determine a knowledge reply answer corresponding to the awakening text information and a corresponding second confidence; and selecting to awaken one of a knowledge skill and a target service skill corresponding to the target service field based on the first confidence and the second confidence. Accordingly, the probability of erroneously awakening a skill based on the speech message can be reduced.

Type: Grant

Filed: October 26, 2020

Date of Patent: August 8, 2023

Assignee: AI SPEECH CO., LTD.

Inventor: Chengya Zhu
Systems and methods for voice assisted healthcare

Patent number: 11663415

Abstract: The following relates generally to voice assisted healthcare. In some embodiments, a digital assistant receives audio data, and determines an intent from the audio data. The digital assistant may then match the determined intent to a flow of a set of flows, where the set of flows may include at least one of: (i) submitting a prescription, (ii) refilling a prescription, (iii) changing a pickup location, (iv) requesting a status update for a prescription, or (v) initiating a pharmacy chat session. The matched flow of the set of flows may then be executed.

Type: Grant

Filed: August 31, 2020

Date of Patent: May 30, 2023

Assignee: WALGREEN CO.

Inventors: Julija Alegra Petkus, Andrew David Schweinfurth, Stephen Elijah Zambo
Method and apparatus for adaptive audio signal alteration

Patent number: 11638086

Abstract: A method and an apparatus for enabling adaptive audio signal alteration are described. When an input audio signal is received, a determination of whether the user of an audio device hears the input audio signal is performed based upon brain activity of the user. A determination of whether the user is distracted by the audio signal is performed based upon sensor measurements indicating a physical state of the user. In response to determining that the user hears the input audio signal and that the input audio signal causes the user to be distracted, a determination of configuration parameter(s) is performed. An alteration of audio signal(s) is caused based upon the configuration parameter(s) to obtain modified version(s) of the audio signal(s) that are intended to address the distraction caused by the input audio signal, and output audio signals are output, where the output audio signals include the modified versions.

Type: Grant

Filed: June 29, 2022

Date of Patent: April 25, 2023

Assignee: Telefonaktiebolaget LM Ericsson (publ)

Inventors: Matthew John Lawrenson, Jan Jasper Van Den Berg, Jacob Ström, Lars Andersson
Performing an action based on secondary user authorization

Patent number: 11627189

Abstract: Techniques for implementing a “sticky” user ID are described. A system receives first input audio data and determines first speech processing results therefrom. The system also determines a first user ID of a user that spoke an utterance represented in the first input audio data and associates the first user ID with a device, which originated the first input audio data, for a predetermined length of time. The system determines first output data responsive to the first speech processing data and causes the device to present first output content corresponding thereto. The system then receives second input audio data and determines second speech processing results therefrom. The system also determines a time of receipt of the second input audio data is within the predetermined length of time. Based at least in part thereon, the system determined second output data responsive to the second speech processing data using the first user ID.

Type: Grant

Filed: June 23, 2020

Date of Patent: April 11, 2023

Assignee: Amazon Technologies, Inc.

Inventor: Yu Bao
Method and system of building hospital-scale chest X-ray database for entity extraction and weakly-supervised classification and localization of common thorax diseases

Patent number: 11583239

Abstract: A new chest X-ray database, referred to as “ChestX-ray8”, is disclosed herein, which comprises over 100,000 frontal view X-ray images of over 32,000 unique patients with the text-mined eight disease image labels (where each image can have multi-labels), from the associated radiological reports using natural language processing. We demonstrate that these commonly occurring thoracic diseases can be detected and spatially-located via a unified weakly supervised multi-label image classification and disease localization framework, which is validated using our disclosed dataset.

Type: Grant

Filed: March 26, 2018

Date of Patent: February 21, 2023

Assignee: The United States of America, as represented by the Secretary, Department of Health and Human Service

Inventors: Xiaosong Wang, Yifan Peng, Le Lu, Zhiyong Lu, Ronald M. Summers
Audio identification during performance

Patent number: 11574008

Abstract: Methods and apparatus for audio identification during a performance are disclosed herein. An example apparatus includes at least one memory and at least one processor to transform a segment of audio into a log-frequency spectrogram based on a constant Q transform using a logarithmic frequency resolution, transform the log-frequency spectrogram into a binary image, each pixel of the binary image corresponding to a time frame and frequency channel pair, each frequency channel representing a corresponding quarter tone frequency channel in a range from C3-C8, generate a matrix product of the binary image and a plurality of reference fingerprints, normalize the matrix product to form a similarity matrix, select an alignment of a line in the similarity matrix that intersects one or more bins in the similarity matrix with the largest calculated Hamming similarities, and select a reference fingerprint based on the alignment.

Type: Grant

Filed: November 23, 2020

Date of Patent: February 7, 2023

Assignee: Gracenote, Inc.

Inventors: Dale T. Roberts, Bob Coover, Nicola Marcantonio, Markus K. Cremer
System and method for configuring input elements of a controlling device

Patent number: 11570504

Abstract: A configurable input element of a controlling device is configured by using a data representative of an over-the-top (OTT) media app determined to be installed on an OTT device and a data representative of the OTT device to identify at least one command that is required to be transmitted to cause the OTT device to launch the OTT media app. The at least one command is provisioned to the controlling device and assigned to the configurable input element. When the input element is subsequently activated, the controlling device will transmit the at least one command to cause the OTT device to launch the OTT media app.

Type: Grant

Filed: November 6, 2020

Date of Patent: January 31, 2023

Assignee: Universal Electronics Inc.

Inventors: Thomas Hascher, Menno Koopmans
Information processing system, information processing apparatus including circuitry to store position information of users present in a space and control environment effect production, information processing method, and room

Patent number: 11556308

Abstract: An information processing system includes: an image display apparatus provided in a space and configured to display an image; a sensor apparatus carried by a user who is present in the space and configured to output a signal for detecting position information of the user in the space; and an information processing apparatus. The information processing apparatus includes circuitry configured to store a plurality of pieces of position information of a plurality of users including the user, who are in present in the space, in association with the plurality of users, the plurality of users being detected based on signals output from a plurality of sensor apparatuses including the sensor apparatus, and control environment effect production that supports communication between the plurality of users by the image displayed by the image display apparatus, based on each of the plurality of pieces of position information of the plurality of users.

Type: Grant

Filed: February 12, 2021

Date of Patent: January 17, 2023

Assignee: RICOH COMPANY, LTD.

Inventor: Haruki Murata
System to determine sentiment from audio data

Patent number: 11532300

Abstract: A device with a microphone acquires audio data of a user's speech. A neural network accepts audio data as input and provides sentiment data as output. The neural network is trained using training data based on input from raters who provide votes as to which sentiment descriptors they think are associated with a sample of speech. A vote by a rater assessing the sample for a particular semantic descriptor is distributed to a plurality of semantically similar semantic descriptors. Semantic descriptor similarity data indicates relative similarity between possible semantic descriptors in the semantic space. The distributed partial votes may be aggregated to produce training data comprising samples of speech and weights of corresponding semantic descriptors. The training data is then used to train the neural network. For example, the neural network may be trained with the training data using per-instance cosine similarity loss or correlational loss.

Type: Grant

Filed: June 26, 2020

Date of Patent: December 20, 2022

Assignee: AMAZON TECHNOLOGIES, INC.

Inventors: Daniel Kenneth Bone, Viktor Rozgic, Chao Wang
Method and device for detecting speech patterns and errors when practicing fluency shaping techniques

Patent number: 11517254

Abstract: A method and system for detecting errors when practicing fluency shaping exercises. The method includes setting each threshold of a set of thresholds to a respective predetermined initial value; analyzing a voice production to compute a set of first energy levels composing the voice production, wherein the voice production is of a user practicing a fluency shaping exercise; detecting at least one speech-related error based on the computed set of first energy levels, a set of second energy levels, and the set of thresholds, wherein the detection of the at least one speech-related error is with respect to the fluency shaping exercise being practiced by the user, wherein the set of second energy levels is determined based on a calibration process; and generating feedback indicating the detected at least one speech-related error.

Type: Grant

Filed: January 18, 2019

Date of Patent: December 6, 2022

Assignee: Novotalk, Ltd.

Inventors: Moshe Rot, Lilach Rothschild, Smadar Lerner
Pressure sensing guidewire assemblies and systems

Patent number: 11517209

Abstract: Pressure sensing guidewire assemblies are described herein where the guidewire assembly may be comprised of an elongate guidewire body and multiple pressure sensors secured near or at a distal end of the guidewire body. The signals obtained from the guidewire connectors and aortic sensor modules may be synchronized to minimize signal acquisition delays. The signals may be further processed to equalize the pressure waveforms by shifting the connector waveform to align correctly with the aortic module waveform and improve output signals.

Type: Grant

Filed: January 9, 2019

Date of Patent: December 6, 2022

Assignee: PATHWAYS MEDICAL CORPORATION

Inventors: Goutam Dutta, Nitin Patil
Low power mode for speech capture devices

Patent number: 11514926

Abstract: A system configured to enable a Wi-Fi processor to enter a low power mode (LPM) for short periods of time without compromising functionality is provided. A device reduces power consumption by enabling the Wi-Fi processor to enter LPM with scheduled wakeup events to enable specific functionality. In some examples, the Wi-Fi processor toggles between LPM and an active mode based on a first duty cycle to enable new device provisioning. The first duty cycle corresponds to a time required to scan a plurality of wireless channels, waking the Wi-Fi processor at a first frequency to monitor for incoming probe requests. In other examples, the Wi-Fi processor uses a second duty cycle chosen to maintain time synchronicity between a time master device and time follower devices. The device sets the second duty cycle to wake the Wi-Fi processor at a second frequency to exchange data packets with synchronized devices.

Type: Grant

Filed: November 6, 2020

Date of Patent: November 29, 2022

Assignee: Amazon Technologies, Inc.

Inventors: Dibyendu Nandy, Om Prakash Gangwal
Tracking camera network

Patent number: 11412171

Abstract: Existence of instrumentation for automatic video recording creates an excess capacity of video recording for those who own automatic video recorders. Others may want to utilize this excess capacity to record their activities thus there is a need for a system that helps match those who would like to utilize the excess capacity with those who have such capacity. Such excess capacity is matched with demand to use such excess capacity by creating a network of automatic video recording units and tags that are associated with people who want to be recorded.

Type: Grant

Filed: February 16, 2021

Date of Patent: August 9, 2022

Assignee: H4 Engineering, Inc.

Inventors: Christopher T. Boyle, Konstantin Othmer, Gordon Jason Glover, Alexander G. Sammons
Speech and behavior control device, robot, storage medium storing control program, and control method for speech and behavior control device

Patent number: 11400601

Abstract: The present invention allows a robot to carry out communication with excellent affectiveness. A speech and behavior control device (1) includes an utterance content selecting section (16) which selects utterance content of a robot (100) from among a plurality of utterances, a movement control section (17) which controls a movable part (13) to move based on a kind of feeling corresponding to the utterance content, and an audio control section (18) which controls the robot (100) to output the utterance content as audio after movement of the movable part (13) has been started.

Type: Grant

Filed: December 27, 2017

Date of Patent: August 2, 2022

Assignee: SHARP KABUSHIKI KAISHA

Inventor: Takuya Oyaizu
Human-machine interface (HMI) auto-steer based upon-likelihood to exceed eye glance guidelines

Patent number: 8994522

Abstract: The described method and system provide for HMI steering for a telematics-equipped vehicle based on likelihood to exceed eye glance guidelines. By determining whether a task is likely to cause the user to exceed eye glance guidelines, alternative HMI processes may be presented to a user to reduce ASGT and EORT and increase compliance with eye glance guidelines. By allowing a user to navigate through long lists of items through vocal input, T9 text input, or heuristic processing rather than through conventional presentation of the full list, a user is much more likely to comply with the eye glance guidelines. This invention is particularly useful in contexts where users may be searching for one item out of a plurality of potential items, for example, within the context of hands-free calling contacts, playing back audio files, or finding points of interest during GPS navigation.

Type: Grant

Filed: May 26, 2011

Date of Patent: March 31, 2015

Assignees: General Motors LLC, GM Global Technology Operations LLC

Inventors: Steven C. Tengler, Bijaya Aryal, Scott P. Geisler, Michael A. Wuergler
Method and System for Suggesting Phrase Completions with Phrase Segments

Publication number: 20140253458

Abstract: A method is provided for managing phrase completion suggestions in response to text input. The method includes receiving text entered into the computing system, and identifying a first plurality of phrases that each begins with the received text and that each includes a respective phrase segment immediately following the received text. The method further includes displaying a first list of the respective phrase segments of the identified first plurality of phrases without displaying the received text, and receiving input defining a selection of one of the respective phrase segments of the displayed first list.

Type: Application

Filed: July 20, 2011

Publication date: September 11, 2014

Applicant: GOOGLE INC.

Inventor: Nirmal J. Patel
Mobile device and method and computer-readable medium controlling same for using with sound localization

Patent number: 8731715

Abstract: A mobile device moves by calculating a distance between a sound source and the mobile device using a sound source direction estimation technique. The mobile device moves by a reference distance in a direction perpendicular to a direction in which the mobile device faces the sound source when call sound of the sound source is generated, outputs voice to instruct to the sound source to generate recall sound, checks a directional angle of the mobile device when recall sound is generated by the sound source, calculates the distance between the sound source and the mobile device according to the reference distance and the directional angle of the mobile device, and moves to the vicinity of the sound source.

Type: Grant

Filed: November 24, 2010

Date of Patent: May 20, 2014

Assignee: Samsung Electronics Co., Ltd.

Inventors: Won Jun Ko, Yong Jae Kim, Woo Sup Han, Ki Cheol Park
CHATBOT SYSTEM AND METHOD WITH CONTEXTUAL INPUT AND OUTPUT MESSAGES

Publication number: 20140122083

Abstract: A chatbot system and method with contextual input/output messages. A chatbot includes a processor, an interactive dialog interface and a knowledge database. The system uses a script file to display input and output messages in a tree format. An initial input or output message is stored. An identifier is assigned to the initial input or output message that is then used as context for the subsequent input/output messages by associating and storing the identifier with the subsequent input/output messages. The relationship between the first input or output message and subsequent input/output messages define a parent-child relationship that is displayable via the script file.

Type: Application

Filed: October 26, 2012

Publication date: May 1, 2014

Inventor: Duan Xiaojiang
PICTURES FROM SKETCHES

Publication number: 20140108016

Abstract: A graphical sketch can be received, the sketch including one or more representations of text. A query can be automatically generated from the sketch. The generation of the query can include automatically recognizing the text and automatically representing the text in the query. The query can be run to identify a picture in response to the query, with the text describing one or more non-textual features of the picture. The picture can be returned, such as in response to the receipt of the graphical sketch.

Type: Application

Filed: October 15, 2012

Publication date: April 17, 2014

Applicant: MICROSOFT CORPORATION

Inventor: Brian Albrecht
METHODS AND SYSTEMS FOR NAME PRONUNCIATION

Publication number: 20140086395

Abstract: In an embodiment, a system maintains a database of a plurality of persons. The database includes an audio clip of a pronunciation of a name of a first person in the database. The system determines from a calendar database that a second person has an event in common with the first person, and transmits to a device associated with the second person an indication that the database includes the pronunciation of the name of the first person.

Type: Application

Filed: September 25, 2012

Publication date: March 27, 2014

Applicant: Linkedln Corporation

Inventors: Jonathan Redfern, Manish Mohan Sharma, Seth McLaughlin
VOICE STAMP-DRIVEN IN-VEHICLE FUNCTIONS

Publication number: 20140074480

Abstract: In-vehicle functions are implemented using a plurality of microphones disposed in a vehicle. Each of the microphones is disposed in a portion of the vehicle defined by a zone. The in-vehicle functions are also implemented via a central controller of the vehicle. The central controller includes a computer processor executing logic. The logic receive a voice communication from an individual via one of the microphones, identifies the zone in the vehicle occupied by the individual, identifies the individual by comparing a voice stamp from the voice communication to a database of voice stamps, and implements at least one vehicle electronic component in the zone based on user preferences associated with the voice stamp.

Type: Application

Filed: September 11, 2012

Publication date: March 13, 2014

Applicant: GM GLOBAL TECHNOLOGY OPERATIONS LLC

Inventors: Jesse T. Gratke, Bassam S. Shahmurad
CONTROL METHOD AND VIDEO-AUDIO PLAYING SYSTEM

Publication number: 20140046668

Abstract: A control method for a video-audio playing system receiving a video-audio streaming signal is provided. The video-audio streaming signal includes at least a channel-program information. The control method comprises receiving a speech signal and analyzing the speech signal to obtain an acoustic feature of the speech signal. According to the acoustic feature, a speech recognition is performed to determine one of the channel-program information corresponds to the acoustic feature. According to the determined channel-program information, the video-audio playing system executes an operation corresponding to the channel-program information.

Type: Application

Filed: September 10, 2012

Publication date: February 13, 2014

Applicant: WISTRON CORPORATION

Inventor: Chih-Wen Huang

1 2 3 4 5 … next