Application Patents (Class 704/270)

Speech assisted network (Class 704/270.1)

Handicap aid (Class 704/271)

Novelty item (Class 704/272)

Security system (Class 704/273)

Warning/alarm system (Class 704/274)

Speech controlled system (Class 704/275)

Pattern display (Class 704/276)

Translation (Class 704/277)

Sound editing (Class 704/278)

Systems and methods for processing and displaying messages in digital communications

Patent number: 11556696

Abstract: Systems and methods include receiving, with a processor, two or more messages from a first user device participating in a communication session, processing, with the processor, the two or more messages, generating, with the processor, a processed message, and displaying, with the processor, the processed message on a second user device participating in the communication session.

Type: Grant

Filed: March 15, 2021

Date of Patent: January 17, 2023

Assignee: Avaya Management L.P.

Inventors: Sandesh Chopdekar, Pushkar Deole, Navin Daga
Methods and apparatus to determine an audience composition based on voice recognition

Patent number: 11558663

Abstract: Methods, apparatus, systems and articles of manufacture are disclosed. An example apparatus includes a controller to cause a people meter to emit a prompt for input of audience identification information at a first time and determine a first audience count based on the input, an audio detector to determine a second audience count based on signatures generated from audio data captured in the media environment, and a comparator to cause the people meter to not emit the prompt for at least a first time period after the first time when the first audience count is equal to the second audience count.

Type: Grant

Filed: August 20, 2020

Date of Patent: January 17, 2023

Assignee: THE NIELSEN COMPANY (US), LLC

Inventors: John T. LiVoti, Stanley Wellington Woodruff, Rajakumar Madhanganesh, Khushboo Agarwal
Dynamic system response configuration

Patent number: 11551663

Abstract: A natural language processing system may use system response configuration data to determine customized output data forms when outputting data for a user. The system response configuration data may represent various output attributes the system may use when creating output data. The system may have a certain number of existing profiles where a profile is associated with certain settings for the system response configuration data/attributes. The system may also use various data such as context data, sentiment data, or the like to customize system response configuration data during a dialog. Other components, such as natural language generation (NLG), text-to-speech (TTS), or the like, may use the customized system response configuration data to determine the form, timing, etc. of output data to be presented to a user.

Type: Grant

Filed: December 10, 2020

Date of Patent: January 10, 2023

Assignee: Amazon Technologies, Inc.

Inventors: Anthony Bissell, Janet Slifka
Multimedia keepsake system and method

Patent number: 11544685

Abstract: A multimedia keepsake is created containing multimedia content created by a customer and stored online as content information. After the customer selects the type of keepsake, the content information is converted to keepsake information having a format appropriate for storage in the selected type of keepsake. The keepsake information is stored online so as to be accessible via an access code, and it is downloaded to a vendor providing the access code.

Type: Grant

Filed: August 12, 2014

Date of Patent: January 3, 2023

Inventor: Geoffrey S. Stern
Systems and methods for implementing user-responsive reactive advertising via voice interactive input/output devices

Patent number: 11532007

Abstract: A system and method are provided for employing voice-activated user interfaces to determine user attention to particularly-presented advertising content by collecting user contact/consumer information, presenting content to the user/consumer, and proposing at least one question, inquiry or query to the user regarding the presented content, the at least one inquiry or query calling for a user/consumer response to be collected, at least one of (a) the user/consumer contact information and (b) the user/consumer response to the question, inquiry or query being collected by the system via a voice-activated user interface and evaluated to assess a level of engagement of the user/consumer with the advertising content. The disclosed systems and methods uniquely provide voice-activated user interface coupled with display of certain advertising content in a manner that promotes user/consumer attention to the advertising content and ease of interaction with the presentation system.

Type: Grant

Filed: August 16, 2019

Date of Patent: December 20, 2022

Inventor: Frank S. Maggio
Visualization of training dialogs for a conversational bot

Patent number: 11521114

Abstract: This document relates to creating and/or updating a chatbot using a graphical user interface. For example, training dialogs for a chatbot can be displayed in a tree form on a graphical user interface. Based at least on interactions between a developer and the graphical user interface, the training dialogs can be modified in the tree form, and training dialogs can be updated based on the modifications provided on the tree form via the graphical user interface.

Type: Grant

Filed: April 18, 2019

Date of Patent: December 6, 2022

Assignee: Microsoft Technology Licensing, LLC

Inventors: Lars H. Liden, Swadheen K. Shukla, Shahin Shayandeh, Matthew D. Mazzola
Collaborative voice controlled devices

Patent number: 11521618

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for collaboration between multiple voice controlled devices are disclosed. In one aspect, a method includes the actions of identifying, by a first computing device, a second computing device that is configured to respond to a particular, predefined hotword; receiving audio data that corresponds to an utterance; receiving a transcription of additional audio data outputted by the second computing device in response to the utterance; based on the transcription of the additional audio data and based on the utterance, generating a transcription that corresponds to a response to the additional audio data; and providing, for output, the transcription that corresponds to the response.

Type: Grant

Filed: December 17, 2019

Date of Patent: December 6, 2022

Assignee: GOOGLE LLC

Inventors: Victor Carbune, Pedro Gonnet Anders, Thomas Deselaers, Sandro Feuz
Three-way calling terminal for mobile human-machine coordination calling robot

Patent number: 11516346

Abstract: A three-way calling terminal for a mobile human-machine coordination calling robot. Technical solutions include: a first speech interface, configured to transfer call audio between a call object and a back-end processing module; a CODEC1 module, configured to encode and decode the call audio between the call object and the back-end processing module; a second speech interface, configured to transfer call audio between the human agent and the call object; a CODEC2 module, configured to encode and decode the call audio between the human agent and the call object; a call control module, configured to process a control signal, and automatically make, answer, and hang up a call; a data processing submodule, configured to process speech data and perform data transfer between the data processing submodule and the back-end processing module; and a networking submodule, configured to be connected to the back-end processing module.

Type: Grant

Filed: July 8, 2021

Date of Patent: November 29, 2022

Assignee: NANJING SILICON INTELLIGENCE TECHNOLOGY CO., LTD.

Inventor: Huapeng Sima
Reception apparatus, reception system, reception method, and storage medium

Patent number: 11514663

Abstract: Provided are a reception apparatus, a reception system, a reception method, and a storage medium that can naturally provide a personal conversation in accordance with a user without requiring the user to register the personal information thereof in advance. A disclosure includes a face information acquisition unit that acquires face information of a user; a face matching unit that matches, against face information of one user, the face information registered in a user information database in which user information including the face information of the user and the reception information is registered; and a user information management unit that, when a result of matching of the face information performed by the face matching unit is unmatched, registers the user information of the one user to the user information database.

Type: Grant

Filed: February 13, 2019

Date of Patent: November 29, 2022

Assignee: NEC CORPORATION

Inventors: Nobuaki Kawase, Makoto Igarashi
Whispering voice recovery method, apparatus and device, and readable storage medium

Patent number: 11508366

Abstract: A method, an apparatus and a device for converting a whispered speech, and a readable storage medium are provided. The method is implemented based on the whispered speech converting model. The whispered speech converting model is trained in advance by using recognition results and whispered speech training acoustic features of whispered speech training data as samples and using normal speech acoustic features of normal speech data parallel to the whispered speech training data as sample labels. A whispered speech acoustic feature and a preliminary recognition result of whispered speech data are acquired, then the whispered speech acoustic feature and the preliminary recognition result are inputted into a preset whispered speech converting model to acquire a normal speech acoustic feature outputted by the model. In this way, the whispered speech can be converted to a normal speech.

Type: Grant

Filed: June 15, 2018

Date of Patent: November 22, 2022

Assignee: IFLYTEK CO., LTD.

Inventors: Jia Pan, Cong Liu, Haikun Wang, Zhiguo Wang, Guoping Hu
Extended impersonated utterance set generation apparatus, dialogue apparatus, method thereof, and program

Patent number: 11508357

Abstract: An extended role play-based utterance set generation apparatus includes a first data store storing role play-based utterance sets and a second data store storing non-role-played utterance sets. The role play-based utterance sets include a first query and a role play-based response to the query. The non-role-played utterance sets include a second query and a non-role-played response to the query. The disclosed technology determines similarity between the role play-based response and the non-role-played response. Upon determining that the role play-based response is the same or similar to the non-role-played response, the disclosed technology generates an association between the role play-based response and the second query and extends the role play-based utterance sets in the first data store with the second query.

Type: Grant

Filed: April 5, 2019

Date of Patent: November 22, 2022

Assignee: NIPPON TELEGRAPH AND TELEPHONE CORPORATION

Inventors: Masahiro Mizukami, Ryuichiro Higashinaka
Apparatus and method for providing voice assistant service

Patent number: 11501755

Abstract: Provided are an electronic device and method for providing a voice assistant service. The method, performed by the electronic device, of providing the voice assistant service includes: obtaining a voice of a user; obtaining voice analysis information of the voice of the user by inputting the voice of the user to a natural language understanding model; determining whether a response operation with respect to the voice of the user is performable, according to a preset criterion, based on the obtained voice analysis information; and based on the determining that the response operation is not performable, outputting a series of guide messages for learning the response operation related to the voice of the user.

Type: Grant

Filed: September 1, 2020

Date of Patent: November 15, 2022

Assignee: SAMSUNG ELECTRONICS CO., LTD.

Inventor: Inchul Hwang
Method, device, and system for providing privacy for communications

Patent number: 11483425

Abstract: A communication system, method and communication terminal are configured to facilitate private outputting of content of a message or communication session. A communication terminal can be configured via data included in a message or via a privacy setting to output content of data from a communication session or message in accordance with a pre-selected privacy setting or one or more privacy rules. For instance, a communication terminal may be configured to suppress a text to speech function for certain text messages, email messages, instant messages, or social networking messages that it receives having the privacy parameter set therein. As another example, a user may set the privacy parameter in his or her terminal so that any such message received by that terminal is output in accordance with the privacy setting or rules. A detection of nearby people can affect how certain content may be output via a terminal.

Type: Grant

Filed: October 23, 2018

Date of Patent: October 25, 2022

Assignee: RINGCENTRAL, INC.

Inventors: Christian Garbin, Johannes Ruetschi
Joint endpointing and automatic speech recognition

Patent number: 11475880

Abstract: A method includes receiving audio data of an utterance and processing the audio data to obtain, as output from a speech recognition model configured to jointly perform speech decoding and endpointing of utterances: partial speech recognition results for the utterance; and an endpoint indication indicating when the utterance has ended. While processing the audio data, the method also includes detecting, based on the endpoint indication, the end of the utterance. In response to detecting the end of the utterance, the method also includes terminating the processing of any subsequent audio data received after the end of the utterance was detected.

Type: Grant

Filed: March 4, 2020

Date of Patent: October 18, 2022

Assignee: Google LLC

Inventors: Shuo-yiin Chang, Rohit Prakash Prabhavalkar, Gabor Simko, Tara N. Sainath, Bo Li, Yangzhang He
Method, apparatus and computer program for detecting voice uttered from a particular position

Patent number: 11468884

Abstract: An information processing apparatus includes a voice acquisition section, a reliability generation section, and a processing execution section. The voice acquisition section acquires an ambient voice. The reliability generation section generates reliability indicating a degree in which the acquired voice is uttered from the particular position on the basis of a predetermined transfer characteristic. As the predetermined transfer characteristic, a phase difference or acoustic characteristic of the voice can be assumed. The processing execution section executes a process according to the generated reliability. As the process according to the reliability, a notification according to the reliability or a predetermined command can be assumed to be executed.

Type: Grant

Filed: March 13, 2018

Date of Patent: October 11, 2022

Assignee: Sony Corporation

Inventors: Ryosuke Sawata, Yuichiro Koyama
Method, apparatus, electronic device and storage medium for broadcasting voice

Patent number: 11470027

Abstract: A method, an apparatus, an electronic device and a storage medium for broadcasting a voice are provided. The method may include: sending a voice broadcast request to a server, where the voice broadcast request includes at least one of scenario information, user information or voice packet setting information; receiving a voice broadcast instruction corresponding to the voice broadcast request returned by the server; and acquiring a personalized voice packet corresponding to the voice broadcast instruction in a local database and broadcasting the personalized voice packet.

Type: Grant

Filed: March 25, 2021

Date of Patent: October 11, 2022

Assignee: Beijing Baidu Netcom Science and Technology Co., Ltd.

Inventors: Shiqiang Ding, Jinyi Lei
Auralization for multi-microphone devices

Patent number: 11470419

Abstract: A method for auralizing a multi-microphone device. Path information for one or more sound paths using dimensions and room reflection coefficients of a simulated room for one of a plurality of microphones included in a multi-microphone device is determined. An array-related transfer functions (ARTFs) for the one of the plurality of microphones is retrieved. The auralized impulse response for the one of the plurality of microphones is generated based at least on the retrieved ARTFs and the determined path information.

Type: Grant

Filed: August 29, 2019

Date of Patent: October 11, 2022

Assignee: Google LLC

Inventors: Rajeev Conrad Nongpiur, Ananya Misra, Chanwoo Kim
Adaptive vocabulary improvement

Patent number: 11455907

Abstract: A computer-implemented method includes recognizing, by a computer device, a word as a new learned word for a user; registering, by the computer device, the new leaned word in a user's new learned word list as a registered new learned word; associating, by the computer device, the registered new learned word with related known words in a user's known word library, the known word library including words known to the user; tracking, by the computer device, uses of the related known words by the user; identifying, by the computer device, a used sentence used by the user that contains one of the related known words; and suggesting, by the computer device, to the user a new sentence that replaces the one of the related known words in the used sentence with the new learned word.

Type: Grant

Filed: November 27, 2018

Date of Patent: September 27, 2022

Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Su Liu, Manjunath Ravi, Zhichao Li, Kai Liu
Audio-visual speech separation

Patent number: 11456005

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for audio-visual speech separation. A method includes: obtaining, for each frame in a stream of frames from a video in which faces of one or more speakers have been detected, a respective per-frame face embedding of the face of each speaker; processing, for each speaker, the per-frame face embeddings of the face of the speaker to generate visual features for the face of the speaker; obtaining a spectrogram of an audio soundtrack for the video; processing the spectrogram to generate an audio embedding for the audio soundtrack; combining the visual features for the one or more speakers and the audio embedding for the audio soundtrack to generate an audio-visual embedding for the video; determining a respective spectrogram mask for each of the one or more speakers; and determining a respective isolated speech spectrogram for each speaker.

Type: Grant

Filed: November 21, 2018

Date of Patent: September 27, 2022

Assignee: Google LLC

Inventors: Inbar Mosseri, Michael Rubinstein, Ariel Ephrat, William Freeman, Oran Lang, Kevin William Wilson, Tali Dekel, Avinatan Hassidim
Apparatus control device, apparatus control system, apparatus control method, and apparatus control program

Patent number: 11443745

Abstract: Included are: an apparatus function information acquiring unit for acquiring apparatus function information in which a target apparatus and one or more target functions to be executed by the target apparatus, which are determined on the basis of uttered speech, are associated with each other; a procedure determining unit for determining one or more manual operations for executing the one or more target functions and an order of the one or more manual operations on the basis of the apparatus function information acquired by the apparatus function information acquiring unit; and an operation command transmission controlling unit for sequentially transmitting, to the target apparatus, operation commands for outputting operation response output control information corresponding to each of the one or more manual operations in accordance with the order of the one or more manual operations determined by the procedure determining unit.

Type: Grant

Filed: October 21, 2020

Date of Patent: September 13, 2022

Assignee: MITSUBISHI ELECTRIC CORPORATION

Inventors: Masato Hirai, Kenshiro Kitamura, Miho Ishikawa, Daisuke Iizawa
Information processing apparatus, information processing method, and storage medium

Patent number: 11443759

Abstract: An information processing apparatus includes a memory storing instructions. The instructions cause the apparatus to extract a plurality of local features from data indicating a speech, the characteristics of feature extraction being formed through learning; and to encode a series of chronological features of the data based on the plurality of local features, characteristics of encoding the series of chronological features being formed through learning. The instructions also cause the apparatus to generate information obtained by weighting features at a specific point in time associated with emotion classification, of the series of chronological features encoded, characteristics of weighting the features at the specific point in time being formed through learning; and to classify emotion corresponding to the data using the information obtained by weighting the features at the specific point in time, characteristics of classification being formed through learning.

Type: Grant

Filed: July 27, 2020

Date of Patent: September 13, 2022

Assignee: HONDA MOTOR CO., LTD.

Inventor: Yuanchao Li
Initiating synthesized speech output from a voice-controlled device

Patent number: 11443730

Abstract: A system, a computer program product, and method for controlling synthesized speech output on a voice-controlled device. A sensor is used to capture an image of a face of a person. A database of previously stored images of facial features is accessed. In response to i) not recognizing the at least one person the voice-controlled device selects a first set of conversational starters; ii) recognizing the person and recognizing previous communications with the person, the voice-controlled device selects a second set of conversational starters; iii) recognizing the person and not recognizing previous communications with the person, the voice-controlled device selects a third set of conversational starters; or iv) recognizing the at least one person and recognizing previous communications with the person selecting but do not know the person's name selecting a fourth set of conversational starters. The voice controlled device outputs the selected set of conversational starters.

Type: Grant

Filed: January 28, 2020

Date of Patent: September 13, 2022

Assignee: International Business Machines Corporation

Inventors: Shang Qing Guo, Jonathan Lenchner
Sentiments based transaction systems and methods

Patent number: 11430014

Abstract: Systems and methods of facilitating transactions related to targeted or customized commercial offerings based on derived sentiment states are provided. The sentiment states are derived from digital representations such as images, videos and sound recordings.

Type: Grant

Filed: October 1, 2020

Date of Patent: August 30, 2022

Assignee: Nant Holdings IP, LLC

Inventor: Patrick Soon-Shiong
Reception apparatus, reception system, reception method and storage medium

Patent number: 11430207

Abstract: Provided are a reception apparatus, a reception system, a reception method, and a storage medium that can naturally provide a personal conversation in accordance with a user without requiring the user to register the personal information thereof in advance. A disclosure includes a face information acquisition unit that acquires face information of a user; a conversation processing unit that acquires reception information including a content of conversation with the user; a face matching unit that matches, against the face information of one user, the face information registered in a user information database in which user information including the face information of the user and the reception information is registered; and a user information management unit that, when a result of matching of the face information performed by the face matching unit is unmatched, registers the user information of the one user to the user information database.

Type: Grant

Filed: June 8, 2017

Date of Patent: August 30, 2022

Assignee: NEC CORPORATION

Inventors: Nobuaki Kawase, Makoto Igarashi
Verbal cues for high-speed control of a voice-enabled device

Patent number: 11423879

Abstract: A technique for controlling a voice-enabled device using voice commands includes receiving an audio signal that is generated in response to a verbal utterance, generating a verbal utterance indicator for the verbal utterance based on the audio signal, selecting a first command for a voice-controlled application residing within the voice-enabled device based on the verbal utterance indicator, and transmitting the first command to the voice-controlled application as an input.

Type: Grant

Filed: July 18, 2017

Date of Patent: August 23, 2022

Assignee: Disney Enterprises, Inc.

Inventor: William Valentine Zajac, III
Method and apparatus for updating real-time voice recognition model using moving agent

Patent number: 11423881

Abstract: According to an embodiment of the present disclosure, a method of updating a speech recognition model using a mobile agent in real-time comprises obtaining, in real-time, space type information for a particular space where the mobile agent is located, varying, in real-time, parameters of a speech recognition model used in the particular space based on the space type information, and performing a speech recognition service based on the speech recognition model including the varied parameters. Embodiments of the present disclosure may be related to artificial intelligence (AI) devices, unmanned aerial vehicles (UAVs), robots, augmented reality (AR) devices, virtual reality (VR) devices, and 5G service-related devices.

Type: Grant

Filed: September 17, 2019

Date of Patent: August 23, 2022

Assignee: LG ELECTRONICS INC.

Inventor: Jonghoon Chae
Multi-platform integrated display

Patent number: 11422764

Abstract: Augmented reality display systems, apparatuses, and methods are disclosed for enabling a wearer of an augmented reality optical display to continue wearing the same optical display while moving between different platforms or vehicles. Example embodiments include optical displays that use a wired connection to connect with each platform to minimize the electromagnetic signature of the system. Embodiments include changing the information displayed to the user depending on the type of vehicle to which the optical display is connected. Additional embodiment display information about weapon systems associated with the platform to which the optical display is connected.

Type: Grant

Filed: May 31, 2019

Date of Patent: August 23, 2022

Assignee: EPIC OPTIX, INC.

Inventor: Ray Kwong
Machine-control of a device based on machine-detected transitions

Patent number: 11417302

Abstract: Apparatus, methods, and systems that operate to provide interactive streaming content identification and processing are disclosed. An example apparatus includes a classifier to determine an audio characteristic value representative of an audio characteristic in audio; a transition detector to detect a transition between a first category and a second category by comparing the audio characteristic value to a threshold value among a set of threshold values, the set of threshold values corresponding to the first category and the second category; and a context manager to control a device to switch from a first fingerprinting algorithm to a second fingerprinting algorithm different than the first fingerprinting algorithm, responsive to the detected transition between the first category and the second category.

Type: Grant

Filed: September 10, 2020

Date of Patent: August 16, 2022

Assignee: Gracenote, Inc.

Inventors: Michael Jeffrey, Markus K. Cremer, Dong-In Lee
System for performing a magnetic resonance tomography and method for controlling an MR scanner

Patent number: 11417329

Abstract: A system for performing magnetic resonance tomography is disclosed. A control system creates a speech data stream from an acquired linguistic expression and generates a command library, which contains a selection of speech commands, to each of which one or more linguistic expressions are assigned. The selection of speech commands is loaded from a command database depending on a current system status of a magnetic resonance (MR) scanner. The control system applies a speech recognition algorithm to the speech data stream to determine whether a linguistic expression contained in the command library can be assigned to the speech data stream. If so, the acquired linguistic expression is recognized, a speech command from the command library assigned to the recognized linguistic expression is established, and a control command for controlling the MR scanner in accordance with the speech command is created.

Type: Grant

Filed: January 30, 2020

Date of Patent: August 16, 2022

Assignee: Siemens Healthcare GmbH

Inventors: Rainer Schneider, Dirk Franger
Teacher and student learning for constructing mixed-domain model

Patent number: 11416741

Abstract: A technique for constructing a model supporting a plurality of domains is disclosed. In the technique, a plurality of teacher models, each of which is specialized for different one of the plurality of the domains, is prepared. A plurality of training data collections, each of which is collected for different one of the plurality of the domains, is obtained. A plurality of soft label sets is generated by inputting each training data in the plurality of the training data collections into corresponding one of the plurality of the teacher models. A student model is trained using the plurality of the soft label sets.

Type: Grant

Filed: June 8, 2018

Date of Patent: August 16, 2022

Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Takashi Fukuda, Osamu Ichikawa, Samuel Thomas, Bhuvana Ramabhadran
Beautification techniques for 3D data in a messaging system

Patent number: 11410401

Abstract: The subject technology receives a selection of a selectable graphical item from a plurality of selectable graphical items, the selectable graphical item comprising an augmented reality content generator for applying a 3D effect, the 3D effect including at least one beautification operation. The subject technology captures image data and depth data using a camera. The subject technology applies, to the image data and the depth data, the 3D effect including the at least one beautification operation based at least in part on the augmented reality content generator, the beautification operation being performed as part of applying the 3D effect. The subject technology generates a 3D message based at least in part on the applied 3D effect including the at least one beautification operation. The subject technology renders a view of the 3D message based at least in part on the applied 3D effect including the at least one beautification operation.

Type: Grant

Filed: August 28, 2020

Date of Patent: August 9, 2022

Assignee: Snap Inc.

Inventors: Kyle Goodrich, Samuel Edward Hare, Maxim Maximov Lazarov, Tony Mathew, Andrew James McPhee, Daniel Moreno, Dhritiman Sagar, Wentao Shang
Processing complex utterances for natural language understanding

Patent number: 11410646

Abstract: A system capable of performing natural language understanding (NLU) on utterances including complex command structures such as sequential commands (e.g., multiple commands in a single utterance), conditional commands (e.g., commands that are only executed if a condition is satisfied), and/or repetitive commands (e.g., commands that are executed until a condition is satisfied). Audio data may be processed using automatic speech recognition (ASR) techniques to obtain text. The text may then be processed using machine learning models that are trained to parse text of incoming utterances. The models may identify complex utterance structures and may identify what command portions of an utterance go with what conditional statements. Machine learning models may also identify what data is needed to determine when the conditionals are true so the system may cause the commands to be executed (and stopped) at the appropriate times.

Type: Grant

Filed: March 28, 2019

Date of Patent: August 9, 2022

Assignee: Amazon Technologies, Inc.

Inventors: Cengiz Erbas, Thomas Kollar, Avnish Sikka, Spyridon Matsoukas, Simon Peter Reavely
Intelligent e-book reader incorporating augmented reality or virtual reality

Patent number: 11410144

Abstract: Embodiments disclosed herein describe intelligent e-book readers which provide a significant improvement over the conventional e-books that simply render static content. The intelligent e-book readers may customize a rendered e-book based on, for example, the reading level and preferences of the user, the user's social media profile and activity, and current events. Furthermore, the intelligent e-book reader may provide additional augmented reality (AR)/virtual reality (VR) content associated with one or more portions of the rendered e-book. The intelligent e-book reader may also facilitate virtual, real time communication between multiple users and experts. The intelligent e-book reader may also facilitate one or more users to provide feedback and suggestions to authors and future movie-makers. The intelligent e-book reader may automatically determine difficult portions of an e-book based on the virtual communications and/or real time eye-tracking of a user.

Type: Grant

Filed: January 25, 2021

Date of Patent: August 9, 2022

Assignee: MASSACHUSETTS MUTUAL LIFE INSURANCE COMPANY

Inventors: Michal Knas, Payton A. Shubrick, Damon Ryan Depaolo, Jiby John
System and method for interview training with time-matched feedback

Patent number: 11403598

Abstract: The present disclosure generally relates to interview training and providing interview feedback. An exemplary method comprises: at an electronic device that is in communication with a display and one or more input devices: receiving, via the one or more input devices, media data corresponding to a user's responses to a plurality of prompts; analyzing the media data; and while displaying, on the display, a media representation of the media data, displaying a plurality of analysis representations overlaid on the media representation, wherein each of the plurality of analysis representations is associated with an analysis of content located at a given time in the media representation and is displayed in coordination with the given time in the media representation.

Type: Grant

Filed: October 18, 2021

Date of Patent: August 2, 2022

Assignee: Korn Ferry

Inventors: Thom Steinhoff, Panos S. Stamus, Bryan Ackermann, John Deyto
Method and apparatus for controlling device

Patent number: 11398225

Abstract: A method and apparatus for controlling a device are disclosed. The method includes: performing voice recognition on a received sound signal to obtain a voice recognition result; determining keywords using the voice recognition result; determining a target intelligent device having attribute information matched with the keywords from intelligent devices, where relationships between the intelligent devices and attribute information of the intelligent devices are constructed in advance, and the attribute information characterizes a device operation provided by the intelligent device corresponding to the attribute information; and controlling the target intelligent device to perform an operation indicated by the voice recognition result.

Type: Grant

Filed: September 25, 2019

Date of Patent: July 26, 2022

Assignee: Beijing Xiaomi Intelligent Technology Co., Ltd.

Inventor: Fuxin Li
Artificial intelligence apparatus and method for recognizing speech of user in consideration of user's application usage log

Patent number: 11398222

Abstract: Provided is an artificial intelligence (AI) device for recognizing speech of user. The AI apparatus includes: a microphone; and a processor configured to: receive, via the microphone, a sound signal corresponding to speech of the user, recognize the speech from the sound signal using a language model, determine an intention of the user based on the recognition result, determine whether the determination of the intention is successful, obtain a user's application usage log if the determination of the intention is not successful, and update the language model using the obtained user's application usage log.

Type: Grant

Filed: August 13, 2019

Date of Patent: July 26, 2022

Assignee: LG ELECTRONICS INC.

Inventors: Jaehong Kim, Boseop Kim
System to characterize vocal presentation

Patent number: 11393462

Abstract: A device with a microphone acquires audio data of a user's speech. That speech comprises utterances, that together comprise a session. The audio data is processed to determine sentiment data indicative of perceived emotional content of the speech as conveyed by individual utterances of the user. That information is then used to determine the emotional content of the session. For example, the information may include several words describing the overall and outlying emotions of the session. Numeric metrics may also be determined, such as activation and valence. A user interface may present the words and metrics to the user. The user may use this information to assess their state of mind, facilitate interactions with others, and so forth.

Type: Grant

Filed: May 13, 2020

Date of Patent: July 19, 2022

Assignee: AMAZON TECHNOLOGIES, INC.

Inventors: Narendra Gyanchandani, Bilyana Slavova, Daniel Kenneth Bone, Hanhan Wang, Njenga Kariuki
Method for decoding and encoding a downmix matrix, method for presenting audio content, encoder and decoder for a downmix matrix, audio encoder and audio decoder

Patent number: 11393481

Abstract: A method is described which decodes a downmix matrix for mapping a plurality of input channels of audio content to a plurality of output channels, the input and output channels being associated with respective speakers at predetermined positions relative to a listener position, wherein the downmix matrix is encoded by exploiting the symmetry of speaker pairs of the plurality of input channels and the symmetry of speaker pairs of the plurality of output channels. Encoded information representing the encoded downmix matrix is received and decoded for obtaining the decoded downmix matrix.

Type: Grant

Filed: September 23, 2019

Date of Patent: July 19, 2022

Inventors: Florin Ghido, Achim Kuntz, Bernhard Grill
Audio confirmation system, audio confirmation method, and program via speech and text comparison

Patent number: 11386901

Abstract: An audio confirmation system includes a voice acquiring section configured to acquire a voice contained in a motion picture; a voice text producing section configured to produce a voice text based on the acquired voice; a determining section configured to determine whether or not the produced voice text and a caption text that is embedded in an image contained in the motion picture correspond to each other; and an outputting section configured to output a result of the determination of the determining section.

Type: Grant

Filed: March 18, 2020

Date of Patent: July 12, 2022

Assignee: Sony Interactive Entertainment Inc.

Inventors: Masaomi Nishidate, Isamu Terasaka, Norihiro Nagai
Contextual tagging and biasing of grammars inside word lattices

Patent number: 11386889

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for implementing contextual grammar selection are disclosed. In one aspect, a method includes the actions of receiving audio data of an utterance. The actions include generating a word lattice that includes multiple candidate transcriptions of the utterance and that includes transcription confidence scores. The actions include determining a context of the computing device. The actions include based on the context of the computing device, identifying grammars that correspond to the multiple candidate transcriptions. The actions include determining, for each of the multiple candidate transcriptions, grammar confidence scores that reflect a likelihood that a respective grammar is a match for a respective candidate transcription. The actions include selecting, from among the candidate transcriptions, a candidate transcription.

Type: Grant

Filed: November 27, 2019

Date of Patent: July 12, 2022

Assignee: Google LLC

Inventors: Petar Aleksic, Pedro J. Moreno Mengibar, Leonid Velikovich
Information processing apparatus and information processing method

Patent number: 11381670

Abstract: An electronic device including circuitry configured to perform control in a manner that a Physical Layer Convergence Protocol (PLCP) header format is selected from a plurality of PLCP header formats; and append the selected PLCP header to a physical layer packet for transmission.

Type: Grant

Filed: August 17, 2020

Date of Patent: July 5, 2022

Assignee: SONY CORPORATION

Inventors: Takeshi Itagaki, Tomoya Yamaura, Kazuyuki Sakoda, Masanori Sato
Intelligent interactive method and apparatus, computer device and computer readable storage medium

Patent number: 11373641

Abstract: Embodiments of the present invention provide an intelligent interactive method and apparatus, a computer device and a computer readable storage medium, which solves problems that a deep intention of a user message cannot be analyzed in an intelligent interactive manner in the prior art and humanized interactive experiences cannot be provided. The intelligent interactive method includes: obtaining an emotion recognition result according to a user message, where the user message includes at least a user voice message; performing an intention analysis according to a text content of the user voice message to obtain corresponding basic intention information; and determining a corresponding interactive instruction according to the emotion recognition result and the basic intention information.

Type: Grant

Filed: May 16, 2019

Date of Patent: June 28, 2022

Assignee: Shanghai Xiaoi Robot Technology Co., Ltd.

Inventors: Hui Wang, Shijing Yu, Pinpin Zhu
Voice analysis training system

Patent number: 11373651

Abstract: A method for performing voice analysis includes storing, in a database, a simulation file for conducting a training session with a user, the simulation file including at least a script, storing desired attributes associated with the simulation file, retrieving the simulation file from the database and providing a user interface to conduct the voice analysis using the simulation file from the database, receiving one or more voice impressions from a user and analyzing, at an audio analysis tool, at least one of the voice impressions of the user determining, at the audio analysis tool, attributes of the at least one voice impression in response to analyzing the at least one voice impression and comparing, at the audio analysis tool, the determined attributes to the desired attributes associated with the simulation file. The method provides, by the client application, feedback to the user based on the comparison.

Type: Grant

Filed: February 21, 2020

Date of Patent: June 28, 2022

Assignee: SALESBOOST, LLC

Inventor: Margaret L Brooks
Output method and electronic device for reply information and supplemental information

Patent number: 11373643

Abstract: An output method includes obtaining voice information, determining whether the voice information is a voice request, in response to the voice information being the voice request, obtaining reply information for replying to the voice request, and supplemental information, and transmitting the reply information and the supplementary information to an output device for outputting. The supplemental information is information that needs to be outputted in association with the reply information.

Type: Grant

Filed: March 28, 2019

Date of Patent: June 28, 2022

Assignee: LENOVO (BEIJING) CO., LTD.

Inventors: Wenlin Yan, Shifeng Peng
Karaoke query processing system

Patent number: 11366851

Abstract: Computer systems and methods are provided for processing audio queries. An electronic device receives an audio clip and performs a matching process on the audio clip. The matching process includes comparing at least a portion of the audio clip to a plurality of reference audio tracks and identifying, based on the comparing, a first portion of a particular reference track that corresponds to the audio sample. Upon identifying the matching portion, the electronic device provides a backing track for playback which corresponds to the particular reference track, and an initial playback position of the backing track.

Type: Grant

Filed: December 18, 2019

Date of Patent: June 21, 2022

Assignee: Spotify AB

Inventors: Marco Marchini, Nicola Montecchio
Device naming-indicator generation

Patent number: 11361764

Abstract: Systems and methods for device naming-indicator generation are disclosed. Friendly names for accessory devices, such as smart-home devices, may be utilized to generate formatted text data that includes capitalization and/or punctuation for the friendly names. The formatted text data may be utilized to generate tag data indicating attributes of the friendly name. The tag data and/or contextual data indicating historical usage of the accessory device may be utilized to generate naming indicator(s) for the accessory device. The naming indicator(s) may be utilized, for example, during target inference and/or for communicating with a user about the accessory device.

Type: Grant

Filed: January 3, 2019

Date of Patent: June 14, 2022

Assignee: Amazon Technologies, Inc.

Inventors: David Y Zhao, Akshay Kumar, William Evan Welbourne
Managing streamed audio communication sessions

Patent number: 11363083

Abstract: Methods and apparatus are disclosed for managing streamed audio communication sessions between user devices (50) configured to send streamed data indicative of received audio contributions from respective participants in a multiple-participant audio communication session via a communications network to one or more other user devices (50) for conversion to audio representations of said received audio contributions for other participants.

Type: Grant

Filed: December 21, 2018

Date of Patent: June 14, 2022

Assignee: BRITISH TELECOMMUNICATIONS public limited company

Inventors: Ian Kegel, Karis Bailey, Martin Reed, Peter Hughes
System and method for dialog interaction in distributed automation systems

Patent number: 11354089

Abstract: A method for generating a user interface with a user interface device in a distributed automation system includes receiving a service message from a home automation device in the distributed automation system, identifying a state of a dialog manager of the user interface device in response to receiving the service message, and generating a natural language output message based at least in part on a device identifier parameter in the service message and a plurality of natural language templates stored in the memory in response to the dialog manager being in an idle state. The method further includes storing the service message in a priority queue in the memory based on a priority level parameter corresponding to the service message in response to the dialog manager being in an active state.

Type: Grant

Filed: December 9, 2016

Date of Patent: June 7, 2022

Assignee: Robert Bosch GmbH

Inventors: Leah Nicolich-Henkin, Cory Henson, Joao P. Sousa
Verification of user identity for voice enabled devices

Patent number: 11355126

Abstract: Provided are methods and systems to verify user identity for voice enabled devices. A voice input can instruct a voice enabled device to perform a plurality of functions/services that, depending on the function/service, may require additional user verification. Primary user verification can be performed by associating voice characteristics of the voice input to a profile associated with a user/user device. A signal (e.g., a BLE beacon) can be sent to the user device that causes the user device to perform secondary user verification. The secondary user verification can be based on a biometric input, passcode verification, authenticated message reply, for example. Based on the secondary user verification, an operational command associated with the voice input can be executed.

Type: Grant

Filed: January 24, 2018

Date of Patent: June 7, 2022

Assignee: Comcast Cable Communications, LLC

Inventor: Franklyn Athias
Personal directory service

Patent number: 11341970

Abstract: A method of providing navigation directions includes receiving, at a user terminal, a query spoken by a user, wherein the query spoken by the user includes a speech utterance indicating (i) a category of business, (ii) a name of the business, and (iii) a location at which or near which the business is disposed; identifying, by processing hardware, the business based on the speech utterance; and providing navigation directions to the business via the user terminal.

Type: Grant

Filed: June 8, 2020

Date of Patent: May 24, 2022

Assignee: GOOGLE LLC

Inventors: Brian Strope, Francoise Beaufays, William J. Byrne

prev 1 2 3 4 5 6 … next