Application Patents (Class 704/270)
  • Patent number: 11556696
    Abstract: Systems and methods include receiving, with a processor, two or more messages from a first user device participating in a communication session, processing, with the processor, the two or more messages, generating, with the processor, a processed message, and displaying, with the processor, the processed message on a second user device participating in the communication session.
    Type: Grant
    Filed: March 15, 2021
    Date of Patent: January 17, 2023
    Assignee: Avaya Management L.P.
    Inventors: Sandesh Chopdekar, Pushkar Deole, Navin Daga
  • Patent number: 11558663
    Abstract: Methods, apparatus, systems and articles of manufacture are disclosed. An example apparatus includes a controller to cause a people meter to emit a prompt for input of audience identification information at a first time and determine a first audience count based on the input, an audio detector to determine a second audience count based on signatures generated from audio data captured in the media environment, and a comparator to cause the people meter to not emit the prompt for at least a first time period after the first time when the first audience count is equal to the second audience count.
    Type: Grant
    Filed: August 20, 2020
    Date of Patent: January 17, 2023
    Assignee: THE NIELSEN COMPANY (US), LLC
    Inventors: John T. LiVoti, Stanley Wellington Woodruff, Rajakumar Madhanganesh, Khushboo Agarwal
  • Patent number: 11551663
    Abstract: A natural language processing system may use system response configuration data to determine customized output data forms when outputting data for a user. The system response configuration data may represent various output attributes the system may use when creating output data. The system may have a certain number of existing profiles where a profile is associated with certain settings for the system response configuration data/attributes. The system may also use various data such as context data, sentiment data, or the like to customize system response configuration data during a dialog. Other components, such as natural language generation (NLG), text-to-speech (TTS), or the like, may use the customized system response configuration data to determine the form, timing, etc. of output data to be presented to a user.
    Type: Grant
    Filed: December 10, 2020
    Date of Patent: January 10, 2023
    Assignee: Amazon Technologies, Inc.
    Inventors: Anthony Bissell, Janet Slifka
  • Patent number: 11544685
    Abstract: A multimedia keepsake is created containing multimedia content created by a customer and stored online as content information. After the customer selects the type of keepsake, the content information is converted to keepsake information having a format appropriate for storage in the selected type of keepsake. The keepsake information is stored online so as to be accessible via an access code, and it is downloaded to a vendor providing the access code.
    Type: Grant
    Filed: August 12, 2014
    Date of Patent: January 3, 2023
    Inventor: Geoffrey S. Stern
  • Patent number: 11532007
    Abstract: A system and method are provided for employing voice-activated user interfaces to determine user attention to particularly-presented advertising content by collecting user contact/consumer information, presenting content to the user/consumer, and proposing at least one question, inquiry or query to the user regarding the presented content, the at least one inquiry or query calling for a user/consumer response to be collected, at least one of (a) the user/consumer contact information and (b) the user/consumer response to the question, inquiry or query being collected by the system via a voice-activated user interface and evaluated to assess a level of engagement of the user/consumer with the advertising content. The disclosed systems and methods uniquely provide voice-activated user interface coupled with display of certain advertising content in a manner that promotes user/consumer attention to the advertising content and ease of interaction with the presentation system.
    Type: Grant
    Filed: August 16, 2019
    Date of Patent: December 20, 2022
    Inventor: Frank S. Maggio
  • Patent number: 11521114
    Abstract: This document relates to creating and/or updating a chatbot using a graphical user interface. For example, training dialogs for a chatbot can be displayed in a tree form on a graphical user interface. Based at least on interactions between a developer and the graphical user interface, the training dialogs can be modified in the tree form, and training dialogs can be updated based on the modifications provided on the tree form via the graphical user interface.
    Type: Grant
    Filed: April 18, 2019
    Date of Patent: December 6, 2022
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Lars H. Liden, Swadheen K. Shukla, Shahin Shayandeh, Matthew D. Mazzola
  • Patent number: 11521618
    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for collaboration between multiple voice controlled devices are disclosed. In one aspect, a method includes the actions of identifying, by a first computing device, a second computing device that is configured to respond to a particular, predefined hotword; receiving audio data that corresponds to an utterance; receiving a transcription of additional audio data outputted by the second computing device in response to the utterance; based on the transcription of the additional audio data and based on the utterance, generating a transcription that corresponds to a response to the additional audio data; and providing, for output, the transcription that corresponds to the response.
    Type: Grant
    Filed: December 17, 2019
    Date of Patent: December 6, 2022
    Assignee: GOOGLE LLC
    Inventors: Victor Carbune, Pedro Gonnet Anders, Thomas Deselaers, Sandro Feuz
  • Patent number: 11516346
    Abstract: A three-way calling terminal for a mobile human-machine coordination calling robot. Technical solutions include: a first speech interface, configured to transfer call audio between a call object and a back-end processing module; a CODEC1 module, configured to encode and decode the call audio between the call object and the back-end processing module; a second speech interface, configured to transfer call audio between the human agent and the call object; a CODEC2 module, configured to encode and decode the call audio between the human agent and the call object; a call control module, configured to process a control signal, and automatically make, answer, and hang up a call; a data processing submodule, configured to process speech data and perform data transfer between the data processing submodule and the back-end processing module; and a networking submodule, configured to be connected to the back-end processing module.
    Type: Grant
    Filed: July 8, 2021
    Date of Patent: November 29, 2022
    Assignee: NANJING SILICON INTELLIGENCE TECHNOLOGY CO., LTD.
    Inventor: Huapeng Sima
  • Patent number: 11514663
    Abstract: Provided are a reception apparatus, a reception system, a reception method, and a storage medium that can naturally provide a personal conversation in accordance with a user without requiring the user to register the personal information thereof in advance. A disclosure includes a face information acquisition unit that acquires face information of a user; a face matching unit that matches, against face information of one user, the face information registered in a user information database in which user information including the face information of the user and the reception information is registered; and a user information management unit that, when a result of matching of the face information performed by the face matching unit is unmatched, registers the user information of the one user to the user information database.
    Type: Grant
    Filed: February 13, 2019
    Date of Patent: November 29, 2022
    Assignee: NEC CORPORATION
    Inventors: Nobuaki Kawase, Makoto Igarashi
  • Patent number: 11508366
    Abstract: A method, an apparatus and a device for converting a whispered speech, and a readable storage medium are provided. The method is implemented based on the whispered speech converting model. The whispered speech converting model is trained in advance by using recognition results and whispered speech training acoustic features of whispered speech training data as samples and using normal speech acoustic features of normal speech data parallel to the whispered speech training data as sample labels. A whispered speech acoustic feature and a preliminary recognition result of whispered speech data are acquired, then the whispered speech acoustic feature and the preliminary recognition result are inputted into a preset whispered speech converting model to acquire a normal speech acoustic feature outputted by the model. In this way, the whispered speech can be converted to a normal speech.
    Type: Grant
    Filed: June 15, 2018
    Date of Patent: November 22, 2022
    Assignee: IFLYTEK CO., LTD.
    Inventors: Jia Pan, Cong Liu, Haikun Wang, Zhiguo Wang, Guoping Hu
  • Patent number: 11508357
    Abstract: An extended role play-based utterance set generation apparatus includes a first data store storing role play-based utterance sets and a second data store storing non-role-played utterance sets. The role play-based utterance sets include a first query and a role play-based response to the query. The non-role-played utterance sets include a second query and a non-role-played response to the query. The disclosed technology determines similarity between the role play-based response and the non-role-played response. Upon determining that the role play-based response is the same or similar to the non-role-played response, the disclosed technology generates an association between the role play-based response and the second query and extends the role play-based utterance sets in the first data store with the second query.
    Type: Grant
    Filed: April 5, 2019
    Date of Patent: November 22, 2022
    Assignee: NIPPON TELEGRAPH AND TELEPHONE CORPORATION
    Inventors: Masahiro Mizukami, Ryuichiro Higashinaka
  • Patent number: 11501755
    Abstract: Provided are an electronic device and method for providing a voice assistant service. The method, performed by the electronic device, of providing the voice assistant service includes: obtaining a voice of a user; obtaining voice analysis information of the voice of the user by inputting the voice of the user to a natural language understanding model; determining whether a response operation with respect to the voice of the user is performable, according to a preset criterion, based on the obtained voice analysis information; and based on the determining that the response operation is not performable, outputting a series of guide messages for learning the response operation related to the voice of the user.
    Type: Grant
    Filed: September 1, 2020
    Date of Patent: November 15, 2022
    Assignee: SAMSUNG ELECTRONICS CO., LTD.
    Inventor: Inchul Hwang
  • Patent number: 11483425
    Abstract: A communication system, method and communication terminal are configured to facilitate private outputting of content of a message or communication session. A communication terminal can be configured via data included in a message or via a privacy setting to output content of data from a communication session or message in accordance with a pre-selected privacy setting or one or more privacy rules. For instance, a communication terminal may be configured to suppress a text to speech function for certain text messages, email messages, instant messages, or social networking messages that it receives having the privacy parameter set therein. As another example, a user may set the privacy parameter in his or her terminal so that any such message received by that terminal is output in accordance with the privacy setting or rules. A detection of nearby people can affect how certain content may be output via a terminal.
    Type: Grant
    Filed: October 23, 2018
    Date of Patent: October 25, 2022
    Assignee: RINGCENTRAL, INC.
    Inventors: Christian Garbin, Johannes Ruetschi
  • Patent number: 11475880
    Abstract: A method includes receiving audio data of an utterance and processing the audio data to obtain, as output from a speech recognition model configured to jointly perform speech decoding and endpointing of utterances: partial speech recognition results for the utterance; and an endpoint indication indicating when the utterance has ended. While processing the audio data, the method also includes detecting, based on the endpoint indication, the end of the utterance. In response to detecting the end of the utterance, the method also includes terminating the processing of any subsequent audio data received after the end of the utterance was detected.
    Type: Grant
    Filed: March 4, 2020
    Date of Patent: October 18, 2022
    Assignee: Google LLC
    Inventors: Shuo-yiin Chang, Rohit Prakash Prabhavalkar, Gabor Simko, Tara N. Sainath, Bo Li, Yangzhang He
  • Patent number: 11468884
    Abstract: An information processing apparatus includes a voice acquisition section, a reliability generation section, and a processing execution section. The voice acquisition section acquires an ambient voice. The reliability generation section generates reliability indicating a degree in which the acquired voice is uttered from the particular position on the basis of a predetermined transfer characteristic. As the predetermined transfer characteristic, a phase difference or acoustic characteristic of the voice can be assumed. The processing execution section executes a process according to the generated reliability. As the process according to the reliability, a notification according to the reliability or a predetermined command can be assumed to be executed.
    Type: Grant
    Filed: March 13, 2018
    Date of Patent: October 11, 2022
    Assignee: Sony Corporation
    Inventors: Ryosuke Sawata, Yuichiro Koyama
  • Patent number: 11470027
    Abstract: A method, an apparatus, an electronic device and a storage medium for broadcasting a voice are provided. The method may include: sending a voice broadcast request to a server, where the voice broadcast request includes at least one of scenario information, user information or voice packet setting information; receiving a voice broadcast instruction corresponding to the voice broadcast request returned by the server; and acquiring a personalized voice packet corresponding to the voice broadcast instruction in a local database and broadcasting the personalized voice packet.
    Type: Grant
    Filed: March 25, 2021
    Date of Patent: October 11, 2022
    Assignee: Beijing Baidu Netcom Science and Technology Co., Ltd.
    Inventors: Shiqiang Ding, Jinyi Lei
  • Patent number: 11470419
    Abstract: A method for auralizing a multi-microphone device. Path information for one or more sound paths using dimensions and room reflection coefficients of a simulated room for one of a plurality of microphones included in a multi-microphone device is determined. An array-related transfer functions (ARTFs) for the one of the plurality of microphones is retrieved. The auralized impulse response for the one of the plurality of microphones is generated based at least on the retrieved ARTFs and the determined path information.
    Type: Grant
    Filed: August 29, 2019
    Date of Patent: October 11, 2022
    Assignee: Google LLC
    Inventors: Rajeev Conrad Nongpiur, Ananya Misra, Chanwoo Kim
  • Patent number: 11455907
    Abstract: A computer-implemented method includes recognizing, by a computer device, a word as a new learned word for a user; registering, by the computer device, the new leaned word in a user's new learned word list as a registered new learned word; associating, by the computer device, the registered new learned word with related known words in a user's known word library, the known word library including words known to the user; tracking, by the computer device, uses of the related known words by the user; identifying, by the computer device, a used sentence used by the user that contains one of the related known words; and suggesting, by the computer device, to the user a new sentence that replaces the one of the related known words in the used sentence with the new learned word.
    Type: Grant
    Filed: November 27, 2018
    Date of Patent: September 27, 2022
    Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Su Liu, Manjunath Ravi, Zhichao Li, Kai Liu
  • Patent number: 11456005
    Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for audio-visual speech separation. A method includes: obtaining, for each frame in a stream of frames from a video in which faces of one or more speakers have been detected, a respective per-frame face embedding of the face of each speaker; processing, for each speaker, the per-frame face embeddings of the face of the speaker to generate visual features for the face of the speaker; obtaining a spectrogram of an audio soundtrack for the video; processing the spectrogram to generate an audio embedding for the audio soundtrack; combining the visual features for the one or more speakers and the audio embedding for the audio soundtrack to generate an audio-visual embedding for the video; determining a respective spectrogram mask for each of the one or more speakers; and determining a respective isolated speech spectrogram for each speaker.
    Type: Grant
    Filed: November 21, 2018
    Date of Patent: September 27, 2022
    Assignee: Google LLC
    Inventors: Inbar Mosseri, Michael Rubinstein, Ariel Ephrat, William Freeman, Oran Lang, Kevin William Wilson, Tali Dekel, Avinatan Hassidim
  • Patent number: 11443745
    Abstract: Included are: an apparatus function information acquiring unit for acquiring apparatus function information in which a target apparatus and one or more target functions to be executed by the target apparatus, which are determined on the basis of uttered speech, are associated with each other; a procedure determining unit for determining one or more manual operations for executing the one or more target functions and an order of the one or more manual operations on the basis of the apparatus function information acquired by the apparatus function information acquiring unit; and an operation command transmission controlling unit for sequentially transmitting, to the target apparatus, operation commands for outputting operation response output control information corresponding to each of the one or more manual operations in accordance with the order of the one or more manual operations determined by the procedure determining unit.
    Type: Grant
    Filed: October 21, 2020
    Date of Patent: September 13, 2022
    Assignee: MITSUBISHI ELECTRIC CORPORATION
    Inventors: Masato Hirai, Kenshiro Kitamura, Miho Ishikawa, Daisuke Iizawa
  • Patent number: 11443759
    Abstract: An information processing apparatus includes a memory storing instructions. The instructions cause the apparatus to extract a plurality of local features from data indicating a speech, the characteristics of feature extraction being formed through learning; and to encode a series of chronological features of the data based on the plurality of local features, characteristics of encoding the series of chronological features being formed through learning. The instructions also cause the apparatus to generate information obtained by weighting features at a specific point in time associated with emotion classification, of the series of chronological features encoded, characteristics of weighting the features at the specific point in time being formed through learning; and to classify emotion corresponding to the data using the information obtained by weighting the features at the specific point in time, characteristics of classification being formed through learning.
    Type: Grant
    Filed: July 27, 2020
    Date of Patent: September 13, 2022
    Assignee: HONDA MOTOR CO., LTD.
    Inventor: Yuanchao Li
  • Patent number: 11443730
    Abstract: A system, a computer program product, and method for controlling synthesized speech output on a voice-controlled device. A sensor is used to capture an image of a face of a person. A database of previously stored images of facial features is accessed. In response to i) not recognizing the at least one person the voice-controlled device selects a first set of conversational starters; ii) recognizing the person and recognizing previous communications with the person, the voice-controlled device selects a second set of conversational starters; iii) recognizing the person and not recognizing previous communications with the person, the voice-controlled device selects a third set of conversational starters; or iv) recognizing the at least one person and recognizing previous communications with the person selecting but do not know the person's name selecting a fourth set of conversational starters. The voice controlled device outputs the selected set of conversational starters.
    Type: Grant
    Filed: January 28, 2020
    Date of Patent: September 13, 2022
    Assignee: International Business Machines Corporation
    Inventors: Shang Qing Guo, Jonathan Lenchner
  • Patent number: 11430014
    Abstract: Systems and methods of facilitating transactions related to targeted or customized commercial offerings based on derived sentiment states are provided. The sentiment states are derived from digital representations such as images, videos and sound recordings.
    Type: Grant
    Filed: October 1, 2020
    Date of Patent: August 30, 2022
    Assignee: Nant Holdings IP, LLC
    Inventor: Patrick Soon-Shiong
  • Patent number: 11430207
    Abstract: Provided are a reception apparatus, a reception system, a reception method, and a storage medium that can naturally provide a personal conversation in accordance with a user without requiring the user to register the personal information thereof in advance. A disclosure includes a face information acquisition unit that acquires face information of a user; a conversation processing unit that acquires reception information including a content of conversation with the user; a face matching unit that matches, against the face information of one user, the face information registered in a user information database in which user information including the face information of the user and the reception information is registered; and a user information management unit that, when a result of matching of the face information performed by the face matching unit is unmatched, registers the user information of the one user to the user information database.
    Type: Grant
    Filed: June 8, 2017
    Date of Patent: August 30, 2022
    Assignee: NEC CORPORATION
    Inventors: Nobuaki Kawase, Makoto Igarashi
  • Patent number: 11423879
    Abstract: A technique for controlling a voice-enabled device using voice commands includes receiving an audio signal that is generated in response to a verbal utterance, generating a verbal utterance indicator for the verbal utterance based on the audio signal, selecting a first command for a voice-controlled application residing within the voice-enabled device based on the verbal utterance indicator, and transmitting the first command to the voice-controlled application as an input.
    Type: Grant
    Filed: July 18, 2017
    Date of Patent: August 23, 2022
    Assignee: Disney Enterprises, Inc.
    Inventor: William Valentine Zajac, III
  • Patent number: 11423881
    Abstract: According to an embodiment of the present disclosure, a method of updating a speech recognition model using a mobile agent in real-time comprises obtaining, in real-time, space type information for a particular space where the mobile agent is located, varying, in real-time, parameters of a speech recognition model used in the particular space based on the space type information, and performing a speech recognition service based on the speech recognition model including the varied parameters. Embodiments of the present disclosure may be related to artificial intelligence (AI) devices, unmanned aerial vehicles (UAVs), robots, augmented reality (AR) devices, virtual reality (VR) devices, and 5G service-related devices.
    Type: Grant
    Filed: September 17, 2019
    Date of Patent: August 23, 2022
    Assignee: LG ELECTRONICS INC.
    Inventor: Jonghoon Chae
  • Patent number: 11422764
    Abstract: Augmented reality display systems, apparatuses, and methods are disclosed for enabling a wearer of an augmented reality optical display to continue wearing the same optical display while moving between different platforms or vehicles. Example embodiments include optical displays that use a wired connection to connect with each platform to minimize the electromagnetic signature of the system. Embodiments include changing the information displayed to the user depending on the type of vehicle to which the optical display is connected. Additional embodiment display information about weapon systems associated with the platform to which the optical display is connected.
    Type: Grant
    Filed: May 31, 2019
    Date of Patent: August 23, 2022
    Assignee: EPIC OPTIX, INC.
    Inventor: Ray Kwong
  • Patent number: 11417302
    Abstract: Apparatus, methods, and systems that operate to provide interactive streaming content identification and processing are disclosed. An example apparatus includes a classifier to determine an audio characteristic value representative of an audio characteristic in audio; a transition detector to detect a transition between a first category and a second category by comparing the audio characteristic value to a threshold value among a set of threshold values, the set of threshold values corresponding to the first category and the second category; and a context manager to control a device to switch from a first fingerprinting algorithm to a second fingerprinting algorithm different than the first fingerprinting algorithm, responsive to the detected transition between the first category and the second category.
    Type: Grant
    Filed: September 10, 2020
    Date of Patent: August 16, 2022
    Assignee: Gracenote, Inc.
    Inventors: Michael Jeffrey, Markus K. Cremer, Dong-In Lee
  • Patent number: 11417329
    Abstract: A system for performing magnetic resonance tomography is disclosed. A control system creates a speech data stream from an acquired linguistic expression and generates a command library, which contains a selection of speech commands, to each of which one or more linguistic expressions are assigned. The selection of speech commands is loaded from a command database depending on a current system status of a magnetic resonance (MR) scanner. The control system applies a speech recognition algorithm to the speech data stream to determine whether a linguistic expression contained in the command library can be assigned to the speech data stream. If so, the acquired linguistic expression is recognized, a speech command from the command library assigned to the recognized linguistic expression is established, and a control command for controlling the MR scanner in accordance with the speech command is created.
    Type: Grant
    Filed: January 30, 2020
    Date of Patent: August 16, 2022
    Assignee: Siemens Healthcare GmbH
    Inventors: Rainer Schneider, Dirk Franger
  • Patent number: 11416741
    Abstract: A technique for constructing a model supporting a plurality of domains is disclosed. In the technique, a plurality of teacher models, each of which is specialized for different one of the plurality of the domains, is prepared. A plurality of training data collections, each of which is collected for different one of the plurality of the domains, is obtained. A plurality of soft label sets is generated by inputting each training data in the plurality of the training data collections into corresponding one of the plurality of the teacher models. A student model is trained using the plurality of the soft label sets.
    Type: Grant
    Filed: June 8, 2018
    Date of Patent: August 16, 2022
    Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Takashi Fukuda, Osamu Ichikawa, Samuel Thomas, Bhuvana Ramabhadran
  • Patent number: 11410401
    Abstract: The subject technology receives a selection of a selectable graphical item from a plurality of selectable graphical items, the selectable graphical item comprising an augmented reality content generator for applying a 3D effect, the 3D effect including at least one beautification operation. The subject technology captures image data and depth data using a camera. The subject technology applies, to the image data and the depth data, the 3D effect including the at least one beautification operation based at least in part on the augmented reality content generator, the beautification operation being performed as part of applying the 3D effect. The subject technology generates a 3D message based at least in part on the applied 3D effect including the at least one beautification operation. The subject technology renders a view of the 3D message based at least in part on the applied 3D effect including the at least one beautification operation.
    Type: Grant
    Filed: August 28, 2020
    Date of Patent: August 9, 2022
    Assignee: Snap Inc.
    Inventors: Kyle Goodrich, Samuel Edward Hare, Maxim Maximov Lazarov, Tony Mathew, Andrew James McPhee, Daniel Moreno, Dhritiman Sagar, Wentao Shang
  • Patent number: 11410646
    Abstract: A system capable of performing natural language understanding (NLU) on utterances including complex command structures such as sequential commands (e.g., multiple commands in a single utterance), conditional commands (e.g., commands that are only executed if a condition is satisfied), and/or repetitive commands (e.g., commands that are executed until a condition is satisfied). Audio data may be processed using automatic speech recognition (ASR) techniques to obtain text. The text may then be processed using machine learning models that are trained to parse text of incoming utterances. The models may identify complex utterance structures and may identify what command portions of an utterance go with what conditional statements. Machine learning models may also identify what data is needed to determine when the conditionals are true so the system may cause the commands to be executed (and stopped) at the appropriate times.
    Type: Grant
    Filed: March 28, 2019
    Date of Patent: August 9, 2022
    Assignee: Amazon Technologies, Inc.
    Inventors: Cengiz Erbas, Thomas Kollar, Avnish Sikka, Spyridon Matsoukas, Simon Peter Reavely
  • Patent number: 11410144
    Abstract: Embodiments disclosed herein describe intelligent e-book readers which provide a significant improvement over the conventional e-books that simply render static content. The intelligent e-book readers may customize a rendered e-book based on, for example, the reading level and preferences of the user, the user's social media profile and activity, and current events. Furthermore, the intelligent e-book reader may provide additional augmented reality (AR)/virtual reality (VR) content associated with one or more portions of the rendered e-book. The intelligent e-book reader may also facilitate virtual, real time communication between multiple users and experts. The intelligent e-book reader may also facilitate one or more users to provide feedback and suggestions to authors and future movie-makers. The intelligent e-book reader may automatically determine difficult portions of an e-book based on the virtual communications and/or real time eye-tracking of a user.
    Type: Grant
    Filed: January 25, 2021
    Date of Patent: August 9, 2022
    Assignee: MASSACHUSETTS MUTUAL LIFE INSURANCE COMPANY
    Inventors: Michal Knas, Payton A. Shubrick, Damon Ryan Depaolo, Jiby John
  • Patent number: 11403598
    Abstract: The present disclosure generally relates to interview training and providing interview feedback. An exemplary method comprises: at an electronic device that is in communication with a display and one or more input devices: receiving, via the one or more input devices, media data corresponding to a user's responses to a plurality of prompts; analyzing the media data; and while displaying, on the display, a media representation of the media data, displaying a plurality of analysis representations overlaid on the media representation, wherein each of the plurality of analysis representations is associated with an analysis of content located at a given time in the media representation and is displayed in coordination with the given time in the media representation.
    Type: Grant
    Filed: October 18, 2021
    Date of Patent: August 2, 2022
    Assignee: Korn Ferry
    Inventors: Thom Steinhoff, Panos S. Stamus, Bryan Ackermann, John Deyto
  • Patent number: 11398225
    Abstract: A method and apparatus for controlling a device are disclosed. The method includes: performing voice recognition on a received sound signal to obtain a voice recognition result; determining keywords using the voice recognition result; determining a target intelligent device having attribute information matched with the keywords from intelligent devices, where relationships between the intelligent devices and attribute information of the intelligent devices are constructed in advance, and the attribute information characterizes a device operation provided by the intelligent device corresponding to the attribute information; and controlling the target intelligent device to perform an operation indicated by the voice recognition result.
    Type: Grant
    Filed: September 25, 2019
    Date of Patent: July 26, 2022
    Assignee: Beijing Xiaomi Intelligent Technology Co., Ltd.
    Inventor: Fuxin Li
  • Patent number: 11398222
    Abstract: Provided is an artificial intelligence (AI) device for recognizing speech of user. The AI apparatus includes: a microphone; and a processor configured to: receive, via the microphone, a sound signal corresponding to speech of the user, recognize the speech from the sound signal using a language model, determine an intention of the user based on the recognition result, determine whether the determination of the intention is successful, obtain a user's application usage log if the determination of the intention is not successful, and update the language model using the obtained user's application usage log.
    Type: Grant
    Filed: August 13, 2019
    Date of Patent: July 26, 2022
    Assignee: LG ELECTRONICS INC.
    Inventors: Jaehong Kim, Boseop Kim
  • Patent number: 11393462
    Abstract: A device with a microphone acquires audio data of a user's speech. That speech comprises utterances, that together comprise a session. The audio data is processed to determine sentiment data indicative of perceived emotional content of the speech as conveyed by individual utterances of the user. That information is then used to determine the emotional content of the session. For example, the information may include several words describing the overall and outlying emotions of the session. Numeric metrics may also be determined, such as activation and valence. A user interface may present the words and metrics to the user. The user may use this information to assess their state of mind, facilitate interactions with others, and so forth.
    Type: Grant
    Filed: May 13, 2020
    Date of Patent: July 19, 2022
    Assignee: AMAZON TECHNOLOGIES, INC.
    Inventors: Narendra Gyanchandani, Bilyana Slavova, Daniel Kenneth Bone, Hanhan Wang, Njenga Kariuki
  • Patent number: 11393481
    Abstract: A method is described which decodes a downmix matrix for mapping a plurality of input channels of audio content to a plurality of output channels, the input and output channels being associated with respective speakers at predetermined positions relative to a listener position, wherein the downmix matrix is encoded by exploiting the symmetry of speaker pairs of the plurality of input channels and the symmetry of speaker pairs of the plurality of output channels. Encoded information representing the encoded downmix matrix is received and decoded for obtaining the decoded downmix matrix.
    Type: Grant
    Filed: September 23, 2019
    Date of Patent: July 19, 2022
    Inventors: Florin Ghido, Achim Kuntz, Bernhard Grill
  • Patent number: 11386901
    Abstract: An audio confirmation system includes a voice acquiring section configured to acquire a voice contained in a motion picture; a voice text producing section configured to produce a voice text based on the acquired voice; a determining section configured to determine whether or not the produced voice text and a caption text that is embedded in an image contained in the motion picture correspond to each other; and an outputting section configured to output a result of the determination of the determining section.
    Type: Grant
    Filed: March 18, 2020
    Date of Patent: July 12, 2022
    Assignee: Sony Interactive Entertainment Inc.
    Inventors: Masaomi Nishidate, Isamu Terasaka, Norihiro Nagai
  • Patent number: 11386889
    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for implementing contextual grammar selection are disclosed. In one aspect, a method includes the actions of receiving audio data of an utterance. The actions include generating a word lattice that includes multiple candidate transcriptions of the utterance and that includes transcription confidence scores. The actions include determining a context of the computing device. The actions include based on the context of the computing device, identifying grammars that correspond to the multiple candidate transcriptions. The actions include determining, for each of the multiple candidate transcriptions, grammar confidence scores that reflect a likelihood that a respective grammar is a match for a respective candidate transcription. The actions include selecting, from among the candidate transcriptions, a candidate transcription.
    Type: Grant
    Filed: November 27, 2019
    Date of Patent: July 12, 2022
    Assignee: Google LLC
    Inventors: Petar Aleksic, Pedro J. Moreno Mengibar, Leonid Velikovich
  • Patent number: 11381670
    Abstract: An electronic device including circuitry configured to perform control in a manner that a Physical Layer Convergence Protocol (PLCP) header format is selected from a plurality of PLCP header formats; and append the selected PLCP header to a physical layer packet for transmission.
    Type: Grant
    Filed: August 17, 2020
    Date of Patent: July 5, 2022
    Assignee: SONY CORPORATION
    Inventors: Takeshi Itagaki, Tomoya Yamaura, Kazuyuki Sakoda, Masanori Sato
  • Patent number: 11373641
    Abstract: Embodiments of the present invention provide an intelligent interactive method and apparatus, a computer device and a computer readable storage medium, which solves problems that a deep intention of a user message cannot be analyzed in an intelligent interactive manner in the prior art and humanized interactive experiences cannot be provided. The intelligent interactive method includes: obtaining an emotion recognition result according to a user message, where the user message includes at least a user voice message; performing an intention analysis according to a text content of the user voice message to obtain corresponding basic intention information; and determining a corresponding interactive instruction according to the emotion recognition result and the basic intention information.
    Type: Grant
    Filed: May 16, 2019
    Date of Patent: June 28, 2022
    Assignee: Shanghai Xiaoi Robot Technology Co., Ltd.
    Inventors: Hui Wang, Shijing Yu, Pinpin Zhu
  • Patent number: 11373651
    Abstract: A method for performing voice analysis includes storing, in a database, a simulation file for conducting a training session with a user, the simulation file including at least a script, storing desired attributes associated with the simulation file, retrieving the simulation file from the database and providing a user interface to conduct the voice analysis using the simulation file from the database, receiving one or more voice impressions from a user and analyzing, at an audio analysis tool, at least one of the voice impressions of the user determining, at the audio analysis tool, attributes of the at least one voice impression in response to analyzing the at least one voice impression and comparing, at the audio analysis tool, the determined attributes to the desired attributes associated with the simulation file. The method provides, by the client application, feedback to the user based on the comparison.
    Type: Grant
    Filed: February 21, 2020
    Date of Patent: June 28, 2022
    Assignee: SALESBOOST, LLC
    Inventor: Margaret L Brooks
  • Patent number: 11373643
    Abstract: An output method includes obtaining voice information, determining whether the voice information is a voice request, in response to the voice information being the voice request, obtaining reply information for replying to the voice request, and supplemental information, and transmitting the reply information and the supplementary information to an output device for outputting. The supplemental information is information that needs to be outputted in association with the reply information.
    Type: Grant
    Filed: March 28, 2019
    Date of Patent: June 28, 2022
    Assignee: LENOVO (BEIJING) CO., LTD.
    Inventors: Wenlin Yan, Shifeng Peng
  • Patent number: 11366851
    Abstract: Computer systems and methods are provided for processing audio queries. An electronic device receives an audio clip and performs a matching process on the audio clip. The matching process includes comparing at least a portion of the audio clip to a plurality of reference audio tracks and identifying, based on the comparing, a first portion of a particular reference track that corresponds to the audio sample. Upon identifying the matching portion, the electronic device provides a backing track for playback which corresponds to the particular reference track, and an initial playback position of the backing track.
    Type: Grant
    Filed: December 18, 2019
    Date of Patent: June 21, 2022
    Assignee: Spotify AB
    Inventors: Marco Marchini, Nicola Montecchio
  • Patent number: 11361764
    Abstract: Systems and methods for device naming-indicator generation are disclosed. Friendly names for accessory devices, such as smart-home devices, may be utilized to generate formatted text data that includes capitalization and/or punctuation for the friendly names. The formatted text data may be utilized to generate tag data indicating attributes of the friendly name. The tag data and/or contextual data indicating historical usage of the accessory device may be utilized to generate naming indicator(s) for the accessory device. The naming indicator(s) may be utilized, for example, during target inference and/or for communicating with a user about the accessory device.
    Type: Grant
    Filed: January 3, 2019
    Date of Patent: June 14, 2022
    Assignee: Amazon Technologies, Inc.
    Inventors: David Y Zhao, Akshay Kumar, William Evan Welbourne
  • Patent number: 11363083
    Abstract: Methods and apparatus are disclosed for managing streamed audio communication sessions between user devices (50) configured to send streamed data indicative of received audio contributions from respective participants in a multiple-participant audio communication session via a communications network to one or more other user devices (50) for conversion to audio representations of said received audio contributions for other participants.
    Type: Grant
    Filed: December 21, 2018
    Date of Patent: June 14, 2022
    Assignee: BRITISH TELECOMMUNICATIONS public limited company
    Inventors: Ian Kegel, Karis Bailey, Martin Reed, Peter Hughes
  • Patent number: 11354089
    Abstract: A method for generating a user interface with a user interface device in a distributed automation system includes receiving a service message from a home automation device in the distributed automation system, identifying a state of a dialog manager of the user interface device in response to receiving the service message, and generating a natural language output message based at least in part on a device identifier parameter in the service message and a plurality of natural language templates stored in the memory in response to the dialog manager being in an idle state. The method further includes storing the service message in a priority queue in the memory based on a priority level parameter corresponding to the service message in response to the dialog manager being in an active state.
    Type: Grant
    Filed: December 9, 2016
    Date of Patent: June 7, 2022
    Assignee: Robert Bosch GmbH
    Inventors: Leah Nicolich-Henkin, Cory Henson, Joao P. Sousa
  • Patent number: 11355126
    Abstract: Provided are methods and systems to verify user identity for voice enabled devices. A voice input can instruct a voice enabled device to perform a plurality of functions/services that, depending on the function/service, may require additional user verification. Primary user verification can be performed by associating voice characteristics of the voice input to a profile associated with a user/user device. A signal (e.g., a BLE beacon) can be sent to the user device that causes the user device to perform secondary user verification. The secondary user verification can be based on a biometric input, passcode verification, authenticated message reply, for example. Based on the secondary user verification, an operational command associated with the voice input can be executed.
    Type: Grant
    Filed: January 24, 2018
    Date of Patent: June 7, 2022
    Assignee: Comcast Cable Communications, LLC
    Inventor: Franklyn Athias
  • Patent number: 11341970
    Abstract: A method of providing navigation directions includes receiving, at a user terminal, a query spoken by a user, wherein the query spoken by the user includes a speech utterance indicating (i) a category of business, (ii) a name of the business, and (iii) a location at which or near which the business is disposed; identifying, by processing hardware, the business based on the speech utterance; and providing navigation directions to the business via the user terminal.
    Type: Grant
    Filed: June 8, 2020
    Date of Patent: May 24, 2022
    Assignee: GOOGLE LLC
    Inventors: Brian Strope, Francoise Beaufays, William J. Byrne