Patents Examined by Leonard Saint-Cyr
  • Patent number: 11869485
    Abstract: The present disclosure discloses a method for generating a styled sentence by a computer device. The method includes: obtaining a to-be-converted natural sentence, classifying, by inputting the natural sentence into a first encoding model, having a classification capability of classifying the natural sentence into a target content vector and a style vector of the natural sentence, the target content vector indicating a meaning of the natural sentence, and the style vector of the natural sentence indicating a language style of the natural sentence. The method also include determining, from at least one style vector according to a set target language style, a target style vector corresponding to the target language style; and inputting the target content vector and the target style vector into a first decoding model, and generating a styled sentence corresponding to the natural sentence.
    Type: Grant
    Filed: May 5, 2022
    Date of Patent: January 9, 2024
    Assignee: TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED
    Inventor: Xiaojiang Liu
  • Patent number: 11863806
    Abstract: Systems and methods are described to address shortcomings in conventional systems by correcting an erroneous term in on-screen caption text for a media asset. In some aspects, the systems and methods identify the erroneous term in a text segment of the on-screen caption text, and identify one or more video frames of the media asset corresponding to the text segment. The systems and methods further identify a contextual term related to the erroneous term from the one or more video frames. By accessing a knowledge graph, the systems and methods identify a candidate correction based on the contextual term and a portion of the text segment. Lastly, the systems and methods replaces the erroneous term with the candidate correction.
    Type: Grant
    Filed: October 5, 2020
    Date of Patent: January 2, 2024
    Assignee: Rovi Guides, Inc.
    Inventors: Ajay Kumar Gupta, Abhijit Satchidanand Savarkar
  • Patent number: 11861314
    Abstract: Medical records may be analyzed to identify important items in the text of the medical record. Actionable content may be identified and may be emphasized or extracted from the medical record. Actionable content may be categorized into one or more categories. Identification may include processing using trained models that use contextual information and position information to determine sentence labels.
    Type: Grant
    Filed: April 2, 2021
    Date of Patent: January 2, 2024
    Assignee: ASAPP, INC.
    Inventors: Yada Pruksachatkun, Sean Adler, Thomas Gregory McKelvey, Jr., Jordan Louis Swartz, Hui Dai, Yi Yang, David Sontag, Jennifer Marie Seale
  • Patent number: 11854547
    Abstract: In one aspect, a playback device includes a voice assistant service (VAS) wake-word engine and a command keyword engine. The playback device detects, via the command keyword engine, a first command keyword of in voice input of sound detected by one or more microphones of the playback device. The playback device determines an intent based on at least one keyword in the voice input via a local natural language unit (NLU). After detecting the first command keyword event and determining the intent, the playback device performs a first playback command corresponding to the first command keyword and according to the determined intent. When the playback device detects, via the wake-word engine, a wake-word in voice input, the playback device streams sound data corresponding to at least a portion of the voice input to one or more remote servers associated with the VAS.
    Type: Grant
    Filed: December 13, 2021
    Date of Patent: December 26, 2023
    Assignee: Sonos, Inc.
    Inventors: Connor Smith, John Tolomei, Kurt Soto
  • Patent number: 11848011
    Abstract: Disclosed herein are embodiments of systems and methods for facilitating interpretation services by continuously generating a real time speed of speech for a set of spoken words within an electronic communication session. A processor continuously generates the real time speed of speech by iteratively generating the real time speed of speech upon parsing each spoken word from audio input signals of the electronic communication session. Iteratively generating the real time speed of speech may be based on a frequency of a subset of the set of spoken words within a preceding time interval of each spoken word. The systems and method may further determine a threshold based on an attribute of the set of spoken words. The method may display on a graphical user interface a graphical indicator having a visual feature that changes when the continuously generated real time speed of speech achieves the threshold.
    Type: Grant
    Filed: June 2, 2021
    Date of Patent: December 19, 2023
    Assignee: Kudo, Inc.
    Inventor: Claudio Fantinuoli
  • Patent number: 11842149
    Abstract: A method for maintenance of a machine among a fleet of machines includes receiving a service request corresponding to the machine. The method also includes obtaining a service architecture corresponding to the fleet of machines. The service architecture includes a service dictionary and a plurality of classification schemes organized in a tree data structure. The method also includes processing the service request based on the service dictionary and a text parsing technique to generate a list of descriptive words. The method includes generating a recommendation based on the list of descriptive words and the service architecture. The recommendation includes at least one of an on-line repair activity, an on-site repair activity and a part replacement activity. The method also includes servicing the fault condition of the machine based on the recommendation.
    Type: Grant
    Filed: February 28, 2019
    Date of Patent: December 12, 2023
    Assignee: General Electric Company
    Inventors: Tapan Shah, Karthika Ravigopal Nair, Mathews Matson Chavarukattil, Sridhar Venkataraman Dasaratha, Shailendra Singh, Siva Sateesh Irinki
  • Patent number: 11820394
    Abstract: A device control apparatus includes a detector which detects sound; and a controller which controls a plurality of devices. The plurality of devices includes a first device which is not related to operation of a vehicle and a second device which is related to the operation of the vehicle. The controller is configured to: recognize voice of a user by using data of the sound detected by the detector; identify a type of an operation target device from a plurality of the devices and operation content of the operation target device, based on the voice recognized; notify a user of start of activating the first device by the operation content identified and activate the first device by the operation content identified when the operation target identified is the first device; and notify the user to request a response before the controller activates the second device by the operation content identified when the operation target identified is the second device.
    Type: Grant
    Filed: April 20, 2018
    Date of Patent: November 21, 2023
    Assignee: Nissan Motor Co., Ltd.
    Inventors: Shota Ohkubo, Hirofumi Inoue, Jo Nishiyama, Takehito Teraguchi, Yu Shikoda
  • Patent number: 11817080
    Abstract: Processor(s) of a client device can: receive audio data that captures a spoken utterance of a user of the client device; process, using an on-device speech recognition model, the audio data to generate a predicted textual segment that is a prediction of the spoken utterance; cause at least part of the predicted textual segment to be rendered (e.g., visually and/or audibly); receive further user interface input that is a correction of the predicted textual segment to an alternate textual segment; and generate a gradient based on comparing at least part of the predicted output to ground truth output that corresponds to the alternate textual segment. The gradient is used, by processor(s) of the client device, to update weights of the on-device speech recognition model and/or is transmitted to a remote system for use in remote updating of global weights of a global speech recognition model.
    Type: Grant
    Filed: October 11, 2019
    Date of Patent: November 14, 2023
    Assignee: GOOGLE LLC
    Inventors: Françoise Beaufays, Johan Schalkwyk, Giovanni Motta
  • Patent number: 11817097
    Abstract: An electronic apparatus is provided. The electronic apparatus includes a communicator, a memory, and a processor connected to the communicator and the memory and configured to control the electronic apparatus. The processor is configured to, by executing at least one command stored in the memory, based on a user input for executing an assistant service being received, transmit information on a user voice acquired by the electronic apparatus to a plurality of servers providing different assistant services through the communicator, and based on a plurality of response information being received from the plurality of servers, provide a response on the user voice based on at least one of the plurality of response information. The plurality of servers provide the assistant service using an artificial intelligence agent.
    Type: Grant
    Filed: March 3, 2022
    Date of Patent: November 14, 2023
    Assignee: Samsung Electronics Co., Ltd.
    Inventors: Wonnam Jang, Sooyeon Kim, Sungrae Jo
  • Patent number: 11809992
    Abstract: Neural networks with similar architectures may be compressed using shared compression profiles. A request to compress a trained neural network may be received and an architecture of the neural network identified. The identified architecture may be compared with the different network architectures mapped to compression profiles to select a compression profile for the neural network. The compression profile may be applied to remove features of the neural network to generate a compressed version of the neural network.
    Type: Grant
    Filed: March 31, 2020
    Date of Patent: November 7, 2023
    Assignee: Amazon Technologies, Inc.
    Inventors: Gurumurthy Swaminathan, Ragav Venkatesan, Xiong Zhou, Runfei Luo, Vineet Khare
  • Patent number: 11810586
    Abstract: A noise cancellation method including generating a first voice signal by canceling a first portion of noise included in an input voice signal using a first network, the first network being a trained u-net structure, and the first portion of the noise being in a time domain, applying a first window to the first voice signal, performing a fast Fourier transform on the first windowed voice signal to acquire a magnitude signal and a phase signal, acquiring a mask using a second network based on the magnitude signal, the second network being another trained u-net structure, applying the mask to the magnitude signal, generating a second voice signal by canceling a second portion of the noise by performing an inverse fast Fourier transform on the first windowed voice signal based on the masked magnitude signal and the phase signal, and applying a second window to the second voice signal.
    Type: Grant
    Filed: August 5, 2021
    Date of Patent: November 7, 2023
    Assignee: LINE PLUS CORPORATION
    Inventors: Ki Jun Kim, JongHewk Park
  • Patent number: 11797780
    Abstract: A method includes receiving a set of text documents. The method also includes generating a summary of the set of text documents by a set of large language machine learning models. The method further includes generating a set of keywords from the summary by the set of large language machine learning models. The method additionally includes generating an image prompt from the set of keywords by the set of large language machine learning models. The method also includes generating a set of images from the image prompt by a text-to-image machine learning model. The method further includes generating a video clip from the set of images. The method additionally includes presenting the video clip.
    Type: Grant
    Filed: October 31, 2022
    Date of Patent: October 24, 2023
    Assignee: Intuit Inc.
    Inventors: Corinne Finegan, Richard Becker, Sanuree Gomes
  • Patent number: 11798528
    Abstract: Systems and methods for providing notifications without breaking media immersion. A notification delivery application receives notification data while a media device provides a media asset. In response to receiving the notification data while the media device provides the media asset, the notification delivery application generates a voice model based on a voice detected in the media asset. The notification delivery application converts the notification data to synthesized speech using the voice model and generates, by the media device, the synthesized speech for output at an appropriate point in the media asset based on contextual features of the media asset.
    Type: Grant
    Filed: October 8, 2021
    Date of Patent: October 24, 2023
    Assignee: Rovi Guides, Inc.
    Inventors: Vikram Makam Gupta, Prateek Varshney, Madhusudhan Seetharam, Ashish Kumar Srivastava, Harshith Kumar Gejjegondanahally Sreekanth
  • Patent number: 11783820
    Abstract: A system includes one or more field terminals, which can accept information inputs in voice and, optionally, other formats. The field terminals are equipped to translate voice inputs and to populate files according to specified requirements. Artificial intelligence algorithms augment speech to text translation thereby reducing translation errors and decreasing computing time. The field terminal completes the translation, populates the required document according to specified protocols and optionally attaches or associates information such as photos, location, video, maps or any other information that may be useful for the intended resource recipient(s). The field terminal comprises a transceiver, and transmits the file to the intended recipients.
    Type: Grant
    Filed: September 15, 2020
    Date of Patent: October 10, 2023
    Assignee: Squire Solutions, Inc.
    Inventors: Kyle Jeffrey Nehman, Dennis Alan Underwood, Jr., Jeremy Brett Whitsitt
  • Patent number: 11783824
    Abstract: A speech-processing system may provide access to one or more virtual assistants via an audio-controlled device. A user may leverage a first virtual assistant to translate a natural language command from a first language into a second language, which the device can send to a second virtual assistant for processing. The device may receive a command from a user and send input data representing the command to a first speech-processing system representing the first virtual assistant. The device may receive a response in the form of a first natural language output from the first speech-processing system along with an indication that the first natural language output should be directed to a second speech-processing system representing the second virtual assistant. For example, the command may be in the first language, and the first natural language output may be in the second language, which is understandable by the second speech-processing system.
    Type: Grant
    Filed: February 5, 2021
    Date of Patent: October 10, 2023
    Assignee: Amazon Technologies, Inc.
    Inventor: Robert John Mars
  • Patent number: 11783830
    Abstract: Various embodiments contemplate systems and methods for performing automatic speech recognition (ASR) and natural language understanding (NLU) that enable high accuracy recognition and understanding of freely spoken utterances which may contain proper names and similar entities. The proper name entities may contain or be comprised wholly of words that are not present in the vocabularies of these systems as normally constituted. Recognition of the other words in the utterances in question, e.g. words that are not part of the proper name entities, may occur at regular, high recognition accuracy. Various embodiments provide as output not only accurately transcribed running text of the complete utterance, but also a symbolic representation of the meaning of the input, including appropriate symbolic representations of proper name entities, adequate to allow a computer system to respond appropriately to the spoken request without further analysis of the user's input.
    Type: Grant
    Filed: May 26, 2021
    Date of Patent: October 10, 2023
    Assignee: Promptu Systems Corporation
    Inventor: Harry William Printz
  • Patent number: 11783845
    Abstract: A method for processing sound that includes, generating one or more noise component estimates relating to an electrical representation of the sound and generating an associated confidence measure for the one or more noise component estimates. The method further comprises processing, based on the confidence measure, the sound.
    Type: Grant
    Filed: August 18, 2021
    Date of Patent: October 10, 2023
    Assignee: Cochlear Limited
    Inventors: Stefan J. Mauger, Adam A. Hersbach, Pam W. Dawson, John M. Heasman
  • Patent number: 11778303
    Abstract: A voice-activated camera system for a computing device. The voice-activated camera system includes a processor, a camera module, a speech recognition module and a microphone for accepting user voice input. The voice-activated camera system includes authorized for only a specific user's voice, so that a camera function may be performed when the authorized user speaks the keyword, but the camera function is not performed when an unauthorized user speaks the keyword.
    Type: Grant
    Filed: June 14, 2021
    Date of Patent: October 3, 2023
    Inventor: Jesse L. Wobrock
  • Patent number: 11776536
    Abstract: Systems and methods of the present technical solution enable a multi-modal interface for voice-based devices, such as digital assistants. The solution can enable a user to interact with video and other content through a touch interface and through voice commands. In addition to inputs such as stop and play, the present solution can also automatically generate annotations for displayed video files. From the annotations, the solution can identify one or more break points that are associated with different scenes, video portions, or how-to steps in the video. The digital assistant can receive input audio signal and parse the input audio signal to identify semantic entities within the input audio signal. The digital assistant can map the identified semantic entities to the annotations to select a portion of the video that corresponds to the users request in the input audio signal.
    Type: Grant
    Filed: July 8, 2020
    Date of Patent: October 3, 2023
    Assignee: GOOGLE LLC
    Inventors: Masoud Loghmani, Anshul Kothari, Ananth Devulapalli
  • Patent number: 11769510
    Abstract: This application relates to microphone authentication apparatus for verifying whether or not an audio signal originated at a microphone. The microphone authentication apparatus has a comparison block configured to receive a first signal indicative of one or more spectral parameters of at least part of an audio signal to be verified, and compare the one or more spectral parameters to one or more predetermined characteristic microphone parameters relating to a characteristic resonance associated with an acoustic port of a microphone. The first signal may be an audio signal and the microphone authentication apparatus may have a feature extract module for determining the spectral parameters. Based on the comparison determination block may whether the audio signal originated from a microphone and may send a verification signal to a voice biometric module.
    Type: Grant
    Filed: May 21, 2020
    Date of Patent: September 26, 2023
    Assignee: Cirrus Logic Inc.
    Inventor: John Paul Lesso