Patents Examined by Daniel A Abebe
  • Patent number: 11676591
    Abstract: In some examples, a method is disclosed. The method includes detecting, by a smart device, an audible utterance of a trigger word. The method also includes, responsive to the detection of the audible utterance of the trigger word, recording audio via the smart device. The method also includes processing, via the smart device, the recorded audio to determine whether the recorded audio contains a command for the smart device or a different smart device to perform an action. The method also includes, responsive to determining that the recorded audio includes a command for the smart device or a different smart device to perform the action, determining whether the command is serviceable by the smart device without involvement of the different smart device. The method also includes, responsive to determining whether the command is serviceable by the smart device without involvement of the different smart device, taking action regarding the command.
    Type: Grant
    Filed: November 20, 2020
    Date of Patent: June 13, 2023
    Assignee: T-MOBITE INNOVATIONS LLC
    Inventors: Christopher Callender, Brian Kuntz, Lyle W. Paczkowski, Michael D. Svoren, Jr.
  • Patent number: 11676622
    Abstract: An audio processing system (100) accepts an audio bitstream having one of a plurality of predefined audio frame rates. The system comprises a front-end component (110), which receives a variable number of quantized spectral components, corresponding to one audio frame in any of the predefined audio frame rates, and performs an inverse quantization according to predetermined, frequency-dependent quantization levels. The front-end component may be agnostic of the audio frame rate. The audio processing system further comprises a frequency-domain processing stage (120) and a sample rate converter (130), which provide a reconstructed audio signal sampled at a target sampling frequency independent of the audio frame rate. By its frame-rate adaptability, the system can be configured to operate frame-synchronously in parallel with a video processing system that accepts plural video frame rates.
    Type: Grant
    Filed: June 10, 2021
    Date of Patent: June 13, 2023
    Assignee: DOLBY INTERNATIONAL AB
    Inventors: Heiko Purnhagen, Kristofer Kjoerling, Alexander Stahlmann, Jens Popp, Karl Jonas Roeden
  • Patent number: 11670312
    Abstract: A downscaled version of an audio decoding procedure may more effectively and/or at improved compliance maintenance be achieved if the synthesis window used for downscaled audio decoding is a downsampled version of a reference synthesis window involved in the non-downscaled audio decoding procedure by downsampling by the downsampling factor by which the downsampled sampling rate and the original sampling rate deviate, and downsampled using a segmental interpolation in segments of ¼ of the frame length.
    Type: Grant
    Filed: July 2, 2021
    Date of Patent: June 6, 2023
    Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.
    Inventors: Markus Schnell, Manfred Lutzky, Eleni Fotopoulou, Konstantin Schmidt, Conrad Benndorf, Adrian Tomasek, Tobias Albert, Timon Seidl
  • Patent number: 11670301
    Abstract: Techniques for lip-reading session triggering events are described. A computing device is equipped with lip-reading capability that enables the device to “read the lips” (i.e., facial features) of a user. The computing device determines when a triggering event occurs to automatically cause the computing device to switch from one input type to a lip-reading session. Lip-reading is also used in conjunction with other types of inputs to improve accuracy of the input. Machine learning is used to personalize the lip-reading capability of the computing device for a particular user.
    Type: Grant
    Filed: June 14, 2021
    Date of Patent: June 6, 2023
    Assignee: eBay Inc.
    Inventor: Neeraj Gupta
  • Patent number: 11663416
    Abstract: A software agent, that is used to assist in providing a service, receives communications from a set of users that are attempting to use the software agent. The communications include communications that are interacting with the software agent, and communications that are not interacting with the software agent. The software agent performs natural language processing on all communications to identify such things as user sentiment, user concerns or other items in the content of the messages, and also to identify actions taken by the users in order to obtain a measure of user satisfaction with the software agent. One or more action signals are then generated based upon the identified user satisfaction with the software agent.
    Type: Grant
    Filed: December 9, 2020
    Date of Patent: May 30, 2023
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Benjamin Gene Cheung, Andres Monroy-Hernandez, Todd Daniel Newman, Mayerber Loureiro De Carvalho Neto, Michael Brian Palmer, Pamela Bhattacharya, Justin Brooks Cranshaw, Charles Yin-Che Lee
  • Patent number: 11657812
    Abstract: Methods and systems for providing message playback using a shared electronic device is described herein. In response to receiving a request to output messages, a speech-processing system may determine a group account associated with a requesting device, and may determine messages stored by a message data store for the group account. Speaker identification processing may also be performed to determine a speaker of the request. A user account associated with the speaker, and messages stored for the user account, may be determined. A summary response indicating the user account's messages and the group account's message may then be generated such that the user account messages are identified prior to the group account's messages. The messages may then be analyzed to determine an appropriate voice user interface for the requester such that the playback of the messages using a shared electronic device is more natural and conversational.
    Type: Grant
    Filed: September 24, 2020
    Date of Patent: May 23, 2023
    Assignee: Amazon Technologies, Inc.
    Inventors: Christo Frank Devaraj, Brian Oliver, Sumedha Arvind Kshirsagar, Gregory Michael Hart, Ran Mokady
  • Patent number: 11657832
    Abstract: A speech-capture device can capture audio data during wakeword monitoring and use the audio data to determine if a user is present nearby the device, even if no wakeword is spoken. Audio such as speech, human originating sounds (e.g., coughing, sneezing), or other human related noises (e.g., footsteps, doors closing) can be used to detect audio. Audio frames are individually scored as to whether a human presence is detected in the particular audio frames. The scores are then smoothed relative to nearby frames to create a decision for a particular frame. Presence information can then be sent according to a periodic schedule to a remote device to create a presence “heartbeat” that regularly identifies whether a user is detected proximate to a speech-capture device.
    Type: Grant
    Filed: September 16, 2020
    Date of Patent: May 23, 2023
    Assignee: Amazon Technologies, Inc.
    Inventors: Shiva Kumar Sundaram, Chao Wang, Shiv Naga Prasad Vitaladevuni, Spyridon Matsoukas, Arindam Mandal
  • Patent number: 11646033
    Abstract: Method starts with processing, by a processor, audio signal to generate audio caller utterance and transcribed caller utterance. Processor generates identified task based on transcribed caller utterance. Processor samples audio caller utterance to generate samples of audio caller utterance. Processor generates loudness result based on loudness values of samples using loudness neural network associated with identified task. Processor generates pitch result based on pitch values of samples using pitch neural network associated with identified task. Processor generates tone result for each word in transcribed caller utterance using tone neural network associated with identified task. Using task completion probability neural network associated with identified task, processor generates task completion probability result that is based on at least one of: loudness result, pitch result, or tone result. Other embodiments are disclosed herein.
    Type: Grant
    Filed: June 7, 2021
    Date of Patent: May 9, 2023
    Assignee: Express Scripts Strategic Development, Inc.
    Inventors: Christopher M. Myers, Danielle L. Smith
  • Patent number: 11626123
    Abstract: A method for decoding an encoded audio bitstream is disclosed. The method includes receiving the encoded audio bitstream and decoding the audio data to generate a decoded lowband audio signal. The method further includes extracting high frequency reconstruction metadata and filtering the decoded lowband audio signal with an analysis filterbank to generate a filtered lowband audio signal. The method also includes extracting a flag indicating whether either spectral translation or harmonic transposition is to be performed on the audio data and regenerating a highband portion of the audio signal using the filtered lowband audio signal and the high frequency reconstruction metadata in accordance with the flag.
    Type: Grant
    Filed: October 23, 2020
    Date of Patent: April 11, 2023
    Assignee: DOLBY INTERNATIONAL AB
    Inventors: Lars Villemoes, Heiko Purnhagen, Per Ekstrand
  • Patent number: 11627221
    Abstract: A call captioning system for captioning a hearing user's (HU's) voice signal during an ongoing call with an assisted user (AU) includes: an AU communication device with a display screen and a caption service activation feature, and a first processor programmed to, during an ongoing call, receive the HU's voice signal. Prior to activating the caption service via the activation feature, the processor uses an automated speech recognition (ASR) engine to generate HU voice signal captions, detect errors in the HU voice signal captions, use the errors to train the ASR software to the HU's voice signal to increase accuracy of the HU captions generated by the ASR engine; and store the trained ASR engine for subsequent use. Upon activating the caption service during the ongoing call, the processor uses the trained ASR engine to generate HU voice signal captions and present them to the AU via the display screen.
    Type: Grant
    Filed: June 25, 2020
    Date of Patent: April 11, 2023
    Assignee: ULTRATEC, INC.
    Inventors: Robert M. Engelke, Kevin R. Colwell, Christopher R. Engelke
  • Patent number: 11620456
    Abstract: Systems and methods of the invention determine evasiveness of postings and manage chat sessions accordingly. In embodiments, a method includes accessing a real-time text-based discourse session comprised of multiple text-based posts published by participants, the posts including a question from an author and responses from at least one respondent; determining relationships between words in the text-based discourse session utilizing corpus linguistics analysis; determining a frequency of the responses of the at least one respondent over time; determining an evasiveness score for each of the responses based on natural language processing of the responses, wherein each of the evasiveness scores indicate a level of relevance of a response with respect to the question; determining rankings for each of the responses based on the determined relationships of words, the frequency of the responses, and the evasiveness scores; and determining a display order for the responses based on the rankings of the responses.
    Type: Grant
    Filed: April 27, 2020
    Date of Patent: April 4, 2023
    Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Zachary A. Silverstein, Trudy L. Hewitt, Jonathan D. Dunne, Liam S. Harpur
  • Patent number: 11621013
    Abstract: A method for decoding an encoded audio bitstream is disclosed. The method includes receiving the encoded audio bitstream and decoding the audio data to generate a decoded lowband audio signal. The method further includes extracting high frequency reconstruction metadata and filtering the decoded lowband audio signal with an analysis filterbank to generate a filtered lowband audio signal. The method also includes extracting a flag indicating whether either spectral translation or harmonic transposition is to be performed on the audio data and regenerating a highband portion of the audio signal using the filtered lowband audio signal and the high frequency reconstruction metadata in accordance with the flag.
    Type: Grant
    Filed: October 11, 2022
    Date of Patent: April 4, 2023
    Assignee: DOLBY INTERNATIONAL AB
    Inventors: Lars Villemoes, Heiko Purnhagen, Per Ekstrand
  • Patent number: 11615778
    Abstract: A device may be configured to parse a syntax element specifying the number of available languages within a presentation associated with an audio stream. A device may be configured to parse one or more syntax elements identifying each of the available languages and parse an accessibility syntax element for each language within the presentation.
    Type: Grant
    Filed: September 29, 2020
    Date of Patent: March 28, 2023
    Assignee: SHARP KABUSHIKI KAISHA
    Inventors: Kiran Mukesh Misra, Sachin G. Deshpande, Sheau Ng, Christopher Andrew Segall
  • Patent number: 11615796
    Abstract: An information processing apparatus, an information processing system, and an information processing method. The information processing apparatus identifies a work target and work content of work, based on voice data sent from a terminal for inputting utterance about the work by a worker, updates work implementation status information indicating work implementation status of the work stored in a memory based on the work target and work content of the work that are identified, and controls to display the work implementation status of the work based on the work implementation status information on a display terminal connected through a network.
    Type: Grant
    Filed: October 23, 2020
    Date of Patent: March 28, 2023
    Assignee: Ricoh Company, Ltd.
    Inventor: Tatsuo Ito
  • Patent number: 11610585
    Abstract: Methods and systems for rendering lists of instructions and performing actions associated with those lists are described herein. In some embodiments, an individual may request that a voice activated electronic device associated with their user account assist in performing a task using a list of instructions. The list of instructions may include metadata that indicates actions capable of being performed by additional Internet of Things (“IoT”) devices. When the instructions are rendered, an instructions speechlet may recognize the metadata and may cause one or more of the IoT devices to perform a particular action. Furthermore, the metadata may also correspond to content capable of being rendered by the voice activated electronic device to assist the individual in performing a particular step of the instructions.
    Type: Grant
    Filed: June 23, 2020
    Date of Patent: March 21, 2023
    Assignee: Amazon Technologies, Inc.
    Inventor: Manoj Sindhwani
  • Patent number: 11610595
    Abstract: In some embodiments, a pitch filter for filtering a preliminary audio signal generated from an audio bitstream is disclosed. The pitch filter has an operating mode selected from one of either: (i) an active mode where the preliminary audio signal is filtered using filtering information to obtain a filtered audio signal, and (ii) an inactive mode where the pitch filter is disabled. The preliminary audio signal is generated in an audio encoder or audio decoder having a coding mode selected from at least two distinct coding modes, and the pitch filter is capable of being selectively operated in either the active mode or the inactive mode while operating in the coding mode based on control information.
    Type: Grant
    Filed: November 22, 2021
    Date of Patent: March 21, 2023
    Assignee: Dolby International AB
    Inventors: Barbara Resch, Kristofer Kjörling, Lars Villemoes
  • Patent number: 11605395
    Abstract: A method and device for automatically increasing the spectral bandwidth of an audio signal including generating a “mapping” (or “prediction”) matrix based on the analysis of a reference wideband signal and a reference narrowband signal, the mapping matrix being a transformation matrix to predict high frequency energy from a low frequency energy envelope, generating an energy envelope analysis of an input narrowband audio signal, generating a resynthesized noise signal by processing a random noise signal with the mapping matrix and the envelope analysis, high-pass filtering the resynthesized noise signal, and summing the high-pass filtered resynthesized noise signal with the input narrowband audio signal. Other embodiments are disclosed.
    Type: Grant
    Filed: February 6, 2020
    Date of Patent: March 14, 2023
    Assignee: Staton Techiya, LLC
    Inventors: John Usher, Dan Ellis
  • Patent number: 11605391
    Abstract: A method for decoding an encoded audio bitstream is disclosed. The method includes receiving the encoded audio bitstream and decoding the audio data to generate a decoded lowband audio signal. The method further includes extracting high frequency reconstruction metadata and filtering the decoded lowband audio signal with an analysis filterbank to generate a filtered lowband audio signal. The method also includes extracting a flag indicating whether either spectral translation or harmonic transposition is to be performed on the audio data and regenerating a highband portion of the audio signal using the filtered lowband audio signal and the high frequency reconstruction metadata in accordance with the flag.
    Type: Grant
    Filed: October 11, 2022
    Date of Patent: March 14, 2023
    Assignee: DOLBY INTERNATIONAL AB
    Inventors: Lars Villemoes, Heiko Purnhagen, Per Ekstrand
  • Patent number: 11600291
    Abstract: This disclosure describes techniques for identifying a voice-enabled device from a group of voice-enabled devices to respond to a speech utterance of a user. A speech-processing system may receive an audio signal representing the speech utterance captured in an environment of a voice-enabled device, and identify another voice-enabled device located in the environment. The system may analyze the audio signal using a different natural-language-understanding model for each of the voice-enabled devices to identify an intent for each of the voice-enabled devices to respond to the speech utterance. The system may determine confidence scores that the intents are responsive to the speech utterance, and select the intent with the highest confidence score. The system may use the selected intent to generate a command for the corresponding voice-enabled device to respond to the user.
    Type: Grant
    Filed: June 12, 2020
    Date of Patent: March 7, 2023
    Assignee: Amazon Technologies, Inc.
    Inventors: Gang Lan, Joseph Pedro Tavares, Deepak Uttam Shah, Mckay Clawson, Vijay Shankar Tennety, Ravi Kiran Rachakonda, Venkata Snehith Cherukuri, Charles James Torbert
  • Patent number: 11586828
    Abstract: Methods, systems, and computer program product for automatically performing sentiment analysis on texts, such as telephone call transcripts and electronic written communications. Disclosed techniques include, inter alia, lexicon training, handling of negations and shifters, pruning of lexicons, confidence calculation for token orientation, supervised customization, lexicon mixing, and adaptive segmentation.
    Type: Grant
    Filed: August 25, 2020
    Date of Patent: February 21, 2023
    Inventors: Amir Lev-Tov, Avraham Faizakof, Arnon Mazza, Yochai Konig