Patents Examined by Nicole A K Schmieder

Hotword-aware speech synthesis

Patent number: 11308934

Abstract: A method includes receiving text input data for conversion into synthesized speech and determining, using a hotword-aware model trained to detect a presence of a hotword assigned to a user device, whether a pronunciation of the text input data includes the hotword. The hotword is configured to initiate a wake-up process on the user device for processing the hotword and/or one or more other terms following the hotword in the audio input data. When the pronunciation of the text input data includes the hotword, the method also includes generating an audio output signal from the text input data and providing the audio output signal to an audio output device to output the audio output signal. The audio output signal when captured by an audio capture device of the user device, configured to prevent initiation of the wake-up process on the user device.

Type: Grant

Filed: June 25, 2018

Date of Patent: April 19, 2022

Assignee: Google LLC

Inventors: Matthew Sharifi, Aleksandar Kracun
In-vehicle device, non-transitory computer-readable medium storing program, and control method for the control of a dialogue system based on vehicle acceleration

Patent number: 11282517

Abstract: An information processing apparatus includes an acquisition unit configured to acquire acceleration of a vehicle, a controller configured to execute a speech dialogue with a driver of the vehicle, an input unit configured to receive a speech input by the driver, and an output unit configured to execute a speech output to the driver. The controller dynamically controls a response time in the speech dialogue with the driver based on the acceleration of the vehicle.

Type: Grant

Filed: April 2, 2019

Date of Patent: March 22, 2022

Assignee: TOYOTA JIDOSHA KABUSHIKI KAISHA

Inventors: Ko Koga, Hideo Hasegawa
Computer-based systems for administering patterned passphrases

Patent number: 11227610

Abstract: This disclosure describes computer-based techniques for administering a spoken patterned passphrase. A passphrase processing unit running on an administrator computer generates passphrase data for a secure system using acoustic data and video data representing a spoken phrase by a speaker. This passphrase includes a pattern of words or speech segments that are audible and words or speech segments that are inaudible. During authentication, a passphrase administration unit on the administrator computer receives acoustic and visual data of a spoken phrase by a person attempting to access the secure system and evaluates whether the spoken phrase includes the pattern of audible and inaudible words or speech segments associated with the account. In this way, the techniques discussed herein may enable the administrator computer to administer spoken passphrases with an additional degree of protection than a system that is limited to using linguistic or biometric content in passwords or passphrases.

Type: Grant

Filed: April 16, 2019

Date of Patent: January 18, 2022

Assignee: Wells Fargo Bank, P.A.

Inventors: Kristine Ing Kushner, John T. Wright
Systems and methods for disambiguating a voice search query based on gestures

Patent number: 11227593

Abstract: Systems and methods are described herein for disambiguating a voice search query by determining whether the user made a gesture while speaking a quotation from a content item and whether the user mimicked or approximated a gesture made by a character in the content item when the character spoke the words quoted by the user. If so, a search result comprising an identifier of the content item is generated. A search result representing the content item from which the quotation comes may be ranked highest among other search results returned and therefore presented first in a list of search results. If the user did not mimic or approximate a gesture made by a character in the content item when the quotation is spoken in the content item, then a search result may not be generated for the content item or may be ranked lowest among other search results.

Type: Grant

Filed: June 28, 2019

Date of Patent: January 18, 2022

Assignee: ROVI GUIDES, INC.

Inventors: Ankur Aher, Nishchit Mahajan, Narendra Purushothama, Sai Durga Venkat Reddy Pulikunta
Conversation dependent volume control

Patent number: 11211080

Abstract: Techniques are described for detecting a conversation between at least two people, and for reducing noise during the conversation. In certain embodiments, at least one speech metric is generated based on spectral analysis of an audio signal and is used to determine that the audio signal represents speech from a first person. Responsive to determining that the speech is part of a conversation between the first person and a second person an operating state of a device in a physical environment is adjusted such that a volume level of sound contributed by or associated with the device is reduced. The sound contributed by or associated with the device corresponds to noise, at least for the duration of the conversation. Therefore, reducing the volume level of sound contributed by or associated with the device reduces the overall noise level in the environment, resulting in a reduction in conversational effort.

Type: Grant

Filed: December 18, 2019

Date of Patent: December 28, 2021

Inventors: Brandon Hook, Daniel Soberal
Contextual disambiguation of an entity in a conversation management system

Patent number: 11205048

Abstract: Systems, computer-implemented methods, and computer program products that can facilitate word entity disambiguation are provided. According to an embodiment, a system can comprise a memory that stores computer executable components and a processor that executes the computer executable components stored in the memory. The computer executable components can comprise a language model component that employs an artificial intelligence model to generate a profile vector of an entity based on one or more binary values representing profile data of the entity and a word vector of a word entity in a dialogue based on one or more second word entities adjacent to the word entity in the dialogue. The computer executable components can further comprise a dialogue management component that disambiguates the word entity based on the profile vector and the word vector.

Type: Grant

Filed: June 18, 2019

Date of Patent: December 21, 2021

Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Sunhwan Lee, Shun Jiang, Chung-hao Tan, Lei Huang, Pawan Chowdhary
Automated identification and classification of complaint-specific user interactions using a multilayer neural network

Patent number: 11194962

Abstract: Methods and apparatuses are described in which unstructured computer text is analyzed for identification and classification of complaint-specific user interactions. A data store receives unstructured computer text corresponding to current user interactions. A server filters the unstructured computer text to identify messages that comprise a potential complaint.

Type: Grant

Filed: June 5, 2019

Date of Patent: December 7, 2021

Assignee: FMR LLC

Inventors: Indraneel Biswas, Nicholas Wilcox
Method and apparatus for differentiating between human and electronic speaker for voice interface security

Patent number: 11176960

Abstract: A system for distinguishing between a human voice generated command and an electronic speaker generated command is provided. An exemplary system comprises a microphone array for receiving an audio signal collection, preprocessing circuitry configured for converting the audio signal collection into processed recorded audio signals, energy balance metric determination circuitry configured for calculating a final energy balance metric based on the processed recorded audio signals, and energy balance metric evaluation circuitry for outputting a command originator signal based at least in part on the final energy balance metric.

Type: Grant

Filed: June 18, 2019

Date of Patent: November 16, 2021

Assignee: University of Florida Research Foundation, Incorporated

Inventors: Patrick G. Traynor, Logan E. Blue, Luis Vargas
Responsive automatic gain control

Patent number: 11164592

Abstract: A system that performs automatic gain control (AGC) using different decay rates. The system may select a slow decay rate to track a loudness level within speech (e.g., within an utterance), improving audio quality and maintaining dynamic range for an individual voice, while selecting a fast decay rate to track the loudness level after a gap of silence (e.g., no voice activity detected for a duration of time) or during large level changes (e.g., actual speech loudness is lower than estimated speech loudness for a duration of time). This improves an accuracy of the loudness estimate and therefore a responsiveness of the automatic gain control, resulting in an improved user experience.

Type: Grant

Filed: May 9, 2019

Date of Patent: November 2, 2021

Assignee: Amazon Technologies, Inc.

Inventors: Biqing Wu, Phil Hetherington, Carlo Murgia, Rong Hu
Embedding natural language context in structured documents using document anatomy

Patent number: 11151323

Abstract: Methods, systems and computer program products for natural language context embedding are provided herein. A computer-implemented method includes extracting a document anatomy and document elements from a given structured document, identifying semantic references in the given structured document, and generating an ontology comprising (i) a hierarchy of concepts and (ii) relations connecting the concepts, each concept comprising attributes for a document element. The computer-implemented method also includes generating natural language text context for a given document element by utilizing the ontology to combine (i) attributes of a given concept corresponding to the given document element with (ii) attributes of another concept, the other concept corresponding to another document element, the other concept being connected to the given concept by at least one relation.

Type: Grant

Filed: December 3, 2018

Date of Patent: October 19, 2021

Assignee: International Business Machines Corporation

Inventors: Sampath Dechu, Saravanan Krishnan, Neelamadhav Gantayat, Senthil Kumar Kumarasamy Mani
Encoded-sound determination method

Patent number: 11081120

Abstract: A method for encoded-sound determination performed by a computer includes: executing a first process that includes obtaining information indicating intensities of sound signals, the frequencies being calculated from the sound signals and corresponding to frequencies; and executing a second process that includes determining whether or not the sound signals are signals of encoded sound, based on whether or not the intensities of the sound signals in predetermined frequency bands that are adjacent to each other in a frequency direction have a difference that is larger than or equal to a predetermined threshold.

Type: Grant

Filed: March 22, 2019

Date of Patent: August 3, 2021

Assignee: FUJITSU LIMITED

Inventors: Akira Kamano, Masanao Suzuki, Nobuyuki Washio, Yohei Kishi
Visual and text pattern matching

Patent number: 11049204

Abstract: A method for automatically detecting block legal billing is described where the technique analyzes each billing entry in the legal bill for visual or textual aspects that indicate that a list of billing items is included in the block. The technique utilizes a combination of textual analysis for punctuation characters, count of the number of verbs, or a search for conjunctions. A visual analysis is match the image of the billing item with a predetermined image of a list. Essentially, a novel natural language processing technique is described that identifies lists in a block of text, where the block of text is in the context of a legal bill.

Type: Grant

Filed: December 7, 2018

Date of Patent: June 29, 2021

Assignee: Bottomline Technologies, Inc.

Inventor: Scot Calitri

prev 1 2 3