Patents Examined by Nicole A K Schmieder
-
Patent number: 11308934Abstract: A method includes receiving text input data for conversion into synthesized speech and determining, using a hotword-aware model trained to detect a presence of a hotword assigned to a user device, whether a pronunciation of the text input data includes the hotword. The hotword is configured to initiate a wake-up process on the user device for processing the hotword and/or one or more other terms following the hotword in the audio input data. When the pronunciation of the text input data includes the hotword, the method also includes generating an audio output signal from the text input data and providing the audio output signal to an audio output device to output the audio output signal. The audio output signal when captured by an audio capture device of the user device, configured to prevent initiation of the wake-up process on the user device.Type: GrantFiled: June 25, 2018Date of Patent: April 19, 2022Assignee: Google LLCInventors: Matthew Sharifi, Aleksandar Kracun
-
Patent number: 11282517Abstract: An information processing apparatus includes an acquisition unit configured to acquire acceleration of a vehicle, a controller configured to execute a speech dialogue with a driver of the vehicle, an input unit configured to receive a speech input by the driver, and an output unit configured to execute a speech output to the driver. The controller dynamically controls a response time in the speech dialogue with the driver based on the acceleration of the vehicle.Type: GrantFiled: April 2, 2019Date of Patent: March 22, 2022Assignee: TOYOTA JIDOSHA KABUSHIKI KAISHAInventors: Ko Koga, Hideo Hasegawa
-
Patent number: 11227610Abstract: This disclosure describes computer-based techniques for administering a spoken patterned passphrase. A passphrase processing unit running on an administrator computer generates passphrase data for a secure system using acoustic data and video data representing a spoken phrase by a speaker. This passphrase includes a pattern of words or speech segments that are audible and words or speech segments that are inaudible. During authentication, a passphrase administration unit on the administrator computer receives acoustic and visual data of a spoken phrase by a person attempting to access the secure system and evaluates whether the spoken phrase includes the pattern of audible and inaudible words or speech segments associated with the account. In this way, the techniques discussed herein may enable the administrator computer to administer spoken passphrases with an additional degree of protection than a system that is limited to using linguistic or biometric content in passwords or passphrases.Type: GrantFiled: April 16, 2019Date of Patent: January 18, 2022Assignee: Wells Fargo Bank, P.A.Inventors: Kristine Ing Kushner, John T. Wright
-
Patent number: 11227593Abstract: Systems and methods are described herein for disambiguating a voice search query by determining whether the user made a gesture while speaking a quotation from a content item and whether the user mimicked or approximated a gesture made by a character in the content item when the character spoke the words quoted by the user. If so, a search result comprising an identifier of the content item is generated. A search result representing the content item from which the quotation comes may be ranked highest among other search results returned and therefore presented first in a list of search results. If the user did not mimic or approximate a gesture made by a character in the content item when the quotation is spoken in the content item, then a search result may not be generated for the content item or may be ranked lowest among other search results.Type: GrantFiled: June 28, 2019Date of Patent: January 18, 2022Assignee: ROVI GUIDES, INC.Inventors: Ankur Aher, Nishchit Mahajan, Narendra Purushothama, Sai Durga Venkat Reddy Pulikunta
-
Patent number: 11211080Abstract: Techniques are described for detecting a conversation between at least two people, and for reducing noise during the conversation. In certain embodiments, at least one speech metric is generated based on spectral analysis of an audio signal and is used to determine that the audio signal represents speech from a first person. Responsive to determining that the speech is part of a conversation between the first person and a second person an operating state of a device in a physical environment is adjusted such that a volume level of sound contributed by or associated with the device is reduced. The sound contributed by or associated with the device corresponds to noise, at least for the duration of the conversation. Therefore, reducing the volume level of sound contributed by or associated with the device reduces the overall noise level in the environment, resulting in a reduction in conversational effort.Type: GrantFiled: December 18, 2019Date of Patent: December 28, 2021Inventors: Brandon Hook, Daniel Soberal
-
Patent number: 11205048Abstract: Systems, computer-implemented methods, and computer program products that can facilitate word entity disambiguation are provided. According to an embodiment, a system can comprise a memory that stores computer executable components and a processor that executes the computer executable components stored in the memory. The computer executable components can comprise a language model component that employs an artificial intelligence model to generate a profile vector of an entity based on one or more binary values representing profile data of the entity and a word vector of a word entity in a dialogue based on one or more second word entities adjacent to the word entity in the dialogue. The computer executable components can further comprise a dialogue management component that disambiguates the word entity based on the profile vector and the word vector.Type: GrantFiled: June 18, 2019Date of Patent: December 21, 2021Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Sunhwan Lee, Shun Jiang, Chung-hao Tan, Lei Huang, Pawan Chowdhary
-
Patent number: 11194962Abstract: Methods and apparatuses are described in which unstructured computer text is analyzed for identification and classification of complaint-specific user interactions. A data store receives unstructured computer text corresponding to current user interactions. A server filters the unstructured computer text to identify messages that comprise a potential complaint.Type: GrantFiled: June 5, 2019Date of Patent: December 7, 2021Assignee: FMR LLCInventors: Indraneel Biswas, Nicholas Wilcox
-
Patent number: 11176960Abstract: A system for distinguishing between a human voice generated command and an electronic speaker generated command is provided. An exemplary system comprises a microphone array for receiving an audio signal collection, preprocessing circuitry configured for converting the audio signal collection into processed recorded audio signals, energy balance metric determination circuitry configured for calculating a final energy balance metric based on the processed recorded audio signals, and energy balance metric evaluation circuitry for outputting a command originator signal based at least in part on the final energy balance metric.Type: GrantFiled: June 18, 2019Date of Patent: November 16, 2021Assignee: University of Florida Research Foundation, IncorporatedInventors: Patrick G. Traynor, Logan E. Blue, Luis Vargas
-
Patent number: 11164592Abstract: A system that performs automatic gain control (AGC) using different decay rates. The system may select a slow decay rate to track a loudness level within speech (e.g., within an utterance), improving audio quality and maintaining dynamic range for an individual voice, while selecting a fast decay rate to track the loudness level after a gap of silence (e.g., no voice activity detected for a duration of time) or during large level changes (e.g., actual speech loudness is lower than estimated speech loudness for a duration of time). This improves an accuracy of the loudness estimate and therefore a responsiveness of the automatic gain control, resulting in an improved user experience.Type: GrantFiled: May 9, 2019Date of Patent: November 2, 2021Assignee: Amazon Technologies, Inc.Inventors: Biqing Wu, Phil Hetherington, Carlo Murgia, Rong Hu
-
Patent number: 11151323Abstract: Methods, systems and computer program products for natural language context embedding are provided herein. A computer-implemented method includes extracting a document anatomy and document elements from a given structured document, identifying semantic references in the given structured document, and generating an ontology comprising (i) a hierarchy of concepts and (ii) relations connecting the concepts, each concept comprising attributes for a document element. The computer-implemented method also includes generating natural language text context for a given document element by utilizing the ontology to combine (i) attributes of a given concept corresponding to the given document element with (ii) attributes of another concept, the other concept corresponding to another document element, the other concept being connected to the given concept by at least one relation.Type: GrantFiled: December 3, 2018Date of Patent: October 19, 2021Assignee: International Business Machines CorporationInventors: Sampath Dechu, Saravanan Krishnan, Neelamadhav Gantayat, Senthil Kumar Kumarasamy Mani
-
Patent number: 11081120Abstract: A method for encoded-sound determination performed by a computer includes: executing a first process that includes obtaining information indicating intensities of sound signals, the frequencies being calculated from the sound signals and corresponding to frequencies; and executing a second process that includes determining whether or not the sound signals are signals of encoded sound, based on whether or not the intensities of the sound signals in predetermined frequency bands that are adjacent to each other in a frequency direction have a difference that is larger than or equal to a predetermined threshold.Type: GrantFiled: March 22, 2019Date of Patent: August 3, 2021Assignee: FUJITSU LIMITEDInventors: Akira Kamano, Masanao Suzuki, Nobuyuki Washio, Yohei Kishi
-
Patent number: 11049204Abstract: A method for automatically detecting block legal billing is described where the technique analyzes each billing entry in the legal bill for visual or textual aspects that indicate that a list of billing items is included in the block. The technique utilizes a combination of textual analysis for punctuation characters, count of the number of verbs, or a search for conjunctions. A visual analysis is match the image of the billing item with a predetermined image of a list. Essentially, a novel natural language processing technique is described that identifies lists in a block of text, where the block of text is in the context of a legal bill.Type: GrantFiled: December 7, 2018Date of Patent: June 29, 2021Assignee: Bottomline Technologies, Inc.Inventor: Scot Calitri