Patents Examined by Edwin S Leland, III
  • Patent number: 11417344
    Abstract: The information processing method in the present disclosure is performed as below. At least one speech segment is detected from speech input to a speech input unit. A first feature quantity is extracted from each speech segment detected, the first feature quantity identifying a speaker whose voice is contained in the speech segment. The first feature quantity extracted is compared with each of second feature quantities stored in storage and identifying the respective voices of registered speakers who are target speakers in speaker recognition. The comparison is performed for each of consecutive speech segments, and under a predetermined condition, among the second feature quantities stored in the storage, at least one second feature quantity whose similarity with the first feature quantity is less than or equal to a threshold is deleted, thereby removing the at least one registered speaker identified by the at least one second feature quantity.
    Type: Grant
    Filed: October 21, 2019
    Date of Patent: August 16, 2022
    Assignee: PANASONIC INTELLECTUAL PROPERTY CORPORATION OF AMERICA
    Inventor: Misaki Doi
  • Patent number: 11386270
    Abstract: A facility for identifying multi-word expressions in a natural language sentence is described. The facility provides the sentence to each of multiple natural language processing modules including a first module, a second module, and a third module. Each natural language processing module uses a different approach to identify a multi-word expression and a type of the multi-word expression. Upon determining that the multiple identifiers of the multi-word expression differ, the facility determines the multi-word expression using a resolution process, which can involve a logical rule set or a machine learning model.
    Type: Grant
    Filed: August 27, 2021
    Date of Patent: July 12, 2022
    Assignee: Unified Compliance Framework (Network Frontiers)
    Inventors: Dorian J. Cougias, Steven Piliero, Dave Dare, Lucian Hontau, Sean Kohler, Michael Wedderburn
  • Patent number: 11380310
    Abstract: Systems and processes for operating a digital assistant are provided. In an example process, low-latency operation of a digital assistant is provided. In this example, natural language processing, task flow processing, dialogue flow processing, speech synthesis, or any combination thereof can be at least partially performed while awaiting detection of a speech end-point condition. Upon detection of a speech end-point condition, results obtained from performing the operations can be presented to the user. In another example, robust operation of a digital assistant is provided. In this example, task flow processing by the digital assistant can include selecting a candidate task flow from a plurality of candidate task flows based on determined task flow scores. The task flow scores can be based on speech recognition confidence scores, intent confidence scores, flow parameter scores, or any combination thereof. The selected candidate task flow is executed and corresponding results presented to the user.
    Type: Grant
    Filed: August 20, 2020
    Date of Patent: July 5, 2022
    Assignee: Apple Inc.
    Inventors: Alejandro Acero, Hepeng Zhang
  • Patent number: 11373038
    Abstract: The present disclosure relates to a method and a terminal for performing word segmentation on text information, and a storage medium. The method includes: acquiring the text information and configuration information, in which the configuration information includes at least two first word segmentation rules; converting the first word segmentation rules into second word segmentation rules according to a predetermined rule; in response to determining that an intersection exists between character strings of the text information matched by two of the second word segmentation rules, determining that two first word segmentation rules corresponding to the two of the second word segmentation rules associated with the intersection conflict; and processing the text information according to the configuration information, and outputting a result of the word segmentation on the text information.
    Type: Grant
    Filed: May 12, 2020
    Date of Patent: June 28, 2022
    Assignee: Beijing Xiaomi Intelligent Technology Co., Ltd.
    Inventors: Shuo Wang, Liang Shi, Yupeng Chen, Qun Guo
  • Patent number: 11373657
    Abstract: A system for identifying audio data includes a feature extraction module receiving unknown input audio data and dividing the unknown input audio data into a plurality of segments of unknown input audio data. A similarity module receives the plurality of segments of the unknown input audio data and receives known audio data from a known source, the known audio data being divided into a plurality of segments of known audio data. The similarity module performs comparisons between the segments of unknown input audio data and respective segments of known audio data and generates a respective plurality of similarity values representative of similarity between the segments of the comparisons, the comparisons being performed serially. The similarity module terminates the comparisons if the similarity values indicate insufficient similarity between the segments of the comparisons, prior to completing comparisons for all segments of the unknown input audio data.
    Type: Grant
    Filed: May 1, 2020
    Date of Patent: June 28, 2022
    Assignee: Raytheon Applied Signal Technology, Inc.
    Inventors: Jonathan C. Wintrode, Nicholas J. Hinnerschitz, Aleksandr R. Jouravlev
  • Patent number: 11361167
    Abstract: Embodiments are directed to organizing conversations. Words may be provided from a conversation stream. Each word may be mapped to a graph model based on characteristics of each word. The graph model may be partitioned based on one or more attributes of a nodes and edges included in the graph model such that nodes associated with relationship strength that exceeds a threshold value may be grouped together. Sentence models may be generated based on sentences included in the conversation stream. Combined models may be generated based on the sentence models and the graph such that each sentence model may be associated with one or more partitions of the graph model. A conversation digest may be generated based on the combined model such that the conversation digest identifies one or more dominant portions of the conversation that include key subject matter.
    Type: Grant
    Filed: July 29, 2021
    Date of Patent: June 14, 2022
    Assignee: Rammer Technologies, Inc.
    Inventors: Toshish Arun Jawale, Ansup Babu, Anthony Claudia
  • Patent number: 11355097
    Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for generating an adaptive audio-generation model. One of the methods includes generating an adaptive audio-generation model including learning a plurality of embedding vectors and parameter values of a neural network using training data comprising first text and audio data representing a plurality of different individual speakers speaking portions of the first text, wherein the plurality of embedding vectors represent respective voice characteristics of the plurality of different individual speakers.
    Type: Grant
    Filed: October 1, 2020
    Date of Patent: June 7, 2022
    Assignee: DeepMind Technologies Limited
    Inventors: Yutian Chen, Scott Ellison Reed, Aaron Gerard Antonius van den Oord, Oriol Vinyals, Heiga Zen, Ioannis Alexandros Assael, Brendan Shillingford, Joao Ferdinando Gomes de Freitas
  • Patent number: 11355127
    Abstract: An electronic apparatus and a controlling method thereof are provided. The electronic apparatus includes a communication interface comprising communication circuitry, a memory, and a processor. The processor is configured to control the electronic apparatus to: receive a user voice for controlling an external device connected to the electronic apparatus from a user terminal through the communication interface, perform user authentication by comparing feature information obtained from the user voice with feature information pre-stored in the memory, obtain a control command for controlling the external device by analyzing the user voice based on the user being authenticated, and control the communication interface to transmit the control command to the external device.
    Type: Grant
    Filed: December 13, 2019
    Date of Patent: June 7, 2022
    Assignee: Samsung Electronics Co., Ltd.
    Inventors: Sungjun Lee, Seongwook Chung
  • Patent number: 11347940
    Abstract: Techniques for dialog data collection are disclosed. In an embodiment, a method comprises providing a first graphical user interface (10) configured to receive first user input data, providing a second graphical user interface (20) configured to receive second user input data, asynchronously transmitting the first user input data to the second graphical user interface (20) or the second user input data to the first graphical user interface (10), and generating training data for a natural language processing system model (60) based on the first user input data and the second user input data.
    Type: Grant
    Filed: October 16, 2018
    Date of Patent: May 31, 2022
    Assignee: SOCO, Inc.
    Inventors: Tiancheng Zhao, Kyusong Lee, Yilian Liu
  • Patent number: 11341974
    Abstract: A speech signal is received by a device comprising first and second transducers, and the first transducer comprises a microphone. A method comprises performing a first voice biometric process on speech contained in a first part of a signal received by the microphone, in order to determine whether the speech is the speech of an enrolled user. A first correlation is determined, between said first part of the signal received by the microphone and a corresponding part of the signal received by the second transducer. A second correlation is determined, between said second part of the signal received by the microphone and the corresponding part of the signal received by the second transducer. It is then determined whether the first correlation and the second correlation satisfy a predetermined condition.
    Type: Grant
    Filed: May 21, 2020
    Date of Patent: May 24, 2022
    Assignee: Cirrus Logic, Inc.
    Inventor: John P. Lesso
  • Patent number: 11321530
    Abstract: A method includes obtaining a string of words and determining whether two or more words of the string of words are in a word group. When the two or more words are in the word group, the method further includes retrieving a set of word group identigens for the word group and retrieving sets of word identigens for remaining words of the string of words. The method further includes determining whether a word group identigen of the set of word group identigens and word identigens of the sets of word identigens creates an entigen group that is a valid interpretation of the string of words. When the entigen group is the valid interpretation of the string of words, the method further includes outputting the entigen group.
    Type: Grant
    Filed: April 16, 2019
    Date of Patent: May 3, 2022
    Assignee: entigenlogic LLC
    Inventors: Frank John Williams, David Ralph Lazzara, Donald Joseph Wurzel, Paige Kristen Thompson, Stephen Emerson Sundberg, Stephen Chen, Karl Olaf Knutson, Jessy Thomas, David Michael Corns, II, Andrew Chu, Eric Andrew Faurie, Theodore Mazurkiewicz, Gary W. Grube
  • Patent number: 11322156
    Abstract: With recent real-world applications of speaker and speech recognition systems, robust features for degraded speech have become a necessity. In general, degraded speech results in poor performance of any speech-based system. This poor performance can be attributed to feature extraction functionality of speech-based system which takes input speech file and converts it into a representation called as a feature. Embodiments of the present disclosure provide systems and methods that compute distance between each degraded speech feature extracted from an input speech signal with each clean speech feature comprised in a memory of the system to obtain set of matched clean speech features wherein at least a subset of cleaned speech features are dynamically selected based on a pre-defined threshold and the computed distance, thereby computing statistics for the dynamically selected clean speech features set for utilizing in at least one of a speech recognition system and a speaker recognition system.
    Type: Grant
    Filed: December 26, 2019
    Date of Patent: May 3, 2022
    Assignee: Tata Consultancy Services Limited
    Inventors: Ashish Panda, Sunilkumar Kopparapu, Sonal Sunil Joshi
  • Patent number: 11315575
    Abstract: Implementations relate to automatic generation of speaker features for each of one or more particular text-dependent speaker verifications (TD-SVs) for a user. Implementations can generate speaker features for a particular TD-SV using instances of audio data that each capture a corresponding spoken utterance of the user during normal non-enrollment interactions with an automated assistant via one or more respective assistant devices. For example, a portion of an instance of audio data can be used in response to: (a) determining that recognized term(s) for the spoken utterance captured by that the portion correspond to the particular TD-SV; and (b) determining that an authentication measure, for the user and for the spoken utterance, satisfies a threshold. Implementations additionally or alternatively relate to utilization of speaker features, for each of one or more particular TD-SVs for a user, in determining whether to authenticate a spoken utterance for the user.
    Type: Grant
    Filed: October 13, 2020
    Date of Patent: April 26, 2022
    Assignee: GOOGLE LLC
    Inventors: Matthew Sharifi, Victor Carbune
  • Patent number: 11315574
    Abstract: A mobile device, a system and a method for task management based on voice intercom function are provided. A mobile device receives a voice message associated with at least one task. Semantic information of the voice message is analyzed to determine at least one message receiver of the voice message and generate a task message. Another mobile device corresponding to one of the at least one message receiver receives the task message. Task management information associated with the at least one task is updated according to the semantic information of the voice message.
    Type: Grant
    Filed: July 9, 2020
    Date of Patent: April 26, 2022
    Assignee: Wistron Corporation
    Inventors: Hui Chi Hsieh, Yu-Chen Yeh
  • Patent number: 11314938
    Abstract: A method and system of automatically interpreting documents relating to regulatory directives to automatically identify actionable items and assigning each of the actionable items identified to the appropriate responsible party in a business.
    Type: Grant
    Filed: July 29, 2019
    Date of Patent: April 26, 2022
    Assignee: Accenture Global Solutions Limited
    Inventors: Prashant Wason, Sridhar Kapa, Saikat Jana, Sagar Sanjeev
  • Patent number: 11308287
    Abstract: A computing device that receives a real-time chat discourse. The computing device analyses the real-time chat discourse by consecutively applying a topic analysis technique, a corpus linguistics technique and a cosine similarity technique. The computing device derives a discourse decision forking component (DDFC) based on comparing the analyzed real-time chat discourse to a similarity threshold value and determines one or more discourse forks using the DDFC.
    Type: Grant
    Filed: October 1, 2020
    Date of Patent: April 19, 2022
    Assignee: International Business Machines Corporation
    Inventors: Nadiya Kochura, Jonathan D. Dunne, Fang Lu
  • Patent number: 11295747
    Abstract: A method and a voice processor that includes (i) an input that is configured to receive of audio signals that represent audio, (ii) a wake word detection circuit, (iii) a first buffer that is configured to store at least wake word signals and prebuffer signals, and (iv) a communication module that is configured to (a) output, over an interrupt port, an interrupt request to an application processor, following a detection of the wake word signals, (b) following an acceptance of the application processor to receive content, access the first buffer and retrieve the prebuffer signals and the wake word signals; and (b) output the content, over the I2S port, to the application processor. The content includes the wake word signals, the prebuffer signals, and query or command signals.
    Type: Grant
    Filed: March 6, 2019
    Date of Patent: April 5, 2022
    Assignee: DSP GROUP LTD.
    Inventor: Avi Keren
  • Patent number: 11289100
    Abstract: Techniques are described herein for dialog-based enrollment of individual users for single- and/or multi-modal recognition by an automated assistant, as well as determining how to respond to a particular user's request based on the particular user being enrolled and/or recognized. Rather than requiring operation of a graphical user interface for individual enrollment, dialog-based enrollment enables users to enroll themselves (or others) by way of a human-to-computer dialog with the automated assistant.
    Type: Grant
    Filed: October 17, 2018
    Date of Patent: March 29, 2022
    Assignee: GOOGLE LLC
    Inventor: Diego Melendo Casado
  • Patent number: 11281865
    Abstract: The present disclosure provides methods and systems for generating linguistic rules. The system may comprise: an electronic display with a graphical user interface comprising: (i) one or more interactive elements for receiving an user input indicating one or more edits to a rule, and (ii) a result visualization region for dynamically displaying a result of the rule in response to receiving the one or more edits, wherein the result of the rule comprises an indicator indicating the validity of the rule; and one or more computer processors that are programmed to: (i) generate the result of the rule by processing the rule with the one or more edits against a set of examples; and (ii) configure the graphical user interface to display the result in a user-selected format.
    Type: Grant
    Filed: July 10, 2020
    Date of Patent: March 22, 2022
    Inventor: Michael Dudley Johnson
  • Patent number: 11281859
    Abstract: For determining structure from a language block, a processor determines phrase tags from phrase vectors for phrases of a language block. The phrase tags specify a phrase function. The processor further determines structure tags for the phrases from the language block.
    Type: Grant
    Filed: February 10, 2020
    Date of Patent: March 22, 2022
    Assignee: Lenovo (Singapore) PTE. LTD.
    Inventors: Song Wang, Roderick Echols, Ryan Charles Knudson, John Weldon Nicholson, Ming Qian