Patents Examined by Andrew C Flanders

Exploring heterogeneous characteristics of layers in ASR models for more efficient training

Patent number: 12367865

Abstract: A computer-implemented method includes obtaining a multi-domain (MD) dataset and training a neural network model using the MD dataset with short-form data withheld (MD-SF). The neural network model includes a plurality of layer each having a plurality of parameters. The method also includes resetting each respective layer in the trained neural network one at a time. For each respective layer in the trained neural network model, and after resetting the respective layer, the method also includes determining a corresponding word error rate of the trained neural network model and identifying the respective layer as corresponding to an ambient layer when the corresponding word error rate satisfies a word error rate threshold. The method also includes transmitting an on-device neural network model to execute on one or more client devices for generating gradients based on the withheld domain (SF) of the MD dataset.

Type: Grant

Filed: October 4, 2022

Date of Patent: July 22, 2025

Assignee: Google LLC

Inventors: Dhruv Guliani, Lillian Zhou, Andreas Kabel, Giovanni Motta, Francoise Beaufays
Adaptive knowledge graph representation learning method for integrating graph structure with text information

Patent number: 12229515

Abstract: The present invention discloses an adaptive knowledge graph representation learning method for integrating a graph structure with text information, including: (1) sampling a neighbor triple of each of a head entity and a tail entity in a target triple; (2) calculating semantic representations of the target triple, and neighbor triples of its head and tail entities; (3) calculating structure representations of the head and tail entities of the target triple; (4) splicing the semantic representation of the target triple with the structure representations of its head and tail entities, inputting a spliced result into an adaptive classification layer, and calculating a classification result and a classification loss; and (5) optimizing the foregoing module based on an optimization algorithm of gradient descent, until a loss value converges, to obtain a final spliced result between the semantic representation of the target triple and the structure representations of its head and tail entities.

Type: Grant

Filed: December 3, 2021

Date of Patent: February 18, 2025

Assignee: ZHEJIANG UNIVERSITY

Inventors: Huajun Chen, Yushan Zhu, Wen Zhang
Using a natural language interface to correlate user intent with predefined data analysis templates for selected data sources

Patent number: 12141525

Abstract: A computing device receives, in a graphical user interface, a first natural language query. The computing device parses the first natural language query, thereby identifying one or more keywords in the query. The computing device identifies one or more data sources that are relevant to the query. The computing device also identifies one or more data fields and/or data values from the data sources in the query. The computing device compares the keywords to respective trigger text for each of a plurality of data analysis templates, thereby selecting a first data analysis template. The first data analysis template includes a plurality of predefined data visualizations. The computing device generates a dashboard that includes the data visualizations using the identified data fields and/or data values, and displays the dashboard in the graphical user interface.

Type: Grant

Filed: September 13, 2021

Date of Patent: November 12, 2024

Assignee: Tableau Software, LLC

Inventors: Ryan Andrew Atallah, Yukiko Ishida Añonuevo
Clause extraction using machine translation and natural language processing

Patent number: 12086559

Abstract: A computer system extracts clauses using machine translation. An input sentence in a source language is translated into a translated sentence in a target language using a trained machine translation model, wherein the trained machine translation model inserts a grammatical indicator into a position of the translated sentence that identifies a dependent clause. The input sentence and the translated sentence are aligned to determine a position in the input sentence that corresponds to the position of the grammatical indicator in the translated sentence. The dependent clause is extracted, in the source language, from the input sentence based on the determined position in the input sentence. Embodiments of the present invention further include a method and program product for clause extraction using machine translation in substantially the same manner described above.

Type: Grant

Filed: March 31, 2021

Date of Patent: September 10, 2024

Assignee: International Business Machines Corporation

Inventors: Vadim Sheinin, Octavian Popescu, Ngoc Phuoc An Vo, Irene Lizeth Manotas Gutiérrez
Electronic apparatus, system comprising electronic apparatus and server and controlling method thereof

Patent number: 12080289

Abstract: Disclosed is an electronic apparatus. The electronic apparatus includes: a communication interface, a memory, and a processor connected to the memory and the communication interface, the processor configured to control the electronic apparatus to, based on receiving a speech related to a function of the electronic apparatus, obtain text information corresponding to the received speech, control the communication interface to transmit the obtained text information to a server including a first neural network model corresponding to the function, execute the function based on response information received from the server, and based on identifying that an update period of the first neural network model is greater than or equal to a first threshold period based on the information related to the function of the electronic apparatus, the electronic apparatus may receive the information about the first neural network model from the server and store the information in the memory.

Type: Grant

Filed: September 27, 2021

Date of Patent: September 3, 2024

Assignee: SAMSUNG ELECTRONICS CO., LTD.

Inventors: Hyeonmok Ko, Dayoung Kwon, Jonggu Kim, Seoha Song, Kyenghun Lee, Hojung Lee, Saebom Jang, Pureum Jung, Changho Paeon, Jiyeon Hong
System and method for data augmentation of feature-based voice data

Patent number: 12073818

Abstract: A method, computer program product, and computing system for receiving feature-based voice data. One or more data augmentation characteristics may be received. One or more augmentations of the feature-based voice data may be generated, via a machine learning model, based upon, at least in part, the feature-based voice data and the one or more data augmentation characteristics.

Type: Grant

Filed: March 10, 2021

Date of Patent: August 27, 2024

Assignee: Microsoft Technology Licensing, LLC

Inventors: Dushyant Sharma, Patrick A. Naylor, James W. Fosburgh, Do Yeong Kim
Non-lexicalized features for language identity classification using subword tokenization

Patent number: 12061872

Abstract: A natural language identity classifier system is described, which employs a supervised machine learning (ML) model to perform language identity classification on input text. The ML model takes, as input, non-lexicalized features of target text derived from subword tokenization of the text. Specifically, these non-lexicalized features are generated based on statistics determined for tokens identified for the input text. According to an embodiment, at least some of the non-lexicalized features are based on natural language-specific summary statistics that indicate how often tokens were found within a corpus for each natural language. Use of such summary statistics allows for generation of natural language specific conditional probability-based features.

Type: Grant

Filed: April 22, 2021

Date of Patent: August 13, 2024

Assignee: Oracle International Corporation

Inventor: Philip Ogren
Automated script generation and audio-visual presentations

Patent number: 12032922

Abstract: Automatic generation of intelligent content is created using a system of computers including a user device and a cloud-based component that processes the user information. The system performs a process that includes receiving an input document and parsing the input document to generate inputs for a natural language generation model using a text analysis model. The natural language generation model generates one or more candidate presentation scripts based on the inputs. A presentation script is selected from the candidate presentation scripts and displayed. A text-to-speech model may be used to generate a synthesized audio presentation of the presentation script. A final presentation may be generated that includes a visual display of the input document and the corresponding audio presentation in sync with the visual display.

Type: Grant

Filed: May 12, 2021

Date of Patent: July 9, 2024

Assignee: MICROSOFT TECHNOLOGY LICENSING, LLC

Inventors: Ji Li, Konstantin Seleskerov, Huey-Ru Tsai, Muin Barkatali Momin, Ramya Tridandapani, Sindhu Vigasini Jambunathan, Amit Srivastava, Derek Martin Johnson, Gencheng Wu, Sheng Zhao, Xinfeng Chen, Bohan Li
Information processing apparatus and information processing method

Patent number: 11996093

Abstract: An information processing apparatus and an information processing method are provided that enable suitable determination of sensing results used in estimating a user state. The information processing apparatus is provided with a determination unit that determines, on the basis of a predetermined reference, one or more second sensing results used in estimating the user state from among a plurality of first sensing results received from a plurality of devices. The information processing apparatus is further provided with an output control unit that controls an output of information on the basis of the one or more second sensing results.

Type: Grant

Filed: July 12, 2018

Date of Patent: May 28, 2024

Inventors: Shinichi Kawano, Hiro Iwase, Mari Saito, Yuhei Taki
Encoding a job posting as an embedding using a graph neural network

Patent number: 11861295

Abstract: Described herein are techniques for using a graph neural network to encode online job postings as embeddings. First, an input graph is defined by processing one or more rules to discover edges that connect nodes in an input graph, where the nodes of the input graph represent job postings or standardized job attributes, and the edges are determined based on analyzing a log of user activity directed to online job postings. Next, a graph neural network (GNN) is trained based on an edge prediction task. Finally, once trained, the GNN is used to derive node embeddings for the nodes (e.g., job postings) of the input graph, and in some instances, new online job postings not represented in the original input graph.

Type: Grant

Filed: October 26, 2021

Date of Patent: January 2, 2024

Assignee: Microsoft Technology Licensing, LLC

Inventors: Shan Li, Baoxu Shi, Jaewon Yang
Mitigating voice frequency loss

Patent number: 11854572

Abstract: Computer-implemented methods, computer program products, and computer systems for mitigating frequency loss may include one or more processors configured for receiving first audio data corresponding to unobstructed user utterances, receiving second audio data corresponding to first obstructed user utterances, generating a frequency loss (FL) model representing frequency loss between the first audio data and the second audio data, receiving third audio data corresponding to one or more second obstructed user utterances, processing the third audio data using the FL model to generate fourth audio data corresponding to a frequency loss mitigated version of the second obstructed user utterances, and transmitting the fourth audio data to a recipient computing device. The first obstructed user utterances are obstructed by a facemask and the one or more second obstructed user utterances is obstructed by the facemask. The FL model may be executed as an audio plugin in a web conferencing program.

Type: Grant

Filed: May 18, 2021

Date of Patent: December 26, 2023

Assignee: International Business Machines Corporation

Inventors: Mary D. Swift, Irene Lizeth Manotas Gutiérrez, Kelley Anders, Jonathan D. Dunne
Method and device for detecting audio signal, and storage medium

Patent number: 11848029

Abstract: A method for detecting an audio signal, the method comprises: obtaining a speech segment and a non-speech segment of an audio signal to be detected, extracting a first audio feature of the speech segment and a second audio feature of the non-speech segment, detecting the first audio feature using a predetermined speech segment detection model to obtain a first detection score, detecting the second audio feature using a predetermined non-speech segment detection model to obtain a second detection score, and determining whether the audio signal belongs to a target audio based on the first detection score and the second detection score.

Type: Grant

Filed: May 21, 2021

Date of Patent: December 19, 2023

Assignee: BEIJING XIAOMI PINECONE ELECTRONICS CO., LTD.

Inventors: Yifeng Wang, Guodu Cai, Shuo Yang, Lihan Li, Peng Gao
Unambiguous phonics system

Patent number: 11842718

Abstract: An unambiguous phonics system (UPS) is capable of presenting text in a format with unambiguous pronunciation. The system can translate input text written in a given language (e.g., English) into a UPS representation of the text written in a UPS alphabet. A unique UPS grapheme can be used to represent each unique grapheme-phoneme combination in the input text. Thus, each letter of the input text is represented in the UPS spelling and each letter of the UPS spelling unambiguously indicates the phoneme used. For all the various grapheme-phoneme combinations for a given input grapheme, the corresponding UPS graphemes can be constructed to have visual similarity with the given input grapheme, thus easing an eventual transition from UPS spelling to traditional spelling. The UPS can include translation, complexity scoring, word/phoneme-grapheme searching, and other module. The UPS can also include techniques to provide efficient, level-based training of the UPS alphabet.

Type: Grant

Filed: December 10, 2020

Date of Patent: December 12, 2023

Assignee: TINYIVY, INC.

Inventor: Zachary Silverzweig
Frontend capture with input stage, suppression module, and output stage

Patent number: 11837254

Abstract: Disclosed are systems and methods for a frontend capture module of a video conferencing application, which can modify an input signal, received from a microphone device to match predetermined signal characteristics, such as voice signal level and expected noise floor. An Input stage, a suppression module and an output stage amplify the voice signal portion of the input signal and suppress the noise signal of input signal to predetermined ranges. The input stage selectively applies gains defined by a gain table, based on signal level of the input signal. The suppression module selectively applies a suppression gain to the input signal based on presence or absence of voice signal in the input signal. The output stage further amplifies the input signal in portions having a voice signal and applies a gain table to maintain a consistent noise floor.

Type: Grant

Filed: October 15, 2021

Date of Patent: December 5, 2023

Assignee: Zoom Video Communications, Inc.

Inventor: Yu Rao
Detecting near matches to a hotword or phrase

Patent number: 11830486

Abstract: Techniques are described herein for identifying a failed hotword attempt. A method includes: receiving first audio data; processing the first audio data to generate a first predicted output; determining that the first predicted output satisfies a secondary threshold but does not satisfy a primary threshold; receiving second audio data; processing the second audio data to generate a second predicted output; determining that the second predicted output satisfies the secondary threshold but does not satisfy the primary threshold; in response to the first predicted output and the second predicted output satisfying the secondary threshold but not satisfying the primary threshold, and in response to the first spoken utterance and the second spoken utterance satisfying one or more temporal criteria relative to one another, identifying a failed hotword attempt; and in response to identifying the failed hotword attempt, providing a hint that is responsive to the failed hotword attempt.

Type: Grant

Filed: October 27, 2020

Date of Patent: November 28, 2023

Assignee: GOOGLE LLC

Inventors: Matthew Sharifi, Victor Carbune
Sensitivity mode for an audio spotting system

Patent number: 11823707

Abstract: An audio spotting system configured for various operating modes including a regular mode and sensitivity mode is described. An example cascade audio spotting system may include a high-power subsystem including a high-power trigger and a transfer module. This high-power trigger includes one or more detection models used to detect whether a target sound activity is included in the one or more audio streams. The one or more detection models are associated with a first set of hyperparameters when the cascade audio spotting system is in a regular mode, and the one or more detection models are associated with a second set of hyperparameters when the cascade audio spotting system is in a sensitivity mode. The transfer module provides at least one of one or more processed audio streams for further processing in response to the high-power trigger detecting the target sound activity in the one or more audio streams.

Type: Grant

Filed: January 10, 2022

Date of Patent: November 21, 2023

Assignee: Synaptics Incorporated

Inventor: Saeed Mosayyebpour Kaskari
Determining order preferences and item suggestions

Patent number: 11810550

Abstract: A computer system may connect to various customer-facing devices and manage or automate the order process between a retail store and the customer. The computer system may perform the dialogue and receive an order for items from the retail store and may perform quality control monitoring of the dialogue between customers and employees taking orders. The ordering system may utilize the ordered items in combination with various contextual cues to determine a customer identity which may then be linked to past orders and/or various order preferences. Based on the determined customer identity, the system may provide recommendations of additional order items or order alterations to the customer before personally identifying information has been collected from the customer. The determination of the customer identity and the determination of recommendations may be performed by machine learning algorithms that were trained on customer data and the retail store products.

Type: Grant

Filed: February 24, 2021

Date of Patent: November 7, 2023

Inventors: Vinay Kumar Shukla, Rahul Aggarwal, Pranav Nirmal Mehra, Vrajesh Navinchandra Sejpal, Akshay Labh Kayastha, Yuganeshan A J
Apparatus and audio signal processor, for providing processed audio signal representation, audio decoder, audio encoder, methods and computer programs

Patent number: 11804229

Abstract: An apparatus for providing a processed audio signal representation on the basis of input audio signal representation configured to apply an un-windowing, in order to provide the processed audio signal representation on the basis of the input audio signal representation. The apparatus is configured to adapt the un-windowing in dependence on one or more signal characteristics and/or in dependence on one or more processing parameters used for a provision of the input audio signal representation.

Type: Grant

Filed: May 5, 2021

Date of Patent: October 31, 2023

Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.

Inventors: Stefan Bayer, Pallavi Maben, Emmanuel Ravelli, Guillaume Fuchs, Eleni Fotopoulou, Markus Multrus
Generating speech in the voice of a player of a video game

Patent number: 11790884

Abstract: A computer-implemented method of generating speech audio in a video game is provided. The method includes inputting, into a synthesizer module, input data that represents speech content. Source acoustic features for the speech content in the voice of a source speaker are generated and are input, along with a speaker embedding associated with a player of the video game into an acoustic feature encoder of a voice convertor. One or more acoustic feature encodings are generated as output of the acoustic feature encoder, which are inputted into an acoustic feature decoder of the voice convertor to generate target acoustic features. The target acoustic features are processed with one or more modules, to generate speech audio in the voice of the player.

Type: Grant

Filed: October 28, 2020

Date of Patent: October 17, 2023

Assignee: ELECTRONIC ARTS INC.

Inventors: Zahra Shakeri, Jervis Pinto, Kilol Gupta, Mohsen Sardari, Harold Chaput, Navid Aghdaie, Kenneth Moss
Voice processing method based on artificial intelligence

Patent number: 11790893

Abstract: A voice processing method is disclosed. The voice processing method applies first and second sentence vectors extracted from first and second utterances, that are included in one dialog group and are separated from each other, to a learning model and generates an output from which at least one word having an overlapping meaning is removed. The voice processing method can be associated with an artificial intelligence module, an unmanned aerial vehicle (UAV), a robot, an augmented reality (AR) device, a virtual reality (VR) device, devices related to 5G services, and the like.

Type: Grant

Filed: September 30, 2020

Date of Patent: October 17, 2023

Assignee: LG ELECTRONICS INC.

Inventors: Kwangyong Lee, Hyun Yu, Byeongha Kim, Yejin Kim

1 2 3 4 5 … next