Patents Examined by Fariba Sirjani

Multi-step linear interpolation of language models

Patent number: 11610581

Abstract: A computer-implemented method is provided for generating a language model for an application. The method includes estimating interpolation weights of each of a plurality of language models according to an Expectation Maximization (EM) algorithm based on a first metric. The method further includes classifying the plurality of language models into two or more sets based on characteristics of the two or more sets. The method also includes estimating a hyper interpolation weight for the two or more sets based on a second metric specific to the application. The method additionally includes interpolating the plurality of language models using the interpolation weights and the hyper interpolation weight to generate a final language model.

Type: Grant

Filed: February 5, 2021

Date of Patent: March 21, 2023

Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Nobuyasu Itoh, Masayuki Suzuki, Gakuto Kurata
Privacy device for smart speakers

Patent number: 11606658

Abstract: Systems, apparatuses, and methods are described for a privacy blocking device configured to prevent receipt, by a listening device, of video and/or audio data until a trigger occurs. A blocker may be configured to prevent receipt of video and/or audio data by one or more microphones and/or one or more cameras of a listening device. The blocker may use the one or more microphones, the one or more cameras, and/or one or more second microphones and/or one or more second cameras to monitor for a trigger. The blocker may process the data. Upon detecting the trigger, the blocker may transmit data to the listening device. For example, the blocker may transmit all or a part of a spoken phrase to the listening device.

Type: Grant

Filed: February 10, 2020

Date of Patent: March 14, 2023

Inventor: Thomas Stachura
Privacy device for smart speakers

Patent number: 11606657

Abstract: Systems, apparatuses, and methods are described for a privacy blocking device configured to prevent receipt, by a listening device, of video and/or audio data until a trigger occurs. A blocker may be configured to prevent receipt of video and/or audio data by one or more microphones and/or one or more cameras of a listening device. The blocker may use the one or more microphones, the one or more cameras, and/or one or more second microphones and/or one or more second cameras to monitor for a trigger. The blocker may process the data. Upon detecting the trigger, the blocker may transmit data to the listening device. For example, the blocker may transmit all or a part of a spoken phrase to the listening device.

Type: Grant

Filed: February 10, 2020

Date of Patent: March 14, 2023

Inventor: Thomas Stachura
Airport noise classification method and system

Patent number: 11593610

Abstract: An aircraft noise monitoring system uses a set of geographically distributed noise sensors to receive data corresponding to events captured by the noise sensors. Each event corresponds to noise that exceeds a threshold level. For each event, the system will receive a classification of the event as an aircraft noise event or a non-aircraft noise event. It will then use the data corresponding to the events and the received classifications to train a convolutional neural network (CNN) in a classification process. After training, when the system receives a new noise event, it will use the CNN to classify the new noise event as an aircraft noise event or a non-aircraft noise event, and it will generate an output indicating whether the new noise event is an aircraft noise event or a non-aircraft noise event.

Type: Grant

Filed: April 17, 2019

Date of Patent: February 28, 2023

Assignee: METROPOLITAN AIRPORTS COMMISSION

Inventors: Derek Anderson, Matthew Baker, Nicholas Heller, Bradley Juffer
Automatic synthesis of translated speech using speaker-specific phonemes

Patent number: 11594226

Abstract: An embodiment includes converting an original audio signal to an original text string, the original audio signal being from a recording of the original text string spoken by a specific person in a source language. The embodiment generates a translated text string by translating the original text string from the source language to a target language, including translation of a word from the source language to a target language. The embodiment assembles a standard phoneme sequence from a set of standard phonemes, where the standard phoneme sequence includes a standard pronunciation of the translated word. The embodiment also associates a custom phoneme with a standard phoneme of the standard phoneme sequence, where the custom phoneme includes the specific person's pronunciation of a sound in the translated word. The embodiment synthesizes the translated text string to a translated audio signal including the translated word pronounced using the custom phoneme.

Type: Grant

Filed: December 22, 2020

Date of Patent: February 28, 2023

Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Su Liu, Yang Liang, Debbie Anglin, Fan Yang
Jitter buffer control, audio decoder, method and computer program

Patent number: 11580997

Abstract: A jitter buffer control for controlling a provision of a decoded audio content on the basis of an input audio content is configured to select a frame-based time scaling or a sample-based time scaling in a signal-adaptive manner. An audio decoder uses such a jitter buffer control.

Type: Grant

Filed: June 11, 2020

Date of Patent: February 14, 2023

Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.

Inventors: Stefan Reuschl, Stefan Doehla, Jérémie Lecomte, Manuel Jander
Policy authoring for task state tracking during dialogue

Patent number: 11574635

Abstract: Conversational understanding systems allow users to conversationally interface with a computing device. In examples, a query may be received that includes a request for execution of a task. A data exchange task definition may be accessed. The data exchange task definition assists a conversational understanding system in managing task state tracking for information needed for task execution. Using the data exchange task definition, a per-turn policy for interacting with the user computing device is generated based on the state of a dialogue with a computing device and an evaluation of a process flow chart provided by a task owner resource. The task owner resource may be independent from the conversational understanding system. A response to the query may be generated and output based on the per-turn policy. In examples, the per-turn policy is used to generate one or more responses during a dialogue with a user via a computing device.

Type: Grant

Filed: December 20, 2019

Date of Patent: February 7, 2023

Assignee: Microsoft Technology Licensing, LLC

Inventors: Paul Crook, Vasiliy Radostev, Omar Zia Khan, Vipul Agarwal, Ruhi Sarikaya, Marius Alexandru Marin, Alexandre Rochette, Jean-Philippe Robichaud
Feeling experience correlation

Patent number: 11574553

Abstract: A system including sensors configured to provide physiological markers of a developer and a controller configured provide information indicative of a user experience to the developer while receive signals from the sensors. The controller is configured to utilize cognitive analysis determine developer emotion responses as the developer receives the user experience. The controller compares a developer emotion classification with a user emotion classification of a user as the user generated the user experience. The system generates a prioritized backlog to identify points where emotion responses between user and developer are in common, or where emotion responses between user and developer differ.

Type: Grant

Filed: September 18, 2019

Date of Patent: February 7, 2023

Assignee: International Business Machines Corporation

Inventors: Stan Kevin Daley, Michael Bender, Siddhartha Sood, Shawn D. Hennessy
Schema-guided response generation

Patent number: 11551159

Abstract: Generally, the present disclosure is directed to systems and methods for performing task-oriented response generation that can provide advantages for artificial intelligence systems or other computing systems that include natural language processing for interpreting user input. Example implementations can process natural language descriptions of various services that can be accessed by the system. In response to a natural language input, systems can identify relevant values for executing one of the service(s), based in part on comparing embedded representations of the natural language input and the natural language description using a machine learned model.

Type: Grant

Filed: December 23, 2019

Date of Patent: January 10, 2023

Assignee: GOOGLE LLC

Inventors: Abhinav Kumar Rastogi, Raghav Gupta, Xiaoxue Zang, Srinivas Kumar Sunkara, Pranav Khaitan
Electronic apparatus and control method thereof

Patent number: 11544469

Abstract: An electronic apparatus is disclosed. The electronic apparatus includes a display, a storage in which keyword information by product specification is stored, and a processor configured to obtain user feedback on the product by crawling a website, identify positive feedback or negative feedback among the user feedback corresponding to the keyword information by specification by performing natural language processing (NLP) to which at least two different algorithms are applied, and display a result of the identification through the display.

Type: Grant

Filed: January 2, 2019

Date of Patent: January 3, 2023

Assignee: SAMSUNG ELECTRONICS CO., LTD.

Inventor: Rajasimhan Baskar
Voice message capturing system

Patent number: 11527251

Abstract: Systems, apparatuses, and methods for capturing voice messages are provided. In one embodiment, a method can include receiving, by one or more processors of a mobile user device, a user input indicative of a voice message at a first time. The method can further include identifying contextual data indicative of one or more computing devices within proximity of the mobile user device. The method can include providing a set of data for storage in one or more memory devices of the mobile user device. The set of data can indicate the voice message and the contextual data indicative of the computing devices. The method can further include providing an output indicative of the voice message and the contextual data to one or more secure computing devices at a second time.

Type: Grant

Filed: December 1, 2020

Date of Patent: December 13, 2022

Assignee: GOOGLE LLC

Inventors: Jonathan Brandt Moeller, Jeremy Drew Payne
Speech recognition with parallel recognition tasks

Patent number: 11527248

Abstract: The subject matter of this specification can be embodied in, among other things, a method that includes receiving an audio signal and initiating speech recognition tasks by a plurality of speech recognition systems (SRS's). Each SRS is configured to generate a recognition result specifying possible speech included in the audio signal and a confidence value indicating a confidence in a correctness of the speech result. The method also includes completing a portion of the speech recognition tasks including generating one or more recognition results and one or more confidence values for the one or more recognition results, determining whether the one or more confidence values meets a confidence threshold, aborting a remaining portion of the speech recognition tasks for SRS's that have not generated a recognition result, and outputting a final recognition result based on at least one of the generated one or more speech results.

Type: Grant

Filed: May 27, 2020

Date of Patent: December 13, 2022

Assignee: GOOGLE LLC

Inventors: Brian Strope, Francoise Beaufays, Olivier Siohan
Automated pipeline selection for synthesis of audio assets

Patent number: 11521594

Abstract: An example method of automated selection of audio asset synthesizing pipelines includes: receiving an audio stream comprising human speech; determining one or more features of the audio stream; selecting, based on the one or more features of the audio stream, an audio asset synthesizing pipeline; training, using the audio stream, one or more audio asset synthesizing models implementing respective stages of the selected audio asset synthesizing pipeline; and responsive to determining that a quality metric of the audio asset synthesizing pipeline satisfies a predetermined quality condition, synthesizing one or more audio assets by the selected audio asset synthesizing pipeline.

Type: Grant

Filed: November 10, 2020

Date of Patent: December 6, 2022

Assignee: Electronic Arts Inc.

Inventors: Kilol Gupta, Tushar Agarwal, Zahra Shakeri, Mohsen Sardari, Harold Henry Chaput, Navid Aghdaie
Machine learning tool for navigating a dialogue flow

Patent number: 11501763

Abstract: Embodiments provide systems and methods for navigating a dialogue flow using a trained intelligence bot. Upon initiation of a chat session between a user and a trained intelligence bot, one or more utterances can be received. The utterances can be processed using the trained intelligence bot to resolve an intent from among a plurality of predefined intents, where the intelligence bot is trained to resolve predefined intents based on training data associated with the predefined intents. A predefined dialogue flow associated with the resolved intent can be navigated using the intelligence bot, where the intelligence bot guides the user through the dialogue flow using context variables that are associated with the user or the chat session. The user can be provided enterprise data retrieved by the intelligence bot using a retrieval request generated based on one or more of the navigation of the dialogue flow or the context variables.

Type: Grant

Filed: April 18, 2019

Date of Patent: November 15, 2022

Assignee: ORACLE INTERNATIONAL CORPORATION

Inventors: Kiran V. Panchamgam, Sandhya Lonial, Sajith Vijayan
Loopback audio channels for echo cancellation in web browsers

Patent number: 11501791

Abstract: Media, methods, and systems are provided for audio rerouting to echo cancel audio in web browsers hosting video streams. Spoken audio from a presenter in a video stream may be received via a microphone on a presenter computing device using a first audio connection. Echo cancellation for the presenter may be enabled. Media audio from the presenter may be received originating from a second audio connection. In response to receiving the media audio, a loopback connection for the presenter may be created. In the loopback connection, the presenter may act as both the sender and receiver of the media audio. The loopback connection may have echo cancellation enabled and use the first audio connection. Once the loopback connection is created, the audio may be routed through the loopback connection. The audio may then be played out of an audio output device for the presenter with echo cancellation enabled.

Type: Grant

Filed: November 22, 2021

Date of Patent: November 15, 2022

Assignee: Hopin Ltd

Inventors: Dan Briggs, Geige Vandentop
Privacy device for smart speakers

Patent number: 11503418

Abstract: Systems, apparatuses, and methods are described for a privacy blocking device configured to prevent receipt, by a listening device, of video and/or audio data until a trigger occurs. A blocker may be configured to prevent receipt of video and/or audio data by one or more microphones and/or one or more cameras of a listening device. The blocker may use the one or more microphones, the one or more cameras, and/or one or more second microphones and/or one or more second cameras to monitor for a trigger. The blocker may process the data. Upon detecting the trigger, the blocker may transmit data to the listening device. For example, the blocker may transmit all or a part of a spoken phrase to the listening device.

Type: Grant

Filed: February 10, 2020

Date of Patent: November 15, 2022

Inventor: Thomas Stachura
Audio recognition method, device and server

Patent number: 11482242

Abstract: An audio recognition method, including: acquiring an audio file to be recognized (S100); extracting audio feature information of the audio file to be recognized, the audio feature information including audio fingerprints (S200); searching, in a fingerprint index database, audio attribute information matched with the audio feature information, the fingerprint index database including an audio fingerprint set in which invalid audio fingerprint removal has been performed on audio sample data (S300). As the audio fingerprint set in the fingerprint index database has been subjected to invalid audio fingerprint removal of audio sample data, the storage space of audio fingerprints in the fingerprint index database can be reduced, and the audio recognition efficiency can be improved. Further provided are an audio recognition device and a server.

Type: Grant

Filed: October 17, 2018

Date of Patent: October 25, 2022

Assignee: Beijing Dajia Internet Information Technology Co., Ltd.

Inventor: Tao Jiang
Privacy device for smart speakers

Patent number: 11477590

Abstract: Systems, apparatuses, and methods are described for a privacy blocking device configured to prevent receipt, by a listening device, of video and/or audio data until a trigger occurs. A blocker may be configured to prevent receipt of video and/or audio data by one or more microphones and/or one or more cameras of a listening device. The blocker may use the one or more microphones, the one or more cameras, and/or one or more second microphones and/or one or more second cameras to monitor for a trigger. The blocker may process the data. Upon detecting the trigger, the blocker may transmit data to the listening device. For example, the blocker may transmit all or a part of a spoken phrase to the listening device.

Type: Grant

Filed: February 10, 2020

Date of Patent: October 18, 2022

Inventor: Thomas Stachura
Electronic apparatus and control method thereof

Patent number: 11462214

Abstract: An electronic apparatus is provided. The electronic apparatus includes a communicator comprising communication circuitry configured to communicate with a voice recognition server; and a processor configured to control the communicator to establish a session with the voice recognition server, based on a voice input start signal being received from a first external apparatus, to maintain the established session based on the voice input start signal being received from a second external apparatus in a state where the session is established, and to process voice recognition on audio data received from the second external apparatus using the maintained session.

Type: Grant

Filed: December 5, 2018

Date of Patent: October 4, 2022

Assignee: Samsung Electronics Co., Ltd.

Inventor: Jangho Jin
Secure word search

Patent number: 11461551

Abstract: A method may include generating word string vectors for word strings in a document, obtaining encrypted word string vectors by encrypting the word string vectors, generating a search vector for a search query, obtaining an encrypted search vector by encrypting the search vector, calculating encrypted distances between the encrypted word string vectors and the encrypted search vector, obtaining a decrypted distance by decrypting an encrypted distance, and using the decrypted distance, determining a semantic match between the search query and the document.

Type: Grant

Filed: October 23, 2019

Date of Patent: October 4, 2022

Assignee: Private AI Inc.

Inventors: Patricia Araujo Thaine, Gerald B. Penn

prev 1 2 3 4 5 6 7 … next