Patents by Inventor Yinyi Guo

Yinyi Guo has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Complementary virtual audio generation

Patent number: 11212637

Abstract: An apparatus includes a processor configured to receive one or more media signals associated with a scene. The processor is also configured to identify a spatial location in the scene for each source of the one or more media signals. The processor is further configured to identify audio content for each media signal of the one or more media signals. The processor is also configured to determine one or more candidate spatial locations in the scene based on the identified spatial locations. The processor is further configured to generate audio to playback as virtual sounds that originate from the one or more candidate spatial locations.

Type: Grant

Filed: April 12, 2018

Date of Patent: December 28, 2021

Assignee: Qualcomm Incorproated

Inventors: Yinyi Guo, Lae-Hoon Kim, Dongmei Wang, Erik Visser
METHOD AND APPARATUS FOR TARGET SOUND DETECTION

Publication number: 20210312943

Abstract: A device to perform target sound detection includes one or more processors. The one or more processors include a buffer configured to store audio data and a target sound detector. The target sound detector includes a first stage and a second stage. The first stage includes a binary target sound classifier configured to process the audio data. The first stage is configured to activate the second stage in response to detection of a target sound. The second stage is configured to receive the audio data from the buffer in response to the detection of the target sound.

Type: Application

Filed: April 1, 2020

Publication date: October 7, 2021

Inventors: Prajakt KULKARNI, Yinyi GUO, Erik VISSER
Audio analytics for natural language processing

Patent number: 11094316

Abstract: A device includes a memory configured to store category labels associated with categories of a natural language processing library. A processor is configured to analyze input audio data to generate a text string and to perform natural language processing on at least the text string to generate an output text string including an action associated with a first device, a speaker, a location, or a combination thereof. The processor is configured to compare the input audio data to audio data of the categories to determine whether the input audio data matches any of the categories and, in response to determining that the input audio data does not match any of the categories: create a new category label, associate the new category label with at least a portion of the output text string, update the categories with the new category label, and generate a notification indicating the new category label.

Type: Grant

Filed: May 4, 2018

Date of Patent: August 17, 2021

Assignee: QUALCOMM Incorporated

Inventors: Erik Visser, Fatemeh Saki, Yinyi Guo, Sunkuk Moon, Lae-Hoon Kim, Ravi Choudhary
SOUND EVENT DETECTION LEARNING

Publication number: 20210158837

Abstract: A device includes a processor configured to receive audio data samples and provide the audio data samples to a first neural network to generate a first output corresponding to a first set of sound classes. The processor is further configured to provide the audio data samples to a second neural network to generate a second output corresponding to a second set of sound classes. A second count of classes of the second set of sound classes is greater than a first count of classes of the first set of sound classes. The processor is also configured to provide the first output to a neural adapter to generate a third output corresponding to the second set of sound classes. The processor is further configured to provide the second output and the third output to a merger adapter to generate sound event identification data based on the audio data samples.

Type: Application

Filed: November 24, 2020

Publication date: May 27, 2021

Inventors: Fatemeh SAKI, Yinyi GUO, Erik VISSER, Eunjeong KOH
ACTIVITY QUERY RESPONSE SYSTEM

Publication number: 20210011887

Abstract: A device for activity tracking includes a memory and one or more processors. The memory is configured to store an activity log. The one or more processors are configured to update the activity log based on activity data. The activity data is received from a second device. The one or more processors are also configured to, responsive to receiving a natural language query, generate a query response based on the activity log.

Type: Application

Filed: September 27, 2019

Publication date: January 14, 2021

Inventors: Erik VISSER, Rehana MAHFUZ, Ravi CHOUDHARY, Lae-Hoon KIM, Sunkuk MOON, Yinyi GUO, Fatemeh SAKI
MULTI-MODAL USER INTERFACE

Publication number: 20210012770

Abstract: A device for multi-modal user input includes a processor configured to process first data received from a first input device. The first data indicates a first input from a user based on a first input mode. The first input corresponds to a command. The processor is configured to send a feedback message to an output device based on processing the first data. The feedback message instructs the user to provide, based on a second input mode that is different from the first input mode, a second input that identifies a command associated with the first input. The processor is configured to receive second data from a second input device, the second data indicating the second input, and to update a mapping to associate the first input to the command identified by the second input.

Type: Application

Filed: November 15, 2019

Publication date: January 14, 2021

Inventors: Ravi Choudhary, Lae-Hoon Kim, Sunkuk Moon, Yinyi Guo, Fatemeh Saki, Erik Visser
Characteristic-based speech codebook selection

Patent number: 10878831

Abstract: An apparatus includes a speech processing engine configured to receive data corresponding to speech and to determine whether a first characteristic associated with the speech differs from a reference characteristic by at least a threshold amount. The apparatus further includes a selection circuit responsive to the speech processing engine. The selection circuit is configured to select a particular speech codebook from among a plurality of speech codebooks based on the first characteristic differing from the reference characteristic by at least the threshold amount. The particular speech codebook is associated with the first characteristic.

Type: Grant

Filed: January 12, 2017

Date of Patent: December 29, 2020

Assignee: QUALCOMM Incorporated

Inventors: Yinyi Guo, Erik Visser
User experience evaluation

Patent number: 10872604

Abstract: A device includes a memory configured to store a user experience evaluation unit. A processor is configured to receive a first user input corresponding to a user command to initiate a particular task, the first user input received via a first sensor. The processor is configured to, after receiving the first user input, receive one or more subsequent user inputs, the one or subsequent user inputs including a second user input received via a second sensor. The processor is configured to initiate a remedial action in response to determining, based on the user experience evaluation unit, that the one or more subsequent user inputs correspond to a negative user experience.

Type: Grant

Filed: May 17, 2018

Date of Patent: December 22, 2020

Assignee: Qualcomm Incorporated

Inventors: Lae-Hoon Kim, Yinyi Guo, Ravi Choudhary, Sunkuk Moon, Erik Visser, Fatemeh Saki
SYSTEM AND METHOD TO VIEW OCCUPANT STATUS AND MANAGE DEVICES OF BUILDING

Publication number: 20200313923

Abstract: A device to provide information to a visual interface that is mountable to a vehicle dashboard includes a memory configured to store device information indicative of controllable devices of a building and occupant data indicative of one or more occupants of the building. The device includes a processor configured to receive, in real-time, status information associated with the one or more occupants of the building. The status information includes at least one of dynamic location information or dynamic activity information. The processor is configured to generate an output to provide, at the visual interface device, a visual representation of at least a portion of the building and the status information associated with the one or more occupants. The processor is also configured to generate an instruction to adjust an operation of one or more devices of the controllable devices based on user input.

Type: Application

Filed: March 29, 2019

Publication date: October 1, 2020

Inventors: Ravi CHOUDHARY, Yinyi GUO, Fatemeh SAKI, Erik VISSER
Enhanced speech generation

Patent number: 10783890

Abstract: In a particular aspect, a speech generator includes a signal input configured to receive a first audio signal. The speech generator also includes at least one speech signal processor configured to generate a second audio signal based on information associated with the first audio signal and based further on automatic speech recognition (ASR) data associated with the first audio signal.

Type: Grant

Filed: April 26, 2019

Date of Patent: September 22, 2020

Assignee: Moore Intellectual Property Law, PLLC

Inventors: Erik Visser, Shuhua Zhang, Lae-Hoon Kim, Yinyi Guo, Sunkuk Moon
Keyword voice authentication

Patent number: 10720165

Abstract: A method of authenticating a user based on voice recognition of a keyword includes generating, at a processor, clean speech statistics. The clean speech statistics are generated from an audio recording of the keyword spoken by the user during an enrollment phase. The method further includes separating speech data and noise data from noisy input speech using the clean speech statistics during an authentication phase. The method also includes authenticating the user by comparing the speech data to the clean speech statistics or by comparing the noisy input speech to noisy speech statistics. The noisy speech statistics are based at least in part on the noise data.

Type: Grant

Filed: January 23, 2017

Date of Patent: July 21, 2020

Assignee: QUALCOMM Incorporated

Inventors: Yinyi Guo, Erik Visser
User interface for secure access to a device using speaker verification

Patent number: 10540979

Abstract: A device includes a memory, a receiver, a processor, and a display. The memory is configured to store a speaker model. The receiver is configured to receive an input audio signal. The processor is configured to determine a first confidence level associated with a first portion of the input audio signal based on the speaker model. The processor is also configured to determine a second confidence level associated with a second portion of the input audio signal based on the speaker model. The display is configured to present a graphical user interface associated with the first confidence level or associated with the second confidence level.

Type: Grant

Filed: April 16, 2015

Date of Patent: January 21, 2020

Assignee: Qualcomm Incorporated

Inventors: Erik Visser, Lae-Hoon Kim, Minho Jin, Yinyi Guo
USER EXPERIENCE EVALUATION

Publication number: 20190355351

Abstract: A device includes a memory configured to store a user experience evaluation unit. A processor is configured to receive a first user input corresponding to a user command to initiate a particular task, the first user input received via a first sensor. The processor is configured to, after receiving the first user input, receive one or more subsequent user inputs, the one or subsequent user inputs including a second user input received via a second sensor. The processor is configured to initiate a remedial action in response to determining, based on the user experience evaluation unit, that the one or more subsequent user inputs correspond to a negative user experience.

Type: Application

Filed: May 17, 2018

Publication date: November 21, 2019

Inventors: Lae-Hoon Kim, Yinyi Guo, Ravi Choudhary, Sunkuk Moon, Erik Visser, Fatemeh Saki
AUDIO ANALYTICS FOR NATURAL LANGUAGE PROCESSING

Publication number: 20190341026

Abstract: A device includes a memory configured to store category labels associated with categories of a natural language processing library. A processor is configured to analyze input audio data to generate a text string and to perform natural language processing on at least the text string to generate an output text string including an action associated with a first device, a speaker, a location, or a combination thereof. The processor is configured to compare the input audio data to audio data of the categories to determine whether the input audio data matches any of the categories and, in response to determining that the input audio data does not match any of the categories: create a new category label, associate the new category label with at least a portion of the output text string, update the categories with the new category label, and generate a notification indicating the new category label.

Type: Application

Filed: May 4, 2018

Publication date: November 7, 2019

Inventors: Erik Visser, Fatemeh Saki, Yinyi Guo, Sunkuk Moon, Lae-Hoon Kim, Ravi Choudhary
COMPLEMENTARY VIRTUAL AUDIO GENERATION

Publication number: 20190320281

Abstract: An apparatus includes a processor configured to receive one or more media signals associated with a scene. The processor is also configured to identify a spatial location in the scene for each source of the one or more media signals. The processor is further configured to identify audio content for each media signal of the one or more media signals. The processor is also configured to determine one or more candidate spatial locations in the scene based on the identified spatial locations. The processor is further configured to generate audio to playback as virtual sounds that originate from the one or more candidate spatial locations.

Type: Application

Filed: April 12, 2018

Publication date: October 17, 2019

Inventors: Yinyi Guo, Lae-Hoon Kim, Dongmei Wang, Erik Visser
ENHANCED SPEECH GENERATION

Publication number: 20190251971

Abstract: In a particular aspect, a speech generator includes a signal input configured to receive a first audio signal. The speech generator also includes at least one speech signal processor configured to generate a second audio signal based on information associated with the first audio signal and based further on automatic speech recognition (ASR) data associated with the first audio signal.

Type: Application

Filed: April 26, 2019

Publication date: August 15, 2019

Inventors: Erik Visser, Shuhua Zhang, Lae-Hoon Kim, Yinyi Guo, Sunkuk Moon
Enhanced speech generation

Patent number: 10332520

Abstract: In a particular aspect, an apparatus includes an audio sensor configured to receive an input audio signal. The apparatus also includes speech generative circuitry configured to generate a synthesized audio signal based at least partly on automatic speech recognition (ASR) data associated with the input audio signal and based on one or more parameters indicative of state information associated with the input audio signal.

Type: Grant

Filed: February 13, 2017

Date of Patent: June 25, 2019

Assignee: Qualcomm Incorporated

Inventors: Erik Visser, Shuhua Zhang, Lae-Hoon Kim, Yinyi Guo, Sunkuk Moon
DESCRIPTIVE TEXT-BASED INPUT BASED ON NON-AUDIBLE SENSOR DATA

Publication number: 20190138095

Abstract: An apparatus includes one or more sensor units configured to detect non-audible sensor data associated with a user. The apparatus also includes a processor, including an action determination unit, coupled to the one or more sensors units. The processor is configured to generate a descriptive text-based input based on the non-audible sensor data. The processor is also configured to determine an action to be performed based on the descriptive text-based input.

Type: Application

Filed: November 3, 2017

Publication date: May 9, 2019

Inventors: Erik Visser, Sunkuk Moon, Yinyi Guo, Lae-Hoon Kim, Shuhua Zhang
WIRELESS CONTROL OF REMOTE DEVICES THROUGH INTENTION CODES OVER A WIRELESS CONNECTION

Publication number: 20190098070

Abstract: Various embodiments provide systems and methods which disclose a command device which can be used to establish a wireless connection, through one or more wireless channels, between the command device and a remote device. An intention code may be generated, prior to, or after, the establishment of the wireless connection, and the remote device may be selected based on the intention code. The command device may initiate a wireless transfer, through one or more wireless channels of the established wireless connection, of an intention code, and receive acknowledgement that the intention code was successfully transferred to the remote device. The command device may then control the remote device, based on the intention code sent to the remote device, through the one or more wireless channels of the established wireless connection between the command device and the remote device.

Type: Application

Filed: September 27, 2017

Publication date: March 28, 2019

Inventors: Lae-Hoon Kim, Erik Visser, Yinyi Guo
ACOUSTIC EVENT ENABLED GEOGRAPHIC MAPPING

Publication number: 20180307753

Abstract: An electronic device includes a classifier circuit, a ranking circuit, and a data generator circuit. The classifier circuit is configured to determine, based on first data indicating samples of sounds detected at a plurality of geographic locations, a plurality of acoustic event classifications associated with the plurality of geographic locations. The ranking circuit is configured to determine a plurality of index scores associated with the plurality of geographic locations by ranking each of the plurality of geographic locations based on the plurality of acoustic event classifications. The data generator circuit is configured to generate, based on the plurality of index scores, second data indicating a geographic map corresponding to the plurality of geographic locations. The second data further indicates the plurality of index scores and a prompt to enable a search for a particular type of acoustic event.

Type: Application

Filed: April 21, 2017

Publication date: October 25, 2018

Inventors: Yinyi GUO, Erik Visser, Lae-Hoon Kim

prev 1 2 3 4 next