Patents by Inventor Jinyu Li

Jinyu Li has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Managed biometric identity

Patent number: 8924735

Abstract: A computing system such as a game console maintains and updates a biometric profile of a user. In one aspect, biometric data of the user is continuously obtained from a sensor such as an infrared and visible light camera, and used to update the biometric profile using a machine learning process. In another aspect, a user is prompted to confirm his or her identify when multiple users are detected at the same time and/or when the user is detected with a confidence level which is below a threshold. A real-time image of the user being identified can be displayed on a user interface with user images associated with one or more accounts. In another aspect, the biometric profile is managed by a shell on the computing system, where the shell makes the biometric profile available to any of a number of applications on the computing system.

Type: Grant

Filed: February 15, 2013

Date of Patent: December 30, 2014

Assignee: Microsoft Corporation

Inventors: Ronald Forbes, Bhaven Dedhia, Tim Keosababian, Tommer Leyvand, Jinyu Li, Timothy Gerken
RESTRUCTURING DEEP NEURAL NETWORK ACOUSTIC MODELS

Publication number: 20140372112

Abstract: A Deep Neural Network (DNN) model used in an Automatic Speech Recognition (ASR) system is restructured. A restructured DNN model may include fewer parameters compared to the original DNN model. The restructured DNN model may include a monophone state output layer in addition to the senone output layer of the original DNN model. Singular value decomposition (SVD) can be applied to one or more weight matrices of the DNN model to reduce the size of the DNN Model. The output layer of the DNN model may be restructured to include monophone states in addition to the senones (tied triphone states) which are included in the original DNN model. When the monophone states are included in the restructured DNN model, the posteriors of monophone states are used to select a small part of senones to be evaluated.

Type: Application

Filed: June 18, 2013

Publication date: December 18, 2014

Inventors: Jian Xue, Emilian Stoimenov, Jinyu Li, Yifan Gong
BIOMETRIC RECOGNITION

Publication number: 20140349747

Abstract: A method for biometric identification for use with a computing device is provided herein. The method includes capturing a temporal sequence of images of the face of a user at different locations within a three-dimensional interaction space. The method further includes extracting one or more face descriptors from the images and generating a biometric template compiling the face descriptors.

Type: Application

Filed: August 5, 2014

Publication date: November 27, 2014

Inventors: Tommer Leyvand, Jinyu Li, Casey Meekhof, Tim Keosababian, Simon Stachniak, Ryan Gunn, Alan Stuart, Russ Glaser, Eddie Mays, Toan Huynh, Richard Irving, James Waletzky, Ajay Ramjee
Visual based identitiy tracking

Patent number: 8864581

Abstract: A video game system (or other data processing system) can visually identify a person entering a field of view of the system and determine whether the person has been previously interacting with the system. In one embodiment, the system establishes thresholds, enrolls players, performs the video game (or other application) including interacting with a subset of the players based on the enrolling, determines that a person has become detectable in the field of view of the system, automatically determines whether the person is one of the enrolled players, maps the person to an enrolled player and interacts with the person based on the mapping if it is determined that the person is one of the enrolled players, and assigns a new identification to the person and interacts with the person based on the new identification if it is determined that the person is not one of the enrolled players.

Type: Grant

Filed: January 29, 2010

Date of Patent: October 21, 2014

Assignee: Microsoft Corporation

Inventors: Tommer Leyvand, Mitchell Stephen Dernis, Jinyu Li, Yichen Wei, Jian Sun, Casey Leon Meekhof, Timothy Milton Keosababian
Posterior-Based Feature with Partial Distance Elimination for Speech Recognition

Publication number: 20140257814

Abstract: A high-dimensional posterior-based feature with partial distance elimination may be utilized for speech recognition. The log likelihood values of a large number of Gaussians are needed to generate the high-dimensional posterior feature. Gaussians with very small log likelihoods are associated with zero posterior values. Log likelihoods for Gaussians for a speech frame may be evaluated with a partial distance elimination method. If the partial distance of a Gaussian is already too small, the Gaussian will have a zero posterior value. The partial distance may be calculated by sequentially adding individual dimensions in a group of dimensions. The partial distance elimination occurs when less than all of the dimensions in the group are sequentially added.

Type: Application

Filed: March 5, 2013

Publication date: September 11, 2014

Applicant: MICROSOFT CORPORATION

Inventors: Jinyu Li, Zhijie Yan, Qiang Huo, Yifan Gong
EXPLOITING HETEROGENEOUS DATA IN DEEP NEURAL NETWORK-BASED SPEECH RECOGNITION SYSTEMS

Publication number: 20140257804

Abstract: Technologies pertaining to training a deep neural network (DNN) for use in a recognition system are described herein. The DNN is trained using heterogeneous data, the heterogeneous data including narrowband signals and wideband signals. The DNN, subsequent to being trained, receives an input signal that can be either a wideband signal or narrowband signal. The DNN estimates the class posterior probability of the input signal regardless of whether the input signal is the wideband signal or the narrowband signal.

Type: Application

Filed: March 7, 2013

Publication date: September 11, 2014

Applicant: Microsoft Corporation

Inventors: Jinyu Li, Dong Yu, Yifan Gong
MULTILINGUAL DEEP NEURAL NETWORK

Publication number: 20140257805

Abstract: Described herein are various technologies pertaining to a multilingual deep neural network (MDNN). The MDNN includes a plurality of hidden layers, wherein values for weight parameters of the plurality of hidden layers are learned during a training phase based upon training data in terms of acoustic raw features for multiple languages. The MDNN further includes softmax layers that are trained for each target language separately, making use of the hidden layer values trained jointly with multiple source languages. The MDNN is adaptable, such that a new softmax layer may be added on top of the existing hidden layers, where the new softmax layer corresponds to a new target language.

Type: Application

Filed: March 11, 2013

Publication date: September 11, 2014

Applicant: MICROSOFT CORPORATION

Inventors: Jui-Ting Huang, Jinyu Li, Dong Yu, Li Deng, Yifan Gong
Biometric recognition

Patent number: 8824749

Abstract: A method for biometric identification for use with a computing device is provided herein. The method includes capturing a temporal sequence of images of the face of a user at different locations within a three-dimensional interaction space. The method further includes extracting one or more face descriptors from the images and generating a biometric template compiling the face descriptors.

Type: Grant

Filed: April 5, 2011

Date of Patent: September 2, 2014

Assignee: Microsoft Corporation

Inventors: Tommer Leyvand, Jinyu Li, Casey Meekhof, Tim Keosababian, Simon Stachniak, Ryan Gunn, Alan Stuart, Russ Glaser, Eddie Mays, Toan Huynh, Richard Irving, James Waletzky, Ajay Ramjee
Managed Biometric Identity

Publication number: 20140237587

Abstract: A computing system such as a game console maintains and updates a biometric profile of a user. In one aspect, biometric data of the user is continuously obtained from a sensor such as an infrared and visible light camera, and used to update the biometric profile using a machine learning process. In another aspect, a user is prompted to confirm his or her identify when multiple users are detected at the same time and/or when the user is detected with a confidence level which is below a threshold. A real-time image of the user being identified can be displayed on a user interface with user images associated with one or more accounts. In another aspect, the biometric profile is managed by a shell on the computing system, where the shell makes the biometric profile available to any of a number of applications on the computing system.

Type: Application

Filed: February 15, 2013

Publication date: August 21, 2014

Applicant: MICROSOFT CORPORATION

Inventors: Ronald Forbes, Bhaven Dedhia, Tim Keosababian, Tommer Leyvand, Jinyu Li, Timothy Gerken
Voice-body identity correlation

Patent number: 8781156

Abstract: A system and method are disclosed for tracking image and audio data over time to automatically identify a person based on a correlation of their voice with their body in a multi-user game or multimedia setting.

Type: Grant

Filed: September 10, 2012

Date of Patent: July 15, 2014

Assignee: Microsoft Corporation

Inventors: Mitchell Dernis, Tommer Leyvand, Christian Klein, Jinyu Li
Online distorted speech estimation within an unscented transformation framework

Patent number: 8731916

Abstract: Noise and channel distortion parameters in the vectorized logarithmic or the cepstral domain for an utterance may be estimated, and subsequently the distorted speech parameters in the same domain may be updated using an unscented transformation framework during online automatic speech recognition. An utterance, including speech generated from a transmission source for delivery to a receiver, may be received by a computing device. The computing device may execute instructions for applying the unscented transformation framework to speech feature vectors, representative of the speech, in order to estimate, in a sequential or online manner, static noise and channel distortion parameters and dynamic noise distortion parameters in the unscented transformation framework. The static and dynamic parameters for the distorted speech in the utterance may then be updated from clean speech parameters and the noise and channel distortion parameters using non-linear mapping.

Type: Grant

Filed: November 18, 2010

Date of Patent: May 20, 2014

Assignee: Microsoft Corporation

Inventors: Deng Li, Jinyu Li, Dong Yu, Yifan Gong
VGPU: a real time GPU emulator

Patent number: 8711159

Abstract: An exemplary method for emulating a graphics processing unit (GPU) includes executing a graphics application on a host computing system to generate commands for a target GPU wherein the host computing system includes host system memory and a different, host GPU; converting the generated commands into intermediate commands; based on one or more generated commands that call for one or more shaders, caching one or more corresponding shaders in a shader cache in the host system memory; based on one or more generated commands that call for one or more resources, caching one or more corresponding resources in a resource cache in the host system memory; based on the intermediate commands, outputting commands for the host GPU; and based on the output commands for the host GPU, rendering graphics using the host GPU where output commands that call for one or more shaders access the one or more corresponding shaders in the shader cache and where output commands that call for one or more resources access the one or more

Type: Grant

Filed: February 23, 2009

Date of Patent: April 29, 2014

Assignee: Microsoft Corporation

Inventors: Jinyu Li, Chen Li, Gang Chen, Xin Tong
ILLUMINATION SENSITIVE FACE RECOGNITION

Publication number: 20140105504

Abstract: Systems and methods for face recognition are provided. In one example, a method for face recognition includes receiving a user image and detecting a user luminance of data representing the user's face. An adaptive low pass filter is selected that corresponds to the user luminance of the user's face. The filter is applied to the user image to create a filtered user image. The filtered user image is projected to create a filtered user image representation. A filtered reference image representation that has been filtered with the same low pass filter is selected from a reference image database. The method then determines whether the filtered reference image representation matches the filtered user image representation.

Type: Application

Filed: October 12, 2012

Publication date: April 17, 2014

Applicant: MICROSOFT CORPORATION

Inventors: Eyal Krupka, Tommer Leyvand, Igor Kviatkovsky, Igor Abramovski, Tim Keosababian, Jinyu Li
Real time head pose estimation

Patent number: 8687880

Abstract: Methods are provided for generating a low dimension pose space and using the pose space to estimate one or more head rotation angles of a user head. In one example, training image frames including a test subject head are captured under a plurality of conditions. For each frame an actual head rotation angle about a rotation axis is recorded. In each frame a face image is detected and converted to an LBP feature vector. Using principal component analysis a PCA feature vector is generated. Pose classes related to rotation angles about a rotation axis are defined. The PCA feature vectors are grouped into a pose class that corresponds to the actual rotation angle associated with the PCA feature vector. Linear discriminant analysis is applied to the pose classes to generate the low dimension pose space.

Type: Grant

Filed: March 20, 2012

Date of Patent: April 1, 2014

Assignee: Microsoft Corporation

Inventors: Yichen Wei, Fang Wen, Jian Sun, Tommer Leyvand, Jinyu Li, Casey Meekhof, Tim Keosababian
Utilizing Scalar Operations for Recognizing Utterances During Automatic Speech Recognition in Noisy Environments

Publication number: 20140067387

Abstract: Scalar operations for model adaptation or feature enhancement may be utilized for recognizing an utterance during automatic speech recognition in a noisy environment. An utterance including distorted speech generated from a transmission source for delivery to a receiver, may be received by a computer. The distorted speech may be caused by the noisy environment and channel distortion. Computations using scalar operations in the form of an algorithm may then be performed for recognizing the utterance. As a result of performing all of the computations with scalar operations, computational complexity is very small in comparison to matrix and vector operations. Vector Taylor Series with diagonal Jacobian approximation may also be utilized as a distortion-model-based noise robust algorithm with scalar operations.

Type: Application

Filed: September 5, 2012

Publication date: March 6, 2014

Applicant: MICROSOFT CORPORATION

Inventors: Jinyu Li, Michael Lewis Seltzer, Yifan Gong
REAL TIME HEAD POSE ESTIMATION

Publication number: 20130251244

Abstract: Methods are provided for generating a low dimension pose space and using the pose space to estimate one or more head rotation angles of a user head. In one example, training image frames including a test subject head are captured under a plurality of conditions. For each frame an actual head rotation angle about a rotation axis is recorded. In each frame a face image is detected and converted to an LBP feature vector. Using principal component analysis a PCA feature vector is generated. Pose classes related to rotation angles about a rotation axis are defined. The PCA feature vectors are grouped into a pose class that corresponds to the actual rotation angle associated with the PCA feature vector. Linear discriminant analysis is applied to the pose classes to generate the low dimension pose space.

Type: Application

Filed: March 20, 2012

Publication date: September 26, 2013

Applicant: MICROSOFT CORPORATION

Inventors: Yichen Wei, Fang Wen, Jian Sun, Tommer Leyvand, Jinyu Li, Casey Meekhof, Tim Keosababian
Emulating legacy hardware using IEEE 754 compliant hardware

Patent number: 8352241

Abstract: Emulating legacy hardware using IEEE 754 compliant hardware is disclosed herein. In some aspects, the emulation includes locating an instruction that includes NaN (not a number) as at least one of an operand or a resultant. The emulation adjusts the resultant of the instruction, via additional code, to produce a final resultant of non-compliant (legacy) hardware. Legacy software, which was written in anticipation of processing by legacy hardware, may then be processed using compliant hardware.

Type: Grant

Filed: February 26, 2009

Date of Patent: January 8, 2013

Assignee: Microsoft Corporation

Inventors: Jinyu Li, Ke Deng, Chen Li
VOICE-BODY IDENTITY CORRELATION

Publication number: 20120327193

Abstract: A system and method are disclosed for tracking image and audio data over time to automatically identify a person based on a correlation of their voice with their body in a multi-user game or multimedia setting.

Type: Application

Filed: September 10, 2012

Publication date: December 27, 2012

Applicant: MICROSOFT CORPORATION

Inventors: Mitchell Dernis, Tommer Leyvand, Christian Klein, Jinyu Li
BIOMETRIC RECOGNITION

Publication number: 20120257797

Abstract: A method for biometric identification for use with a computing device is provided herein. The method includes capturing a temporal sequence of images of the face of a user at different locations within a three-dimensional interaction space. The method further includes extracting one or more face descriptors from the images and generating a biometric template compiling the face descriptors.

Type: Application

Filed: April 5, 2011

Publication date: October 11, 2012

Applicant: MICROSOFT CORPORATION

Inventors: Tommer Leyvand, Jinyu Li, Casey Meekhof, Tim Keosababian, Simon Stachniak, Ryan Gunn, Alan Stuart, Russ Glaser, Eddie Mays, Toan Huynh, Richard Irving, James Waletzky, Ajay Ramjee
Voice-body identity correlation

Patent number: 8265341

Abstract: A system and method are disclosed for tracking image and audio data over time to automatically identify a person based on a correlation of their voice with their body in a multi-user game or multimedia setting.

Type: Grant

Filed: January 25, 2010

Date of Patent: September 11, 2012

Assignee: Microsoft Corporation

Inventors: Mitchell Dernis, Tommer Leyvand, Christian Klein, Jinyu Li

prev … 5 6 7 8 9 10 next