Patents by Inventor Jinyu Li

Jinyu Li has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20120257797
    Abstract: A method for biometric identification for use with a computing device is provided herein. The method includes capturing a temporal sequence of images of the face of a user at different locations within a three-dimensional interaction space. The method further includes extracting one or more face descriptors from the images and generating a biometric template compiling the face descriptors.
    Type: Application
    Filed: April 5, 2011
    Publication date: October 11, 2012
    Applicant: MICROSOFT CORPORATION
    Inventors: Tommer Leyvand, Jinyu Li, Casey Meekhof, Tim Keosababian, Simon Stachniak, Ryan Gunn, Alan Stuart, Russ Glaser, Eddie Mays, Toan Huynh, Richard Irving, James Waletzky, Ajay Ramjee
  • Patent number: 8265341
    Abstract: A system and method are disclosed for tracking image and audio data over time to automatically identify a person based on a correlation of their voice with their body in a multi-user game or multimedia setting.
    Type: Grant
    Filed: January 25, 2010
    Date of Patent: September 11, 2012
    Assignee: Microsoft Corporation
    Inventors: Mitchell Dernis, Tommer Leyvand, Christian Klein, Jinyu Li
  • Patent number: 8237720
    Abstract: Embodiments for shader-based finite state machine frame detection for implementing alternative graphical processing on an animation scenario are disclosed. In accordance with one embodiment, the embodiment includes assigning an identifier to each shader used to render animation scenarios. The embodiment also includes defining a finite state machine for a key frame in each of the animation scenarios, whereby each finite state machine representing a plurality of shaders that renders the key frame in each animation scenario. The embodiment further includes deriving a shader ID sequence for each finite state machine based on the identifier assigned to each shader. The embodiment additionally includes comparing an input shader ID sequence of a new frame of a new animation scenario to each derived shader ID sequences. Finally, the embodiment includes executing alternative graphics processing on the new animation scenario when the input shader ID sequence matches one of the derived shader ID sequences.
    Type: Grant
    Filed: February 12, 2009
    Date of Patent: August 7, 2012
    Assignee: Microsoft Corporation
    Inventors: Jinyu Li, Chen Li, Xin Tong
  • Patent number: 8239195
    Abstract: A speech recognition system includes a receiver component that receives a distorted speech utterance. The speech recognition also includes an adaptor component that selectively adapts parameters of a compressed model used to recognize at least a portion of the distorted speech utterance, wherein the adaptor component selectively adapts the parameters of the compressed model based at least in part upon the received distorted speech utterance.
    Type: Grant
    Filed: September 23, 2008
    Date of Patent: August 7, 2012
    Assignee: Microsoft Corporation
    Inventors: Jinyu Li, Li Deng, Dong Yu, Jian Wu, Yifan Gong, Alejandro Acero
  • Patent number: 8217962
    Abstract: Embodiments for single-pass bounding box calculation are disclosed. In accordance with one embodiment, the single-pass bounding box calculation includes rendering a first target to a 2-dimensional screen space, whereby the first target includes at least six pixels. The calculation further includes producing transformed vertices in a set of geometry primitives based on an application-specified transformation. The calculation also includes generating six new points for each transformed vertex in the set of geometry primitives. The calculation additionally includes producing an initial third coordinate value for each pixel by rendering the at least six new points generate for each pixel to each corresponding pixel. The calculation further includes producing a post-rasterization value for each pixel by rasterizing the at least six new points rendered to each pixel with each corresponding pixel.
    Type: Grant
    Filed: January 29, 2009
    Date of Patent: July 10, 2012
    Assignee: Microsoft Corporation
    Inventors: Xin Tong, Chen Li, Jinyu Li
  • Patent number: 8214215
    Abstract: A speech recognition system described herein includes a receiver component that receives a distorted speech utterance. The speech recognition also includes an updater component that is in communication with a first model and a second model, wherein the updater component automatically updates parameters of the second model based at least in part upon joint estimates of additive and convolutive distortions output by the first model, wherein the joint estimates of additive and convolutive distortions are estimates of distortions based on a phase-sensitive model in the speech utterance received by the receiver component. Further, distortions other than additive and convolutive distortions, including other stationary and nonstationary sources, can also be estimated used to update the parameters of the second model.
    Type: Grant
    Filed: September 24, 2008
    Date of Patent: July 3, 2012
    Assignee: Microsoft Corporation
    Inventors: Jinyu Li, Li Deng, Dong Yu, Yifan Gong, Alejandro Acero
  • Publication number: 20120165097
    Abstract: A video game system (or other data processing system) can visually identify a person entering a field of view of the system and determine whether the person has been previously interacting with the system. In one embodiment, the system establishes thresholds, enrolls players, performs the video game (or other application) including interacting with a subset of the players based on the enrolling, determines that a person has become detectable in the field of view of the system, automatically determines whether the person is one of the enrolled players, maps the person to an enrolled player and interacts with the person based on the mapping if it is determined that the person is one of the enrolled players, and assigns a new identification to the person and interacts with the person based on the new identification if it is determined that the person is not one of the enrolled players.
    Type: Application
    Filed: March 2, 2012
    Publication date: June 28, 2012
    Applicant: MICROSOFT CORPORATION
    Inventors: Tommer Leyvand, Mitchell Stephen Dernis, Jinyu Li, Yichen Wei, Jian Sun, Casey Leon Meekhof, Timothy Milton Keosababian
  • Publication number: 20120130710
    Abstract: Noise and channel distortion parameters in the vectorized logarithmic or the cepstral domain for an utterance may be estimated, and subsequently the distorted speech parameters in the same domain may be updated using an unscented transformation framework during online automatic speech recognition. An utterance, including speech generated from a transmission source for delivery to a receiver, may be received by a computing device. The computing device may execute instructions for applying the unscented transformation framework to speech feature vectors, representative of the speech, in order to estimate, in a sequential or online manner, static noise and channel distortion parameters and dynamic noise distortion parameters in the unscented transformation framework. The static and dynamic parameters for the distorted speech in the utterance may then be updated from clean speech parameters and the noise and channel distortion parameters using non-linear mapping.
    Type: Application
    Filed: November 18, 2010
    Publication date: May 24, 2012
    Applicant: Microsoft Corporation
    Inventors: Deng Li, Jinyu Li, Dong Yu, Yifan Gong
  • Patent number: 8180637
    Abstract: A method of compensating for additive and convolutive distortions applied to a signal indicative of an utterance is discussed. The method includes receiving a signal and initializing noise mean and channel mean vectors. Gaussian dependent matrix and Hidden Markov Model (HMM) parameters are calculated or updated to account for additive noise from the noise mean vector or convolutive distortion from the channel mean vector. The HMM parameters are adapted by decoding the utterance using the previously calculated HMM parameters and adjusting the Gaussian dependent matrix and the HMM parameters based upon data received during the decoding. The adapted HMM parameters are applied to decode the input utterance and provide a transcription of the utterance.
    Type: Grant
    Filed: December 3, 2007
    Date of Patent: May 15, 2012
    Assignee: Microsoft Corporation
    Inventors: Dong Yu, Li Deng, Alejandro Acero, Yifan Gong, Jinyu Li
  • Publication number: 20110190055
    Abstract: A video game system (or other data processing system) can visually identify a person entering a field of view of the system and determine whether the person has been previously interacting with the system. In one embodiment, the system establishes thresholds, enrolls players, performs the video game (or other application) including interacting with a subset of the players based on the enrolling, determines that a person has become detectable in the field of view of the system, automatically determines whether the person is one of the enrolled players, maps the person to an enrolled player and interacts with the person based on the mapping if it is determined that the person is one of the enrolled players, and assigns a new identification to the person and interacts with the person based on the new identification if it is determined that the person is not one of the enrolled players.
    Type: Application
    Filed: January 29, 2010
    Publication date: August 4, 2011
    Applicant: MICROSOFT CORPORATION
    Inventors: Tommer Leyvand, Mitchell Stephen Dernis, Jinyu Li, Yichen Wei, Jian Sun, Casey Leon Meekhof, Timothy Milton Keosababian
  • Publication number: 20110182481
    Abstract: A system and method are disclosed for tracking image and audio data over time to automatically identify a person based on a correlation of their voice with their body in a multi-user game or multimedia setting.
    Type: Application
    Filed: January 25, 2010
    Publication date: July 28, 2011
    Applicant: MICROSOFT CORPORATION
    Inventors: Mitchell Dernis, Tommer Leyvand, Christian Klein, Jinyu Li
  • Publication number: 20110144986
    Abstract: Described is a calibration model for use in a speech recognition system. The calibration model adjusts the confidence scores output by a speech recognition engine to thereby provide an improved calibrated confidence score for use by an application. The calibration model is one that has been trained for a specific usage scenario, e.g., for that application, based upon a calibration training set obtained from a previous similar/corresponding usage scenario or scenarios. Different calibration models may be used with different usage scenarios, e.g., during different conditions. The calibration model may comprise a maximum entropy classifier with distribution constraints, trained with continuous raw confidence scores and multi-valued word tokens, and/or other distributions and extracted features.
    Type: Application
    Filed: December 10, 2009
    Publication date: June 16, 2011
    Applicant: Microsoft Corporation
    Inventors: Dong Yu, Li Deng, Jinyu Li
  • Publication number: 20100318355
    Abstract: Techniques and systems for training an acoustic model are described. In an embodiment, a technique for training an acoustic model includes dividing a corpus of training data that includes transcription errors into N parts, and on each part, decoding an utterance with an incremental acoustic model and an incremental language model to produce a decoded transcription. The technique may further include inserting silence between a pair of words into the decoded transcription and aligning an original transcription corresponding to the utterance with the decoded transcription according to time for each part. The technique may further include selecting a segment from the utterance having at least Q contiguous matching aligned words, and training the incremental acoustic model with the selected segment. The trained incremental acoustic model may then be used on a subsequent part of the training data. Other embodiments are described and claimed.
    Type: Application
    Filed: June 10, 2009
    Publication date: December 16, 2010
    Applicant: MICROSOFT CORPORATION
    Inventors: Jinyu Li, Yifan Gong, Chaojun Liu, Kaisheng Yao
  • Publication number: 20100214294
    Abstract: An exemplary method for tessellating a primitive of a graphical object includes receiving information for a primitive of a graphical object where the information includes vertex information and an edge factor for each edge of the primitive; based on the received information, dividing the primitive into parts where each part corresponds to at least a portion of an edge of the primitive and at least one vertex of the primitive and where each part has an association with the edge factor of the corresponding edge; for each of the parts, executing a geometry shader on a graphics processing unit (GPU) where the executing includes determining barycentric coordinates for a respective part based in part on its associated edge factor; for each of the parts, outputting the barycentric coordinates to a vertex buffer; and generating a tessellated mesh for the primitive based on the vertex information and the barycentric coordinates of the vertex buffer where the generating includes invoking a draw function of the GPU.
    Type: Application
    Filed: February 20, 2009
    Publication date: August 26, 2010
    Applicant: Microsoft Corporation
    Inventors: Chen Li, Jinyu Li, Xin Tong
  • Publication number: 20100214301
    Abstract: An exemplary method for emulating a graphics processing unit (GPU) includes executing a graphics application on a host computing system to generate commands for a target GPU wherein the host computing system includes host system memory and a different, host GPU; converting the generated commands into intermediate commands; based on one or more generated commands that call for one or more shaders, caching one or more corresponding shaders in a shader cache in the host system memory; based on one or more generated commands that call for one or more resources, caching one or more corresponding resources in a resource cache in the host system memory; based on the intermediate commands, outputting commands for the host GPU; and based on the output commands for the host GPU, rendering graphics using the host GPU where output commands that call for one or more shaders access the one or more corresponding shaders in the shader cache and where output commands that call for one or more resources access the one or more
    Type: Application
    Filed: February 23, 2009
    Publication date: August 26, 2010
    Applicant: Microsoft Corporation
    Inventors: Jinyu Li, Chen Li, Gang Chen, Xin Tong
  • Publication number: 20100217579
    Abstract: Emulating legacy hardware using IEEE 754 compliant hardware is disclosed herein. In some aspects, the emulation includes locating an instruction that includes NaN (not a number) as at least one of an operand or a resultant. The emulation adjusts the resultant of the instruction, via additional code, to produce a final resultant of non-compliant (legacy) hardware. Legacy software, which was written in anticipation of processing by legacy hardware, may then be processed using compliant hardware.
    Type: Application
    Filed: February 26, 2009
    Publication date: August 26, 2010
    Applicant: MICROSOFT CORPORATION
    Inventors: Jinyu Li, Ke Deng, Chen Li
  • Publication number: 20100201691
    Abstract: Embodiments for shader-based finite state machine frame detection for implementing alternative graphical processing on an animation scenario are disclosed. In accordance with one embodiment, the embodiment includes assigning an identifier to each shader used to render animation scenarios. The embodiment also includes defining a finite state machine for a key frame in each of the animation scenarios, whereby each finite state machine representing a plurality of shaders that renders the key frame in each animation scenario. The embodiment further includes deriving a shader ID sequence for each finite state machine based on the identifier assigned to each shader. The embodiment additionally includes comparing an input shader ID sequence of a new frame of a new animation scenario to each derived shader ID sequences. Finally, the embodiment includes executing alternative graphics processing on the new animation scenario when the input shader ID sequence matches one of the derived shader ID sequences.
    Type: Application
    Filed: February 12, 2009
    Publication date: August 12, 2010
    Applicant: Microsoft Corporation
    Inventors: Jinyu Li, Chen Li, Xin Tong
  • Publication number: 20100188412
    Abstract: Providing content based cache for graphic resource management is disclosed herein. In some aspects, a portion of a shadow copy of graphics resources is updated from an original copy of the graphics resources when a requested resource is not current. The shadow copy may be dedicated to a graphics processing unit (GPU) while the original copy may be maintained by a central processing unit (CPU). In further aspects, the requested graphics resource in the shadow copy may be compared to a corresponding graphics resource in the original copy when the GPU requests the graphics resource. The comparison may be performed by comparing hashes of each graphics resource and/or by comparing at least a portion of the graphics resources.
    Type: Application
    Filed: January 28, 2009
    Publication date: July 29, 2010
    Applicant: Microsoft Corporation
    Inventors: Chen Li, Jinyu Li, Xin Tong, Barry C. Bond, Gang Chen
  • Publication number: 20100188404
    Abstract: Embodiments for single-pass bounding box calculation are disclosed. In accordance with one embodiment, the single-pass bounding box calculation includes rendering a first target to a 2-dimensional screen space, whereby the first target includes at least six pixels. The calculation further includes producing transformed vertices in a set of geometry primitives based on an application-specified transformation. The calculation also includes generating six new points for each transformed vertex in the set of geometry primitives. The calculation additionally includes producing an initial third coordinate value for each pixel by rendering the at least six new points generate for each pixel to each corresponding pixel. The calculation further includes producing a post-rasterization value for each pixel by rasterizing the at least six new points rendered to each pixel with each corresponding pixel.
    Type: Application
    Filed: January 29, 2009
    Publication date: July 29, 2010
    Applicant: MICROSOFT CORPORATION
    Inventors: Xin Tong, Chen Li, Jinyu Li
  • Publication number: 20100076757
    Abstract: A speech recognition system includes a receiver component that receives a distorted speech utterance. The speech recognition also includes an adaptor component that selectively adapts parameters of a compressed model used to recognize at least a portion of the distorted speech utterance, wherein the adaptor component selectively adapts the parameters of the compressed model based at least in part upon the received distorted speech utterance.
    Type: Application
    Filed: September 23, 2008
    Publication date: March 25, 2010
    Applicant: Microsoft Corporation
    Inventors: Jinyu Li, Li Deng, Dong Yu, Jian Wu, Yifan Gong, Alejandro Acero