Patents by Inventor Jinyu Li

Jinyu Li has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 9454958
    Abstract: Technologies pertaining to training a deep neural network (DNN) for use in a recognition system are described herein. The DNN is trained using heterogeneous data, the heterogeneous data including narrowband signals and wideband signals. The DNN, subsequent to being trained, receives an input signal that can be either a wideband signal or narrowband signal. The DNN estimates the class posterior probability of the input signal regardless of whether the input signal is the wideband signal or the narrowband signal.
    Type: Grant
    Filed: March 7, 2013
    Date of Patent: September 27, 2016
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Jinyu Li, Dong Yu, Yifan Gong
  • Publication number: 20160275947
    Abstract: Systems and methods for speech recognition incorporating environmental variables are provided. The systems and methods capture speech to be recognized. The speech is then recognized utilizing a variable component deep neural network (DNN). The variable component DNN processes the captured speech by incorporating an environment variable. The environment variable may be any variable that is dependent on environmental conditions or the relation of the user, the client device, and the environment. For example, the environment variable may be based on noise of the environment and represented as a signal-to-noise ratio. The variable component DNN may incorporate the environment variable in different ways. For instance, the environment variable may be incorporated into weighting matrices and biases of the DNN, the outputs of the hidden layers of the DNN, or the activation functions of the nodes of the DNN.
    Type: Application
    Filed: September 9, 2014
    Publication date: September 22, 2016
    Applicant: Microsoft Technology Licensing, LLC
    Inventors: Jinyu LI, Rui ZHAO, Yifan GONG
  • Publication number: 20160209893
    Abstract: A centrifugal fan for an electronic device is described. The centrifugal fan includes a volute casing with an outlet; and vanes accommodated in the volute casing. The volute casing has an inner surface facing the vanes, the inner surface being formed with a plurality of air guiding channels for directing air inside the volute casing in a direction towards the outlet of the volute casing.
    Type: Application
    Filed: June 30, 2015
    Publication date: July 21, 2016
    Applicant: Lenovo (Beijing) Co., Ltd.
    Inventors: Chunfeng Yuan, Jinyu Li
  • Publication number: 20160140406
    Abstract: Systems and methods for identifying a false representation of a human face are provided. In one example, a method for identifying a false representation of a human face includes receiving a plurality of different data streams captured by a respective plurality of sensors of differing sensor types sensing a candidate face. In a cascading plurality of stages, one or more of the different data streams are analyzed, wherein each of the stages comprises a different analysis. In one of the cascading plurality of stages, the method determines that one or more of the different data streams corresponds to a false representation of the human face. Based on determining that one or more of the different data streams corresponds to a false representation of a human face, an indication of the false representation is outputted.
    Type: Application
    Filed: January 27, 2016
    Publication date: May 19, 2016
    Applicant: Microsoft Technology Licensing, LLC
    Inventors: Chun-Te Chu, Michael J. Conrad, Dijia Wu, Jinyu Li
  • Patent number: 9336775
    Abstract: A high-dimensional posterior-based feature with partial distance elimination may be utilized for speech recognition. The log likelihood values of a large number of Gaussians are needed to generate the high-dimensional posterior feature. Gaussians with very small log likelihoods are associated with zero posterior values. Log likelihoods for Gaussians for a speech frame may be evaluated with a partial distance elimination method. If the partial distance of a Gaussian is already too small, the Gaussian will have a zero posterior value. The partial distance may be calculated by sequentially adding individual dimensions in a group of dimensions. The partial distance elimination occurs when less than all of the dimensions in the group are sequentially added.
    Type: Grant
    Filed: March 5, 2013
    Date of Patent: May 10, 2016
    Assignee: MICROSOFT TECHNOLOGY LICENSING, LLC
    Inventors: Jinyu Li, Zhijie Yan, Qiang Huo, Yifan Gong
  • Patent number: 9324321
    Abstract: The adaptation and personalization of a deep neural network (DNN) model for automatic speech recognition is provided. An utterance which includes speech features for one or more speakers may be received in ASR tasks such as voice search or short message dictation. A decomposition approach may then be applied to an original matrix in the DNN model. In response to applying the decomposition approach, the original matrix may be converted into multiple new matrices which are smaller than the original matrix. A square matrix may then be added to the new matrices. Speaker-specific parameters may then be stored in the square matrix. The DNN model may then be adapted by updating the square matrix. This process may be applied to all of a number of original matrices in the DNN model. The adapted DNN model may include a reduced number of parameters than those received in the original DNN model.
    Type: Grant
    Filed: March 7, 2014
    Date of Patent: April 26, 2016
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Jian Xue, Jinyu Li, Dong Yu, Michael L. Seltzer, Yifan Gong
  • Publication number: 20160078339
    Abstract: Systems and methods are provided for generating a DNN classifier by “learning” a “student” DNN model from a larger more accurate “teacher” DNN model. The student DNN may be trained from un-labeled training data because its supervised signal is obtained by passing the un-labeled training data through the teacher DNN. In one embodiment, an iterative process is applied to train the student DNN by minimize the divergence of the output distributions from the teacher and student DNN models. For each iteration until convergence, the difference in the output distributions is used to update the student DNN model, and output distributions are determined again, using the unlabeled training data. The resulting trained student model may be suitable for providing accurate signal processing applications on devices having limited computational or storage resources such as mobile or wearable devices. In an embodiment, the teacher DNN model comprises an ensemble of DNN models.
    Type: Application
    Filed: September 14, 2015
    Publication date: March 17, 2016
    Inventors: Jinyu Li, Rui Zhao, Jui-Ting Huang, Yifan Gong
  • Patent number: 9280969
    Abstract: Techniques and systems for training an acoustic model are described. In an embodiment, a technique for training an acoustic model includes dividing a corpus of training data that includes transcription errors into N parts, and on each part, decoding an utterance with an incremental acoustic model and an incremental language model to produce a decoded transcription. The technique may further include inserting silence between a pair of words into the decoded transcription and aligning an original transcription corresponding to the utterance with the decoded transcription according to time for each part. The technique may further include selecting a segment from the utterance having at least Q contiguous matching aligned words, and training the incremental acoustic model with the selected segment. The trained incremental acoustic model may then be used on a subsequent part of the training data. Other embodiments are described and claimed.
    Type: Grant
    Filed: June 10, 2009
    Date of Patent: March 8, 2016
    Assignee: MICROSOFT TECHNOLOGY LICENSING, LLC
    Inventors: Jinyu Li, Yifan Gong, Chaojun Liu, Kaisheng Yao
  • Patent number: 9278287
    Abstract: A video game system (or other data processing system) can visually identify a person entering a field of view of the system and determine whether the person has been previously interacting with the system. In one embodiment, the system establishes thresholds, enrolls players, performs the video game (or other application) including interacting with a subset of the players based on the enrolling, determines that a person has become detectable in the field of view of the system, automatically determines whether the person is one of the enrolled players, maps the person to an enrolled player and interacts with the person based on the mapping if it is determined that the person is one of the enrolled players, and assigns a new identification to the person and interacts with the person based on the new identification if it is determined that the person is not one of the enrolled players.
    Type: Grant
    Filed: October 20, 2014
    Date of Patent: March 8, 2016
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Tommer Leyvand, Mitchell Stephen Dernis, Jinyu Li, Yichen Wei, Jian Sun, Casey Leon Meekhof, Timothy Milton Keosababian
  • Publication number: 20160048736
    Abstract: Systems and methods for identifying a false representation of a human face are provided. In one example, a method for identifying a false representation of a human face includes receiving a plurality of different data streams captured by a respective plurality of sensors of differing sensor types sensing a candidate face. In a cascading plurality of stages, one or more of the different data streams are analyzed, wherein each of the stages comprises a different analysis. In one of the cascading plurality of stages, the method determines that one or more of the different data streams corresponds to a false representation of the human face. Based on determining that one or more of the different data streams corresponds to a false representation of a human face, an indication of the false representation is outputted.
    Type: Application
    Filed: August 12, 2014
    Publication date: February 18, 2016
    Inventors: Chun-Te Chu, Michael J. Conrad, Dijia Wu, Jinyu Li
  • Patent number: 9251427
    Abstract: Systems and methods for identifying a false representation of a human face are provided. In one example, a method for identifying a false representation of a human face includes receiving a plurality of different data streams captured by a respective plurality of sensors of differing sensor types sensing a candidate face. In a cascading plurality of stages, one or more of the different data streams are analyzed, wherein each of the stages comprises a different analysis. In one of the cascading plurality of stages, the method determines that one or more of the different data streams corresponds to a false representation of the human face. Based on determining that one or more of the different data streams corresponds to a false representation of a human face, an indication of the false representation is outputted.
    Type: Grant
    Filed: August 12, 2014
    Date of Patent: February 2, 2016
    Assignee: MICROSOFT TECHNOLOGY LICENSING, LLC
    Inventors: Chun-Te Chu, Michael J. Conrad, Dijia Wu, Jinyu Li
  • Publication number: 20150322954
    Abstract: The present invention discloses an airflow accelerating device and an electronic apparatus. The airflow accelerating device comprises: a housing, having a chamber formed therein; at least one vibrating plate, disposed within the chamber; at least one division plate, fixed in the housing, for dividing the chamber into at least two sub-chambers, each of the at least two sub-chambers having at least one air outlet configured to transmit airflow generated by vibration of the vibrating plate to outside of the chamber.
    Type: Application
    Filed: February 4, 2015
    Publication date: November 12, 2015
    Inventors: Ting Tian, Jinyu Li
  • Publication number: 20150318052
    Abstract: According to this disclosure, a shift register unit includes a pull-up control module, a pull-up module, a pull-down control module and a pull-down module, wherein the pull-up module is adapted to provide a transmission signal output terminal with a first clock signal inputted from a first clock signal input terminal according to the pull-up control signal, and provide a gate drive signal output terminal with a first direct current supply voltage according to the pull-up control signal and the first clock signal inputted from the first clock signal input terminal.
    Type: Application
    Filed: September 17, 2014
    Publication date: November 5, 2015
    Inventor: Jinyu LI
  • Publication number: 20150310858
    Abstract: Providing a framework for merging automatic speech recognition (ASR) systems having a shared deep neural network (DNN) feature transformation is provided. A received utterance may be evaluated to generate a DNN-derived feature from the top hidden layer of a DNN. The top hidden layer output may then be utilized to generate a network including a bottleneck layer and an output layer. Weights representing a feature dimension reduction may then be extracted between the top hidden layer and the bottleneck layer. Scores may then be generated and combined to merge the ASR systems which share the DNN feature transformation.
    Type: Application
    Filed: April 29, 2014
    Publication date: October 29, 2015
    Applicant: MICROSOFT CORPORATION
    Inventors: JINYU LI, JIAN XUE, YIFAN GONG
  • Patent number: 9165180
    Abstract: Systems and methods for face recognition are provided. In one example, a method for face recognition includes receiving a user image and detecting a user luminance of data representing the user's face. An adaptive low pass filter is selected that corresponds to the user luminance of the user's face. The filter is applied to the user image to create a filtered user image. The filtered user image is projected to create a filtered user image representation. A filtered reference image representation that has been filtered with the same low pass filter is selected from a reference image database. The method then determines whether the filtered reference image representation matches the filtered user image representation.
    Type: Grant
    Filed: October 12, 2012
    Date of Patent: October 20, 2015
    Assignee: MICROSOFT TECHNOLOGY LICENSING, LLC
    Inventors: Eyal Krupka, Tommer Leyvand, Igor Kviatkovsky, Igor Abramovski, Tim Keosababian, Jinyu Li
  • Publication number: 20150255061
    Abstract: The adaptation and personalization of a deep neural network (DNN) model for automatic speech recognition is provided. An utterance which includes speech features for one or more speakers may be received in ASR tasks such as voice search or short message dictation. A decomposition approach may then be applied to an original matrix in the DNN model. In response to applying the decomposition approach, the original matrix may be converted into multiple new matrices which are smaller than the original matrix. A square matrix may then be added to the new matrices. Speaker-specific parameters may then be stored in the square matrix. The DNN model may then be adapted by updating the square matrix. This process may be applied to all of a number of original matrices in the DNN model. The adapted DNN model may include a reduced number of parameters than those received in the original DNN model.
    Type: Application
    Filed: March 7, 2014
    Publication date: September 10, 2015
    Applicant: MICROSOFT CORPORATION
    Inventors: Jian Xue, Jinyu Li, Dong Yu, Michael L. Seltzer, Yifan Gong
  • Publication number: 20150245536
    Abstract: A heat dissipating device located on a same base plate with a fan, and a first side of the heat dissipating device abutting an air outlet of the fan. The heat dissipating device includes a plurality of horizontal heat dissipating fins horizontally arranged in parallel with each other in a stacked manner such that arrangement of the plurality of horizontal heat dissipating fins is consistent with a direction of the air outlet of the fan, wherein two adjacent horizontal heat dissipating fins have a gap there between, through which air from the air outlet of the fan is discharged, wherein, a first horizontal heat dissipating fin located at an uppermost layer is coupled with a heat pipe and the heat pipe is coupled with at least one heating element. The heat dissipating device is capable of enhancing heat dissipating efficiency and improving the user's usage experience.
    Type: Application
    Filed: September 24, 2014
    Publication date: August 27, 2015
    Applicant: Lenovo (Beijing) Co., Ltd.
    Inventors: Jinyu Li, Ziran Li, Chunfeng Yuan, Yingfeng Ma
  • Patent number: 9070360
    Abstract: Described is a calibration model for use in a speech recognition system. The calibration model adjusts the confidence scores output by a speech recognition engine to thereby provide an improved calibrated confidence score for use by an application. The calibration model is one that has been trained for a specific usage scenario, e.g., for that application, based upon a calibration training set obtained from a previous similar/corresponding usage scenario or scenarios. Different calibration models may be used with different usage scenarios, e.g., during different conditions. The calibration model may comprise a maximum entropy classifier with distribution constraints, trained with continuous raw confidence scores and multi-valued word tokens, and/or other distributions and extracted features.
    Type: Grant
    Filed: December 10, 2009
    Date of Patent: June 30, 2015
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Dong Yu, Li Deng, Jinyu Li
  • Publication number: 20150113636
    Abstract: A computing system such as a game console maintains and updates a biometric profile of a user. In one aspect, biometric data of the user is continuously obtained from a sensor such as an infrared and visible light camera, and used to update the biometric profile using a machine learning process. In another aspect, a user is prompted to confirm his or her identify when multiple users are detected at the same time and/or when the user is detected with a confidence level which is below a threshold. A real-time image of the user being identified can be displayed on a user interface with user images associated with one or more accounts. In another aspect, the biometric profile is managed by a shell on the computing system, where the shell makes the biometric profile available to any of a number of applications on the computing system.
    Type: Application
    Filed: December 23, 2014
    Publication date: April 23, 2015
    Inventors: Ronald Forbes, Bhaven Dedhia, Tim Keosababian, Tommer Leyvand, Jinyu Li, Timothy Gerken
  • Publication number: 20150038230
    Abstract: A video game system (or other data processing system) can visually identify a person entering a field of view of the system and determine whether the person has been previously interacting with the system. In one embodiment, the system establishes thresholds, enrolls players, performs the video game (or other application) including interacting with a subset of the players based on the enrolling, determines that a person has become detectable in the field of view of the system, automatically determines whether the person is one of the enrolled players, maps the person to an enrolled player and interacts with the person based on the mapping if it is determined that the person is one of the enrolled players, and assigns a new identification to the person and interacts with the person based on the new identification if it is determined that the person is not one of the enrolled players.
    Type: Application
    Filed: October 20, 2014
    Publication date: February 5, 2015
    Inventors: Tommer Leyvand, Mitchell Stephen Dernis, Jinyu Li, Yichen Wei, Jian Sun, Casey Leon Meekhof, Timothy Milton Keosababian