Patents by Inventor Jinyu Li
Jinyu Li has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Patent number: 9454958Abstract: Technologies pertaining to training a deep neural network (DNN) for use in a recognition system are described herein. The DNN is trained using heterogeneous data, the heterogeneous data including narrowband signals and wideband signals. The DNN, subsequent to being trained, receives an input signal that can be either a wideband signal or narrowband signal. The DNN estimates the class posterior probability of the input signal regardless of whether the input signal is the wideband signal or the narrowband signal.Type: GrantFiled: March 7, 2013Date of Patent: September 27, 2016Assignee: Microsoft Technology Licensing, LLCInventors: Jinyu Li, Dong Yu, Yifan Gong
-
Publication number: 20160275947Abstract: Systems and methods for speech recognition incorporating environmental variables are provided. The systems and methods capture speech to be recognized. The speech is then recognized utilizing a variable component deep neural network (DNN). The variable component DNN processes the captured speech by incorporating an environment variable. The environment variable may be any variable that is dependent on environmental conditions or the relation of the user, the client device, and the environment. For example, the environment variable may be based on noise of the environment and represented as a signal-to-noise ratio. The variable component DNN may incorporate the environment variable in different ways. For instance, the environment variable may be incorporated into weighting matrices and biases of the DNN, the outputs of the hidden layers of the DNN, or the activation functions of the nodes of the DNN.Type: ApplicationFiled: September 9, 2014Publication date: September 22, 2016Applicant: Microsoft Technology Licensing, LLCInventors: Jinyu LI, Rui ZHAO, Yifan GONG
-
Publication number: 20160209893Abstract: A centrifugal fan for an electronic device is described. The centrifugal fan includes a volute casing with an outlet; and vanes accommodated in the volute casing. The volute casing has an inner surface facing the vanes, the inner surface being formed with a plurality of air guiding channels for directing air inside the volute casing in a direction towards the outlet of the volute casing.Type: ApplicationFiled: June 30, 2015Publication date: July 21, 2016Applicant: Lenovo (Beijing) Co., Ltd.Inventors: Chunfeng Yuan, Jinyu Li
-
Publication number: 20160140406Abstract: Systems and methods for identifying a false representation of a human face are provided. In one example, a method for identifying a false representation of a human face includes receiving a plurality of different data streams captured by a respective plurality of sensors of differing sensor types sensing a candidate face. In a cascading plurality of stages, one or more of the different data streams are analyzed, wherein each of the stages comprises a different analysis. In one of the cascading plurality of stages, the method determines that one or more of the different data streams corresponds to a false representation of the human face. Based on determining that one or more of the different data streams corresponds to a false representation of a human face, an indication of the false representation is outputted.Type: ApplicationFiled: January 27, 2016Publication date: May 19, 2016Applicant: Microsoft Technology Licensing, LLCInventors: Chun-Te Chu, Michael J. Conrad, Dijia Wu, Jinyu Li
-
Patent number: 9336775Abstract: A high-dimensional posterior-based feature with partial distance elimination may be utilized for speech recognition. The log likelihood values of a large number of Gaussians are needed to generate the high-dimensional posterior feature. Gaussians with very small log likelihoods are associated with zero posterior values. Log likelihoods for Gaussians for a speech frame may be evaluated with a partial distance elimination method. If the partial distance of a Gaussian is already too small, the Gaussian will have a zero posterior value. The partial distance may be calculated by sequentially adding individual dimensions in a group of dimensions. The partial distance elimination occurs when less than all of the dimensions in the group are sequentially added.Type: GrantFiled: March 5, 2013Date of Patent: May 10, 2016Assignee: MICROSOFT TECHNOLOGY LICENSING, LLCInventors: Jinyu Li, Zhijie Yan, Qiang Huo, Yifan Gong
-
Patent number: 9324321Abstract: The adaptation and personalization of a deep neural network (DNN) model for automatic speech recognition is provided. An utterance which includes speech features for one or more speakers may be received in ASR tasks such as voice search or short message dictation. A decomposition approach may then be applied to an original matrix in the DNN model. In response to applying the decomposition approach, the original matrix may be converted into multiple new matrices which are smaller than the original matrix. A square matrix may then be added to the new matrices. Speaker-specific parameters may then be stored in the square matrix. The DNN model may then be adapted by updating the square matrix. This process may be applied to all of a number of original matrices in the DNN model. The adapted DNN model may include a reduced number of parameters than those received in the original DNN model.Type: GrantFiled: March 7, 2014Date of Patent: April 26, 2016Assignee: Microsoft Technology Licensing, LLCInventors: Jian Xue, Jinyu Li, Dong Yu, Michael L. Seltzer, Yifan Gong
-
Publication number: 20160078339Abstract: Systems and methods are provided for generating a DNN classifier by “learning” a “student” DNN model from a larger more accurate “teacher” DNN model. The student DNN may be trained from un-labeled training data because its supervised signal is obtained by passing the un-labeled training data through the teacher DNN. In one embodiment, an iterative process is applied to train the student DNN by minimize the divergence of the output distributions from the teacher and student DNN models. For each iteration until convergence, the difference in the output distributions is used to update the student DNN model, and output distributions are determined again, using the unlabeled training data. The resulting trained student model may be suitable for providing accurate signal processing applications on devices having limited computational or storage resources such as mobile or wearable devices. In an embodiment, the teacher DNN model comprises an ensemble of DNN models.Type: ApplicationFiled: September 14, 2015Publication date: March 17, 2016Inventors: Jinyu Li, Rui Zhao, Jui-Ting Huang, Yifan Gong
-
Patent number: 9280969Abstract: Techniques and systems for training an acoustic model are described. In an embodiment, a technique for training an acoustic model includes dividing a corpus of training data that includes transcription errors into N parts, and on each part, decoding an utterance with an incremental acoustic model and an incremental language model to produce a decoded transcription. The technique may further include inserting silence between a pair of words into the decoded transcription and aligning an original transcription corresponding to the utterance with the decoded transcription according to time for each part. The technique may further include selecting a segment from the utterance having at least Q contiguous matching aligned words, and training the incremental acoustic model with the selected segment. The trained incremental acoustic model may then be used on a subsequent part of the training data. Other embodiments are described and claimed.Type: GrantFiled: June 10, 2009Date of Patent: March 8, 2016Assignee: MICROSOFT TECHNOLOGY LICENSING, LLCInventors: Jinyu Li, Yifan Gong, Chaojun Liu, Kaisheng Yao
-
Patent number: 9278287Abstract: A video game system (or other data processing system) can visually identify a person entering a field of view of the system and determine whether the person has been previously interacting with the system. In one embodiment, the system establishes thresholds, enrolls players, performs the video game (or other application) including interacting with a subset of the players based on the enrolling, determines that a person has become detectable in the field of view of the system, automatically determines whether the person is one of the enrolled players, maps the person to an enrolled player and interacts with the person based on the mapping if it is determined that the person is one of the enrolled players, and assigns a new identification to the person and interacts with the person based on the new identification if it is determined that the person is not one of the enrolled players.Type: GrantFiled: October 20, 2014Date of Patent: March 8, 2016Assignee: Microsoft Technology Licensing, LLCInventors: Tommer Leyvand, Mitchell Stephen Dernis, Jinyu Li, Yichen Wei, Jian Sun, Casey Leon Meekhof, Timothy Milton Keosababian
-
Publication number: 20160048736Abstract: Systems and methods for identifying a false representation of a human face are provided. In one example, a method for identifying a false representation of a human face includes receiving a plurality of different data streams captured by a respective plurality of sensors of differing sensor types sensing a candidate face. In a cascading plurality of stages, one or more of the different data streams are analyzed, wherein each of the stages comprises a different analysis. In one of the cascading plurality of stages, the method determines that one or more of the different data streams corresponds to a false representation of the human face. Based on determining that one or more of the different data streams corresponds to a false representation of a human face, an indication of the false representation is outputted.Type: ApplicationFiled: August 12, 2014Publication date: February 18, 2016Inventors: Chun-Te Chu, Michael J. Conrad, Dijia Wu, Jinyu Li
-
Patent number: 9251427Abstract: Systems and methods for identifying a false representation of a human face are provided. In one example, a method for identifying a false representation of a human face includes receiving a plurality of different data streams captured by a respective plurality of sensors of differing sensor types sensing a candidate face. In a cascading plurality of stages, one or more of the different data streams are analyzed, wherein each of the stages comprises a different analysis. In one of the cascading plurality of stages, the method determines that one or more of the different data streams corresponds to a false representation of the human face. Based on determining that one or more of the different data streams corresponds to a false representation of a human face, an indication of the false representation is outputted.Type: GrantFiled: August 12, 2014Date of Patent: February 2, 2016Assignee: MICROSOFT TECHNOLOGY LICENSING, LLCInventors: Chun-Te Chu, Michael J. Conrad, Dijia Wu, Jinyu Li
-
Publication number: 20150322954Abstract: The present invention discloses an airflow accelerating device and an electronic apparatus. The airflow accelerating device comprises: a housing, having a chamber formed therein; at least one vibrating plate, disposed within the chamber; at least one division plate, fixed in the housing, for dividing the chamber into at least two sub-chambers, each of the at least two sub-chambers having at least one air outlet configured to transmit airflow generated by vibration of the vibrating plate to outside of the chamber.Type: ApplicationFiled: February 4, 2015Publication date: November 12, 2015Inventors: Ting Tian, Jinyu Li
-
Publication number: 20150318052Abstract: According to this disclosure, a shift register unit includes a pull-up control module, a pull-up module, a pull-down control module and a pull-down module, wherein the pull-up module is adapted to provide a transmission signal output terminal with a first clock signal inputted from a first clock signal input terminal according to the pull-up control signal, and provide a gate drive signal output terminal with a first direct current supply voltage according to the pull-up control signal and the first clock signal inputted from the first clock signal input terminal.Type: ApplicationFiled: September 17, 2014Publication date: November 5, 2015Inventor: Jinyu LI
-
Publication number: 20150310858Abstract: Providing a framework for merging automatic speech recognition (ASR) systems having a shared deep neural network (DNN) feature transformation is provided. A received utterance may be evaluated to generate a DNN-derived feature from the top hidden layer of a DNN. The top hidden layer output may then be utilized to generate a network including a bottleneck layer and an output layer. Weights representing a feature dimension reduction may then be extracted between the top hidden layer and the bottleneck layer. Scores may then be generated and combined to merge the ASR systems which share the DNN feature transformation.Type: ApplicationFiled: April 29, 2014Publication date: October 29, 2015Applicant: MICROSOFT CORPORATIONInventors: JINYU LI, JIAN XUE, YIFAN GONG
-
Patent number: 9165180Abstract: Systems and methods for face recognition are provided. In one example, a method for face recognition includes receiving a user image and detecting a user luminance of data representing the user's face. An adaptive low pass filter is selected that corresponds to the user luminance of the user's face. The filter is applied to the user image to create a filtered user image. The filtered user image is projected to create a filtered user image representation. A filtered reference image representation that has been filtered with the same low pass filter is selected from a reference image database. The method then determines whether the filtered reference image representation matches the filtered user image representation.Type: GrantFiled: October 12, 2012Date of Patent: October 20, 2015Assignee: MICROSOFT TECHNOLOGY LICENSING, LLCInventors: Eyal Krupka, Tommer Leyvand, Igor Kviatkovsky, Igor Abramovski, Tim Keosababian, Jinyu Li
-
Publication number: 20150255061Abstract: The adaptation and personalization of a deep neural network (DNN) model for automatic speech recognition is provided. An utterance which includes speech features for one or more speakers may be received in ASR tasks such as voice search or short message dictation. A decomposition approach may then be applied to an original matrix in the DNN model. In response to applying the decomposition approach, the original matrix may be converted into multiple new matrices which are smaller than the original matrix. A square matrix may then be added to the new matrices. Speaker-specific parameters may then be stored in the square matrix. The DNN model may then be adapted by updating the square matrix. This process may be applied to all of a number of original matrices in the DNN model. The adapted DNN model may include a reduced number of parameters than those received in the original DNN model.Type: ApplicationFiled: March 7, 2014Publication date: September 10, 2015Applicant: MICROSOFT CORPORATIONInventors: Jian Xue, Jinyu Li, Dong Yu, Michael L. Seltzer, Yifan Gong
-
Publication number: 20150245536Abstract: A heat dissipating device located on a same base plate with a fan, and a first side of the heat dissipating device abutting an air outlet of the fan. The heat dissipating device includes a plurality of horizontal heat dissipating fins horizontally arranged in parallel with each other in a stacked manner such that arrangement of the plurality of horizontal heat dissipating fins is consistent with a direction of the air outlet of the fan, wherein two adjacent horizontal heat dissipating fins have a gap there between, through which air from the air outlet of the fan is discharged, wherein, a first horizontal heat dissipating fin located at an uppermost layer is coupled with a heat pipe and the heat pipe is coupled with at least one heating element. The heat dissipating device is capable of enhancing heat dissipating efficiency and improving the user's usage experience.Type: ApplicationFiled: September 24, 2014Publication date: August 27, 2015Applicant: Lenovo (Beijing) Co., Ltd.Inventors: Jinyu Li, Ziran Li, Chunfeng Yuan, Yingfeng Ma
-
Patent number: 9070360Abstract: Described is a calibration model for use in a speech recognition system. The calibration model adjusts the confidence scores output by a speech recognition engine to thereby provide an improved calibrated confidence score for use by an application. The calibration model is one that has been trained for a specific usage scenario, e.g., for that application, based upon a calibration training set obtained from a previous similar/corresponding usage scenario or scenarios. Different calibration models may be used with different usage scenarios, e.g., during different conditions. The calibration model may comprise a maximum entropy classifier with distribution constraints, trained with continuous raw confidence scores and multi-valued word tokens, and/or other distributions and extracted features.Type: GrantFiled: December 10, 2009Date of Patent: June 30, 2015Assignee: Microsoft Technology Licensing, LLCInventors: Dong Yu, Li Deng, Jinyu Li
-
Publication number: 20150113636Abstract: A computing system such as a game console maintains and updates a biometric profile of a user. In one aspect, biometric data of the user is continuously obtained from a sensor such as an infrared and visible light camera, and used to update the biometric profile using a machine learning process. In another aspect, a user is prompted to confirm his or her identify when multiple users are detected at the same time and/or when the user is detected with a confidence level which is below a threshold. A real-time image of the user being identified can be displayed on a user interface with user images associated with one or more accounts. In another aspect, the biometric profile is managed by a shell on the computing system, where the shell makes the biometric profile available to any of a number of applications on the computing system.Type: ApplicationFiled: December 23, 2014Publication date: April 23, 2015Inventors: Ronald Forbes, Bhaven Dedhia, Tim Keosababian, Tommer Leyvand, Jinyu Li, Timothy Gerken
-
Publication number: 20150038230Abstract: A video game system (or other data processing system) can visually identify a person entering a field of view of the system and determine whether the person has been previously interacting with the system. In one embodiment, the system establishes thresholds, enrolls players, performs the video game (or other application) including interacting with a subset of the players based on the enrolling, determines that a person has become detectable in the field of view of the system, automatically determines whether the person is one of the enrolled players, maps the person to an enrolled player and interacts with the person based on the mapping if it is determined that the person is one of the enrolled players, and assigns a new identification to the person and interacts with the person based on the new identification if it is determined that the person is not one of the enrolled players.Type: ApplicationFiled: October 20, 2014Publication date: February 5, 2015Inventors: Tommer Leyvand, Mitchell Stephen Dernis, Jinyu Li, Yichen Wei, Jian Sun, Casey Leon Meekhof, Timothy Milton Keosababian