Patents by Inventor Zhenghao Wang
Zhenghao Wang has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Patent number: 10839822Abstract: Representative embodiments disclose mechanisms to separate and recognize multiple audio sources (e.g., picking out individual speakers) in an environment where they overlap and interfere with each other. The architecture uses a microphone array to spatially separate out the audio signals. The spatially filtered signals are then input into a plurality of separators, so each signal is input into a corresponding signal. The separators use neural networks to separate out audio sources. The separators typically produce multiple output signals for the single input signals. A post selection processor then assesses the separator outputs to pick the signals with the highest quality output. These signals can be used in a variety of systems such as speech recognition, meeting transcription and enhancement, hearing aids, music information retrieval, speech enhancement and so forth.Type: GrantFiled: November 6, 2017Date of Patent: November 17, 2020Assignee: Microsoft Technology Licensing, LLCInventors: Zhuo Chen, Jinyu Li, Xiong Xiao, Takuya Yoshioka, Huaming Wang, Zhenghao Wang, Yifan Gong
-
Publication number: 20190236416Abstract: In some embodiments, the disclosed subject matter involves a system and method relating to using an ambient capture device including a fisheye camera and a microphone array to capture audio and video in an environment, for use in an artificial intelligence (Al) application. The device with fisheye camera may provide approximately a 360° audio and video view, at relatively low cost. An embodiment may utilize a speech and vision fusion model component. The speech and vision fusion model may be trained using deep learning to combine features from many different sources, including available sensor data from the capture device. A long short term memory (LSTM) model may inter or identify features such as, but not limited to: audio direction; vision detection and tracking; voice signature; facial signature; gesture recognition; and object identification. The fusion processing may be performed by a cloud server, enabling the capture device to remain less complex.Type: ApplicationFiled: January 31, 2018Publication date: August 1, 2019Inventors: Zhenghao Wang, Xuedong Huang, Lijuan Qin, Kun Wu, Huaming Wang
-
Publication number: 20190139563Abstract: Representative embodiments disclose mechanisms to separate and recognize multiple audio sources (e.g., picking out individual speakers) in an environment where they overlap and interfere with each other. The architecture uses a microphone array to spatially separate out the audio signals. The spatially filtered signals are then input into a plurality of separators, so each signal is input into a corresponding signal. The separators use neural networks to separate out audio sources. The separators typically produce multiple output signals for the single input signals. A post selection processor then assesses the separator outputs to pick the signals with the highest quality output. These signals can be used in a variety of systems such as speech recognition, meeting transcription and enhancement, hearing aids, music information retrieval, speech enhancement and so forth.Type: ApplicationFiled: November 6, 2017Publication date: May 9, 2019Inventors: Zhuo Chen, Jinyu Li, Xiong Xiao, Takuya Yoshioka, Huaming Wang, Zhenghao Wang, Yifan Gong
-
Patent number: 10176168Abstract: Statistical Machine Translation (SMT) based search query spelling correction techniques are described herein. In one or more implementations, search data regarding searches performed by clients may be logged. The logged data includes query correction pairs that may be used to ascertain error patterns indicating how misspelled substrings may be translated to corrected substrings. The error patterns may be used to determine suggestions for an input query and to develop query correction models used to translate the input query to a corrected query. In one or more implementations, probabilistic features from multiple query correction models are combined to score different correction candidates. One or more top scoring correction candidates may then be exposed as suggestions for selection by a user and/or provided to a search engine to conduct a corresponding search using the corrected query version(s).Type: GrantFiled: November 15, 2011Date of Patent: January 8, 2019Assignee: MICROSOFT TECHNOLOGY LICENSING, LLCInventors: Jianfeng Gao, Mei-Yuh Hwang, Xuedong D. Huang, Christopher Brian Quirk, Zhenghao Wang
-
Patent number: 9972314Abstract: Techniques and architectures may be used to generate and perform a process using weighted finite-state transducers involving generic input search graphs. The process need not pursue theoretical optimality and instead search graphs may be optimized without an a priori optimization step. The process may result in an automatic speech recognition (ASR) decoder that is substantially faster than ASR decoders the include the optimization step.Type: GrantFiled: June 1, 2016Date of Patent: May 15, 2018Assignee: Microsoft Technology Licensing, LLCInventors: Zhenghao Wang, Xuedong Huang, Huaming Wang
-
Publication number: 20170352348Abstract: Techniques and architectures may be used to generate and perform a process using weighted finite-state transducers involving generic input search graphs. The process need not pursue theoretical optimality and instead search graphs may be optimized without an a priori optimization step. The process may result in an automatic speech recognition (ASR) decoder that is substantially faster than ASR decoders the include the optimization step.Type: ApplicationFiled: June 1, 2016Publication date: December 7, 2017Inventors: Zhenghao Wang, Xuedong Huang, Huaming Wang
-
Patent number: 9501759Abstract: The subject disclosure is directed towards developing a translation model for mapping search query terms to document-related data. By processing user logs comprising search histories into word-aligned query-document pairs, the translation model may be trained using data, such as probabilities, corresponding to the word-aligned query-document pairs. After incorporating the translation model into model data for a search engine, the translation model is used may used as features for producing relevance scores for current search queries and ranking documents/advertisements according to relevance.Type: GrantFiled: December 16, 2011Date of Patent: November 22, 2016Assignee: Microsoft Technology Licensing, LLCInventors: Jianfeng Gao, Xuedong Huang, Mei Li, Zhenghao Wang, Christopher John Brockett, William B. Dolan
-
Publication number: 20140222724Abstract: A log-linear model may be trained using a modified version of an original limited-memory Broyden-Fletcher-Goldfarb-Shanno (L-BFGS) algorithm. The modified version may be based on modifying the original L-BFGS algorithm using a single map-reduce implementation. In another aspect, a sparse log-linear model may be accessed. The sparse log-linear model may be trained with L1-regularization, based on data indicating past user ad selection behaviors. A probability of a user selection of an ad may be determined based on the sparse log-linear model.Type: ApplicationFiled: February 2, 2013Publication date: August 7, 2014Applicant: MICROSOFT CORPORATIONInventors: Jianfeng Gao, Xuedong Huang, Zhenghao Wang, Yunhong Zhou
-
Patent number: 8600968Abstract: Computer-readable media, computer systems, and computing methods are provided for recommending websites that are relevant to a current website to which a user has navigated. A search engine is used to track a set of websites the user has visited immediately prior to the current website, while predictive model(s) are used to generate a sequence of websites that include the current website and the tracked websites. The sequence is compared against strings of websites within a browser-history log to identify matching strings, where the matching strings include the sequence and a respective candidate website. A probability of relevance is computed from a frequency that each of the matching strings has been visited within a predefined time frame. The probability of relevance for each of the matching strings is ranked against one another to distill the highest-ranked matching strings, which are parsed to extract and present the candidate websites included therein.Type: GrantFiled: April 19, 2011Date of Patent: December 3, 2013Assignee: Microsoft CorporationInventors: Roman Holenstein, Qing Chang, Zhenghao Wang, Ravi Shahani
-
Publication number: 20130124492Abstract: Statistical Machine Translation (SMT) based search query spelling correction techniques are described herein. In one or more implementations, search data regarding searches performed by clients may be logged. The logged data includes query correction pairs that may be used to ascertain error patterns indicating how misspelled substrings may be translated to corrected substrings. The error patterns may be used to determine suggestions for an input query and to develop query correction models used to translate the input query to a corrected query. In one or more implementations, probabilistic features from multiple query correction models are combined to score different correction candidates. One or more top scoring correction candidates may then be exposed as suggestions for selection by a user and/or provided to a search engine to conduct a corresponding search using the corrected query version(s).Type: ApplicationFiled: November 15, 2011Publication date: May 16, 2013Applicant: MICROSOFT CORPORATIONInventors: Jianfeng Gao, Mei-Yuh Hwang, Xuedong D. Huang, Christopher Brian Quirk, Zhenghao Wang
-
Publication number: 20130103493Abstract: The subject disclosure is directed towards developing a translation model for mapping search query terms to document-related data. By processing user logs comprising search histories into word-aligned query-document pairs, the translation model may be trained using data, such as probabilities, corresponding to the word-aligned query-document pairs. After incorporating the translation model into model data for a search engine, the translation model is used may used as features for producing relevance scores for current search queries and ranking documents/advertisements according to relevance.Type: ApplicationFiled: December 16, 2011Publication date: April 25, 2013Applicant: MICROSOFT CORPORATIONInventors: Jianfeng Gao, Xuedong Huang, Mei Li, Zhenghao Wang, Christopher John Brockett, William B. Dolan
-
Publication number: 20120284315Abstract: Methods, computer systems and computer readable media for optimizing non-deterministic computational paths are provided. In embodiments, requests are received to generate reports derived from a plurality of series of data files whose metadata attributes form certain mathematical structures that can be used to choose the optimal path in the non-deterministic dependency model. Storage for each of the series of data files is optimized. Available data files needed for the report are processed and missing data files are identified. Based on the mathematical structure of the plurality of series of data files, an optimal transition with the missing data files available is determined. An entry into the transition is triggered and the missing data files are processed. The report is generated and the optimized storage is retained for future requests.Type: ApplicationFiled: May 4, 2011Publication date: November 8, 2012Applicant: MICROSOFT CORPORATIONInventors: ZHENGHAO WANG, SHENGQUAN YAN, AN YAN, JEFFREY ERIC LARSSON, ZIJIAN ZHENG
-
Publication number: 20120278354Abstract: Systems, methods, and computer media for efficiently processing user log data are provided. A received user log data analysis request specifies: target user log features that identify users in a target user group, analysis user log features that identify data associated with the users in the target user group, and an analysis to perform on the identified data associated with the users in the target user group. Occurrences of specified features are extracted from user logs and stored. Users associated with an occurrence of each of the extracted and stored target user log features are identified as users in the target user group. Occurrences of the analysis user log features that are associated with a user in the target user group are extracted and reformatted for the analysis specified in the analysis request.Type: ApplicationFiled: April 29, 2011Publication date: November 1, 2012Applicant: MICROSOFT CORPORATIONInventors: Shengquan Yan, Zhenghao Wang, Xiao Huang, Yu Chen, An Yan, Jeffrey Eric Larsson, Michael Kiogora Kinoti, Peng Yu, Zijian Zheng
-
Publication number: 20120271805Abstract: Computer-readable media, computer systems, and computing methods are provided for recommending websites that are relevant to a current website to which a user has navigated. A search engine is used to track a set of websites the user has visited immediately prior to the current website, while predictive model(s) are used to generate a sequence of websites that include the current website and the tracked websites. The sequence is compared against strings of websites within a browser-history log to identify matching strings, where the matching strings include the sequence and a respective candidate website. A probability of relevance is computed from a frequency that each of the matching strings has been visited within a predefined time frame. The probability of relevance for each of the matching strings is ranked against one another to distill the highest-ranked matching strings, which are parsed to extract and present the candidate websites included therein.Type: ApplicationFiled: April 19, 2011Publication date: October 25, 2012Applicant: MICROSOFT CORPORATIONInventors: Roman Holenstein, Qing (Elaine) Chang, Zhenghao Wang, Ravi Shahani
-
Publication number: 20110072740Abstract: A concrete photovoltaic structure includes an essentially vertical support member constructed, at least in part, of concrete, wherein the essentially vertical support member includes a first portion configured to be received, at least in part, within a footing. The concrete photovoltaic structure further includes a photovoltaic support member constructed, at least in part, of concrete, wherein the photovoltaic support member is configured to be affixed to a second portion of the essentially vertical support member.Type: ApplicationFiled: September 29, 2009Publication date: March 31, 2011Inventors: David B. Dieter, Zhenghao Wang
-
Patent number: 7861118Abstract: An apparatus, method and computer-readable medium provide for detecting and filtering potential race conditions. In one example, potential race conditions may be detected by determining overlap of memory accesses (e.g., read, write). Potential race conditions may be filtered as potentially benign or harmful race conditions using, for example, heuristics, comparison of output states from execution of instructions or operations in differing sequences or identifying of associated output bugs corresponding to the execution of operations in different sequences.Type: GrantFiled: March 30, 2007Date of Patent: December 28, 2010Assignee: Microsoft CorporationInventors: Andrew James Edwards, James Jordan Tigani, Zhenghao Wang, Bradley Calder, Satish Narayanasamy
-
Publication number: 20100062347Abstract: A rechargeable zinc cell with a longitudinally-folded separator comprising a zinc negative electrode, a positive electrode, an electrolyte and a separator. The separator comprises at least two wicking layers with a microporous layer in the center thereof, and the separator is folded longitudinally to wrap around a long edge of the zinc negative electrode. A method of constructing a rechargeable zinc cell with a longitudinally-folded separator comprising the steps of placing the zinc negative electrode in contact with at least one of the two wicking layers of the separator, folding the separator longitudinally around a long edge of the zinc negative electrode, placing the positive electrode on said separator and rolling the zinc negative electrode, the positive electrode and the separator into a jelly roll structure.Type: ApplicationFiled: September 9, 2008Publication date: March 11, 2010Inventors: Lin-Feng Li, Fuyuan Ma, Zhenghao Wang
-
Publication number: 20090228871Abstract: Methods, systems, and computer-readable media having computer-executable instructions embodied thereon that, when executed, perform methods in accordance with embodiments hereof, for emulating behavior of a target program to identify defects therein. Emulation includes recording a trace file upon running machine-level instructions of the target program, collecting relevant events encountered upon replaying the trace file, expressing the relevant events as symbolic constraints, and solving the symbolic constraints for variant input parameters. Injecting the variant input parameters into the trace file causes the target program to follow various control paths, allowing for systematically searching the target program for defects. The procedure for security testing above can be repeated by employing a search-strategy algorithm that selects optimal control paths of the target program to evaluate.Type: ApplicationFiled: March 10, 2008Publication date: September 10, 2009Applicant: MICROSOFT CORPORATIONInventors: ANDREW EDWARDS, MICHAEL Y. LEVIN, JORDAN TIGANI, ZHENGHAO WANG, DENNIS JEFFREY
-
Publication number: 20080244332Abstract: An apparatus, method and computer-readable medium provide for detecting and filtering potential race conditions. In one example, potential race conditions may be detected by determining overlap of memory accesses (e.g., read, write). Potential race conditions may be filtered as potentially benign or harmful race conditions using, for example, heuristics, comparison of output states from execution of instructions or operations in differing sequences or identifying of associated output bugs corresponding to the execution of operations in different sequences.Type: ApplicationFiled: March 30, 2007Publication date: October 2, 2008Applicant: Microsoft CorporationInventors: Andrew James Edwards, James Jordan Tigani, Zhenghao Wang, Bradley Calder, Satish Naravanasamy
-
Publication number: 20070250820Abstract: An execution of a software program can be analyzed to detect various conditions, such as software defects relating to pointers and the like. Analysis can include modeling software constructs such as heaps, calls, memory, threads, and the like. Additional information, such as call stacks, can be provided to assist in debugging. A graphical depiction of pointer history can be presented and used to navigate throughout the execution history of a program.Type: ApplicationFiled: April 20, 2006Publication date: October 25, 2007Applicant: Microsoft CorporationInventors: Andrew Edwards, J. Tigani, Zhenghao Wang