Patents by Inventor Sheng Zhao
Sheng Zhao has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Publication number: 20160084659Abstract: Aspects of the disclosure relate to location sensing based at least on magnetic measurements within an environment. In certain aspects, the location sensing contemplates several environment and/or operational conditions of an electronic device that conducts the sensing, including soft iron variations, motion characteristic of the device, and/or the elevation of the device. In other aspects, magnetic mappings for the environment can be generated in accordance with one or more of such conditions, and accurate location sensing can be achieved based at least on such mappings and magnetic measurements at a location of the device within the environment.Type: ApplicationFiled: September 24, 2014Publication date: March 24, 2016Inventors: XUE YANG, SHENG ZHAO, LEI YANG
-
Publication number: 20150364128Abstract: The technology relates to converting text to speech utilizing recurrent neural networks (RNNs). The recurrent neural networks may be implemented as multiple modules for determining properties of the text. In embodiments, a part-of-speech RNN module, letter-to-sound RNN module, a linguistic prosody tagger RNN module, and a context awareness and semantic mining RNN module may all be utilized. The properties from the RNN modules are processed by a hyper-structure RNN module that determine the phonetic properties of the input text based on the outputs of the other RNN modules. The hyper-structure RNN module may generate a generation sequence that is capable of being converting to audible speech by a speech synthesizer. The generation sequence may also be optimized by a global optimization module prior to being synthesized into audible speech.Type: ApplicationFiled: June 13, 2014Publication date: December 17, 2015Applicant: MICROSOFT CORPORATIONInventors: Pei Zhao, Max Leung, Kaisheng Yao, Bo Yan, Sheng Zhao, Fileno A. Alleva
-
Publication number: 20150364127Abstract: The technology relates to performing letter-to-sound conversion utilizing recurrent neural networks (RNNs). The RNNs may be implemented as RNN modules for letter-to-sound conversion. The RNN modules receive text input and convert the text to corresponding phonemes. In determining the corresponding phonemes, the RNN modules may analyze the letters of the text and the letters surrounding the text being analyzed. The RNN modules may also analyze the letters of the text in reverse order. The RNN modules may also receive contextual information about the input text. The letter-to-sound conversion may then also be based on the contextual information that is received. The determined phonemes may be utilized to generate synthesized speech from the input text.Type: ApplicationFiled: June 13, 2014Publication date: December 17, 2015Applicant: MICROSOFT CORPORATIONInventors: Pei Zhao, Kaisheng Yao, Max Leung, Mei-Yuh Hwang, Sheng Zhao, Bo Yan, Geoffrey Zweig, Fileno A. Alleva
-
Patent number: 9037460Abstract: Dynamic features are utilized with CRFs to handle long-distance dependencies of output labels. The dynamic features present a probability distribution involved in explicit distance from/to a special output label that is pre-defined according to each application scenario. Besides the number of units in the segment (from the previous special output label to the current unit), the dynamic features may also include the sum of any basic features of units in the segment. Since the added dynamic features are involved in the distance from the previous specific label, the searching lattice associated with Viterbi searching is expanded to distinguish the nodes with various distances. The dynamic features may be used in a variety of different applications, such as Natural Language Processing, Text-To-Speech and Automatic Speech Recognition. For example, the dynamic features may be used to assist in prosodic break and pause prediction.Type: GrantFiled: March 28, 2012Date of Patent: May 19, 2015Assignee: Microsoft Technology Licensing, LLCInventors: Jian Luan, Linfang Wang, Hairong Xia, Sheng Zhao, Daniela Braga
-
Publication number: 20150128829Abstract: A bio-modifier for asphalt is provided that comprises non-wood bio-char. In some embodiments, the bio-char comprises pyrolyzed biomass from a bio-fuel crop and/or comprises pyrolyzed grass. The asphalt modifier can improve the performance of asphalt compositions such as asphalt binder compositions and compositions comprising asphalt binder and aggregate. For example, the bio-modifier can improve the temperature susceptibility of asphalt binder compositions and increase the rutting resistance, moisture and cracking resistance of hot mix asphalt compositions. In addition, methods of preparing the bio-modifier composition, methods of preparing modified asphalts comprising the bio-modifier, and modified asphalt compositions are provided.Type: ApplicationFiled: November 7, 2014Publication date: May 14, 2015Inventors: Baoshan Huang, Xiaofei Philip Ye, Sheng Zhao, Xiang Shu
-
Patent number: 8996377Abstract: A text-to-speech (TTS) engine combines recorded speech with synthesized speech from a TTS synthesizer based on text input. The TTS engine receives the text input and identifies the domain for the speech (e.g. navigation, dialing, . . . ). The identified domain is used in selecting domain specific speech recordings (e.g. pre-recorded static phrases such as “turn left”, “turn right” . . . ) from the input text. The speech recordings are obtained based on the static phrases for the domain that are identified from the input text. The TTS engine blends the static phrases with the TTS output to smooth the acoustic trajectory of the input text. The prosody of the static phrases is used to create similar prosody in the TTS output.Type: GrantFiled: July 12, 2012Date of Patent: March 31, 2015Assignee: Microsoft Technology Licensing, LLCInventors: Sheng Zhao, Peng Wang, Difei Gao, Yijian Wu, Binggong Ding, Shenghua Ye, Max Leung
-
Publication number: 20140358895Abstract: Embodiments relate to an eigenvalue-based data query. An aspect includes receiving a query request that includes a query statement. Another aspect includes calculating eigenvalues of key component elements in the query statement. Another aspect includes matching eigenvalues of nodes in an execution plan of a historical query statement to the eigenvalues of the key component elements. Yet another aspect includes based on determining success of matching the eigenvalues of the key component elements to the eigenvalues of the nodes in an execution plan of the historical query statement, generating an execution plan of the query statement.Type: ApplicationFiled: March 4, 2014Publication date: December 4, 2014Applicant: International Business Machines CorporationInventors: Jing Jing Liu, Lei Qiu, Chen Wang, Fu Fei Xu, Guang Zhou Zhang, Sheng Zhao, Zan Zhou
-
Patent number: 8812485Abstract: Mechanisms for performing database queries are provided. With these mechanisms, in response to a query request, a query plan intended for minimum query response time and a query plan intended for minimum query total time for the query request are obtained execution of the minimum query response time query plan and the minimum query total time query plan is started. Before the execution of the minimum query total time query plan reaches a specified point, an initial query result obtained from the execution of the minimum query response time query plan is output. In response to the execution of the minimum query total time query plan reaching the specified point, continuing the execution of the minimum query total time query plan to output remaining query results.Type: GrantFiled: August 29, 2012Date of Patent: August 19, 2014Assignee: International Business Machines CorporationInventors: Qi Chen, Shang Shun Lei, Yun Feng Sun, Guang Zhou Zhang, Sheng Zhao
-
Publication number: 20140033786Abstract: A method for fabricating a metallic member includes providing a pre-forging mould. The pre-forging mould comprises an upper mould and a lower mould. The lower mould defines a pre-forging chamber, and a die cavity defined in a bottom surface of the pre-forging chamber. A metallic stock is placed above the die cavity, and the upper mould is moved toward the lower mould to forge the metallic stock, thereby forming a pre-formed body comprising a forging portion and a pre-forged base; annealing the pre-formed body; providing a forging mould to forge the pre-formed body, thereby obtaining a forged-body with a forged base thinner than that of the pre-forged base. Then the forged-body is milled to a desired size, and sandblasted, thereby obtaining the metallic member.Type: ApplicationFiled: April 7, 2013Publication date: February 6, 2014Applicants: HON HAI PRECISION INDUSTRY CO., LTD., FU TAI HUA INDUSTRY (SHENZHEN) CO., LTD.Inventors: CHENG-HUNG LIN, QING-FENG HUO, YI-MING YOU, LIN-SHENG ZHAO, KE ZHOU, WEN-TAO WANG, MING ZHENG, TAO-MIN LIU
-
Publication number: 20140019134Abstract: A text-to-speech (TTS) engine combines recorded speech with synthesized speech from a TTS synthesizer based on text input. The TTS engine receives the text input and identifies the domain for the speech (e.g. navigation, dialing, . . . ). The identified domain is used in selecting domain specific speech recordings (e.g. pre-recorded static phrases such as “turn left”, “turn right” . . . ) from the input text. The speech recordings are obtained based on the static phrases for the domain that are identified from the input text. The TTS engine blends the static phrases with the TTS output to smooth the acoustic trajectory of the input text. The prosody of the static phrases is used to create similar prosody in the TTS output.Type: ApplicationFiled: July 12, 2012Publication date: January 16, 2014Applicant: Microsoft CorporationInventors: Sheng Zhao, Peng Wang, Difei Gao, Yijian Wu, Binggong Ding, Shenghua Ye, Max Leung
-
Publication number: 20130281438Abstract: The invention relates to a novel class of 2,4-diamino-6,7-dihydro-5H-pyrrolo[2,3]pyrimidine derivatives as a FAK and/or Pyk2 inhibitor, to a process for their preparation, and to a composition thereof, as well as to use of the compounds for the inhibiting FAK and/or Pyk2 and method for the treatment of a FAK and/or Pyk2 mediated disorder or disease.Type: ApplicationFiled: January 7, 2012Publication date: October 24, 2013Applicant: CENTAURUS BIOPHARMA CO., LTD.Inventors: Dengming Xiao, Liang Cheng, Xijie Liu, Yuandong Hu, Xinhe Xu, Zhihua Liu, Lipeng Zhang, Wei Wu, Shulong Wang, Yu Shen, Gen Li, Yin Wang, Sheng Zhao, Chonglong Li, Jia Tang, Honghao Yu
-
Publication number: 20130262105Abstract: Dynamic features are utilized with CRFs to handle long-distance dependencies of output labels. The dynamic features present a probability distribution involved in explicit distance from/to a special output label that is pre-defined according to each application scenario. Besides the number of units in the segment (from the previous special output label to the current unit), the dynamic features may also include the sum of any basic features of units in the segment. Since the added dynamic features are involved in the distance from the previous specific label, the searching lattice associated with Viterbi searching is expanded to distinguish the nodes with various distances. The dynamic features may be used in a variety of different applications, such as Natural Language Processing, Text-To-Speech and Automatic Speech Recognition. For example, the dynamic features may be used to assist in prosodic break and pause prediction.Type: ApplicationFiled: March 28, 2012Publication date: October 3, 2013Applicant: MICROSOFT CORPORATIONInventors: Jian Luan, Linfang Wang, Hairong Xia, Sheng Zhao, Daniela Braga
-
Publication number: 20130054568Abstract: Mechanisms for performing database queries are provided. With these mechanisms, in response to a query request, a query plan intended for minimum query response time and a query plan intended for minimum query total time for the query request are obtained execution of the minimum query response time query plan and the minimum query total time query plan is started. Before the execution of the minimum query total time query plan reaches a specified point, an initial query result obtained from the execution of the minimum query response time query plan is output. In response to the execution of the minimum query total time query plan reaching the specified point, continuing the execution of the minimum query total time query plan to output remaining query results.Type: ApplicationFiled: August 29, 2012Publication date: February 28, 2013Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Qi Chen, Shang Shun Lei, Yun Feng Sun, Guang Zhou Zhang, Sheng Zhao
-
Patent number: 8352270Abstract: An interactive prompt generation and TTS optimization tool with a user-friendly graphical user interface is provided. The tool accepts HTS abstraction or speech recognition processed input from a user to generate an enhanced initial waveform for synthesis. Acoustic features of the waveform are presented to the user with graphical visualizations enabling the user to modify various parameters of the speech synthesis process and listen to modified versions until an acceptable end product is reached.Type: GrantFiled: June 9, 2009Date of Patent: January 8, 2013Assignee: Microsoft CorporationInventors: Jian-Chao Wang, Lu-Jun Yuan, Sheng Zhao, Fileno A. Alleva, Jingyang Xu, Chiwei Che
-
Patent number: 8332225Abstract: Techniques to create and share custom voice fonts are described. An apparatus may include a preprocessing component to receive voice audio data and a corresponding text script from a client and to process the voice audio data to produce prosody labels and a rich script. The apparatus may further include a verification component to automatically verify the voice audio data and the text script. The apparatus may further include a training component to train a custom voice font from the verified voice audio data and rich script and to generate custom voice font data usable by the TTS component. Other embodiments are described and claimed.Type: GrantFiled: June 4, 2009Date of Patent: December 11, 2012Assignee: Microsoft CorporationInventors: Sheng Zhao, Zhi Li, Shenghao Qin, Chiwei Che, Jingyang Xu, Binggong Ding
-
Publication number: 20120310285Abstract: A spontaneous-extending and anti-rotation scoliosis correcting system comprises pedicle screws and a plurality of correcting rods locked with the pedicle screws. Each correcting rod includes at least one sleeve and at least one inserting rod which can be inserted into the sleeve. The inner wall of the sleeve and the inserting rod are the same in shape and are in clearance fit. A positioning mechanism for restricting the relative rotation of the inserting rod with respect to the sleeve is arranged on a matching surface between the inserting rod and the sleeve. The scoliosis correcting system has the benefits of ensuring the lateral stability and the anti-rotation function for scoliosis correction; having the performance of spontaneous extending along the growth direction of the spine; and ensuring both the short-term operating effect and the long-term curative effect.Type: ApplicationFiled: January 28, 2011Publication date: December 6, 2012Inventors: Sheng Zhao, Xiaochun Wei, Kai Li
-
Patent number: 8122008Abstract: A method for joining tables in multiple heterogeneous distributed databases implemented by at least two data sources accessible to a federal database server over a network includes: transmitting from the federated database server a sub-command to a first of the data sources responsive to the federated database server receiving a data query; retrieving, with the federated database server, block data from the first data source related to the data query using block fetching according to the sub-command; transmitting, with the federated database server, at least a portion of the block data to a second of the data sources together with an instruction for the second data source to perform a join operation on the portion of the block data and a data table stored by the second data source related to the query; and retrieving a result of the join operation with the federated database server.Type: GrantFiled: September 23, 2009Date of Patent: February 21, 2012Assignee: International Business Machines CorporationInventors: Ming Li, Hai Feng Li, Yun Feng Sun, Sheng Zhao
-
Publication number: 20100312563Abstract: Techniques to create and share custom voice fonts are described. An apparatus may include a preprocessing component to receive voice audio data and a corresponding text script from a client and to process the voice audio data to produce prosody labels and a rich script. The apparatus may further include a verification component to automatically verify the voice audio data and the text script. The apparatus may further include a training component to train a custom voice font from the verified voice audio data and rich script and to generate custom voice font data usable by the TTS component. Other embodiments are described and claimed.Type: ApplicationFiled: June 4, 2009Publication date: December 9, 2010Applicant: MICROSOFT CORPORATIONInventors: Sheng Zhao, Zhi Li, Shenghao Qin, Chiwei Che, Jingyang Xu, Binggong Ding
-
Publication number: 20100312565Abstract: An interactive prompt generation and TTS optimization tool with a user-friendly graphical user interface is provided. The tool accepts HTS abstraction or speech recognition processed input from a user to generate an enhanced initial waveform for synthesis. Acoustic features of the waveform are presented to the user with graphical visualizations enabling the user to modify various parameters of the speech synthesis process and listen to modified versions until an acceptable end product is reached.Type: ApplicationFiled: June 9, 2009Publication date: December 9, 2010Applicant: Microsoft CorporationInventors: Jian-Chao Wang, Lu-Jun Yuan, Sheng Zhao, Fileno A. Alleva, Jingyang Xu, Chiwei Che
-
Patent number: 7693719Abstract: A method for synthesizing speech from text includes receiving one or more waveforms characteristic of a voice of a person selected by a user, generating a personalized voice font based on the one or more waveforms, and delivering the personalized voice font to the user's computer, whereby speech can be synthesized from text, the speech being in the voice of the selected person, the speech being synthesized using the personalized voice font. A system includes a text-to-speech (TTS) application operable to generate a voice font based on speech waveforms transmitted from a client computer remotely accessing the TTS application.Type: GrantFiled: October 29, 2004Date of Patent: April 6, 2010Assignee: Microsoft CorporationInventors: Min Chu, Yong Zhao, Sheng Zhao