Patents by Inventor Sheng Zhao

Sheng Zhao has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20160084659
    Abstract: Aspects of the disclosure relate to location sensing based at least on magnetic measurements within an environment. In certain aspects, the location sensing contemplates several environment and/or operational conditions of an electronic device that conducts the sensing, including soft iron variations, motion characteristic of the device, and/or the elevation of the device. In other aspects, magnetic mappings for the environment can be generated in accordance with one or more of such conditions, and accurate location sensing can be achieved based at least on such mappings and magnetic measurements at a location of the device within the environment.
    Type: Application
    Filed: September 24, 2014
    Publication date: March 24, 2016
    Inventors: XUE YANG, SHENG ZHAO, LEI YANG
  • Publication number: 20150364128
    Abstract: The technology relates to converting text to speech utilizing recurrent neural networks (RNNs). The recurrent neural networks may be implemented as multiple modules for determining properties of the text. In embodiments, a part-of-speech RNN module, letter-to-sound RNN module, a linguistic prosody tagger RNN module, and a context awareness and semantic mining RNN module may all be utilized. The properties from the RNN modules are processed by a hyper-structure RNN module that determine the phonetic properties of the input text based on the outputs of the other RNN modules. The hyper-structure RNN module may generate a generation sequence that is capable of being converting to audible speech by a speech synthesizer. The generation sequence may also be optimized by a global optimization module prior to being synthesized into audible speech.
    Type: Application
    Filed: June 13, 2014
    Publication date: December 17, 2015
    Applicant: MICROSOFT CORPORATION
    Inventors: Pei Zhao, Max Leung, Kaisheng Yao, Bo Yan, Sheng Zhao, Fileno A. Alleva
  • Publication number: 20150364127
    Abstract: The technology relates to performing letter-to-sound conversion utilizing recurrent neural networks (RNNs). The RNNs may be implemented as RNN modules for letter-to-sound conversion. The RNN modules receive text input and convert the text to corresponding phonemes. In determining the corresponding phonemes, the RNN modules may analyze the letters of the text and the letters surrounding the text being analyzed. The RNN modules may also analyze the letters of the text in reverse order. The RNN modules may also receive contextual information about the input text. The letter-to-sound conversion may then also be based on the contextual information that is received. The determined phonemes may be utilized to generate synthesized speech from the input text.
    Type: Application
    Filed: June 13, 2014
    Publication date: December 17, 2015
    Applicant: MICROSOFT CORPORATION
    Inventors: Pei Zhao, Kaisheng Yao, Max Leung, Mei-Yuh Hwang, Sheng Zhao, Bo Yan, Geoffrey Zweig, Fileno A. Alleva
  • Patent number: 9037460
    Abstract: Dynamic features are utilized with CRFs to handle long-distance dependencies of output labels. The dynamic features present a probability distribution involved in explicit distance from/to a special output label that is pre-defined according to each application scenario. Besides the number of units in the segment (from the previous special output label to the current unit), the dynamic features may also include the sum of any basic features of units in the segment. Since the added dynamic features are involved in the distance from the previous specific label, the searching lattice associated with Viterbi searching is expanded to distinguish the nodes with various distances. The dynamic features may be used in a variety of different applications, such as Natural Language Processing, Text-To-Speech and Automatic Speech Recognition. For example, the dynamic features may be used to assist in prosodic break and pause prediction.
    Type: Grant
    Filed: March 28, 2012
    Date of Patent: May 19, 2015
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Jian Luan, Linfang Wang, Hairong Xia, Sheng Zhao, Daniela Braga
  • Publication number: 20150128829
    Abstract: A bio-modifier for asphalt is provided that comprises non-wood bio-char. In some embodiments, the bio-char comprises pyrolyzed biomass from a bio-fuel crop and/or comprises pyrolyzed grass. The asphalt modifier can improve the performance of asphalt compositions such as asphalt binder compositions and compositions comprising asphalt binder and aggregate. For example, the bio-modifier can improve the temperature susceptibility of asphalt binder compositions and increase the rutting resistance, moisture and cracking resistance of hot mix asphalt compositions. In addition, methods of preparing the bio-modifier composition, methods of preparing modified asphalts comprising the bio-modifier, and modified asphalt compositions are provided.
    Type: Application
    Filed: November 7, 2014
    Publication date: May 14, 2015
    Inventors: Baoshan Huang, Xiaofei Philip Ye, Sheng Zhao, Xiang Shu
  • Patent number: 8996377
    Abstract: A text-to-speech (TTS) engine combines recorded speech with synthesized speech from a TTS synthesizer based on text input. The TTS engine receives the text input and identifies the domain for the speech (e.g. navigation, dialing, . . . ). The identified domain is used in selecting domain specific speech recordings (e.g. pre-recorded static phrases such as “turn left”, “turn right” . . . ) from the input text. The speech recordings are obtained based on the static phrases for the domain that are identified from the input text. The TTS engine blends the static phrases with the TTS output to smooth the acoustic trajectory of the input text. The prosody of the static phrases is used to create similar prosody in the TTS output.
    Type: Grant
    Filed: July 12, 2012
    Date of Patent: March 31, 2015
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Sheng Zhao, Peng Wang, Difei Gao, Yijian Wu, Binggong Ding, Shenghua Ye, Max Leung
  • Publication number: 20140358895
    Abstract: Embodiments relate to an eigenvalue-based data query. An aspect includes receiving a query request that includes a query statement. Another aspect includes calculating eigenvalues of key component elements in the query statement. Another aspect includes matching eigenvalues of nodes in an execution plan of a historical query statement to the eigenvalues of the key component elements. Yet another aspect includes based on determining success of matching the eigenvalues of the key component elements to the eigenvalues of the nodes in an execution plan of the historical query statement, generating an execution plan of the query statement.
    Type: Application
    Filed: March 4, 2014
    Publication date: December 4, 2014
    Applicant: International Business Machines Corporation
    Inventors: Jing Jing Liu, Lei Qiu, Chen Wang, Fu Fei Xu, Guang Zhou Zhang, Sheng Zhao, Zan Zhou
  • Patent number: 8812485
    Abstract: Mechanisms for performing database queries are provided. With these mechanisms, in response to a query request, a query plan intended for minimum query response time and a query plan intended for minimum query total time for the query request are obtained execution of the minimum query response time query plan and the minimum query total time query plan is started. Before the execution of the minimum query total time query plan reaches a specified point, an initial query result obtained from the execution of the minimum query response time query plan is output. In response to the execution of the minimum query total time query plan reaching the specified point, continuing the execution of the minimum query total time query plan to output remaining query results.
    Type: Grant
    Filed: August 29, 2012
    Date of Patent: August 19, 2014
    Assignee: International Business Machines Corporation
    Inventors: Qi Chen, Shang Shun Lei, Yun Feng Sun, Guang Zhou Zhang, Sheng Zhao
  • Publication number: 20140033786
    Abstract: A method for fabricating a metallic member includes providing a pre-forging mould. The pre-forging mould comprises an upper mould and a lower mould. The lower mould defines a pre-forging chamber, and a die cavity defined in a bottom surface of the pre-forging chamber. A metallic stock is placed above the die cavity, and the upper mould is moved toward the lower mould to forge the metallic stock, thereby forming a pre-formed body comprising a forging portion and a pre-forged base; annealing the pre-formed body; providing a forging mould to forge the pre-formed body, thereby obtaining a forged-body with a forged base thinner than that of the pre-forged base. Then the forged-body is milled to a desired size, and sandblasted, thereby obtaining the metallic member.
    Type: Application
    Filed: April 7, 2013
    Publication date: February 6, 2014
    Applicants: HON HAI PRECISION INDUSTRY CO., LTD., FU TAI HUA INDUSTRY (SHENZHEN) CO., LTD.
    Inventors: CHENG-HUNG LIN, QING-FENG HUO, YI-MING YOU, LIN-SHENG ZHAO, KE ZHOU, WEN-TAO WANG, MING ZHENG, TAO-MIN LIU
  • Publication number: 20140019134
    Abstract: A text-to-speech (TTS) engine combines recorded speech with synthesized speech from a TTS synthesizer based on text input. The TTS engine receives the text input and identifies the domain for the speech (e.g. navigation, dialing, . . . ). The identified domain is used in selecting domain specific speech recordings (e.g. pre-recorded static phrases such as “turn left”, “turn right” . . . ) from the input text. The speech recordings are obtained based on the static phrases for the domain that are identified from the input text. The TTS engine blends the static phrases with the TTS output to smooth the acoustic trajectory of the input text. The prosody of the static phrases is used to create similar prosody in the TTS output.
    Type: Application
    Filed: July 12, 2012
    Publication date: January 16, 2014
    Applicant: Microsoft Corporation
    Inventors: Sheng Zhao, Peng Wang, Difei Gao, Yijian Wu, Binggong Ding, Shenghua Ye, Max Leung
  • Publication number: 20130281438
    Abstract: The invention relates to a novel class of 2,4-diamino-6,7-dihydro-5H-pyrrolo[2,3]pyrimidine derivatives as a FAK and/or Pyk2 inhibitor, to a process for their preparation, and to a composition thereof, as well as to use of the compounds for the inhibiting FAK and/or Pyk2 and method for the treatment of a FAK and/or Pyk2 mediated disorder or disease.
    Type: Application
    Filed: January 7, 2012
    Publication date: October 24, 2013
    Applicant: CENTAURUS BIOPHARMA CO., LTD.
    Inventors: Dengming Xiao, Liang Cheng, Xijie Liu, Yuandong Hu, Xinhe Xu, Zhihua Liu, Lipeng Zhang, Wei Wu, Shulong Wang, Yu Shen, Gen Li, Yin Wang, Sheng Zhao, Chonglong Li, Jia Tang, Honghao Yu
  • Publication number: 20130262105
    Abstract: Dynamic features are utilized with CRFs to handle long-distance dependencies of output labels. The dynamic features present a probability distribution involved in explicit distance from/to a special output label that is pre-defined according to each application scenario. Besides the number of units in the segment (from the previous special output label to the current unit), the dynamic features may also include the sum of any basic features of units in the segment. Since the added dynamic features are involved in the distance from the previous specific label, the searching lattice associated with Viterbi searching is expanded to distinguish the nodes with various distances. The dynamic features may be used in a variety of different applications, such as Natural Language Processing, Text-To-Speech and Automatic Speech Recognition. For example, the dynamic features may be used to assist in prosodic break and pause prediction.
    Type: Application
    Filed: March 28, 2012
    Publication date: October 3, 2013
    Applicant: MICROSOFT CORPORATION
    Inventors: Jian Luan, Linfang Wang, Hairong Xia, Sheng Zhao, Daniela Braga
  • Publication number: 20130054568
    Abstract: Mechanisms for performing database queries are provided. With these mechanisms, in response to a query request, a query plan intended for minimum query response time and a query plan intended for minimum query total time for the query request are obtained execution of the minimum query response time query plan and the minimum query total time query plan is started. Before the execution of the minimum query total time query plan reaches a specified point, an initial query result obtained from the execution of the minimum query response time query plan is output. In response to the execution of the minimum query total time query plan reaching the specified point, continuing the execution of the minimum query total time query plan to output remaining query results.
    Type: Application
    Filed: August 29, 2012
    Publication date: February 28, 2013
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Qi Chen, Shang Shun Lei, Yun Feng Sun, Guang Zhou Zhang, Sheng Zhao
  • Patent number: 8352270
    Abstract: An interactive prompt generation and TTS optimization tool with a user-friendly graphical user interface is provided. The tool accepts HTS abstraction or speech recognition processed input from a user to generate an enhanced initial waveform for synthesis. Acoustic features of the waveform are presented to the user with graphical visualizations enabling the user to modify various parameters of the speech synthesis process and listen to modified versions until an acceptable end product is reached.
    Type: Grant
    Filed: June 9, 2009
    Date of Patent: January 8, 2013
    Assignee: Microsoft Corporation
    Inventors: Jian-Chao Wang, Lu-Jun Yuan, Sheng Zhao, Fileno A. Alleva, Jingyang Xu, Chiwei Che
  • Patent number: 8332225
    Abstract: Techniques to create and share custom voice fonts are described. An apparatus may include a preprocessing component to receive voice audio data and a corresponding text script from a client and to process the voice audio data to produce prosody labels and a rich script. The apparatus may further include a verification component to automatically verify the voice audio data and the text script. The apparatus may further include a training component to train a custom voice font from the verified voice audio data and rich script and to generate custom voice font data usable by the TTS component. Other embodiments are described and claimed.
    Type: Grant
    Filed: June 4, 2009
    Date of Patent: December 11, 2012
    Assignee: Microsoft Corporation
    Inventors: Sheng Zhao, Zhi Li, Shenghao Qin, Chiwei Che, Jingyang Xu, Binggong Ding
  • Publication number: 20120310285
    Abstract: A spontaneous-extending and anti-rotation scoliosis correcting system comprises pedicle screws and a plurality of correcting rods locked with the pedicle screws. Each correcting rod includes at least one sleeve and at least one inserting rod which can be inserted into the sleeve. The inner wall of the sleeve and the inserting rod are the same in shape and are in clearance fit. A positioning mechanism for restricting the relative rotation of the inserting rod with respect to the sleeve is arranged on a matching surface between the inserting rod and the sleeve. The scoliosis correcting system has the benefits of ensuring the lateral stability and the anti-rotation function for scoliosis correction; having the performance of spontaneous extending along the growth direction of the spine; and ensuring both the short-term operating effect and the long-term curative effect.
    Type: Application
    Filed: January 28, 2011
    Publication date: December 6, 2012
    Inventors: Sheng Zhao, Xiaochun Wei, Kai Li
  • Patent number: 8122008
    Abstract: A method for joining tables in multiple heterogeneous distributed databases implemented by at least two data sources accessible to a federal database server over a network includes: transmitting from the federated database server a sub-command to a first of the data sources responsive to the federated database server receiving a data query; retrieving, with the federated database server, block data from the first data source related to the data query using block fetching according to the sub-command; transmitting, with the federated database server, at least a portion of the block data to a second of the data sources together with an instruction for the second data source to perform a join operation on the portion of the block data and a data table stored by the second data source related to the query; and retrieving a result of the join operation with the federated database server.
    Type: Grant
    Filed: September 23, 2009
    Date of Patent: February 21, 2012
    Assignee: International Business Machines Corporation
    Inventors: Ming Li, Hai Feng Li, Yun Feng Sun, Sheng Zhao
  • Publication number: 20100312563
    Abstract: Techniques to create and share custom voice fonts are described. An apparatus may include a preprocessing component to receive voice audio data and a corresponding text script from a client and to process the voice audio data to produce prosody labels and a rich script. The apparatus may further include a verification component to automatically verify the voice audio data and the text script. The apparatus may further include a training component to train a custom voice font from the verified voice audio data and rich script and to generate custom voice font data usable by the TTS component. Other embodiments are described and claimed.
    Type: Application
    Filed: June 4, 2009
    Publication date: December 9, 2010
    Applicant: MICROSOFT CORPORATION
    Inventors: Sheng Zhao, Zhi Li, Shenghao Qin, Chiwei Che, Jingyang Xu, Binggong Ding
  • Publication number: 20100312565
    Abstract: An interactive prompt generation and TTS optimization tool with a user-friendly graphical user interface is provided. The tool accepts HTS abstraction or speech recognition processed input from a user to generate an enhanced initial waveform for synthesis. Acoustic features of the waveform are presented to the user with graphical visualizations enabling the user to modify various parameters of the speech synthesis process and listen to modified versions until an acceptable end product is reached.
    Type: Application
    Filed: June 9, 2009
    Publication date: December 9, 2010
    Applicant: Microsoft Corporation
    Inventors: Jian-Chao Wang, Lu-Jun Yuan, Sheng Zhao, Fileno A. Alleva, Jingyang Xu, Chiwei Che
  • Patent number: 7693719
    Abstract: A method for synthesizing speech from text includes receiving one or more waveforms characteristic of a voice of a person selected by a user, generating a personalized voice font based on the one or more waveforms, and delivering the personalized voice font to the user's computer, whereby speech can be synthesized from text, the speech being in the voice of the selected person, the speech being synthesized using the personalized voice font. A system includes a text-to-speech (TTS) application operable to generate a voice font based on speech waveforms transmitted from a client computer remotely accessing the TTS application.
    Type: Grant
    Filed: October 29, 2004
    Date of Patent: April 6, 2010
    Assignee: Microsoft Corporation
    Inventors: Min Chu, Yong Zhao, Sheng Zhao