Patents by Inventor Sheng Zhao

Sheng Zhao has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

MAGNETIC FIELD MAPPING AND INDOOR LOCATION SENSING

Publication number: 20160084659

Abstract: Aspects of the disclosure relate to location sensing based at least on magnetic measurements within an environment. In certain aspects, the location sensing contemplates several environment and/or operational conditions of an electronic device that conducts the sensing, including soft iron variations, motion characteristic of the device, and/or the elevation of the device. In other aspects, magnetic mappings for the environment can be generated in accordance with one or more of such conditions, and accurate location sensing can be achieved based at least on such mappings and magnetic measurements at a location of the device within the environment.

Type: Application

Filed: September 24, 2014

Publication date: March 24, 2016

Inventors: XUE YANG, SHENG ZHAO, LEI YANG
HYPER-STRUCTURE RECURRENT NEURAL NETWORKS FOR TEXT-TO-SPEECH

Publication number: 20150364128

Abstract: The technology relates to converting text to speech utilizing recurrent neural networks (RNNs). The recurrent neural networks may be implemented as multiple modules for determining properties of the text. In embodiments, a part-of-speech RNN module, letter-to-sound RNN module, a linguistic prosody tagger RNN module, and a context awareness and semantic mining RNN module may all be utilized. The properties from the RNN modules are processed by a hyper-structure RNN module that determine the phonetic properties of the input text based on the outputs of the other RNN modules. The hyper-structure RNN module may generate a generation sequence that is capable of being converting to audible speech by a speech synthesizer. The generation sequence may also be optimized by a global optimization module prior to being synthesized into audible speech.

Type: Application

Filed: June 13, 2014

Publication date: December 17, 2015

Applicant: MICROSOFT CORPORATION

Inventors: Pei Zhao, Max Leung, Kaisheng Yao, Bo Yan, Sheng Zhao, Fileno A. Alleva
ADVANCED RECURRENT NEURAL NETWORK BASED LETTER-TO-SOUND

Publication number: 20150364127

Abstract: The technology relates to performing letter-to-sound conversion utilizing recurrent neural networks (RNNs). The RNNs may be implemented as RNN modules for letter-to-sound conversion. The RNN modules receive text input and convert the text to corresponding phonemes. In determining the corresponding phonemes, the RNN modules may analyze the letters of the text and the letters surrounding the text being analyzed. The RNN modules may also analyze the letters of the text in reverse order. The RNN modules may also receive contextual information about the input text. The letter-to-sound conversion may then also be based on the contextual information that is received. The determined phonemes may be utilized to generate synthesized speech from the input text.

Type: Application

Filed: June 13, 2014

Publication date: December 17, 2015

Applicant: MICROSOFT CORPORATION

Inventors: Pei Zhao, Kaisheng Yao, Max Leung, Mei-Yuh Hwang, Sheng Zhao, Bo Yan, Geoffrey Zweig, Fileno A. Alleva
Dynamic long-distance dependency with conditional random fields

Patent number: 9037460

Abstract: Dynamic features are utilized with CRFs to handle long-distance dependencies of output labels. The dynamic features present a probability distribution involved in explicit distance from/to a special output label that is pre-defined according to each application scenario. Besides the number of units in the segment (from the previous special output label to the current unit), the dynamic features may also include the sum of any basic features of units in the segment. Since the added dynamic features are involved in the distance from the previous specific label, the searching lattice associated with Viterbi searching is expanded to distinguish the nodes with various distances. The dynamic features may be used in a variety of different applications, such as Natural Language Processing, Text-To-Speech and Automatic Speech Recognition. For example, the dynamic features may be used to assist in prosodic break and pause prediction.

Type: Grant

Filed: March 28, 2012

Date of Patent: May 19, 2015

Assignee: Microsoft Technology Licensing, LLC

Inventors: Jian Luan, Linfang Wang, Hairong Xia, Sheng Zhao, Daniela Braga
DEVELOPMENT OF A RENEWABLE CARBON-BASED BIO-MODIFIER FOR ASPHALT CEMENT

Publication number: 20150128829

Abstract: A bio-modifier for asphalt is provided that comprises non-wood bio-char. In some embodiments, the bio-char comprises pyrolyzed biomass from a bio-fuel crop and/or comprises pyrolyzed grass. The asphalt modifier can improve the performance of asphalt compositions such as asphalt binder compositions and compositions comprising asphalt binder and aggregate. For example, the bio-modifier can improve the temperature susceptibility of asphalt binder compositions and increase the rutting resistance, moisture and cracking resistance of hot mix asphalt compositions. In addition, methods of preparing the bio-modifier composition, methods of preparing modified asphalts comprising the bio-modifier, and modified asphalt compositions are provided.

Type: Application

Filed: November 7, 2014

Publication date: May 14, 2015

Inventors: Baoshan Huang, Xiaofei Philip Ye, Sheng Zhao, Xiang Shu
Blending recorded speech with text-to-speech output for specific domains

Patent number: 8996377

Abstract: A text-to-speech (TTS) engine combines recorded speech with synthesized speech from a TTS synthesizer based on text input. The TTS engine receives the text input and identifies the domain for the speech (e.g. navigation, dialing, . . . ). The identified domain is used in selecting domain specific speech recordings (e.g. pre-recorded static phrases such as “turn left”, “turn right” . . . ) from the input text. The speech recordings are obtained based on the static phrases for the domain that are identified from the input text. The TTS engine blends the static phrases with the TTS output to smooth the acoustic trajectory of the input text. The prosody of the static phrases is used to create similar prosody in the TTS output.

Type: Grant

Filed: July 12, 2012

Date of Patent: March 31, 2015

Assignee: Microsoft Technology Licensing, LLC

Inventors: Sheng Zhao, Peng Wang, Difei Gao, Yijian Wu, Binggong Ding, Shenghua Ye, Max Leung
EIGENVALUE-BASED DATA QUERY

Publication number: 20140358895

Abstract: Embodiments relate to an eigenvalue-based data query. An aspect includes receiving a query request that includes a query statement. Another aspect includes calculating eigenvalues of key component elements in the query statement. Another aspect includes matching eigenvalues of nodes in an execution plan of a historical query statement to the eigenvalues of the key component elements. Yet another aspect includes based on determining success of matching the eigenvalues of the key component elements to the eigenvalues of the nodes in an execution plan of the historical query statement, generating an execution plan of the query statement.

Type: Application

Filed: March 4, 2014

Publication date: December 4, 2014

Applicant: International Business Machines Corporation

Inventors: Jing Jing Liu, Lei Qiu, Chen Wang, Fu Fei Xu, Guang Zhou Zhang, Sheng Zhao, Zan Zhou
Database query

Patent number: 8812485

Abstract: Mechanisms for performing database queries are provided. With these mechanisms, in response to a query request, a query plan intended for minimum query response time and a query plan intended for minimum query total time for the query request are obtained execution of the minimum query response time query plan and the minimum query total time query plan is started. Before the execution of the minimum query total time query plan reaches a specified point, an initial query result obtained from the execution of the minimum query response time query plan is output. In response to the execution of the minimum query total time query plan reaching the specified point, continuing the execution of the minimum query total time query plan to output remaining query results.

Type: Grant

Filed: August 29, 2012

Date of Patent: August 19, 2014

Assignee: International Business Machines Corporation

Inventors: Qi Chen, Shang Shun Lei, Yun Feng Sun, Guang Zhou Zhang, Sheng Zhao
FABRICATING METHOD FOR FABRICATING METALLIC MEMBER

Publication number: 20140033786

Abstract: A method for fabricating a metallic member includes providing a pre-forging mould. The pre-forging mould comprises an upper mould and a lower mould. The lower mould defines a pre-forging chamber, and a die cavity defined in a bottom surface of the pre-forging chamber. A metallic stock is placed above the die cavity, and the upper mould is moved toward the lower mould to forge the metallic stock, thereby forming a pre-formed body comprising a forging portion and a pre-forged base; annealing the pre-formed body; providing a forging mould to forge the pre-formed body, thereby obtaining a forged-body with a forged base thinner than that of the pre-forged base. Then the forged-body is milled to a desired size, and sandblasted, thereby obtaining the metallic member.

Type: Application

Filed: April 7, 2013

Publication date: February 6, 2014

Applicants: HON HAI PRECISION INDUSTRY CO., LTD., FU TAI HUA INDUSTRY (SHENZHEN) CO., LTD.

Inventors: CHENG-HUNG LIN, QING-FENG HUO, YI-MING YOU, LIN-SHENG ZHAO, KE ZHOU, WEN-TAO WANG, MING ZHENG, TAO-MIN LIU
BLENDING RECORDED SPEECH WITH TEXT-TO-SPEECH OUTPUT FOR SPECIFIC DOMAINS

Publication number: 20140019134

Abstract: A text-to-speech (TTS) engine combines recorded speech with synthesized speech from a TTS synthesizer based on text input. The TTS engine receives the text input and identifies the domain for the speech (e.g. navigation, dialing, . . . ). The identified domain is used in selecting domain specific speech recordings (e.g. pre-recorded static phrases such as “turn left”, “turn right” . . . ) from the input text. The speech recordings are obtained based on the static phrases for the domain that are identified from the input text. The TTS engine blends the static phrases with the TTS output to smooth the acoustic trajectory of the input text. The prosody of the static phrases is used to create similar prosody in the TTS output.

Type: Application

Filed: July 12, 2012

Publication date: January 16, 2014

Applicant: Microsoft Corporation

Inventors: Sheng Zhao, Peng Wang, Difei Gao, Yijian Wu, Binggong Ding, Shenghua Ye, Max Leung
2,4-DIAMINO-6,7-DIHYDRO-5H-PYRROLO[2,3]PYRIMIDINE DERIVATIVES AS FAK/Pyk2 INHIBITORS

Publication number: 20130281438

Abstract: The invention relates to a novel class of 2,4-diamino-6,7-dihydro-5H-pyrrolo[2,3]pyrimidine derivatives as a FAK and/or Pyk2 inhibitor, to a process for their preparation, and to a composition thereof, as well as to use of the compounds for the inhibiting FAK and/or Pyk2 and method for the treatment of a FAK and/or Pyk2 mediated disorder or disease.

Type: Application

Filed: January 7, 2012

Publication date: October 24, 2013

Applicant: CENTAURUS BIOPHARMA CO., LTD.

Inventors: Dengming Xiao, Liang Cheng, Xijie Liu, Yuandong Hu, Xinhe Xu, Zhihua Liu, Lipeng Zhang, Wei Wu, Shulong Wang, Yu Shen, Gen Li, Yin Wang, Sheng Zhao, Chonglong Li, Jia Tang, Honghao Yu
DYNAMIC LONG-DISTANCE DEPENDENCY WITH CONDITIONAL RANDOM FIELDS

Publication number: 20130262105

Abstract: Dynamic features are utilized with CRFs to handle long-distance dependencies of output labels. The dynamic features present a probability distribution involved in explicit distance from/to a special output label that is pre-defined according to each application scenario. Besides the number of units in the segment (from the previous special output label to the current unit), the dynamic features may also include the sum of any basic features of units in the segment. Since the added dynamic features are involved in the distance from the previous specific label, the searching lattice associated with Viterbi searching is expanded to distinguish the nodes with various distances. The dynamic features may be used in a variety of different applications, such as Natural Language Processing, Text-To-Speech and Automatic Speech Recognition. For example, the dynamic features may be used to assist in prosodic break and pause prediction.

Type: Application

Filed: March 28, 2012

Publication date: October 3, 2013

Applicant: MICROSOFT CORPORATION

Inventors: Jian Luan, Linfang Wang, Hairong Xia, Sheng Zhao, Daniela Braga
Database Query

Publication number: 20130054568

Abstract: Mechanisms for performing database queries are provided. With these mechanisms, in response to a query request, a query plan intended for minimum query response time and a query plan intended for minimum query total time for the query request are obtained execution of the minimum query response time query plan and the minimum query total time query plan is started. Before the execution of the minimum query total time query plan reaches a specified point, an initial query result obtained from the execution of the minimum query response time query plan is output. In response to the execution of the minimum query total time query plan reaching the specified point, continuing the execution of the minimum query total time query plan to output remaining query results.

Type: Application

Filed: August 29, 2012

Publication date: February 28, 2013

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Qi Chen, Shang Shun Lei, Yun Feng Sun, Guang Zhou Zhang, Sheng Zhao
Interactive TTS optimization tool

Patent number: 8352270

Abstract: An interactive prompt generation and TTS optimization tool with a user-friendly graphical user interface is provided. The tool accepts HTS abstraction or speech recognition processed input from a user to generate an enhanced initial waveform for synthesis. Acoustic features of the waveform are presented to the user with graphical visualizations enabling the user to modify various parameters of the speech synthesis process and listen to modified versions until an acceptable end product is reached.

Type: Grant

Filed: June 9, 2009

Date of Patent: January 8, 2013

Assignee: Microsoft Corporation

Inventors: Jian-Chao Wang, Lu-Jun Yuan, Sheng Zhao, Fileno A. Alleva, Jingyang Xu, Chiwei Che
Techniques to create a custom voice font

Patent number: 8332225

Abstract: Techniques to create and share custom voice fonts are described. An apparatus may include a preprocessing component to receive voice audio data and a corresponding text script from a client and to process the voice audio data to produce prosody labels and a rich script. The apparatus may further include a verification component to automatically verify the voice audio data and the text script. The apparatus may further include a training component to train a custom voice font from the verified voice audio data and rich script and to generate custom voice font data usable by the TTS component. Other embodiments are described and claimed.

Type: Grant

Filed: June 4, 2009

Date of Patent: December 11, 2012

Assignee: Microsoft Corporation

Inventors: Sheng Zhao, Zhi Li, Shenghao Qin, Chiwei Che, Jingyang Xu, Binggong Ding
AUTOMATIC-EXTENDING AND ANTI-ROTATION SCOLIOSIS CORRECTING SYSTEM

Publication number: 20120310285

Abstract: A spontaneous-extending and anti-rotation scoliosis correcting system comprises pedicle screws and a plurality of correcting rods locked with the pedicle screws. Each correcting rod includes at least one sleeve and at least one inserting rod which can be inserted into the sleeve. The inner wall of the sleeve and the inserting rod are the same in shape and are in clearance fit. A positioning mechanism for restricting the relative rotation of the inserting rod with respect to the sleeve is arranged on a matching surface between the inserting rod and the sleeve. The scoliosis correcting system has the benefits of ensuring the lateral stability and the anti-rotation function for scoliosis correction; having the performance of spontaneous extending along the growth direction of the spine; and ensuring both the short-term operating effect and the long-term curative effect.

Type: Application

Filed: January 28, 2011

Publication date: December 6, 2012

Inventors: Sheng Zhao, Xiaochun Wei, Kai Li
Joining tables in multiple heterogeneous distributed databases

Patent number: 8122008

Abstract: A method for joining tables in multiple heterogeneous distributed databases implemented by at least two data sources accessible to a federal database server over a network includes: transmitting from the federated database server a sub-command to a first of the data sources responsive to the federated database server receiving a data query; retrieving, with the federated database server, block data from the first data source related to the data query using block fetching according to the sub-command; transmitting, with the federated database server, at least a portion of the block data to a second of the data sources together with an instruction for the second data source to perform a join operation on the portion of the block data and a data table stored by the second data source related to the query; and retrieving a result of the join operation with the federated database server.

Type: Grant

Filed: September 23, 2009

Date of Patent: February 21, 2012

Assignee: International Business Machines Corporation

Inventors: Ming Li, Hai Feng Li, Yun Feng Sun, Sheng Zhao
TECHNIQUES TO CREATE A CUSTOM VOICE FONT

Publication number: 20100312563

Abstract: Techniques to create and share custom voice fonts are described. An apparatus may include a preprocessing component to receive voice audio data and a corresponding text script from a client and to process the voice audio data to produce prosody labels and a rich script. The apparatus may further include a verification component to automatically verify the voice audio data and the text script. The apparatus may further include a training component to train a custom voice font from the verified voice audio data and rich script and to generate custom voice font data usable by the TTS component. Other embodiments are described and claimed.

Type: Application

Filed: June 4, 2009

Publication date: December 9, 2010

Applicant: MICROSOFT CORPORATION

Inventors: Sheng Zhao, Zhi Li, Shenghao Qin, Chiwei Che, Jingyang Xu, Binggong Ding
INTERACTIVE TTS OPTIMIZATION TOOL

Publication number: 20100312565

Abstract: An interactive prompt generation and TTS optimization tool with a user-friendly graphical user interface is provided. The tool accepts HTS abstraction or speech recognition processed input from a user to generate an enhanced initial waveform for synthesis. Acoustic features of the waveform are presented to the user with graphical visualizations enabling the user to modify various parameters of the speech synthesis process and listen to modified versions until an acceptable end product is reached.

Type: Application

Filed: June 9, 2009

Publication date: December 9, 2010

Applicant: Microsoft Corporation

Inventors: Jian-Chao Wang, Lu-Jun Yuan, Sheng Zhao, Fileno A. Alleva, Jingyang Xu, Chiwei Che
Providing personalized voice font for text-to-speech applications

Patent number: 7693719

Abstract: A method for synthesizing speech from text includes receiving one or more waveforms characteristic of a voice of a person selected by a user, generating a personalized voice font based on the one or more waveforms, and delivering the personalized voice font to the user's computer, whereby speech can be synthesized from text, the speech being in the voice of the selected person, the speech being synthesized using the personalized voice font. A system includes a text-to-speech (TTS) application operable to generate a voice font based on speech waveforms transmitted from a client computer remotely accessing the TTS application.

Type: Grant

Filed: October 29, 2004

Date of Patent: April 6, 2010

Assignee: Microsoft Corporation

Inventors: Min Chu, Yong Zhao, Sheng Zhao

prev 1 2 3 4 5 next