Patents by Inventor Sheng Zhao

Sheng Zhao has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 11600261
    Abstract: Systems are configured for generating spectrogram data characterized by a voice timbre of a target speaker and a prosody style of source speaker by converting a waveform of source speaker data to phonetic posterior gram (PPG) data, extracting additional prosody features from the source speaker data, and generating a spectrogram based on the PPG data and the extracted prosody features. The systems are configured to utilize/train a machine learning model for generating spectrogram data and for training a neural text-to-speech model with the generated spectrogram data.
    Type: Grant
    Filed: May 27, 2022
    Date of Patent: March 7, 2023
    Assignee: MICROSOFT TECHNOLOGY LICENSING, LLC
    Inventors: Shifeng Pan, Lei He, Yulin Li, Sheng Zhao, Chunling Ma
  • Publication number: 20220415314
    Abstract: Novel solutions for speech recognition provide contextual spelling correction (CSC) for automatic speech recognition (ASR). Disclosed examples include receiving an audio stream; performing an ASR process on the audio stream to produce an ASR hypothesis; receiving a context list; and, based on at least the ASR hypothesis and the context list, performing spelling correction to produce an output text sequence. A contextual spelling correction (CSC) model is used on top of an ASR model, precluding the need for changing the original ASR model. This permits run-time user customization based on contextual data, even for large-size context lists. Some examples include filtering ASR hypotheses for the audio stream and, based on at least the ASR hypotheses filtering, determining whether to trigger spelling correction for the ASR hypothesis. Some examples include generating text to speech (TTS) audio using preprocessed transcriptions with context phrases to train the CSC model.
    Type: Application
    Filed: August 31, 2022
    Publication date: December 29, 2022
    Inventors: Xiaoqiang WANG, Yanqing LIU, Sheng ZHAO, Jinyu LI
  • Publication number: 20220366153
    Abstract: Automatic generation of intelligent content is created using a system of computers including a user device and a cloud-based component that processes the user information. The system performs a process that includes receiving an input document and parsing the input document to generate inputs for a natural language generation model using a text analysis model. The natural language generation model generates one or more candidate presentation scripts based on the inputs. A presentation script is selected from the candidate presentation scripts and displayed. A text-to-speech model may be used to generate a synthesized audio presentation of the presentation script. A final presentation may be generated that includes a visual display of the input document and the corresponding audio presentation in sync with the visual display.
    Type: Application
    Filed: May 12, 2021
    Publication date: November 17, 2022
    Inventors: Ji LI, Konstantin SELESKEROV, Huey-Ru TSAI, Muin Barkatali MOMIN, Ramya TRIDANDAPANI, Sindhu Vigasini JAMBUNATHAN, Amit SRIVASTAVA, Derek Martin JOHNSON, Gencheng WU, Sheng ZHAO, Xinfeng CHEN, Bohan LI
  • Publication number: 20220350019
    Abstract: A radar system and a terminal device are provided. The radar system includes a controller and at least two radar modules directly or indirectly connected to the controller. The at least two radar modules include a first radar module and a second radar module, and the first radar module and the second radar module implement time division multiplexing of the controller in a digital domain. Compared with an existing radar system, the radar system in this application can provide more transmit channels, more receive channels, and a larger antenna array size when the two radar systems include a same quantity of controllers.
    Type: Application
    Filed: December 27, 2021
    Publication date: November 3, 2022
    Applicant: HUAWEI TECHNOLOGIES CO.,LTD.
    Inventors: Baopeng Wang, Wei Jiang, Sheng Zhao, Zhenjun Ren
  • Publication number: 20220310058
    Abstract: Systems are configured for generating text-to-speech data in a personalized voice by training a neural text-to-speech machine learning model on natural speech data collected from a particular user, validating the identity of the user from which data is collected, and authorizing requests from users to use the personalized voice in generating new speech data. The systems are further configured to train a machine learning model as a neural text-to-speech model with generated personalized speech data.
    Type: Application
    Filed: November 3, 2020
    Publication date: September 29, 2022
    Inventors: Sheng ZHAO, Li JIANG, Xuedong HUANG, Lijuan QIN, Lei HE, Binggong DING, Bo YAN, Chunling MA, Raunak OBEROI
  • Publication number: 20220293091
    Abstract: Systems are configured for generating spectrogram data characterized by a voice timbre of a target speaker and a prosody style of source speaker by converting a waveform of source speaker data to phonetic posterior gram (PPG) data, extracting additional prosody features from the source speaker data, and generating a spectrogram based on the PPG data and the extracted prosody features. The systems are configured to utilize/train a machine learning model for generating spectrogram data and for training a neural text-to-speech model with the generated spectrogram data.
    Type: Application
    Filed: May 27, 2022
    Publication date: September 15, 2022
    Inventors: Shifeng PAN, Lei HE, Yulin LI, Sheng ZHAO, Chunling MA
  • Publication number: 20220235253
    Abstract: The invention provides a gravity heat pipe having a working fluid selected from the group consisting of HFO-1234ze(Z), HFO-1234ze(E), HFO-1336mzz(Z), HFO-1336mzz(E), HFO-1224yd(Z), HFO-1233zd(E), and a mixture thereof. The heat pipes of the invention are environmentally friendly, have good cooling performance and low manufacturing costs, and are suitable for cooling of communication base stations, servers, or data centers.
    Type: Application
    Filed: December 31, 2019
    Publication date: July 28, 2022
    Inventors: Hengdao QUAN, Zhikai GUO, Sheng ZHAO, Hongsheng OUYANG, Huie YANG, Huafeng SUN, Gang YANG, Xia LUO
  • Patent number: 11361753
    Abstract: Systems are configured for generating spectrogram data characterized by a voice timbre of a target speaker and a prosody style of source speaker by converting a waveform of source speaker data to phonetic posterior gram (PPG) data, extracting additional prosody features from the source speaker data, and generating a spectrogram based on the PPG data and the extracted prosody features. The systems are configured to utilize/train a machine learning model for generating spectrogram data and for training a neural text-to-speech model with the generated spectrogram data.
    Type: Grant
    Filed: September 24, 2020
    Date of Patent: June 14, 2022
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Shifeng Pan, Lei He, Yulin Li, Sheng Zhao, Chunling Ma
  • Publication number: 20220068259
    Abstract: Systems are configured for generating spectrogram data characterized by a voice timbre of a target speaker and a prosody style of source speaker by converting a waveform of source speaker data to phonetic posterior gram (PPG) data, extracting additional prosody features from the source speaker data, and generating a spectrogram based on the PPG data and the extracted prosody features. The systems are configured to utilize/train a machine learning model for generating spectrogram data and for training a neural text-to-speech model with the generated spectrogram data.
    Type: Application
    Filed: September 24, 2020
    Publication date: March 3, 2022
    Inventors: Shifeng PAN, Lei HE, Yulin LI, Sheng ZHAO, Chunling MA
  • Publication number: 20210208283
    Abstract: An improved, efficient method for mapping world points from an environment (e.g., points generated by a LIDAR sensor of an autonomous vehicle) to locations (e.g., pixels) within rolling-shutter images taken of the environment is provided. This improved method allows for accurate localization of the world point in a rolling-shutter image via an iterative process that converges in very few iterations. The method poses the localization process as an iterative process for determining the time, within the rolling-shutter exposure period of the image, at which the world point was imaged by the camera. The method reduces the number of times the world point is projected into the normalized space of the camera image, often converging in three or fewer iterations.
    Type: Application
    Filed: November 25, 2020
    Publication date: July 8, 2021
    Inventors: Sheng Zhao, Nicholas Lloyd Armstrong-Crews, Volker Grabe
  • Patent number: 11055287
    Abstract: Embodiments relate to an eigenvalue-based data query. An aspect includes receiving a query request that includes a query statement. Another aspect includes calculating eigenvalues of key component elements in the query statement. Another aspect includes matching eigenvalues of nodes in an execution plan of a historical query statement to the eigenvalues of the key component elements. Yet another aspect includes based on determining success of matching the eigenvalues of the key component elements to the eigenvalues of the nodes in an execution plan of the historical query statement, generating an execution plan of the query statement.
    Type: Grant
    Filed: September 17, 2018
    Date of Patent: July 6, 2021
    Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Jing Jing Liu, Lei Qiu, Chen Wang, Fu Fei Xu, Guang Zhou Zhang, Sheng Zhao, Zan Zhou
  • Patent number: 11017610
    Abstract: An electronic device detects and recovers from fault conditions while tracking its motion and building a map of its environment. A front-end motion tracking module detects fault conditions while tracking motion over time and provides mapping data to a back-end mapping module. The front-end motion tracking module indicates to the back-end mapping module when a fault condition has been detected and when the fault condition is no longer detected. The back-end mapping module generates maps from the mapping data and merges the maps to build a three-dimensional representation of the environment. The back-end mapping module buffers or discards any mapping data received from the front-end motion tracking module during the existence of a fault condition. The back-end mapping module merges the maps generated based on mapping data received before and after the fault condition and adds the merged maps to the three-dimensional representation of the environment.
    Type: Grant
    Filed: May 17, 2017
    Date of Patent: May 25, 2021
    Assignee: GOOGLE LLC
    Inventors: Esha Nerurkar, Sheng Zhao
  • Publication number: 20200331083
    Abstract: The present invention relates to a double one-track electro-discharge wire cutting method, comprising a step of forward wire running and a step of reverse wire running wherein a wire running direction of the step of forward wire running is opposite to that of the step of reverse wire running; with wire running in the step of forward wire running, one processing element is completed by discharge cutting processing, and with wire running in the step of reverse wire running, another processing element is completed by discharge cutting processing. The method can improve the processing efficiency and recycle the electrode wire while ensuring the processing precision.
    Type: Application
    Filed: January 9, 2018
    Publication date: October 22, 2020
    Inventors: Ci wen He, Jin Sheng Zhao
  • Patent number: 10802147
    Abstract: An electronic device tracks its motion in an environment while building a three-dimensional visual representation of the environment that is used to correct drift in the tracked motion. A motion tracking module estimates poses of the electronic device based on feature descriptors corresponding to the visual appearance of spatial features of objects in the environment. A mapping module builds a three-dimensional visual representation of the environment based on a stored plurality of maps, and feature descriptors and estimated device poses received from the motion tracking module. The mapping module provides the three-dimensional visual representation of the environment to a localization module, which identifies correspondences between stored and observed feature descriptors. The localization module performs a loop closure by minimizing the discrepancies between matching feature descriptors to compute a localized pose.
    Type: Grant
    Filed: May 15, 2017
    Date of Patent: October 13, 2020
    Assignee: GOOGLE LLC
    Inventors: Esha Nerurkar, Simon Lynen, Sheng Zhao
  • Publication number: 20200278449
    Abstract: An electronic device tracks its motion in an environment while building a three-dimensional visual representation of the environment that is used to correct drift in the tracked motion. A motion tracking module estimates poses of the electronic device based on feature descriptors corresponding to the visual appearance of spatial features of objects in the environment. A mapping module builds a three-dimensional visual representation of the environment based on a stored plurality of maps, and feature descriptors and estimated device poses received from the motion tracking module. The mapping module provides the three-dimensional visual representation of the environment to a localization module, which identifies correspondences between stored and observed feature descriptors. The localization module performs a loop closure by minimizing the discrepancies between matching feature descriptors to compute a localized pose.
    Type: Application
    Filed: May 15, 2020
    Publication date: September 3, 2020
    Inventors: Esha Nerurkar, Simon Lynen, Sheng Zhao
  • Patent number: 10761216
    Abstract: Various embodiments each include systems, methods, devices, or software for integer ambiguity resolution approach over a time window of GNSS/IMU data. One purpose of processing a window of data is to enhance the reliability of obtaining high-accuracy position estimation, using carrier-phase measurements, even in challenging environments.
    Type: Grant
    Filed: August 17, 2016
    Date of Patent: September 1, 2020
    Assignee: THE REGENTS OF THE UNIVERSITY OF CALIFORNIA
    Inventors: Yiming Chen, Sheng Zhao, Jay A. Farrell
  • Patent number: 10521482
    Abstract: Methods, systems, and computer programs are presented for recommending new connections based on profile similarity and existing interconnections within a social network. One method includes an operation for detecting a request for new connections for a member of the social network, where the profile of the member includes values for certain attributes. Additionally, the method includes operations for identifying members that have at least one equal attribute to the attributes of the member, and for calculating a connection score for each identified member based on the respective values of the identified members attributes. Members are selected from the identified members based on the connection scores, and a ranking score for each selected member is obtained utilizing a machine learning algorithm that utilizes similarity analysis of the attributes to calculate the ranking score. The selected members are presented to the member as the possible new connections based on the ranking scores.
    Type: Grant
    Filed: April 24, 2017
    Date of Patent: December 31, 2019
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Liang Zhang, Lin Zhu, Di Wang, Sheng Zhao, Yang Liu, Shu Chen
  • Publication number: 20190358721
    Abstract: The present invention relates to a double one-track electro-discharge wire cutting method, comprising a step of forward wire running and a step of reverse wire running wherein a wire running direction of the step of forward wire running is opposite to that of the step of reverse wire running; with wire running in the step of forward wire running, one processing element is completed by discharge cutting processing, and with wire running in the step of reverse wire running, another processing element is completed by discharge cutting processing. The method can improve the processing efficiency and recycle the electrode wire while ensuring the processing precision.
    Type: Application
    Filed: September 1, 2018
    Publication date: November 28, 2019
    Inventors: Ci wen He, Jin Sheng Zhao
  • Publication number: 20190236106
    Abstract: Methods, systems, and computer programs are presented for recommending new connections based on profile similarity and existing interconnections within a social network. One method includes an operation for detecting a request for new connections for a member of the social network, where the profile of the member includes values for certain attributes. Additionally, the method includes operations for identifying members that have at least one equal attribute to the attributes of the member, and for calculating a connection score for each identified member based on the respective values of the identified members attributes. Members are selected from the identified members based on the connection scores, and a ranking score for each selected member is obtained utilizing a machine learning algorithm that utilizes similarity analysis of the attributes to calculate the ranking score. The selected members are presented to the member as the possible new connections based on the ranking scores.
    Type: Application
    Filed: April 24, 2017
    Publication date: August 1, 2019
    Inventors: Liang Zhang, Lin Zhu, Di Wang, Sheng Zhao, Yang Liu, Shu Chen
  • Publication number: 20190018879
    Abstract: Embodiments relate to an eigenvalue-based data query. An aspect includes receiving a query request that includes a query statement. Another aspect includes calculating eigenvalues of key component elements in the query statement. Another aspect includes matching eigenvalues of nodes in an execution plan of a historical query statement to the eigenvalues of the key component elements. Yet another aspect includes based on determining success of matching the eigenvalues of the key component elements to the eigenvalues of the nodes in an execution plan of the historical query statement, generating an execution plan of the query statement.
    Type: Application
    Filed: September 17, 2018
    Publication date: January 17, 2019
    Inventors: Jing Jing Liu, Lei Qiu, Chen Wang, Fu Fei Xu, Guang Zhou Zhang, Sheng Zhao, Zan Zhou