Patents by Inventor XIONG XIAO

XIONG XIAO has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

EXPANDABLE AND CONTRACTIBLE STRUCTURE FOR MASSAGE DEVICE AND MASSAGE DEVICE

Publication number: 20250177240

Abstract: The present invention relates to an expandable and contractible structure for a massage device and a massage device, comprising a geared motor, a transmission mechanism, an expandable and contractible mechanism, and a casing. The expandable and contractible mechanism comprises a fan blade fixture, an expansion driver, and deformable fan blades. The transmission mechanism is disposed at the power output end of the geared motor, the expansion driver is sleeved on the transmission mechanism, and the geared motor is accommodated in the casing, which drives the expansion driver and facilitates the opening of the fan blades. When the motor reverses, the expansion driver resets, causing the blades to contract under elastic stress. This design ensures a smooth, jam-free operation and greatly enhances the user experience with its distinctive expandable and contractible functionality.

Type: Application

Filed: January 12, 2024

Publication date: June 5, 2025

Inventors: ZHI-PING TU, XIONG XIAO
TRAINING AND USING A TRANSCRIPT GENERATION MODEL ON A MULTI-SPEAKER AUDIO STREAM

Publication number: 20240257815

Abstract: The disclosure herein describes using a transcript generation model for generating a transcript from a multi-speaker audio stream. Audio data including overlapping speech of a plurality of speakers is obtained and a set of frame embeddings are generated from audio data frames of obtained audio data using an audio data encoder. A set of words and channel change (CC) symbols are generated from the set of frame embeddings using a transcript generation model. The CC symbols are included between pairs of adjacent words that are spoken by different people at the same time. The set of words and CC symbols are transformed into a plurality of transcript lines, wherein words of the set of words are sorted into transcript lines based on CC symbols, and a multi-speaker transcript is generated based on the plurality of transcript lines. The inclusion of CC symbols by the model enables efficient, accurate multi-speaker transcription.

Type: Application

Filed: April 10, 2024

Publication date: August 1, 2024

Inventors: Naoyuki KANDA, Takuya YOSHIOKA, Zhuo CHEN, Jinyu LI, Yashesh GAUR, Zhong MENG, Xiaofei WANG, Xiong XIAO
CLOUD NETWORK, MEASUREMENT SYSTEM FOR CLOUD NETWORK , AND METHOD, DEVICE AND STORAGE MEDIUM

Publication number: 20240235972

Abstract: A cloud network, a measurement system for the cloud network, and a method, a device and a storage medium. For a cloud network, by means of automatically perceiving a measurement intent of a tenant in the cloud network, generating a measurement rule that complies with the measurement intent, and delivering, on the basis of the measurement rule and by means of bypass packet injection, a measurement request message into network element devices on a path to be measured, a network quality analysis is performed with the help of measurement record information generated when the measurement request message passes through different network element devices.

Type: Application

Filed: May 5, 2022

Publication date: July 11, 2024

Applicant: HANGZHOU ALICLOUD FEITIAN INFORMATION TECHNOLOGY CO., LTD.

Inventors: Shunmin ZHU, Biao LYU, Jianyuan LU, Daxiang KANG, Xiong XIAO, Lei WANG
Training and using a transcript generation model on a multi-speaker audio stream

Patent number: 11984127

Abstract: The disclosure herein describes using a transcript generation model for generating a transcript from a multi-speaker audio stream. Audio data including overlapping speech of a plurality of speakers is obtained and a set of frame embeddings are generated from audio data frames of the obtained audio data using an audio data encoder. A set of words and channel change (CC) symbols are generated from the set of frame embeddings using a transcript generation model. The CC symbols are included between pairs of adjacent words that are spoken by different people at the same time. The set of words and CC symbols are transformed into a plurality of transcript lines, wherein words of the set of words are sorted into transcript lines based on the CC symbols, and a multi-speaker transcript is generated based on the plurality of transcript lines. The inclusion of CC symbols by the model enables efficient, accurate multi-speaker transcription.

Type: Grant

Filed: December 31, 2021

Date of Patent: May 14, 2024

Assignee: Microsoft Technology Licensing, LLC

Inventors: Naoyuki Kanda, Takuya Yoshioka, Zhuo Chen, Jinyu Li, Yashesh Gaur, Zhong Meng, Xiaofei Wang, Xiong Xiao
COMPUTERIZED INTELLIGENT ASSISTANT FOR CONFERENCES

Publication number: 20230402038

Abstract: A method for facilitating a remote conference includes receiving a digital video and a computer-readable audio signal. A face recognition machine is operated to recognize a face of a first conference participant in the digital video, and a speech recognition machine is operated to translate the computer-readable audio signal into a first text. An attribution machine attributes the text to the first conference participant. A second computer-readable audio signal is processed similarly, to obtain a second text attributed to a second conference participant. A transcription machine automatically creates a transcript including the first text attributed to the first conference participant and the second text attributed to the second conference participant.

Type: Application

Filed: May 15, 2023

Publication date: December 14, 2023

Inventors: Adi DIAMANT, Xuedong HUANG, Karen MASTER BEN-DOR, Eyal KRUPKA, Raz HALALY, Yoni SMOLIN, Ilya GURVICH, Aviv HURVITZ, Lijuan QIN, Wei XIONG, Shixiong ZHANG, Lingfeng WU, Xiong XIAO, Ido LEICHTER, Moshe DAVID, Amit Kumar AGARWAL
Cloud platform-based comprehensive line network monitoring method and system

Patent number: 11753056

Abstract: Provided are a cloud platform-based comprehensive line network monitoring method and system. Device state data of each line is acquired, and stored in a monitoring platform and a data platform that are built on a cloud platform. The monitoring platform can only store only the latest device state data, perform data monitoring processing on the device state data to obtain real-time monitoring data, and transmit corresponding real-time monitoring data to a client at an application layer based on a data subscription demand of the client, to allow the client to perform real-time device monitoring processing based on the real-time monitoring data. Also, the data platform can provide, in response to a historical data application request sent by the client, the device state data to the client as historical statistical analysis data, to allow the client to perform historical state monitoring processing based on the historical statistical analysis data.

Type: Grant

Filed: March 16, 2023

Date of Patent: September 12, 2023

Assignees: PCI TECHNOLOGY & SERVICE CO., LTD., GUANGHOU HUAJIA SOFTWARE CO., LTD., GUANGDONG HUAZHIYUAN TECHNOLOGY CO., LTD., GUANGZHOU PCI URBAN RAIL TRANSIT SMART OPERATION & MAINTENANCE SERVICE CO., LTD.

Inventors: Xiong Xiao, Jianping Jia, Zhaohui Chen
CLOUD PLATFORM-BASED COMPREHENSIVE LINE NETWORK MONITORING METHOD AND SYSTEM

Publication number: 20230219605

Abstract: Provided are a cloud platform-based comprehensive line network monitoring method and system. Device state data of each line is acquired, and stored in a monitoring platform and a data platform that are built on a cloud platform. The monitoring platform can only store only the latest device state data, perform data monitoring processing on the device state data to obtain real-time monitoring data, and transmit corresponding real-time monitoring data to a client at an application layer based on a data subscription demand of the client, to allow the client to perform real-time device monitoring processing based on the real-time monitoring data. Also, the data platform can provide, in response to a historical data application request sent by the client, the device state data to the client as historical statistical analysis data, to allow the client to perform historical state monitoring processing based on the historical statistical analysis data.

Type: Application

Filed: March 16, 2023

Publication date: July 13, 2023

Inventors: Xiong XIAO, Jianping JIA, Zhaohui CHEN
TRAINING AND USING A TRANSCRIPT GENERATION MODEL ON A MULTI-SPEAKER AUDIO STREAM

Publication number: 20230215439

Abstract: The disclosure herein describes using a transcript generation model for generating a transcript from a multi-speaker audio stream. Audio data including overlapping speech of a plurality of speakers is obtained and a set of frame embeddings are generated from audio data frames of the obtained audio data using an audio data encoder. A set of words and channel change (CC) symbols are generated from the set of frame embeddings using a transcript generation model. The CC symbols are included between pairs of adjacent words that are spoken by different people at the same time. The set of words and CC symbols are transformed into a plurality of transcript lines, wherein words of the set of words are sorted into transcript lines based on the CC symbols, and a multi-speaker transcript is generated based on the plurality of transcript lines. The inclusion of CC symbols by the model enables efficient, accurate multi-speaker transcription.

Type: Application

Filed: December 31, 2021

Publication date: July 6, 2023

Inventors: Naoyuki KANDA, Takuya YOSHIOKA, Zhuo CHEN, Jinyu LI, Yashesh GAUR, Zhong MENG, Xiaofei WANG, Xiong XIAO
Computerized intelligent assistant for conferences

Patent number: 11688399

Abstract: A method for facilitating a remote conference includes receiving a digital video and a computer-readable audio signal. A face recognition machine is operated to recognize a face of a first conference participant in the digital video, and a speech recognition machine is operated to translate the computer-readable audio signal into a first text. An attribution machine attributes the text to the first conference participant. A second computer-readable audio signal is processed similarly, to obtain a second text attributed to a second conference participant. A transcription machine automatically creates a transcript including the first text attributed to the first conference participant and the second text attributed to the second conference participant.

Type: Grant

Filed: December 8, 2020

Date of Patent: June 27, 2023

Assignee: MICROSOFT TECHNOLOGY LICENSING, LLC

Inventors: Adi Diamant, Karen Master Ben-Dor, Eyal Krupka, Raz Halaly, Yoni Smolin, Ilya Gurvich, Aviv Hurvitz, Lijuan Qin, Wei Xiong, Shixiong Zhang, Lingfeng Wu, Xiong Xiao, Ido Leichter, Moshe David, Xuedong Huang, Amit Kumar Agarwal
Low-latency speech separation

Patent number: 11445295

Abstract: A system and method include reception of a first plurality of audio signals, generation of a second plurality of beamformed audio signals based on the first plurality of audio signals, each of the second plurality of beamformed audio signals associated with a respective one of a second plurality of beamformer directions, generation of a first TF mask for a first output channel based on the first plurality of audio signals, determination of a first beamformer direction associated with a first target sound source based on the first TF mask, generation of first features based on the first beamformer direction and the first plurality of audio signals, determination of a second TF mask based on the first features, and application of the second TF mask to one of the second plurality of beamformed audio signals associated with the first beamformer direction.

Type: Grant

Filed: November 17, 2020

Date of Patent: September 13, 2022

Assignee: Microsoft Technology Licensing, LLC

Inventors: Zhuo Chen, Changliang Liu, Takuya Yoshioka, Xiong Xiao, Hakan Erdogan, Dimitrios Basile Dimitriadis
Multi-stage sedimentation rake-free thickening device

Patent number: 11319215

Abstract: Disclosed is a multi-stage sedimentation rake-free thickening device. The device includes a central tank. A diversion sedimentation zone is arranged on the outside of the center tank. The diversion sedimentation zone includes an annular diversion sedimentation screen and a concentrated magnetic shower. The annular diversion sedimentation screen includes an annular groove spirally arranged around a central groove body. The annular groove is sequentially arranged with second spoiler baffles along the length direction. The lower bottom plate of the annular groove is also provided with second underflow discharge port. Multiple second inclined plate diversion discharge pipe is arranged under the corresponding second underflow discharge ports. The outlets of all the second inclined plate guide discharge pipes are collected to the second underflow discharge pipe, and the settled water is discharged from the second overflow discharge pipe arranged at the end of the annular groove.

Type: Grant

Filed: September 29, 2020

Date of Patent: May 3, 2022

Inventors: Chao Wang, Erning Zhao, Mengmeng Wang, Biao Hu, Chengliang Qiu, Xueqing Jiang, Xiong Xiao, Chengpeng Duan, Yang Li, Jiaqiang Zhou, Jin Zhang, Yu Zhang
Speaker recognition/location using neural network

Patent number: 11222640

Abstract: Computing devices and methods utilizing a joint speaker location/speaker identification neural network are provided. In one example a computing device receives an audio signal of utterances spoken by multiple persons. Magnitude and phase information features are extracted from the signal and inputted into a joint speaker location and speaker identification neural network. The neural network utilizes both the magnitude and phase information features to determine a change in the person speaking. Output comprising the determination of the change is received from the neural network. The output is then used to perform a speaker recognition function, speaker location function, or both.

Type: Grant

Filed: February 27, 2020

Date of Patent: January 11, 2022

Assignee: Microsoft Technology Licensing, LLC

Inventors: Shixiong Zhang, Xiong Xiao
Voice identification enrollment

Patent number: 11152006

Abstract: Examples are disclosed that relate to voice identification enrollment. One example provides a method of voice identification enrollment comprising, during a meeting in which two or more human speakers speak at different times, determining whether one or more conditions of a protocol for sampling meeting audio used to establish human speaker voiceprints are satisfied, and in response to determining that the one or more conditions are satisfied, selecting a sample of meeting audio according to the protocol, the sample representing an utterance made by one of the human speakers. The method further comprises establishing, based at least on the sample, a voiceprint of the human speaker.

Type: Grant

Filed: June 27, 2018

Date of Patent: October 19, 2021

Assignee: Microsoft Technology Licensing, LLC

Inventors: Eyal Krupka, Shixiong Zhang, Xiong Xiao
Computerized Intelligent Assistant for Conferences

Publication number: 20210210097

Abstract: A method for facilitating a remote conference includes receiving a digital video and a computer-readable audio signal. A face recognition machine is operated to recognize a face of a first conference participant in the digital video, and a speech recognition machine is operated to translate the computer-readable audio signal into a first text. An attribution machine attributes the text to the first conference participant. A second computer-readable audio signal is processed similarly, to obtain a second text attributed to a second conference participant. A transcription machine automatically creates a transcript including the first text attributed to the first conference participant and the second text attributed to the second conference participant.

Type: Application

Filed: December 8, 2020

Publication date: July 8, 2021

Inventors: Adi DIAMANT, Karen MASTER BEN-DOR, Eyal KRUPKA, Raz HALALY, Yoni SMOLIN, Ilya GURVICH, Aviv HURVITZ, Lijuan QIN, Wei XIONG, Shixiong ZHANG, Lingfeng WU, Xiong XIAO, Ido LEICHTER, Moshe DAVID, Xuedong HUANG, Amit Kumar AGARWAL
MULTI-STAGE SEDIMENTATION RAKE-FREE THICKENING DEVICE

Publication number: 20210179444

Abstract: Disclosed is a multi-stage sedimentation rake-free thickening device. The device includes a central tank. A diversion sedimentation zone is arranged on the outside of the center tank. The diversion sedimentation zone includes an annular diversion sedimentation screen and a concentrated magnetic shower. The annular diversion sedimentation screen includes an annular groove spirally arranged around a central groove body. The annular groove is sequentially arranged with second spoiler baffles along the length direction. The lower bottom plate of the annular groove is also provided with second underflow discharge port. Multiple second inclined plate diversion discharge pipe is arranged under the corresponding second underflow discharge ports. The outlets of all the second inclined plate guide discharge pipes are collected to the second underflow discharge pipe, and the settled water is discharged from the second overflow discharge pipe arranged at the end of the annular groove.

Type: Application

Filed: September 29, 2020

Publication date: June 17, 2021

Inventors: Chao WANG, Erning ZHAO, Mengmeng WANG, Biao HU, Chengliang QIU, Xueqing JIANG, Xiong XIAO, Chengpeng DUAN, Yang LI, Jiaqiang ZHOU, Jin ZHANG, Yu ZHANG
Reverse flow type multi-stage sedimentation rake-free thickening device

Patent number: 10994228

Abstract: Disclosed is a reverse flow multi-stage sedimentation rake-free thickening device relating to the field of slime water treatment. The device includes a feed assembly, a guide assembly, and a clean coal collection assembly. The guide assembly also includes a central tank body and coal slurry flows from the upper part of the central tank body to the inner side wall of the central tank body through the feed assembly and the medicament, and then flows to the middle of the central tank body through the guide assembly. After the reaction, the bubbles carry the fine coal slime and move up to the clean coal collection assembly. The clean coal collection assembly is located above the outlet of the guide assembly, and the clean coal collection assembly is sequentially provided with a central collection area, a defoaming area, and a diversion settlement area from the middle to the outside.

Type: Grant

Filed: September 24, 2020

Date of Patent: May 4, 2021

Inventors: Chao Wang, Erning Zhao, Biao Hu, Chengliang Qiu, Mengmeng Wang, Xueqing Jiang, Xiong Xiao, Xinchun Liu, Yang Feng, Jiaqiang Zhou, Jin Zhang, Yu Zhang
Multi-microphone speech separation

Patent number: 10957337

Abstract: This document relates to separation of audio signals into speaker-specific signals. One example obtains features reflecting mixed speech signals captured by multiple microphones. The features can be input a neural network and masks can be obtained from the neural network. The masks can be applied one or more of the mixed speech signals captured by one or more of the microphones to obtain two or more separate speaker-specific speech signals, which can then be output.

Type: Grant

Filed: May 29, 2018

Date of Patent: March 23, 2021

Assignee: Microsoft Technology Licensing, LLC

Inventors: Zhuo Chen, Hakan Erdogan, Takuya Yoshioka, Fileno A. Alleva, Xiong Xiao
LOW-LATENCY SPEECH SEPARATION

Publication number: 20210076129

Abstract: A system and method include reception of a first plurality of audio signals, generation of a second plurality of beamformed audio signals based on the first plurality of audio signals, each of the second plurality of beamformed audio signals associated with a respective one of a second plurality of beamformer directions, generation of a first TF mask for a first output channel based on the first plurality of audio signals, determination of a first beamformer direction associated with a first target sound source based on the first TF mask, generation of first features based on the first beamformer direction and the first plurality of audio signals, determination of a second TF mask based on the first features, and application of the second TF mask to one of the second plurality of beamformed audio signals associated with the first beamformer direction.

Type: Application

Filed: November 17, 2020

Publication date: March 11, 2021

Inventors: Zhuo CHEN, Changliang LIU, Takuya YOSHIOKA, Xiong XIAO, Hakan ERDOGAN, Dimitrios Basile DIMITRIADIS
REVERSE FLOW TYPE MULTI-STAGE SEDIMENTATION RAKE-FREE THICKENING DEVICE

Publication number: 20210008467

Abstract: Disclosed is a reverse flow multi-stage sedimentation rake-free thickening device relating to the field of slime water treatment. The device includes a feed assembly, a guide assembly, and a clean coal collection assembly. The guide assembly also includes a central tank body and coal slurry flows from the upper part of the central tank body to the inner side wall of the central tank body through the feed assembly and the medicament, and then flows to the middle of the central tank body through the guide assembly. After the reaction, the bubbles carry the fine coal slime and move up to the clean coal collection assembly. The clean coal collection assembly is located above the outlet of the guide assembly, and the clean coal collection assembly is sequentially provided with a central collection area, a defoaming area, and a diversion settlement area from the middle to the outside.

Type: Application

Filed: September 24, 2020

Publication date: January 14, 2021

Inventors: Chao WANG, Erning ZHAO, Biao HU, Chengliang QIU, Mengmeng WANG, Xueqing JIANG, Xiong XIAO, Xinchun LIU, Yang FENG, Jiaqiang ZHOU, Jin ZHANG, Yu ZHANG
Computerized intelligent assistant for conferences

Patent number: 10867610

Abstract: A method for facilitating a remote conference includes receiving a digital video and a computer-readable audio signal. A face recognition machine is operated to recognize a face of a first conference participant in the digital video, and a speech recognition machine is operated to translate the computer-readable audio signal into a first text. An attribution machine attributes the text to the first conference participant. A second computer-readable audio signal is processed similarly, to obtain a second text attributed to a second conference participant. A transcription machine automatically creates a transcript including the first text attributed to the first conference participant and the second text attributed to the second conference participant.

Type: Grant

Filed: June 29, 2018

Date of Patent: December 15, 2020

Assignee: Microsoft Technology Licensing, LLC

Inventors: Adi Diamant, Karen Master Ben-Dor, Eyal Krupka, Raz Halaly, Yoni Smolin, Ilya Gurvich, Aviv Hurvitz, Lijuan Qin, Wei Xiong, Shixiong Zhang, Lingfeng Wu, Xiong Xiao, Ido Leichter, Moshe David, Xuedong Huang, Amit Kumar Agarwal

1 2 3 next