Patents by Inventor XIONG XIAO

XIONG XIAO has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20200335119
    Abstract: Embodiments are associated with determination of a first plurality of multi-dimensional vectors, each of the first plurality of multi-dimensional vectors representing speech of a target speaker, determination of a multi-dimensional vector representing a speech signal of two or more speakers, determination of a weighted vector representing speech of the target speaker based on the first plurality of multi-dimensional vectors and on similarities between the multi-dimensional vector and each of the first plurality of multi-dimensional vectors, and extraction of speech of the target speaker from the speech signal based on the weighted vector and the speech signal.
    Type: Application
    Filed: June 7, 2019
    Publication date: October 22, 2020
    Inventors: Xiong XIAO, Zhuo CHEN, Takuya YOSHIOKA, Changliang LIU, Hakan ERDOGAN, Dimitrios Basile DIMITRIADIS, Yifan GONG, James Garnet Droppo, III
  • Publication number: 20200322722
    Abstract: A system and method include reception of a first plurality of audio signals, generation of a second plurality of beamformed audio signals based on the first plurality of audio signals, each of the second plurality of beamformed audio signals associated with a respective one of a second plurality of beamformer directions, generation of a first TF mask for a first output channel based on the first plurality of audio signals, determination of a first beamformer direction associated with a first target sound source based on the first TF mask, generation of first features based on the first beamformer direction and the first plurality of audio signals, determination of a second TF mask based on the first features, and application of the second TF mask to one of the second plurality of beamformed audio signals associated with the first beamformer direction.
    Type: Application
    Filed: April 5, 2019
    Publication date: October 8, 2020
    Inventors: Zhuo CHEN, Changliang LIU, Takuya YOSHIOKA, Xiong XIAO, Hakan ERDOGAN, Dimitrios Basile DIMITRIADIS
  • Publication number: 20200202867
    Abstract: Computing devices and methods utilizing a joint speaker location/speaker identification neural network are provided. In one example a computing device receives an audio signal of utterances spoken by multiple persons. Magnitude and phase information features are extracted from the signal and inputted into a joint speaker location and speaker identification neural network. The neural network utilizes both the magnitude and phase information features to determine a change in the person speaking. Output comprising the determination of the change is received from the neural network. The output is then used to perform a speaker recognition function, speaker location function, or both.
    Type: Application
    Filed: February 27, 2020
    Publication date: June 25, 2020
    Applicant: Microsoft Technology Licensing, LLC
    Inventors: Shixiong ZHANG, Xiong XIAO
  • Patent number: 10580414
    Abstract: Computing devices and methods utilizing a joint speaker location/speaker identification neural network are provided. In one example a computing device receives a multi-channel audio signal of an utterance spoken by a user. Magnitude and phase information features are extracted from the signal and inputted into a joint speaker location/speaker identification neural network that is trained via utterances from a plurality of persons. A user embedding comprising speaker identification characteristics and location characteristics is received from the neural network and compared to a plurality of enrollment embeddings extracted from the plurality of utterances that are each associated with an identity of a corresponding person. Based at least on the comparisons, the user is matched to an identity of one of the persons, and the identity of the person is outputted.
    Type: Grant
    Filed: June 12, 2018
    Date of Patent: March 3, 2020
    Assignee: MICROSOFT TECHNOLOGY LICENSING, LLC
    Inventors: Shixiong Zhang, Xiong Xiao
  • Publication number: 20190341055
    Abstract: Examples are disclosed that relate to voice identification enrollment. One example provides a method of voice identification enrollment comprising, during a meeting in which two or more human speakers speak at different times, determining whether one or more conditions of a protocol for sampling meeting audio used to establish human speaker voiceprints are satisfied, and in response to determining that the one or more conditions are satisfied, selecting a sample of meeting audio according to the protocol, the sample representing an utterance made by one of the human speakers. The method further comprises establishing, based at least on the sample, a voiceprint of the human speaker.
    Type: Application
    Filed: June 27, 2018
    Publication date: November 7, 2019
    Applicant: Microsoft Technology Licensing, LLC
    Inventors: Eyal KRUPKA, Shixiong ZHANG, Xiong XIAO
  • Publication number: 20190341053
    Abstract: A computerized conference assistant includes a camera and a microphone. A face location machine of the computerized conference assistant finds a physical location of a human, based on a position of a candidate face in digital video captured by the camera. A beamforming machine of the computerized conference assistant outputs a beamformed signal isolating sounds originating from the physical location of the human. A diarization machine of the computerized conference assistant attributes information encoded in the beamformed signal to the human.
    Type: Application
    Filed: June 26, 2018
    Publication date: November 7, 2019
    Applicant: Microsoft Technology Licensing, LLC
    Inventors: Shixiong ZHANG, Lingfeng WU, Eyal KRUPKA, Xiong XIAO, Yifan GONG
  • Publication number: 20190341057
    Abstract: Computing devices and methods utilizing a joint speaker location/speaker identification neural network are provided. In one example a computing device receives a multi-channel audio signal of an utterance spoken by a user. Magnitude and phase information features are extracted from the signal and inputted into a joint speaker location/speaker identification neural network that is trained via utterances from a plurality of persons. A user embedding comprising speaker identification characteristics and location characteristics is received from the neural network and compared to a plurality of enrollment embeddings extracted from the plurality of utterances that are each associated with an identity of a corresponding person. Based at least on the comparisons, the user is matched to an identity of one of the persons, and the identity of the person is outputted.
    Type: Application
    Filed: June 12, 2018
    Publication date: November 7, 2019
    Applicant: Microsoft Technology Licensing, LLC
    Inventors: Shixiong ZHANG, Xiong XIAO
  • Publication number: 20190341050
    Abstract: A method for facilitating a remote conference includes receiving a digital video and a computer-readable audio signal. A face recognition machine is operated to recognize a face of a first conference participant in the digital video, and a speech recognition machine is operated to translate the computer-readable audio signal into a first text. An attribution machine attributes the text to the first conference participant. A second computer-readable audio signal is processed similarly, to obtain a second text attributed to a second conference participant. A transcription machine automatically creates a transcript including the first text attributed to the first conference participant and the second text attributed to the second conference participant.
    Type: Application
    Filed: June 29, 2018
    Publication date: November 7, 2019
    Applicant: Microsoft Technology Licensing, LLC
    Inventors: Adi DIAMANT, Karen MASTER BEN-DOR, Eyal KRUPKA, Raz HALALY, Yoni SMOLIN, Ilya GURVICH, Aviv HURVITZ, Lijuan QIN, Wei XIONG, Shixiong ZHANG, Lingfeng WU, Xiong XIAO, Ido LEICHTER, Moshe DAVID, Xuedong HUANG, Amit Kumar AGARWAL
  • Publication number: 20190341054
    Abstract: Multi-modal speech localization is achieved using image data captured by one or more cameras, and audio data captured by a microphone array. Audio data captured by each microphone of the array is transformed to obtain a frequency domain representation that is discretized in a plurality of frequency intervals. Image data captured by each camera is used to determine a positioning of each human face. Input data is provided to a previously-trained, audio source localization classifier, including: the frequency domain representation of the audio data captured by each microphone, and the positioning of each human face captured by each camera in which the positioning of each human face represents a candidate audio source. An identified audio source is indicated by the classifier based on the input data that is estimated to be the human face from which the audio data originated.
    Type: Application
    Filed: June 27, 2018
    Publication date: November 7, 2019
    Applicant: Microsoft Technology Licensing, LLC
    Inventors: Eyal KRUPKA, Xiong XIAO
  • Publication number: 20190318757
    Abstract: This document relates to separation of audio signals into speaker-specific signals. One example obtains features reflecting mixed speech signals captured by multiple microphones. The features can be input a neural network and masks can be obtained from the neural network. The masks can be applied one or more of the mixed speech signals captured by one or more of the microphones to obtain two or more separate speaker-specific speech signals, which can then be output.
    Type: Application
    Filed: May 29, 2018
    Publication date: October 17, 2019
    Applicant: Microsoft Technology Licensing, LLC
    Inventors: Zhuo CHEN, Hakan ERDOGAN, Takuya YOSHIOKA, Fileno A. ALLEVA, Xiong XIAO
  • Patent number: 10289634
    Abstract: A data-clustering method generates data clusters for a set of data points. A region of interest containing the data points and a center matrix for the region of interest are defined, where the center matrix includes an array of center points defining centers of overlapping circles. The data points are mapped to corresponding circles based on near center points. Pairs of overlapping circles are merged based on relative numbers of data points lying in overlap regions of the pairs of overlapping circles compared to total numbers of data points within the corresponding circles. Circles belonging to the one or more data clusters are identified based on merged pairs of overlapping circles, and data points belonging to the one or more data clusters are identified based on the corresponding circles. The method may be performed by a computer having a heterogeneous architecture with parallel processors.
    Type: Grant
    Filed: September 5, 2016
    Date of Patent: May 14, 2019
    Assignee: NXP USA, INC.
    Inventors: Xiong Xiao, Zhenyong Chen, Xianzhong Li
  • Publication number: 20190139563
    Abstract: Representative embodiments disclose mechanisms to separate and recognize multiple audio sources (e.g., picking out individual speakers) in an environment where they overlap and interfere with each other. The architecture uses a microphone array to spatially separate out the audio signals. The spatially filtered signals are then input into a plurality of separators, so each signal is input into a corresponding signal. The separators use neural networks to separate out audio sources. The separators typically produce multiple output signals for the single input signals. A post selection processor then assesses the separator outputs to pick the signals with the highest quality output. These signals can be used in a variety of systems such as speech recognition, meeting transcription and enhancement, hearing aids, music information retrieval, speech enhancement and so forth.
    Type: Application
    Filed: November 6, 2017
    Publication date: May 9, 2019
    Inventors: Zhuo Chen, Jinyu Li, Xiong Xiao, Takuya Yoshioka, Huaming Wang, Zhenghao Wang, Yifan Gong
  • Publication number: 20170132307
    Abstract: A data-clustering method generates data clusters for a set of data points. A region of interest containing the data points and a center matrix for the region of interest are defined, where the center matrix includes an array of center points defining centers of overlapping circles. The data points are mapped to corresponding circles based on near center points. Pairs of overlapping circles are merged based on relative numbers of data points lying in overlap regions of the pairs of overlapping circles compared to total numbers of data points within the corresponding circles. Circles belonging to the one or more data clusters are identified based on merged pairs of overlapping circles, and data points belonging to the one or more data clusters are identified based on the corresponding circles. The method may be performed by a computer having a heterogeneous architecture with parallel processors.
    Type: Application
    Filed: September 5, 2016
    Publication date: May 11, 2017
    Inventors: XIONG XIAO, Zhenyong Chen, Xianzhong Li
  • Patent number: 8955742
    Abstract: A self-service terminal includes a safety box, a main control module, an operating module, and a shielding member. The main control module includes a casing and a mounting frame mounted in the casing. The mounting frame is attached on the safety box. The casing includes a pair of side panels and a top panel connecting between the pair of side panels. The pair of side panels and the top panel is detachably attached to the mounting frame. The operating module is attached to a front side of the main control module. The shielding member is attached to the safety box and encloses the operating module.
    Type: Grant
    Filed: November 30, 2010
    Date of Patent: February 17, 2015
    Assignees: Hong Fu Jin Precision Industry (Shenzhen) Co., Ltd., Hon Hai Precision Industry Co., Ltd.
    Inventors: Chih-Kun Shih, Yang-Jie Luo, Wan-Cheng Luo, Si-Long Li, Xiang-Xiong Xiao, Jie Peng
  • Patent number: 8651372
    Abstract: A self-service terminal includes a first cabinet, a cover rotatably secured on the first cabinet, and an adjustment device received in the first cabinet. A card slot for inputting/outputting a cash card is defined in the cover, and a positioning member is attached to the user interface cover. The adjustment device includes a positioning tray and a mounting member slidably secured to the positioning tray. A reader is attached to the positioning tray and includes an interface. The mounting member includes a slanted side plate abutting the positioning tray; wherein the positioning member is engaged with the positioning tray, for aligning with the interface of the reader and the card slot of the cover.
    Type: Grant
    Filed: November 26, 2010
    Date of Patent: February 18, 2014
    Assignees: Hong Fu Jin Precision Industry (ShenZhen) Co., Ltd, Hon Hai Precision Industry Co., Ltd.
    Inventors: Chih-Kun Shih, Si-Long Li, Yang-Jie Luo, Wan-Cheng Luo, Xiang-Xiong Xiao, Jie Peng
  • Patent number: 8452071
    Abstract: A self-service terminal for storing currency, includes a detection unit, an image capturing unit, a database, and a serial number reading unit. The detection unit detects the currency. The image capturing unit is connected to the detection unit, and captures an image of a side of the currency on which a string of serial numbers are printed. The database is connected to the image capturing unit. The database includes depositor's information stored therein. The database saves the image therein. The serial number reading unit is connected to the database. The serial number reading unit captures the string of serial numbers from image. The database saves the string of serial numbers therein, and associates the string of serial numbers with the information of the depositor.
    Type: Grant
    Filed: December 15, 2010
    Date of Patent: May 28, 2013
    Assignees: Hong Fu Jin Precision Industry (ShenZhen) Co., Ltd., Hon Hai Precision Industry Co., Ltd.
    Inventors: Chih-Kun Shih, Wan-Cheng Luo, Xiao-Mao Xie, Wei Xu, Xiang-Xiong Xiao
  • Patent number: 8376220
    Abstract: An automatic teller machine includes a chassis, a user interface module; and a pair of sliding mechanisms. The user interface module is rotatably attached to the chassis. The pair of sliding mechanisms are attached to the chassis and the user interface. Each sliding mechanism includes a first rail and a second rail. The first rail is rotatably secured to the chassis. A second blocking member is located on the first rail. The second rail is rotatably secured to the user interface module and slidable on the first rail. A latch member with a latch portion is rotatably located on the second rail. The user interface module is rotatable between a closed position, where the latch portion is located away from the second blocking member, and an open position, where the latch portion engages with the second blocking member.
    Type: Grant
    Filed: December 29, 2010
    Date of Patent: February 19, 2013
    Assignees: Hong Fu Jin Precision Industry (ShenZhen) Co., Ltd., Hon Hai Precision Industry Co., Ltd.
    Inventors: Chih-Kun Shih, Yang-Jie Luo, Wan-Cheng Luo, Xiang-Xiong Xiao, Si-Long Li, Jie Peng
  • Publication number: 20120020543
    Abstract: A self-service terminal for storing currency, includes a detection unit, an image capturing unit, a database, and a serial number reading unit. The detection unit detects the currency. The image capturing unit is connected to the detection unit, and captures an image of a side of the currency on which a string of serial numbers are printed. The database is connected to the image capturing unit. The database includes depositor's information stored therein. The database saves the image therein. The serial number reading unit is connected to the database. The serial number reading unit captures the string of serial numbers from image. The database saves the string of serial numbers therein, and associates the string of serial numbers with the information of the depositor.
    Type: Application
    Filed: December 15, 2010
    Publication date: January 26, 2012
    Applicants: HON HAI PRECISION INDUSTRY CO., LTD., HONG FU JIN PRECISION INDUSTRY (ShenZhen) CO., LTD
    Inventors: CHIH-KUN SHIH, WAN-CHENG LUO, XIAO-MAO XIE, WEI XU, XIANG-XIONG XIAO
  • Publication number: 20120001524
    Abstract: A self-service terminal includes a safety box, a main control module, an operating module, and a shielding member. The main control module includes a casing and a mounting frame mounted in the casing. The mounting frame is attached on the safety box. The casing includes a pair of side panels and a top panel connecting between the pair of side panels. The pair of side panels and the top panel is detachably attached to the mounting frame. The operating module is attached to a front side of the main control module. The shielding member is attached to the safety box and encloses the operating module.
    Type: Application
    Filed: November 30, 2010
    Publication date: January 5, 2012
    Applicants: HON HAI PRECISION INDUSTRY CO., LTD., HONG FU JIN PRECISION INDUSTRY (ShenZhen) CO.,LTD.
    Inventors: CHIH-KUN SHIH, YANG-JIE LUO, WAN-CHENG LUO, SI-LONG LI, XIANG-XIONG XIAO, JIE PENG
  • Publication number: 20120002354
    Abstract: A self-service terminal includes a control portion and a slide panel. The control portion includes a user interface cover and a maintenance operation portion. The user interface cover is pivotably mounted on the control portion, and rotates on the control portion between a first position and a second position. In the first position, the user interface cover shields the maintenance operation portion to make the maintenance operation portion inaccessible. In the second position, the user interface cover does not shield the maintenance operation portion and the maintenance operation portion is accessible. The slide panel is slidably mounted in the maintenance operation portion. A keyboard and a mouse are mounted on the slide panel. The slide panel hangs over the maintenance operation portion when the user interface cover is in the second position, and is retracted in the maintenance operation portion when the user interface cover is in the first position.
    Type: Application
    Filed: December 16, 2010
    Publication date: January 5, 2012
    Applicants: HON HAI PRECISION INDUSTRY CO., LTD., HONG FU JIN PRECISION INDUSTRY (ShenZhen) CO., LTD
    Inventors: CHIH-KUN SHIH, YANG-JIE LUO, WAN-CHENG LUO, SI-LONG LI, XIANG-XIONG XIAO, JIE PENG