Patents by Inventor Zhaoqing Ma

Zhaoqing Ma has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Incremental clustering for face recognition systems

Patent number: 11354936

Abstract: Techniques for improved image classification are provided. Face embeddings are generated for each face depicted in a collection of images, and the face embeddings are clustered based on the individual whose face is depicted. Based on these clusters, each embedding is assigned a label reflecting the cluster assignments. Some or all of the face embeddings are then used to train a classifier model to generate cluster labels for new input images. This classifier model can then be used to process new images in an efficient manner, and classify them into appropriate clusters.

Type: Grant

Filed: July 15, 2020

Date of Patent: June 7, 2022

Assignee: Amazon Technologies, Inc.

Inventors: Dharmil Satishbhai Chandarana, Ilya Levner, Zhaoqing Ma, Prajwal Yadapadithaya, Riley James Williams, Canku Alp Calargun, Prama Anand
Adjusting speed of human speech playback

Patent number: 11232808

Abstract: A system configured to vary a speech speed of speech represented in input audio data without changing a pitch of the speech. The system may vary the speech speed based on a number of different inputs, including non-audio data, data associated with a command, or data associated with the voice message itself. The non-audio data may correspond to information about an account, device or user, such as user preferences, calendar entries, location information, etc. The system may analyze audio data associated with the command to determine command speech speed, identity of person listening, etc. The system may analyze the input audio data to determine a message speech speed, background noise level, identity of the person speaking, etc. Using all of these inputs, the system may dynamically determine a target speech speed and may generate output audio data having the target speech speed.

Type: Grant

Filed: April 25, 2019

Date of Patent: January 25, 2022

Assignee: Amazon Technologies, Inc.

Inventors: Zhaoqing Ma, Tony Roy Hardie, Christo Frank Devaraj
Vector norm algorithmic subsystems for improving clustering solutions

Patent number: 11003959

Abstract: Categorizing images may include training a first neural network to cluster a plurality of images to obtain a first image embedding space, wherein a vector representation is determined for each of the plurality of images based on the training, determining a vector norm value corresponding to each of the plurality of images based on the vector representation for each of the plurality of images, and identifying a first subset of the images for which a corresponding vector norm value satisfies a predetermined vector norm quality threshold. Then, a second neural network may be trained using the first subset of images to obtain a second image embedding space, and the second image embedding space may be used to categorize additional images.

Type: Grant

Filed: June 13, 2019

Date of Patent: May 11, 2021

Assignee: Amazon Technologies, Inc.

Inventors: Ilya Levner, Konstantinos Boulis, Gurbinder Gill, Canku Calargun, Prajwal Yadapadithaya, Venkata Krishnan Ramamoorthy, Zhaoqing Ma
Managing modality views on conversation canvas

Patent number: 10491549

Abstract: A single communication application can display an initial modality view associated with the group communication on a canvas of a communication application user interface. The initial modality view can be one of multiple available views. The single communication application can receive a selection to display a different from the initial modality view. Each active user in the group communication can be on a different computing device with different active instances of the single communication application which are each configured to switch independently of each other active instances of the single communication application running on different computing devices to modality views that are different from the initial modality view. The communication application can display the selected modality view on the canvas by transitioning from the initial modality view.

Type: Grant

Filed: June 7, 2018

Date of Patent: November 26, 2019

Assignee: Microsoft Technologicy Licensing, LLC

Inventors: Nathan Gunderson, Alexander Darrow, Zhaoqing Ma, Punit Java, Christina Marie Meyer, Steve Chang, Leslie Cindy Chen, Eric Hamilton, Marcelo Truffat
ADJUSTING SPEED OF HUMAN SPEECH PLAYBACK

Publication number: 20190318758

Abstract: A system configured to vary a speech speed of speech represented in input audio data without changing a pitch of the speech. The system may vary the speech speed based on a number of different inputs, including non-audio data, data associated with a command, or data associated with the voice message itself. The non-audio data may correspond to information about an account, device or user, such as user preferences, calendar entries, location information, etc. The system may analyze audio data associated with the command to determine command speech speed, identity of person listening, etc. The system may analyze the input audio data to determine a message speech speed, background noise level, identity of the person speaking, etc. Using all of these inputs, the system may dynamically determine a target speech speed and may generate output audio data having the target speech speed.

Type: Application

Filed: April 25, 2019

Publication date: October 17, 2019

Inventors: Zhaoqing Ma, Tony Roy Hardie, Christo Frank Devaraj
Adjusting speed of human speech playback

Patent number: 10276185

Abstract: A system configured to vary a speech speed of speech represented in input audio data without changing a pitch of the speech. The system may vary the speech speed based on a number of different inputs, including non-audio data, data associated with a command, or data associated with the voice message itself. The non-audio data may correspond to information about an account, device or user, such as user preferences, calendar entries, location information, etc. The system may analyze audio data associated with the command to determine command speech speed, identity of person listening, etc. The system may analyze the input audio data to determine a message speech speed, background noise level, identity of the person speaking, etc. Using all of these inputs, the system may dynamically determine a target speech speed and may generate output audio data having the target speech speed.

Type: Grant

Filed: August 15, 2017

Date of Patent: April 30, 2019

Assignee: Amazon Technologies, Inc.

Inventors: Zhaoqing Ma, Tony Roy Hardie, Christo Frank Devaraj
MANAGING MODALITY VIEWS ON CONVERSATION CANVAS

Publication number: 20180351886

Abstract: A single communication application can display an initial modality view associated with the group communication on a canvas of a communication application user interface. The initial modality view can be one of multiple available views. The single communication application can receive a selection to display a different from the initial modality view. Each active user in the group communication can be on a different computing device with different active instances of the single communication application which are each configured to switch independently of each other active instances of the single communication application running on different computing devices to modality views that are different from the initial modality view. The communication application can display the selected modality view on the canvas by transitioning from the initial modality view.

Type: Application

Filed: June 7, 2018

Publication date: December 6, 2018

Inventors: Nathan Gunderson, Alexander Darrow, Zhaoqing Ma, Punit Java, Christina Marie Meyer, Steve Chang, Leslie Cindy Chen, Eric Hamilton, Marcelo Truffat
Managing modality views on conversation canvas

Patent number: 10009298

Abstract: A communication application displays a modality view that may be one of a collaboration, a gallery, or a messaging view on a conversation canvas. The application may display an initial view according a modality selection by the user. A user is enabled to select a next view from the set by providing a user action such as a tap, swipe action, etc. The application dynamically generates the next modality view according to the initial view. Common participants and common contexts are used to configure the next view. Subsequent to configuration, the application displays the next modality view on the conversation canvas by transitioning from the initial view. The application retains session information from the initial view to restore the initial view session if the user selects to return to the initial view.

Type: Grant

Filed: June 5, 2015

Date of Patent: June 26, 2018

Assignee: Microsoft Technology Licensing, LLC

Inventors: Nathan Gunderson, Alexander Darrow, Zhaoqing Ma, Punit Java, Christina Marie Meyer, Steve Chang, Leslie Cindy Chen, Eric Hamilton, Marcelo Truffat
MANAGING MODALITY VIEWS ON CONVERSATION CANVAS

Publication number: 20150319113

Abstract: A communication application displays a modality view that may be one of a collaboration, a gallery, or a messaging view on a conversation canvas. The application may display an initial view according a modality selection by the user. A user is enabled to select a next view from the set by providing a user action such as a tap, swipe action, etc. The application dynamically generates the next modality view according to the initial view. Common participants and common contexts are used to configure the next view. Subsequent to configuration, the application displays the next modality view on the conversation canvas by transitioning from the initial view. The application retains session information from the initial view to restore the initial view session if the user selects to return to the initial view.

Type: Application

Filed: June 5, 2015

Publication date: November 5, 2015

Inventors: Nathan Gunderson, Alexander Darrow, Zhaoqing Ma, Punit Java, Christina Marie Meyer, Steve Chang, Leslie Cindy Chen, Eric Hamilton, Marcelo Truffat
Managing modality views on conversation canvas

Patent number: 9083816

Abstract: A communication application displays a modality view that may be one of a collaboration, a gallery, or a messaging view on a conversation canvas. The application may display an initial view according a modality selection by the user. A user is enabled to select a next view from the set by providing a user action such as a tap, swipe action, etc. The application dynamically generates the next modality view according to the initial view. Common participants and common contexts are used to configure the next view. Subsequent to configuration, the application displays the next modality view on the conversation canvas by transitioning from the initial view. The application retains session information from the initial view to restore the initial view session if the user selects to return to the initial view.

Type: Grant

Filed: September 14, 2012

Date of Patent: July 14, 2015

Assignee: Microsoft Technology Licensing, LLC

Inventors: Nathan Gunderson, Alexander Darrow, Zhaoqing Ma, Punit Java, Christina Marie Meyer, Steve Chang, Leslie Cindy Chen, Eric Hamilton, Marcelo Truffat
MANAGING MODALITY VIEWS ON CONVERSATION CANVAS

Publication number: 20140082522

Abstract: A communication application displays a modality view that may be one of a collaboration, a gallery, or a messaging view on a conversation canvas. The application may display an initial view according a modality selection by the user. A user is enabled to select a next view from the set by providing a user action such as a tap, swipe action, etc. The application dynamically generates the next modality view according to the initial view. Common participants and common contexts are used to configure the next view. Subsequent to configuration, the application displays the next modality view on the conversation canvas by transitioning from the initial view. The application retains session information from the initial view to restore the initial view session if the user selects to return to the initial view.

Type: Application

Filed: September 14, 2012

Publication date: March 20, 2014

Applicant: Microsoft Corporation

Inventors: Nathan Gunderson, Alexander Darrow, Zhaoqing Ma, Punit Java, Christina Marie Meyer, Steve Chang, Leslie Cindy Chen, Eric Hamilton, Marcelo Truffat