Patents by Inventor Amit Srivastava

Amit Srivastava has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Techniques for generating data for an intelligent gesture detector

Patent number: 11556183

Abstract: A method and system for generating training data for training a gesture detection machine-learning (ML) model includes receiving a request to generate training data for the gesture detection model, the training data being associated with a target gesture, retrieving data associated with an original gesture, the original gesture being a gesture made using a body part, retrieving skeleton data associated with the target gesture, the skeleton data displaying a skeleton representative of the body part and the skeleton displaying the target gesture, aligning a location of the body part in the data with a location of the skeleton in the skeleton data, providing the aligned data and the skeleton data to an ML model for generating a target data that displays the target gesture, receiving the target data as an output from the ML model, the target data preserving a visual feature of the data and displaying the target gesture, and providing the target data to the gesture detection ML model.

Type: Grant

Filed: September 30, 2021

Date of Patent: January 17, 2023

Assignee: Microsoft Technology Licensing, LLC

Inventors: Ji Li, Mingxi Cheng, Fatima Zohra Daha, Amit Srivastava
INTEGRATED SYSTEM FOR DETECTING AND CORRECTING CONTENT

Publication number: 20220405907

Abstract: Aspects of the present disclosure relate to systems and methods for detecting and correcting undesirable content. A video feed may be segmented to distinguish background data from foreground data. It may be determined that a region of the background data includes a qualifying behavior. The qualifying behavior may be classified as belonging to a distracting category of data. An effect may be applied to the background data that includes the qualifying behavior to reduce an appearance of the qualifying behavior.

Type: Application

Filed: June 20, 2021

Publication date: December 22, 2022

Inventors: Fatima Zohra DAHA, Amit SRIVASTAVA, Nicolas Paul-Stringall HIGUERA, Robert Fernand GORDAN
Image classification modeling while maintaining data privacy compliance

Patent number: 11507677

Abstract: The present disclosure relates to processing operations that execute image classification training for domain-specific traffic, where training operations are entirely compliant with data privacy regulations and policies. Image classification model training, as described herein, is configured to classify meaningful image categories in domain-specific scenarios where there is unknown data traffic and strict data compliance requirements that result in privacy-limited image data sets. Iterative image classification training satisfies data compliance requirements through a combination of online image classification training and offline image classification training. This results in tuned image recognition classifiers that have improved accuracy and efficiency over general image recognition classifiers when working with domain-specific data traffic. One or more image recognition classifiers are independently trained and tuned to detect an image class for image classification.

Type: Grant

Filed: February 15, 2019

Date of Patent: November 22, 2022

Assignee: MICROSOFT TECHNOLOGY LICENSING, LLC

Inventors: Ji Li, Youjun Liu, Amit Srivastava
AUTOMATED SCRIPT GENERATION AND AUDIO-VISUAL PRESENTATIONS

Publication number: 20220366153

Abstract: Automatic generation of intelligent content is created using a system of computers including a user device and a cloud-based component that processes the user information. The system performs a process that includes receiving an input document and parsing the input document to generate inputs for a natural language generation model using a text analysis model. The natural language generation model generates one or more candidate presentation scripts based on the inputs. A presentation script is selected from the candidate presentation scripts and displayed. A text-to-speech model may be used to generate a synthesized audio presentation of the presentation script. A final presentation may be generated that includes a visual display of the input document and the corresponding audio presentation in sync with the visual display.

Type: Application

Filed: May 12, 2021

Publication date: November 17, 2022

Inventors: Ji LI, Konstantin SELESKEROV, Huey-Ru TSAI, Muin Barkatali MOMIN, Ramya TRIDANDAPANI, Sindhu Vigasini JAMBUNATHAN, Amit SRIVASTAVA, Derek Martin JOHNSON, Gencheng WU, Sheng ZHAO, Xinfeng CHEN, Bohan LI
Automated intelligent content generation

Patent number: 11494396

Abstract: Automatic generation of intelligent content is created using a system of computers including a user device and a cloud-based component that processes the user information. The system performs a process that includes receiving a user query for creating content in a content generation application and determining an action from an intent of the user query. A prompt is generated based on the action and provided to a natural language generation model. In response to the prompt, output is received from the natural language generation model. Response content is generated based on the output in a format compatible with the content generation application. At least some of the response content is displayed to the user. The user can choose to keep, edit, or discard the response content. The user can iterate with additional queries until the content document reflects the user's desired content.

Type: Grant

Filed: January 19, 2021

Date of Patent: November 8, 2022

Assignee: MICROSOFT TECHNOLOGY LICENSING, LLC

Inventors: Ji Li, Amit Srivastava, Muin Barkatali Momin, Muqi Li, Emily Lauren Tohir, SivaPriya Kalyanaraman, Derek Martin Johnson
Acoustic based speech analysis using deep learning models

Patent number: 11495210

Abstract: A method and system for detecting one or more speech features in speech audio data includes receiving speech audio data, performing preprocessing on the speech audio data to prepare the speech audio data for use as an input into one or more models that detect one or more speech features, providing the preprocessed speech audio data to a stacked machine learning model, and analyzing the preprocessed speech audio data via the stacked ML model to detect the one or more speech features. The stacked ML model includes a feature aggregation model, a sequence to sequence model, and a decision-making model.

Type: Grant

Filed: December 11, 2019

Date of Patent: November 8, 2022

Assignee: Microsoft Technology Licensing, LLC

Inventors: Ji Li, Amit Srivastava
Method and system of utilizing unsupervised learning to improve text to content suggestions

Patent number: 11455466

Abstract: A method and system for providing an application-specific embedding for an entire text-to-content suggestions service is disclosed. The method includes accessing a dataset containing unlabeled training data collected from an application, the unlabeled training data being collected under user privacy constraints, applying an unsupervised ML model to the dataset to generate a pretrained embedding; and utilizing the pretrained embedding to train the text-to-content suggestion ML model utilized by the application.

Type: Grant

Filed: May 1, 2019

Date of Patent: September 27, 2022

Assignee: Microsoft Technology Licensing, LLC

Inventors: Xingxing Zhang, Ji Li, Furu Wei, Ming Zhou, Amit Srivastava
Method and system of utilizing unsupervised learning to improve text to content suggestions

Patent number: 11429787

Abstract: Method and system for training a text-to-content suggestion ML model include accessing a dataset containing unlabeled training data collected from an application, the unlabeled training data being collected under user privacy constraints, applying an ML model to the dataset to generate a pretrained embedding, and applying a supervised ML model to a labeled dataset to train the text-to-content suggestion ML model utilized by the application by utilizing the pretrained embedding generated by the supervised ML model.

Type: Grant

Filed: May 1, 2019

Date of Patent: August 30, 2022

Assignee: Microsoft Technology Licensing, LLC

Inventors: Ji Li, Xingxing Zhang, Furu Wei, Ming Zhou, Amit Srivastava
Process for making compounds for use in the treatment of cancer

Patent number: 11414396

Abstract: Disclosed herein is a process of making a compound of formula I The compound of formula I is an inhibitor of MEK and thus can be used to treat cancer.

Type: Grant

Filed: August 26, 2020

Date of Patent: August 16, 2022

Assignees: Exelixis, Inc., Genentech, Inc.

Inventors: Sriram Naganathan, Nathan Guz, Matthew Pfeiffer, C. Gregory Sowell, Tracy Bostick, Jason Yang, Amit Srivastava, Neel Kumar Anand
AUTOMATED INTELLIGENT CONTENT GENERATION

Publication number: 20220229832

Abstract: Automatic generation of intelligent content is created using a system of computers including a user device and a cloud-based component that processes the user information. The system performs a process that includes receiving a user query for creating content in a content generation application and determining an action from an intent of the user query. A prompt is generated based on the action and provided to a natural language generation model. In response to the prompt, output is received from the natural language generation model. Response content is generated based on the output in a format compatible with the content generation application. At least some of the response content is displayed to the user. The user can choose to keep, edit, or discard the response content. The user can iterate with additional queries until the content document reflects the user's desired content.

Type: Application

Filed: January 19, 2021

Publication date: July 21, 2022

Inventors: Ji LI, Amit SRIVASTAVA, Muin Barkatali MOMIN, Muqi LI, Emily Lauren TOHIR, SivaPriya KALYANARAMAN, Derek Martin JOHNSON
Multilingual Model Training Using Parallel Corpora, Crowdsourcing, and Accurate Monolingual Models

Publication number: 20220198157

Abstract: A data processing system for generating training data for a multilingual NLP model implements obtaining a corpus including first and second content items, where the first content items are English-language textual content, and the second content items are translations of the first content items in one or more non-English target languages; selecting a first content item from the plurality of first content items; generating a plurality of candidate labels for the first content item by analyzing the first content item with a plurality of first English-language NLP models; selecting a first label from the plurality of candidate labels; generating first training data by associating the first label with the first content item; generating second training data by associating the first label with a second content item of the second content items; and training a pretrained multilingual NLP model with the first training data and the second training data.

Type: Application

Filed: December 22, 2020

Publication date: June 23, 2022

Applicant: Microsoft Technology Licensing, LLC

Inventors: Ji LI, Amit SRIVASTAVA
Speaking technique improvement assistant

Patent number: 11341331

Abstract: An intelligent speech assistant receives information collected while a user is speaking. The information can comprise speech data, vision data, or both, where the speech data is from the user speaking and the vision data is of the user while speaking. The assistant evaluates the speech data against a script which can contain information that the user should speak, information that the user should not speak, or both. The assistant collects instances where the user utters phrases that match the script or instances where the user utters phrases that do not match the script, depending on whether phases should or should not be spoken. The assistant evaluates vision data to identify gestures, facial expressions, and/or emotions of the user. Instances where the gestures, facial expressions, and/or emotions are not appropriate to the context are flagged. Real-time prompts and/or a summary is presented to the user as feedback.

Type: Grant

Filed: October 4, 2019

Date of Patent: May 24, 2022

Assignee: MICROSOFT TECHNOLOGY LICENSING, LLC

Inventors: Huakai Liao, Priyanka Vikram Sinha, Kevin Dara Khieu, Derek Martin Johnson, Siliang Kang, Huey-Ru Tsai, Amit Srivastava
AUTOMATIC GENERATION OF TRANSFORMATIONS OF FORMATTED TEMPLATES USING DEEP LEARNING MODELING

Publication number: 20220147702

Abstract: The present disclosure applies trained artificial intelligence (AI) processing adapted to automatically generating transformations of formatted templates. Pre-existing formatted templates (e.g., slide-based presentation templates) are leveraged by the trained AI processing to automatically generate a plurality of high-quality template transformations. In transforming a formatted template, the trained AI processing not only generates feature transformation of objects thereof but may also provide style transformations where attributes associated with a presentation theme may be modified for a formatted template or set of formatted templates. The trained AI processing is novel in that it is tailored for analysis of feature data of a specific type of formatted template.

Type: Application

Filed: November 11, 2020

Publication date: May 12, 2022

Inventors: Ji LI, Amit SRIVASTAVA, Mingxi CHENG
TECHNIQUES FOR RICH INTERACTION IN REMOTE LIVE PRESENTATION AND ACCURATE SUGGESTION FOR REHEARSAL THROUGH AUDIENCE VIDEO ANALYSIS

Publication number: 20220141532

Abstract: Techniques performed by a data processing system for facilitating an online presentation session include establishing the session for a first computing device of a presenter and a plurality of second computing devices of a plurality of participants, receiving a set of first media streams comprising presentation content from the first computing device, sending a set of second media streams to the plurality of second computing devices, receiving a set of third media streams from the computing devices of a first subset of the plurality of participants including video content of first subset of the participants captured by the respective computing devices of the first subset of participants, analyzing the set of third media streams to identify a set of first reactions by the first subset participants to obtain first reaction information, determining first graphical representation information representing the first reaction information, and sending a fourth media stream to cause the first computing device to displ

Type: Application

Filed: October 30, 2020

Publication date: May 5, 2022

Applicant: Microsoft Technology Licensing, LLC

Inventors: Ji LI, Robert Fernand GORDAN, Nicolas HIGUERA, Amit SRIVASTAVA
Techniques for Presentation Analysis Based on Audience Feedback, Reactions, and Gestures

Publication number: 20220138470

Abstract: Techniques performed by a data processing system for facilitating an online presentation session include establishing an online presentation session for conducting an online presentation for a first computing device of a presenter and a plurality of second computing devices of a plurality of participants, receiving a set of first media streams comprising presentation content from the first computing device, receiving a set of second media streams from the second computing devices of a first subset of the plurality of participants, the set of second media streams including audio content, video content, or both of first subset of the plurality of participants, analyzing the set of first media streams using one or more first machine learning models n to generate a set of first feedback results, analyzing the set of second media streams using one or more second machine learning models to identify a set of first reactions by the participants to obtain first reaction information, automatically analyzing the set of

Type: Application

Filed: October 30, 2020

Publication date: May 5, 2022

Applicant: Microsoft Technology Licensing, LLC

Inventors: Konstantin SELESKEROV, Amit SRIVASTAVA, Derek Martin JOHNSON, Priyanka Vikram SINHA, Gencheng WU, Brittany Elizabeth MEDEROS
POLYSORBATE MIXTURES HAVING MODIFIED FATTY ACID ESTER DISTRIBUTION

Publication number: 20220111054

Abstract: The present disclosure provides polysorbate 20 compositions with particular fatty acid ester concentrations. In some embodiments, they may be used in pharmaceutical formulations, for example, to improve stability.

Type: Application

Filed: September 23, 2021

Publication date: April 14, 2022

Applicant: Genentech, Inc.

Inventors: Sandeep Yadav, Nidhi Doshi, Tamanna Shobha, Anthony Tomlinson, Amit Srivastava
Contextual voice-based presentation assistance

Patent number: 11289091

Abstract: Examples are disclosed that relate to methods and computing devices for providing voice-based assistance during a presentation. In one example, a method comprises receiving content of a slide deck, processing the content of the slide deck, and populating a contextual knowledge graph based on the content of the slide deck. A voice input is received from a presenter. Using the knowledge graph, the voice input is analyzed to determine an action to be performed by a presentation program during the presentation. The action is translated into one or more commands executable by the presentation program to perform the action, and the one or more commands are sent to a client device executing the presentation program.

Type: Grant

Filed: August 22, 2019

Date of Patent: March 29, 2022

Assignee: Microsoft Technology Licensing, LLC

Inventors: Amit Srivastava, Dachuan Zhang
Machine learning model-based content processing framework

Patent number: 11270059

Abstract: A textual user input is received and a plurality of different text-to-content models are run on the textual user input. A selection system attempts to identify a suggested content item, based upon the outputs of the text-to-content models. The selection system first attempts to generate a completed suggestion based on outputs from a single text-to-content model. It then attempts to mix the outputs of the text-to-content models to obtain a completed content suggestion.

Type: Grant

Filed: August 27, 2019

Date of Patent: March 8, 2022

Assignee: Microsoft Technology Licensing, LLC

Inventors: Ji Li, Xiaozhi Yu, Gregory Alexander DePaul, Youjun Liu, Amit Srivastava
AUTOMATIC REACTION-TRIGGERING FOR LIVE PRESENTATIONS

Publication number: 20220038580

Abstract: The present disclosure relates to processing operations configured to provide processing that automatically analyzes acoustic signals from attendees of a live presentation and automatically triggers corresponding reaction indications from results of analysis thereof. Exemplary reaction indications provide feedback for live presentations that can be presented in real-time (or near real-time) without requiring a user to manually take action to provide any feedback. As a non-limiting example, reaction indications may be presented in a form that is easy to visualize and understand such as emojis or icons. Another example of a reaction indication is a graphical user interface (GUI) notification that provides a predictive indication of user intent derived from analysis of acoustic signals.

Type: Application

Filed: August 3, 2020

Publication date: February 3, 2022

Inventors: Ji Li, Amit Srivastava, Derek Martin Johnson, Priyanka Vikram Sinha, Konstantin Seleskerov, Gencheng Wu
DETECTION OF MISSION CHANGE IN CONVERSATION

Publication number: 20220020375

Abstract: Methods, systems, and computer programs are presented for detecting a mission changes in a conversation. A user utterance from a user device is received. The user utterance is part of a conversation with an intelligent assistant. The conversation includes preceding user utterances in pursuit of a first mission. It is determined that the user utterance indicates a mission change from the first mission to a second mission based on an application of a machine-learned model to the user utterance and the preceding user utterances. The machine-learned model has been trained repeatedly with past utterances of other users over a time period, the determining based on a certainty of the indication satisfying a certainty threshold. Responsive to the determining that the user utterance indicates the mission change from the first mission to a second mission, a reply to the user utterance is generated to further the second mission rather than the first mission.

Type: Application

Filed: September 30, 2021

Publication date: January 20, 2022

Inventors: Stefan Schoenmackers, Amit Srivastava, Lawrence William Colagiovanni, Sanjika Hewavitharana, Ajinkya Gorakhnath Kale, Vinh Khuc

prev 1 2 3 4 5 6 next