Patents by Inventor Amit Srivastava

Amit Srivastava has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 11556183
    Abstract: A method and system for generating training data for training a gesture detection machine-learning (ML) model includes receiving a request to generate training data for the gesture detection model, the training data being associated with a target gesture, retrieving data associated with an original gesture, the original gesture being a gesture made using a body part, retrieving skeleton data associated with the target gesture, the skeleton data displaying a skeleton representative of the body part and the skeleton displaying the target gesture, aligning a location of the body part in the data with a location of the skeleton in the skeleton data, providing the aligned data and the skeleton data to an ML model for generating a target data that displays the target gesture, receiving the target data as an output from the ML model, the target data preserving a visual feature of the data and displaying the target gesture, and providing the target data to the gesture detection ML model.
    Type: Grant
    Filed: September 30, 2021
    Date of Patent: January 17, 2023
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Ji Li, Mingxi Cheng, Fatima Zohra Daha, Amit Srivastava
  • Publication number: 20220405907
    Abstract: Aspects of the present disclosure relate to systems and methods for detecting and correcting undesirable content. A video feed may be segmented to distinguish background data from foreground data. It may be determined that a region of the background data includes a qualifying behavior. The qualifying behavior may be classified as belonging to a distracting category of data. An effect may be applied to the background data that includes the qualifying behavior to reduce an appearance of the qualifying behavior.
    Type: Application
    Filed: June 20, 2021
    Publication date: December 22, 2022
    Inventors: Fatima Zohra DAHA, Amit SRIVASTAVA, Nicolas Paul-Stringall HIGUERA, Robert Fernand GORDAN
  • Patent number: 11507677
    Abstract: The present disclosure relates to processing operations that execute image classification training for domain-specific traffic, where training operations are entirely compliant with data privacy regulations and policies. Image classification model training, as described herein, is configured to classify meaningful image categories in domain-specific scenarios where there is unknown data traffic and strict data compliance requirements that result in privacy-limited image data sets. Iterative image classification training satisfies data compliance requirements through a combination of online image classification training and offline image classification training. This results in tuned image recognition classifiers that have improved accuracy and efficiency over general image recognition classifiers when working with domain-specific data traffic. One or more image recognition classifiers are independently trained and tuned to detect an image class for image classification.
    Type: Grant
    Filed: February 15, 2019
    Date of Patent: November 22, 2022
    Assignee: MICROSOFT TECHNOLOGY LICENSING, LLC
    Inventors: Ji Li, Youjun Liu, Amit Srivastava
  • Publication number: 20220366153
    Abstract: Automatic generation of intelligent content is created using a system of computers including a user device and a cloud-based component that processes the user information. The system performs a process that includes receiving an input document and parsing the input document to generate inputs for a natural language generation model using a text analysis model. The natural language generation model generates one or more candidate presentation scripts based on the inputs. A presentation script is selected from the candidate presentation scripts and displayed. A text-to-speech model may be used to generate a synthesized audio presentation of the presentation script. A final presentation may be generated that includes a visual display of the input document and the corresponding audio presentation in sync with the visual display.
    Type: Application
    Filed: May 12, 2021
    Publication date: November 17, 2022
    Inventors: Ji LI, Konstantin SELESKEROV, Huey-Ru TSAI, Muin Barkatali MOMIN, Ramya TRIDANDAPANI, Sindhu Vigasini JAMBUNATHAN, Amit SRIVASTAVA, Derek Martin JOHNSON, Gencheng WU, Sheng ZHAO, Xinfeng CHEN, Bohan LI
  • Patent number: 11494396
    Abstract: Automatic generation of intelligent content is created using a system of computers including a user device and a cloud-based component that processes the user information. The system performs a process that includes receiving a user query for creating content in a content generation application and determining an action from an intent of the user query. A prompt is generated based on the action and provided to a natural language generation model. In response to the prompt, output is received from the natural language generation model. Response content is generated based on the output in a format compatible with the content generation application. At least some of the response content is displayed to the user. The user can choose to keep, edit, or discard the response content. The user can iterate with additional queries until the content document reflects the user's desired content.
    Type: Grant
    Filed: January 19, 2021
    Date of Patent: November 8, 2022
    Assignee: MICROSOFT TECHNOLOGY LICENSING, LLC
    Inventors: Ji Li, Amit Srivastava, Muin Barkatali Momin, Muqi Li, Emily Lauren Tohir, SivaPriya Kalyanaraman, Derek Martin Johnson
  • Patent number: 11495210
    Abstract: A method and system for detecting one or more speech features in speech audio data includes receiving speech audio data, performing preprocessing on the speech audio data to prepare the speech audio data for use as an input into one or more models that detect one or more speech features, providing the preprocessed speech audio data to a stacked machine learning model, and analyzing the preprocessed speech audio data via the stacked ML model to detect the one or more speech features. The stacked ML model includes a feature aggregation model, a sequence to sequence model, and a decision-making model.
    Type: Grant
    Filed: December 11, 2019
    Date of Patent: November 8, 2022
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Ji Li, Amit Srivastava
  • Patent number: 11455466
    Abstract: A method and system for providing an application-specific embedding for an entire text-to-content suggestions service is disclosed. The method includes accessing a dataset containing unlabeled training data collected from an application, the unlabeled training data being collected under user privacy constraints, applying an unsupervised ML model to the dataset to generate a pretrained embedding; and utilizing the pretrained embedding to train the text-to-content suggestion ML model utilized by the application.
    Type: Grant
    Filed: May 1, 2019
    Date of Patent: September 27, 2022
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Xingxing Zhang, Ji Li, Furu Wei, Ming Zhou, Amit Srivastava
  • Patent number: 11429787
    Abstract: Method and system for training a text-to-content suggestion ML model include accessing a dataset containing unlabeled training data collected from an application, the unlabeled training data being collected under user privacy constraints, applying an ML model to the dataset to generate a pretrained embedding, and applying a supervised ML model to a labeled dataset to train the text-to-content suggestion ML model utilized by the application by utilizing the pretrained embedding generated by the supervised ML model.
    Type: Grant
    Filed: May 1, 2019
    Date of Patent: August 30, 2022
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Ji Li, Xingxing Zhang, Furu Wei, Ming Zhou, Amit Srivastava
  • Patent number: 11414396
    Abstract: Disclosed herein is a process of making a compound of formula I The compound of formula I is an inhibitor of MEK and thus can be used to treat cancer.
    Type: Grant
    Filed: August 26, 2020
    Date of Patent: August 16, 2022
    Assignees: Exelixis, Inc., Genentech, Inc.
    Inventors: Sriram Naganathan, Nathan Guz, Matthew Pfeiffer, C. Gregory Sowell, Tracy Bostick, Jason Yang, Amit Srivastava, Neel Kumar Anand
  • Publication number: 20220229832
    Abstract: Automatic generation of intelligent content is created using a system of computers including a user device and a cloud-based component that processes the user information. The system performs a process that includes receiving a user query for creating content in a content generation application and determining an action from an intent of the user query. A prompt is generated based on the action and provided to a natural language generation model. In response to the prompt, output is received from the natural language generation model. Response content is generated based on the output in a format compatible with the content generation application. At least some of the response content is displayed to the user. The user can choose to keep, edit, or discard the response content. The user can iterate with additional queries until the content document reflects the user's desired content.
    Type: Application
    Filed: January 19, 2021
    Publication date: July 21, 2022
    Inventors: Ji LI, Amit SRIVASTAVA, Muin Barkatali MOMIN, Muqi LI, Emily Lauren TOHIR, SivaPriya KALYANARAMAN, Derek Martin JOHNSON
  • Publication number: 20220198157
    Abstract: A data processing system for generating training data for a multilingual NLP model implements obtaining a corpus including first and second content items, where the first content items are English-language textual content, and the second content items are translations of the first content items in one or more non-English target languages; selecting a first content item from the plurality of first content items; generating a plurality of candidate labels for the first content item by analyzing the first content item with a plurality of first English-language NLP models; selecting a first label from the plurality of candidate labels; generating first training data by associating the first label with the first content item; generating second training data by associating the first label with a second content item of the second content items; and training a pretrained multilingual NLP model with the first training data and the second training data.
    Type: Application
    Filed: December 22, 2020
    Publication date: June 23, 2022
    Applicant: Microsoft Technology Licensing, LLC
    Inventors: Ji LI, Amit SRIVASTAVA
  • Patent number: 11341331
    Abstract: An intelligent speech assistant receives information collected while a user is speaking. The information can comprise speech data, vision data, or both, where the speech data is from the user speaking and the vision data is of the user while speaking. The assistant evaluates the speech data against a script which can contain information that the user should speak, information that the user should not speak, or both. The assistant collects instances where the user utters phrases that match the script or instances where the user utters phrases that do not match the script, depending on whether phases should or should not be spoken. The assistant evaluates vision data to identify gestures, facial expressions, and/or emotions of the user. Instances where the gestures, facial expressions, and/or emotions are not appropriate to the context are flagged. Real-time prompts and/or a summary is presented to the user as feedback.
    Type: Grant
    Filed: October 4, 2019
    Date of Patent: May 24, 2022
    Assignee: MICROSOFT TECHNOLOGY LICENSING, LLC
    Inventors: Huakai Liao, Priyanka Vikram Sinha, Kevin Dara Khieu, Derek Martin Johnson, Siliang Kang, Huey-Ru Tsai, Amit Srivastava
  • Publication number: 20220147702
    Abstract: The present disclosure applies trained artificial intelligence (AI) processing adapted to automatically generating transformations of formatted templates. Pre-existing formatted templates (e.g., slide-based presentation templates) are leveraged by the trained AI processing to automatically generate a plurality of high-quality template transformations. In transforming a formatted template, the trained AI processing not only generates feature transformation of objects thereof but may also provide style transformations where attributes associated with a presentation theme may be modified for a formatted template or set of formatted templates. The trained AI processing is novel in that it is tailored for analysis of feature data of a specific type of formatted template.
    Type: Application
    Filed: November 11, 2020
    Publication date: May 12, 2022
    Inventors: Ji LI, Amit SRIVASTAVA, Mingxi CHENG
  • Publication number: 20220141532
    Abstract: Techniques performed by a data processing system for facilitating an online presentation session include establishing the session for a first computing device of a presenter and a plurality of second computing devices of a plurality of participants, receiving a set of first media streams comprising presentation content from the first computing device, sending a set of second media streams to the plurality of second computing devices, receiving a set of third media streams from the computing devices of a first subset of the plurality of participants including video content of first subset of the participants captured by the respective computing devices of the first subset of participants, analyzing the set of third media streams to identify a set of first reactions by the first subset participants to obtain first reaction information, determining first graphical representation information representing the first reaction information, and sending a fourth media stream to cause the first computing device to displ
    Type: Application
    Filed: October 30, 2020
    Publication date: May 5, 2022
    Applicant: Microsoft Technology Licensing, LLC
    Inventors: Ji LI, Robert Fernand GORDAN, Nicolas HIGUERA, Amit SRIVASTAVA
  • Publication number: 20220138470
    Abstract: Techniques performed by a data processing system for facilitating an online presentation session include establishing an online presentation session for conducting an online presentation for a first computing device of a presenter and a plurality of second computing devices of a plurality of participants, receiving a set of first media streams comprising presentation content from the first computing device, receiving a set of second media streams from the second computing devices of a first subset of the plurality of participants, the set of second media streams including audio content, video content, or both of first subset of the plurality of participants, analyzing the set of first media streams using one or more first machine learning models n to generate a set of first feedback results, analyzing the set of second media streams using one or more second machine learning models to identify a set of first reactions by the participants to obtain first reaction information, automatically analyzing the set of
    Type: Application
    Filed: October 30, 2020
    Publication date: May 5, 2022
    Applicant: Microsoft Technology Licensing, LLC
    Inventors: Konstantin SELESKEROV, Amit SRIVASTAVA, Derek Martin JOHNSON, Priyanka Vikram SINHA, Gencheng WU, Brittany Elizabeth MEDEROS
  • Publication number: 20220111054
    Abstract: The present disclosure provides polysorbate 20 compositions with particular fatty acid ester concentrations. In some embodiments, they may be used in pharmaceutical formulations, for example, to improve stability.
    Type: Application
    Filed: September 23, 2021
    Publication date: April 14, 2022
    Applicant: Genentech, Inc.
    Inventors: Sandeep Yadav, Nidhi Doshi, Tamanna Shobha, Anthony Tomlinson, Amit Srivastava
  • Patent number: 11289091
    Abstract: Examples are disclosed that relate to methods and computing devices for providing voice-based assistance during a presentation. In one example, a method comprises receiving content of a slide deck, processing the content of the slide deck, and populating a contextual knowledge graph based on the content of the slide deck. A voice input is received from a presenter. Using the knowledge graph, the voice input is analyzed to determine an action to be performed by a presentation program during the presentation. The action is translated into one or more commands executable by the presentation program to perform the action, and the one or more commands are sent to a client device executing the presentation program.
    Type: Grant
    Filed: August 22, 2019
    Date of Patent: March 29, 2022
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Amit Srivastava, Dachuan Zhang
  • Patent number: 11270059
    Abstract: A textual user input is received and a plurality of different text-to-content models are run on the textual user input. A selection system attempts to identify a suggested content item, based upon the outputs of the text-to-content models. The selection system first attempts to generate a completed suggestion based on outputs from a single text-to-content model. It then attempts to mix the outputs of the text-to-content models to obtain a completed content suggestion.
    Type: Grant
    Filed: August 27, 2019
    Date of Patent: March 8, 2022
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Ji Li, Xiaozhi Yu, Gregory Alexander DePaul, Youjun Liu, Amit Srivastava
  • Publication number: 20220038580
    Abstract: The present disclosure relates to processing operations configured to provide processing that automatically analyzes acoustic signals from attendees of a live presentation and automatically triggers corresponding reaction indications from results of analysis thereof. Exemplary reaction indications provide feedback for live presentations that can be presented in real-time (or near real-time) without requiring a user to manually take action to provide any feedback. As a non-limiting example, reaction indications may be presented in a form that is easy to visualize and understand such as emojis or icons. Another example of a reaction indication is a graphical user interface (GUI) notification that provides a predictive indication of user intent derived from analysis of acoustic signals.
    Type: Application
    Filed: August 3, 2020
    Publication date: February 3, 2022
    Inventors: Ji Li, Amit Srivastava, Derek Martin Johnson, Priyanka Vikram Sinha, Konstantin Seleskerov, Gencheng Wu
  • Publication number: 20220020375
    Abstract: Methods, systems, and computer programs are presented for detecting a mission changes in a conversation. A user utterance from a user device is received. The user utterance is part of a conversation with an intelligent assistant. The conversation includes preceding user utterances in pursuit of a first mission. It is determined that the user utterance indicates a mission change from the first mission to a second mission based on an application of a machine-learned model to the user utterance and the preceding user utterances. The machine-learned model has been trained repeatedly with past utterances of other users over a time period, the determining based on a certainty of the indication satisfying a certainty threshold. Responsive to the determining that the user utterance indicates the mission change from the first mission to a second mission, a reply to the user utterance is generated to further the second mission rather than the first mission.
    Type: Application
    Filed: September 30, 2021
    Publication date: January 20, 2022
    Inventors: Stefan Schoenmackers, Amit Srivastava, Lawrence William Colagiovanni, Sanjika Hewavitharana, Ajinkya Gorakhnath Kale, Vinh Khuc