Patents by Inventor Karim Ahmed

Karim Ahmed has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Sequence-to-sequence prediction using a neural network model

Patent number: 11928600

Abstract: A method for sequence-to-sequence prediction using a neural network model includes generating an encoded representation based on an input sequence using an encoder of the neural network model and predicting an output sequence based on the encoded representation using a decoder of the neural network model. The neural network model includes a plurality of model parameters learned according to a machine learning process. At least one of the encoder or the decoder includes a branched attention layer. Each branch of the branched attention layer includes an interdependent scaling node configured to scale an intermediate representation of the branch by a learned scaling parameter. The learned scaling parameter depends on one or more other learned scaling parameters of one or more other interdependent scaling nodes of one or more other branches of the branched attention layer.

Type: Grant

Filed: January 30, 2018

Date of Patent: March 12, 2024

Assignee: Salesforce, Inc.

Inventors: Nitish Shirish Keskar, Karim Ahmed, Richard Socher
SEQUENCE-TO-SEQUENCE PREDICTION USING A NEURAL NETWORK MODEL

Publication number: 20190130273

Abstract: A method for sequence-to-sequence prediction using a neural network model includes generating an encoded representation based on an input sequence using an encoder of the neural network model and predicting an output sequence based on the encoded representation using a decoder of the neural network model. The neural network model includes a plurality of model parameters learned according to a machine learning process. At least one of the encoder or the decoder includes a branched attention layer. Each branch of the branched attention layer includes an interdependent scaling node configured to scale an intermediate representation of the branch by a learned scaling parameter. The learned scaling parameter depends on one or more other learned scaling parameters of one or more other interdependent scaling nodes of one or more other branches of the branched attention layer.

Type: Application

Filed: January 30, 2018

Publication date: May 2, 2019

Inventors: Nitish Shirish Keskar, Karim Ahmed, Richard SOCHER

Sequence-to-sequence prediction using a neural network model

SEQUENCE-TO-SEQUENCE PREDICTION USING A NEURAL NETWORK MODEL