Patents by Inventor Moin Nabi
Moin Nabi has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Publication number: 20250028699Abstract: A method for training a machine learning model using self-contrastive decorrelation is provided. The method comprises training a machine learning model by receiving a sentence including text, performing a first encoding operation on the sentence, performing a second encoding operation on the sentence, mapping the first vector representation on which a first augmentation operation is performed to a first high dimensional vector representation and the second vector representation on which a first augmentation operation is performed to a second high dimensional vector representation, generating a correlation matrix using the first high dimensional vector representation and the second high dimensional vector representation, and performing a decorrelation operation on the correlation matrix.Type: ApplicationFiled: October 2, 2024Publication date: January 23, 2025Inventors: Tassilo Klein, Moin Nabi
-
Patent number: 12147764Abstract: Disclosed herein are system, method, and computer program product embodiments for similarity scoring of sentences, while restricting distances between tokenized pairs in the sentences. An embodiment operates by determining a similarity of tokens between a first sequence of tokens and a second sequence of tokens to generate token pairs, determining a distance of relative positioning of token pairs in the first tokenized sequence and the second tokenized sequence and generating a score value that indicates the degree to which the first sentence matches the second sentence based on restricting matches to a maximum value of the distance of relative positions of the token pairs in the first tokenized sequence and the second tokenized sequence.Type: GrantFiled: June 1, 2021Date of Patent: November 19, 2024Assignee: SAP SEInventors: Tassilo Klein, Moin Nabi
-
Patent number: 12135701Abstract: A method for training a machine learning model using self-contrastive decorrelation is provided. The method comprises training a machine learning model by receiving a sentence including text, performing a first encoding operation on the sentence, performing a second encoding operation on the sentence, mapping the first vector representation on which a first augmentation operation is performed to a first high dimensional vector representation and the second vector representation on which a first augmentation operation is performed to a second high dimensional vector representation, generating a correlation matrix using the first high dimensional vector representation and the second high dimensional vector representation, and performing a decorrelation operation on the correlation matrix.Type: GrantFiled: December 15, 2022Date of Patent: November 5, 2024Assignee: SAP SEInventors: Tassilo Klein, Moin Nabi
-
Publication number: 20240202176Abstract: A method for training a machine learning model using self-contrastive decorrelation is provided. The method comprises training a machine learning model by receiving a sentence including text, performing a first encoding operation on the sentence, performing a second encoding operation on the sentence, mapping the first vector representation on which a first augmentation operation is performed to a first high dimensional vector representation and the second vector representation on which a first augmentation operation is performed to a second high dimensional vector representation, generating a correlation matrix using the first high dimensional vector representation and the second high dimensional vector representation, and performing a decorrelation operation on the correlation matrix.Type: ApplicationFiled: December 15, 2022Publication date: June 20, 2024Inventors: Tassilo Klein, Moin Nabi
-
Publication number: 20240202281Abstract: A method comprises generating vectors of sentences, the sentences including text, performing an augmentation operation on a first vector of the vectors and a second vector of the vectors, the augmentation operation comprising: merging the first vector with the second vector such that at least a first element of the first vector is combined with a second element of the second vector, wherein the first element is representative of a first sentence of the plurality of sentences and the second element represents a second sentence of the plurality of sentences, inputting, the first vector that is merged with the second vector, into a first layer, and generating a third vector that is based on the first vector that is merged with the second vector, the third vector including a third element that represents the first sentence and the second sentence.Type: ApplicationFiled: December 16, 2022Publication date: June 20, 2024Inventors: Tassilo Klein, Moin Nabi
-
Publication number: 20240202546Abstract: A method and system for generating inputs specific to a multimodal learning based machine learning model. The generating of inputs specific to the multimodal learning based machine learning model from training dataset comprises image data, the generating including: determining a plurality of permutations based on the one or more images included in the image data, and applying orthogonal super-positioning relative to the plurality of permutations. The method further comprises providing the inputs that are generated, based on the orthogonal super-positioning, into the multimodal learning based machine learning model, and generating, by the multimodal learning based machine learning model, a prediction specific to at least one image of the one or more images.Type: ApplicationFiled: December 16, 2022Publication date: June 20, 2024Inventors: Tassilo Klein, Moin Nabi
-
Patent number: 12001802Abstract: Disclosed herein are various embodiments for training and enriching a natural language processing system. An embodiment operates by identifying a natural language processor (NLP) trained on a first set of documents, wherein the NLP is trained to perform a set of functionality based on the first set of documents. An industry, set of words corresponding to the industry, and set of sentences including at least a subset of the set of words in which the NLP is to be configured to perform the set of functionality are identified. A set of sentences that exceed a similarity threshold are identified. The NLP is trained with the subset of the set of sentences that exceed the similarity threshold, wherein the trained NLP with the subset is configured to perform the set of functionality within the industry with a greater accuracy than NLP trained on only the first set of documents.Type: GrantFiled: June 3, 2021Date of Patent: June 4, 2024Assignee: SAP SEInventors: Tassilo Klein, Moin Nabi
-
Publication number: 20240161255Abstract: According to an aspect, there is provided a method that includes receiving a first image and a second image as inputs for contrastive self-supervised learning; applying a high dynamic range augmentation to the first image to generate a first pair of views; applying the high dynamic range augmentation to the second image to generate a second pair of views; applying a first convolutional neural network to the first pair of views to output a first pair of encoded representations; applying a second convolutional neural network to the second pair of views to output a second pair of encoded representations; projecting the first pair of encoded representations to form first projected representations; projecting the second pair of encoded representations to form second projected representations; and training a machine learning model using the high dynamic range augmentations and an objective function that provides contrastive self-supervised learning.Type: ApplicationFiled: November 11, 2022Publication date: May 16, 2024Inventors: Tassilo Klein, Moin Nabi
-
Publication number: 20240127616Abstract: A method for text-image integration is provided. The method may include receiving a question related to pairable data comprising text data and image data. Embeddings are generated from the text tokens and image encodings. Embeddings are generated from the text tokens and image encodings. The embeddings include text embeddings and image embeddings. A spectral conversion of the text embeddings and the image embeddings is performed to generate spectral data. The spectral data is processed to extract text-image features. The text-image features are processed to generate inferred answers to the question.Type: ApplicationFiled: October 18, 2022Publication date: April 18, 2024Inventors: Stefan Lionar, Tassilo Klein, Moin Nabi
-
Patent number: 11893347Abstract: Disclosed herein are system, method, and computer program product embodiments for utilizing non-RAM memory to implement machine learning configured with a meta-learning training set (small dataset), to create a common-sense predictive language model, thus boosting the performance for downstream tasks. An embodiment operates by receiving a base sentence and perturbation sentences as an input and tokenizing the input to generate a sequence of tokens. Tokens of the semantic perturbation sentences are embedded with tokens of the base sentence as contextually similar tokens pairs to generate training data and classified to capture relationships of the base sentence and the perturbation sentences to generate a classification, which is used to train a language model.Type: GrantFiled: June 1, 2021Date of Patent: February 6, 2024Assignee: SAP SEInventors: Tassilo Klein, Moin Nabi
-
Patent number: 11848017Abstract: Disclosed herein are various embodiments for pronoun-based natural language processing. An embodiment operates by receiving a plurality of text-based sentences each comprising a plurality of words, and each text-based sentence including a pronoun. A plurality of candidate nouns are identified amongst the plurality of words. A trigger word is identified from the plurality of words, wherein the trigger word is associated with both the pronoun and one of the plurality of candidate nouns. A score for each of the candidate nouns is received based on a relationship with the trigger word. The candidate noun with a highest score is selected as being associated with the pronoun.Type: GrantFiled: June 10, 2021Date of Patent: December 19, 2023Assignee: SAP SEInventors: Tassilo Klein, Moin Nabi
-
Patent number: 11816188Abstract: A machine learning model may be trained based on a training set including training images depicting various base objects. Each training images may be associated with a ground-truth segmentation corresponding to one or more pixel-wise labels. The machine learning model may be trained to learn base class prototypes corresponding to segmentations of classes of similar base objects. The machine learning model may be further trained based on a support image depicting a novel object. The support image may be associated with an image-level label corresponding to the novel object. The machine learning model may be trained to learn, based on a base class prototype identified as being similar to the support image, a novel class prototype corresponding to the novel object. The trained machine learning model to may be applied to segment a query image. Related systems and computer program products are also provided.Type: GrantFiled: August 31, 2020Date of Patent: November 14, 2023Assignee: SAP SEInventors: Moin Nabi, Tassilo Klein, Hasnain Raza, Sayyed Mahdyar Ravanbakhsh
-
Patent number: 11687733Abstract: In an example embodiment, a self-supervised learning task is used for training commonsense-aware representations in a minimally supervised fashion and a pair level mutual-exclusive loss is used to enforce commonsense knowledge during representation learning. This helps to exploit the mutual-exclusive nature of the training samples of commonsense reasoning corpora. Given two pieces of input where the only difference between them are trigger pieces of data, it may be postulated that the pairwise pronoun disambiguation is mutually exclusive. This idea is formulated using a contrastive loss and then this is used to update the language model.Type: GrantFiled: June 25, 2020Date of Patent: June 27, 2023Assignee: SAP SEInventors: Tassilo Klein, Moin Nabi
-
Patent number: 11631181Abstract: A machine learning model may be trained on a first task of puzzle solving before being tuned on a second task of image analysis. The training of the machine learning model may be self-supervised whereas the tuning of the machine learning model may be supervised. The training data may include a puzzle generated to include multiple imaging modalities. The puzzle may be generated by shuffling a position of the pieces forming an original image. The machine learning model may be trained to perform the first task by reassembling the pieces in the puzzle to generate a reconstruction of the original image. Upon being trained to perform the first task and tuned to perform the second task, the machine learning model may be deployed to perform the second task. The second task may be an image segmentation task such as tumor segmentation and a regression task such as survival prediction.Type: GrantFiled: September 2, 2020Date of Patent: April 18, 2023Assignee: SAP SEInventors: Aiham Taleb, Moin Nabi, Tassilo Klein
-
Publication number: 20230072255Abstract: Aspects of the current subject matter are directed to a variational encoder that takes into account group characteristics of data elements of a dataset. For example, a prior adjusted variational autoencoder takes into account that not all attributes in the dataset naturally follow a normal Gaussian distribution N(0,1). To illustrate by way of an example, data from the dataset may be separated into groups in which elements in a group share group characteristics; for each group, a group representation N(mu_g, sigma_g) is calculated. And, for example, other attributes of data in the dataset do not depend on the group, and the associated data elements continue to follow the normal Gaussian distribution N(0,1). The representation may introduce a flexibility in which encodings of group-related attributes will be encoded close together in the content part instead of being close to an arbitrarily chosen point.Type: ApplicationFiled: August 18, 2021Publication date: March 9, 2023Inventors: Tassilo Klein, Moin Nabi, Jan Nikolas Morshuis
-
Patent number: 11544532Abstract: A method may include training a machine learning model to perform a first task before training the machine learning model to perform the second task. The machine learning model includes a generator network and a discriminator network. The training includes training, based on a first training sample associated with the first task, the discriminator network to perform the first task. The generator network may be trained to generate a first synthetic training sample emulating the first training sample. The discriminator network trained to perform the first task may be reinitialized in order for the discriminator network to be trained, based on a second training sample, to perform the second task. The reinitialized discriminator network may be further retrained, based on the first synthetic training sample, to perform the first task. Related systems and articles of manufacture, including computer program products, are also provided.Type: GrantFiled: December 11, 2019Date of Patent: January 3, 2023Assignee: SAP SEInventors: Mihai Puscas, Moin Nabi, Tassilo Klein, Oleksiy Ostapenko
-
Publication number: 20220414482Abstract: Aspects of the current subject matter are directed to a system in which knowledge graphs are incorporated with visual question answering. A knowledge graph is integrated into a visual question answering system to provide additional knowledge from one or more sources to answer a question about an image. Aspects of the current subject matter are directed to a neural network approach that combines methods of image feature extraction and questions processing with a neural network, such as a graph neural network, that operates on knowledge graphs. The graph neural network takes input vector representations of the nodes as inputs and combines them according to their relationships into question-specific representations. The question-specific representations are then processed with the image features and the question features to generate an answer.Type: ApplicationFiled: June 29, 2021Publication date: December 29, 2022Inventors: Artur Speiser, Moin Nabi, Tassilo Klein
-
Publication number: 20220399015Abstract: Disclosed herein are various embodiments for pronoun-based natural language processing. An embodiment operates by receiving a plurality of text-based sentences each comprising a plurality of words, and each text-based sentence including a pronoun. A plurality of candidate nouns are identified amongst the plurality of words. A trigger word is identified from the plurality of words, wherein the trigger word is associated with both the pronoun and one of the plurality of candidate nouns. A score for each of the candidate nouns is received based on a relationship with the trigger word.Type: ApplicationFiled: June 10, 2021Publication date: December 15, 2022Inventors: TASSILO KLEIN, Moin Nabi
-
Publication number: 20220391592Abstract: Disclosed herein are various embodiments for training and enriching a natural language processing system. An embodiment operates by identifying a natural language processor (NLP) trained on a first set of documents, wherein the NLP is trained to perform a set of functionality based on the first set of documents. An industry, set of words corresponding to the industry, and set of sentences including at least a subset of the set of words in which the NLP is to be configured to perform the set of functionality are identified. A set of sentences that exceed a similarity threshold are identified. The NLP is trained with the subset of the set of sentences that exceed the similarity threshold, wherein the trained NLP with the subset is configured to perform the set of functionality within the industry with a greater accuracy than NLP trained on only the first set of documents.Type: ApplicationFiled: June 3, 2021Publication date: December 8, 2022Inventors: TASSILO KLEIN, Moin Nabi
-
Publication number: 20220382980Abstract: Disclosed herein are system, method, and computer program product embodiments for similarity scoring of sentences, while restricting distances between tokenized pairs in the sentences. An embodiment operates by determining a similarity of tokens between a first sequence of tokens and a second sequence of tokens to generate token pairs, determining a distance of relative positioning of token pairs in the first tokenized sequence and the second tokenized sequence and generating a score value that indicates the degree to which the first sentence matches the second sentence based on restricting matches to a maximum value of the distance of relative positions of the token pairs in the first tokenized sequence and the second tokenized sequence.Type: ApplicationFiled: June 1, 2021Publication date: December 1, 2022Inventors: Tassilo KLEIN, Moin NABI