Patents by Inventor Joseph Shtok

Joseph Shtok has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Training visual language grounding models using separation loss

Patent number: 11954144

Abstract: An example system includes a processor to receive, a randomly generated alpha-map, a pair of training images, and a pair of training texts associated with the pair of training images. The processor is to generate a blended image based on the randomly generated alpha-map and the pair of training images. The processor is to train a visual language grounding model to separate the blended image into a pair of heatmaps identifying portions of the blended image corresponding to each of the training images using a separation loss.

Type: Grant

Filed: August 26, 2021

Date of Patent: April 9, 2024

Assignee: International Business Machines Corporation

Inventors: Assaf Arbelle, Leonid Karlinsky, Sivan Doveh, Joseph Shtok, Amit Alfassy
TAFSSL: task adaptive feature sub-space learning for few-shot learning

Patent number: 11816593

Abstract: Embodiments may include novel techniques for Task-Adaptive Feature Sub-Space Learning (TAFSSL). For example, in an embodiment, a method may be implemented in a computer system comprising a processor, memory accessible by the processor, and computer program instructions stored in the memory and executable by the processor, the method comprising: training a machine learning system to classify features in images by: generating a sample set comprising one or a few labeled training samples and one or a few additional samples containing instances of target classes, performing dimensionality reduction computed on the samples in the sample set to form a dimension reduced sub-space, generating class representatives in the dimension reduced sub-space using clustering, and classifying features in images using the trained machine learning system.

Type: Grant

Filed: August 23, 2020

Date of Patent: November 14, 2023

Assignee: International Business Machines Corporation

Inventors: Leonid Karlinsky, Joseph Shtok, Eliyahu Schwartz
DETECTING GRAPHICAL ELEMENTS IN CHARTS USING PREDICTED HEATMAPS

Publication number: 20230298373

Abstract: An example system includes a processor to receive detected chart regions in a page of a document. The processor is to produce, via a graphical elements detector, predicted heatmaps and bounding boxes for graphical objects in the detected chart regions. The processor is also to apply chart type specific analysis algorithm to the predicted heatmaps and bounding boxes, to extract tabular chart data. The processor can then generate an output data file and a visualization based on the predicted heatmap and the extracted tabular chart data.

Type: Application

Filed: March 21, 2022

Publication date: September 21, 2023

Inventors: Joseph SHTOK, Leonid KARLINSKY, Sivan HARARY, Ophir AZULAI
Expert knowledge transfer using egocentric video

Patent number: 11620796

Abstract: A method, a computer program product, and a computer system for transferring knowledge from an expert to a user using a mixed reality rendering. The method includes determining a user perspective of a user viewing an object on which a procedure is to be performed. The method includes determining an anchoring of the user perspective to an expert perspective, the expert perspective associated with an expert providing a demonstration of the procedure. The method includes generating a virtual rendering of the expert at the user perspective based on the anchoring at a scene viewed by the user, the virtual rendering corresponding to the demonstration of the procedure as performed by the expert. The method includes generating a mixed reality environment in which the virtual rendering of the expert is shown in the scene viewed by the user.

Type: Grant

Filed: March 1, 2021

Date of Patent: April 4, 2023

Assignee: International Business Machines Corporation

Inventors: Joseph Shtok, Leonid Karlinsky, Adi Raz Goldfarb, Oded Dubovsky
TRAINING VISUAL LANGUAGE GROUNDING MODELS USING SEPARATION LOSS

Publication number: 20230061647

Abstract: An example system includes a processor to receive, a randomly generated alpha-map, a pair of training images, and a pair of training texts associated with the pair of training images. The processor is to generate a blended image based on the randomly generated alpha-map and the pair of training images. The processor is to train a visual language grounding model to separate the blended image into a pair of heatmaps identifying portions of the blended image corresponding to each of the training images using a separation loss.

Type: Application

Filed: August 26, 2021

Publication date: March 2, 2023

Inventors: Assaf ARBELLE, Leonid KARLINSKY, Sivan DOVEH, Joseph SHTOK, Amit ALFASSY
Systems and methods for training a model using a few-shot classification process

Patent number: 11481623

Abstract: There is provided a method of computing a model for classification of a data element, comprising: feeding a plurality of labeled auxiliary data elements and at least one labeled relevant data elements for each of a plurality of relevant classification categories, into a synthesizer component, for outputting at least one synthetic labeled relevant data element for each one of the plurality of relevant classification categories, feeding the synthetic labeled relevant data elements and a plurality of unlabelled training relevant data elements into a domain adaptation component, for outputting a respective relevant classification category for each of the plurality of unlabelled training relevant data elements, iteratively end-to-end training the synthesis component and the domain adaptation component, and providing the trained domain adaption component, for outputting a relevant classification category in response to an input of a query unlabelled data element.

Type: Grant

Filed: September 25, 2019

Date of Patent: October 25, 2022

Assignee: International Business Machines Corporation

Inventors: Leonid Karlinsky, Joseph Shtok
Unsupervised, semi-supervised, and supervised learning using deep learning based probabilistic generative models

Patent number: 11475313

Abstract: Embodiments of the present systems and methods may provide techniques to discover features such as object categories that provide improved accuracy and performance. For example, in an embodiment, a method may comprise extracting, at the computer system, features from a dataset comprising a plurality of data samples using a backbone neural network to form a features vector for each data sample, training, at the computer system, using the features vectors for at least some of the plurality of data samples, an unsupervised generative probabilistic model to perform clustering of extracted features of the at least some of the plurality of data samples by minimizing a negative Log-Likelihood function, wherein clusters of extracted features form categories, and categorizing, at the computer system, at least some different data samples of the plurality of data samples, into the formed categories.

Type: Grant

Filed: February 13, 2020

Date of Patent: October 18, 2022

Assignee: International Business Machines Corporation

Inventors: Leonid Karlinsky, Joseph Shtok
EXPERT KNOWLEDGE TRANSFER USING EGOCENTRIC VIDEO

Publication number: 20220277524

Abstract: A method, a computer program product, and a computer system for transferring knowledge from an expert to a user using a mixed reality rendering. The method includes determining a user perspective of a user viewing an object on which a procedure is to be performed. The method includes determining an anchoring of the user perspective to an expert perspective, the expert perspective associated with an expert providing a demonstration of the procedure. The method includes generating a virtual rendering of the expert at the user perspective based on the anchoring at a scene viewed by the user, the virtual rendering corresponding to the demonstration of the procedure as performed by the expert. The method includes generating a mixed reality environment in which the virtual rendering of the expert is shown in the scene viewed by the user.

Type: Application

Filed: March 1, 2021

Publication date: September 1, 2022

Inventors: Joseph Shtok, Leonid Karlinsky, Adi Raz Goldfarb, ODED DUBOVSKY
INCREMENTAL LEARNING WITHOUT FORGETTING FOR CLASSIFICATION AND DETECTION MODELS

Publication number: 20220207410

Abstract: A computing system, computer program product, and computer-implemented method for incremental learning without forgetting for a classification/detection model are provided. The method includes receiving, at a computing system, a classification/detection model including a base embedding space and corresponding base embedding vectors that are based on a base training dataset including base classes. The method also includes expanding the classification/detection model to account for a new training dataset including new classes by lifting the base embedding space to add an orthogonal subspace for the new classes, producing an expanded embedding space and corresponding expanded embedding vectors that are of a higher dimension than the base embedding vectors.

Type: Application

Filed: December 28, 2020

Publication date: June 30, 2022

Inventors: Sivan HARARY, Leonid KARLINSKY, Joseph SHTOK
SEMI-SUPERVISED LEARNING OF TRAINING GRADIENTS VIA TASK GENERATION

Publication number: 20220188639

Abstract: In an approach for augmenting a neural network with a self-supervised mechanism, a processor trains a first neural network using labeled data, the first neural network configured for a main task. A processor trains a second neural network using the labeled data and unlabeled data, the second neural network being an additional component to the first neural network. A processor computes a gradient using a second loss of the second neural network based on the unlabeled data.

Type: Application

Filed: December 14, 2020

Publication date: June 16, 2022

Inventors: Leonid Karlinsky, Joseph Shtok
AUTOMATIC CREATION OF DIFFICULT ANNOTATED DATA LEVERAGING CUES

Publication number: 20220180182

Abstract: A system and method for generating hard training data from easy training data. Training data including visual data with synthetic semantic implants (“VSSI”) having at least one cue is received. An annotator identifies at least one cue in the VSSI and annotates the VSSI to indicate the cue to create a modified training data set. A data scrambler removes at least one cue from the VSSI to create the tagged training data, which can then be used to train a classifier to identify transitions between segments when the cues are not present.

Type: Application

Filed: December 9, 2020

Publication date: June 9, 2022

Inventors: Daniel Nechemia Rotman, Yevgeny Yaroker, Udi Barzelay, Joseph Shtok
TAFSSL: TASK ADAPTIVE FEATURE SUB-SPACE LEARNING FOR FEW-SHOT LEARNING

Publication number: 20220058505

Abstract: Embodiments may include novel techniques for Task-Adaptive Feature Sub-Space Learning (TAFSSL). For example, in an embodiment, a method may be implemented in a computer system comprising a processor, memory accessible by the processor, and computer program instructions stored in the memory and executable by the processor, the method comprising: training a machine learning system to classify features in images by: generating a sample set comprising one or a few labeled training samples and one or a few additional samples containing instances of target classes, performing dimensionality reduction computed on the samples in the sample set to form a dimension reduced sub-space, generating class representatives in the dimension reduced sub-space using clustering, and classifying features in images using the trained machine learning system.

Type: Application

Filed: August 23, 2020

Publication date: February 24, 2022

Inventors: Leonid Karlinsky, Joseph Shtok, Eliyahu Schwartz
Method and system for producing digital image features

Patent number: 11176417

Abstract: A system for generating a set of digital image features, comprising at least one hardware processor adapted for: producing a plurality of input groups of features, each produced by extracting a plurality of features from one of a plurality of digital images; computing an output group of features by inputting the plurality of input groups of features into at least one prediction model trained to produce a model group of features in response to at least two groups of features, such that a model set of labels indicative of the model group of features is similar, according to at least one similarity test, to a target set of labels computed by applying at least one set operator to a plurality of input sets of labels each indicative of one of the at least two groups of features; and providing the output group of features to at least one other processor.

Type: Grant

Filed: October 6, 2019

Date of Patent: November 16, 2021

Assignee: International Business Machines Corporation

Inventors: Amit Aides, Amit Alfassy, Leonid Karlinsky, Joseph Shtok
Point depth estimation from a set of 3D-registered images

Patent number: 11176696

Abstract: Embodiments provide 3D coordinates for points in a scene that are observed to be in the correct physical position in a series of images. A method may comprise obtaining a plurality of images including a base image having at least one annotated point corresponding to a point of an object shown in the base image, and a plurality of side images showing the object from different viewpoints than the base image, wherein the plurality of side images are given with the camera poses relative to the base image, extracting from at least some of the side images, image patches showing the annotated point, wherein a plurality of sets of image patches are extracted, wherein a set of image patches is extracted at a plurality of corresponding candidate depth values, classifying each set as having a corresponding candidate depth value that is correct or incorrect, and outputting a correct depth value.

Type: Grant

Filed: May 13, 2019

Date of Patent: November 16, 2021

Assignee: International Business Machines Corporation

Inventors: Joseph Shtok, Yochay Tzur
UNSUPERVISED, SEMI-SUPERVISED, AND SUPERVISED LEARNING USING DEEP LEARNING BASED PROBABILISTIC GENERATIVE MODELS

Publication number: 20210256391

Abstract: Embodiments of the present systems and methods may provide techniques to discover features such as object categories that provide improved accuracy and performance. For example, in an embodiment, a method may comprise extracting, at the computer system, features from a dataset comprising a plurality of data samples using a backbone neural network to form a features vector for each data sample, training, at the computer system, using the features vectors for at least some of the plurality of data samples, an unsupervised generative probabilistic model to perform clustering of extracted features of the at least some of the plurality of data samples by minimizing a negative Log-Likelihood function, wherein clusters of extracted features form categories, and categorizing, at the computer system, at least some different data samples of the plurality of data samples, into the formed categories.

Type: Application

Filed: February 13, 2020

Publication date: August 19, 2021

Inventors: Leonid Karlinsky, Joseph Shtok
METHOD AND SYSTEM FOR PRODUCING DIGITAL IMAGE FEATURES

Publication number: 20210103771

Abstract: A system for generating a set of digital image features, comprising at least one hardware processor adapted for: producing a plurality of input groups of features, each produced by extracting a plurality of features from one of a plurality of digital images; computing an output group of features by inputting the plurality of input groups of features into at least one prediction model trained to produce a model group of features in response to at least two groups of features, such that a model set of labels indicative of the model group of features is similar, according to at least one similarity test, to a target set of labels computed by applying at least one set operator to a plurality of input sets of labels each indicative of one of the at least two groups of features; and providing the output group of features to at least one other processor

Type: Application

Filed: October 6, 2019

Publication date: April 8, 2021

Inventors: Amit Aides, Amit Alfassy, Leonid Karinsky, Joseph Shtok
SYSTEMS AND METHODS FOR TRAINING A MODEL USING A FEW-SHOT CLASSIFICATION PROCESS

Publication number: 20210089880

Abstract: There is provided a method of computing a model for classification of a data element, comprising: feeding a plurality of labeled auxiliary data elements and at least one labeled relevant data elements for each of a plurality of relevant classification categories, into a synthesizer component, for outputting at least one synthetic labeled relevant data element for each one of the plurality of relevant classification categories, feeding the synthetic labeled relevant data elements and a plurality of unlabelled training relevant data elements into a domain adaptation component, for outputting a respective relevant classification category for each of the plurality of unlabelled training relevant data elements, iteratively end-to-end training the synthesis component and the domain adaptation component, and providing the trained domain adaption component, for outputting a relevant classification category in response to an input of a query unlabelled data element.

Type: Application

Filed: September 25, 2019

Publication date: March 25, 2021

Inventors: Leonid Karlinsky, Joseph Shtok
System and method for a visual recognition and/or detection of a potentially unbounded set of categories with limited examples per category and restricted query scope

Patent number: 10878297

Abstract: Embodiments may provide visual recognition techniques that provide improved recognition accuracy and reduced use of computing resources in cases where only a small set of examples is used to train an unlimited number of recognized categories. For example, in an embodiment, a computer-implemented method of visual recognition may comprise generating a plurality of personal embedding models, each personal embedding model including categories relating to a person, and object, or a subject, wherein at least some of the personal embedding models include at least some different categories, training the plurality of personal embedding models using image training data having a limited number of examples of each category, wherein the examples of each category are used to train more than one category in more than one of the personal embedding models, recognizing images from image data using the plurality of personal embedding models, and outputting information relating to the recognized images.

Type: Grant

Filed: August 29, 2018

Date of Patent: December 29, 2020

Assignee: International Business Machines Corporation

Inventors: Oded Dubovsky, Leonid Karlinsky, Joseph Shtok
POINT DEPTH ESTIMATION FROM A SET OF 3D-REGISTERED IMAGES

Publication number: 20200364893

Abstract: Embodiments provide 3D coordinates for points in a scene that are observed to be in the correct physical position in a series of images. A method may comprise obtaining a plurality of images including a base image having at least one annotated point corresponding to a point of an object shown in the base image, and a plurality of side images showing the object from different viewpoints than the base image, wherein the plurality of side images are given with the camera poses relative to the base image, extracting from at least some of the side images, image patches showing the annotated point, wherein a plurality of sets of image patches are extracted, wherein a set of image patches is extracted at a plurality of corresponding candidate depth values, classifying each set as having a corresponding candidate depth value that is correct or incorrect, and outputting a correct depth value.

Type: Application

Filed: May 13, 2019

Publication date: November 19, 2020

Inventors: JOSEPH SHTOK, YOCHAY TZUR
Representative-based metric learning for classification and few-shot object detection

Patent number: 10832096

Abstract: A method can include learning a common embedding space and a set of parameters for each one of a plurality of sets of mixture models, wherein one mixture model is associated with one class of objects within a set of object categories. The method can also include adding new mixture models to the set of mixture models to support novel categories based on a set of example embedding vectors computed for each one of the novel categories. Additionally, the method includes detecting in images a plurality of boxes with associated labels and corresponding confidence scores, wherein the boxes correspond to image regions comprising objects of both known categories and the novel categories. Furthermore, the method includes, given a query image, executing an instruction based on the common embedding space and the set of mixture models, the instruction comprising identifying objects from both categories in the query image.

Type: Grant

Filed: January 7, 2019

Date of Patent: November 10, 2020

Assignee: International Business Machines Corporation

Inventors: Leonid Karlinsky, Eliyahu Schwartz, Joseph Shtok, Mattias Marder, Sivan Harary

1 2 next