Patents by Inventor Joseph Shtok
Joseph Shtok has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Patent number: 11954144Abstract: An example system includes a processor to receive, a randomly generated alpha-map, a pair of training images, and a pair of training texts associated with the pair of training images. The processor is to generate a blended image based on the randomly generated alpha-map and the pair of training images. The processor is to train a visual language grounding model to separate the blended image into a pair of heatmaps identifying portions of the blended image corresponding to each of the training images using a separation loss.Type: GrantFiled: August 26, 2021Date of Patent: April 9, 2024Assignee: International Business Machines CorporationInventors: Assaf Arbelle, Leonid Karlinsky, Sivan Doveh, Joseph Shtok, Amit Alfassy
-
Patent number: 11816593Abstract: Embodiments may include novel techniques for Task-Adaptive Feature Sub-Space Learning (TAFSSL). For example, in an embodiment, a method may be implemented in a computer system comprising a processor, memory accessible by the processor, and computer program instructions stored in the memory and executable by the processor, the method comprising: training a machine learning system to classify features in images by: generating a sample set comprising one or a few labeled training samples and one or a few additional samples containing instances of target classes, performing dimensionality reduction computed on the samples in the sample set to form a dimension reduced sub-space, generating class representatives in the dimension reduced sub-space using clustering, and classifying features in images using the trained machine learning system.Type: GrantFiled: August 23, 2020Date of Patent: November 14, 2023Assignee: International Business Machines CorporationInventors: Leonid Karlinsky, Joseph Shtok, Eliyahu Schwartz
-
Publication number: 20230298373Abstract: An example system includes a processor to receive detected chart regions in a page of a document. The processor is to produce, via a graphical elements detector, predicted heatmaps and bounding boxes for graphical objects in the detected chart regions. The processor is also to apply chart type specific analysis algorithm to the predicted heatmaps and bounding boxes, to extract tabular chart data. The processor can then generate an output data file and a visualization based on the predicted heatmap and the extracted tabular chart data.Type: ApplicationFiled: March 21, 2022Publication date: September 21, 2023Inventors: Joseph SHTOK, Leonid KARLINSKY, Sivan HARARY, Ophir AZULAI
-
Patent number: 11620796Abstract: A method, a computer program product, and a computer system for transferring knowledge from an expert to a user using a mixed reality rendering. The method includes determining a user perspective of a user viewing an object on which a procedure is to be performed. The method includes determining an anchoring of the user perspective to an expert perspective, the expert perspective associated with an expert providing a demonstration of the procedure. The method includes generating a virtual rendering of the expert at the user perspective based on the anchoring at a scene viewed by the user, the virtual rendering corresponding to the demonstration of the procedure as performed by the expert. The method includes generating a mixed reality environment in which the virtual rendering of the expert is shown in the scene viewed by the user.Type: GrantFiled: March 1, 2021Date of Patent: April 4, 2023Assignee: International Business Machines CorporationInventors: Joseph Shtok, Leonid Karlinsky, Adi Raz Goldfarb, Oded Dubovsky
-
Publication number: 20230061647Abstract: An example system includes a processor to receive, a randomly generated alpha-map, a pair of training images, and a pair of training texts associated with the pair of training images. The processor is to generate a blended image based on the randomly generated alpha-map and the pair of training images. The processor is to train a visual language grounding model to separate the blended image into a pair of heatmaps identifying portions of the blended image corresponding to each of the training images using a separation loss.Type: ApplicationFiled: August 26, 2021Publication date: March 2, 2023Inventors: Assaf ARBELLE, Leonid KARLINSKY, Sivan DOVEH, Joseph SHTOK, Amit ALFASSY
-
Patent number: 11481623Abstract: There is provided a method of computing a model for classification of a data element, comprising: feeding a plurality of labeled auxiliary data elements and at least one labeled relevant data elements for each of a plurality of relevant classification categories, into a synthesizer component, for outputting at least one synthetic labeled relevant data element for each one of the plurality of relevant classification categories, feeding the synthetic labeled relevant data elements and a plurality of unlabelled training relevant data elements into a domain adaptation component, for outputting a respective relevant classification category for each of the plurality of unlabelled training relevant data elements, iteratively end-to-end training the synthesis component and the domain adaptation component, and providing the trained domain adaption component, for outputting a relevant classification category in response to an input of a query unlabelled data element.Type: GrantFiled: September 25, 2019Date of Patent: October 25, 2022Assignee: International Business Machines CorporationInventors: Leonid Karlinsky, Joseph Shtok
-
Patent number: 11475313Abstract: Embodiments of the present systems and methods may provide techniques to discover features such as object categories that provide improved accuracy and performance. For example, in an embodiment, a method may comprise extracting, at the computer system, features from a dataset comprising a plurality of data samples using a backbone neural network to form a features vector for each data sample, training, at the computer system, using the features vectors for at least some of the plurality of data samples, an unsupervised generative probabilistic model to perform clustering of extracted features of the at least some of the plurality of data samples by minimizing a negative Log-Likelihood function, wherein clusters of extracted features form categories, and categorizing, at the computer system, at least some different data samples of the plurality of data samples, into the formed categories.Type: GrantFiled: February 13, 2020Date of Patent: October 18, 2022Assignee: International Business Machines CorporationInventors: Leonid Karlinsky, Joseph Shtok
-
Publication number: 20220277524Abstract: A method, a computer program product, and a computer system for transferring knowledge from an expert to a user using a mixed reality rendering. The method includes determining a user perspective of a user viewing an object on which a procedure is to be performed. The method includes determining an anchoring of the user perspective to an expert perspective, the expert perspective associated with an expert providing a demonstration of the procedure. The method includes generating a virtual rendering of the expert at the user perspective based on the anchoring at a scene viewed by the user, the virtual rendering corresponding to the demonstration of the procedure as performed by the expert. The method includes generating a mixed reality environment in which the virtual rendering of the expert is shown in the scene viewed by the user.Type: ApplicationFiled: March 1, 2021Publication date: September 1, 2022Inventors: Joseph Shtok, Leonid Karlinsky, Adi Raz Goldfarb, ODED DUBOVSKY
-
Publication number: 20220207410Abstract: A computing system, computer program product, and computer-implemented method for incremental learning without forgetting for a classification/detection model are provided. The method includes receiving, at a computing system, a classification/detection model including a base embedding space and corresponding base embedding vectors that are based on a base training dataset including base classes. The method also includes expanding the classification/detection model to account for a new training dataset including new classes by lifting the base embedding space to add an orthogonal subspace for the new classes, producing an expanded embedding space and corresponding expanded embedding vectors that are of a higher dimension than the base embedding vectors.Type: ApplicationFiled: December 28, 2020Publication date: June 30, 2022Inventors: Sivan HARARY, Leonid KARLINSKY, Joseph SHTOK
-
Publication number: 20220188639Abstract: In an approach for augmenting a neural network with a self-supervised mechanism, a processor trains a first neural network using labeled data, the first neural network configured for a main task. A processor trains a second neural network using the labeled data and unlabeled data, the second neural network being an additional component to the first neural network. A processor computes a gradient using a second loss of the second neural network based on the unlabeled data.Type: ApplicationFiled: December 14, 2020Publication date: June 16, 2022Inventors: Leonid Karlinsky, Joseph Shtok
-
Publication number: 20220180182Abstract: A system and method for generating hard training data from easy training data. Training data including visual data with synthetic semantic implants (“VSSI”) having at least one cue is received. An annotator identifies at least one cue in the VSSI and annotates the VSSI to indicate the cue to create a modified training data set. A data scrambler removes at least one cue from the VSSI to create the tagged training data, which can then be used to train a classifier to identify transitions between segments when the cues are not present.Type: ApplicationFiled: December 9, 2020Publication date: June 9, 2022Inventors: Daniel Nechemia Rotman, Yevgeny Yaroker, Udi Barzelay, Joseph Shtok
-
Publication number: 20220058505Abstract: Embodiments may include novel techniques for Task-Adaptive Feature Sub-Space Learning (TAFSSL). For example, in an embodiment, a method may be implemented in a computer system comprising a processor, memory accessible by the processor, and computer program instructions stored in the memory and executable by the processor, the method comprising: training a machine learning system to classify features in images by: generating a sample set comprising one or a few labeled training samples and one or a few additional samples containing instances of target classes, performing dimensionality reduction computed on the samples in the sample set to form a dimension reduced sub-space, generating class representatives in the dimension reduced sub-space using clustering, and classifying features in images using the trained machine learning system.Type: ApplicationFiled: August 23, 2020Publication date: February 24, 2022Inventors: Leonid Karlinsky, Joseph Shtok, Eliyahu Schwartz
-
Patent number: 11176417Abstract: A system for generating a set of digital image features, comprising at least one hardware processor adapted for: producing a plurality of input groups of features, each produced by extracting a plurality of features from one of a plurality of digital images; computing an output group of features by inputting the plurality of input groups of features into at least one prediction model trained to produce a model group of features in response to at least two groups of features, such that a model set of labels indicative of the model group of features is similar, according to at least one similarity test, to a target set of labels computed by applying at least one set operator to a plurality of input sets of labels each indicative of one of the at least two groups of features; and providing the output group of features to at least one other processor.Type: GrantFiled: October 6, 2019Date of Patent: November 16, 2021Assignee: International Business Machines CorporationInventors: Amit Aides, Amit Alfassy, Leonid Karlinsky, Joseph Shtok
-
Patent number: 11176696Abstract: Embodiments provide 3D coordinates for points in a scene that are observed to be in the correct physical position in a series of images. A method may comprise obtaining a plurality of images including a base image having at least one annotated point corresponding to a point of an object shown in the base image, and a plurality of side images showing the object from different viewpoints than the base image, wherein the plurality of side images are given with the camera poses relative to the base image, extracting from at least some of the side images, image patches showing the annotated point, wherein a plurality of sets of image patches are extracted, wherein a set of image patches is extracted at a plurality of corresponding candidate depth values, classifying each set as having a corresponding candidate depth value that is correct or incorrect, and outputting a correct depth value.Type: GrantFiled: May 13, 2019Date of Patent: November 16, 2021Assignee: International Business Machines CorporationInventors: Joseph Shtok, Yochay Tzur
-
Publication number: 20210256391Abstract: Embodiments of the present systems and methods may provide techniques to discover features such as object categories that provide improved accuracy and performance. For example, in an embodiment, a method may comprise extracting, at the computer system, features from a dataset comprising a plurality of data samples using a backbone neural network to form a features vector for each data sample, training, at the computer system, using the features vectors for at least some of the plurality of data samples, an unsupervised generative probabilistic model to perform clustering of extracted features of the at least some of the plurality of data samples by minimizing a negative Log-Likelihood function, wherein clusters of extracted features form categories, and categorizing, at the computer system, at least some different data samples of the plurality of data samples, into the formed categories.Type: ApplicationFiled: February 13, 2020Publication date: August 19, 2021Inventors: Leonid Karlinsky, Joseph Shtok
-
Publication number: 20210103771Abstract: A system for generating a set of digital image features, comprising at least one hardware processor adapted for: producing a plurality of input groups of features, each produced by extracting a plurality of features from one of a plurality of digital images; computing an output group of features by inputting the plurality of input groups of features into at least one prediction model trained to produce a model group of features in response to at least two groups of features, such that a model set of labels indicative of the model group of features is similar, according to at least one similarity test, to a target set of labels computed by applying at least one set operator to a plurality of input sets of labels each indicative of one of the at least two groups of features; and providing the output group of features to at least one other processorType: ApplicationFiled: October 6, 2019Publication date: April 8, 2021Inventors: Amit Aides, Amit Alfassy, Leonid Karinsky, Joseph Shtok
-
Publication number: 20210089880Abstract: There is provided a method of computing a model for classification of a data element, comprising: feeding a plurality of labeled auxiliary data elements and at least one labeled relevant data elements for each of a plurality of relevant classification categories, into a synthesizer component, for outputting at least one synthetic labeled relevant data element for each one of the plurality of relevant classification categories, feeding the synthetic labeled relevant data elements and a plurality of unlabelled training relevant data elements into a domain adaptation component, for outputting a respective relevant classification category for each of the plurality of unlabelled training relevant data elements, iteratively end-to-end training the synthesis component and the domain adaptation component, and providing the trained domain adaption component, for outputting a relevant classification category in response to an input of a query unlabelled data element.Type: ApplicationFiled: September 25, 2019Publication date: March 25, 2021Inventors: Leonid Karlinsky, Joseph Shtok
-
Patent number: 10878297Abstract: Embodiments may provide visual recognition techniques that provide improved recognition accuracy and reduced use of computing resources in cases where only a small set of examples is used to train an unlimited number of recognized categories. For example, in an embodiment, a computer-implemented method of visual recognition may comprise generating a plurality of personal embedding models, each personal embedding model including categories relating to a person, and object, or a subject, wherein at least some of the personal embedding models include at least some different categories, training the plurality of personal embedding models using image training data having a limited number of examples of each category, wherein the examples of each category are used to train more than one category in more than one of the personal embedding models, recognizing images from image data using the plurality of personal embedding models, and outputting information relating to the recognized images.Type: GrantFiled: August 29, 2018Date of Patent: December 29, 2020Assignee: International Business Machines CorporationInventors: Oded Dubovsky, Leonid Karlinsky, Joseph Shtok
-
Publication number: 20200364893Abstract: Embodiments provide 3D coordinates for points in a scene that are observed to be in the correct physical position in a series of images. A method may comprise obtaining a plurality of images including a base image having at least one annotated point corresponding to a point of an object shown in the base image, and a plurality of side images showing the object from different viewpoints than the base image, wherein the plurality of side images are given with the camera poses relative to the base image, extracting from at least some of the side images, image patches showing the annotated point, wherein a plurality of sets of image patches are extracted, wherein a set of image patches is extracted at a plurality of corresponding candidate depth values, classifying each set as having a corresponding candidate depth value that is correct or incorrect, and outputting a correct depth value.Type: ApplicationFiled: May 13, 2019Publication date: November 19, 2020Inventors: JOSEPH SHTOK, YOCHAY TZUR
-
Patent number: 10832096Abstract: A method can include learning a common embedding space and a set of parameters for each one of a plurality of sets of mixture models, wherein one mixture model is associated with one class of objects within a set of object categories. The method can also include adding new mixture models to the set of mixture models to support novel categories based on a set of example embedding vectors computed for each one of the novel categories. Additionally, the method includes detecting in images a plurality of boxes with associated labels and corresponding confidence scores, wherein the boxes correspond to image regions comprising objects of both known categories and the novel categories. Furthermore, the method includes, given a query image, executing an instruction based on the common embedding space and the set of mixture models, the instruction comprising identifying objects from both categories in the query image.Type: GrantFiled: January 7, 2019Date of Patent: November 10, 2020Assignee: International Business Machines CorporationInventors: Leonid Karlinsky, Eliyahu Schwartz, Joseph Shtok, Mattias Marder, Sivan Harary