Patents by Inventor Leonid Sigal

Leonid Sigal has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20180096192
    Abstract: There is provided a system comprising a memory and a processor configured to receive a plurality of images, train a plurality of independent detectors for identifying a first plurality of objects in the plurality of images based on individual attributes including a first attribute and a second attribute, train a plurality of joint detectors for identifying the first plurality of objects in the plurality of images based on composite attributes including a plurality of composite attributes each including the first attribute and the second attribute, analyze features of the plurality of images to determine a difference between a first training performance of the independent detectors and a second training performance of the joint detectors, and select, based on the analyzing, between using the independent detectors and using the joint detectors for identifying a second plurality of objects in the plurality of images using a first new attribute and a second new attribute in the attribute database.
    Type: Application
    Filed: October 4, 2016
    Publication date: April 5, 2018
    Inventors: Leonid Sigal, Arik Shamir, Guy Rosenthal
  • Publication number: 20180089580
    Abstract: Techniques are disclosed for identifying and filtering outliers from a sample set of data prior to training a classifier on an object using the sample set. A data set including a plurality of samples used to train a classification model is retrieved. The samples in the data set have a feature dimensionality. A graph of the data set is built. Each node in the graph corresponds to a sample in the data set and edges connecting the nodes correspond to a measure of similarity between the nodes. The feature dimensionality of the sample data set is reduced based on a topology of the graph. One or more outliers in the data set are identified based on the reduced feature dimensionality.
    Type: Application
    Filed: September 27, 2016
    Publication date: March 29, 2018
    Inventors: Yanwei FU, Leonid SIGAL
  • Patent number: 9928442
    Abstract: Systems, methods, and computer program products to perform an operation comprising assigning each of a plurality of images in a blog post and each of a plurality of images in a collection of images to a respective node in a graph, computing an adjacency matrix for the graph, wherein the adjacency matrix defines relationships between images in the blog post and images in the collections of images, and determining a first subset of the images in the collection of images that summarize the images in the image collection, wherein the subset is determined based on the adjacency matrix, wherein the adjacency matrix is computed based on the subset of the images in the collection of images.
    Type: Grant
    Filed: March 22, 2016
    Date of Patent: March 27, 2018
    Assignee: Disney Enterprises, Inc.
    Inventors: Leonid Sigal, Gunhee Kim, Seungwhan Moon
  • Patent number: 9846845
    Abstract: The disclosure provides an approach for recognizing and analyzing activities. In one embodiment, a learning application trains parameters of a hierarchical model which represents human (or object) activity at multiple levels of detail. Higher levels of detail may consider more context, and vice versa. Further, learning may be optimized for a user-preferred type of inference by adjusting a learning criterion. An inference application may use the trained model to answer queries about variable(s) at any level of detail. In one embodiment, the inference application may determine scores for each possible value of the query variable by finding the best hierarchical event representation that maximizes a scoring function while fixing the value of the query variable to its possible values. Here, the inference application may approximately determine the best hierarchical event representation by iteratively optimizing one level-of-detail variable at a time while fixing other level-of-detail variables, until convergence.
    Type: Grant
    Filed: November 21, 2012
    Date of Patent: December 19, 2017
    Assignee: Disney Enterprises, Inc.
    Inventors: Leonid Sigal, Tian Lan
  • Publication number: 20170357720
    Abstract: Systems, methods and articles of manufacture for modeling a joint language-visual space. A textual query to be evaluated relative to a video library is received from a requesting entity. The video library contains a plurality of instances of video content. One or more instances of video content from the video library that correspond to the textual query are determined, by analyzing the textual query using a data model that includes a soft-attention neural network module that is jointly trained with a language Long Short-term Memory (LSTM) neural network module and a video LSTM neural network module. At least an indication of the one or more instances of video content is returned to the requesting entity.
    Type: Application
    Filed: June 12, 2017
    Publication date: December 14, 2017
    Inventors: Atousa TORABI, Leonid SIGAL
  • Patent number: 9805264
    Abstract: Techniques disclose an incrementally expanding object detection model. An object detection tool identifies, based on an object detection model, one or more objects in a sequence of video frames. The object detection model provides an object space including a plurality of object classes. Each object class includes one or more prototypes. Each object is classified as being an instance of one of the object classes. Each identified object is tracked across at least one of the frames. The object detection tool generates a measure of confidence for that object based on the tracking. Upon determining that the measure of confidence exceeds a threshold, the object detection tool adds a prototype of the instance to the object detection model.
    Type: Grant
    Filed: October 19, 2015
    Date of Patent: October 31, 2017
    Assignee: Disney Enterprises, Inc.
    Inventors: Alina Kuznetsova, Sung Ju Hwang, Leonid Sigal
  • Publication number: 20170308754
    Abstract: There is provided a system comprising a label database including a plurality of label, a non-transitory memory storing an executable code, and a hardware processor executing the executable code to receive a media content including a plurality of segments, each segment including a plurality of frames, extract a first plurality of features from a segment, extract a second plurality of features from each frame of the segment, determine an attention weight for each frame of the segment based on the first plurality of features extracted from the segment and the second plurality of features extracted from the segment, and determine that the segment depicts one of the plurality of labels in a label database based on the first plurality of features, the second plurality of features, and the attention weight of each frame of the plurality of frames of the segment.
    Type: Application
    Filed: July 5, 2016
    Publication date: October 26, 2017
    Inventors: Atousa Torabi, Leonid Sigal
  • Publication number: 20170308753
    Abstract: There is provided a system including a non-transitory memory storing an executable code and a hardware processor executing the executable code to receive a plurality of training contents depicting a plurality of activities, extract training object data from the plurality of training contents including a first training object data corresponding to a first activity, extract training scene data from the plurality of training contents including a first training scene data corresponding to the first activity, determine that a probability of the first activity is maximized when the first training object data and the first training scene data both exist in a sample media content.
    Type: Application
    Filed: July 15, 2016
    Publication date: October 26, 2017
    Inventors: Zuxuan Wu, Yanwei Fu, Leonid Sigal
  • Publication number: 20170308756
    Abstract: There is provided a system comprising a memory and a processor configured to receive a media content depicting an activity, extract a first plurality of features from a first segment of the media content, make a first prediction that the media content depicts a first activity based on the first plurality of features, wherein the first prediction has a first confidence level, extract a second plurality of features from a second segment of the media content, the second segment temporally following the first segment in the media content, make a second prediction that the media content depicts the first activity based on the second plurality of features, wherein the second prediction has a second confidence level, determine that the media content depicts the first activity based on the first prediction and the second prediction, wherein the second confidence level is at least as high as the first confidence level.
    Type: Application
    Filed: July 14, 2016
    Publication date: October 26, 2017
    Inventors: Leonid Sigal, Shugao Ma
  • Publication number: 20170300534
    Abstract: Items of interest within digital information may be detected and associated with a label that provides context to the item of interest. The label may describe an item category of the item of interest. The knowledge base of item categories may be limited. Additional item categories may be learned by accessing sets of vocabulary that may relate to the known item categories.
    Type: Application
    Filed: April 13, 2016
    Publication date: October 19, 2017
    Inventors: Yanwei Fu, Leonid Sigal
  • Publication number: 20170300781
    Abstract: One or more image parameters of an image may be analyzed using a hierarchical set of models. Executing individual models in the set of models may generate outputs from analysis of different image parameters of the image. Inputs of one or more of the models may be conditioned on a set of outputs derived from one or more preceding model in the hierarchy.
    Type: Application
    Filed: September 22, 2016
    Publication date: October 19, 2017
    Inventors: Andreas Lehrmann, Leonid Sigal
  • Publication number: 20170277970
    Abstract: Systems, methods, and computer program products to perform an operation comprising assigning each of a plurality of images in a blog post and each of a plurality of images in a collection of images to a respective node in a graph, computing an adjacency matrix for the graph, wherein the adjacency matrix defines relationships between images in the blog post and images in the collections of images, and determining a first subset of the images in the collection of images that summarize the images in the image collection, wherein the subset is determined based on the adjacency matrix, wherein the adjacency matrix is computed based on the subset of the images in the collection of images.
    Type: Application
    Filed: March 22, 2016
    Publication date: September 28, 2017
    Inventors: Leonid SIGAL, Gunhee KIM, Seungwhan MOON
  • Patent number: 9740964
    Abstract: There are provided systems and methods for performing object classification through semantic mapping. Such an object classification system includes a system processor, a system memory, and an object categorizing unit stored in the system memory. The system processor is configured to execute the object categorizing unit to receive image data corresponding to an object, and to transform the image data into a directed quantity expressed at least in part in terms of semantic parameters. The system processor is further configured to determine a projection of the directed quantity onto an object representation map including multiple object categories, and to associate the object with a category from among the multiple object categories based on the projection.
    Type: Grant
    Filed: March 31, 2015
    Date of Patent: August 22, 2017
    Assignee: Disney Enterprises, Inc.
    Inventors: Sung Ju Hwang, Jonghyun Choi, Leonid Sigal
  • Patent number: 9704288
    Abstract: Techniques are disclosed for providing a learning-based clothing model that enables the simultaneous animation of multiple detailed garments in real-time. A simple conditional model learns and preserves key dynamic properties of cloth motions and folding details. Such a conditional model may be generated for each garment worn by a given character. Once generated, the conditional model may be used to determine complex body/cloth interactions in order to render the character and garment from frame-to-frame. The clothing model may be used for a variety of garments worn by male and female human characters (as well as non-human characters) while performing a varied set of motions typically used in video games (e.g., walking, running, jumping, turning, etc.).
    Type: Grant
    Filed: December 21, 2010
    Date of Patent: July 11, 2017
    Assignee: Disney Enterprises, Inc.
    Inventors: Edilson de Aguiar, Leonid Sigal, Adrien Treuille, Jessica K. Hodgins
  • Patent number: 9652685
    Abstract: Embodiments presented herein describe techniques for generating a story graph using a collection of digital media, such as images and video. The story graph presents a structure for activities, events, and locales commonly occurring in sets of photographs taken by different individuals across a given location (e.g., a theme park, tourist attraction, convention, etc.). To build a story graph, streams from sets of digital media are generated. Each stream corresponds to media (e.g., images or video) taken in sequence at the location by an individual (or related group of individuals) over a period of time. For each stream, features from each media are extracted relative to the stream. Clusters of media are generated and are connected by directed edges. The connections indicate a path observed to have occurred in the streams from one cluster to another cluster.
    Type: Grant
    Filed: September 30, 2014
    Date of Patent: May 16, 2017
    Assignee: Disney Enterprises, Inc.
    Inventors: Leonid Sigal, Gunhee Kim
  • Publication number: 20170109582
    Abstract: Techniques disclose an incrementally expanding object detection model. An object detection tool identifies, based on an object detection model, one or more objects in a sequence of video frames. The object detection model provides an object space including a plurality of object classes. Each object class includes one or more prototypes. Each object is classified as being an instance of one of the object classes. Each identified object is tracked across at least one of the frames. The object detection tool generates a measure of confidence for that object based on the tracking. Upon determining that the measure of confidence exceeds a threshold, the object detection tool adds a prototype of the instance to the object detection model.
    Type: Application
    Filed: October 19, 2015
    Publication date: April 20, 2017
    Inventors: Alina Kuznetsova, Sung Ju Hwang, Leonid Sigal
  • Publication number: 20170109419
    Abstract: There are provided systems and methods for performing metadata extraction and management. Such a system includes a computing platform having a hardware processor, a system memory, and metadata extraction and management unit stored in the system memory. The system is configured to extract multiple metadata types from a media asset, and to aggregate the multiple metadata types to produce an aggregated metadata describing the media asset. The system is further configured to transform the aggregated metadata into at least one database entry identifying the media asset, and to map the at least one database entry into a graphical database so as to relate the media asset to at least one other media asset represented in the graphical database.
    Type: Application
    Filed: October 15, 2015
    Publication date: April 20, 2017
    Inventors: Miquel Angel Farre Guiu, Marc Junyent Martin, Jordi Pont-Tuset, Pablo Beltran, Nimesh Narayan, Leonid Sigal, Aljoscha Smolic, Anthony M. Accardo
  • Publication number: 20170068643
    Abstract: Systems, methods, and computer program products to perform an operation comprising assigning each of a plurality of nodes of a graph to a distinct image, of a plurality of images, wherein the graph represents a story, wherein each node corresponds to a respective element of the story, wherein each node comprises: (i) an attribute and (ii) a text of the respective element of the story, wherein the plurality of graph nodes are assigned based on a set of attributes of each of the plurality of images and the attribute of each node, and generating a visual depiction of the story, wherein the visual depiction comprises an ordered representation of each of the distinct images and the text of each respective element of the story.
    Type: Application
    Filed: September 3, 2015
    Publication date: March 9, 2017
    Inventors: Ariel SHAMIR, Oz RADIANO, Leonid SIGAL, Moshe B. MAHLER
  • Patent number: 9514363
    Abstract: The disclosure provides an approach for detecting and localizing action in video. In one embodiment, an action detection application receives training video sequences and associated eye gaze fixation data collected from a sample of human viewers. Using the training video sequences and eye gaze data, the action detection application learns a model which includes a latent regions potential term that measures the compatibility of latent spatio-temporal regions with the model, as well as a context potential term that accounts for contextual information that is not directly produced by the appearance and motion of the actor. The action detection application may train this model in, e.g., the latent structural SVM framework by minimizing a cost function which encodes the cost of an incorrect action label prediction and a mislocalization of the eye gaze. During training and thereafter, inferences using the model may be made using an efficient dynamic programming algorithm.
    Type: Grant
    Filed: April 8, 2014
    Date of Patent: December 6, 2016
    Assignee: Disney Enterprises, Inc.
    Inventors: Leonid Sigal, Nataliya Shapovalova, Michail Raptis
  • Patent number: 9477908
    Abstract: The disclosure provides an approach for detecting objects in images. An object detection application receives a set of training images with object annotations. Given these training images, the object detection application generates semantic labeling for object detections, where the labeling includes lower-level subcategories and higher-level visual composites. In one embodiment, the object detection application identifies subcategories using an exemplar support vector machine (SVM) based clustering approach. Identified subcategories are used to initialize mixture components in mixture models which the object detection application trains in a latent SVM framework, thereby learning a number of subcategory classifiers that produce, for any given image, a set of candidate windows and associated subcategory labels.
    Type: Grant
    Filed: April 10, 2014
    Date of Patent: October 25, 2016
    Assignee: Disney Enterprises, Inc.
    Inventors: Leonid Sigal, Michail Raptis, Tian Lan