Patents by Inventor Roberto Cipolla

Roberto Cipolla has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 9767604
    Abstract: A method of object recognition and/or registration includes receiving a point cloud, arranging the points of the point cloud into a hierarchical search tree, and determining geometric information of the points located within a region, by identifying a highest level tree nodes where all of descendent leaf nodes are contained within the region and selecting the leaf nodes for the points where no sub-tree is entirely contained within the region, such that the points falling within the region are represented by the smallest number of nodes and performing statistical operations on the nodes representing the points in the region. The geometric information includes descriptors of features in the point cloud. The method further includes comparing the feature descriptors with a database of feature descriptors for a plurality of objects.
    Type: Grant
    Filed: July 23, 2015
    Date of Patent: September 19, 2017
    Assignee: Kabushiki Kaisha Toshiba
    Inventors: Minh-Tri Pham, Riccardo Gherardi, Frank Perbet, Bjorn Stenger, Sam Johnson, Oliver Woodford, Pablo Alcantarilla, Roberto Cipolla
  • Patent number: 9646408
    Abstract: According to one embodiment, a method of generating a three dimensional representation of a subject from a depth image, comprises comparing a depth image of the subject with a plurality of representative images, wherein each representative image is associated with a respective parametric model of a subject; identifying a representative image of the plurality of representative images as a closest representative image to the depth image of the subject; selecting the parametric model associated with the closest representative image to the depth image; and generating a three dimensional representation of the subject by fitting the selected parametric model to the depth image of the subject.
    Type: Grant
    Filed: May 6, 2014
    Date of Patent: May 9, 2017
    Assignee: Kabushiki Kaisha Toshiba
    Inventors: Sam Johnson, Frank Perbet, Bjorn Stenger, Minh-Tri Pham, Oliver Woodford, Riccardo Gherardi, Roberto Cipolla
  • Patent number: 9361722
    Abstract: A method of animating a computer generation of a head and displaying the text of an electronic book, such that the head has a mouth which moves in accordance with the speech of the text of the electronic book to be output by the head and a word or group of words from the text is displayed while simultaneously being mimed by the mouth, wherein input text is divided into a sequence of acoustic units, which are converted to a sequence of image vectors and into a sequence of text display indicators. The sequence of image vectors is outputted as video such that the mouth of said head moves to mime the speech associated with the input text with a selected expression, and the sequence of text display indicators is output as video which is synchronized with the lip movement of the head.
    Type: Grant
    Filed: August 8, 2014
    Date of Patent: June 7, 2016
    Assignee: KABUSHIKI KAISHA TOSHIBA
    Inventors: Javier Latorre-Martinez, Vincent Ping Leung Wan, Balakrishna Venkata Jagannadha Kolluru, Ioannis Stylianou, Robert Arthur Blokland, Norbert Braunschweiler, Kayoko Yanagisawa, Langzhou Chen, Ranniery Maia, Robert Anderson, Bjorn Stenger, Roberto Cipolla, Neil Baker
  • Publication number: 20160148393
    Abstract: A method of calculating a similarity measure between first and second image patches, which include respective first and second intensity values associated with respective elements of the first and second image patches, and which have a corresponding size and shape such that each element of the first image patch corresponds to an element on the second image patch. The method: determines a set of sub-regions on the second image patch corresponding to elements of the first image patch and having first intensity values within a range defined for that sub-region; calculates variance, for each sub-region of the set over all of the elements of that sub-region, of a function of the second intensity value associated with that element and the first intensity value associated with the corresponding element of the first image patch; and calculates similarity measure as the sum over all sub-regions of the calculated variances.
    Type: Application
    Filed: November 5, 2013
    Publication date: May 26, 2016
    Applicant: KABUSHIKI KAISHA TOSHIBA
    Inventors: Atsuto MAKI, Riccardo GHERARDI, Oliver WOODFORD, Frank PERBET, Minh-Tri PHAM, Bjorn STENGER, Sam JOHNSON, Roberto CIPOLLA
  • Publication number: 20160027208
    Abstract: A method for analysing a point cloud, the method comprising: receiving a point cloud, comprising a plurality of points, each point representing a spatial point in an image; arranging the points into a hierarchical search tree, with a lowest level comprising a plurality of leaf nodes, where each leaf node corresponds to a point of the point cloud, the search tree comprising a plurality of hierarchical levels with tree nodes in each of the hierarchical levels, the nodes being vertically connected to each other though the hierarchy by branches, wherein at least one moment of the property of the descendant nodes is stored in each tree node; and determining geometric information of the points located within a region, by identifying the highest level tree nodes where all of the descendent leaf nodes are contained within the region and selecting the leaf nodes for the points where no sub-tree is entirely contained within the region, such that such that the points falling within the region are represented by the s
    Type: Application
    Filed: July 23, 2015
    Publication date: January 28, 2016
    Applicant: Kabushiki Kaisha Toshiba
    Inventors: Minh-Tri PHAM, Riccardo GHERARDI, Frank PERBET, Bjorn STENGER, Sam JOHNSON, Oliver WOODFORD, Pablo ALCANTARILLA, Roberto CIPOLLA
  • Patent number: 9165204
    Abstract: A method (100) and system (300) is described for processing video data comprising a plurality of images. The method and apparatus is for obtaining for labeling of a plurality of objects or regions in an image of a sequence of images followed by label propagation to other images in the sequence based on an inference step and a model.
    Type: Grant
    Filed: February 28, 2011
    Date of Patent: October 20, 2015
    Assignees: TOYOTA MOTOR EUROPE NV/SA, CAMBRIDGE ENTERPRISE LIMITED
    Inventors: Gabriel Othmezouri, Ichiro Sakata, Roberto Cipolla, Vijay Badrinarayanan
  • Publication number: 20150254527
    Abstract: A method for comparing a plurality of objects, the method comprising representing at least one feature of each object as a 3D ball representation, the radius of each ball representing the scale of the feature in the with respect to the frame of the object, the position of each ball representing the translation the feature in the frame of the object, the method further comprising comparing the objects by comparing the scale and translation as represented by the 3D balls to determine similarity between objects and their poses.
    Type: Application
    Filed: August 26, 2014
    Publication date: September 10, 2015
    Applicant: Kabushiki Kaisha Toshiba
    Inventors: Minh-Tri Pham, Frank Perbet, Bjorn Dietmar, Rafael Stenger, Riccardo Gherardi, Oliver Woodford, Sam Johnson, Roberto Cipolla, Stephan Liwicki
  • Patent number: 9008439
    Abstract: A method of comparing two object poses, wherein each object pose is expressed in terms of position, orientation and scale with respect to a common coordinate system, the method comprising: calculating a distance between the two object poses, the distance being calculated using the distance function: d sRt ? ( X , Y ) = d s 2 ? ( X , Y ) ? s 2 + d r 2 ? ( X , Y ) ? r 2 + d t 2 ? ( X , Y ) ? t 2 .
    Type: Grant
    Filed: August 14, 2014
    Date of Patent: April 14, 2015
    Assignee: Kabushiki Kaisha Toshiba
    Inventors: Minh-Tri Pham, Oliver Woodford, Frank Perbet, Atsuto Maki, Bjorn Stenger, Roberto Cipolla
  • Patent number: 9002055
    Abstract: A method (100) and system (300) is described for processing video data comprising a plurality of images. The method (100) comprising obtaining (104, 106), for each of the plurality of images, a segmentation in a plurality of regions and a set of keypoints, and tracking (108) at least one region between a first image and a subsequent image resulting in a matched region in the subsequent image taking into account a matching between keypoints in the first image and the subsequent image. The latter results in accurate tracking of regions. Furthermore the method may optionally also perform label propagation taking into account keypoint tracking.
    Type: Grant
    Filed: October 13, 2008
    Date of Patent: April 7, 2015
    Assignees: Toyota Motor Europe NV, Cambridge Enterprise Limited
    Inventors: Ryuji Funayama, Hiromichi Yanagihara, Julien Fauqueur, Gabriel Brostow, Roberto Cipolla
  • Publication number: 20150052084
    Abstract: A system for emulating a subject, to allow a user to interact with a computer generated talking head with the subject's face and voice; said system comprising a processor, a user interface and a personality storage section, the user interface being configured to emulate the subject, by displaying a talking head which comprises the subject's face and output speech from the mouth of the face with the subject's voice, the user interface further comprising a receiver for receiving a query from the user, the emulated subject being configured to respond to the query received from the user, the processor comprising a dialogue section and a talking head generation section, wherein said dialogue section is configured to generate a response to a query inputted by a user from the user interface and generate a response to be outputted by the talking head, the response being generated by retrieving information from said personality storage section, said personality storage section comprising content created by or about
    Type: Application
    Filed: August 13, 2014
    Publication date: February 19, 2015
    Applicant: Kabushiki Kaisha Toshiba
    Inventors: Balakrishna Venkata Jagannadha KOLLURU, Vincent Ping Leung WAN, Bjorn Dietmar Rafael STENGER, Roberto CIPOLLA, Javier LATORRE-MARTINEZ, Langzhou CHEN, Ranniery Da Silva MAIA, Kayoko YANAGISAWA, Norbert BRAUNSCHWEILER, Ioannis STYLIANOU, Robert Arthur BLOKLAND
  • Publication number: 20150042662
    Abstract: A method of animating a computer generation of a head and displaying the text of an electronic book, such that the head has a mouth which moves in accordance with the speech of the text of the electronic book to be output by the head and a word or group of words from the text is displayed while simultaneously being mimed by the mouth, said method comprising: inputting the text of said book; dividing said input text into a sequence of acoustic units; determining expression characteristics for the inputted text; calculating a duration for each acoustic unit using a duration model; converting said sequence of acoustic units to a sequence of image vectors using a statistical model, wherein said model has a plurality of model parameters describing probability distributions which relate an acoustic unit to an image vector, said image vector comprising a plurality of parameters which define a face of said head; converting said sequence of acoustic units into a sequence of text display indicators using an text dis
    Type: Application
    Filed: August 8, 2014
    Publication date: February 12, 2015
    Applicant: KABUSHIKI KAISHA TOSHIBA
    Inventors: Javier Latorre-Martinez, Vincent Ping Leung Wan, Balakrishna Venkata Jagannadha Kolluru, Ioannis Stylianou, Robert Arthur Blokland, Norbert Braunschweiler, Kayoko Yanagisawa, Langzhou Chen, Ranniery MAIA, Robert Anderson, Bjorn Stenger, Roberto Cipolla, Neil Baker
  • Publication number: 20150036918
    Abstract: A method of comparing two object poses, wherein each object pose is expressed in terms of position, orientation and scale with respect to a common coordinate system, the method comprising: calculating a distance between the two object poses, the distance being calculated using the distance function: d sRt ? ( X , Y ) = d s 2 ? ( X , Y ) ? s 2 + d r 2 ? ( X , Y ) ? r 2 + d t 2 ? ( X , Y ) ? t 2 .
    Type: Application
    Filed: August 14, 2014
    Publication date: February 5, 2015
    Applicant: Kabushiki Kaisha Toshiba
    Inventors: Minh-Tri Pham, Oliver Woodford, Frank Perbet, Atsuto Maki, Bjorn Stenger, Roberto Cipolla
  • Patent number: 8948448
    Abstract: A trajectory estimation apparatus includes: an image acceptance unit which accepts images that are temporally sequential and included in the video; a hierarchical subregion generating unit which generates subregions at hierarchical levels by performing hierarchical segmentation on each of the images accepted by the image acceptance unit such that, among subregions belonging to hierarchical levels different from each other, a spatially larger subregion includes spatially smaller subregions; and a representative trajectory estimation unit which estimates, as a representative trajectory, a trajectory, in the video, of a subregion included in a certain image, by searching for a subregion that is most similar to the subregion included in the certain image, across hierarchical levels in an image different from the certain image.
    Type: Grant
    Filed: December 23, 2011
    Date of Patent: February 3, 2015
    Assignee: Panasonic Corporation
    Inventors: Masahiro Iwasaki, Kunio Nobori, Ayako Komoto, Fabio Galasso, Roberto Cipolla
  • Publication number: 20140375635
    Abstract: According to one embodiment, a method of generating a three dimensional representation of a subject from a depth image, comprises comparing a depth image of the subject with a plurality of representative images, wherein each representative image is associated with a respective parametric model of a subject; identifying a representative image of the plurality of representative images as a closest representative image to the depth image of the subject; selecting the parametric model associated with the closest representative image to the depth image; and generating a three dimensional representation of the subject by fitting the selected parametric model to the depth image of the subject.
    Type: Application
    Filed: May 6, 2014
    Publication date: December 25, 2014
    Applicant: KABUSHIKI KAISHA TOSHIBA
    Inventors: SAM JOHNSON, FRANK PERBET, BJORN STENGER, MINH-TRI PHAM, OLIVER WOODFORD, RICCARDO GHERARDI, ROBERTO CIPOLLA
  • Patent number: 8849039
    Abstract: A method of comparing two object poses, wherein each object pose is expressed in terms of position, orientation and scale with respect to a common coordinate system, the method comprising: calculating a distance between the two object poses, the distance being calculated using the distance function: d sRt ? ( X , Y ) = d s 2 ? ( X , Y ) ? s 2 + d r 2 ? ( X , Y ) ? r 2 + d t 2 ? ( X , Y ) ? t 2 .
    Type: Grant
    Filed: February 28, 2012
    Date of Patent: September 30, 2014
    Assignee: Kabushiki Kaisha Toshiba
    Inventors: Minh-Tri Pham, Oliver Woodford, Frank Perbet, Atsuto Maki, Bjorn Stenger, Roberto Cipolla
  • Publication number: 20140210830
    Abstract: A method of animating a computer generation of a head, the head having a mouth which moves in accordance with speech to be output by the head, said method comprising: providing an input related to the speech which is to be output by the movement of the lips; dividing said input into a sequence of acoustic units; selecting expression characteristics for the inputted text; converting said sequence of acoustic units to a sequence of image vectors using a statistical model, wherein said model has a plurality of model parameters describing probability distributions which relate an acoustic unit to an image vector, said image vector comprising a plurality of parameters which define a face of said head; and outputting said sequence of image vectors as video such that the mouth of said head moves to mime the speech associated with the input text with the selected expression, wherein a parameter of a predetermined type of each probability distribution in said selected expression is expressed as a weighted sum of pa
    Type: Application
    Filed: January 29, 2014
    Publication date: July 31, 2014
    Applicant: Kabushiki Kaisha Toshiba
    Inventors: Javier LATORRE-MARTINEZ, Vincent Ping Leung Wan, Bjorn Stenger, Robert Anderson, Roberto Cipolla
  • Publication number: 20140210831
    Abstract: A method of animating a computer generation of a head, the head having a mouth which moves in accordance with speech to be output by the head, said method comprising: providing an input related to the speech which is to be output by the movement of the mouth; dividing said input into a sequence of acoustic units; selecting an expression to be output by said head; converting said sequence of acoustic units to a sequence of image vectors using a statistical model, wherein said model has a plurality of model parameters describing probability distributions which relate an acoustic unit to an image vector for a selected expression, said image vector comprising a plurality of parameters which define a face of said head; and outputting said sequence of image vectors as video such that the mouth of said head moves to mime the speech associated with the input text with the selected expression, wherein the image parameters define the face of a head using an appearance model comprising a plurality of shape modes and
    Type: Application
    Filed: January 29, 2014
    Publication date: July 31, 2014
    Applicant: Kabushiki Kaisha Toshiba
    Inventors: Bjorn Stenger, Robert Anderson, Javier Latorre-Martinez, Vincent Ping Leung Wan, Roberto Cipolla
  • Patent number: 8750614
    Abstract: According to one embodiment, a method of classifying a feature in a video sequence includes selecting a target region of a frame of the video sequence, where the target region contains the feature; dividing the target region into a plurality cells, calculating histograms of optic flow with the cells comparing the histograms of optic flow for pairs of cells; and assigning the feature to a class based at least in part on the result of the comparison.
    Type: Grant
    Filed: September 22, 2011
    Date of Patent: June 10, 2014
    Assignee: Kabushiki Kaisha Toshiba
    Inventors: Atsuto Maki, Frank Perbet, Bjorn Stenger, Oliver Woodford, Roberto Cipolla
  • Publication number: 20140125773
    Abstract: A method of calculating a similarity measure between first and second image patches, which include respective first and second intensity values associated with respective elements of the first and second image patches, and which have a corresponding size and shape such that each element of the first image patch corresponds to an element on the second image patch. The method: determines a set of sub-regions on the second image patch corresponding to elements of the first image patch and having first intensity values within a range defined for that sub-region; calculates variance, for each sub-region of the set over all of the elements of that sub-region, of a function of the second intensity value associated with that element and the first intensity value associated with the corresponding element of the first image patch; and calculates similarity measure as the sum over all sub-regions of the calculated variances.
    Type: Application
    Filed: November 5, 2013
    Publication date: May 8, 2014
    Applicant: KABUSHIKI KAISHA TOSHIBA
    Inventors: Atsuto MAKI, Riccardo Gherardi, Oliver Woodford, Frank Perbet, Minh-Tri Pham, Bjorn Stenger, Sam Johnson, Roberto Cipolla
  • Patent number: 8451322
    Abstract: According to one embodiment, an apparatus for determining the gradients of the surface normals of an object includes a receiving unit, establishing unit, determining unit, and selecting unit. The receiving unit is configured to receive data of three 2D images of the object, wherein each image is taken under illumination from a different direction. The establishing unit is configured to establish which pixels of the image are in shadow such that there is only data available from two images from these pixels. The determining unit is configured to determine a range of possible solutions for the gradient of the surface normal of a shadowed pixel using the data available for the two images. The selecting unit is configured to select a solution for the gradient using the integrability of the gradient field over an area of the object as a constraint and minimizing a cost function.
    Type: Grant
    Filed: April 8, 2011
    Date of Patent: May 28, 2013
    Assignee: Kabushiki Kaisha Toshiba
    Inventors: Carlos Hernandez, George Vogiatzis, Roberto Cipolla