Patents by Inventor Roberto Cipolla

Roberto Cipolla has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 12085947
    Abstract: A computer-implemented method for training an agent in a first context including an entity and an environment of the entity, to allow an apparatus to perform a navigation task in a second context comprising the apparatus and a physical environment of the apparatus, the apparatus adapted to receive images of the physical environment of the apparatus and comprising a steering device adapted to control the direction of the apparatus, the method comprising: obtaining one or more navigation tasks comprising: generating a navigation task; scoring the navigation task using a machine-learned model trained to estimate the easiness of tasks; in response to the score satisfying a selection criterion, selecting the navigation task as one of the one or more navigation tasks; and training the agent using a reinforcement learning method comprising attempting to perform, by the entity, the one or more navigation tasks using images of the environment of the entity.
    Type: Grant
    Filed: February 24, 2021
    Date of Patent: September 10, 2024
    Assignee: Kabushiki Kaisha Toshiba
    Inventors: Steven Morad, Roberto Mecca, Rudra Poudel, Stephan Liwicki, Roberto Cipolla
  • Patent number: 11568595
    Abstract: A computer vision method for generating a three dimensional reconstruction of an object, the method comprising: receiving a set of photometric stereo images of the object, the set of photometric stereo images comprising a plurality of images using illumination from different directions using one or more light sources; using a trained neural network to generate a normal map of the object; and producing a 3D reconstruction of said object from said normal map, wherein using said trained neural network comprises converting said set of photometric stereo images to an input form suitable for an input layer of said neural network, wherein said input form comprises, for each pixel, a representation of the different lighting directions and their corresponding intensities which have been obtained from photometric stereo images to which a compensation has been applied, the compensation being determined from an estimate of the distance between the lighting source and a point on the object to which the pixel correspond
    Type: Grant
    Filed: February 19, 2021
    Date of Patent: January 31, 2023
    Assignee: Kabushiki Kaisha Toshiba
    Inventors: Fotios Logothetis, Roberto Mecca, Ignas Budvytis, Roberto Cipolla
  • Patent number: 11538585
    Abstract: A method of detecting abnormalities in ECG signals by providing an ECG signal to a neural network, performing a first series of convolution operations to a first subset of layers and in a final layer, and determining a plurality of preliminary classification estimates, each preliminary classification estimate corresponding with a time segment of the ECG signal. Furthermore, determining input data for a second subset of layers of the neural network by concatenating the preliminary classification with the output of a layer of the first subset of layers that precedes the final layer of the first subset of layers. Within the second subset of layers of the neural network, performing a second series of convolution operations. In a final layer of the second subset, determining plurality of final classification estimates, each final classification estimate corresponding with a time segment of the ECG signal.
    Type: Grant
    Filed: June 8, 2018
    Date of Patent: December 27, 2022
    Assignee: Cambridge Heartwear Limited
    Inventors: James Charles, Rameen Shakur, Roberto Cipolla
  • Patent number: 11436452
    Abstract: A method for processing video data comprising a plurality of image frames, the method comprising: obtaining a forward model and a backward model of the plurality of image frames; processing the forward model and the backward model to propagate at least one label in the region or patch to at least one other image frame of the video sequence, using a probabilistic method for estimating the label in the at least one other image frame in forward and backward correspondences, wherein, during the processing, a pixel having a most likely label with a probability lower than a threshold value is assigned a predetermined generic label; and generating a labelled result for any given image frame by applying an image label difference, based on label uncertainty between the forward and backward correspondences, to the given image frame.
    Type: Grant
    Filed: February 20, 2020
    Date of Patent: September 6, 2022
    Assignees: TOYOTA MOTOR EUROPE, CAMBRIDGE ENTERPRISE LIMITED
    Inventors: Patrick Sauer, Ignas Budvytis, Roberto Cipolla
  • Patent number: 11394945
    Abstract: A system for producing a 3D image of an object, the system includes: a processor adapted to receive multiview stereo photometric data, the multiview photometric stereo data including a first set of image data, wherein the first set of image data includes images captured by the first camera at a plurality of positions with illumination from different illumination directions such that there are a plurality of images from a different illumination direction for each position of the first camera, the processor being adapted to process the multiview photometric stereo data using a photometric stereo approach with a formulation of the irradiance using a reflection model that models both diffuse and specular reflection.
    Type: Grant
    Filed: July 28, 2020
    Date of Patent: July 19, 2022
    Assignee: Kabushiki Kaisha Toshiba
    Inventors: Roberto Mecca, Roberto Cipolla, Fotios Logothetis
  • Patent number: 11315253
    Abstract: An image processing method for segmenting an image, the method comprising: receiving an image; processing said image with a common processing stage to produce a first feature map; inputting said first feature map to a parallel processing stage, said second processing stage comprising first and second parallel branches that receive the first feature map; and combining the output of the first and second branches to produce a semantic segmented image, wherein the common processing stage comprises a neural network, the neural network having at least one separable convolution module configured to perform separable convolution and downsample the image to produce first feature map and said first branch comprises a neural network comprising at least one separable convolution module configured to perform separable convolution.
    Type: Grant
    Filed: January 14, 2020
    Date of Patent: April 26, 2022
    Assignee: Kabushiki Kaisha Toshiba
    Inventors: Rudra Prasad Poudel Karmatha, Stephan Liwicki, Roberto Cipolla
  • Publication number: 20220075383
    Abstract: A computer-implemented method for training an agent in a first context including an entity and an environment of the entity, to allow an apparatus to perform a navigation task in a second context comprising the apparatus and a physical environment of the apparatus, the apparatus adapted to receive images of the physical environment of the apparatus and comprising a steering device adapted to control the direction of the apparatus, the method comprising: obtaining one or more navigation tasks comprising: generating a navigation task; scoring the navigation task using a machine-learned model trained to estimate the easiness of tasks; in response to the score satisfying a selection criterion, selecting the navigation task as one of the one or more navigation tasks; and training the agent using a reinforcement learning method comprising attempting to perform, by the entity, the one or more navigation tasks using images of the environment of the entity.
    Type: Application
    Filed: February 24, 2021
    Publication date: March 10, 2022
    Applicant: Kabushiki Kaisha Toshiba
    Inventors: Steven MORAD, Roberto MECCA, Rudra POUDEL, Stephan LIWICKI, Roberto CIPOLLA
  • Publication number: 20220051471
    Abstract: A computer vision method for generating a three dimensional reconstruction of an object, the method comprising: receiving a set of photometric stereo images of the object, the set of photometric stereo images comprising a plurality of images using illumination from different directions using one or more light sources; using a trained neural network to generate a normal map of the object; and producing a 3D reconstruction of said object from said normal map, wherein using said trained neural network comprises converting said set of photometric stereo images to an input form suitable for an input layer of said neural network, wherein said input form comprises, for each pixel, a representation of the different lighting directions and their corresponding intensities which have been obtained from photometric stereo images to which a compensation has been applied, the compensation being determined from an estimate of the distance between the lighting source and a point on the object to which the pixel correspond
    Type: Application
    Filed: February 19, 2021
    Publication date: February 17, 2022
    Applicant: Kabushiki Kaisha Toshiba
    Inventors: Fotios LOGOTHETIS, Roberto MECCA, Ignas BUDVYTIS, Roberto CIPOLLA
  • Patent number: 11144597
    Abstract: A system for emulating a subject, to allow a user to interact with a computer generated talking head with the subject's face and voice; said system comprising a processor, a user interface and a personality storage section, the user interface being configured to emulate the subject, by displaying a talking head which comprises the subject's face and output speech from the mouth of the face with the subject's voice, the user interface further comprising a receiver for receiving a query from the user, the emulated subject being configured to respond to the query received from the user, the processor comprising a dialogue section and a talking head generation section, wherein said dialogue section is configured to generate a response to a query inputted by a user from the user interface and generate a response to be outputted by the talking head, the response being generated by retrieving information from said personality storage section, said personality storage section comprising content created by or about
    Type: Grant
    Filed: March 16, 2018
    Date of Patent: October 12, 2021
    Assignee: Kabushiki Kaisha Toshiba
    Inventors: Balakrishna Venkata Jagannadha Kolluru, Vincent Ping Leung Wan, Bjorn Dietmar Rafael Stenger, Roberto Cipolla, Javier Latorre-Martinez, Langzhou Chen, Ranniery Da Silva Maia, Kayoko Yanagisawa, Norbert Braunschweiler, Ioannis Stylianou, Robert Arthur Blokland
  • Patent number: 11013470
    Abstract: A method of detecting abnormalities in an ECG signals, comprising receiving an ECG signal, extracting at least one ECG beat the ECG signal; providing the at least one ECG beat to a neural network. Within the neural network, performing at least two layers of convolution operations, and for a beat of the ECG signal, determining a confidence map for the location of each of the P, Q, R, S and T points, determining, from the respective confidence maps the position or absence of points, the points comprising each of the P, Q, R, S and T points; and reporting the positions and/or absence of the points.
    Type: Grant
    Filed: June 8, 2018
    Date of Patent: May 25, 2021
    Assignee: Cambridge Heartwear Limited
    Inventors: Rameen Shakur, James Charles, Roberto Cipolla
  • Publication number: 20210044788
    Abstract: A system for producing a 3D image of an object, the system includes: a processor adapted to receive multiview stereo photometric data, the multiview photometric stereo data including a first set of image data, wherein the first set of image data includes images captured by the first camera at a plurality of positions with illumination from different illumination directions such that there are a plurality of images from a different illumination direction for each position of the first camera, the processor being adapted to process the multiview photometric stereo data using a photometric stereo approach with a formulation of the irradiance using a reflection model that models both diffuse and specular reflection.
    Type: Application
    Filed: July 28, 2020
    Publication date: February 11, 2021
    Applicant: Kabushiki Kaisha Toshiba
    Inventors: Roberto Mecca, Roberto Cipolla, Fotios Logothetis
  • Publication number: 20200234447
    Abstract: An image processing method for segmenting an image, the method comprising: receiving an image; processing said image with a common processing stage to produce a first feature map; inputting said first feature map to a parallel processing stage, said second processing stage comprising first and second parallel branches that receive the first feature map; and combining the output of the first and second branches to produce a semantic segmented image, wherein the common processing stage comprises a neural network, the neural network having at least one separable convolution module configured to perform separable convolution and downsample the image to produce first feature map and said first branch comprises a neural network comprising at least one separable convolution module configured to perform separable convolution.
    Type: Application
    Filed: January 14, 2020
    Publication date: July 23, 2020
    Applicant: Kabushiki Kaisha Toshiba
    Inventors: Rudra Prasad Poudel KARMATHA, Stephan LIWICKI, Roberto CIPOLLA
  • Publication number: 20200184285
    Abstract: A method for processing video data comprising a plurality of image frames, the method comprising: obtaining a forward model and a backward model of the plurality of image frames; processing the forward model and the backward model to propagate at least one label in the region or patch to at least one other image frame of the video sequence, using a probabilistic method for estimating the label in the at least one other image frame in forward and backward correspondences, wherein, during the processing, a pixel having a most likely label with a probability lower than a threshold value is assigned a predetermined generic label; and generating a labelled result for any given image frame by applying an image label difference, based on label uncertainty between the forward and backward correspondences, to the given image frame.
    Type: Application
    Filed: February 20, 2020
    Publication date: June 11, 2020
    Applicants: TOYOTA MOTOR EUROPE, CAMBRIDGE ENTERPRISE LIMITED
    Inventors: Patrick Sauer, Ignas Budvytis, Roberto Cipolla
  • Patent number: 10602942
    Abstract: We disclose herein a method of detecting abnormalities in electrocardiogram (ECG) signals, the method comprising receiving a set of ECG signals from an ECG device; amplifying only the peaks of at least some of the set of ECG signals to produce ECG beat markings from which a heart rate is derivable to detect an irregular rhythm between at least two ECG beats; extracting a single ECG beat from the set of ECG signals from the ECG device by using said ECG beat markings; feeding the extracted single ECG beat into a first neural network; producing, at the first neural network, a compact representation of the extracted single ECG signal so as to generate a feature extraction output; and using, at a second neural network, the feature extraction output from the first neural network to generate a score associated with the abnormalities in the ECG signals.
    Type: Grant
    Filed: August 25, 2017
    Date of Patent: March 31, 2020
    Assignee: CAMBRIDGE HEARTWEAR LIMITED
    Inventors: Rameen Shakur, Levin Chun Kiat Tan, Roberto Cipolla
  • Publication number: 20190374166
    Abstract: A method of detecting abnormalities in an ECG signals, comprising receiving an ECG signal, extracting at least one ECG beat the ECG signal; providing the at least one ECG beat to a neural network. Within the neural network, performing at least two layers of convolution operations, and for a beat of the ECG signal, determining a confidence map for the location of each of the P, Q, R, S and T points, determining, from the respective confidence maps the position or absence of points, the points comprising each of the P, Q, R, S and T points; and reporting the positions and/or absence of the points.
    Type: Application
    Filed: June 8, 2018
    Publication date: December 12, 2019
    Inventors: James Charles, Rameen Shakur, Roberto Cipolla
  • Publication number: 20190378617
    Abstract: A method of detecting abnormalities in ECG signals by providing an ECG signal to a neural network, performing a first series of convolution operations to a first subset of layers and in a final layer, and determining a plurality of preliminary classification estimates, each preliminary classification estimate corresponding with a time segment of the ECG signal. Furthermore, determining input data for a second subset of layers of the neural network by concatenating the preliminary classification with the output of a layer of the first subset of layers that precedes the final layer of the first subset of layers. Within the second subset of layers of the neural network, performing a second series of convolution operations. In a final layer of the second subset, determining plurality of final classification estimates, each final classification estimate corresponding with a time segment of the ECG signal.
    Type: Application
    Filed: June 8, 2018
    Publication date: December 12, 2019
    Inventors: James Charles, Rameen Shakur, Roberto Cipolla
  • Publication number: 20190059763
    Abstract: We disclose herein a method of detecting abnormalities in electrocardiogram (ECG) signals, the method comprising receiving a set of ECG signals from an ECG device; amplifying only the peaks of at least some of the set of ECG signals to produce ECG beat markings from which a heart rate is derivable to detect an irregular rhythm between at least two ECG beats; extracting a single ECG beat from the set of ECG signals from the ECG device by using said ECG beat markings; feeding the extracted single ECG beat into a first neural network; producing, at the first neural network, a compact representation of the extracted single ECG signal so as to generate a feature extraction output; and using, at a second neural network, the feature extraction output from the first neural network to generate a score associated with the abnormalities in the ECG signals.
    Type: Application
    Filed: August 25, 2017
    Publication date: February 28, 2019
    Inventors: Rameen Shakur, Levin Chun Kiat Tan, Roberto Cipolla
  • Patent number: 10097813
    Abstract: A system for capturing 3D image data of a scene, including three light sources, each configured to emit light at a different wavelength to the other two sources and to illuminate the scene to be captured; a first video camera configured to receive light from the light sources which has been reflected from the scene, to isolate light received from each of the light sources, and to output data relating to the image captured for each of the three light sources; a depth sensor configured to capture depth map data of the scene; and an analysis unit configured to receive data from the first video camera and process the data to obtain data relating to a normal field obtained from the images captured for each of the three light sources, and to combine the normal field data with the depth map data to capture 3D image data of the scene.
    Type: Grant
    Filed: February 29, 2012
    Date of Patent: October 9, 2018
    Assignee: Kabushiki Kaisha Toshiba
    Inventors: Bjorn Stenger, Atsuto Maki, Frank Perbet, Oliver Woodford, Roberto Cipolla, Robert Anderson
  • Publication number: 20180203946
    Abstract: A system for emulating a subject, to allow a user to interact with a computer generated talking head with the subject's face and voice; said system comprising a processor, a user interface and a personality storage section, the user interface being configured to emulate the subject, by displaying a talking head which comprises the subject's face and output speech from the mouth of the face with the subject's voice, the user interface further comprising a receiver for receiving a query from the user, the emulated subject being configured to respond to the query received from the user, the processor comprising a dialogue section and a talking head generation section, wherein said dialogue section is configured to generate a response to a query inputted by a user from the user interface and generate a response to be outputted by the talking head, the response being generated by retrieving information from said personality storage section, said personality storage section comprising content created by or about
    Type: Application
    Filed: March 16, 2018
    Publication date: July 19, 2018
    Applicant: Kabushiki Kaisha Toshiba
    Inventors: Balakrishna Venkata Jagannadha KOLLURU, Vincent Ping Leung Wan, Bjorn Dietmar Rafael Stenger, Roberto Cipolla, Javier Latorre-Martinez, Langzhou Chen, Ranniery Da Silva Maia, Kayoko Yanagisawa, Norbert Braunschweiler, Ioannis Stylianou, Robert Arthur Blokland
  • Patent number: 9959657
    Abstract: A method of animating a computer generation of a head, the head having a mouth which moves in accordance with speech to be output by the head, said method comprising: providing an input related to the speech which is to be output by the movement of the lips; dividing said input into a sequence of acoustic units; selecting expression characteristics for the inputted text; converting said sequence of acoustic units to a sequence of image vectors using a statistical model, wherein said model has a plurality of model parameters describing probability distributions which relate an acoustic unit to an image vector, said image vector comprising a plurality of parameters which define a face of said head; and outputting said sequence of image vectors as video such that the mouth of said head moves to mime the speech associated with the input text with the selected expression, wherein a parameter of a predetermined type of each probability distribution in said selected expression is expressed as a weighted sum of pa
    Type: Grant
    Filed: January 29, 2014
    Date of Patent: May 1, 2018
    Assignee: Kabushiki Kaisha Toshiba
    Inventors: Javier Latorre-Martinez, Vincent Ping Leung Wan, Bjorn Stenger, Robert Anderson, Roberto Cipolla