Patents by Inventor Francesco Cricri

Francesco Cricri has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Apparatus, a Method and a Computer Program for Video Coding and Decoding

Publication number: 20220141471

Abstract: A method includes maintaining a set of parameters or weights derived through online learning for a neural net; transmitting an update of the parameters or weights to a decoder; deriving a first prediction block based on an output of the neural net using the parameters or weights; deriving a first encoded prediction error block through encoding a difference of the first prediction block and a first input block; encoding the first encoded prediction error block into a bitstream; deriving a reconstructed prediction error block based on the first encoded prediction error block; deriving a second prediction block based on an output of the neural net using the parameters or weights and the reconstructed prediction error block; deriving a second encoded prediction error block through encoding a difference of the second prediction block and a second input block; and encoding the second encoded prediction error block into a bitstream.

Type: Application

Filed: January 14, 2022

Publication date: May 5, 2022

Inventors: Miska HANNUKSELA, Mikko Honkala, Jani Lainema, Francesco Cricri, Emre Aksu
AN APPARATUS, A METHOD AND A COMPUTER PROGRAM FOR VIDEO CODING AND DECODING

Publication number: 20220141455

Abstract: A method comprising: obtaining a configuration of at least one neural network comprising a plurality of intra-prediction mode agnostic layers and one or more intra-prediction mode specific layers, the one or more intra-prediction mode specific layers corresponding to different intra-prediction modes; obtaining at least one input video frame comprising a plurality of blocks; determining to encode one or more blocks using intra prediction; determining an intra-prediction mode for each of said one or more blocks; grouping blocks having same intra-prediction mode into groups, each group being assigned with a computation path among the plurality of intra-prediction mode agnostic and the one or more intra-prediction mode specific layers; training the plurality of intra-prediction mode agnostic and/or the one or more intra-prediction mode specific layers of the neural networks based on a training loss between an output of the neural networks relating to a group of blocks and ground-truth blocks, wherein the ground-t

Type: Application

Filed: January 29, 2020

Publication date: May 5, 2022

Inventors: Francesco CRICRI, Caglar AYTEKIN, Miska HANNUKSELA, Xingyang NI
TRAINING IN NEURAL NETWORKS

Publication number: 20220044125

Abstract: A system, obtaining a first training dataset, comprising a plurality of first image and pose data pairs; obtaining a first generated dataset, comprising a plurality of first image and estimated pose data pairs, wherein estimated pose data of the first image and estimated pose data pairs are generated by a first neural network trained using the first training dataset; obtaining a second generated dataset, comprising a plurality of second image and estimated pose data pairs, wherein estimated pose data of the second image and estimated pose data pairs are generated by a second neural network trained using the first training dataset; generating the first and second generated datasets a generated training dataset, comprising image and estimated pose data pairs selected from said first generated dataset; and training a third neural network based on a combination of some or all of the first training dataset and the generated training dataset.

Type: Application

Filed: August 5, 2021

Publication date: February 10, 2022

Inventors: Goutham RANGU, Francesco CRICRI, Emre Baris AKSU
Apparatus, a method and a computer program for video coding and decoding

Patent number: 11228767

Abstract: A method comprising: deriving a first prediction block (608) at least partly based on an output of a neural net (602) using a first set of parameters; deriving a first encoded prediction error block (614-620) through encoding a difference of the first prediction block and a first input block; encoding (620) the first encoded prediction error block into a bitstream; deriving a first reconstructed prediction error block (624) from the first encoded prediction error block; deriving a training signal (628) from one or both of the first encoded prediction error block and/or the first reconstructed prediction error block (624); retraining (630) the neural net (602) with the training signal (628) to obtain a second set of parameters for the neural net (602); deriving a second prediction block (608) at least partly based on an output of the neural net using the second set of parameters; deriving a second encoded prediction error block (614-620) through encoding a difference of the second prediction block and a second

Type: Grant

Filed: December 3, 2018

Date of Patent: January 18, 2022

Assignee: Nokia Technologies Oy

Inventors: Miska Hannuksela, Mikko Honkala, Jani Lainema, Francesco Cricri, Emre Aksu
FEDERATED TEACHER-STUDENT MACHINE LEARNING

Publication number: 20220012637

Abstract: A node for a federated machine learning system that comprises the node and one or more other nodes configured for the same machine learning task, the node comprising: a federated student machine learning network configured to update a machine learning model in dependence upon updated machine learning models of the one or more node; a teacher machine learning network; means for receiving unlabeled data; means for teaching, using supervised learning, at least the federated first machine learning network using the teacher machine learning network, wherein the teacher machine learning network is configured to receive the data and produce pseudo labels for supervised learning using the data and wherein the federated student machine learning network is configured to perform supervised learning in dependence upon the same received data and the pseudo-labels.

Type: Application

Filed: July 8, 2021

Publication date: January 13, 2022

Inventors: Hamed REZAZADEGAN TAVAKOLI, Francesco CRICRI, Emre Baris AKSU
AN APPARATUS, A METHOD AND A COMPUTER PROGRAM FOR RUNNING A NEURAL NETWORK

Publication number: 20220007084

Abstract: A method comprising: obtaining (400), in a first apparatus (500), media content, encoding (402), in a neural data compression network of the first apparatus (500), the media content wherein one or more parameters of the neural data compression network are determined based on a type of at least one analysis task to be performed on the media content; and transmitting (404) the encoded media content to a second apparatus (502).

Type: Application

Filed: September 20, 2019

Publication date: January 6, 2022

Inventors: Caglar Aytekin, Miska Hannuksela, Francesco Cricrì
Graph Diffusion for Structured Pruning of Neural Networks

Publication number: 20210397965

Abstract: An apparatus includes at least one processor; and at least one non-transitory memory including computer program code; wherein the at least one memory and the computer program code are configured to, with the at least one processor, cause the apparatus at least to: estimate an importance of parameters of a neural network based on a graph diffusion process over at least one layer of the neural network; determine the parameters of the neural network that are suitable for pruning or sparsification; remove neurons of the neural network to prune or sparsify the neural network; and provide at least one syntax element for signaling the pruned or sparsified neural network over a communication channel, wherein the at least one syntax element comprises at least one neural network representation syntax element.

Type: Application

Filed: June 22, 2021

Publication date: December 23, 2021

Inventors: Honglei ZHANG, Francesco CRICRI, Hamed REZAZADEGAN TAVAKOLI, Joachim WABNIG, Iraj SANIEE, Miska Matias HANNUKSELA, Emre AKSU
Methods, apparatus, systems, computer programs for enabling mediated reality

Patent number: 11194439

Abstract: A method comprising: using a tracked real point of view of a user in a real space and a first mapping between the real space and a virtual space to determine a point of view of a virtual user within the virtual space; causing rendering to the user at least part of a virtual scene determined by the point of view of the virtual user within the virtual space; and using a selected one of a plurality of different mappings to map tracked user actions in the real space to actions of the virtual user in the virtual space, wherein, when a first mode is selected, the method comprises mapping tracked user actions in the real space, using the first mapping, to spatially-equivalent actions of the virtual user in the virtual space, and wherein, when a second mode is selected, the method comprises mapping tracked user actions in the real space, using a second mapping different to the first mapping, to non-spatially-equivalent actions of the virtual user in the virtual space, wherein the second mapping makes available user i

Type: Grant

Filed: September 6, 2018

Date of Patent: December 7, 2021

Assignee: Nokia Technologies Oy

Inventors: Lasse Laaksonen, Arto Lehtiniemi, Sujeet Shyamsundar Mate, Francesco Cricri
Reverse neural network for object re-identification

Patent number: 11188783

Abstract: The invention relates to a method comprising receiving, by a neural network, a first image comprising at least one target object; receiving, by the neural network, a second image comprising at least one query object; and determining, by the neural network, whether the query object corresponds to the target object, wherein the neural network comprises a discriminator neural network of a generative adversarial network (GAN). The invention further relates to an apparatus and a computer program product that perform the method.

Type: Grant

Filed: October 10, 2018

Date of Patent: November 30, 2021

Assignee: NOKIA TECHNOLOGIES OY

Inventors: Francesco Cricrì, Emre Aksu, Xingyang Ni
Feature-Domain Residual for Video Coding for Machines

Publication number: 20210314573

Abstract: An apparatus includes at least one processor; and at least one non-transitory memory including computer program code; wherein the at least one memory and the computer program code are configured to, with the at least one processor, cause the apparatus at least to: decode encoded data to generate decoded data, the encoded data having a bitrate lower than that of original data, and extract features from the decoded data; decode encoded residual features to generate decoded residual features; and generate enhanced decoded features as a result of combining the decoded residual features with the features extracted from the decoded data.

Type: Application

Filed: March 31, 2021

Publication date: October 7, 2021

Inventors: Honglei Zhang, Hamed Rezazadegan Tavakoli, Francesco Cricri, Miska Matias Hannuksela, Emre Aksu, Nam Le
Method for analysing media content to generate reconstructed media content

Patent number: 11068722

Abstract: The invention relates to a method, an apparatus and a computer program product for analyzing media content. The method comprises receiving media content; performing feature extraction of the media content at a plurality of convolution layers to produce a plurality of layer-specific feature maps; transmitting from the plurality of convolution layers a corresponding layer-specific feature map to a corresponding de-convolution layer of a plurality of de-convolution layers via a recurrent connection between the plurality of convolution layers and the plurality of de-convolution layers; and generating a reconstructed media content based on the plurality of feature maps.

Type: Grant

Filed: September 27, 2017

Date of Patent: July 20, 2021

Assignee: Nokia Technologies Oy

Inventors: Francesco Cricri, Mikko Honkala, Emre Baris Aksu, Xingyang Ni
Cascaded Prediction-Transform Approach for Mixed Machine-Human Targeted Video Coding

Publication number: 20210218997

Abstract: Data may be encoded to minimize distortion after decoding, but the quality required for presentation of the decoded data to a machine and the quality required for presentation to a human may be different. To accommodate different quality requirements, video data may be encoded to produce a first set of encoded data and a second set of encoded data, where the first set may be decoded for use by one of a machine consumer or a human consumer, and a combination of the first set and the second set may be decoded for use by the other of a machine consumer or a human consumer. The first and second set may be produced with a neural encoder and a neural decoder, and/or may be produced with the use of prediction and transform neural network modules. A human-targeted structure and a machine-targeted structure may produce the sets of encoded data.

Type: Application

Filed: December 30, 2020

Publication date: July 15, 2021

Inventors: Hamed Rezazadegan Tavakoli, Francesco Cricri, Miska Matias Hannuksela, Emre Baris Aksu, Honglei Zhang, Nam Le
Method and apparatus for training a neural network used for denoising

Patent number: 11062210

Abstract: A method, apparatus and computer program product provide an automated neural network training mechanism. The method, apparatus and computer program product receive a decoded noisy image and a set of input parameters for a neural network configured to optimize the decoded noisy image. A denoised image is generated based on the decoded noisy image and the set of input parameters. A denoised noisy error is computed representing an error between the denoised image and the decoded noisy image. The neural network is trained using the denoised noisy error and the set of input parameters and a ground truth noisy error value is received representing an error between the original image and the encoded image. The ground truth noisy error value is compared with the denoised noisy error to determine whether a difference between the ground truth noisy error value and the denoised noisy error is within a pre-determined threshold.

Type: Grant

Filed: October 1, 2019

Date of Patent: July 13, 2021

Assignee: NOKIA TECHNOLOGIES OY

Inventors: Caglar Aytekin, Francesco Cricri, Xingyang Ni
High Level Syntax for Compressed Representation of Neural Networks

Publication number: 20210211733

Abstract: An apparatus includes at least one processor; and at least one non-transitory memory including computer program code; wherein the at least one memory and the computer program code are configured to, with the at least one processor, cause the apparatus at least to perform: encode or decode a high-level bitstream syntax for at least one neural network; wherein the high-level bitstream syntax comprises at least one information unit having metadata or compressed neural network data of a portion of the at least one neural network; and wherein a serialized bitstream comprises one or more of the at least one information unit.

Type: Application

Filed: January 4, 2021

Publication date: July 8, 2021

Inventors: Emre Baris Aksu, Miska Matias Hannuksela, Hamed Rezazadegan Tavakoli, Francesco Cricri
An Apparatus, A Method and a Computer Program for Video Coding and Decoding

Publication number: 20210195206

Abstract: A method comprising: deriving a first prediction block (608) at least partly based on an output of a neural net (602) using a first set of parameters; deriving a first encoded prediction error block (614-620) through encoding a difference of the first prediction block and a first input block; encoding (620) the first encoded prediction error block into a bitstream; deriving a first reconstructed prediction error block (624) from the first encoded prediction error block; deriving a training signal (628) from one or both of the first encoded prediction error block and/or the first reconstructed prediction error block (624); retraining (630) the neural net (602) with the training signal (628) to obtain a second set of parameters for the neural net (602); deriving a second prediction block (608) at least partly based on an output of the neural net using the second set of parameters; deriving a second encoded prediction error block (614-620) through encoding a difference of the second prediction block and a second

Type: Application

Filed: December 3, 2018

Publication date: June 24, 2021

Inventors: Miska Hannuksela, Mikko Honkala, Jani Lainema, Francesco Cricri, Emre Aksu
CONTROLLING AUDIO RENDERING

Publication number: 20210195358

Abstract: A method comprising: remotely sensing a real acoustic environment, in which multiple audio signals are captured; and enabling automatic control of mixing of the multiple captured audio signals based on the remote sensing of the real acoustic environment in which the multiple audio signals were captured.

Type: Application

Filed: February 15, 2017

Publication date: June 24, 2021

Inventors: Francesco Cricri, Arto Lehtiniemi, Antti Eronen
Methods and Apparatuses relating to the Handling of Visual Virtual Reality Content

Publication number: 20210191505

Abstract: This specification describes a method comprising responding to a first gesture by a first user delimiting a visual virtual reality content portion from visual virtual reality content being consumed by the first user via a first head-mounted display by selecting the delimited visual virtual reality content portion, responding to a second gesture by the first user directed towards a content consumption device associated with a second user by identifying the content consumption device as a recipient of the selected visual virtual reality content portion, and causing the selected visual virtual reality content portion to be provided to the content consumption device for consumption by t second user.

Type: Application

Filed: February 2, 2017

Publication date: June 24, 2021

Inventor: Francesco Cricri
VIDEO PROCESSING

Publication number: 20210168395

Abstract: An apparatus, a method and a computer program product are described comprising: obtaining or receiving video data; providing a current frame and/or one or more previous frames of the obtained or received video data to an input of a neural network; generating a predicted output at an output of the neural network, wherein the predicted output comprises at least one of one or more predicted future frames of the video data and predicted properties of one or more future frames of the video data; determining one or more processing decisions based, at least in part, on the predicted output; and processing the current frame of the video data at least partially according to the one or more processing decisions.

Type: Application

Filed: July 8, 2019

Publication date: June 3, 2021

Inventors: Francesco CRICRI, Antti HALLAPURO, Miska HANNUKSELA, Jani LAINEMA, Emre AKSU, Caglar AYTEKIN, Ramin GHAZNAVI YOUVALARI
AUGMENTED REALITY SYSTEM

Publication number: 20210160646

Abstract: A method, apparatus and computer program is described comprising: generating or obtaining at least one augmented reality image for presentation to a user, the augmented reality images comprising one or more virtual objects, displays or devices of an augmented reality system; attenuating real-world audio of the augmented reality system; and controlling the provision of audio to the user in at least one audio focus or beamform direction relating to at least one selected real-world object of the augmented reality system.

Type: Application

Filed: November 3, 2020

Publication date: May 27, 2021

Inventors: Jussi LEPPÄNEN, Miikka VILERMO, Arto LEHTINIEMI, Francesco CRICRÌ
AN APPARATUS, A METHOD AND A COMPUTER PROGRAM FOR VIDEO CODING AND DECODING

Publication number: 20210127140

Abstract: A method comprising: obtaining a block of a picture or a picture in an encoder; determining if the block/picture is used for on-line learning; if affirmative, encoding the block/picture; reconstructing a coarse version of the block/picture or the respective prediction error block/picture; enhancing the coarse version using a neural net; fine-tuning the neural net with a training signal based on the coarse version; determining if the block/picture is enhanced using the neural net; and if affirmative, encoding the block/picture with enhancing using the neural net.

Type: Application

Filed: March 29, 2019

Publication date: April 29, 2021

Inventors: Miska HANNUKSELA, Jani LAINEMA, Francesco CRICRI

prev 1 2 3 4 5 6 7 … next