Patents by Inventor Emre Baris Aksu

Emre Baris Aksu has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Method and apparatus for storage and signaling of compressed point clouds

Patent number: 11581022

Abstract: A method, apparatus and computer program product are provided to signal and store compressed point clouds in video encoding. The method, apparatus and computer program product may be utilized in conjunction with a variety of video formats. Relative to encoding of compressed point clouds, the method, apparatus and computer program product access a point cloud compression coded bitstream and cause storage of the point cloud compression coded bitstream. The point cloud compression coded bitstream comprises a texture information bitstream, a geometry information bitstream, and an auxiliary metadata bitstream. Relative to the decoding of compressed point clouds, the method, apparatus and computer program product receive a point cloud compression coded bitstream and decode the point cloud compression coded bitstream.

Type: Grant

Filed: May 19, 2020

Date of Patent: February 14, 2023

Assignee: NOKIA TECHNOLOGIES OY

Inventors: Lauri Ilola, Emre Baris Aksu, Miska Matias Hannuksela, Sebastian Schwarz
Cascaded prediction-transform approach for mixed machine-human targeted video coding

Patent number: 11575938

Abstract: Data may be encoded to minimize distortion after decoding, but the quality required for presentation of the decoded data to a machine and the quality required for presentation to a human may be different. To accommodate different quality requirements, video data may be encoded to produce a first set of encoded data and a second set of encoded data, where the first set may be decoded for use by one of a machine consumer or a human consumer, and a combination of the first set and the second set may be decoded for use by the other of a machine consumer or a human consumer. The first and second set may be produced with a neural encoder and a neural decoder, and/or may be produced with the use of prediction and transform neural network modules. A human-targeted structure and a machine-targeted structure may produce the sets of encoded data.

Type: Grant

Filed: December 30, 2020

Date of Patent: February 7, 2023

Assignee: Nokia Technologies Oy

Inventors: Hamed Rezazadegan Tavakoli, Francesco Cricri, Miska Matias Hannuksela, Emre Baris Aksu, Honglei Zhang, Nam Le
METHOD, AN APPARATUS AND A COMPUTER PROGRAM PRODUCT FOR VIDEO CONFERENCING

Publication number: 20230033063

Abstract: A method comprising obtaining a 360-degree video content from a video source; projecting the 360-degree video content onto a 2D image plane; dividing the projected 360-degree video content into a plurality of regions, wherein the regions are partly overlapping and each region covers a region of the 360-degree video content suitable for a viewport presentation; receiving a request for a viewport orientation of the 360-degree video content from a client; and providing the client with an viewport presentation of the region corresponding to the requested viewport orientation.

Type: Application

Filed: July 22, 2022

Publication date: February 2, 2023

Inventors: Saba Ahsan, Sujeet Shyamsundar Mate, Yu You, Emre Baris Aksu, Igor Danilo Diego Curcio, Miska Matias Hannuksela
METHOD, APPARATUS AND COMPUTER PROGRAM PRODUCT FOR SIGNALING INFORMATION OF A MEDIA TRACK

Publication number: 20220335979

Abstract: Various embodiments provide an apparatus, a method, and a computer program product. The apparatus includes at least one processor; and at least one non-transitory memory including computer program code; wherein the at least one memory and the computer program code are configured to, with the at least one processor, cause the apparatus at least to perform: define or utilize file format syntax elements to indicate samples comprising at least one of: one or more description documents, wherein the one or more description documents comprise 3 dimensional information; or one or more updates to at least one description document of the one or more description documents; and define or utilize the file format syntax elements to indicate a relationship between samples containing the one or more description document and update information to the samples.

Type: Application

Filed: March 17, 2022

Publication date: October 20, 2022

Inventors: Lukasz KONDRAD, Lauri Aleksi ILOLA, Emre Baris AKSU, Kashyap KAMMACHI SREEDHAR
AN APPARATUS, A METHOD AND A COMPUTER PROGRAM FOR VIDEO CODING AND DECODING

Publication number: 20220335978

Abstract: A method comprising: authoring a plurality of sets of media tracks comprising at least a first set of media tracks and a second set of media tracks into a media file format, wherein a subset of tracks of the first set comprises alternate data for each other and a subset of tracks of the second set comprises alternate data for each other; and including, in or along a bitstream comprising a media file including or inferring said media tracks, an indication that said subset of tracks of the first set are alternatives to each other and said subset of tracks of the second set are alternatives to each other upon playback of the media tracks.

Type: Application

Filed: July 30, 2020

Publication date: October 20, 2022

Inventors: Lukasz KONDRAD, Lauri Aleksi ILOLA, Emre Baris AKSU, Miska Matias HANNUKSELA
Compression Framework for Distributed or Federated Learning with Predictive Compression Paradigm

Publication number: 20220335269

Abstract: An apparatus includes circuitry configured to: receive a plurality of compressed residual local weight updates from a plurality of respective institutes with a plurality of a respective first parameter, the first parameter used to determine a plurality of respective predicted local weight updates; determine a plurality of local weight updates or a plurality of adjusted local weight updates based on the plurality of compressed residual local weight updates and the plurality of respective predicted local weight updates; aggregate the plurality of determined local weight updates or the plurality of adjusted local weight updates to generate an intended global weight update, and update a model on a server based at least on the intended global weight update, the model used to perform a task; and transfer a compressed residual global weight update to the institutes with a second parameter, the second parameter used to determine a predicted global weight update.

Type: Application

Filed: April 11, 2022

Publication date: October 20, 2022

Inventors: Honglei Zhang, Hamed Rezazadegan Tavakoli, Francesco Cricri, Homayun Afrabandpey, Goutham Rangu, Emre Baris Aksu
Apparatus, a method and a computer program for video coding and decoding

Patent number: 11477489

Abstract: A method comprising: writing, in a container file, a first video-based point cloud compression (V-PCC) bitstream and a second V-PCC bitstream, wherein said first and second V-PCC bitstreams are associated with a common group based on at least one logical context; writing, in the container file, an indication about the common group between the first V-PCC bitstream and the second V-PCC bitstream; generating a media presentation description (MPD) file with a first representation belonging to a first adaptation set associated with the first V-PCC bitstream and a second representation belonging to a second adaptation set associated with the second V-PCC bitstream; and writing, in the MPD file, at least one information element describing grouping information of the first representation belonging to the first adaptation set and the second representation belonging to the second adaptation set, wherein said information element is provided with at least one attribute indicating that said first and second V-PCC bitstre

Type: Grant

Filed: April 6, 2021

Date of Patent: October 18, 2022

Assignee: Nokia Technologies Oy

Inventors: Kashyap Kammachi Sreedhar, Emre Baris Aksu, Lukasz Kondrad
Method and apparatus for incorporating location awareness in media content

Patent number: 11438731

Abstract: A method, apparatus and computer program product creates a viewpoint position structure for media content. The viewpoint position structure specifies a position of a viewpoint defined in a reference coordinate system and an offset of the reference coordinate system with respect to a geographical reference. The method, apparatus and computer program product cause storage of the viewpoint position structure. An indication may be created as to whether the media content is augmented reality media content. The augmented reality media content may comprise a background that is at least partially transparent. The offset may be determined, within the reference coordinate system, relative to a geomagnetic reference direction, based upon one or more of a viewpoint yaw angle, a viewpoint pitch angle, or a viewpoint roll angle.

Type: Grant

Filed: March 19, 2020

Date of Patent: September 6, 2022

Assignee: NOKIA TECHNOLOGIES OY

Inventors: Sujeet Shyamsundar Mate, Emre Baris Aksu, Miska Matias Hannuksela, Igor Danilo Diego Curcio, Kashyap Kammachi-Sreedhar, Ville-Veikko Mattila
HIGH-LEVEL SYNTAX FOR SIGNALING NEURAL NETWORKS WITHIN A MEDIA BITSTREAM

Publication number: 20220256227

Abstract: An example method is provided to include receiving a media bitstream comprising one or more media units and a first enhancement information message, wherein the first enhancement information message comprises at least two independently parsable structures, a first independently parsable structure comprising information about at least one purpose of one or more neural networks (NNs) to be applied to the one or more media units, and a second independently parsable structure comprising or identifying one or more neural networks; decoding the one or more media units; and using the one or more neural networks to enhance or filter one or more frames of the decoded the one or more media units, based on the at least one purpose. An example method includes. Corresponding apparatuses and computer program products are also provided.

Type: Application

Filed: February 3, 2022

Publication date: August 11, 2022

Inventors: Hamed REZAZADEGAN TAVAKOLI, Francesco CRICRÌ, Emre Baris AKSU, Miska Matias HANNUKSELA
High level syntax for compressed representation of neural networks

Patent number: 11412266

Abstract: An apparatus includes at least one processor; and at least one non-transitory memory including computer program code; wherein the at least one memory and the computer program code are configured to, with the at least one processor, cause the apparatus at least to perform: encode or decode a high-level bitstream syntax for at least one neural network; wherein the high-level bitstream syntax comprises at least one information unit having metadata or compressed neural network data of a portion of the at least one neural network; and wherein a serialized bitstream comprises one or more of the at least one information unit.

Type: Grant

Filed: January 4, 2021

Date of Patent: August 9, 2022

Assignee: Nokia Technologies Oy

Inventors: Emre Baris Aksu, Miska Matias Hannuksela, Hamed Rezazadegan Tavakoli, Francesco Cricri
Method, An Apparatus And A Computer Program Product For Virtual Reality

Publication number: 20220247990

Abstract: A method includes generating a bitstream defining a presentation, the presentation comprising an omnidirectional visual media content and a first visual media component and a second visual media component; indicating in the bitstream a first presentation timeline and a second presentation timeline; and indicating in the bitstream a switching mode with respect to the first presentation timeline associated with the first visual media component, or with respect to the second presentation timeline associated with the second visual media component, the switching mode being indicated dependent on a viewpoint of a user; wherein the switching mode provides an indication of switching to the first visual media component or to the second visual media component, the first visual media component corresponding to content captured from a first omnidirectional camera in a first location, and the second visual media component corresponding to content captured from a second omnidirectional camera in a second location.

Type: Application

Filed: April 21, 2022

Publication date: August 4, 2022

Inventors: Kashyap Kammachi Sreedhar, Igor Danilo Diego CURCIO, Miska Matias HANNUKSELA, Sujeet Shyamsundar MATE, Emre Baris AKSU
METHOD AND APPARATUS FOR LATE BINDING IN MEDIA CONTENT

Publication number: 20220167042

Abstract: A method, apparatus and computer program product encode, into a media description, a first information item indicative of a first locator for segment metadata for a set of representations. The method, apparatus and computer program product encode, into the media description, one or more representation-specific information items indicative of representation-specific locator for segment media data for one or more representations of the set of representations. The method, apparatus and computer program product cause storage of the media description with the set of representations.

Type: Application

Filed: December 12, 2019

Publication date: May 26, 2022

Inventors: Miska Matias HANNUKSELA, Emre Baris AKSU, Ari HOURUNRANTA, Kashyap KAMMACHI-SREEDHAR, Igor Danilo Diego CURCIO
Method, an apparatus and a computer program product for virtual reality

Patent number: 11323683

Abstract: The invention relates to a solution wherein a bitstream defining a presentation is generated, the presentation comprising an omnidirectional visual media content and a first visual media component and a second visual media component; indicating in the bitstream a first presentation timeline associated with the first visual media component; indicating in the bitstream a second presentation timeline associated with the second visual media component; indicating in the bitstream a switching mode to a second presentation timeline associated with the second visual media component; and indicating in the bitstream, that the switching mode is with respect to the first presentation timeline or to the second presentation timeline.

Type: Grant

Filed: December 31, 2019

Date of Patent: May 3, 2022

Assignee: Nokia Technologies Oy

Inventors: Kashyap Kammachi Sreedhar, Igor Danilo Diego Curcio, Miska Matias Hannuksela, Sujeet Shyamsundar Mate, Emre Baris Aksu
TRAINING IN NEURAL NETWORKS

Publication number: 20220044125

Abstract: A system, obtaining a first training dataset, comprising a plurality of first image and pose data pairs; obtaining a first generated dataset, comprising a plurality of first image and estimated pose data pairs, wherein estimated pose data of the first image and estimated pose data pairs are generated by a first neural network trained using the first training dataset; obtaining a second generated dataset, comprising a plurality of second image and estimated pose data pairs, wherein estimated pose data of the second image and estimated pose data pairs are generated by a second neural network trained using the first training dataset; generating the first and second generated datasets a generated training dataset, comprising image and estimated pose data pairs selected from said first generated dataset; and training a third neural network based on a combination of some or all of the first training dataset and the generated training dataset.

Type: Application

Filed: August 5, 2021

Publication date: February 10, 2022

Inventors: Goutham RANGU, Francesco CRICRI, Emre Baris AKSU
FEDERATED TEACHER-STUDENT MACHINE LEARNING

Publication number: 20220012637

Abstract: A node for a federated machine learning system that comprises the node and one or more other nodes configured for the same machine learning task, the node comprising: a federated student machine learning network configured to update a machine learning model in dependence upon updated machine learning models of the one or more node; a teacher machine learning network; means for receiving unlabeled data; means for teaching, using supervised learning, at least the federated first machine learning network using the teacher machine learning network, wherein the teacher machine learning network is configured to receive the data and produce pseudo labels for supervised learning using the data and wherein the federated student machine learning network is configured to perform supervised learning in dependence upon the same received data and the pseudo-labels.

Type: Application

Filed: July 8, 2021

Publication date: January 13, 2022

Inventors: Hamed REZAZADEGAN TAVAKOLI, Francesco CRICRI, Emre Baris AKSU
Method and apparatus for storage and signaling of static point cloud data

Patent number: 11200701

Abstract: A method, apparatus and computer program product access a video-based point cloud compression coded bitstream. The point cloud compression coded bitstream corresponds to a non-timed video-based point cloud compression representation that comprises one or more video point cloud compression units. The method, apparatus and computer program product encapsulate the one or more video point cloud compression units as one or more video point cloud compression unit items. The method, apparatus and computer program product also cause storage of the one or more video point cloud compression unit items in a file.

Type: Grant

Filed: March 19, 2020

Date of Patent: December 14, 2021

Assignee: NOKIA TECHNOLOGIES OY

Inventors: Emre Baris Aksu, Miska Matias Hannuksela
APPARATUS, A METHOD AND A COMPUTER PROGRAM FOR VIDEO CODING AND DECODING

Publication number: 20210314626

Abstract: A method comprising: writing, in a container file, a first video-based point cloud compression (V-PCC) bitstream and a second V-PCC bitstream, wherein said first and second V-PCC bitstreams are associated with a common group based on at least one logical context; writing, in the container file, an indication about the common group between the first V-PCC bitstream and the second V-PCC bitstream; generating a media presentation description (MPD) file with a first representation belonging to a first adaptation set associated with the first V-PCC bitstream and a second representation belonging to a second adaptation set associated with the second V-PCC bitstream; and writing, in the MPD file, at least one information element describing grouping information of the first representation belonging to the first adaptation set and the second representation belonging to the second adaptation set, wherein said information element is provided with at least one attribute indicating that said first and second V-PCC bitstre

Type: Application

Filed: April 6, 2021

Publication date: October 7, 2021

Inventors: Kashyap Kammachi Sreedhar, Emre Baris Aksu, Lukasz Kondrad
Method for analysing media content to generate reconstructed media content

Patent number: 11068722

Abstract: The invention relates to a method, an apparatus and a computer program product for analyzing media content. The method comprises receiving media content; performing feature extraction of the media content at a plurality of convolution layers to produce a plurality of layer-specific feature maps; transmitting from the plurality of convolution layers a corresponding layer-specific feature map to a corresponding de-convolution layer of a plurality of de-convolution layers via a recurrent connection between the plurality of convolution layers and the plurality of de-convolution layers; and generating a reconstructed media content based on the plurality of feature maps.

Type: Grant

Filed: September 27, 2017

Date of Patent: July 20, 2021

Assignee: Nokia Technologies Oy

Inventors: Francesco Cricri, Mikko Honkala, Emre Baris Aksu, Xingyang Ni
Cascaded Prediction-Transform Approach for Mixed Machine-Human Targeted Video Coding

Publication number: 20210218997

Abstract: Data may be encoded to minimize distortion after decoding, but the quality required for presentation of the decoded data to a machine and the quality required for presentation to a human may be different. To accommodate different quality requirements, video data may be encoded to produce a first set of encoded data and a second set of encoded data, where the first set may be decoded for use by one of a machine consumer or a human consumer, and a combination of the first set and the second set may be decoded for use by the other of a machine consumer or a human consumer. The first and second set may be produced with a neural encoder and a neural decoder, and/or may be produced with the use of prediction and transform neural network modules. A human-targeted structure and a machine-targeted structure may produce the sets of encoded data.

Type: Application

Filed: December 30, 2020

Publication date: July 15, 2021

Inventors: Hamed Rezazadegan Tavakoli, Francesco Cricri, Miska Matias Hannuksela, Emre Baris Aksu, Honglei Zhang, Nam Le
High Level Syntax for Compressed Representation of Neural Networks

Publication number: 20210211733

Abstract: An apparatus includes at least one processor; and at least one non-transitory memory including computer program code; wherein the at least one memory and the computer program code are configured to, with the at least one processor, cause the apparatus at least to perform: encode or decode a high-level bitstream syntax for at least one neural network; wherein the high-level bitstream syntax comprises at least one information unit having metadata or compressed neural network data of a portion of the at least one neural network; and wherein a serialized bitstream comprises one or more of the at least one information unit.

Type: Application

Filed: January 4, 2021

Publication date: July 8, 2021

Inventors: Emre Baris Aksu, Miska Matias Hannuksela, Hamed Rezazadegan Tavakoli, Francesco Cricri

prev 1 2 3 4 next