Patents by Inventor George Dan Toderici

George Dan Toderici has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 12225239
    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for training an encoder neural network configured to receive a data item and to process the data item to output a compressed representation of the data item. In one aspect, a method includes, for each training data item: processing the data item using the encoder neural network to generate a latent representation of the training data item; processing the latent representation using a hyper-encoder neural network to determine a conditional entropy model; generating a compressed representation of the training data item; processing the compressed representation using a decoder neural network to generate a reconstruction of the training data item; processing the reconstruction of the training data item using a discriminator neural network to generate a discriminator network output; evaluating a first loss function; and determining an update to the current values of the encoder network parameters.
    Type: Grant
    Filed: August 25, 2023
    Date of Patent: February 11, 2025
    Assignee: Google LLC
    Inventors: George Dan Toderici, Fabian Julius Mentzer, Eirikur Thor Agustsson, Michael Tobias Tschannen
  • Patent number: 12141199
    Abstract: A system and methodology provide for annotating videos with entities and associated probabilities of existence of the entities within video frames. A computer-implemented method identifies an entity from a plurality of entities identifying characteristics of video items. The computer-implemented method selects a set of features correlated with the entity based on a value of a feature of a plurality of features, determines a classifier for the entity using the set of features, and determines an aggregation calibration function for the entity based on the set of features. The computer-implemented method selects a video frame from a video item, where the video frame having associated features, and determines a probability of existence of the entity based on the associated features using the classifier and the aggregation calibration function.
    Type: Grant
    Filed: December 13, 2021
    Date of Patent: November 12, 2024
    Assignee: Google LLC
    Inventors: Balakrishnan Varadarajan, George Dan Toderici, Apostol Natsev, Nitin Khandelwal, Sudheendra Vijayanarasimhan, Weilong Yang, Sanketh Shetty
  • Patent number: 12033077
    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for receiving, by a neural network (NN), a dataset for generating features from the dataset. A first set of features is computed from the dataset using at least a feature layer of the NN. The first set of features i) is characterized by a measure of informativeness; and ii) is computed such that a size of the first set of features is compressible into a second set of features that is smaller in size than the first set of features and that has a same measure of informativeness as the measure of informativeness of the first set of features. The second set of features if generated from the first set of features using a compression method that compresses the first set of features to generate the second set of features.
    Type: Grant
    Filed: February 27, 2023
    Date of Patent: July 9, 2024
    Assignee: Google LLC
    Inventors: Abhinav Shrivastava, Saurabh Singh, Johannes Ballé, Sami Ahmad Abu-El-Haija, Nicholas Milo Johnston, George Dan Toderici
  • Publication number: 20240223817
    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for compressing video data. In one aspect, a method comprises: receiving a video sequence of frames; generating, using a flow prediction network, an optical flow between two sequential frames, wherein the two sequential frames comprise a first frame and a second frame that is subsequent the first frame; generating from the optical flow, using a first autoencoder neural network: a predicted optical flow between the first frame and the second frame; and warping a reconstruction of the first frame according to the predicted optical flow and subsequently applying a blurring operation to obtain an initial predicted reconstruction of the second frame.
    Type: Application
    Filed: July 5, 2022
    Publication date: July 4, 2024
    Inventors: George Dan Toderici, Eirikur Thor Agustsson, Fabian Julius Mentzer, David Charles Minnen, Johannes Balle, Nicholas Johnston
  • Patent number: 12014542
    Abstract: A computer-implemented method for selecting representative frames for videos is provided. The method includes receiving a video and identifying a set of features for each of the frames of the video. The features including frame-based features and semantic features. The semantic features identifying likelihoods of semantic concepts being present as content in the frames of the video. A set of video segments for the video is subsequently generated. Each video segment includes a chronological subset of frames from the video and each frame is associated with at least one of the semantic features. The method generates a score for each frame of the subset of frames for each video segment based at least on the semantic features, and selecting a representative frame for each video segment based on the scores of the frames in the video segment. The representative frame represents and summarizes the video segment.
    Type: Grant
    Filed: December 14, 2020
    Date of Patent: June 18, 2024
    Assignee: Google LLC
    Inventors: Sanketh Shetty, Tomas Izo, Min-Hsuan Tsai, Sudheendra Vijayanarasimhan, Apostol Natsev, Sami Abu-El-Haija, George Dan Toderici, Susana Ricco, Balakrishnan Varadarajan, Nicola Muscettola, WeiHsin Gu, Weilong Yang, Nitin Khandelwal, Phuong Le
  • Publication number: 20240144583
    Abstract: Example embodiments of the present disclosure relate to systems and methods for compressing attributes of volumetric and hypervolumetric datasets. An example system performs operations including obtaining a reference dataset comprising attributes indexed by a domain of multidimensional coordinates; subdividing the domain into a plurality of blocks respectively associated with a plurality of attribute subsets; inputting, to a local nonlinear operator, a latent representation for an attribute subset associated with at least one block of the plurality of blocks; obtaining, using the local nonlinear operator and based on the latent representation, an attribute representation of one or more attributes of the attribute subset; and updating the latent representation based on a comparison of the attribute representation and the reference dataset.
    Type: Application
    Filed: December 27, 2023
    Publication date: May 2, 2024
    Inventors: Philip Andrew Chou, Berivan Isik, Sung Jin Hwang, Nicholas Milo Johnston, George Dan Toderici
  • Publication number: 20240107079
    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for training an encoder neural network configured to receive a data item and to process the data item to output a compressed representation of the data item. In one aspect, a method includes, for each training data item: processing the data item using the encoder neural network to generate a latent representation of the training data item; processing the latent representation using a hyper-encoder neural network to determine a conditional entropy model; generating a compressed representation of the training data item; processing the compressed representation using a decoder neural network to generate a reconstruction of the training data item; processing the reconstruction of the training data item using a discriminator neural network to generate a discriminator network output; evaluating a first loss function; and determining an update to the current values of the encoder network parameters.
    Type: Application
    Filed: August 25, 2023
    Publication date: March 28, 2024
    Inventors: George Dan Toderici, Fabian Julius Mentzer, Eirikur Thor Agustsson, Michael Tobias Tschannen
  • Publication number: 20240078712
    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for compressing and decompressing data. In one aspect, a method comprises: processing data using an encoder neural network to generate a latent representation of the data; processing the latent representation of the data using a hyper-encoder neural network to generate a latent representation of an entropy model; generating an entropy encoded representation of the latent representation of the entropy model; generating an entropy encoded representation of the latent representation of the data using the latent representation of the entropy model; and determining a compressed representation of the data from the entropy encoded representations of: (i) the latent representation of the data and (ii) the latent representation of the entropy model used to entropy encode the latent representation of the data.
    Type: Application
    Filed: April 25, 2023
    Publication date: March 7, 2024
    Inventors: David Charles Minnen, Saurabh Singh, Johannes Balle, Troy Chinen, Sung Jin Hwang, Nicholas Johnston, George Dan Toderici
  • Patent number: 11900525
    Abstract: Example embodiments of the present disclosure relate to systems and methods for compressing attributes of volumetric and hypervolumetric datasets. An example system performs operations including obtaining a reference dataset comprising attributes indexed by a domain of multidimensional coordinates; subdividing the domain into a plurality of blocks respectively associated with a plurality of attribute subsets; inputting, to a local nonlinear operator, a latent representation for an attribute subset associated with at least one block of the plurality of blocks; obtaining, using the local nonlinear operator and based on the latent representation, an attribute representation of one or more attributes of the attribute subset; and updating the latent representation based on a comparison of the attribute representation and the reference dataset.
    Type: Grant
    Filed: March 30, 2022
    Date of Patent: February 13, 2024
    Assignee: GOOGLE LLC
    Inventors: Philip Andrew Chou, Berivan Isik, Sung Jin Hwang, Nicholas Milo Johnston, George Dan Toderici
  • Patent number: 11750848
    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for training an encoder neural network configured to receive a data item and to process the data item to output a compressed representation of the data item. In one aspect, a method includes, for each training data item: processing the data item using the encoder neural network to generate a latent representation of the training data item; processing the latent representation using a hyper-encoder neural network to determine a conditional entropy model; generating a compressed representation of the training data item; processing the compressed representation using a decoder neural network to generate a reconstruction of the training data item; processing the reconstruction of the training data item using a discriminator neural network to generate a discriminator network output; evaluating a first loss function; and determining an update to the current values of the encoder network parameters.
    Type: Grant
    Filed: November 30, 2020
    Date of Patent: September 5, 2023
    Assignee: Google LLC
    Inventors: George Dan Toderici, Fabian Julius Mentzer, Eirikur Thor Agustsson, Michael Tobias Tschannen
  • Publication number: 20230260197
    Abstract: Example embodiments of the present disclosure relate to systems and methods for compressing attributes of volumetric and hypervolumetric datasets. An example system performs operations including obtaining a reference dataset comprising attributes indexed by a domain of multidimensional coordinates; subdividing the domain into a plurality of blocks respectively associated with a plurality of attribute subsets; inputting, to a local nonlinear operator, a latent representation for an attribute subset associated with at least one block of the plurality of blocks; obtaining, using the local nonlinear operator and based on the latent representation, an attribute representation of one or more attributes of the attribute subset; and updating the latent representation based on a comparison of the attribute representation and the reference dataset.
    Type: Application
    Filed: March 30, 2022
    Publication date: August 17, 2023
    Inventors: Philip Andrew Chou, Berivan Isik, Sung Jin Hwang, Nicholas Milo Johnston, George Dan Toderici
  • Publication number: 20230237332
    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for receiving, by a neural network (NN), a dataset for generating features from the dataset. A first set of features is computed from the dataset using at least a feature layer of the NN. The first set of features i) is characterized by a measure of informativeness; and ii) is computed such that a size of the first set of features is compressible into a second set of features that is smaller in size than the first set of features and that has a same measure of informativeness as the measure of informativeness of the first set of features. The second set of features if generated from the first set of features using a compression method that compresses the first set of features to generate the second set of features.
    Type: Application
    Filed: February 27, 2023
    Publication date: July 27, 2023
    Inventors: Abhinav Shrivastava, Saurabh Singh, Johannes Ballé, Sami Ahmad Abu-El-Haija, Nicholas Milo Johnston, George Dan Toderici
  • Patent number: 11670010
    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for compressing and decompressing data. In one aspect, a method comprises: processing data using an encoder neural network to generate a latent representation of the data; processing the latent representation of the data using a hyper-encoder neural network to generate a latent representation of an entropy model; generating an entropy encoded representation of the latent representation of the entropy model; generating an entropy encoded representation of the latent representation of the data using the latent representation of the entropy model; and determining a compressed representation of the data from the entropy encoded representations of: (i) the latent representation of the data and (ii) the latent representation of the entropy model used to entropy encode the latent representation of the data.
    Type: Grant
    Filed: January 19, 2022
    Date of Patent: June 6, 2023
    Assignee: Google LLC
    Inventors: David Charles Minnen, Saurabh Singh, Johannes Balle, Troy Chinen, Sung Jin Hwang, Nicholas Johnston, George Dan Toderici
  • Publication number: 20230099526
    Abstract: Example aspects of the present disclosure are directed to a computer-implemented method for determining a perceptual quality of a subject video content item. The method can include inputting a subject frame set from the subject video content item into a first machine-learned model. The method can also include generating, using the first machine-learned model, a feature based at least in part on the subject frame set. The method can also include outputting, using a second machine-learned model, a score indicating the perceptual quality of the subject video content item based at least in part on the feature.
    Type: Application
    Filed: September 27, 2021
    Publication date: March 30, 2023
    Inventors: Troy Chinen, Alex Sukhanov, Eirikur Thor Agustsson, George Dan Toderici
  • Patent number: 11610124
    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for receiving, by a neural network (NN), a dataset for generating features from the dataset. A first set of features is computed from the dataset using at least a feature layer of the NN. The first set of features i) is characterized by a measure of informativeness; and ii) is computed such that a size of the first set of features is compressible into a second set of features that is smaller in size than the first set of features and that has a same measure of informativeness as the measure of informativeness of the first set of features. The second set of features if generated from the first set of features using a compression method that compresses the first set of features to generate the second set of features.
    Type: Grant
    Filed: October 29, 2019
    Date of Patent: March 21, 2023
    Assignee: Google LLC
    Inventors: Abhinav Shrivastava, Saurabh Singh, Johannes Balle, Sami Ahmad Abu-El-Haija, Nicholas Johnston, George Dan Toderici
  • Publication number: 20220207873
    Abstract: A system and methodology provide for annotating videos with entities and associated probabilities of existence of the entities within video frames. A computer-implemented method identifies an entity from a plurality of entities identifying characteristics of video items. The computer-implemented method selects a set of features correlated with the entity based on a value of a feature of a plurality of features, determines a classifier for the entity using the set of features, and determines an aggregation calibration function for the entity based on the set of features. The computer-implemented method selects a video frame from a video item, where the video frame having associated features, and determines a probability of existence of the entity based on the associated features using the classifier and the aggregation calibration function.
    Type: Application
    Filed: December 13, 2021
    Publication date: June 30, 2022
    Inventors: Balakrishnan Varadarajan, George Dan Toderici, Apostol Natsev, Nitin Khandelwal, Sudheendra Vijayanarasimhan, Weilong Yang, Sanketh Shetty
  • Patent number: 11354822
    Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for image compression and reconstruction. A request to generate an encoded representation of an input image is received. The encoded representation of the input image is then generated. The encoded representation includes a respective set of binary codes at each iteration. Generating the set of binary codes for the iteration from an initial set of binary includes: for any tiles that have already been masked off during any previous iteration, masking off the tile. For any tiles that have not yet been masked off during any of the previous iterations, a determination is made as to whether a reconstruction error of the tile when reconstructed from binary codes at the previous iterations satisfies an error threshold. When the reconstruction quality satisfies the error threshold, the tile is masked off.
    Type: Grant
    Filed: May 16, 2018
    Date of Patent: June 7, 2022
    Assignee: Google LLC
    Inventors: Michele Covell, Damien Vincent, David Charles Minnen, Saurabh Singh, Sung Jin Hwang, Nicholas Johnston, Joel Eric Shor, George Dan Toderici
  • Publication number: 20220174328
    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for training an encoder neural network configured to receive a data item and to process the data item to output a compressed representation of the data item. In one aspect, a method includes, for each training data item: processing the data item using the encoder neural network to generate a latent representation of the training data item; processing the latent representation using a hyper-encoder neural network to determine a conditional entropy model; generating a compressed representation of the training data item; processing the compressed representation using a decoder neural network to generate a reconstruction of the training data item; processing the reconstruction of the training data item using a discriminator neural network to generate a discriminator network output; evaluating a first loss function; and determining an update to the current values of the encoder network parameters.
    Type: Application
    Filed: November 30, 2020
    Publication date: June 2, 2022
    Inventors: George Dan Toderici, Fabian Julius Mentzer, Eirikur Thor Agustsson, Michael Tobias Tschannen
  • Publication number: 20220138991
    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for compressing and decompressing data. In one aspect, a method comprises: processing data using an encoder neural network to generate a latent representation of the data; processing the latent representation of the data using a hyper-encoder neural network to generate a latent representation of an entropy model; generating an entropy encoded representation of the latent representation of the entropy model; generating an entropy encoded representation of the latent representation of the data using the latent representation of the entropy model; and determining a compressed representation of the data from the entropy encoded representations of: (i) the latent representation of the data and (ii) the latent representation of the entropy model used to entropy encode the latent representation of the data.
    Type: Application
    Filed: January 19, 2022
    Publication date: May 5, 2022
    Inventors: David Charles Minnen, Saurabh Singh, Johannes Balle, Troy Chinen, Sung Jin Hwang, Nicholas Johnston, George Dan Toderici
  • Patent number: 11257254
    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for compressing and decompressing data. In one aspect, a method comprises: processing data using an encoder neural network to generate a latent representation of the data; processing the latent representation of the data using a hyper-encoder neural network to generate a latent representation of an entropy model; generating an entropy encoded representation of the latent representation of the entropy model; generating an entropy encoded representation of the latent representation of the data using the latent representation of the entropy model; and determining a compressed representation of the data from the entropy encoded representations of: (i) the latent representation of the data and (ii) the latent representation of the entropy model used to entropy encode the latent representation of the data.
    Type: Grant
    Filed: July 18, 2019
    Date of Patent: February 22, 2022
    Assignee: Google LLC
    Inventors: David Charles Minnen, Saurabh Singh, Johannes Balle, Troy Chinen, Sung Jin Hwang, Nicholas Johnston, George Dan Toderici