Patents by Inventor Sahngwon RYOO

Sahngwon RYOO has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20250252137
    Abstract: Systems and methods of the present disclosure are directed to computer-implemented method for contextual processing via inter-model between pre-trained machine-learned models. The method includes obtaining, by a computing system comprising one or more computing devices, input data. The method includes processing, by the computing system, the input data with two or more pre-trained models to generate output data, wherein processing the input comprises executing a structured inter-model communication schema for inter-model communication between the two or more pre-trained models over a communications channel. The method includes providing, by the computing system, the output data as an output.
    Type: Application
    Filed: March 31, 2023
    Publication date: August 7, 2025
    Inventors: Andy Zeng, Adrian Wing Dak Wong, Stefan Welker, Krzysztof Choromanski, Federico Tombari, Aveek Ravishekhar Purohit, Michael Sahngwon Ryoo, Vikas Sindhwani, Johnny Chung Lee, Vincent Olivier Vanhoucke, Peter Raymond Florence
  • Publication number: 20250191267
    Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for processing videos and text using co-tokenization.
    Type: Application
    Filed: March 7, 2023
    Publication date: June 12, 2025
    Inventors: Anthony Jacob Piergiovanni, Anelia Angelova, Kairo Tiere Morton, Michael Sahngwon Ryoo, Weicheng Kuo
  • Publication number: 20240355109
    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for determining one or more neural network architectures of a neural network for performing a video processing neural network task. In one aspect, a method comprises: at each of a plurality of iterations: selecting a parent neural network architecture from a set of neural network architectures; training a neural network having the parent neural network architecture to perform the video processing neural network task, comprising determining trained values of connection weight parameters of the parent neural network architecture; generating a new neural network architecture based at least in part on the trained values of the connection weight parameters of the parent neural network architecture; and adding the new neural network architecture to the set of neural network architectures.
    Type: Application
    Filed: June 18, 2024
    Publication date: October 24, 2024
    Inventors: Michael Sahngwon Ryoo, Anthony Jacob Piergiovanni, Mingxing Tan, Anelia Angelova
  • Patent number: 12046025
    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for determining one or more neural network architectures of a neural network for performing a video processing neural network task. In one aspect, a method comprises: at each of a plurality of iterations: selecting a parent neural network architecture from a set of neural network architectures; training a neural network having the parent neural network architecture to perform the video processing neural network task, comprising determining trained values of connection weight parameters of the parent neural network architecture; generating a new neural network architecture based at least in part on the trained values of the connection weight parameters of the parent neural network architecture; and adding the new neural network architecture to the set of neural network architectures.
    Type: Grant
    Filed: May 22, 2020
    Date of Patent: July 23, 2024
    Assignee: Google LLC
    Inventors: Michael Sahngwon Ryoo, Anthony Jacob Piergiovanni, Mingxing Tan, Anelia Angelova
  • Publication number: 20240189994
    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for controlling an agent interacting with an environment. In one aspect, a method comprises: receiving a natural language text sequence that characterizes a task to be performed by the agent in the environment; generating an encoded representation of the natural language text sequence; and at each of a plurality of time steps: obtaining an observation image characterizing a state of the environment at the time step; processing the observation image to generate an encoded representation of the observation image; generating a sequence of input tokens; processing the sequence of input tokens to generate a policy output that defines an action to be performed by the agent in response to the observation image; selecting an action to be performed by the agent using the policy output; and causing the agent to perform the selected action.
    Type: Application
    Filed: December 13, 2023
    Publication date: June 13, 2024
    Inventors: Keerthana P G, Karol Hausman, Julian Ibarz, Brian Ichter, Alexander Irpan, Dmitry Kalashnikov, Yao Lu, Kanury Kanishka Rao, Michael Sahngwon Ryoo, Austin Charles Stone, Teddey Ming Xiao, Quan Ho Vuong, Sumedh Anand Sontakke
  • Publication number: 20230409899
    Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for processing a network input using a computer vision neural network with learned tokenization.
    Type: Application
    Filed: June 21, 2022
    Publication date: December 21, 2023
    Inventors: Michael Sahngwon Ryoo, Anthony Jacob Piergiovanni, Anelia Angelova, Anurag Arnab, Mostafa Dehghani
  • Publication number: 20230114556
    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for processing a network input using a neural network to generate a network output.
    Type: Application
    Filed: July 14, 2021
    Publication date: April 13, 2023
    Inventors: Michael Sahngwon Ryoo, Anthony Jacob Piergiovanni, Anelia Angelova
  • Publication number: 20220366257
    Abstract: Generally, the present disclosure is directed to a neural architecture search process for finding small and fast video processing networks for understanding of video data. The neural architecture search process can automatically design networks that provide comparable video processing performance at a fraction of the computational and storage cost of larger existing models, thereby conserving computing resources such as memory and processor usage.
    Type: Application
    Filed: September 16, 2020
    Publication date: November 17, 2022
    Inventors: Anthony J. Piergiovanni, Anelia Angelova, Michael Sahngwon Ryoo
  • Publication number: 20220189154
    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for determining one or more neural network architectures of a neural network for performing a video processing neural network task. In one aspect, a method comprises: at each of a plurality of iterations: selecting a parent neural network architecture from a set of neural network architectures; training a neural network having the parent neural network architecture to perform the video processing neural network task, comprising determining trained values of connection weight parameters of the parent neural network architecture; generating a new neural network architecture based at least in part on the trained values of the connection weight parameters of the parent neural network architecture; and adding the new neural network architecture to the set of neural network architectures.
    Type: Application
    Filed: May 22, 2020
    Publication date: June 16, 2022
    Inventors: Michael Sahngwon Ryoo, Anthony Jacob Piergiovanni, Mingxing Tan, Anelia Angelova
  • Publication number: 20120155711
    Abstract: Disclosed are an apparatus and a method for analyzing a video. The video analyzing apparatus includes a route analyzing unit configured to analyze a subject of a first-person view video and an object represented in the first-person view video based on the first-person view video and generate route information of the subject and the object; and an event analyzing unit configured to classify the first-person view video as a semantic event using the route information.
    Type: Application
    Filed: December 16, 2011
    Publication date: June 21, 2012
    Applicant: Electronics and Telecommunications Research Institute
    Inventors: Sahngwon RYOO, Jae Yeong LEE, Sung Lok CHOI, Won Pil YU
  • Publication number: 20120141094
    Abstract: Disclosed are a method and an apparatus for generating training videos and recognizing situations, using composed videos. The method for generating training videos using composed videos according to an exemplary embodiment of the present invention includes generating composed videos based on configuration information of an original video; selecting the composed videos satisfying structural constraints of situations among the generated composed videos; and configuring the training videos including the selected composed videos.
    Type: Application
    Filed: December 2, 2011
    Publication date: June 7, 2012
    Applicant: Electronics and Telecommunications Research Institute
    Inventors: Sahngwon RYOO, Won Pil YU