Patents by Inventor Ziyu Wang

Ziyu Wang has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

METHOD FOR PREPARING HIGH-BULK-DENSITY IBUPROFEN SPHERICAL CRYSTAL AND PRODUCT THEREOF

Publication number: 20240076261

Abstract: The present invention discloses a method for preparing an ibuprofen spherical crystal having high bulk density and a product thereof. The method includes: (1) heating ibuprofen until a molten liquid state is reached; and (2) pressurizing the molten liquid state of ibuprofen, dropping the molten liquid state of ibuprofen into a crystallizer filled with water through a liquid distributor for crystallization under stirring conditions until the molten liquid state of ibuprofen is completely dropped, lowering the rotation speed of stirring to allow crystal growing, and finally, subjecting a resulting crystal slurry to post-treatment to obtain the ibuprofen spherical crystal having high bulk density. The ibuprofen spherical crystal has a bulk density of 0.50-0.70 g/mL, a tap density of 0.63-0.89 g/mL, a median particle size of 300-1,000 ?m and an angle of repose of 22-29°.

Type: Application

Filed: March 24, 2022

Publication date: March 7, 2024

Inventors: JINLONG LIU, SHUMIN LV, RUIJUN HU, XUEZHONG HE, MENGLIANG DU, XIAOCHUN WANG, ZIYU WANG, LANYING LIANG
DATA-DRIVEN ROBOT CONTROL

Publication number: 20240042600

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for data-driven robotic control. One of the methods includes maintaining robot experience data; obtaining annotation data; training, on the annotation data, a reward model; generating task-specific training data for the particular task, comprising, for each experience in a second subset of the experiences in the robot experience data: processing the observation in the experience using the trained reward model to generate a reward prediction, and associating the reward prediction with the experience; and training a policy neural network on the task-specific training data for the particular task, wherein the policy neural network is configured to receive a network input comprising an observation and to generate a policy output that defines a control policy for a robot performing the particular task.

Type: Application

Filed: June 8, 2023

Publication date: February 8, 2024

Inventors: Serkan Cabi, Ziyu Wang, Alexander Novikov, Ksenia Konyushkova, Sergio Gomez Colmenarejo, Scott Ellison Reed, Misha Man Ray Denil, Jonathan Karl Scholz, Oleg O. Sushkov, Rae Chan Jeong, David Barker, David Budden, Mel Vecerik, Yusuf Aytar, Joao Ferdinando Gomes de Freitas
REINFORCEMENT AND IMITATION LEARNING FOR A TASK

Publication number: 20230330848

Abstract: A neural network control system for controlling an agent to perform a task in a real-world environment, operates based on both image data and proprioceptive data describing the configuration of the agent. The training of the control system includes both imitation learning, using datasets generated from previous performances of the task, and reinforcement learning, based on rewards calculated from control data output by the control system.

Type: Application

Filed: April 25, 2023

Publication date: October 19, 2023

Inventors: Saran Tunyasuvunakool, Yuke Zhu, Joshua Merel, János Kramár, Ziyu Wang, Nicolas Manfred Otto Heess
Iterative multiscale image generation using neural networks

Patent number: 11734797

Abstract: A method of generating an output image having an output resolution of N pixels×N pixels, each pixel in the output image having a respective color value for each of a plurality of color channels, the method comprising: obtaining a low-resolution version of the output image; and upscaling the low-resolution version of the output image to generate the output image having the output resolution by repeatedly performing the following operations: obtaining a current version of the output image having a current K×K resolution; and processing the current version of the output image using a set of convolutional neural networks that are specific to the current resolution to generate an updated version of the output image having a 2K×2K resolution.

Type: Grant

Filed: May 23, 2022

Date of Patent: August 22, 2023

Assignee: DeepMind Technologies Limited

Inventors: Nal Emmerich Kalchbrenner, Daniel Belov, Sergio Gomez Colmenarejo, Aaron Gerard Antonius van den Oord, Ziyu Wang, Joao Ferdinando Gomes de Freitas, Scott Ellison Reed
Data-driven robot control

Patent number: 11712799

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for data-driven robotic control. One of the methods includes maintaining robot experience data; obtaining annotation data; training, on the annotation data, a reward model; generating task-specific training data for the particular task, comprising, for each experience in a second subset of the experiences in the robot experience data: processing the observation in the experience using the trained reward model to generate a reward prediction, and associating the reward prediction with the experience; and training a policy neural network on the task-specific training data for the particular task, wherein the policy neural network is configured to receive a network input comprising an observation and to generate a policy output that defines a control policy for a robot performing the particular task.

Type: Grant

Filed: September 14, 2020

Date of Patent: August 1, 2023

Assignee: DeepMind Technologies Limited

Inventors: Serkan Cabi, Ziyu Wang, Alexander Novikov, Ksenia Konyushkova, Sergio Gomez Colmenarejo, Scott Ellison Reed, Misha Man Ray Denil, Jonathan Karl Scholz, Oleg O. Sushkov, Rae Chan Jeong, David Barker, David Budden, Mel Vecerik, Yusuf Aytar, Joao Ferdinando Gomes de Freitas
SYSTEM AND METHOD FOR FAST, POST-QUANTUM BLOCKCHAIN CONCENSUS GENERATION AND SMART CONTRACTS EXECUTION

Publication number: 20230186293

Abstract: A system for performing real-time quantum-safe computation of a digital transaction using in a blockchain consensus protocol, comprising a plurality of permissioned verification servers being a plurality of distributed participants that are adapted to create common randomization to all of said participants which remains unrevealed until being used by said participants, by assigning to each participant a unique polynomial having a maximal degree being common to all participants; allowing each participant to select a random value; allowing each participant to send his selected random value to all other participants using a secret sharing scheme based on points on his unique polynomial, such that said secret hides the details of said selected random value and all other participants that receive shares of said selected random value will not be able to reconstruct said selected random value from the received shares; create a pool of all shares of all participants; build a quantum-safe consensus of honest participa

Type: Application

Filed: April 27, 2021

Publication date: June 15, 2023

Inventors: Shlomi DOLEV, Ziyu WANG
Action selection neural network training using imitation learning in latent space

Patent number: 11663441

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for training an action selection policy neural network, wherein the action selection policy neural network is configured to process an observation characterizing a state of an environment to generate an action selection policy output, wherein the action selection policy output is used to select an action to be performed by an agent interacting with an environment. In one aspect, a method comprises: obtaining an observation characterizing a state of the environment subsequent to the agent performing a selected action; generating a latent representation of the observation; processing the latent representation of the observation using a discriminator neural network to generate an imitation score; determining a reward from the imitation score; and adjusting the current values of the action selection policy neural network parameters based on the reward using a reinforcement learning training technique.

Type: Grant

Filed: September 27, 2019

Date of Patent: May 30, 2023

Assignee: DeepMind Technologies Limited

Inventors: Scott Ellison Reed, Yusuf Aytar, Ziyu Wang, Tom Paine, Sergio Gomez Colmenarejo, David Budden, Tobias Pfaff, Aaron Gerard Antonius van den Oord, Oriol Vinyals, Alexander Novikov
HYBRID POLYMERIC MATERIALS AND USES THEREOF

Publication number: 20230040485

Abstract: Disclosed herein is a hybrid polymeric material comprising a tropoelastin and a copolymer of a polyol monomer and a polycarboxylic acid monomer. The hybrid polymeric material is suitable for use as a tissue scaffold.

Type: Application

Filed: December 18, 2020

Publication date: February 9, 2023

Inventors: Anthony Steven WEISS, Suzanne Marie MITHIEUX, Ziyu WANG
ITERATIVE MULTISCALE IMAGE GENERATION USING NEURAL NETWORKS

Publication number: 20220284546

Abstract: A method of generating an output image having an output resolution of N pixels×N pixels, each pixel in the output image having a respective color value for each of a plurality of color channels, the method comprising: obtaining a low-resolution version of the output image; and upscaling the low-resolution version of the output image to generate the output image having the output resolution by repeatedly performing the following operations: obtaining a current version of the output image having a current K×K resolution; and processing the current version of the output image using a set of convolutional neural networks that are specific to the current resolution to generate an updated version of the output image having a 2K×2K resolution.

Type: Application

Filed: May 23, 2022

Publication date: September 8, 2022

Inventors: Nal Emmerich Kalchbrenner, Daniel Belov, Sergio Gomez Colmenarejo, Aaron Gerard Antonius van den Oord, Ziyu Wang, Joao Ferdinando Gomes de Freitas, Scott Ellison Reed
TRAINING A NEURAL NETWORK TO CONTROL AN AGENT USING TASK-RELEVANT ADVERSARIAL IMITATION LEARNING

Publication number: 20220261639

Abstract: A method is proposed of training a neural network to generate action data for controlling an agent to perform a task in an environment. The method includes obtaining, for each of a plurality of performances of the task, one or more first tuple datasets, each first tuple dataset comprising state data characterizing a state of the environment at a corresponding time during the performance of the task; and a concurrent process of training the neural network and a discriminator network. The training process comprises a plurality of neural network update steps and a plurality of discriminator network update steps.

Type: Application

Filed: July 16, 2020

Publication date: August 18, 2022

Inventors: Konrad Zolna, Scott Ellison Reed, Ziyu Wang, Alexander Novikov, Sergio Gomez Colmenarejo, Joao Ferdinando Gomes de Freitas, David Budden, Serkan Cabi
Iterative multiscale image generation using neural networks

Patent number: 11361403

Abstract: A method of generating an output image having an output resolution of N pixels×N pixels, each pixel in the output image having a respective color value for each of a plurality of color channels, the method comprising: obtaining a low-resolution version of the output image; and upscaling the low-resolution version of the output image to generate the output image having the output resolution by repeatedly performing the following operations: obtaining a current version of the output image having a current K×K resolution; and processing the current version of the output image using a set of convolutional neural networks that are specific to the current resolution to generate an updated version of the output image having a 2K×2K resolution.

Type: Grant

Filed: February 26, 2018

Date of Patent: June 14, 2022

Assignee: DeepMind Technologies Limited

Inventors: Nal Emmerich Kalchbrenner, Daniel Belov, Sergio Gomez Colmenarejo, Aaron Gerard Antonius van den Oord, Ziyu Wang, Joao Ferdinando Gomes de Freitas, Scott Ellison Reed
PROBE FOR DETECTING TRACE URANYL IONS AND PORTABLE ECL DETECTOR BASED ON THE SAME

Publication number: 20220018783

Abstract: A high-sensitivity, high-selectivity and portable detection method for trace uranyl ion is described. The method has an ultralow detection limit of 11 pM/2.6 ppt and is useful in precise monitoring of the uranium content in agricultural and sideline products, foods, environments and so on. The test instrument is miniaturized and low in cost to achieve high-precision portable measurement in the field. A conjugated polymer with aggregation-induced emission (AIE) activity is synthesized, and prepared into Pdots, and a uranyl-responsive electrochemiluminescence (ECL) probe is developed by modifying the Pdots with DNA or RNA, which serves as an adsorption ligand of uranyl ion. The probe exhibits good biocompatibility. The ECL technology can be used in uranyl ion detection and the method has extremely high sensitivity. A uranyl ion probe with AIE activity is also disclosed, which can be applied in portable precise monitoring of trace uranyl ion by means of the ECL technology.

Type: Application

Filed: November 20, 2019

Publication date: January 20, 2022

Inventors: Daoben HUA, Ziyu WANG, Jianbin PAN, Jingjuan XU
DATA-DRIVEN ROBOT CONTROL

Publication number: 20210078169

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for data-driven robotic control. One of the methods includes maintaining robot experience data; obtaining annotation data; training, on the annotation data, a reward model; generating task-specific training data for the particular task, comprising, for each experience in a second subset of the experiences in the robot experience data: processing the observation in the experience using the trained reward model to generate a reward prediction, and associating the reward prediction with the experience; and training a policy neural network on the task-specific training data for the particular task, wherein the policy neural network is configured to receive a network input comprising an observation and to generate a policy output that defines a control policy for a robot performing the particular task.

Type: Application

Filed: September 14, 2020

Publication date: March 18, 2021

Inventors: Serkan Cabi, Ziyu Wang, Alexander Novikov, Ksenia Konyushkova, Sergio Gomez Colmenarejo, Scott Ellison Reed, Misha Man Ray Denil, Jonathan Karl Scholz, Oleg O. Sushkov, Rae Chan Jeong, David Barker, David Budden, Mel Vecerik, Yusuf Aytar, Joao Ferdinando Gomes de Freitas
MULTISCALE IMAGE GENERATION

Publication number: 20210027425

Abstract: A method of generating an output image having an output resolution of N pixels×N pixels, each pixel in the output image having a respective color value for each of a plurality of color channels, the method comprising: obtaining a low-resolution version of the output image; and upscaling the low-resolution version of the output image to generate the output image having the output resolution by repeatedly performing the following operations: obtaining a current version of the output image having a current K×K resolution; and processing the current version of the output image using a set of convolutional neural networks that are specific to the current resolution to generate an updated version of the output image having a 2K×2K resolution.

Type: Application

Filed: February 26, 2018

Publication date: January 28, 2021

Inventors: Nal Emmerich Kalchbrenner, Daniel Belov, Sergio Gomez Colmenarejo, Aaron Gerard Antonius van den Oord, Ziyu Wang, Joao Gomes de Freitas, Scott Ellison Reed
TRAINING ACTION SELECTION NEURAL NETWORKS USING OFF-POLICY ACTOR CRITIC REINFORCEMENT LEARNING

Publication number: 20200293862

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for training an action selection neural network. One of the methods includes maintaining a replay memory that stores trajectories generated as a result of interaction of an agent with an environment; and training an action selection neural network having policy parameters on the trajectories in the replay memory, wherein training the action selection neural network comprises: sampling a trajectory from the replay memory; and adjusting current values of the policy parameters by training the action selection neural network on the trajectory using an off-policy actor critic reinforcement learning technique.

Type: Application

Filed: May 28, 2020

Publication date: September 17, 2020

Inventors: Ziyu Wang, Nicolas Manfred Otto Heess, Victor Constant Bapst
End effector for autonomous object retrieval

Patent number: 10751882

Abstract: Features are disclosed for an end effector for automated identification and handling of an object. The end effector includes an end effector that can be positioned over a pick point of an overpackage in which a desired object is location using sensors. Using the location information, the end effector can identify a path to the pick point and detect whether the pick point is engaged by detecting environmental changes at the end effector.

Type: Grant

Filed: May 14, 2018

Date of Patent: August 25, 2020

Assignee: Amazon Technologies, Inc.

Inventors: Tye Michael Brady, Anna Buchele, Juan Carlos del Rio, Rocco DiVerdi, Yuzhong Huang, Hunter Normandeau, Timothy Stallman, Ziyu Wang
Training action selection neural networks using off-policy actor critic reinforcement learning

Patent number: 10706352

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for training an action selection neural network. One of the methods includes maintaining a replay memory that stores trajectories generated as a result of interaction of an agent with an environment; and training an action selection neural network having policy parameters on the trajectories in the replay memory, wherein training the action selection neural network comprises: sampling a trajectory from the replay memory; and adjusting current values of the policy parameters by training the action selection neural network on the trajectory using an off-policy actor critic reinforcement learning technique.

Type: Grant

Filed: May 3, 2019

Date of Patent: July 7, 2020

Assignee: DeepMind Technologies Limited

Inventors: Ziyu Wang, Nicolas Manfred Otto Heess, Victor Constant Bapst
ACTION SELECTION NEURAL NETWORK TRAINING USING IMITATION LEARNING IN LATENT SPACE

Publication number: 20200104680

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for training an action selection policy neural network, wherein the action selection policy neural network is configured to process an observation characterizing a state of an environment to generate an action selection policy output, wherein the action selection policy output is used to select an action to be performed by an agent interacting with an environment. In one aspect, a method comprises: obtaining an observation characterizing a state of the environment subsequent to the agent performing a selected action; generating a latent representation of the observation; processing the latent representation of the observation using a discriminator neural network to generate an imitation score; determining a reward from the imitation score; and adjusting the current values of the action selection policy neural network parameters based on the reward using a reinforcement learning training technique.

Type: Application

Filed: September 27, 2019

Publication date: April 2, 2020

Inventors: Scott Ellison Reed, Yusuf Aytar, Ziyu Wang, Tom Paine, Sergio Gomez Colmenarejo, David Budden, Tobias Pfaff, Aaron Gerard Antonius van den Oord, Oriol Vinyals, Alexander Novikov
DATA EFFICIENT IMITATION OF DIVERSE BEHAVIORS

Publication number: 20200090042

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for training a neural network used to select actions to be performed by an agent interacting with an environment. One of the methods includes: obtaining data identifying a set of trajectories, each trajectory comprising a set of observations characterizing a set of states of the environment and corresponding actions performed by another agent in response to the states; obtaining data identifying an encoder that maps the observations onto embeddings for use in determining a set of imitation trajectories; determining, for each trajectory, a corresponding embedding by applying the encoder to the trajectory; determining a set of imitation trajectories by applying a policy defined by the neural network to the embedding for each trajectory; and adjusting parameters of the neural network based on the set of trajectories, the set of imitation trajectories and the embeddings.

Type: Application

Filed: November 19, 2019

Publication date: March 19, 2020

Inventors: Gregory Duncan Wayne, Joshua Merel, Ziyu Wang, Nicolas Manfred Otto Heess, Joao Ferdinando Gomes de Freitas, Scott Ellison Reed
Dueling deep neural networks

Patent number: 10572798

Abstract: Systems, methods, and apparatus, including computer programs encoded on a computer storage medium, for selecting an actions from a set of actions to be performed by an agent interacting with an environment. In one aspect, the system includes a dueling deep neural network. The dueling deep neural network includes a value subnetwork, an advantage subnetwork, and a combining layer. The value subnetwork processes a representation of an observation to generate a value estimate. The advantage subnetwork processes the representation of the observation to generate an advantage estimate for each action in the set of actions. The combining layer combines the value estimate and the respective advantage estimate for each action to generate a respective Q value for the action. The system selects an action to be performed by the agent in response to the observation using the respective Q values for the actions in the set of actions.

Type: Grant

Filed: November 11, 2016

Date of Patent: February 25, 2020

Assignee: DeepMind Technologies Limited

Inventors: Ziyu Wang, Joao Ferdinando Gomes de Freitas, Marc Lanctot

1 2 next