Patents by Inventor Stefan Schaal
Stefan Schaal has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Patent number: 11986958Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for using skill templates for robotic demonstration learning. One of the methods includes receiving a skill template for a task to be performed by a robot, wherein the skill template defines a state machine having a plurality of subtasks and one or more respective transition conditions between one or more of the subtasks. Local demonstration data for a demonstration subtask of the skill template is received, where the local demonstration data is generated from a user demonstrating how to perform the demonstration subtask with the robot. A machine learning model is refined for the demonstration subtask and the skill template is executed on the robot, causing the robot to transition through the state machine defined by the skill template to perform the task.Type: GrantFiled: May 21, 2020Date of Patent: May 21, 2024Assignee: Intrinsic Innovation LLCInventors: Bala Venkata Sai Ravi Krishna Kolluri, Stefan Schaal, Benjamin M. Davis, Ralf Oliver Michael Schönherr, Ning Ye
-
Publication number: 20240157554Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for using simulated local demonstration data for robotic demonstration learning. One of the methods includes receiving perceptual data of a workcell of a robot to be configured to execute a task according to a skill template, wherein the skill template specifies one or more subtasks required to perform the skill, wherein at least one of the subtasks is a demonstration subtask that relies on learning visual characteristics of the workcell. A virtual model is generated of a portion of the workcell. A training system generates simulated local demonstration data from the virtual model of the portion of the workcell and tunes a base control policy for the demonstration subtask using the simulated local demonstration data generated from the virtual model of the portion of the workcell.Type: ApplicationFiled: November 20, 2023Publication date: May 16, 2024Inventors: Bala Venkata Sai Ravi Krishna Kolluri, Stefan Schaal, Ralf Oliver Michael Schönherr, Benjamin M. Davis, Ning Ye
-
Publication number: 20240111571Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for controlling a robot using symbolic states. One of the methods includes receiving a definition of a task having multiple task states, wherein each task state is associated with a different respective control policy; executing the task using an initial control policy associated with an initial task state; during execution of the task, continually generating, from sensor data, a prediction of the task state of the task; and upon determining that a transition to a different task state has occurred, transitioning the robot to a different control policy associated with the different task state.Type: ApplicationFiled: September 22, 2022Publication date: April 4, 2024Inventors: Wenzhao Lian, Stefan Schaal, Takatoki Migimatsu
-
Publication number: 20240096077Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for training an autoencoder. One of the methods includes receiving a pair of images, wherein a first image of the pair represents a first state of a working environment at a first time step earlier than that of a second image of the pair. A first and a second latent representation is generated respectively for the first and the second image by the autoencoder; A predicted reward for an action executed in the first state is generated by a reward prediction neural network. A predicted next latent representation for the first state and the action is generated by a dynamics prediction neural network. An overall loss is determined based on the predicted reward, the predicted next latent representation, and the second latent representation. Model parameters of the autoencoder are updated to reduce the overall loss.Type: ApplicationFiled: September 15, 2022Publication date: March 21, 2024Inventors: Jianlan Luo, Stefan Schaal
-
Patent number: 11820014Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for using simulated local demonstration data for robotic demonstration learning. One of the methods includes receiving perceptual data of a workcell of a robot to be configured to execute a task according to a skill template, wherein the skill template specifies one or more subtasks required to perform the skill, wherein at least one of the subtasks is a demonstration subtask that relies on learning visual characteristics of the workcell. A virtual model is generated of a portion of the workcell. A training system generates simulated local demonstration data from the virtual model of the portion of the workcell and tunes a base control policy for the demonstration subtask using the simulated local demonstration data generated from the virtual model of the portion of the workcell.Type: GrantFiled: May 21, 2020Date of Patent: November 21, 2023Assignee: Intrinsic Innovation LLCInventors: Bala Venkata Sai Ravi Krishna Kolluri, Stefan Schaal, Ralf Oliver Michael Schönherr, Benjamin M. Davis, Ning Ye
-
Publication number: 20230356393Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for distributing skill templates for robotic demonstration learning. One of the methods includes receiving from the user device by a skill template distribution system, a selection of an available skill template. The skill template distribution system provides a skill template, wherein the skill template comprises information representing a state machine of one or more tasks, and wherein the skill template specifies which of the one or more tasks are demonstration subtasks requiring local demonstration data. The skill template distribution system trains a machine learning model for the demonstration subtask using a local demonstration data to generate learned parameter values.Type: ApplicationFiled: June 26, 2023Publication date: November 9, 2023Inventors: Bala Venkata Sai Ravi Krishna Kolluri, Stefan Schaal, Benjamin M. Davis, Ralf Oliver Michael Schönherr, Ning Ye
-
Patent number: 11780086Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for using a demonstration device for robotic demonstration learning. One of the methods includes generating, by a demonstration device for a robot, a representation of a sequence of states input by a user of the demonstration device. The representation is provided by the demonstration device to a robot execution system. The representation of the sequence of actions is translated into a plurality of robot commands corresponding to the representation of the sequence of states input by the user on the demonstration device. The plurality of robot commands corresponding to the sequence of actions input by the user on the demonstration device are executed. Demonstration data is generated from one or more sensor streams of the robot while executing the plurality of robot commands corresponding to the sequence of actions input by the user on the demonstration device.Type: GrantFiled: October 17, 2022Date of Patent: October 10, 2023Assignee: Intrinsic Innovation LLCInventors: Bala Venkata Sai Ravi Krishna Kolluri, Stefan Schaal, Ralf Oliver Michael Schönherr, Ning Ye
-
Publication number: 20230241773Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for robotic control using demonstrations to learn category-level manipulation task. One of the methods includes obtaining a collection of object models for a plurality of different types of objects belonging to a same object category and training a category-level representation in a category-level space from the collection of object models. A category-level trajectory is generated the demonstration data of a demonstration object. For a new object in the object category, a trajectory projection is generated in the category-level space, which is used to cause a robot to perform the robotic manipulation task on the new object.Type: ApplicationFiled: January 30, 2023Publication date: August 3, 2023Inventors: Wenzhao Lian, Bowen Wen, Stefan Schaal
-
Patent number: 11685047Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for distributing skill templates for robotic demonstration learning. One of the methods includes receiving, from the user device by a skill template distribution system, a selection of an available skill template. The skill template distribution system provides a skill template, wherein the skill template comprises information representing a state machine of one or more tasks, and wherein the skill template specifies which of the one or more tasks are demonstration subtasks requiring local demonstration data. The skill template distribution system trains a machine learning model for the demonstration subtask using a local demonstration data to generate learned parameter values.Type: GrantFiled: May 21, 2020Date of Patent: June 27, 2023Assignee: Intrinsic Innovation LLCInventors: Bala Venkata Sai Ravi Krishna Kolluri, Stefan Schaal, Benjamin M. Davis, Ralf Oliver Michael Schönherr, Ning Ye
-
Patent number: 11679497Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for distributed robotic demonstration learning. One of the methods includes receiving a skill template to be trained to cause a robot to perform a particular skill having a plurality of subtasks. One or more demonstration subtasks defined by the skill template are identified, wherein each demonstration subtask is an action to be refined using local demonstration data. On online execution system uploads sets of local demonstration data to a cloud-based training system. The cloud-based training system generates respective trained model parameters for each set of local demonstration data. The skill template is executed on the robot using the trained model parameters generated by the cloud-based training system.Type: GrantFiled: May 21, 2020Date of Patent: June 20, 2023Assignee: Intrinsic Innovation LLCInventors: Bala Venkata Sai Ravi Krishna Kolluri, Stefan Schaal, Benjamin M. Davis, Ralf Oliver Michael Schönherr, Ning Ye
-
Patent number: 11638936Abstract: A cleaning device includes: a treatment container; a workpiece carrier arranged in the treatment container and configured to hold at least one workpiece; at least one nozzle configured to discharge a cleaning jet directed onto the workpiece carrier and mounted such that the at least one nozzle is moveable on a circulation track about the workpiece carrier and is pivotable about a pivoting axis extending parallel to an axis of rotation of the workpiece carrier; a pivoting device configured to pivot the at least one nozzle; and a controller configured to control a circulating movement of the at least one nozzle on the circulation track and a pivoting movement of the at least one nozzle, such that a specified point on a surface of the workpiece is impacted repeatedly by the cleaning jet at a respectively different angle, within a specified timeframe.Type: GrantFiled: August 11, 2022Date of Patent: May 2, 2023Assignees: MAFAC—E. SCHWARZ GMBH & CO., KG MASCHINENFABRIKInventors: Stefan Schaal, Steffen Haas
-
Publication number: 20230114561Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for using a demonstration device for robotic demonstration learning. One of the methods includes generating, by a demonstration device for a robot, a representation of a sequence of states input by a user of the demonstration device. The representation is provided by the demonstration device to a robot execution system. The representation of the sequence of actions is translated into a plurality of robot commands corresponding to the representation of the sequence of states input by the user on the demonstration device. The plurality of robot commands corresponding to the sequence of actions input by the user on the demonstration device are executed. Demonstration data is generated from one or more sensor streams of the robot while executing the plurality of robot commands corresponding to the sequence of actions input by the user on the demonstration device.Type: ApplicationFiled: October 17, 2022Publication date: April 13, 2023Inventors: Bala Venkata Sai Ravi Krishna Kolluri, Stefan Schaal, Ralf Oliver Michael Schönherr, Ning Ye
-
Publication number: 20230095351Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for training a robotic control policy to perform a particular task. One of the methods includes performing a meta reinforcement learning phase including using training data collected for a plurality of different robotic control tasks and updating a robotic control policy according to the training data, wherein the robotic control policy is conditioned on an encoder network that is trained to predict which task is being performed from a context of a robotic operating environment; and performing an adaptation phase using a plurality of demonstrations for the particular task, including iteratively updating the encoder network after processing each demonstration of the plurality of demonstrations, thereby training the encoder network to learn environmental features of successful task runs.Type: ApplicationFiled: September 15, 2022Publication date: March 30, 2023Inventors: Jianlan Luo, Stefan Schaal, Sergey Vladimir Levine, Zihao Zhao
-
Patent number: 11534913Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for integrating sensor streams for robotic demonstration learning. One of the methods includes selecting, by a learning system for a robot, a base update rate for combining multiple sensor streams into a task state representation. The learning system repeatedly generates the task state representation at the base update rate, including combining, during each time period defined by the update rate, the task state representation from most recently updated sensor data processed by the plurality of neural networks. The learning system repeatedly uses the task state representations to generate commands for the robot at the base update rate.Type: GrantFiled: May 21, 2020Date of Patent: December 27, 2022Assignee: Intrinsic Innovation LLCInventors: Bala Venkata Sai Ravi Krishna Kolluri, Stefan Schaal, Ralf Oliver Michael Schönherr, Benjamin M. Davis, Ning Ye
-
Publication number: 20220402140Abstract: A computer-implemented method comprising, receiving data representing a successful trajectory for an insertion task using a robot to insert a connector into a receptacle, performing a parameter optimization process for the robot to perform the insertion task. This parameter optimization includes defining an objective function that measures a similarity of a current trajectory generated with a current set of parameters to the successful trajectory and repeatedly modifying the current set of parameters and evaluating the modified set of parameters according to the objective function until generating a final set of parameters.Type: ApplicationFiled: June 21, 2022Publication date: December 22, 2022Inventors: Wenzhao Lian, Stefan Schaal, Zheng Wu
-
Publication number: 20220402128Abstract: A computer-implemented method includes obtaining a collection of object models for a plurality of different types of objects belonging to a same object category, generating a canonical representation for objects belonging to the object category, performing a plurality of downstream tasks using a plurality of different robot grasps on instances of objects belonging to the category and evaluating each grasp according to success or failure of the downstream task; and generating one or more category-level grasping areas for the canonical representation for objects belonging to the object category including aggregating the evaluations of grasps according to the downstream task.Type: ApplicationFiled: June 21, 2022Publication date: December 22, 2022Inventors: Wenzhao Lian, Stefan Schaal, Bowen Wen
-
Publication number: 20220388039Abstract: A cleaning device includes: a treatment container; a workpiece carrier arranged in the treatment container and configured to hold at least one workpiece; at least one nozzle configured to discharge a cleaning jet directed onto the workpiece carrier and mounted such that the at least one nozzle is moveable on a circulation track about the workpiece carrier and is pivotable about a pivoting axis extending parallel to an axis of rotation of the workpiece carrier; a pivoting device configured to pivot the at least one nozzle; and a controller configured to control a circulating movement of the at least one nozzle on the circulation track and a pivoting movement of the at least one nozzle, such that a specified point on a surface of the workpiece is impacted repeatedly by the cleaning jet at a respectively different angle, within a specified timeframe.Type: ApplicationFiled: August 11, 2022Publication date: December 8, 2022Inventors: Stefan Schaal, Steffen Haas
-
Patent number: 11472025Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for using a demonstration device for robotic demonstration learning. One of the methods includes generating, by a demonstration device for a robot, a representation of a sequence of states input by a user of the demonstration device. The representation is provided by the demonstration device to a robot execution system. The representation of the sequence of actions is translated into a plurality of robot commands corresponding to the representation of the sequence of states input by the user on the demonstration device. The plurality of robot commands corresponding to the sequence of actions input by the user on the demonstration device are executed. Demonstration data is generated from one or more sensor streams of the robot while executing the plurality of robot commands corresponding to the sequence of actions input by the user on the demonstration device.Type: GrantFiled: May 21, 2020Date of Patent: October 18, 2022Assignee: Intrinsic Innovation LLCInventors: Bala Venkata Sai Ravi Krishna Kolluri, Stefan Schaal, Ralf Oliver Michael Schönherr, Ning Ye
-
Publication number: 20210362328Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for using a demonstration device for robotic demonstration learning. One of the methods includes generating, by a demonstration device for a robot, a representation of a sequence of states input by a user of the demonstration device. The representation is provided by the demonstration device to a robot execution system. The representation of the sequence of actions is translated into a plurality of robot commands corresponding to the representation of the sequence of states input by the user on the demonstration device. The plurality of robot commands corresponding to the sequence of actions input by the user on the demonstration device are executed. Demonstration data is generated from one or more sensor streams of the robot while executing the plurality of robot commands corresponding to the sequence of actions input by the user on the demonstration device.Type: ApplicationFiled: May 21, 2020Publication date: November 25, 2021Inventors: Bala Venkata Sai Ravi Krishna Kolluri, Stefan Schaal, Ralf Oliver Michael Schönherr, Ning Ye
-
Publication number: 20210362329Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for integrating sensor streams for robotic demonstration learning. One of the methods includes selecting, by a learning system for a robot, a base update rate for combining multiple sensor streams into a task state representation. The learning system repeatedly generates the task state representation at the base update rate, including combining, during each time period defined by the update rate, the task state representation from most recently updated sensor data processed by the plurality of neural networks. The learning system repeatedly uses the task state representations to generate commands for the robot at the base update rate.Type: ApplicationFiled: May 21, 2020Publication date: November 25, 2021Inventors: Bala Venkata Sai Ravi Krishna Kolluri, Stefan Schaal, Ralf Oliver Michael Schönherr, Benjamin M. Davis, Ning Ye