Patents by Inventor Arash Rezaei
Arash Rezaei has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Publication number: 20240385873Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media for performing preflight checks of a distributed computing system, are described. In one aspect, a method includes assigning a computing workload to a first subset of hardware accelerator machines each having one or more hardware accelerators. A preflight check on the first subset is performed before performing the computing workload to verify the functionality of each machine in the first subset. For each hardware accelerator machine of the first subset, a program code package is installed, including a task action based at least in part on characteristics of the computing workload. The task action including a sequence of operations is performed on the hardware accelerator machine to determine whether the task action fails. Whenever the task action fails, the computing workload is re-assigned to a second subset of hardware accelerator machines different from the first subset.Type: ApplicationFiled: May 17, 2024Publication date: November 21, 2024Inventors: Jiafan Zhu, Jianqiao Liu, Xiangyu Dong, Xiao Zhang, Jikai Tang, Kexin Yang, Yong Zhao, Alireza Ghaffarkhah, Arash Rezaei, Dayou Du, Yazhou Zu, Xiangling Kong, Hoang-Vu Dang, Alexander Vadimovich Kolbasov
-
Patent number: 12020063Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media for performing preflight checks of a distributed computing system, are described. In one aspect, a method includes assigning a computing workload to a first subset of hardware accelerator machines each having one or more hardware accelerators. A preflight check on the first subset is performed before performing the computing workload to verify the functionality of each machine in the first subset. For each hardware accelerator machine of the first subset, a program code package is installed, including a task action based at least in part on characteristics of the computing workload. The task action including a sequence of operations is performed on the hardware accelerator machine to determine whether the task action fails. Whenever the task action fails, the computing workload is re-assigned to a second subset of hardware accelerator machines different from the first subset.Type: GrantFiled: December 1, 2021Date of Patent: June 25, 2024Assignee: Google LLCInventors: Jiafan Zhu, Jianqiao Liu, Xiangyu Dong, Xiao Zhang, Jikai Tang, Kexin Yang, Yong Zhao, Alireza Ghaffarkhah, Arash Rezaei, Dayou Du, Yazhou Zu, Xiangling Kong, Hoang-Vu Dang, Alexander Vadimovich Kolbasov
-
Publication number: 20230168919Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media for performing preflight checks of a distributed computing system, are described. In one aspect, a method includes assigning a computing workload to a first subset of hardware accelerator machines each having one or more hardware accelerators. A preflight check on the first subset is performed before performing the computing workload to verify the functionality of each machine in the first subset. For each hardware accelerator machine of the first subset, a program code package is installed, including a task action based at least in part on characteristics of the computing workload. The task action including a sequence of operations is performed on the hardware accelerator machine to determine whether the task action fails. Whenever the task action fails, the computing workload is re-assigned to a second subset of hardware accelerator machines different from the first subset.Type: ApplicationFiled: December 1, 2021Publication date: June 1, 2023Inventors: Jiafan Zhu, Jianqiao Liu, Xiangyu Dong, Xiao Zhang, Jikai Tang, Kexin Yang, Yong Zhao, Alireza Ghaffarkhah, Arash Rezaei, Dayou Du, Yazhou Zu, Xiangling Kong, Hoang-Vu Dang, Alexander Vadimovich Kolbasov
-
Patent number: 10678550Abstract: Methods are provided. A method includes capturing a snapshot of an offload process being executed by one or more many-core processors. The offload process is in signal communication with a host process being executed by a host processor. At least the offload is in signal communication with a monitoring process. The method further includes terminating the offload process on the one or more many-core processors, by the monitor process responsive to a communication between the monitor process and the offload processing being disrupted. The snapshot includes a respective predetermined minimum set of information required to restore a same state of the process as when the snapshot was taken.Type: GrantFiled: December 16, 2014Date of Patent: June 9, 2020Assignee: NEC CorporationInventors: Cheng-Hong Li, Giuseppe Coviello, Srimat Chakradhar, Arash Rezaei
-
Patent number: 10061523Abstract: An embodiment includes a storage device, comprising: a memory; and a controller including a memory interface coupled to the memory, the controller configured to: receive write data to write to an address associated with first data stored in the memory and a first differentially compressed value stored in the memory; calculate a second differentially compressed value based on the write data and the first data; store the second differentially compressed value in the memory; and change the association of the address to reference the second differentially compressed value instead of the first differentially compressed value.Type: GrantFiled: March 30, 2016Date of Patent: August 28, 2018Assignee: SAMSUNG ELECTRONICS CO., LTD.Inventors: Arash Rezaei, Tameesh Suri, Bob Brennan
-
Publication number: 20170206024Abstract: An embodiment includes a storage device, comprising: a memory; and a controller including a memory interface coupled to the memory, the controller configured to: receive write data to write to an address associated with first data stored in the memory and a first differentially compressed value stored in the memory; calculate a second differentially compressed value based on the write data and the first data; store the second differentially compressed value in the memory; and change the association of the address to reference the second differentially compressed value instead of the first differentially compressed value.Type: ApplicationFiled: March 30, 2016Publication date: July 20, 2017Inventors: Arash REZAEI, Tameesh SURI, Bob BRENNAN
-
Patent number: 9652247Abstract: Methods are provided. A method for swapping-out an offload process from a coprocessor includes issuing a snapify_pause request from a host processor to the coprocessor to initiate a pausing of the offload process executing by the coprocessor and another process executing by the host processor using a plurality of locks. The offload process is previously offloaded from the host processor to the coprocessor. The method further includes issuing a snapify_capture request from the host processor to the coprocessor to initiate a local snapshot capture and saving of the local snapshot capture by the coprocessor. The method also includes issuing a snapify_wait request from the host processor to the coprocessor to wait for the local snapshot capture and the saving of the local snapshot capture to complete by the coprocessor.Type: GrantFiled: December 16, 2014Date of Patent: May 16, 2017Assignee: NEC CorporationInventors: Cheng-Hong Li, Giuseppe Coviello, Srimat Chakradhar, Arash Rezaei
-
Publication number: 20150212823Abstract: Methods are provided. A method for swapping-out an offload process from a coprocessor includes issuing a snapify_pause request from a host processor to the coprocessor to initiate a pausing of the offload process executing by the coprocessor and another process executing by the host processor using a plurality of locks. The offload process is previously offloaded from the host processor to the coprocessor. The method further includes issuing a snapify_capture request from the host processor to the coprocessor to initiate a local snapshot capture and saving of the local snapshot capture by the coprocessor. The method also includes issuing a snapify_wait request from the host processor to the coprocessor to wait for the local snapshot capture and the saving of the local snapshot capture to complete by the coprocessor.Type: ApplicationFiled: December 16, 2014Publication date: July 30, 2015Inventors: Cheng-Hong Li, Giuseppe Coviello, Srimat Chakradhar, Arash Rezaei
-
Publication number: 20150212892Abstract: Methods are provided. A method includes capturing a snapshot of an offload process being executed by one or more many-core processors. The offload process is in signal communication with a host process being executed by a host processor. At least the offload is in signal communication with a monitoring process. The method further includes terminating the offload process on the one or more many-core processors, by the monitor process responsive to a communication between the monitor process and the offload processing being disrupted. The snapshot includes a respective predetermined minimum set of information required to restore a same state of the process as when the snapshot was taken.Type: ApplicationFiled: December 16, 2014Publication date: July 30, 2015Inventors: Cheng-Hong Li, Giuseppe Coviello, Srimat Chakradhar, Arash Rezaei