Restarting Or Rejuvenating (epo) Patents (Class 714/E11.137)

Electronic device and method with on-demand accelerator checkpointing

Patent number: 12019520

Abstract: An electronic device and method with on-demand accelerator checkpointing are provided. In one general aspect, an electronic device includes a host processor, and an accelerator configured to operate according to instructions transmitted by the host processor to the accelerator, wherein, a memory of the host processor and a memory of the accelerator are respectively checkpointed to a storage at respective different intervals, and in response to a determination that a failure has occurred in the host processor, the memory of the accelerator is checkpointed to the storage.

Type: Grant

Filed: November 21, 2022

Date of Patent: June 25, 2024

Assignee: Samsung Electronics Co., Ltd.

Inventors: Jaehyung Ahn, Seongbeom Kim, Byungwoo Bang, Uiseok Song, Junyeon Lee, Wooseok Chang, Hun Seong Choi
Software safe shutdown system, software safe shutdown method, and program to prevent a problem caused by a system failure

Patent number: 9588798

Abstract: Provided is a software safe shutdown system, comprising: an OS anomalous shutdown detection unit, which detects that an operating system of a virtual machine which is executed by a computer system has had an anomalous shutdown due to a memory leak; a memory resource securing unit which secures, from among a usable memory resource within the computer system, a memory resource which is necessary to recover a console function of the anomalously shutdown virtual machine; and a console acquisition unit which allocates the secured memory resource to the anomalously shutdown virtual machine, and which, after recovering the console function, normally shuts down the virtual machine using the console function.

Type: Grant

Filed: August 26, 2013

Date of Patent: March 7, 2017

Assignee: NEC CORPORATION

Inventor: Fumio Machida
System and Methods for Self-Healing From Operating System Faults in Kernel/Supervisory Mode

Publication number: 20140032962

Abstract: A method, apparatus, and computing system are provided for self-healing from a fault in an operating system (OS) without crashing the OS. In one example, the method includes detecting, isolating, and repairing a faulty kernel code module without affecting the run-time operation of other, properly function, portions of the OS. In another example, the apparatus comprises a variety of modules, including, for instance, a kernel level fault detector and a kernel level fault manager, that are configured to execute steps of the method. In another example, the computing system includes an OS that has at least some modularized kernel code sections, and a self-healing code for detecting, isolating, and repairing failures occurring in one or more of the modularized kernel code sections without crashing the OS.

Type: Application

Filed: July 26, 2012

Publication date: January 30, 2014

Applicant: FUTUREWEI TECHNOLOGIES, INC.

Inventor: Bhalchandra Dattatray Deshpande
METHODS, APPARATUS, AND SYSTEMS FOR ELECTRONIC DEVICE RECOVERY

Publication number: 20130283091

Abstract: Methods, apparatus, and systems for electronic device recovery are disclosed. An example method includes determining that a software request received from a computing device includes an indication of a repair mode of an electronic device, determining a characteristic of the electronic device, determining software to be provided to the electronic device based on the characteristic, and in response to determining that the software request includes the indication of the repair mode, transmitting location information for the software to the computing device.

Type: Application

Filed: April 19, 2012

Publication date: October 24, 2013

Inventors: Spencer Leavitt George Quin, Andrey Feldman, Robert David Turner, Timothy Richard Tyhurst
METHOD FOR AUTOMATICALLY RELOADING SOFTWARE AND A DEVICE FOR AUTOMATICALLY RELOADING SOFTWARE

Publication number: 20130024727

Abstract: Method for automatically reloading software characterized in that it comprises: a step of detecting corruption (E101) of at least one part of a software package of an on-board programmable device (10-1, 10-2, 10-n); and, in response to signaling, a step of reloading (E103) a non-corrupt version of the said at least one corrupt part of the software in order to replace the said at least one corrupt part of the software.

Type: Application

Filed: July 11, 2012

Publication date: January 24, 2013

Applicant: AIRBUS OPERATIONS (S.A.S.)

Inventors: Anne Frayssignes, Nicolas Caule
ADDING SCALABILITY AND FAULT TOLERANCE TO GENERIC FINITE STATE MACHINE FRAMEWORKS FOR USE IN AUTOMATED INCIDENT MANAGEMENT OF CLOUD COMPUTING INFRASTRUCTURES

Publication number: 20120331342

Abstract: A scalable and fault tolerant finite state machine engine, for example, for use in an automated incident management system, logs or records data in persistent storage at different points or levels during various internal processing of an event associated with an information technology element, and action taken associated with the event, by executing a finite state machine instance that encodes policies for handling incidents on such types of information technology elements. In the event that the finite state machine engine is shutdown during processing, the finite state machine engine is able to pick up from where it left off when it was shutdown, for each abnormally terminated finite state machine instance, by using the data logged in the persistent storage and determining a point of processing from where it should continue its execution.

Type: Application

Filed: September 5, 2012

Publication date: December 27, 2012

Applicant: International Business Machines Corporation

Inventors: Michael M. Behrendt, Rafah A. Hosn, Ruchi Mahindru, Harigovind V. Ramasamy, Soumitra Sarkar, Mahesh Viswanathan, Norbert G. Vogl
METHOD, SYSTEM AND ARTICLE OF MANUFACTURE FOR SYSTEM RECOVERY

Publication number: 20120216075

Abstract: Provided are a method, system, and article of manufacture for system recovery. An operating system and a backup copy of the operating system are both maintained in a partition of a computational device. A boot loader receives an indication to load the backup copy of the operating system. The boot loader loads the backup copy of the operating system. The computational device is rebooted with the loaded backup copy of the operating system.

Type: Application

Filed: May 3, 2012

Publication date: August 23, 2012

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Christopher J. Stakutis, Kevin Michael Stearns, Jennifer Martin Shaw
ENHANCED ERROR HANDLING FOR SELF-VIRTUALIZING INPUT/OUTPUT DEVICE IN LOGICALLY-PARTITIONED DATA PROCESSING SYSTEM

Publication number: 20120159245

Abstract: Error handling is simplified for a self-virtualizing IO resource that utilizes a physical function adjunct partition for a physical function in the self-virtualizing IO resource to coordinate error recovery for the self-virtualizing IO resource, by restarting each virtual function adjunct partition associated with that physical function to avoid the need to coordinate error recovery within the logical partitions to which such virtual function adjunct partitions are assigned.

Type: Application

Filed: December 15, 2010

Publication date: June 21, 2012

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Sean T. Brownlow, Charles S. Graham, Andrew T. Koch, Adam C. Lange-Pearson, Kyle A. Lucke, Gregory M. Nordstrom, John R. Oberly, III
Hardware turnkey mobiltity

Publication number: 20120151257

Abstract: A system and method for producing a factory build of an embedded appliance is disclosed. The method comprises providing an embedded appliance having first and second digital storage locations, with the second location including a boot manager and a recovery operating system. A factory image of operational software is copied to the secondary digital storage location. The embedded appliance is configured to boot from the second digital storage location at an initial startup and load a recovery operating system to copy the factory image to the primary digital storage location to enable the embedded appliance to operate from the operational section using the operational software.

Type: Application

Filed: February 14, 2011

Publication date: June 14, 2012

Inventor: Terry Tam
Method and System for Cluster Resource Management in a Virtualized Computing Environment

Publication number: 20110231696

Abstract: Methods and systems for cluster resource management in virtualized computing environments are described. VM spares are used to reserve (or help discover or otherwise obtain) a set of computing resources for a VM. While VM spares may be used for a variety of scenarios, particular uses of VM spares include using spares to ensure resource availability for requests to power on VMs as well as for discovering, obtaining, and defragmenting the resources and VMs on a cluster, e.g., in response to requests to reserve resources for a VM or to respond to a notification of a failure for a given VM.

Type: Application

Filed: March 17, 2010

Publication date: September 22, 2011

Applicant: VMWARE, INC.

Inventors: Minwen JI, Elisha ZISKIND, Anne Marie HOLLER
REPRODUCING APPARATUS, INTEGRATED CIRCUIT, AND REPRODUCING METHOD

Publication number: 20110087921

Abstract: A reproducing apparatus 100 that reproduces content recorded in a recording medium 101 avoids reentering the same fault state and becoming unrecoverable at the time of occurrence of an unrecoverable CPU error, such as misalignment. Fault information is stored in nonvolatile memory, or the like, and the thus-stored fault information is used to cause reproduction control processing to branch before execution of a processing routine to be invoked at the time of occurrence of an unrecoverable CPU error, such as misalignment, thereby preventing the reproducing apparatus from reentering the same fault state.

Type: Application

Filed: December 6, 2010

Publication date: April 14, 2011

Applicant: PANASONIC CORPORATION

Inventors: Kenichi TACHIBANA, Hisashi Fukugawa, Hiroko Mori, Michimasa Okubo, Shinya Taira
METHOD FOR COMMANDING AND PERFORMING NETWORK ENTRY

Publication number: 20110016350

Abstract: A method for commanding and performing network entry is disclosed. The method for commanding network entry using a non-periodic message in a Base Station (BS) includes broadcasting a first pattern indicating ready for restart once or more times to a Mobile Station (MS), when the BS determines to restart due to a serious error of the BS, and performing a restart procedure, and broadcasting a second pattern indicating network entry to the MS, upon completion of the restart procedure.

Type: Application

Filed: April 2, 2009

Publication date: January 20, 2011

Inventors: Hee Jeong Cho, Ki Seon Ryu
HIGH AVAILABILITY SUPPORT FOR VIRTUAL MACHINES

Publication number: 20100325471

Abstract: A computer implemented method, a tangible computer storage medium, and a data processing system provide high availability support for virtual machines in a logical partitioned platform. A monitoring system detect a failure in the virtual machine. Partition management firmware then restarts the virtual machine in a consistency failover image node utilizing a consistency failover image. If a subsequent failure of the virtual machine is detected within a predetermined time, partition management firmware restarts the virtual machine in a boot failover image node utilizing a boot failover image.

Type: Application

Filed: June 17, 2009

Publication date: December 23, 2010

Applicant: International Business Machines Corporation

Inventors: Rajeev Mishra, Lance Warren Russell, Ravi A. Shankar
Restarting Mehtod Using a Snapshot

Publication number: 20100088543

Abstract: The active server A101 notifies its own operating status to the administration server 106, and the administration server 106 acquires a snapshot and copies a disk according to the policy established by a user. When the active server A101 or the active disk 113 fails, the administration server 106 chooses a backup server A104 and a status storage disk A114 to be booted and restarts them by using the snapshot according to the policy.

Type: Application

Filed: December 9, 2009

Publication date: April 8, 2010

Applicant: Hitachi, Ltd.

Inventors: Yasuyuki KAMBARA, Yoshifumi Takamoto, Keisuke Hatasaki, Kouji Masuda
APPARATUS AND METHODS FOR RESTORING SYSTEM OPERATION STATES

Publication number: 20090259886

Abstract: A process for restoring an operational state of a portable handheld device is provided. The device may include multiple computing units and persistent storage. The operational state may be generated by a sequence of events. The operational state may receive signals corresponding to a plurality of event types. The process may include selecting an event type for storage, storing in the persistent storage events corresponding to the selected event type, receiving a signal indicating an interruption of operation, and transmitting the stored events to restore the device to the operational state.

Type: Application

Filed: April 15, 2008

Publication date: October 15, 2009

Applicant: Broadcom Corporation

Inventors: Li Li, Jin Guo
Method, operating system and computing hardware for running a computer program

Publication number: 20090254773

Abstract: A method for running a computer program on computing hardware, in particular on a microprocessor, is described, the computer program including multiple program objects and errors being detected in the method while running the computer program on the computing hardware, the program objects being subdivided into at least two classes and program objects of the first class being repeated when an error is detected and, when an error is detected in one program object of the first class, which has already been sent for execution, this program object of the first class being restarted instead of a program object of the second class.

Type: Application

Filed: October 19, 2005

Publication date: October 8, 2009

Inventors: Reinhard Weiberle, Bernd Mueller, Werner Harter, Ralf Angerbauer, Thomas Kottke, Yorck von Collani, Rainer Gmehlich
FAILURE RECOVERY METHOD IN CLUSTER SYSTEM

Publication number: 20090138757

Abstract: There is provided a method executed in a cluster system comprising a first computer and at least one second computer that stores the data transmitted from the first computer. The method comprising the steps of: judging whether the failure-occurred processing is restarted by the first computer or the failure-occurred processing is switched over by the second computer based on the system information; receiving the stored data from the second computer and re-executing the processing in which the failure has occurred by the first computer, in the case of which it is judged that the processing in which the failure has occurred is restarted by the first computer; and executing, by the second computer, the processing in which the failure has occurred, in the case of which it is judged that the processing in which the failure has occurred is switched over by the at least one second computer.

Type: Application

Filed: March 3, 2008

Publication date: May 28, 2009

Inventors: Hirokazu Matsumoto, Tsunehiko Baba, Shinji Hamada, Takashi Ichimura, Noriaki Takahashi
SELF-HEALING CACHE OPERATIONS

Publication number: 20080307268

Abstract: A method, apparatus and program storage device for performing a self-healing cache process is described. At least one error affecting a cache is detected. The cache may have a matching address tag for a fetching operation. Based on the type of error, a self-healing cache process is performed based.

Type: Application

Filed: June 9, 2008

Publication date: December 11, 2008

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Michael J. Azevedo, Carol Spanel, Andrew D. Walls