Restarting Or Rejuvenating (epo) Patents (Class 714/E11.137)
-
Patent number: 12019520Abstract: An electronic device and method with on-demand accelerator checkpointing are provided. In one general aspect, an electronic device includes a host processor, and an accelerator configured to operate according to instructions transmitted by the host processor to the accelerator, wherein, a memory of the host processor and a memory of the accelerator are respectively checkpointed to a storage at respective different intervals, and in response to a determination that a failure has occurred in the host processor, the memory of the accelerator is checkpointed to the storage.Type: GrantFiled: November 21, 2022Date of Patent: June 25, 2024Assignee: Samsung Electronics Co., Ltd.Inventors: Jaehyung Ahn, Seongbeom Kim, Byungwoo Bang, Uiseok Song, Junyeon Lee, Wooseok Chang, Hun Seong Choi
-
Patent number: 9588798Abstract: Provided is a software safe shutdown system, comprising: an OS anomalous shutdown detection unit, which detects that an operating system of a virtual machine which is executed by a computer system has had an anomalous shutdown due to a memory leak; a memory resource securing unit which secures, from among a usable memory resource within the computer system, a memory resource which is necessary to recover a console function of the anomalously shutdown virtual machine; and a console acquisition unit which allocates the secured memory resource to the anomalously shutdown virtual machine, and which, after recovering the console function, normally shuts down the virtual machine using the console function.Type: GrantFiled: August 26, 2013Date of Patent: March 7, 2017Assignee: NEC CORPORATIONInventor: Fumio Machida
-
Publication number: 20140032962Abstract: A method, apparatus, and computing system are provided for self-healing from a fault in an operating system (OS) without crashing the OS. In one example, the method includes detecting, isolating, and repairing a faulty kernel code module without affecting the run-time operation of other, properly function, portions of the OS. In another example, the apparatus comprises a variety of modules, including, for instance, a kernel level fault detector and a kernel level fault manager, that are configured to execute steps of the method. In another example, the computing system includes an OS that has at least some modularized kernel code sections, and a self-healing code for detecting, isolating, and repairing failures occurring in one or more of the modularized kernel code sections without crashing the OS.Type: ApplicationFiled: July 26, 2012Publication date: January 30, 2014Applicant: FUTUREWEI TECHNOLOGIES, INC.Inventor: Bhalchandra Dattatray Deshpande
-
Publication number: 20130283091Abstract: Methods, apparatus, and systems for electronic device recovery are disclosed. An example method includes determining that a software request received from a computing device includes an indication of a repair mode of an electronic device, determining a characteristic of the electronic device, determining software to be provided to the electronic device based on the characteristic, and in response to determining that the software request includes the indication of the repair mode, transmitting location information for the software to the computing device.Type: ApplicationFiled: April 19, 2012Publication date: October 24, 2013Inventors: Spencer Leavitt George Quin, Andrey Feldman, Robert David Turner, Timothy Richard Tyhurst
-
Publication number: 20130024727Abstract: Method for automatically reloading software characterized in that it comprises: a step of detecting corruption (E101) of at least one part of a software package of an on-board programmable device (10-1, 10-2, 10-n); and, in response to signaling, a step of reloading (E103) a non-corrupt version of the said at least one corrupt part of the software in order to replace the said at least one corrupt part of the software.Type: ApplicationFiled: July 11, 2012Publication date: January 24, 2013Applicant: AIRBUS OPERATIONS (S.A.S.)Inventors: Anne Frayssignes, Nicolas Caule
-
Publication number: 20120331342Abstract: A scalable and fault tolerant finite state machine engine, for example, for use in an automated incident management system, logs or records data in persistent storage at different points or levels during various internal processing of an event associated with an information technology element, and action taken associated with the event, by executing a finite state machine instance that encodes policies for handling incidents on such types of information technology elements. In the event that the finite state machine engine is shutdown during processing, the finite state machine engine is able to pick up from where it left off when it was shutdown, for each abnormally terminated finite state machine instance, by using the data logged in the persistent storage and determining a point of processing from where it should continue its execution.Type: ApplicationFiled: September 5, 2012Publication date: December 27, 2012Applicant: International Business Machines CorporationInventors: Michael M. Behrendt, Rafah A. Hosn, Ruchi Mahindru, Harigovind V. Ramasamy, Soumitra Sarkar, Mahesh Viswanathan, Norbert G. Vogl
-
Publication number: 20120216075Abstract: Provided are a method, system, and article of manufacture for system recovery. An operating system and a backup copy of the operating system are both maintained in a partition of a computational device. A boot loader receives an indication to load the backup copy of the operating system. The boot loader loads the backup copy of the operating system. The computational device is rebooted with the loaded backup copy of the operating system.Type: ApplicationFiled: May 3, 2012Publication date: August 23, 2012Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Christopher J. Stakutis, Kevin Michael Stearns, Jennifer Martin Shaw
-
Publication number: 20120159245Abstract: Error handling is simplified for a self-virtualizing IO resource that utilizes a physical function adjunct partition for a physical function in the self-virtualizing IO resource to coordinate error recovery for the self-virtualizing IO resource, by restarting each virtual function adjunct partition associated with that physical function to avoid the need to coordinate error recovery within the logical partitions to which such virtual function adjunct partitions are assigned.Type: ApplicationFiled: December 15, 2010Publication date: June 21, 2012Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Sean T. Brownlow, Charles S. Graham, Andrew T. Koch, Adam C. Lange-Pearson, Kyle A. Lucke, Gregory M. Nordstrom, John R. Oberly, III
-
Publication number: 20120151257Abstract: A system and method for producing a factory build of an embedded appliance is disclosed. The method comprises providing an embedded appliance having first and second digital storage locations, with the second location including a boot manager and a recovery operating system. A factory image of operational software is copied to the secondary digital storage location. The embedded appliance is configured to boot from the second digital storage location at an initial startup and load a recovery operating system to copy the factory image to the primary digital storage location to enable the embedded appliance to operate from the operational section using the operational software.Type: ApplicationFiled: February 14, 2011Publication date: June 14, 2012Inventor: Terry Tam
-
Publication number: 20110231696Abstract: Methods and systems for cluster resource management in virtualized computing environments are described. VM spares are used to reserve (or help discover or otherwise obtain) a set of computing resources for a VM. While VM spares may be used for a variety of scenarios, particular uses of VM spares include using spares to ensure resource availability for requests to power on VMs as well as for discovering, obtaining, and defragmenting the resources and VMs on a cluster, e.g., in response to requests to reserve resources for a VM or to respond to a notification of a failure for a given VM.Type: ApplicationFiled: March 17, 2010Publication date: September 22, 2011Applicant: VMWARE, INC.Inventors: Minwen JI, Elisha ZISKIND, Anne Marie HOLLER
-
Publication number: 20110087921Abstract: A reproducing apparatus 100 that reproduces content recorded in a recording medium 101 avoids reentering the same fault state and becoming unrecoverable at the time of occurrence of an unrecoverable CPU error, such as misalignment. Fault information is stored in nonvolatile memory, or the like, and the thus-stored fault information is used to cause reproduction control processing to branch before execution of a processing routine to be invoked at the time of occurrence of an unrecoverable CPU error, such as misalignment, thereby preventing the reproducing apparatus from reentering the same fault state.Type: ApplicationFiled: December 6, 2010Publication date: April 14, 2011Applicant: PANASONIC CORPORATIONInventors: Kenichi TACHIBANA, Hisashi Fukugawa, Hiroko Mori, Michimasa Okubo, Shinya Taira
-
Publication number: 20110016350Abstract: A method for commanding and performing network entry is disclosed. The method for commanding network entry using a non-periodic message in a Base Station (BS) includes broadcasting a first pattern indicating ready for restart once or more times to a Mobile Station (MS), when the BS determines to restart due to a serious error of the BS, and performing a restart procedure, and broadcasting a second pattern indicating network entry to the MS, upon completion of the restart procedure.Type: ApplicationFiled: April 2, 2009Publication date: January 20, 2011Inventors: Hee Jeong Cho, Ki Seon Ryu
-
Publication number: 20100325471Abstract: A computer implemented method, a tangible computer storage medium, and a data processing system provide high availability support for virtual machines in a logical partitioned platform. A monitoring system detect a failure in the virtual machine. Partition management firmware then restarts the virtual machine in a consistency failover image node utilizing a consistency failover image. If a subsequent failure of the virtual machine is detected within a predetermined time, partition management firmware restarts the virtual machine in a boot failover image node utilizing a boot failover image.Type: ApplicationFiled: June 17, 2009Publication date: December 23, 2010Applicant: International Business Machines CorporationInventors: Rajeev Mishra, Lance Warren Russell, Ravi A. Shankar
-
Publication number: 20100088543Abstract: The active server A101 notifies its own operating status to the administration server 106, and the administration server 106 acquires a snapshot and copies a disk according to the policy established by a user. When the active server A101 or the active disk 113 fails, the administration server 106 chooses a backup server A104 and a status storage disk A114 to be booted and restarts them by using the snapshot according to the policy.Type: ApplicationFiled: December 9, 2009Publication date: April 8, 2010Applicant: Hitachi, Ltd.Inventors: Yasuyuki KAMBARA, Yoshifumi Takamoto, Keisuke Hatasaki, Kouji Masuda
-
Publication number: 20090259886Abstract: A process for restoring an operational state of a portable handheld device is provided. The device may include multiple computing units and persistent storage. The operational state may be generated by a sequence of events. The operational state may receive signals corresponding to a plurality of event types. The process may include selecting an event type for storage, storing in the persistent storage events corresponding to the selected event type, receiving a signal indicating an interruption of operation, and transmitting the stored events to restore the device to the operational state.Type: ApplicationFiled: April 15, 2008Publication date: October 15, 2009Applicant: Broadcom CorporationInventors: Li Li, Jin Guo
-
Publication number: 20090254773Abstract: A method for running a computer program on computing hardware, in particular on a microprocessor, is described, the computer program including multiple program objects and errors being detected in the method while running the computer program on the computing hardware, the program objects being subdivided into at least two classes and program objects of the first class being repeated when an error is detected and, when an error is detected in one program object of the first class, which has already been sent for execution, this program object of the first class being restarted instead of a program object of the second class.Type: ApplicationFiled: October 19, 2005Publication date: October 8, 2009Inventors: Reinhard Weiberle, Bernd Mueller, Werner Harter, Ralf Angerbauer, Thomas Kottke, Yorck von Collani, Rainer Gmehlich
-
Publication number: 20090138757Abstract: There is provided a method executed in a cluster system comprising a first computer and at least one second computer that stores the data transmitted from the first computer. The method comprising the steps of: judging whether the failure-occurred processing is restarted by the first computer or the failure-occurred processing is switched over by the second computer based on the system information; receiving the stored data from the second computer and re-executing the processing in which the failure has occurred by the first computer, in the case of which it is judged that the processing in which the failure has occurred is restarted by the first computer; and executing, by the second computer, the processing in which the failure has occurred, in the case of which it is judged that the processing in which the failure has occurred is switched over by the at least one second computer.Type: ApplicationFiled: March 3, 2008Publication date: May 28, 2009Inventors: Hirokazu Matsumoto, Tsunehiko Baba, Shinji Hamada, Takashi Ichimura, Noriaki Takahashi
-
Publication number: 20080307268Abstract: A method, apparatus and program storage device for performing a self-healing cache process is described. At least one error affecting a cache is detected. The cache may have a matching address tag for a fetching operation. Based on the type of error, a self-healing cache process is performed based.Type: ApplicationFiled: June 9, 2008Publication date: December 11, 2008Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Michael J. Azevedo, Carol Spanel, Andrew D. Walls