By Masking Or Reconfiguration Patents (Class 714/3)

Of network (Class 714/4.1)

Of peripheral subsystem (Class 714/5.1)

Access processor affected (e.g., I/O processor, MMU, or DMA processor, etc.) (Class 714/5.11)

Of memory (Class 714/6.1)

Within single memory device (e.g., disk, etc.) (Class 714/6.11)

Plurality of memory devices (e.g., array, etc.) (Class 714/6.2)

Backup or standby (e.g., failover, etc.) (Class 714/6.3)

Remote repair (Class 714/6.31)

Replacement of failed memory device (Class 714/6.32)

Of processor (Class 714/10)

Concurrent, redundantly operating processors (Class 714/11)

Synchronization maintenance of processors (Class 714/12)

Prepared backup processor (e.g., initializing cold backup) or updating backup processor (e.g., by checkpoint message) (Class 714/13)

Of power supply (Class 714/14)

APPARATUS AND METHOD TO MINIMIZE PERFORMANCE DEGRADATION DURING COMMUNICATION PATH FAILURE IN A DATA PROCESSING SYSTEM

Publication number: 20090319822

Abstract: A method to minimize performance degradation during communication path failure in a data processing system, comprising a host computer, a storage controller, and a plurality of physical communication paths in communication with the host computer and the storage controller, where the method establishes a. threshold communication path error rate, and determines an (i)th actual communication path error rate for an (i)th physical communication path, wherein that (i)th communication path is one of the plurality of physical communication paths. If the (i)th actual communication path error rate is greater than the threshold communication path error rate, the method discontinues use of the (i)th physical communication path.

Type: Application

Filed: June 18, 2008

Publication date: December 24, 2009

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Juan Alonso Coronado, Roger Gregory Hathorn, Bret Wayne Holley, Clarisa Valencia
RUN-TIME FAULT RESOLUTION FROM DEVELOPMENT-TIME FAULT AND FAULT RESOLUTION PATH IDENTIFICATION

Publication number: 20090319823

Abstract: Embodiments of the present invention address deficiencies of the art in respect to fault handling and provide a method, system and computer program product for run-time fault resolution from development time fault and fault resolution path identification. In an embodiment of the invention, a method for run-time fault resolution from development time fault and fault resolution path identification can be provided. The method can include detecting a recoverable fault condition in a computing system, selecting a fault resolution path from amongst a multiple development time specified fault resolution paths to match the recoverable fault condition, prompting an operator with the selected fault resolution path, and resuming operation of the computing system without restart subsequent to the operator performing the selected resolution fault path.

Type: Application

Filed: June 20, 2008

Publication date: December 24, 2009

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventor: Mark C. Hampton
Failover Enabled Telemetry Systems

Publication number: 20090313497

Abstract: The present invention discloses several techniques for providing failover in telemetry systems. The invention allows the continuous and uninterrupted connection between gathering units and a central data collection server, thereby ensuring the proper operation of telemetry systems.

Type: Application

Filed: August 5, 2009

Publication date: December 17, 2009

Applicant: LAROTEC LTD.

Inventors: Uzi Lumbroso, Andrey Nekhamkin
Method and apparatus for analyzing serial data streams

Patent number: 7634693

Abstract: An apparatus and method for processing a data signal is provided. An acquisition unit of a test instrument acquires a data signal for a predetermined time. The data signal is stored in a memory of the test instrument and a clock recovery unit recovers a clock signal from the stored data signal. The stored data signal is sliced by a processor into a plurality of data segments of a predetermined length in accordance with the recovered clock signal.

Type: Grant

Filed: September 22, 2008

Date of Patent: December 15, 2009

Assignee: LeCroy Corporation

Inventors: Martin Miller, Yaron Habot, Joseph Schachner, Michael Schnecker, Peter J. Pupalaikis
Failure recovery method

Patent number: 7634681

Abstract: The reliability is improved at a low cost even in a virtualized server environment. The number of spare servers is reduced for improving the reliability and for saving a licensing fee for software on the spare servers. A server system comprises a plurality of physical servers on which a plurality of virtual servers run, a single standby server, a module for detecting an active virtual server, and a module for switching the correspondence of boot disks of virtualization modules for controlling virtual servers to the physical servers. When a physical server fails, the boot disk of the associated virtualization module is connected to a spare server to automatically activate on the spare server those virtual servers which have been active upon occurrence of the failure.

Type: Grant

Filed: January 26, 2006

Date of Patent: December 15, 2009

Assignee: Hitachi, Ltd.

Inventors: Yoshifumi Takamoto, Takao Nakajima, Toshihiko Kashiyama
Managing Execution Stability Of An Application Carried Out Using A Plurality Of Pluggable Processing Components

Publication number: 20090300404

Abstract: Methods, apparatus, and products are disclosed for managing execution stability of an application carried out using a plurality of pluggable processing components. Managing execution stability of an application includes: receiving, by an application manager, component stability metrics for a particular pluggable processing component; determining, by the application manager, that the particular pluggable processing component is unstable in dependence upon the component stability metrics for the particular pluggable processing component; and notifying, by the application manager, a system administrator that the particular pluggable processing component is unstable.

Type: Application

Filed: May 29, 2008

Publication date: December 3, 2009

Applicant: INTERNATINAL BUSINESS MACHINES CORPORATION

Inventors: Michael J. Branson, Zachary A. Garbow, John M. Santosuosso
BACKUP COORDINATOR FOR DISTRIBUTED TRANSACTIONS

Publication number: 20090300405

Abstract: A primary coordinator generates a prepare message for a two-phase commit distributed transaction, the prepare message including an address of a backup coordinator. The primary coordinator maintains a transaction log of the distributed transaction, wherein the transaction log is accessible to both the primary coordinator and the backup coordinator. The prepare message is sent to a plurality of participants. The primary coordinator fails over to the backup coordinator without interrupting the distributed transaction.

Type: Application

Filed: May 29, 2008

Publication date: December 3, 2009

Inventor: Mark Cameron Little
INFORMATION PROCESSING SYSTEM AND INFORMATION PROCESSING DEVICE

Publication number: 20090300406

Abstract: An information processing system includes a plurality of server devices including a main server device and at a standby server device, and a client device coupled to said server devices via a network. The client device includes a monitor unit to asynchronously monitor an operation state of each of the plurality of server devices, and a display control unit to acquire a content from the main server device and display the content in a display area on a screen once the monitor unit detects an operation state of the main server device is active, and to acquire from the standby server device a content for a process that the standby server device has taken over from the main server device and displays the content on the screen once the monitor unit detects an operation state of the standby server device is switched from standby state to active state.

Type: Application

Filed: May 30, 2009

Publication date: December 3, 2009

Applicant: FUJITSU LIMITED

Inventor: Yousuke KATOU
STORAGE SYSTEM

Publication number: 20090287953

Abstract: A storage system encrypts plain text from an external device and stores the cryptogram into a disk unit, decrypts stored data in the disk unit and transmits decrypted text to the external device. The plain and decrypted text must be in agreement when seen from the external device. If a failure occurs in the encrypting or decrypting process, the plain and decrypted text disagree. The storage system includes an encryption unit for encrypting first data, a decryption unit for decrypting the encrypted data into second data, and a comparison unit for comparing the first and second data. When the first and second data do not agree, the first data is encrypted by a different encryption unit and the encrypted data is decrypted into third data, whereupon the first and third data are compared. When the first and third data do not agree, a failure report is sent.

Type: Application

Filed: July 22, 2009

Publication date: November 19, 2009

Inventors: Makio MIZUNO, Kazuhisa Fujimoto
STORAGE DEVICE AND RECOVERY METHOD

Publication number: 20090276656

Abstract: A storage device including a plurality of storage units for storing data dispersively among the storage units, includes: a processor for controlling boot-up of the storage units; and a memory for storing operation history indicative of the sequence of any failure causing any of the storage units to become inoperative, the processor controlling reboot-up of the storage units, when a plurality of the storage units becomes inoperative on account of a plurality of failures, in accordance with process including: determining the order of the reboot up of the storage units that is reversal of the sequence of the failures causing the storage units to become inoperative in reference to the operation history in the memory; rebooting the inoperative storage units successively in accordance with the determined order.

Type: Application

Filed: July 10, 2009

Publication date: November 5, 2009

Inventor: Hiroshi Sakurai
Method of managing paths for an externally-connected storage system and method of detecting a fault site

Publication number: 20090265577

Abstract: Provided is a method of controlling a computer system that includes: a computer; a first storage device connected to the computer via a first path and a second path; and a second storage device externally-connected to the first storage system via a third path and connected to the computer via a fourth path, the first storage device providing a first storage area to the computer, the second storage device including a second storage area corresponding to the first storage area, the method including: judging whether or not a fault has occurred in at least one of the first to fourth paths; selecting, a path used for access to the first or second storage area; and transmitting the access request for the first or second storage area by using the selected path. Accordingly, in the computer system, an application can be prevented from being stopped despite a fault in a path.

Type: Application

Filed: June 23, 2008

Publication date: October 22, 2009

Inventors: Naoki Haramai, Hiroshi Yokouchi, Ryu Gemba, Atsushi Kondo, Kazuhiro Oyama
Systems and methods for CPU repair

Patent number: 7607038

Abstract: In one embodiment, a method for repairing a faulty cache element is provided. Once a monitored cache element is determined to be faulty, the system stores the repair information, and cache configuration in an EEPROM or non-volatile memory on the CPU module. Then the computer is rebooted. During the reboot, the faulty cache element is repaired by being swapped out for a spare cache element based on the information stored in the EEPROM or the non-volatile memory.

Type: Grant

Filed: February 17, 2006

Date of Patent: October 20, 2009

Assignee: Hewlett-Packard Development Company, L.P.

Inventors: Jeff Barlow, Jeff Brauch, Howard Calkin, Raymond Gratias, Stephen Hack, Lacey Joyal, Guy Kuntz, Ken Pomaranski, Michael Sedmak
Recovering from a failed I/O controller in an information handling system

Patent number: 7600157

Abstract: An information handling system includes first and second input/output (I/O) controllers, a detector for detecting an I/O controller failure, and an I/O recovery unit. The first I/O controller adaptively controls a first and a second I/O slot. The second I/O controller adaptively controls a third and a fourth I/O slot. Lastly, the I/O recovery unit, responsive to a detected I/O controller failure, operatively couples/decouples the first and second I/O slot to/from the first I/O controller, operatively couples/decouples the third and fourth I/O slot to/from the second I/O controller, and operatively decouples/couples the first and second I/O slot from/to the third and fourth I/O slot according to an I/O failure recovery protocol, the I/O failure recovery protocol provided for adapting one of either the first and second I/O controllers to operatively couple to the first, second, third and fourth I/O slots as a function of the detected I/O controller failure.

Type: Grant

Filed: October 14, 2008

Date of Patent: October 6, 2009

Assignee: Dell Products L.P.

Inventors: Martin McAfee, Bharath Vasudevan
System and method for application monitoring and automatic disaster recovery for high-availability

Patent number: 7600146

Abstract: Stable and automated recovery of an application executing on a primary computer system by transitioning to secondary computer system is provided. Intelligent agents installed on a primary client server allow the primary environment to be replicated on a host, secondary environment. This creates continuous availability of applications executing on the primary system via the mirrored host environment. Every transaction on the client environment (e.g., every piece of data) is replicated in real-time on the host environment. A simple mechanism (e.g., a button) is provided to allow access to the secondary environment thereby switching over from the client environment to the host environment in the case of, with respect to the primary environment, a disaster, a need to perform maintenance, system failure and the like.

Type: Grant

Filed: April 3, 2007

Date of Patent: October 6, 2009

Assignee: Dell Marketing USA L.P.

Inventors: John Liccione, Kelly M. Jones
Dual access pathways to serially-connected mass data storage units

Patent number: 7594134

Abstract: A group of data storage units are serially connected in a sequential data communication path to communicate read and write operations to first and second interfaces of each data storage unit in the group. A data management computer device (“filer”) manages read and write operations of the data storage units of the group through an adapter of the filer. Main and redundant primary communication pathway connectors extend from the filer to the interfaces of the data storage unit, thereby establishing redundancy through multiple pathways to communicate the read and write operations to the data storage units of the group. Main and redundant secondary communication pathway connectors extend from partner filers to the groups of data storage units associated with each partner filer, thereby further enhancing redundancy.

Type: Grant

Filed: August 14, 2006

Date of Patent: September 22, 2009

Assignee: Network Appliance, Inc.

Inventors: Sue Coatney, George Kong, Doug Coatney
INPUT/OUTPUT CONTROL METHOD, INFORMATION PROCESSING APPARATUS, COMPUTER READABLE RECORDING MEDIUM

Publication number: 20090235110

Abstract: An input/output control method for an information processing apparatus that is connected to an input/output device through first and second paths, monitors an input/output response to an input/output request issued to the input/output device through the first path, and performs a timeout process when the input/output response is not present within a timeout time. The input/output control method includes predicting a timeout time to the input/output request on the basis of statistic information that the information processing apparatus obtains by monitoring the input/output response, detecting an error on the first path when an input/output response to the input/output request is not present within the predicted timeout time and disconnecting the first path when the error on the first path is detected.

Type: Application

Filed: March 16, 2009

Publication date: September 17, 2009

Applicant: FUJITSU LIMITED

Inventor: Kazushige KUROKAWA
SELF MAINTAINED COMPUTER SYSTEM UTILIZING ROBOTICS

Publication number: 20090222686

Abstract: A self-maintained computer system includes a computer system having a plurality of interconnected computer components and a robot associated with the computer system that is configured to carry a spare computer component and further configured to replace a computer component of the computer system with the spare computer component. The robot automatically replaces an individual computer component when a failure of the individual computer component is detected.

Type: Application

Filed: March 3, 2008

Publication date: September 3, 2009

Applicant: SUN MICROSYSTEMS, INC.

Inventors: John P. Nibarger, Kevin D. McKinstry
Method and system for troubleshooting a misconfiguration of a computer system based on configurations of other computer systems

Patent number: 7584382

Abstract: A method and system for identifying a configuration parameter of a “sick” computer system that is at fault for causing an undesired behavior based on analysis of configuration parameters from other computer systems is provided. In one embodiment, a troubleshooting system collects “suspect” values for “suspect” configuration parameters used by a “sick” application when the undesired behavior was exhibited by the sick computer system. The troubleshooting system then compares the suspect values to sample values of the suspect configuration parameters retrieved from sample computer systems. The troubleshooting system uses that comparison to identify one or more suspect configuration parameters that are likely at fault for causing the application to exhibit the undesired behavior.

Type: Grant

Filed: August 13, 2004

Date of Patent: September 1, 2009

Assignee: Microsoft Corporation

Inventors: Chad E. Verbowski, Jiahe Helen Wang, John C. Platt, Ruyun Zhang, Yu Chen
METHOD, SYSTEM AND PROGRAM PRODUCT FOR NON-DISRUPTIVE I/O ADAPTER DIAGNOSTIC TESTING

Publication number: 20090210751

Abstract: A primary I/O adapter and a redundant I/O adapter of a data processing system are assigned to support access to a system resource. While the primary I/O adapter is in service and the redundant I/O adapter is not in service in providing access to the system resource, a fail over command is issued to remove the primary I/O adapter from service and place the redundant I/O adapter in service in supporting access to the system resource. While the redundant I/O adapter is in service and the primary I/O adapter is not in service in providing access to the system resource, diagnostic testing on the primary I/O adapter is performed. In response to the diagnostic testing revealing no fault in the primary I/O adapter, a fail back command is issued to restore the primary I/O adapter to service and to remove the redundant I/O adapter from service.

Type: Application

Filed: February 14, 2008

Publication date: August 20, 2009

Inventors: Rafael G. Cabezas, David D. Galvin, Binh K. Hua, Sivarama K. Kodukula
Method and system for controlling command execution

Patent number: 7577870

Abstract: Method and system for performing a task in a control system is provided. The method includes, transmitting a command to a governing agent; re-transmitting the command from the governing agent to a primary, a secondary and an observer agent; validating the task information; and performing the task wherein, the task is executed by the primary agent while the secondary agent monitors task execution and if the primary agent fails to execute the task and/or a threshold value is exceeded, the secondary agent performs the task to the extent the secondary agent perceives that the task is not being performed and to the extent that it has sufficient capacity. The system includes an input module that transmits commands for executing a task to the governing agent that is operationally coupled to a processing unit.

Type: Grant

Filed: December 21, 2005

Date of Patent: August 18, 2009

Assignee: The Boeing Company

Inventor: William E. Combs
Control method of computer, program, and virtual computer system

Patent number: 7577864

Abstract: Provided is a technology for increasing reliability of communication carried out by OSes and application programs operating on logical partitions set on a computer. The computer has multiple logical partitions constructed therein by a control program, the physical interfaces are shared by virtual interfaces respectively set for the multiple logical partitions, and the memory module stores management information indicating correspondences between the physical interface and the virtual interface.

Type: Grant

Filed: September 1, 2006

Date of Patent: August 18, 2009

Assignee: Hitachi, Ltd.

Inventors: Kazuhide Horimoto, Toshiomi Moriki, Yuji Tsushima, Takuichi Hoshina
Memory management method, medium, and apparatus based on access time in multi-core system

Publication number: 20090193287

Abstract: A memory management method and apparatus based on an access time in a multi-core system. In the memory management method of the multi-core system, it is easy to estimate the execution time of a task to be performed by a processing core and it is possible to secure the same memory access time when a task is migrated between processing cores by setting a memory allocation order according to distances from the processing cores to the memories in correspondence with the processing cores, translating a logical address to be processed by one of the processing cores according to the set memory allocation order into a physical address of one of the memories, and allocating a memory corresponding to the translated physical address to the processing core.

Type: Application

Filed: July 2, 2008

Publication date: July 30, 2009

Applicant: SAMSUNG ELECTRONICS CO., LTD.

Inventor: Jae-yong Jeong
Procedure for dynamic reconfiguration of resources of logical partitions

Patent number: 7565398

Abstract: A method, system and apparatus for dynamically reconfiguring partitions of a partitioned computer system are provided. Here, dynamically reconfiguring a partition is defined as being able to reconfigure the partition while the partition is operational. Using a graphical user interface (GUI), a user may indicate a plurality of resources that are to be involved in a partition reconfiguration. However, the partition is actually reconfigured one resource at a time. If one of the resources is memory, then memory is allocated to the partition or de-allocated from the partition one logical memory block (LMB) at a time. In reconfiguring partitions, resources may be removed, added or moved from one partition to another. Moving resources from one partition to another entails removing the resources from one partition and adding it to the other partition. After a partition re-configuration, a resource allocation table is updated. All this is done transparently to the user.

Type: Grant

Filed: June 27, 2002

Date of Patent: July 21, 2009

Assignee: International Business Machines Corporation

Inventors: Shamsundar Ashok, Robert Kimberlin Foster, Walter Manfred Lipp, Minh Nguyen
Providing independent clock failover for scalable blade servers

Patent number: 7562247

Abstract: Methods and systems are disclosed for providing independent clock failover for scalable blade servers that include assigning a server blade to one of a plurality of clock failover groups, providing a plurality of independent clock signals to the clock generator of the server blade, wherein one of the plurality of independent clock signals is an active clock signal, detecting a failover condition for the clock failover group assigned to the server blade, and switching the active clock signal, in response to the detected failover condition, from one independent clock signal to another independent clock signal.

Type: Grant

Filed: May 16, 2006

Date of Patent: July 14, 2009

Assignee: International Business Machines Corporation

Inventors: Marcus A. Baker, Justin P. Bandholz, Patrick M. Bland, Andrew S. Heinzmann
RECONFIGURABLE CIRCUIT WITH REDUNDANT RECONFIGURABLE CLUSTER(S)

Publication number: 20090177912

Abstract: A reconfigurable circuit having redundant reconfigurable clusters is described herein.

Type: Application

Filed: December 9, 2008

Publication date: July 9, 2009

Applicant: M2000

Inventor: Frederic Reblewski
APPARATUS, SYSTEM, AND METHOD TO PREVENT QUEUE STALLING

Publication number: 20090177911

Abstract: An apparatus, system, and method are disclosed to prevent queue stalling. The apparatus to prevent queue stalling is provided with a plurality of modules configured to functionally execute the necessary steps of detecting a connection failure on a first logical path, wherein the first logical path is associated with a first entry in a queue, and wherein the first logical path is configured to define a communication path between an entity associated with a first entry in the queue and a queue manager, scanning the queue to identify a second entry associated with a second logical path in response to the connection failure, and advancing the second entry to a position within the queue that is ahead of the first entry. These modules in the described embodiments include a detection module, a scanning module, and an advancing module.

Type: Application

Filed: January 3, 2008

Publication date: July 9, 2009

Inventors: Brian Dow Clark, Juan Alonso Coronado, Dinh Hai Le, Beth Ann Peterson, Clarisa Valencia
Reducing the soft error vulnerability of stored data

Patent number: 7558992

Abstract: Embodiments of apparatuses and methods for reducing the soft error vulnerability of stored data are disclosed. In one embodiment, an apparatus includes storage logic, determination logic, and selection logic. The determination logic is to determine a condition of a dataword. The storage logic includes logic to store a first portion of the dataword, a second portion of the dataword, and a result generated by the determination logic. The selection logic is to select, based on the contents of the storage logic to store the result, either the contents of the storage logic to store the second portion of the dataword, or a replacement value. The replacement value depends on the contents of a predetermined bit of the storage logic to store the first portion of the dataword.

Type: Grant

Filed: October 10, 2005

Date of Patent: July 7, 2009

Assignee: Intel Corporation

Inventors: Oguz Ergin, Osman Unsal, Xavier Vera, Antonio González
Systems and methods for implementing reliability, availability and serviceability in a computer system

Patent number: 7555671

Abstract: Embodiments include systems and methods for processing Reliability, Availability and Serviceability (RAS) events in a computer system. Embodiments comprise processing critical events in a first portion of a Management Interrupt (MI) period. The MI period is chosen to be not greater than a maximum tolerable Operating System (OS) latency period. If time remains in a current MI period after processing critical events, the system then processes non-critical events during the time remaining in the current MI period. If at the end of the current MI period, some non-critical events remain to be processed, a subsequent MI period is scheduled to process the remaining non-critical events.

Type: Grant

Filed: August 31, 2006

Date of Patent: June 30, 2009

Assignee: Intel Corporation

Inventors: Murugasamy Nachimuthu, Singaravelan Nallasellan, Mohan J. Kumar
IMAGE FORMING APPARATUS MONITORING SYSTEM AND METHOD THEREFOR

Publication number: 20090161143

Abstract: A monitoring server that monitors an image forming apparatus and detects an error if any has occurred thereon, identifies a type of firmware that has caused the error, and issues an instruction for replacing a version of the identified type of firmware. The image forming apparatus issues, to a transmission server, a request for transmitting a replacement version of the identified type of firmware. The transmission server refers to a firmware version list table to determine another type of firmware whose version is to be replaced in association with the replacement version of the identified type of firmware. The transmission server transmits the replacement version of the identified type of firmware and the determined another type of firmware to the image forming apparatus. The image forming apparatus receives each firmware transmitted from the transmission server and applies the received firmware to the image forming apparatus.

Type: Application

Filed: December 12, 2008

Publication date: June 25, 2009

Applicant: CANON KABUSHIKI KAISHA

Inventor: Hisashi Nakamoto
Failover Of Blade Servers In A Data Center

Publication number: 20090158081

Abstract: Failover of blade servers in a data center including powering off a failing blade server by a system management server through a blade server management module (‘BSMM’) managing the failing blade server, the failing blade server characterized by a machine type, one or more network addresses, and one or more storage addresses, the addresses being virtual addresses; identifying, by the system management server from a pool of standby blade servers, a replacement blade server, the replacement blade server managed by a BSMM; assigning, by the system management server through the BSMM managing the replacement blade server, the one or more network addresses and the one or more storage addresses of the failing blade server to the replacement blade server, including enabling in the replacement blade server the assigned addresses; and powering on the replacement blade server by the system management server through the BSMM managing the replacement blade server.

Type: Application

Filed: December 13, 2007

Publication date: June 18, 2009

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Gregory W. Dake, Eric R. Kern, Andrew B. McNeill, JR., Martin J. Tross, Theodore B. Vojnovich, Ben-Ami Yassour
Fail over method through disk take over and computer system having fail over function

Patent number: 7549076

Abstract: When a primary server executing a task fails in a computer system where a plurality of servers are connected to an external disk device via a network and the servers boot an operation system from the external disk device, task processing is taken over from the primary server to a server that is not executing a task in accordance with the following method. The method for taking over a task includes the steps of detecting that the primary server fails; searching the computer system for a server that has the same hardware configuration as that of the primary server and that is not running a task; enabling the server, searched for as a result of the search, to access the external disk device; and booting the server from the external disk device.

Type: Grant

Filed: January 13, 2005

Date of Patent: June 16, 2009

Assignee: Hitachi, Ltd.

Inventors: Keisuke Hatasaki, Takao Nakajima
DELIVERY OF STREAMS TO REPAIR ERRORED MEDIA STREAMS IN PERIODS OF INSUFFICIENT RESOURCES

Publication number: 20090150715

Abstract: In one embodiment, a method includes ingesting a program stream from a program source on a first channel. The method also includes storing the program stream, and receiving notification from a client of unrecoverable error in a stream received at the client. The unrecoverable error corresponds to at least a portion of the stored program stream. The method also includes distributing the corresponding portion of the stored program stream to the client on a second channel in response to the notification.

Type: Application

Filed: December 6, 2007

Publication date: June 11, 2009

Inventors: John Pickens, William C. VerSteeg
Methods and Apparatus for Handling Errors Involving Virtual Machines

Publication number: 20090144579

Abstract: A virtual machine monitor (VMM) in a data processing system handles errors involving virtual machines (VMs) in the processing system. For instance, an error manager in the VMM may detect an uncorrectable error in involving a component associated with a first VM in the processing system. In response to detection of that error, the error manager may terminate the first VM, while allowing a second VM in the processing system to continue operating. In one embodiment, the error manager automatically determines which VM is affected by the uncorrectable error, in response to detecting the uncorrectable error. The error manager may also automatically spawn a new VM to replace the first VM, if the processing system has sufficient resources to support the new VM. Other embodiments are described and claimed.

Type: Application

Filed: December 4, 2007

Publication date: June 4, 2009

Inventor: ROBERT C. SWANSON
Method for managing cluster node-specific quorum roles

Patent number: 7543046

Abstract: A method for managing node-specific quorum roles in a cluster of nodes. The method comprises assigning a configuration change of node role attributes to a node in the cluster, with each attribute being represented by a bit of a binary value. The binary value corresponding to the specified node role attributes is converted to an integer value and the integer value is stored in a node record associated with the node. The node record is accessed to isolate each bit of the integer value associated with the node and the node role attributes assigned to the node are identified from each of the isolated bits of the integer value associated with the node. A determination of whether the configuration change can be safely supported by a configuration quorum of nodes required to represent the cluster's configuration accurately is made based on the identified node role attributes.

Type: Grant

Filed: May 30, 2008

Date of Patent: June 2, 2009

Assignee: International Business Machines Corporation

Inventors: Myung M. Bae, Steven R. Champagne, Robert K. Gardner, Felipe Knop, Johannes M. Sayre
REDUNDANT 3-WIRE COMMUNICATION SYSTEM AND METHOD

Publication number: 20090138750

Abstract: A redundant communication system and method for providing data communication between a first computing node and a second computing node. A transmitter is provided as part of the first computing node. A receiver is provided as part of the second computing node. A first signal line carries a first data signal. The first signal line electrically couples the transmitter with the receiver. A second signal line carries a second data signal redundant to the first signal. The second signal line electrically couples the transmitter with the receiver. The receiver evaluates the first data signal to determine the presence of an error and the second node uses the second data signal if an error is detected in the first data signal.

Type: Application

Filed: January 29, 2009

Publication date: May 28, 2009

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Alfredo Aldereguia, Grace Ann Richter, Jeffrey B. Williams
SYSTEM AND METHOD FOR RECOVERING FROM TRANSIENT FAULTS IN AN IMPLANTABLE MEDICAL DEVICE

Publication number: 20090132061

Abstract: A system and method is disclosed for system fault recovery by an implantable medical device which employs a global fault response. The system enables the device to consistently recover from transient faults while maintaining a history of the reason for the device fault. Upon detection of a fault, the primary controller of the device signals a reset controller which then issues a reset command. All sub-systems of the primary device controller are then reset together rather than resetting individual sub-systems independently to ensure deterministic behavior.

Type: Application

Filed: January 26, 2009

Publication date: May 21, 2009

Applicant: Cardiac Pacemakers, Inc.

Inventors: Scott Stubbs, Conrad L. Sowder, William J. Linder, Lynn S. Elliott, Kenneth P. Hoyme, Hiten J. Doshi
Method and Computer Program for Selecting Circuit Repairs Using Redundant Elements with Consideration of Aging Effects

Publication number: 20090132849

Abstract: A method and computer program for selecting circuit repairs using redundant elements with consideration of aging effects provides a mechanism for raising short-term and long-term performance of memory arrays beyond present levels/yields. Available redundant elements are used as replacements for selected elements in the array. The elements for replacement are selected by BOL (beginning-of-life) testing at a selected operating point that maximizes the end-of-life (EOL) yield distribution as among a set of operating points at which post-repair yield requirements are met at beginning-of-life (BOL). The selected operating point is therefore the “best” operating point to improve yield at EOL for a desired range of operating points or maximize the EOL operating range. For a given BOL repair operating point, the yield at EOL is computed. The operating point having the best yield at EOL is selected and testing is performed at that operating point to select repairs.

Type: Application

Filed: November 16, 2007

Publication date: May 21, 2009

Inventors: Chad A. Adams, Rajiv V. Joshi, Rouwaida N. Kanj, Sani R. Nassif
ERROR HANDLING SCHEME FOR TIME-CRITICAL PROCESSING ENVIRONMENTS

Publication number: 20090132850

Abstract: As a result of detecting an error, command routing logic for device driver logic is reconfigured so that command processing logic of the device driver is not invoked and to return from commands in a manner indicative of successful completion of command processing.

Type: Application

Filed: August 29, 2008

Publication date: May 21, 2009

Applicant: SEAWEED SYSTEMS

Inventor: Christopher J. Hall
Systems And Methods For Managing A Redundant Management Module

Publication number: 20090125752

Abstract: Systems and methods for managing a redundant management module are provided. In this regard, a representative system, among others, includes first and second management modules that are configured to manage a computing device; and a programmable logic device that is configured to: instruct the first management module to manage the computing device responsive to detecting that the first management module is ready to manage the computing device, and instruct the second management module to manage the computing device responsive to detecting that the first management module failed to manage the computing device.

Type: Application

Filed: August 21, 2008

Publication date: May 14, 2009

Inventors: Kum Cheong Adam Chan, Chee Cheng Jeffrey Liang, Boon Siang Choo, Dale Shidla
Semiconductor memory test device and method thereof

Patent number: 7533310

Abstract: A semiconductor memory test device and method thereof are provided. The example semiconductor memory test device may include a fail memory configured to store at least one test result of a memory under test, a mode selecting unit configured to output a selection signal for selecting a memory address protocol of the fail memory based upon which one of a plurality of test modes is active in the memory under test and an address arranging unit configured to arrange address signals to conform with the selected memory address protocol in response to the selection signal received from the mode selecting unit.

Type: Grant

Filed: December 19, 2006

Date of Patent: May 12, 2009

Assignee: Samsung Electronics Co., Ltd.

Inventors: Je-Young Park, Ki-Sang Kang
Semiconductor test system

Patent number: 7533308

Abstract: The semiconductor test system comprises a test device for testing semiconductor devices including redundant circuits to obtain fail information of defective parts of the semiconductor devices; a redundant remedy judging device which includes fail memories for storing the fail information, and a redundant remedy judging unit for judging based on the fail information stored in the fail memories as to whether or not the redundant remedy replacing the defective parts of the semiconductor devices with the redundant circuits can be made, and which is provided independent of the test device, wherein the test device and the redundant remedy judging device are interconnected with each other via a network, and fail information is transmitted from the test device to the redundant remedy judging device.

Type: Grant

Filed: March 17, 2005

Date of Patent: May 12, 2009

Assignee: Fujitsu Microelectronics Limited

Inventors: Tomomi Yano, Kozo Okamoto, Takumi Morimoto
Error reporting and correcting method for peripheral

Patent number: 7526667

Abstract: An error reporting and correcting method applied to a peripheral. The peripheral can be connected to the user terminal via the network. In the error reporting and correcting method, an error record file is generated immediately after an error operation occurs to the peripheral. The error record file is uploaded via the network by the user host, and a correction program corresponding to the error file is downloaded to the user host via the network. After being downloaded, the correction program for automatically performing error correction is automatically installed in the user host.

Type: Grant

Filed: August 16, 2006

Date of Patent: April 28, 2009

Inventor: Wen-Yung Huang
Failover method of remotely-mirrored clustered file servers

Patent number: 7526668

Abstract: By incorporating a referral mechanism, a failover method and system for remotely mirrored clustered file servers enables a client computer to transparently access a remotely mirrored file system during a failover of a first NAS (network attached storage) storing the file system. The first NAS is clustered with a second NAS that takes over the IP address of the first NAS on failover. A mirroring relationship is established between the first NAS and a third NAS whereby a copy of the file system is replicated to the third NAS. A referral is created on the second NAS, such that an access attempt from the client computer for accessing the first file system that is directed to the second NAS following IP address takeover from the first NAS after failover is referred to the third NAS by the referral to enable the client computer to access the copy of the file system.

Type: Grant

Filed: June 8, 2006

Date of Patent: April 28, 2009

Assignee: Hitachi, Ltd.

Inventors: Hidehisa Shitomi, Manabu Kitamura
FAST SOFTWARE FAULT DETECTION AND NOTIFICATION TO A BACKUP UNIT

Publication number: 20090100288

Abstract: A method and system for quickly informing a backup unit that a primary unit has failed. Normally an exception handler is activated when a software failure occurs and network controller chips or the ASIC interface to a signal bus can operate even though there is a software failure. A software failure notification packet is programmed and stored in a location that is not affected by a software system failure. When a software failure occurs, control is shifted to the exception handler. The exception handler sends a pre-established and pre-addressed packet to the network controller card which transmits this packet to the backup unit. Upon receipt of the packet, the backup unit goes into operation. In some alternate embodiments that include multiple line cards in a single unit, the exception handler sends a signal to a backup unit via a signal bus or a data bus.

Type: Application

Filed: December 16, 2008

Publication date: April 16, 2009

Applicant: CISCO TECHNOLOGY, INC.

Inventors: Neeraj Khurana, Alain Jebara, Neil Joffe, Venkatram Krishnamoorthi
RECOVERY OF APPLICATION FAULTS IN A MIRRORED APPLICATION ENVIRONMENT

Publication number: 20090094478

Abstract: Provided are a method, system, and article of manufacture for recovery of application faults in a mirrored application environment. Application events are recorded at a primary system executing an instruction for an application. The recorded events are transferred to a buffer. The recorded events are transferred from the buffer to a secondary system, wherein the secondary system implements processes indicated in the recorded events to execute the instructions indicated in the events. An error is detected at the primary system. A determination is made of a primary order in which the events are executed by processes in the primary system. A determination is made of a modified order of the execution of the events comprising a different order of executing the events than the primary order in response to detecting the error. The secondary system processes execute the instructions indicated in the recorded events according to the modified order.

Type: Application

Filed: October 5, 2007

Publication date: April 9, 2009

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Richard Edwin Harper, Dinesh Kumar Subhraveti
Fall over method through disk take over and computer system having failover function

Patent number: 7516353

Abstract: When a primary server executing a task fails in a computer system where a plurality of servers are connected to an external disk device via a network and the servers boot an operation system from the external disk device, task processing is taken over from the primary server to a server that is not executing a task in accordance with the following method. The method for taking over a task includes the steps of detecting that the primary server fails; searching the computer system for a server that has the same hardware configuration as that of the primary server and that is not running a task; enabling the server, searched for as a result of the search, to access the external disk device; and booting the server from the external disk device.

Type: Grant

Filed: May 16, 2008

Date of Patent: April 7, 2009

Assignee: Hitachi, Ltd.

Inventors: Keisuke Hatasaki, Takao Nakajima
Isolating a drive from disk array for diagnostic operations

Patent number: 7516352

Abstract: A storage system includes a RAID adapter, disk array switches, sub-processors, and hard disk drives (HDDs). The system permits the isolation of a suspected faulty HDD to allow diagnostics to be performed without impacting operation of the rest of the system. Upon detection of a possible fault in a target HDD, a private zone is established including the target HDD and one of the sub-processors, thereby isolating the target HDD. The sub-processor performs diagnostic operations, then transmits its results to the adapter. A faulty HDD can then be fully isolated and the private zone is disassembled, allowing the sub-processor to rejoin the network.

Type: Grant

Filed: March 21, 2006

Date of Patent: April 7, 2009

Assignee: International Business Machines Corporation

Inventors: John C. Elliott, Robert A. Kubo, Gregg S. Lucas
Method for detecting sources of faults or defective measuring sensors by positive case modeling and partial suppression of equations

Publication number: 20090083573

Abstract: A method establishes a global system model equation including model equations, which contain parameters, of individual components that form the global system. According to said method, the parameters of the individual components are detected using sensor values from the sensors that are allocated to the individual components and it is determined whether it is determined whether it is possible to adapt the parameters to the sensor values and to solve the global system model equation.

Type: Application

Filed: July 18, 2005

Publication date: March 26, 2009

Inventors: Claus Hillermeier, Georg Hoever, Hans Mauser
METHOD FOR OPERATING A MANAGEMENT SYSTEM OF FUNCTION MODULES

Publication number: 20090083574

Abstract: Methods for operating a management system that manages a large number of first function modules and second function modules. An inhibitor module I sets first control statuses to designating blocking when associated events are detected by an event detecting device, and then the management system no longer makes associated first function modules available for execution. The inhibitor module I sets second control statuses to designating executable when associated events are detected by an event detecting device, and then the management system makes associated second function modules available for execution.

Type: Application

Filed: September 13, 2005

Publication date: March 26, 2009

Inventors: Bernd Kesch, Hans Hillner, Matthias Knirsch, Alexander Hinz
Routing system and method for transparently rocovering routing states after a failover or during a software upgrade

Patent number: 7506194

Abstract: Methods and apparatus for efficiently enabling routing states to be recovered after a failover or during a software upgrade in a system which supports graceful restart and stateful switchover are disclosed. According to one aspect of the present invention, a method for restarting a network device which has a plurality of routers and is in communication with a first peer being arranged to support graceful restart and a second peer includes performing a graceful restart with respect to the first peer. A peer transparent failover is performed with respect to the second peer. The graceful restart and the peer transparent failover are performed in response to a failure associated with the network device.

Type: Grant

Filed: March 24, 2004

Date of Patent: March 17, 2009

Assignee: Cisco Technology, Inc.

Inventors: Chandrashekhar Appanna, Ruchi Kapoor, Barry Friedman, Himanshu Shah, Gargi Nalawade, Shyam Suri, John Harper

prev … 7 8 9 10 11 12 13 14 15 next