By Masking Or Reconfiguration Patents (Class 714/3)
  • Publication number: 20090319822
    Abstract: A method to minimize performance degradation during communication path failure in a data processing system, comprising a host computer, a storage controller, and a plurality of physical communication paths in communication with the host computer and the storage controller, where the method establishes a. threshold communication path error rate, and determines an (i)th actual communication path error rate for an (i)th physical communication path, wherein that (i)th communication path is one of the plurality of physical communication paths. If the (i)th actual communication path error rate is greater than the threshold communication path error rate, the method discontinues use of the (i)th physical communication path.
    Type: Application
    Filed: June 18, 2008
    Publication date: December 24, 2009
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Juan Alonso Coronado, Roger Gregory Hathorn, Bret Wayne Holley, Clarisa Valencia
  • Publication number: 20090319823
    Abstract: Embodiments of the present invention address deficiencies of the art in respect to fault handling and provide a method, system and computer program product for run-time fault resolution from development time fault and fault resolution path identification. In an embodiment of the invention, a method for run-time fault resolution from development time fault and fault resolution path identification can be provided. The method can include detecting a recoverable fault condition in a computing system, selecting a fault resolution path from amongst a multiple development time specified fault resolution paths to match the recoverable fault condition, prompting an operator with the selected fault resolution path, and resuming operation of the computing system without restart subsequent to the operator performing the selected resolution fault path.
    Type: Application
    Filed: June 20, 2008
    Publication date: December 24, 2009
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventor: Mark C. Hampton
  • Publication number: 20090313497
    Abstract: The present invention discloses several techniques for providing failover in telemetry systems. The invention allows the continuous and uninterrupted connection between gathering units and a central data collection server, thereby ensuring the proper operation of telemetry systems.
    Type: Application
    Filed: August 5, 2009
    Publication date: December 17, 2009
    Applicant: LAROTEC LTD.
    Inventors: Uzi Lumbroso, Andrey Nekhamkin
  • Patent number: 7634693
    Abstract: An apparatus and method for processing a data signal is provided. An acquisition unit of a test instrument acquires a data signal for a predetermined time. The data signal is stored in a memory of the test instrument and a clock recovery unit recovers a clock signal from the stored data signal. The stored data signal is sliced by a processor into a plurality of data segments of a predetermined length in accordance with the recovered clock signal.
    Type: Grant
    Filed: September 22, 2008
    Date of Patent: December 15, 2009
    Assignee: LeCroy Corporation
    Inventors: Martin Miller, Yaron Habot, Joseph Schachner, Michael Schnecker, Peter J. Pupalaikis
  • Patent number: 7634681
    Abstract: The reliability is improved at a low cost even in a virtualized server environment. The number of spare servers is reduced for improving the reliability and for saving a licensing fee for software on the spare servers. A server system comprises a plurality of physical servers on which a plurality of virtual servers run, a single standby server, a module for detecting an active virtual server, and a module for switching the correspondence of boot disks of virtualization modules for controlling virtual servers to the physical servers. When a physical server fails, the boot disk of the associated virtualization module is connected to a spare server to automatically activate on the spare server those virtual servers which have been active upon occurrence of the failure.
    Type: Grant
    Filed: January 26, 2006
    Date of Patent: December 15, 2009
    Assignee: Hitachi, Ltd.
    Inventors: Yoshifumi Takamoto, Takao Nakajima, Toshihiko Kashiyama
  • Publication number: 20090300404
    Abstract: Methods, apparatus, and products are disclosed for managing execution stability of an application carried out using a plurality of pluggable processing components. Managing execution stability of an application includes: receiving, by an application manager, component stability metrics for a particular pluggable processing component; determining, by the application manager, that the particular pluggable processing component is unstable in dependence upon the component stability metrics for the particular pluggable processing component; and notifying, by the application manager, a system administrator that the particular pluggable processing component is unstable.
    Type: Application
    Filed: May 29, 2008
    Publication date: December 3, 2009
    Applicant: INTERNATINAL BUSINESS MACHINES CORPORATION
    Inventors: Michael J. Branson, Zachary A. Garbow, John M. Santosuosso
  • Publication number: 20090300405
    Abstract: A primary coordinator generates a prepare message for a two-phase commit distributed transaction, the prepare message including an address of a backup coordinator. The primary coordinator maintains a transaction log of the distributed transaction, wherein the transaction log is accessible to both the primary coordinator and the backup coordinator. The prepare message is sent to a plurality of participants. The primary coordinator fails over to the backup coordinator without interrupting the distributed transaction.
    Type: Application
    Filed: May 29, 2008
    Publication date: December 3, 2009
    Inventor: Mark Cameron Little
  • Publication number: 20090300406
    Abstract: An information processing system includes a plurality of server devices including a main server device and at a standby server device, and a client device coupled to said server devices via a network. The client device includes a monitor unit to asynchronously monitor an operation state of each of the plurality of server devices, and a display control unit to acquire a content from the main server device and display the content in a display area on a screen once the monitor unit detects an operation state of the main server device is active, and to acquire from the standby server device a content for a process that the standby server device has taken over from the main server device and displays the content on the screen once the monitor unit detects an operation state of the standby server device is switched from standby state to active state.
    Type: Application
    Filed: May 30, 2009
    Publication date: December 3, 2009
    Applicant: FUJITSU LIMITED
    Inventor: Yousuke KATOU
  • Publication number: 20090287953
    Abstract: A storage system encrypts plain text from an external device and stores the cryptogram into a disk unit, decrypts stored data in the disk unit and transmits decrypted text to the external device. The plain and decrypted text must be in agreement when seen from the external device. If a failure occurs in the encrypting or decrypting process, the plain and decrypted text disagree. The storage system includes an encryption unit for encrypting first data, a decryption unit for decrypting the encrypted data into second data, and a comparison unit for comparing the first and second data. When the first and second data do not agree, the first data is encrypted by a different encryption unit and the encrypted data is decrypted into third data, whereupon the first and third data are compared. When the first and third data do not agree, a failure report is sent.
    Type: Application
    Filed: July 22, 2009
    Publication date: November 19, 2009
    Inventors: Makio MIZUNO, Kazuhisa Fujimoto
  • Publication number: 20090276656
    Abstract: A storage device including a plurality of storage units for storing data dispersively among the storage units, includes: a processor for controlling boot-up of the storage units; and a memory for storing operation history indicative of the sequence of any failure causing any of the storage units to become inoperative, the processor controlling reboot-up of the storage units, when a plurality of the storage units becomes inoperative on account of a plurality of failures, in accordance with process including: determining the order of the reboot up of the storage units that is reversal of the sequence of the failures causing the storage units to become inoperative in reference to the operation history in the memory; rebooting the inoperative storage units successively in accordance with the determined order.
    Type: Application
    Filed: July 10, 2009
    Publication date: November 5, 2009
    Inventor: Hiroshi Sakurai
  • Publication number: 20090265577
    Abstract: Provided is a method of controlling a computer system that includes: a computer; a first storage device connected to the computer via a first path and a second path; and a second storage device externally-connected to the first storage system via a third path and connected to the computer via a fourth path, the first storage device providing a first storage area to the computer, the second storage device including a second storage area corresponding to the first storage area, the method including: judging whether or not a fault has occurred in at least one of the first to fourth paths; selecting, a path used for access to the first or second storage area; and transmitting the access request for the first or second storage area by using the selected path. Accordingly, in the computer system, an application can be prevented from being stopped despite a fault in a path.
    Type: Application
    Filed: June 23, 2008
    Publication date: October 22, 2009
    Inventors: Naoki Haramai, Hiroshi Yokouchi, Ryu Gemba, Atsushi Kondo, Kazuhiro Oyama
  • Patent number: 7607038
    Abstract: In one embodiment, a method for repairing a faulty cache element is provided. Once a monitored cache element is determined to be faulty, the system stores the repair information, and cache configuration in an EEPROM or non-volatile memory on the CPU module. Then the computer is rebooted. During the reboot, the faulty cache element is repaired by being swapped out for a spare cache element based on the information stored in the EEPROM or the non-volatile memory.
    Type: Grant
    Filed: February 17, 2006
    Date of Patent: October 20, 2009
    Assignee: Hewlett-Packard Development Company, L.P.
    Inventors: Jeff Barlow, Jeff Brauch, Howard Calkin, Raymond Gratias, Stephen Hack, Lacey Joyal, Guy Kuntz, Ken Pomaranski, Michael Sedmak
  • Patent number: 7600157
    Abstract: An information handling system includes first and second input/output (I/O) controllers, a detector for detecting an I/O controller failure, and an I/O recovery unit. The first I/O controller adaptively controls a first and a second I/O slot. The second I/O controller adaptively controls a third and a fourth I/O slot. Lastly, the I/O recovery unit, responsive to a detected I/O controller failure, operatively couples/decouples the first and second I/O slot to/from the first I/O controller, operatively couples/decouples the third and fourth I/O slot to/from the second I/O controller, and operatively decouples/couples the first and second I/O slot from/to the third and fourth I/O slot according to an I/O failure recovery protocol, the I/O failure recovery protocol provided for adapting one of either the first and second I/O controllers to operatively couple to the first, second, third and fourth I/O slots as a function of the detected I/O controller failure.
    Type: Grant
    Filed: October 14, 2008
    Date of Patent: October 6, 2009
    Assignee: Dell Products L.P.
    Inventors: Martin McAfee, Bharath Vasudevan
  • Patent number: 7600146
    Abstract: Stable and automated recovery of an application executing on a primary computer system by transitioning to secondary computer system is provided. Intelligent agents installed on a primary client server allow the primary environment to be replicated on a host, secondary environment. This creates continuous availability of applications executing on the primary system via the mirrored host environment. Every transaction on the client environment (e.g., every piece of data) is replicated in real-time on the host environment. A simple mechanism (e.g., a button) is provided to allow access to the secondary environment thereby switching over from the client environment to the host environment in the case of, with respect to the primary environment, a disaster, a need to perform maintenance, system failure and the like.
    Type: Grant
    Filed: April 3, 2007
    Date of Patent: October 6, 2009
    Assignee: Dell Marketing USA L.P.
    Inventors: John Liccione, Kelly M. Jones
  • Patent number: 7594134
    Abstract: A group of data storage units are serially connected in a sequential data communication path to communicate read and write operations to first and second interfaces of each data storage unit in the group. A data management computer device (“filer”) manages read and write operations of the data storage units of the group through an adapter of the filer. Main and redundant primary communication pathway connectors extend from the filer to the interfaces of the data storage unit, thereby establishing redundancy through multiple pathways to communicate the read and write operations to the data storage units of the group. Main and redundant secondary communication pathway connectors extend from partner filers to the groups of data storage units associated with each partner filer, thereby further enhancing redundancy.
    Type: Grant
    Filed: August 14, 2006
    Date of Patent: September 22, 2009
    Assignee: Network Appliance, Inc.
    Inventors: Sue Coatney, George Kong, Doug Coatney
  • Publication number: 20090235110
    Abstract: An input/output control method for an information processing apparatus that is connected to an input/output device through first and second paths, monitors an input/output response to an input/output request issued to the input/output device through the first path, and performs a timeout process when the input/output response is not present within a timeout time. The input/output control method includes predicting a timeout time to the input/output request on the basis of statistic information that the information processing apparatus obtains by monitoring the input/output response, detecting an error on the first path when an input/output response to the input/output request is not present within the predicted timeout time and disconnecting the first path when the error on the first path is detected.
    Type: Application
    Filed: March 16, 2009
    Publication date: September 17, 2009
    Applicant: FUJITSU LIMITED
    Inventor: Kazushige KUROKAWA
  • Publication number: 20090222686
    Abstract: A self-maintained computer system includes a computer system having a plurality of interconnected computer components and a robot associated with the computer system that is configured to carry a spare computer component and further configured to replace a computer component of the computer system with the spare computer component. The robot automatically replaces an individual computer component when a failure of the individual computer component is detected.
    Type: Application
    Filed: March 3, 2008
    Publication date: September 3, 2009
    Applicant: SUN MICROSYSTEMS, INC.
    Inventors: John P. Nibarger, Kevin D. McKinstry
  • Patent number: 7584382
    Abstract: A method and system for identifying a configuration parameter of a “sick” computer system that is at fault for causing an undesired behavior based on analysis of configuration parameters from other computer systems is provided. In one embodiment, a troubleshooting system collects “suspect” values for “suspect” configuration parameters used by a “sick” application when the undesired behavior was exhibited by the sick computer system. The troubleshooting system then compares the suspect values to sample values of the suspect configuration parameters retrieved from sample computer systems. The troubleshooting system uses that comparison to identify one or more suspect configuration parameters that are likely at fault for causing the application to exhibit the undesired behavior.
    Type: Grant
    Filed: August 13, 2004
    Date of Patent: September 1, 2009
    Assignee: Microsoft Corporation
    Inventors: Chad E. Verbowski, Jiahe Helen Wang, John C. Platt, Ruyun Zhang, Yu Chen
  • Publication number: 20090210751
    Abstract: A primary I/O adapter and a redundant I/O adapter of a data processing system are assigned to support access to a system resource. While the primary I/O adapter is in service and the redundant I/O adapter is not in service in providing access to the system resource, a fail over command is issued to remove the primary I/O adapter from service and place the redundant I/O adapter in service in supporting access to the system resource. While the redundant I/O adapter is in service and the primary I/O adapter is not in service in providing access to the system resource, diagnostic testing on the primary I/O adapter is performed. In response to the diagnostic testing revealing no fault in the primary I/O adapter, a fail back command is issued to restore the primary I/O adapter to service and to remove the redundant I/O adapter from service.
    Type: Application
    Filed: February 14, 2008
    Publication date: August 20, 2009
    Inventors: Rafael G. Cabezas, David D. Galvin, Binh K. Hua, Sivarama K. Kodukula
  • Patent number: 7577870
    Abstract: Method and system for performing a task in a control system is provided. The method includes, transmitting a command to a governing agent; re-transmitting the command from the governing agent to a primary, a secondary and an observer agent; validating the task information; and performing the task wherein, the task is executed by the primary agent while the secondary agent monitors task execution and if the primary agent fails to execute the task and/or a threshold value is exceeded, the secondary agent performs the task to the extent the secondary agent perceives that the task is not being performed and to the extent that it has sufficient capacity. The system includes an input module that transmits commands for executing a task to the governing agent that is operationally coupled to a processing unit.
    Type: Grant
    Filed: December 21, 2005
    Date of Patent: August 18, 2009
    Assignee: The Boeing Company
    Inventor: William E. Combs
  • Patent number: 7577864
    Abstract: Provided is a technology for increasing reliability of communication carried out by OSes and application programs operating on logical partitions set on a computer. The computer has multiple logical partitions constructed therein by a control program, the physical interfaces are shared by virtual interfaces respectively set for the multiple logical partitions, and the memory module stores management information indicating correspondences between the physical interface and the virtual interface.
    Type: Grant
    Filed: September 1, 2006
    Date of Patent: August 18, 2009
    Assignee: Hitachi, Ltd.
    Inventors: Kazuhide Horimoto, Toshiomi Moriki, Yuji Tsushima, Takuichi Hoshina
  • Publication number: 20090193287
    Abstract: A memory management method and apparatus based on an access time in a multi-core system. In the memory management method of the multi-core system, it is easy to estimate the execution time of a task to be performed by a processing core and it is possible to secure the same memory access time when a task is migrated between processing cores by setting a memory allocation order according to distances from the processing cores to the memories in correspondence with the processing cores, translating a logical address to be processed by one of the processing cores according to the set memory allocation order into a physical address of one of the memories, and allocating a memory corresponding to the translated physical address to the processing core.
    Type: Application
    Filed: July 2, 2008
    Publication date: July 30, 2009
    Applicant: SAMSUNG ELECTRONICS CO., LTD.
    Inventor: Jae-yong Jeong
  • Patent number: 7565398
    Abstract: A method, system and apparatus for dynamically reconfiguring partitions of a partitioned computer system are provided. Here, dynamically reconfiguring a partition is defined as being able to reconfigure the partition while the partition is operational. Using a graphical user interface (GUI), a user may indicate a plurality of resources that are to be involved in a partition reconfiguration. However, the partition is actually reconfigured one resource at a time. If one of the resources is memory, then memory is allocated to the partition or de-allocated from the partition one logical memory block (LMB) at a time. In reconfiguring partitions, resources may be removed, added or moved from one partition to another. Moving resources from one partition to another entails removing the resources from one partition and adding it to the other partition. After a partition re-configuration, a resource allocation table is updated. All this is done transparently to the user.
    Type: Grant
    Filed: June 27, 2002
    Date of Patent: July 21, 2009
    Assignee: International Business Machines Corporation
    Inventors: Shamsundar Ashok, Robert Kimberlin Foster, Walter Manfred Lipp, Minh Nguyen
  • Patent number: 7562247
    Abstract: Methods and systems are disclosed for providing independent clock failover for scalable blade servers that include assigning a server blade to one of a plurality of clock failover groups, providing a plurality of independent clock signals to the clock generator of the server blade, wherein one of the plurality of independent clock signals is an active clock signal, detecting a failover condition for the clock failover group assigned to the server blade, and switching the active clock signal, in response to the detected failover condition, from one independent clock signal to another independent clock signal.
    Type: Grant
    Filed: May 16, 2006
    Date of Patent: July 14, 2009
    Assignee: International Business Machines Corporation
    Inventors: Marcus A. Baker, Justin P. Bandholz, Patrick M. Bland, Andrew S. Heinzmann
  • Publication number: 20090177912
    Abstract: A reconfigurable circuit having redundant reconfigurable clusters is described herein.
    Type: Application
    Filed: December 9, 2008
    Publication date: July 9, 2009
    Applicant: M2000
    Inventor: Frederic Reblewski
  • Publication number: 20090177911
    Abstract: An apparatus, system, and method are disclosed to prevent queue stalling. The apparatus to prevent queue stalling is provided with a plurality of modules configured to functionally execute the necessary steps of detecting a connection failure on a first logical path, wherein the first logical path is associated with a first entry in a queue, and wherein the first logical path is configured to define a communication path between an entity associated with a first entry in the queue and a queue manager, scanning the queue to identify a second entry associated with a second logical path in response to the connection failure, and advancing the second entry to a position within the queue that is ahead of the first entry. These modules in the described embodiments include a detection module, a scanning module, and an advancing module.
    Type: Application
    Filed: January 3, 2008
    Publication date: July 9, 2009
    Inventors: Brian Dow Clark, Juan Alonso Coronado, Dinh Hai Le, Beth Ann Peterson, Clarisa Valencia
  • Patent number: 7558992
    Abstract: Embodiments of apparatuses and methods for reducing the soft error vulnerability of stored data are disclosed. In one embodiment, an apparatus includes storage logic, determination logic, and selection logic. The determination logic is to determine a condition of a dataword. The storage logic includes logic to store a first portion of the dataword, a second portion of the dataword, and a result generated by the determination logic. The selection logic is to select, based on the contents of the storage logic to store the result, either the contents of the storage logic to store the second portion of the dataword, or a replacement value. The replacement value depends on the contents of a predetermined bit of the storage logic to store the first portion of the dataword.
    Type: Grant
    Filed: October 10, 2005
    Date of Patent: July 7, 2009
    Assignee: Intel Corporation
    Inventors: Oguz Ergin, Osman Unsal, Xavier Vera, Antonio González
  • Patent number: 7555671
    Abstract: Embodiments include systems and methods for processing Reliability, Availability and Serviceability (RAS) events in a computer system. Embodiments comprise processing critical events in a first portion of a Management Interrupt (MI) period. The MI period is chosen to be not greater than a maximum tolerable Operating System (OS) latency period. If time remains in a current MI period after processing critical events, the system then processes non-critical events during the time remaining in the current MI period. If at the end of the current MI period, some non-critical events remain to be processed, a subsequent MI period is scheduled to process the remaining non-critical events.
    Type: Grant
    Filed: August 31, 2006
    Date of Patent: June 30, 2009
    Assignee: Intel Corporation
    Inventors: Murugasamy Nachimuthu, Singaravelan Nallasellan, Mohan J. Kumar
  • Publication number: 20090161143
    Abstract: A monitoring server that monitors an image forming apparatus and detects an error if any has occurred thereon, identifies a type of firmware that has caused the error, and issues an instruction for replacing a version of the identified type of firmware. The image forming apparatus issues, to a transmission server, a request for transmitting a replacement version of the identified type of firmware. The transmission server refers to a firmware version list table to determine another type of firmware whose version is to be replaced in association with the replacement version of the identified type of firmware. The transmission server transmits the replacement version of the identified type of firmware and the determined another type of firmware to the image forming apparatus. The image forming apparatus receives each firmware transmitted from the transmission server and applies the received firmware to the image forming apparatus.
    Type: Application
    Filed: December 12, 2008
    Publication date: June 25, 2009
    Applicant: CANON KABUSHIKI KAISHA
    Inventor: Hisashi Nakamoto
  • Publication number: 20090158081
    Abstract: Failover of blade servers in a data center including powering off a failing blade server by a system management server through a blade server management module (‘BSMM’) managing the failing blade server, the failing blade server characterized by a machine type, one or more network addresses, and one or more storage addresses, the addresses being virtual addresses; identifying, by the system management server from a pool of standby blade servers, a replacement blade server, the replacement blade server managed by a BSMM; assigning, by the system management server through the BSMM managing the replacement blade server, the one or more network addresses and the one or more storage addresses of the failing blade server to the replacement blade server, including enabling in the replacement blade server the assigned addresses; and powering on the replacement blade server by the system management server through the BSMM managing the replacement blade server.
    Type: Application
    Filed: December 13, 2007
    Publication date: June 18, 2009
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Gregory W. Dake, Eric R. Kern, Andrew B. McNeill, JR., Martin J. Tross, Theodore B. Vojnovich, Ben-Ami Yassour
  • Patent number: 7549076
    Abstract: When a primary server executing a task fails in a computer system where a plurality of servers are connected to an external disk device via a network and the servers boot an operation system from the external disk device, task processing is taken over from the primary server to a server that is not executing a task in accordance with the following method. The method for taking over a task includes the steps of detecting that the primary server fails; searching the computer system for a server that has the same hardware configuration as that of the primary server and that is not running a task; enabling the server, searched for as a result of the search, to access the external disk device; and booting the server from the external disk device.
    Type: Grant
    Filed: January 13, 2005
    Date of Patent: June 16, 2009
    Assignee: Hitachi, Ltd.
    Inventors: Keisuke Hatasaki, Takao Nakajima
  • Publication number: 20090150715
    Abstract: In one embodiment, a method includes ingesting a program stream from a program source on a first channel. The method also includes storing the program stream, and receiving notification from a client of unrecoverable error in a stream received at the client. The unrecoverable error corresponds to at least a portion of the stored program stream. The method also includes distributing the corresponding portion of the stored program stream to the client on a second channel in response to the notification.
    Type: Application
    Filed: December 6, 2007
    Publication date: June 11, 2009
    Inventors: John Pickens, William C. VerSteeg
  • Publication number: 20090144579
    Abstract: A virtual machine monitor (VMM) in a data processing system handles errors involving virtual machines (VMs) in the processing system. For instance, an error manager in the VMM may detect an uncorrectable error in involving a component associated with a first VM in the processing system. In response to detection of that error, the error manager may terminate the first VM, while allowing a second VM in the processing system to continue operating. In one embodiment, the error manager automatically determines which VM is affected by the uncorrectable error, in response to detecting the uncorrectable error. The error manager may also automatically spawn a new VM to replace the first VM, if the processing system has sufficient resources to support the new VM. Other embodiments are described and claimed.
    Type: Application
    Filed: December 4, 2007
    Publication date: June 4, 2009
    Inventor: ROBERT C. SWANSON
  • Patent number: 7543046
    Abstract: A method for managing node-specific quorum roles in a cluster of nodes. The method comprises assigning a configuration change of node role attributes to a node in the cluster, with each attribute being represented by a bit of a binary value. The binary value corresponding to the specified node role attributes is converted to an integer value and the integer value is stored in a node record associated with the node. The node record is accessed to isolate each bit of the integer value associated with the node and the node role attributes assigned to the node are identified from each of the isolated bits of the integer value associated with the node. A determination of whether the configuration change can be safely supported by a configuration quorum of nodes required to represent the cluster's configuration accurately is made based on the identified node role attributes.
    Type: Grant
    Filed: May 30, 2008
    Date of Patent: June 2, 2009
    Assignee: International Business Machines Corporation
    Inventors: Myung M. Bae, Steven R. Champagne, Robert K. Gardner, Felipe Knop, Johannes M. Sayre
  • Publication number: 20090138750
    Abstract: A redundant communication system and method for providing data communication between a first computing node and a second computing node. A transmitter is provided as part of the first computing node. A receiver is provided as part of the second computing node. A first signal line carries a first data signal. The first signal line electrically couples the transmitter with the receiver. A second signal line carries a second data signal redundant to the first signal. The second signal line electrically couples the transmitter with the receiver. The receiver evaluates the first data signal to determine the presence of an error and the second node uses the second data signal if an error is detected in the first data signal.
    Type: Application
    Filed: January 29, 2009
    Publication date: May 28, 2009
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Alfredo Aldereguia, Grace Ann Richter, Jeffrey B. Williams
  • Publication number: 20090132061
    Abstract: A system and method is disclosed for system fault recovery by an implantable medical device which employs a global fault response. The system enables the device to consistently recover from transient faults while maintaining a history of the reason for the device fault. Upon detection of a fault, the primary controller of the device signals a reset controller which then issues a reset command. All sub-systems of the primary device controller are then reset together rather than resetting individual sub-systems independently to ensure deterministic behavior.
    Type: Application
    Filed: January 26, 2009
    Publication date: May 21, 2009
    Applicant: Cardiac Pacemakers, Inc.
    Inventors: Scott Stubbs, Conrad L. Sowder, William J. Linder, Lynn S. Elliott, Kenneth P. Hoyme, Hiten J. Doshi
  • Publication number: 20090132849
    Abstract: A method and computer program for selecting circuit repairs using redundant elements with consideration of aging effects provides a mechanism for raising short-term and long-term performance of memory arrays beyond present levels/yields. Available redundant elements are used as replacements for selected elements in the array. The elements for replacement are selected by BOL (beginning-of-life) testing at a selected operating point that maximizes the end-of-life (EOL) yield distribution as among a set of operating points at which post-repair yield requirements are met at beginning-of-life (BOL). The selected operating point is therefore the “best” operating point to improve yield at EOL for a desired range of operating points or maximize the EOL operating range. For a given BOL repair operating point, the yield at EOL is computed. The operating point having the best yield at EOL is selected and testing is performed at that operating point to select repairs.
    Type: Application
    Filed: November 16, 2007
    Publication date: May 21, 2009
    Inventors: Chad A. Adams, Rajiv V. Joshi, Rouwaida N. Kanj, Sani R. Nassif
  • Publication number: 20090132850
    Abstract: As a result of detecting an error, command routing logic for device driver logic is reconfigured so that command processing logic of the device driver is not invoked and to return from commands in a manner indicative of successful completion of command processing.
    Type: Application
    Filed: August 29, 2008
    Publication date: May 21, 2009
    Applicant: SEAWEED SYSTEMS
    Inventor: Christopher J. Hall
  • Publication number: 20090125752
    Abstract: Systems and methods for managing a redundant management module are provided. In this regard, a representative system, among others, includes first and second management modules that are configured to manage a computing device; and a programmable logic device that is configured to: instruct the first management module to manage the computing device responsive to detecting that the first management module is ready to manage the computing device, and instruct the second management module to manage the computing device responsive to detecting that the first management module failed to manage the computing device.
    Type: Application
    Filed: August 21, 2008
    Publication date: May 14, 2009
    Inventors: Kum Cheong Adam Chan, Chee Cheng Jeffrey Liang, Boon Siang Choo, Dale Shidla
  • Patent number: 7533310
    Abstract: A semiconductor memory test device and method thereof are provided. The example semiconductor memory test device may include a fail memory configured to store at least one test result of a memory under test, a mode selecting unit configured to output a selection signal for selecting a memory address protocol of the fail memory based upon which one of a plurality of test modes is active in the memory under test and an address arranging unit configured to arrange address signals to conform with the selected memory address protocol in response to the selection signal received from the mode selecting unit.
    Type: Grant
    Filed: December 19, 2006
    Date of Patent: May 12, 2009
    Assignee: Samsung Electronics Co., Ltd.
    Inventors: Je-Young Park, Ki-Sang Kang
  • Patent number: 7533308
    Abstract: The semiconductor test system comprises a test device for testing semiconductor devices including redundant circuits to obtain fail information of defective parts of the semiconductor devices; a redundant remedy judging device which includes fail memories for storing the fail information, and a redundant remedy judging unit for judging based on the fail information stored in the fail memories as to whether or not the redundant remedy replacing the defective parts of the semiconductor devices with the redundant circuits can be made, and which is provided independent of the test device, wherein the test device and the redundant remedy judging device are interconnected with each other via a network, and fail information is transmitted from the test device to the redundant remedy judging device.
    Type: Grant
    Filed: March 17, 2005
    Date of Patent: May 12, 2009
    Assignee: Fujitsu Microelectronics Limited
    Inventors: Tomomi Yano, Kozo Okamoto, Takumi Morimoto
  • Patent number: 7526667
    Abstract: An error reporting and correcting method applied to a peripheral. The peripheral can be connected to the user terminal via the network. In the error reporting and correcting method, an error record file is generated immediately after an error operation occurs to the peripheral. The error record file is uploaded via the network by the user host, and a correction program corresponding to the error file is downloaded to the user host via the network. After being downloaded, the correction program for automatically performing error correction is automatically installed in the user host.
    Type: Grant
    Filed: August 16, 2006
    Date of Patent: April 28, 2009
    Inventor: Wen-Yung Huang
  • Patent number: 7526668
    Abstract: By incorporating a referral mechanism, a failover method and system for remotely mirrored clustered file servers enables a client computer to transparently access a remotely mirrored file system during a failover of a first NAS (network attached storage) storing the file system. The first NAS is clustered with a second NAS that takes over the IP address of the first NAS on failover. A mirroring relationship is established between the first NAS and a third NAS whereby a copy of the file system is replicated to the third NAS. A referral is created on the second NAS, such that an access attempt from the client computer for accessing the first file system that is directed to the second NAS following IP address takeover from the first NAS after failover is referred to the third NAS by the referral to enable the client computer to access the copy of the file system.
    Type: Grant
    Filed: June 8, 2006
    Date of Patent: April 28, 2009
    Assignee: Hitachi, Ltd.
    Inventors: Hidehisa Shitomi, Manabu Kitamura
  • Publication number: 20090100288
    Abstract: A method and system for quickly informing a backup unit that a primary unit has failed. Normally an exception handler is activated when a software failure occurs and network controller chips or the ASIC interface to a signal bus can operate even though there is a software failure. A software failure notification packet is programmed and stored in a location that is not affected by a software system failure. When a software failure occurs, control is shifted to the exception handler. The exception handler sends a pre-established and pre-addressed packet to the network controller card which transmits this packet to the backup unit. Upon receipt of the packet, the backup unit goes into operation. In some alternate embodiments that include multiple line cards in a single unit, the exception handler sends a signal to a backup unit via a signal bus or a data bus.
    Type: Application
    Filed: December 16, 2008
    Publication date: April 16, 2009
    Applicant: CISCO TECHNOLOGY, INC.
    Inventors: Neeraj Khurana, Alain Jebara, Neil Joffe, Venkatram Krishnamoorthi
  • Publication number: 20090094478
    Abstract: Provided are a method, system, and article of manufacture for recovery of application faults in a mirrored application environment. Application events are recorded at a primary system executing an instruction for an application. The recorded events are transferred to a buffer. The recorded events are transferred from the buffer to a secondary system, wherein the secondary system implements processes indicated in the recorded events to execute the instructions indicated in the events. An error is detected at the primary system. A determination is made of a primary order in which the events are executed by processes in the primary system. A determination is made of a modified order of the execution of the events comprising a different order of executing the events than the primary order in response to detecting the error. The secondary system processes execute the instructions indicated in the recorded events according to the modified order.
    Type: Application
    Filed: October 5, 2007
    Publication date: April 9, 2009
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Richard Edwin Harper, Dinesh Kumar Subhraveti
  • Patent number: 7516353
    Abstract: When a primary server executing a task fails in a computer system where a plurality of servers are connected to an external disk device via a network and the servers boot an operation system from the external disk device, task processing is taken over from the primary server to a server that is not executing a task in accordance with the following method. The method for taking over a task includes the steps of detecting that the primary server fails; searching the computer system for a server that has the same hardware configuration as that of the primary server and that is not running a task; enabling the server, searched for as a result of the search, to access the external disk device; and booting the server from the external disk device.
    Type: Grant
    Filed: May 16, 2008
    Date of Patent: April 7, 2009
    Assignee: Hitachi, Ltd.
    Inventors: Keisuke Hatasaki, Takao Nakajima
  • Patent number: 7516352
    Abstract: A storage system includes a RAID adapter, disk array switches, sub-processors, and hard disk drives (HDDs). The system permits the isolation of a suspected faulty HDD to allow diagnostics to be performed without impacting operation of the rest of the system. Upon detection of a possible fault in a target HDD, a private zone is established including the target HDD and one of the sub-processors, thereby isolating the target HDD. The sub-processor performs diagnostic operations, then transmits its results to the adapter. A faulty HDD can then be fully isolated and the private zone is disassembled, allowing the sub-processor to rejoin the network.
    Type: Grant
    Filed: March 21, 2006
    Date of Patent: April 7, 2009
    Assignee: International Business Machines Corporation
    Inventors: John C. Elliott, Robert A. Kubo, Gregg S. Lucas
  • Publication number: 20090083573
    Abstract: A method establishes a global system model equation including model equations, which contain parameters, of individual components that form the global system. According to said method, the parameters of the individual components are detected using sensor values from the sensors that are allocated to the individual components and it is determined whether it is determined whether it is possible to adapt the parameters to the sensor values and to solve the global system model equation.
    Type: Application
    Filed: July 18, 2005
    Publication date: March 26, 2009
    Inventors: Claus Hillermeier, Georg Hoever, Hans Mauser
  • Publication number: 20090083574
    Abstract: Methods for operating a management system that manages a large number of first function modules and second function modules. An inhibitor module I sets first control statuses to designating blocking when associated events are detected by an event detecting device, and then the management system no longer makes associated first function modules available for execution. The inhibitor module I sets second control statuses to designating executable when associated events are detected by an event detecting device, and then the management system makes associated second function modules available for execution.
    Type: Application
    Filed: September 13, 2005
    Publication date: March 26, 2009
    Inventors: Bernd Kesch, Hans Hillner, Matthias Knirsch, Alexander Hinz
  • Patent number: 7506194
    Abstract: Methods and apparatus for efficiently enabling routing states to be recovered after a failover or during a software upgrade in a system which supports graceful restart and stateful switchover are disclosed. According to one aspect of the present invention, a method for restarting a network device which has a plurality of routers and is in communication with a first peer being arranged to support graceful restart and a second peer includes performing a graceful restart with respect to the first peer. A peer transparent failover is performed with respect to the second peer. The graceful restart and the peer transparent failover are performed in response to a failure associated with the network device.
    Type: Grant
    Filed: March 24, 2004
    Date of Patent: March 17, 2009
    Assignee: Cisco Technology, Inc.
    Inventors: Chandrashekhar Appanna, Ruchi Kapoor, Barry Friedman, Himanshu Shah, Gargi Nalawade, Shyam Suri, John Harper