Using Active Fault-masking, E.g., By Switching Out Faulty Elements Or By Switching In Spare Elements, Etc. (epo) Patents (Class 714/E11.071)

E Subclasses

In systems, e.g., multiprocessors, etc. (epo) (Class 714/E11.072)

In distributed systems (EPO) (Class 714/E11.073)

In regular structures (EPO) (Class 714/E11.074)

In interconnections, e.g., rings, etc. (epo) (Class 714/E11.078)

Bus (EPO) (Class 714/E11.079)

Data exchange between units, e.g., for updating backup units, etc. (epo) (Class 714/E11.08)

For control, e.g., actuators, etc. (epo) (Class 714/E11.081)

In arithmetic units (epo) (Class 714/E11.082)

Redundant power supplies (epo) (Class 714/E11.083)

Masking faults in storage systems using spares and/or by reconfiguring (epo) (Class 714/E11.084)

Removing defective units from operation (EPO) (Class 714/E11.085)

Bypassing defective units on a serial bus (EPO) (Class 714/E11.086)

With address translations and modifications (EPO) (Class 714/E11.087)

Handling defects in a Redundant Array of Inexpensive Disks (RAID) by remapping (EPO) (Class 714/E11.088)

Managing spare storage units (EPO) (Class 714/E11.089)

Hot spares (EPO) (Class 714/E11.09)

Via redundancy in hardware accessing the storage components (EPO) (Class 714/E11.091)

Using redundant I/O processors, storage control units or array controllers (EPO) (Class 714/E11.092)

Connection redundancy between storage system components (EPO) (Class 714/E11.095)

Using the replication of data, e.g., with two or more copies, etc. (EPO) (Class 714/E11.098)

Duplex memories, e.g., twin boot ROMs, etc. (EPO) (Class 714/E11.099)

Duplexed caches, e.g., cashe paired with non-volatile storage, etc. (EPO) (Class 714/E11.1)

Mirroring, i.e., the concept of maintaining data on two or more units in the same state at all times (EPO) (Class 714/E11.101)

In logic arrays, e.g., programmable or iterative logic arrays, etc. (epo) (Class 714/E11.111)

Selectable error control for memory device

Patent number: 11816339

Abstract: Methods, systems, and devices for selectable error control for memory device are described. An apparatus may include a memory array and a circuit configurable to perform a first error control operation and a second error control operation on data stored by the memory array. The circuit may include a first plurality of gates enabled during the first error control operation and configured to generate a first set of bits associated with a first matrix of the first error control operation. The circuit may also include a second plurality of gates enabled during the second error control operation and configured to generate a second set of bits associated with the second matrix of the second error control operation. The circuit may further include a third plurality of gates configured to generate a third set of bits that are common to both the first matrix and the second matrix.

Type: Grant

Filed: August 2, 2021

Date of Patent: November 14, 2023

Assignee: Micron Technology, Inc.

Inventor: Scott E. Schaefer
RAID system with fault resilient storage devices

Patent number: 11775391

Abstract: A storage system, and a method for operating a storage system. In some embodiments, the system includes a first storage device and a second storage device, and the method includes: determining that the first storage device is in a read-only state and that the second storage device is in a read-write state; performing a write operation, of a first stripe, to the storage system; performing a first read operation, of a second stripe, from the storage system; and performing a second read operation, of the first stripe, from the storage system, wherein: the performing of the write operation includes: writing a portion of the first stripe to the second storage device, and making an entry in a mapping table for the first stripe.

Type: Grant

Filed: December 1, 2020

Date of Patent: October 3, 2023

Assignee: Samsung Electronics Co., Ltd.

Inventors: Sung Wook Ryu, Yang Seok Ki, Dong Gi Daniel Lee, Changho Choi, Ehsan Najafabadi
Method for reducing impact of flapping links on performance of network devices

Patent number: 11606299

Abstract: The method for reducing impact of flapping links on performance of network devices is implemented in 3 steps: Step 1: initialize a number of points of the interface to define interface status; Step 2: add a number of points to T and change interface status to isolated; Step 3: the interface comes back to normal state; By all those steps above, the routers can reduce impact of flapping links on route calculation and its performance.

Type: Grant

Filed: December 31, 2020

Date of Patent: March 14, 2023

Assignee: VIETTEL GROUP

Inventors: Hanh Trang Nguyen, Van Huong Tran
LUT-based focused ion beam friendly fill-cell design

Patent number: 10735004

Abstract: An integrated circuit includes a plurality of logic function circuits disposed on the integrated circuit and interconnected by metal interconnect lines to form a logic network. A plurality of configurable logic function circuits is also disposed on the integrated circuit, each configurable logic function circuit being disposed on a respective area on the integrated circuit and not interconnected by the metal interconnect lines to form the logic network.

Type: Grant

Filed: May 9, 2019

Date of Patent: August 4, 2020

Assignee: Microchip Technology Inc.

Inventors: Matthew Kian Chin Yap, Alan Nakamoto
Automatically Scaled Network Overlay with Heuristic Monitoring in a Hybrid Cloud Environment

Publication number: 20130268799

Abstract: Techniques are provided for a management application in a first virtual network to start a first cloud gateway in the first virtual network. First messages are sent to a second virtual network, the first messages comprising information configured to start a second cloud gateway and a first virtual switch in the second virtual network. A connection is established between the first cloud gateway and the second cloud gateway, where the first cloud gateway, the second cloud gateway, and the first virtual switch form a first scalable cloud network element. One or more second messages are sent to the second virtual network, the one or more second messages comprising information configured to start a virtual machine and a first virtual machine interface configured to allow the virtual machine to access processing resources in the second virtual network. Data are stored that associates the virtual machine with the first virtual switch.

Type: Application

Filed: April 4, 2012

Publication date: October 10, 2013

Applicant: CISCO TECHNOLOGY, INC.

Inventors: Kyle Andrew Donald Mestery, David Chang, Murali Anantha
ENHANCED DEBUGGING FOR EMBEDDED DEVICES

Publication number: 20130212425

Abstract: Methods, machine-readable tangible storage media, and data processing systems that enable a debug host device to acquire memory dump information from a debug target device after the target device suffers an unrecoverable system malfunction are disclosed. In one embodiment, data in the volatile memory on a debug target device is accessed via a hardware integrated debug framework, which is also used to access data on a nonvolatile electronically erasable semiconductor memory of a debug target device, and one or more registers of one or more processors on a debug target device, and a core dump is created on the debug host device.

Type: Application

Filed: September 14, 2012

Publication date: August 15, 2013

Inventors: Russell A. Blaine, Matthew Byom, Kevi Rathbun Walker, Daniel S. Heller, Shantonu Sen
UNFUSING A FAILING PART OF AN OPERATOR GRAPH

Publication number: 20130166942

Abstract: Techniques for managing a fused processing element are described. Embodiments receive streaming data to be processed by a plurality of processing elements. Additionally, an operator graph of the plurality of processing elements is established. The operator graph defines at least one execution path and wherein at least one of the processing elements of the operator graph is configured to receive data from at least one upstream processing element and transmit data to at least one downstream processing element. Embodiments detect an error condition has been satisfied at a first one of the plurality of processing elements, wherein the first processing element contains a plurality of fused operators. At least one of the plurality of fused operators is selected for removal from the first processing element. Embodiments then remove the selected at least one fused operator from the first processing element.

Type: Application

Filed: December 22, 2011

Publication date: June 27, 2013

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Michael J. Branson, John M. Santosuosso, Brandon W. Schulz
SELECTIVE MESSAGE LOSS HANDLING IN A CLUSTER OF REPLICATED SERVERS

Publication number: 20130124909

Abstract: A computer-implemented method, a computerized system and a product for providing a cluster of replicated servers. The method performed by a computerized server in a cluster of servers, wherein the cluster of servers are executing replicated instances of an application, wherein the replicated instances are configured to perform the same processing of the same input, comprising: detecting a message loss in the server; electively determining a responsive action to the message loss; and notifying the cluster of servers of the responsive action determined by the server, whereby other servers of the cluster of servers are able to mimic operation of the server by simulating the responsive action.

Type: Application

Filed: November 15, 2011

Publication date: May 16, 2013

Applicant: International Business Machines Corporation

Inventors: Avraham Harpaz, Nir Naaman, Idan Zach
PERFORMANCE ENHANCEMENT TECHNIQUE FOR RAIDS UNDER REBUILD

Publication number: 20130073900

Abstract: A method for improving the performance of a RAID under rebuild is disclosed. In one embodiment, such a method includes identifying a RAID requiring rebuild, such as by identifying a RAID having one or more failed storage-drive components. The method then automatically performs the following in response to identifying the RAID: the method identifies hot extents (i.e., extents most heavily accessed) in the RAID; the method migrates the hot extents from the identified failed RAID to a normal RAID not requiring rebuild, such as to an underused RAID; and the method rebuilds the failed RAID. The migration of the hot extents will ideally occur while the RAID is being rebuilt but may also be performed prior to the rebuild process. A corresponding apparatus and computer program product are also disclosed.

Type: Application

Filed: September 20, 2011

Publication date: March 21, 2013

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Chao Guang Li, Yao Peng, Cheng-Chung Song, Zhi Qiang Wang, Hui Zhang
DISASTER RECOVERY APPLIANCE

Publication number: 20130031405

Abstract: A disaster recovery appliance is described herein. The disaster recovery appliance is coupled to one or more servers. The disaster recovery appliance continuously receives backup data for each of the one or more servers. When a server fails, the disaster recovery appliance, replaces the failed server. While the failed server is inaccessible, the disaster recovery appliance is able to mimic the functionality of the failed server. In some embodiments, the disaster recovery appliance is able to act as a server in addition to a backup device for the other servers.

Type: Application

Filed: October 4, 2012

Publication date: January 31, 2013

Applicant: MAXSP CORPORATION

Inventor: MaxSP Corporation
MANAGING RECOVERY VIRTUAL MACHINES IN CLUSTERED ENVIRONMENT

Publication number: 20130007506

Abstract: Techniques involving replication of virtual machines of virtual machines in a clustered environment are described. One representative technique includes receiving a replication request to replicate a primary virtual machine. A clustering broker is configured to act on the replication request on behalf of a cluster of recovery nodes, by at least placing a replicated virtual machine corresponding to the source virtual machine on a recovery node and facilitate tracking the migration of the replicated virtual machine within the cluster. The clustering broker returns an address of the recovery node that has been placed or found through tracking for the particular virtual machine.

Type: Application

Filed: July 1, 2011

Publication date: January 3, 2013

Applicant: MICROSOFT CORPORATION

Inventors: Manoj Kumar Jain, Krishan Kumar Attre, Prasad Thiruveedu, Praveen Vijayaraghavan, Vrijesh Kothari, Ahmed Bisht
HIGH AVAILABILITY DATA STORAGE SYSTEMS AND METHODS

Publication number: 20130007504

Abstract: Provided are systems and methods for accessing a storage device from a node when a local connection failure occurs between the node and the storage device. A failure is determined to have occurred at a first node access path between a first node and a storage device that prevents an application at the first node from accessing the storage device from the first node access path. An access request is sent from the first node to a second node. The second node has a second node access path to the storage device. A determination is made that the second node can communicate with the storage device. The storage device is accessed by an application at the first node via the second node access path.

Type: Application

Filed: June 30, 2011

Publication date: January 3, 2013

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Manohar Rao Bodke, Lakshmi Priya Kanduru, Rohit Krishna Prasad, Ravi A. Shankar
REPURPOSING DATA LANE AS CLOCK LANE BY MIGRATING TO REDUCED SPEED LINK OPERATION

Publication number: 20130007502

Abstract: Methods and apparatus relating to repurposing a data lane as a clock lane by migrating to reduced speed link operation are described. In one embodiment, speed of a link is reduced upon detection of failure on a clock lane of the link and one of a plurality of data lanes of a link is repurposed as a replacement clock lane. Other embodiments are also disclosed and claimed.

Type: Application

Filed: July 1, 2011

Publication date: January 3, 2013

Inventors: VENKATRAMAN IYER, ROBERT G. BLANKENSHIP, ALLEN J. BAUM
Automatically Performing Failover Operations With A Load Balancer

Publication number: 20130007505

Abstract: A load balancer includes a failover logic unit to identify servers to execute services, generate and store in the load balancer a failover rule and a service rule, and to determine a failure in a first server that executes a first service responsive to a lack of response by the first server to a keepalive message sent by the load balancer to the first server. The load balancer can then perform an operation to cause an automatic failover of the first service to another server based on the failover and service rules.

Type: Application

Filed: June 30, 2011

Publication date: January 3, 2013

Inventor: Thomas D. Spear
DATA ROUTING FOR POWER OUTAGE MANAGEMENT

Publication number: 20120324273

Abstract: In one embodiment, a particular node in a computer network, that is, one receiving electrical power from a grid source, may determine routing metrics to a plurality of neighbor nodes of the particular node in the computer network. In addition, the node also determines power grid connectivity of the plurality of neighbor nodes. Traffic may be routed from the particular node to one or more select neighbor nodes having preferred routing metrics, until a power outage condition at the particular node is detected, at which time the traffic (e.g., last gasp messages) may be routed from the particular node to one or more select neighbor nodes having diverse power grid connectivity from the particular node. In this manner, traffic may be routed via a device that is not also experiencing the power outage condition.

Type: Application

Filed: June 20, 2011

Publication date: December 20, 2012

Applicant: CISCO TECHNOLOGY, INC.

Inventors: Shmuel Shaffer, Sandeep J. Shetty, Jean-Philippe Vasseur
OPTICAL COMMUNICATION SYSTEM, INTERFACE BOARD AND CONTROL METHOD PERFORMED IN INTERFACE BOARD

Publication number: 20120324272

Abstract: An embodiment of the invention is an optical communication system including: a plurality of interface boards which transmit and receive optical signals to and from interface boards facing the plurality of interface boards; and a monitoring control device which monitors states of the plurality of interface boards. A first interface board of the plurality of interface boards includes: a replacement unit capable of monitoring the states of the plurality of interface boards on behalf of the monitoring control device and independently receiving supply of power; and a control unit configured to start the replacement unit in a case where a fault occurs in the monitoring control device and stop or halt the replacement unit in a case where there is no fault in the monitoring control device.

Type: Application

Filed: June 1, 2012

Publication date: December 20, 2012

Inventors: Takuya IWASAWA, Kenta Noda
REPLAYING JOBS AT A SECONDARY LOCATION OF A SERVICE

Publication number: 20120311377

Abstract: Jobs submitted to a primary location of a service within a period of time before and/or after a fail-over event are determined and are resubmitted to a secondary location of the service. For example, jobs that are submitted fifteen minutes before the fail-over event and jobs that are submitted to the primary network before the fail-over to the second location is completed are resubmitted at the secondary location. After the fail-over event occurs, the jobs are updated with the secondary network that is taking the place of the primary location of the service. A mapping of job input parameters (e.g. identifiers and/or secrets) from the primary location to the secondary location are used by the jobs when they are resubmitted to the secondary location. Each job determines what changes are to be made to the job request based on the job being resubmitted.

Type: Application

Filed: June 1, 2011

Publication date: December 6, 2012

Applicant: MICROSOFT CORPORATION

Inventors: Daniel Blood, Alexander Hopmann, Siddharth Rajendra Shah, Viktoriya Taranov, Tarkan Sevilmis, Nikita Voronkov
JOB MIGRATION IN RESPONSE TO LOSS OR DEGRADATION OF A SEMI-REDUNDANT COMPONENT

Publication number: 20120290874

Abstract: A method of managing the workload in a computer system having one or more semi-redundant hardware components is provided. The method comprises detecting loss or degradation of the level of performance of one or more of the semi-redundant hardware components, identifying hardware components affected by the loss or degradation, migrating a critical job from an affected hardware component to an unaffected hardware component, and performing less-critical jobs on an affected hardware component. Loss or degradation of the semi-redundant component reduces the capacity of affected hardware components in the computer system without entirely disabling the computer system. Jobs identified as critical run on hardware components having the most capacity and reliability, while less-critical jobs use the remaining capacity of affected hardware components. Examples of semi-redundant hardware components include a memory module, CPU core, Ethernet port, power supply, fan, disk drive, and an input output port.

Type: Application

Filed: July 3, 2012

Publication date: November 15, 2012

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Fred A. Bower, III, Scott A. Piper, Gregory B. Pruett
READ XF INSTRUCTION FOR PROCESSING VECTORS

Publication number: 20120284560

Abstract: The described embodiments include a processor that handles faults. The processor first receives a first input vector, a control vector, and a predicate vector, each vector comprising a plurality of elements. For each element in the first input vector for which a corresponding element in the control vector and the predicate vector are active, the processor then performs a read operation using an address from the element of the first input vector. When a fault condition is encountered while performing the read operation, the processor determines if the element is a first element where a corresponding element of the control vector is active. If so, the processor handles/processes the fault. Otherwise, the processor masks the fault for the element.

Type: Application

Filed: July 18, 2012

Publication date: November 8, 2012

Applicant: APPLE INC.

Inventor: Jeffry E. Gonion
BYZANTINE FAULT TOLERANT DYNAMIC QUORUM USING A TRUSTED PLATFORM MODULE

Publication number: 20120254655

Abstract: A method implemented in a computer infrastructure having computer executable code tangibly embodied on a computer readable medium. The computer executable code is operable to dynamically adjust quorum requirements for a voting set V of a server cluster, including a plurality of servers, to ensure that a response of the server cluster to a client request remains Byzantine fault tolerant when at least one of: a failed server of the server cluster is replaced with at least one new server, such that a total set S of servers that have ever been members of the server cluster is increased, and an existing server is removed from the voting set V.

Type: Application

Filed: March 29, 2012

Publication date: October 4, 2012

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventor: Henry E. BUTTERWORTH
PARALLEL COMPUTER SYSTEM, CONTROL DEVICE, AND CONTROLLING METHOD

Publication number: 20120246512

Abstract: The control device detects a failed node in which a failure has occurred from a plurality of computation nodes included in a plurality of computation units included in the parallel computer. The control device chooses execution nodes for executing the program from the computation nodes of the parallel computer except the detected failed nodes based on the number of computation nodes needed to execute the program. The control device selects a paths to connect the computation nodes from a plurality of links each connecting two computation units adjacent to each other through a plurality of paths configured to connect computation nodes included in two computation units adjacent to each other in a one-to-one manner included in the links connecting two computation units adjacent to each other in the plurality of computation units including the choosed execution nodes except the path connected to the detected failed node.

Type: Application

Filed: January 18, 2012

Publication date: September 27, 2012

Applicant: FUJITSU LIMITED

Inventor: Hidetoshi IWASHITA
Method and Device for Link Protection in Virtual Private Local Area Network

Publication number: 20120239965

Abstract: The present invention discloses a method and device for a link protection in a virtual private local area network, which relates to the network data communication technology. The method of the present invention includes: in a networking process of a VPLS network, a link protection device establishing a main tunnel and a standby tunnel of MPLS TE for a link, and creating a VPLS forwarding table to deal with the information of the established MPLS TE main tunnel and standby tunnel; and when receiving a VPLS message, the link protection device searching the information of the MPLS TE main tunnel of the VPLS message according to a way of accessing the VPLS network of the VPLS message and the VPLS forwarding table, and if the found MPLS TE main tunnel is invalid, then transmitting the received VPLS message by adopting the standby tunnel of the MPLS TE main tunnel.

Type: Application

Filed: September 25, 2010

Publication date: September 20, 2012

Inventors: Shihua Wu, Shimeng Chen, Guangping Niu
Transmitting network information using link or port aggregation protocols

Publication number: 20120233492

Abstract: In one embodiment, a method includes receiving at a network device, a packet from a component in a virtual network device, the packet transmitted across a link aggregation bundle connecting the virtual network device to the network device and indicating if the component is a master component in the virtual network device, and determining if an error exists in operation of the component as the master component or a slave component. An apparatus for assigning services to physical links in an aggregated link bundle is also disclosed.

Type: Application

Filed: March 10, 2011

Publication date: September 13, 2012

Applicant: CISCO TECHNOLOGY, INC.

Inventors: Norman Finn, Samer Salam, Ali Sajassi
Reducing Data Stream Interruption During Failure of a Firewall Device

Publication number: 20120226932

Abstract: A method includes a first firewall device performing a firewall function on a first redundant input data packet and outputting the first redundant input data packet as a first redundant output data packet according to the firewall function. A second firewall device performs only the the same firewall function on a second redundant input data packet and outputs the second redundant input data packet as a second redundant output data packet according to the firewall function. The redundant output data packets are at least substantially similar when the firewall devices are functioning properly. A controller receives the redundant output data packets and transmits at a given time one of the redundant output data packets to a target. The controller transmits the first redundant output data packet to the target while the second device is failed.

Type: Application

Filed: May 8, 2012

Publication date: September 6, 2012

Applicant: AT&T INTELLECTUAL PROPERTY I, L.P.

Inventors: Thomas H. Adams, Michael Raftelis
Migrating Virtual Machines Among Networked Servers Upon Detection Of Degrading Network Link Operation

Publication number: 20120221887

Abstract: Migrating virtual machines among networked servers, the servers coupled for data communications with a data communications network that includes a networking device, where migrating includes: establishing, by a virtual machine management module (‘VMMM’), one or more virtual machines on a particular server; querying, by the VMMM, the networking device for link statistics of a link coupling the network device to the particular server for data communications; determining, by the VMMM in dependence upon the link statistics, whether the link coupling the network device to the particular server is degrading; and if the link coupling the network device to the particular server is degrading, migrating a virtual machine executing on the particular server to a destination server. In some embodiments, migrating occurs is carried out only if non-degrading link is available. If no non-degrading links are available, the network device, rather than the link, may be failing.

Type: Application

Filed: May 3, 2012

Publication date: August 30, 2012

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Nils P. Hansson, Bruce A. Smith, Edward S. Suffern, James L. Wooldridge
METHOD OF DYNAMIC ALLOCATION ON A STATICALLY ALLOCATED AND EMBEDDED SOFTWARE ARCHITECTURE

Publication number: 20120210160

Abstract: A method of dynamically allocating a task or a signal on a statically allocated and embedded software architecture of a vehicle includes identifying a faulty component. The faulty component may include a software component, a hardware component or a signal or communications link between components. Once the faulty component is identified, any tasks performed by or signals associated with the faulty component are identified, and the tasks performed by or the signals associated with the faulty component are re-allocated to an embedded standby component so that performance of the re-allocated task and/or signal for future system operations is performed by the standby component.

Type: Application

Filed: February 10, 2011

Publication date: August 16, 2012

Applicant: GM GLOBAL TECHNOLOGY OEPERATIONS LLC

Inventors: Thomas E. Fuhrman, Sandeep Menon
MOBILE ROUTER NETWORK METHOD

Publication number: 20120204057

Abstract: A method comprises the steps of providing a plurality of mobile routers; providing a main server for tracking and monitoring the plurality of mobile routers; initially configuring each mobile router of the plurality of mobile routers to communicate with the main server; providing a first linked communication between each mobile router and the main server; registering each mobile router with the main server and uploading configuration information from each mobile router to the main server; assigning each mobile router with a predetermined group; subsequent to the registering and assigning steps, providing a second linked communication between each mobile router and the main server; and operating the main server such that when the second linked communication occurs, the main server reassigns each mobile router to communicate with at least one group server assigned to communicate with the predetermined group.

Type: Application

Filed: March 8, 2012

Publication date: August 9, 2012

Inventors: DOUGLAS S. MOELLER, Ronald W. Pashby
HOLISTIC TASK SCHEDULING FOR DISTRIBUTED COMPUTING

Publication number: 20120159236

Abstract: Embodiments of the invention include a method for fault tolerance management of workers nodes during map/reduce computing in a computing cluster. The method includes subdividing a computational problem into a set of sub-problems, mapping a selection of the sub-problems in the set to respective nodes in the cluster, directing processing of the sub-problems in the respective nodes, and collecting results from completion of processing of the sub-problems. During a first early temporal portion of processing the computational problem, failed nodes are detected and the sub-problems currently being processed by the failed nodes are re-processed. Conversely, during a second later temporal portion of processing the computational problem, sub-problems in nodes not yet completely processed are replicated into other nodes, processing of the replicated sub-problems directed, and the results from completion of processing of sub-problems collected.

Type: Application

Filed: February 28, 2012

Publication date: June 21, 2012

Applicant: International Business Machines Corporation

Inventor: David L. Kaminsky
Generation of Standby Images of Applications

Publication number: 20120144232

Abstract: Embodiments that generate checkpoint images of an application for use as warm standby are contemplated. The embodiments may monitor accesses of external references by threads. An external reference may comprise a connection or use of services of an entity that is external to the set of processes that constitute the application, to which a process of the application attempts to connect by means of a socket or inter-process communication (IPC). Various embodiments comprise two or more computing devices, such as two or more servers. One of the computing devices may generate a checkpoint image of an application at a suitable point in time during initialization, when the state of the application is not yet dependent on interactions with external references. The second computing device may preload checkpoint image for the application and activate the checkpoint images when needed, following the specific resource management rules of the distributed subsystem.

Type: Application

Filed: December 3, 2010

Publication date: June 7, 2012

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Douglas J. Griffith, Angela A. Jaehde, Manjunath B. Muttur
Systems and Methods of Providing Fast Leader Elections in Distributed Systems of Simple Topologies

Publication number: 20120124412

Abstract: Systems and computer-implemented methods of electing a new leader node in distributed systems of simple topologies connecting a plurality of nodes on at least one computer system. The computer-implemented method comprises several steps including at least one node, which detected the absence of a leader, starting a first round for its approval as an Approved Election Initiator. If a quorum accepts the StartElection request during the first round, then the Election Initiator starts a second round to set the leader. If a quorum of all nodes has not been reached during the first round, then the first round fails. The method repeats until a leader is set and is repeated each time a node discovers that the network does not have an active leader. Also provided herein is a computer readable medium having computer executable instructions stored thereon for performing the computer-implemented method.

Type: Application

Filed: November 15, 2010

Publication date: May 17, 2012

Applicant: Microsoft Corporation

Inventor: Konstantin Surkov
JOB MIGRATION IN RESPONSE TO LOSS OR DEGRADATION OF A SEMI-REDUNDANT COMPONENT

Publication number: 20120072765

Abstract: A computer program product and method of managing the workload in a computer system having one or more semi-redundant hardware components are provided. The method comprises detecting loss or degradation of the level of performance of one or more of the semi-redundant hardware components, identifying hardware components that are affected by the loss or degradation of the one or more semi-redundant components, migrating a critical job from an affected hardware component to an unaffected hardware component, and performing less-critical jobs on an affected hardware component. Loss or degradation of the semi-redundant component reduces the capacity of affected hardware components in the computer system without entirely disabling the computer system. Jobs identified as being critical are run on hardware components having the most capacity and reliability, while allowing less-critical jobs to make use of the remaining capacity of affected hardware components.

Type: Application

Filed: September 20, 2010

Publication date: March 22, 2012

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Fred A. Bower, III, Scott A. Piper, Gregory B. Pruett
REMOTE MONITORING APPARATUS, WIND TURBINE GENERATOR SYSTEM, AND METHOD OF CONTROLLING REMOTE MONITORING APPARATUS

Publication number: 20120066544

Abstract: A SCADA system includes a main switching hub and a backup switching hub that relay transmission data between a wind turbine generator and terminals provided in another SCADA system and client terminals, and a network switch for performing switching between the main switching hub and the backup switching hub for relaying transmission data between the wind turbine generator and the terminals. A backup remote I/O connected to the backup switching hub causes the network switch to perform switching based on a switching command from a SCADA terminal input via the backup switching hub. This serves to solve a data transmission problem caused by a problem in a switching hub on the wind turbine generator side from a remote location.

Type: Application

Filed: September 14, 2011

Publication date: March 15, 2012

Applicant: MITSUBISHI HEAVY INDUSTRIES, LTD.

Inventor: Hidekazu Ichinose
CONTROLLED AUTOMATIC HEALING OF DATA-CENTER SERVICES

Publication number: 20120066541

Abstract: Subject matter described herein is directed to reallocating an application component from a faulty data-center resource to a non-faulty data-center resource. Background monitors identify data-center resources that are faulty and schedule migration of application components from the faulty data-center resources to non-faulty data-center resources. Migration is carried out in an automatic manner that allows an application to remain available. Thresholds are in place to control a rate of migration, as well as, detect when resource failure might be resulting from data-center-wide processes or from an application failure.

Type: Application

Filed: September 10, 2010

Publication date: March 15, 2012

Applicant: MICROSOFT CORPORATION

Inventors: PAVEL A. DOURNOV, HAIZHI XU
Non-Disruptive Hardware Change

Publication number: 20120054544

Abstract: A method, system, and computer program product for changing hardware in a data processing system without disrupting processes executing on the data processing system. A hardware change to a selected portion of hardware in the data processing system may be required, such as to repair hardware errors or to implement a system update. Responsive to a determination that a hardware change to the selected portion of the hardware is required, a process being performed by the selected portion is moved from the selected portion of the hardware to an alternate portion of the hardware. The hardware change is applied to the selected portion of the hardware. The selected portion of the hardware is returned for use by the data processing system after the hardware change is applied.

Type: Application

Filed: August 24, 2010

Publication date: March 1, 2012

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Michael S. Floyd, Ryan J. Pennington, Harmony L. Prince, Kevin F. Reick, David D. Sanner
Method and Apparatus for Calendaring Reminders

Publication number: 20120030505

Abstract: An electronic calendar includes such features as recurring reminders, dividing unpredictable work loads into equal pieces, template free parsing, a reminders scheduling algorithm to reduce spikes, dynamic delivery and recovery algorithms, methods for splitting the work load between controllers and workers and for monitoring progress, all within the context of a calendar architecture for a large enterprise.

Type: Application

Filed: October 6, 2011

Publication date: February 2, 2012

Inventors: Rupesh CHHATRAPATI, Richard J. Treitel, Jon Chester, David Sobotka, Louis Lao, Andrew Zaeske, Meera Vengadasubbu
HIGH RELIABILITY COMPUTER SYSTEM AND ITS CONFIGURATION METHOD

Publication number: 20120030504

Abstract: [Object] To increase the speed of copy processing from an online computer 101 to a standby computer 102. [Solution] When copying copy target information from the online computer 101 to the standby computer 102, a status copy processing unit 110: recognizes, as a synchronous point, a point in time when execution of a first application (AP #1), from among applications 108, is completed; extracts only information stored in a use area 401 of an OS 106 and a use area 403 of a second application (AP #2) from a storage area 400 of a memory at this synchronous point; and transfers the extracted information, as the copy target information necessary to continue the processing, from the online computer 101 to the standby computer 102. [Selected Drawing] FIG.

Type: Application

Filed: November 5, 2009

Publication date: February 2, 2012

Inventors: Hiroyasu Nishiyama, Tomoya Ohta, Daisuke Yokota, Ken Nomura, Toshiaki Arai
System and method for provisioning and running a cross-cloud test grid

Publication number: 20120017112

Abstract: An automated method for provisioning a grid used to run a load test on a target website includes sending one or more requests in a multi-threaded manner to at least one cloud provider, the one or more requests for an allocation of N load server instances and M result server instances which comprise the grid. Requests received back from the cloud provider are also handled in a multi-threaded manner; any errors occurring during the allocation being corrected automatically. The N load server instances and the M result server instances are then verified to be operational and correctly running software deployed to provide defined test services. Errors identified during the verification are automatically corrected either by attempting to restart a failed instance or allocating a different instance.

Type: Application

Filed: July 19, 2010

Publication date: January 19, 2012

Applicant: Power Integrations, Inc.

Inventors: Tal Broda, Matthew Solnit, Kenneth C. Gardner, Craig R. Powers, Michael Hemmert, Charles A. Vazac, Kendall Cosby
INTRA-REALM AAA FALLBACK MECHANISM

Publication number: 20120005523

Abstract: There is provided an intra-realm AAA (authentication, authorization and accounting) fallback mechanism, wherein the single global realm may be divided in one or more sub-realms. The thus presented mechanism exemplarily comprises detecting a failure of an authentication server serving at least one authentication client within a first sub-realm of a single-realm authentication system, and routing authentication messages of the at least one authentication client to a fallback authentication server within a second sub-realm of the single-realm authentication system, wherein routing may exemplarily comprise sub-realm based source routing.

Type: Application

Filed: April 21, 2009

Publication date: January 5, 2012

Applicant: NOKIA SIEMENS NETWORKS OY

Inventors: Jouni Korhonen, Hannes Tschofenig
FAULT TOLERANCE FOR MAP/REDUCE COMPUTING

Publication number: 20120005522

Abstract: Embodiments of the invention include a method for fault tolerance management of workers nodes during map/reduce computing in a computing cluster. The method includes subdividing a computational problem into a set of sub-problems, mapping a selection of the sub-problems in the set to respective nodes in the cluster, directing processing of the sub-problems in the respective nodes, and collecting results from completion of processing of the sub-problems. During a first early temporal portion of processing the computational problem, failed nodes are detected and the sub-problems currently being processed by the failed nodes are re-processed. Conversely, during a second later temporal portion of processing the computational problem, sub-problems in nodes not yet completely processed are replicated into other nodes, processing of the replicated sub-problems directed, and the results from completion of processing of sub-problems collected.

Type: Application

Filed: June 30, 2010

Publication date: January 5, 2012

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventor: David L. Kaminski
Communication system, communication unit, control unit, and controlling method

Publication number: 20110296232

Abstract: Whether or not a signal which is transmitted from a current operation side node to a stand-by side node has been disconnected is determined on a communication route for the signal. When the signal is determined to have been disconnected, the network route is switched to the network route used when a client unit and the stand-by side node perform data communication. With this, the switching of the network route when processing having been performed by a node of a system is performed by another node in place of the node can be made quickly.

Type: Application

Filed: August 8, 2011

Publication date: December 1, 2011

Applicant: NEC CORPORATION

Inventor: Tetsu Izawa
CLUSTER-FREE TECHNIQUES FOR ENABLING A DIRECTORY PROTOCOL-BASED DOMAIN NAME SYSTEM (DNS) SERVICE FOR HIGH AVAILABILITY

Publication number: 20110271139

Abstract: Cluster-free techniques for enabling a directory protocol-based Domain Name System (DNS) service for high availability are presented. A DNS service monitors a node for wild-carded IP address that migrate to the node when a primary node fails to service DNS requests for a directory of the network. The DNS service forwards the wild-carded IP address to a distributed directory service for resolution and uses the distributed directory service to dynamically configure the DNS service for directly handling subsequent DNS requests made to the directory over the network while the primary node remains inoperable over the network.

Type: Application

Filed: July 14, 2011

Publication date: November 3, 2011

Inventor: Madhavi R. Phanse
CONFIGURING A SYSTEM TO COLLECT AND AGGREGATE DATASETS

Publication number: 20110246816

Abstract: Methods for configuring a system to collect and aggregate datasets are disclosed. One embodiment includes, identifying a data source in the system from where dataset is to be collected, configuring a machine in the system that generates the dataset to be collected, to send the dataset to the data source, identifying an arrival location where the dataset that is collected is to be aggregated or written, and/or configuring an agent node by specifying a source for the agent node as the data source in the system and specifying a sink for the agent node as the arrival location.

Type: Application

Filed: September 8, 2010

Publication date: October 6, 2011

Applicant: Cloudera, Inc.

Inventors: Jonathan Ming-Cyn Hsieh, Henry Noel Robinson
REPURPOSABLE RECOVERY ENVIRONMENT

Publication number: 20110246813

Abstract: A reconfiguration manager is operable to reconfigure a repurposable recovery environment between a recovery environment for a production environment and a second environment different from the recovery environment. A storage system in the repurposable recovery environment periodically saves production information from the production environment while the repurposable recovery environment is operating as the second environment. The production information in the storage system is used to reconfigure the repurposable recovery environment from the second environment to the recovery environment.

Type: Application

Filed: April 1, 2010

Publication date: October 6, 2011

Applicant: ACCENTURE GLOBAL SERVICES GmbH

Inventors: Ryan COLLIER, Sandro LORENZOTTI, Gaurang PATEL, Jay CORN
MANAGEMENT APPARATUS, MANAGEMENT METHOD, AND PROGRAM

Publication number: 20110239038

Abstract: When a fault occurs in a guest machine 2b of a virtual machine system 100b, a stop control part 5a of a guest machine 2a of a virtual machine system 100a requests a stop control part 4b of a host machine 1b to stop operation of the guest machine 2b. If the guest machine 2b does not stop operation normally, the stop control part 5a requests a BMC 8b to stop operation of the host machine 1b. The BMC 8b stops the host machine 1b, so that the machine where the fault occurs can be stopped.

Type: Application

Filed: January 6, 2009

Publication date: September 29, 2011

Applicant: MITSUBISHI ELECTRIC CORPORATION

Inventor: Takayuki Ito
APPARATUS TO MAINTAIN REDUNDANCY IN FOUR LINE SYSTEM

Publication number: 20110225452

Abstract: Exemplary embodiments adapted to distribute power from four input lines to a plurality of power supply units (PSUs) configured in an N+1 architecture are provided. In one such embodiment, a plurality of rectifier devices have first and second ends, each of the plurality of rectifier devices connected at the first end to one of the four input lines, and adapted to be bypassed by a first relay in a first operating mode and provide rectified input current in a second operating mode. A plurality of second relays is connected between each of the second ends of the plurality of rectifier devices. The plurality of second relays are adapted to be closed in the second operating mode to sum the rectified input current from each of the plurality of rectifier devices in a single node connecting each of the plurality of PSUs.

Type: Application

Filed: March 10, 2010

Publication date: September 15, 2011

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Jiwu DUAN, Steven M. GROFF, Trung LE
System And Method for Handling a Failover Event

Publication number: 20110191627

Abstract: A system comprising a memory storing a set of instructions executable by a processor. The instructions being operable to monitor progress of an application executing in a first operating system (OS) instance, the progress occurring on data stored within a shared memory area, detect a failover event in the application and copy, upon the detection of the failover event, the data from the shared memory area to a fail memory area of a second instance of the OS, the fail memory area being an area of memory mapped for receiving data from another instance of the OS only if the application executing on the another instance experiences a failover event.

Type: Application

Filed: January 29, 2010

Publication date: August 4, 2011

Inventors: Maarten Koning, Felix Burton, Matt Sherer
COMPUTER SYSTEM AND BOOT CONTROL METHOD

Publication number: 20110191622

Abstract: When a primary computer is taken over to a secondary computer in a redundancy configuration computer system where booting is performed via a storage area network (SAN), a management server delivers an information collecting/setting program to the secondary computer before the user's operating system of the secondary computer is started. This program assigns a unique ID (World Wide Name), assigned to the fibre channel port of the primary computer, to the fibre channel port of the secondary computer to allow a software image to be taken over from the primary computer to the secondary computer.

Type: Application

Filed: April 14, 2011

Publication date: August 4, 2011

Applicant: HITACHI, LTD.

Inventors: Tetsuhiro GOTO, Kazuhiro ADACHI
PROCESS FOR SECURE BACKSPACING TO A FIRST DATA CENTER AFTER FAILOVER THROUGH A SECOND DATA CENTER AND A NETWORK ARCHITECTURE WORKING ACCORDINGLY

Publication number: 20110179305

Abstract: A method (100) is proposed for safely switching back to a first computer center (DC1) following failover by a second computer center (DC2), where at least one transaction terminal is connected as a client (CLA) to the second computer center (DC2) during the downtime of the first computer center (DC1) and at the end of the downtime is switched back again to the first computer center (DC1), where the following steps are performed: the second computer center (DC2) is shown that the first computer center (DC1) is operational again (step 110); the client (CLA) is prompted to go out of service (step 120); synchronization (STD) of configuration and/or transaction data between the first and second computer centers is performed (step 130); at the conclusion of synchronization(STD), the client (CLA) is prompted to switch back to the first computer center (DC1) (step 140).

Type: Application

Filed: January 20, 2011

Publication date: July 21, 2011

Applicant: WINCOR NIXDORF INTERNATIONAL GMBH

Inventors: Nico LAMMEL, Helmut FINKE
HIGH AVAILABILITY FOR NETWORK SECURITY DEVICES

Publication number: 20110173490

Abstract: In one example, a backup intrusion detection and prevention (IDP) device includes one or more network interfaces to receive a state update message from a primary IDP device, wherein the state update message indicates a network session being inspected by the primary IDP device and an identified application-layer protocol for the device, to receive an indication that the primary device has switched over or failed over to the backup device, and to receive a plurality of packets of the network session after receiving the indication, each of the plurality of packets comprising a respective payload including application-layer data, a protocol decoder to detect a beginning of a new transaction from the application-layer data of one of the plurality of packets, and a control unit to statefully process only the application-layer data of the network session that include and follow the beginning of the new transaction.

Type: Application

Filed: January 8, 2010

Publication date: July 14, 2011

Applicant: Juniper Networks, Inc.

Inventors: Krishna Narayanaswamy, Rajiv Ranjan
Systems and Methods for a Communication Protocol Between a Local Controller and a Master Controller

Publication number: 20110161722

Abstract: Systems and methods for local management units in a photovoltaic energy system. In one embodiment, a method implemented in a computer system includes: attempting to communicate on a first active channel with a master management unit from a local management unit that controls a solar module; if communication with the master management unit on the first active channel has not been established, attempting to communicate on a second active channel with the master management unit.

Type: Application

Filed: September 30, 2010

Publication date: June 30, 2011

Applicant: TIGO ENERGY

Inventors: Maxym Makhota, Daniel Eizips, Shmuel Arditi, Ron Hadar

1 2 3 4 next