Fault Recovery Patents (Class 714/2)
  • Publication number: 20140215257
    Abstract: A method and a system to provide daisy chain distribution in data centers are provided. A node identification module identifies three or more data nodes of a plurality of data nodes. The identification of three or more data nodes indicates that the respective data nodes are to receive a copy of a data file. A connection creation module to, using one or more processors, create communication connections between the three or more data nodes. The communication connections form a daisy chain beginning at a seeder data node of the three or more data nodes and ending at a terminal data node of the three or more data nodes.
    Type: Application
    Filed: January 30, 2013
    Publication date: July 31, 2014
    Applicant: eBay Inc
    Inventors: Aswani Kumar Raveendran Nair, Sergiy Pereshyvaylo, Bin Yu, Sudheendra Vasudeva Murthy
  • Publication number: 20140215256
    Abstract: A distributed computer system includes components. The components include embedded computer processors that make up an application within the distributed computer system. The computer processors are accessible by an end user of the system. The computer processors are operable to communicate with a plurality of system analyzers, to generate an operational status of the application in the system based on the communication with the plurality of system analyzers, to generate one or more recommendations to address or troubleshoot a non-desired operational status of the application within the system, and to provide a unified interface to the end user that provides to the end user the one or more recommendations to address or troubleshoot the non-desired operational status of the application within the system.
    Type: Application
    Filed: January 28, 2013
    Publication date: July 31, 2014
    Applicant: CA, Inc.
    Inventors: Ramanjaneyulu Malisetti, Venkata Ramana Koneti, Nagamohan Koganti
  • Patent number: 8793449
    Abstract: A storage server receives a write request from a client system including new data and a location to store the new data. The storage server transmits a copy instruction to a storage subsystem to relocate old data at the location and transmits a write instruction to the storage subsystem to overwrite the old data with the new data. The storage subsystem includes fast stable storage in which the copy instruction and the write instruction are stored. After receiving each instruction, the storage subsystem sends an acknowledgement to the storage server. When both instructions have been acknowledged, the storage server sends an acknowledgement to the client system. The storage subsystem performs the instructions asynchronously from the client system's write request.
    Type: Grant
    Filed: October 15, 2010
    Date of Patent: July 29, 2014
    Assignee: NetApp, Inc.
    Inventor: Jeffrey S. Kimmel
  • Publication number: 20140208151
    Abstract: An electronic system includes circuitry to detect errors in logic state in the system and to initiate corrective action when one or more errors are detected. In some embodiments, redundant information is stored within a system that is associated with an operational state of the system. If the operational state of the system is subsequently corrupted as a result of an electrical or mechanical overstress condition, resulting errors may be detected by comparing or otherwise processing the stored operational state information and the redundant information.
    Type: Application
    Filed: January 23, 2013
    Publication date: July 24, 2014
    Inventor: Devon Fernandez
  • Publication number: 20140208150
    Abstract: Approaches for problem determination and resolution process cross communication are provided. Embodiments provide cross communication of a problem determination and resolution among similar data center devices. Specifically, symptoms of an error condition encountered for one data center device are captured by a first enterprise group, along with an associated resolution solution, and made available to an another enterprise group managing a commonly configured data center device, which may be faced with a similar error condition. The error signature and resolution steps captured by the first enterprise group are subsequently made available within and across multiple management domains operating within a common model (e.g., a publication-subscription system). Within this model, both the originator of the error determination and resolution (i.e., publisher), and one or more commonly configured data center devices susceptible to the same error condition (i.e.
    Type: Application
    Filed: January 22, 2013
    Publication date: July 24, 2014
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Tamer E. Abuelsaad, Gregory J. Boss, Clifford A. Pickover, Randy A. Rendahl
  • Patent number: 8788579
    Abstract: An application instance identifier is employed with various systems and methods in order to provide a requestor with continuous access to a resource when operating in a client clustered environment. A requestor residing on a first client may attempt to access a resource. The first client sends a request to access the resource. The request may be associated with an application instance identifier that identifies the requestor. At some point, the first client fails and the requestor is associated with a second client via a failover mechanism. The second client sends a second request to access the resource on behalf of the requestor. The second request is associated with the requestor's application instance identifier. The application instance identifier is used to identify the second request as belonging to the same requestor as the first request, thereby granting the second request to access the resource while avoiding a conflict situation.
    Type: Grant
    Filed: September 9, 2011
    Date of Patent: July 22, 2014
    Assignee: Microsoft Corporation
    Inventors: David M. Kruse, Diaa Fathalla, James T. Pinkerton, Mathew George, Prashanth Prahalad, Thomas E. Jolly
  • Patent number: 8788872
    Abstract: Managing failover operations on a cluster of computers, including: identifying, by a failover hold module, a failure to access data storage in the cluster of computers; preventing the execution of all read operations directed to the data storage that were received after the failure to access data storage was identified; executing all write operations directed to the data storage that were received after the failure to access data storage was identified, including writing data to a cache; identifying that a failover to alternative data storage is complete; executing the held read operations, including reading data from the alternative data storage; and copying, from cache to the alternative data storage, the data written to the cache as part of the write operations.
    Type: Grant
    Filed: November 29, 2012
    Date of Patent: July 22, 2014
    Assignee: International Business Machines Corporation
    Inventors: Travis M. Drucker, Joel C. Dubbels, Thomas J. Eggebraaten, Janice R. Glowacki, Richard J. Stevens, David A. Wall
  • Patent number: 8782342
    Abstract: An information handling system includes a plurality of storage enclosures, a plurality of logical storage units located in each storage enclosure, a controller connected to each of the plurality of logical storage units. The controller is configured to receive data regarding the plurality of logical storage units, and automatically execute an algorithm based on the received logical storage unit data to generate a mirrored storage configuration to be implemented, the mirrored storage configuration defining mirrored pairs of the plurality of logical storage units.
    Type: Grant
    Filed: April 30, 2013
    Date of Patent: July 15, 2014
    Assignee: Dell Products L.P.
    Inventors: William C. Edwards, III, Patrick O. Boyd
  • Patent number: 8782647
    Abstract: Method for issuing and monitoring a remote batch job, method for processing a batch job, and system for processing a remote batch job. The method for issuing and monitoring a remote batch job includes formatting a command to be sent to a remote server to include a sequence identification composed of an issuing server identification and a time stamp, forwarding the command from the issuing server to the remote server for processing, and determining success or failure of the processing of the command at the remote server. When the failure of the processing of the command at the remote server is determined, the method further includes instructing the remote server to retry the command processing.
    Type: Grant
    Filed: March 2, 2012
    Date of Patent: July 15, 2014
    Assignee: International Business Machines Corporation
    Inventor: Tsunehiro Kajita
  • Publication number: 20140195845
    Abstract: In response to a notification of a fault captured in a system, a fault isolator serially analyzes each clock object to determine captured faults associated with the clock object. For each of the clock objects determined to have a captured fault, the fault isolator initiates a repair action for the chip represented by the clock object. The fault isolator concurrently analyzes the non-clock objects to determine captured faults associated with the non-clock objects after analysis of the clock objects. For each of the non-clock objects determined to have a captured fault, the fault isolator initiates a repair action for the chip represented by the non-clock object.
    Type: Application
    Filed: January 4, 2013
    Publication date: July 10, 2014
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventor: INTERNATIONAL BUSINESS MACHINES CORPORATION
  • Patent number: 8775858
    Abstract: Providing heterogeneous recovery in a redundant memory system that includes a memory controller, a plurality of memory channels in communication with the memory controller, an error detection code mechanism configured for detecting a failing memory channel, and an error recovery mechanism. The error recovery mechanism is configured for receiving notification of the failing memory channel, for performing a recovery operation on the failing memory channel while other memory channels are performing normal system operations, for bringing the recovered channel back into operational mode with the other memory channels for store operations, for continuing to mark the recovered channel to guard against stale data, for removing any stale data after the recovery operation is complete, and for removing the mark on the recovered channel to allow the normal system operations with all of the memory channels, the removing based on the removing any stale data being complete.
    Type: Grant
    Filed: March 11, 2013
    Date of Patent: July 8, 2014
    Assignee: International Business Machines Corporation
    Inventors: Kevin C. Gower, Luis A. Lastras-Montano, Patrick J. Meaney, Vesselina K. Papazova, Eldee Stephens
  • Patent number: 8775386
    Abstract: In a DBMS, in response to a request from application programs, a base table update function updates a base table and a recovery start point, and a log recording function records update logs for the base table. Then, at predetermined timing, a backup function outputs the content of the base table to a backup file and copies the recovery start point associated with the base table to a recovery start point. Subsequently, when a user has specified a quiesce point and the base table, a copy generation function generates a target table that is a copy of the base table at the quiesce point by updating the backup file of the specified base table with ones of the update logs from the recovery start point to the quiesce point.
    Type: Grant
    Filed: August 18, 2010
    Date of Patent: July 8, 2014
    Assignee: International Business Machines Corporation
    Inventors: Keisuke Asakura, Ritsuko Boh, Noriaki Kohno
  • Patent number: 8773054
    Abstract: The present invention is directed to a method of estimating a motor rotation angle that does not affect precision of a detected angle of a crank angle sensor, which is an alternative sensor, when abnormality occurs in a resolver and a peripheral circuit, and performing a motor control without failure of an inverter or peripheral device. A vehicle system includes a motor, a resolver detecting a rotor rotation angle of the motor, a motor control circuit controlling the motor based on information on rotor rotation angle and torque command value, an engine connected to the motor through a crankshaft, and a crankshaft sensor detecting revolutions of the crankshaft, in which the motor control circuit estimates rotor rotation angle based on a variation rate of the number of revolutions of the crankshaft when abnormality of the resolver is detected, and performs a weak field control based on estimated rotor rotation angle.
    Type: Grant
    Filed: December 9, 2011
    Date of Patent: July 8, 2014
    Assignee: Hitachi Automotive Systems, Ltd.
    Inventors: Kosei Goto, Yukihiko Oishi
  • Publication number: 20140189417
    Abstract: An apparatus and method are described for performing partial memory mirroring operations. For example, one embodiment of a processor comprises: a processor core for generating a read or write transaction having a system memory address; a home agent identified to service the read or write transaction based on the system memory address; one or more target address decoders (TADs) associated with the home agent to determine whether the system memory address is within a mirrored memory region or a non-mirrored memory region, wherein: if the system memory address is within a mirrored memory region, then the one or more TADs identifying multiple mirrored memory channels for the read or write transaction; and if the system memory address is not within a mirrored memory region, then the one or more TADs identifying a single memory channel for the read or write transaction.
    Type: Application
    Filed: December 28, 2012
    Publication date: July 3, 2014
    Inventors: Herbert H. Hum, Ganesh Kumar, Robert C. Swanson, David Bubien
  • Publication number: 20140189429
    Abstract: Disclosed is an approach for implementing disaster recovery for virtual machines. Consistency groups are implemented for virtual machines, where the consistency group link together two or more VMs. The consistency group includes any set of VMs which need to be managed on a consistent basis in the event of a disaster recovery scenario.
    Type: Application
    Filed: December 27, 2012
    Publication date: July 3, 2014
    Applicant: NUTANIX, INC.
    Inventor: NUTANIX, INC.
  • Patent number: 8768642
    Abstract: The present invention systems and methods facilitate configuration of functional components included in a remotely located integrated circuit die. In one exemplary implementation, a die functional component reconfiguration request process is engaged in wherein a system requests a reconfiguration code from a remote centralized resource. A reconfiguration code production process is executed in which a request for a reconfiguration code and a permission indicator are received, validity of permission indicator is analyzed, and a reconfiguration code is provided if the permission indicator is valid. A die functional component configuration process is performed on the die when an appropriate reconfiguration code is received by the die. The functional component configuration process includes directing alteration of a functional component configuration. Workflow is diverted from disabled functional components to enabled functional components.
    Type: Grant
    Filed: December 18, 2003
    Date of Patent: July 1, 2014
    Assignee: Nvidia Corporation
    Inventors: Michael B. Diamond, John S. Montrym, James M. Van Dyke, Michael B. Nagy, Sean J. Treichler
  • Patent number: 8769334
    Abstract: Techniques for providing instant disaster recovery are disclosed. In one particular exemplary embodiment, the techniques may be realized as a method for providing instant disaster recovery comprising, maintaining, in a data store, data associated with a first host system, wherein the data comprises a first data portion and a second data portion, storing, in the first data portion, a disaster recovery agent, and exposing, to a second host system, the first data portion and the second data portion, wherein the disaster recovery agent is configured to initiate, on the second host system, a disaster recovery process, boot the second host system using the first data portion, and copy, from the data store, the second data portion in accordance with a first copy procedure and a second copy procedure.
    Type: Grant
    Filed: February 25, 2013
    Date of Patent: July 1, 2014
    Assignee: Symantec Corporation
    Inventors: Geeta Gharpure, Taher M. Vohra
  • Patent number: 8769356
    Abstract: A memory device comprises a memory cell array and a bad page map. The memory cell array comprises a plurality of memory cells arranged in pages and columns, wherein the memory cell array is divided into a first memory block and a second memory block each corresponding to an array of the memory cells. The bad page map stores bad page location information indicating whether each of the pages of the first memory block is good or bad. A fail page address of the first memory block is replaced by a pass page address of the second memory block according to the bad page location information.
    Type: Grant
    Filed: August 9, 2012
    Date of Patent: July 1, 2014
    Assignee: Samsung Electronics Co., Ltd.
    Inventors: Hak-Soo Yu, Chul-Woo Park, Uk-Song Kang, Joo-Sun Choi, Hong-Sun Hwang, Jong-Pil Son
  • Patent number: 8767532
    Abstract: In one embodiment, a set of tunnels is determined that traverse a particular link connected to an intermediate node in a network. The intermediate node computes, in a coordinated path computation, paths for tunnels of the set of tunnels that do not include the particular link. The coordinated path computation considers each of the tunnels of the set of tunnels. The intermediate node selects one or more tunnels of the set of tunnels for preemption. The one or more tunnels are selected as tunnels that are reroutable by respective head-end nodes of the one or more other tunnels. Notifications are sent to one or more other intermediate nodes that inform the one or more other intermediate nodes of the one or more tunnels selected for preemption.
    Type: Grant
    Filed: October 26, 2012
    Date of Patent: July 1, 2014
    Assignee: Cisco Technology, Inc.
    Inventors: Jean-Philippe Vasseur, Francois Le Faucheur, Anna Charny
  • Patent number: 8769535
    Abstract: The present invention is directed to a virtualization system using a solid-state drive for disaster recovery.
    Type: Grant
    Filed: September 24, 2009
    Date of Patent: July 1, 2014
    Assignee: Avaya Inc.
    Inventor: Mahalingam Mani
  • Patent number: 8769335
    Abstract: A computer implemented method for providing homogeneous recovery in a redundant memory system. The method includes receiving a notification that a memory channel has failed, where the memory channel is one of a plurality of memory channels in a memory system. New operations are blocked from starting on the memory channels in response to the notification, and any pending operations on the memory channels are completed in response to the notification. A recovery operation is performed on the memory channels in response to the completing. The new operations are started on at least a first subset of the memory channels in response to the recovery operation completing. The memory system is configured to operate with the first subset of the memory channels.
    Type: Grant
    Filed: March 11, 2013
    Date of Patent: July 1, 2014
    Assignee: International Business Machines Corporation
    Inventors: Kevin C. Gower, Luis A. Lastras-Montano, Patrick J. Meaney, Vesselina K. Papazova, Eldee Stephens
  • Patent number: 8762801
    Abstract: A system includes a first device, a first storage element, a comparator and a second device. The first device is configured to test memory cells in an array of memory cells to detect defective memory cells. The defective memory cells include a first memory cell and a second memory cell. The first storage element is configured to store a first address of the first memory cell. The comparator is configured to compare a second address of the second memory cell to the first address.
    Type: Grant
    Filed: April 15, 2013
    Date of Patent: June 24, 2014
    Assignee: Marvell International Ltd.
    Inventors: Winston Lee, Albert Wu, Chorng-Lii Liou
  • Patent number: 8762339
    Abstract: Aspects of the present invention provide a solution for implementing disaster recovery for an application. A subscription backup site for the application is selected from among a plurality of sites. The site that is selected will have a virtual space that is able to accommodate the workload requirements for the application. The application is deflated on the site in such as way as to operate in a dormant state. In the dormant state, the application has a small fraction of its required resources allocated to it. Thereafter, when disaster recovery is needed, the application can be inflated on the subscription backup site by allocating resources to enable the application to execute on the subscription backup site.
    Type: Grant
    Filed: November 29, 2010
    Date of Patent: June 24, 2014
    Assignee: International Business Machines Corporation
    Inventors: Erik Bartholomy, Frank J. Giordano, Gordan G. Greenlee
  • Patent number: 8761002
    Abstract: Particular embodiments of the disclosed subject matter provide methods and systems to support a multicast source selection system. In an example embodiment, the system includes a network element in data communication with a network, the network element being operable to: receive a request for withdrawal of a server as a source of a multicast data stream; and propagate information to the network indicating withdrawal of the server as a source of the multicast data stream, the propagation of information by the network element being responsive to the request for withdrawal of the server as a source of the multicast data stream.
    Type: Grant
    Filed: January 22, 2013
    Date of Patent: June 24, 2014
    Assignee: AT&T Intellectual Property I, LP
    Inventors: Donald M. Smith, Kuo-Hui Liu, Gregory Harp, Thomas Bradley Scholl
  • Publication number: 20140173327
    Abstract: Methods, apparatuses, and computer program products for correcting a failure associated with a current firmware image are provided. Embodiments include a firmware selection module detecting the failure associated with the current firmware image stored in firmware memory corresponding to a component of a system. Embodiments also include the firmware selection module selecting from a plurality of backup firmware images, a replacement firmware image based on a status of at least one backup firmware image in response to detecting the failure. Embodiments also include the firmware selection module storing the selected replacement firmware image in the firmware memory.
    Type: Application
    Filed: December 19, 2012
    Publication date: June 19, 2014
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: SHIVA R. DASARI, SCOTT N. DUNHAM, EDWARD J. KLODNICKI
  • Publication number: 20140173328
    Abstract: Methods, apparatuses, and computer program products for correcting a failure associated with a current firmware image are provided. Embodiments include a firmware selection module detecting the failure associated with the current firmware image stored in firmware memory corresponding to a component of a system. Embodiments also include the firmware selection module selecting from a plurality of backup firmware images, a replacement firmware image based on a status of at least one backup firmware image in response to detecting the failure. Embodiments also include the firmware selection module storing the selected replacement firmware image in the firmware memory.
    Type: Application
    Filed: December 27, 2012
    Publication date: June 19, 2014
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: SHIVA R. DASARI, SCOTT N. DUNHAM, EDWARD J. KLODNICKI
  • Publication number: 20140173330
    Abstract: The invention provides for split brain detection and recovery in a DAS cluster data storage system through a secondary network interconnection, such as a SAS link, directly between the DAS controllers. In the event of a communication failure detected on the secondary network, the DAS controllers initiate communications over the primary network, such as an Ethernet used for clustering and failover operations, to diagnose the nature of the failure, which may include a crash of a data storage node or loss of a secondary network link. Once the nature of the failure has been determined, the DAS controllers continue to serve all I/O from the surviving nodes to honor high availability. When the failure has been remedied, the DAS controllers restore any local cache memory that has become stale and return to regular I/O operations.
    Type: Application
    Filed: December 14, 2012
    Publication date: June 19, 2014
    Applicant: LSI CORPORATION
    Inventors: Sumanesh Samanta, Luca Bert, Sujan Biswas
  • Publication number: 20140173326
    Abstract: Embodiments of the invention relate to supporting transaction data committed to a stable storage. Committed data in the cluster is stored in the persistent cache layer and replicated and stored in the cache layer of one or more secondary nodes. One copy is designated as a master copy and all other copies are designated as replica, with an exclusive write lock assigned to the master and a shared write lock extended to the replica. An acknowledgement of receiving the data is communicated following confirmation that the data has been replicated to each node designated to receive the replica. Managers and a director are provided to support management of the master copy and the replicas within the file system, including invalidation of replicas, fault tolerance associated with failure of a node holding a master copy, recovery from a failed node, recovered of the file system from a power failure, and transferring master and replica copies within the file system.
    Type: Application
    Filed: December 19, 2012
    Publication date: June 19, 2014
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Wendy A. Belluomini, Karan Gupta, Dean Hildebrand, Anna S. Povzner, Himabindu Pucha, Renu Tewari
  • Patent number: 8751856
    Abstract: Provided are techniques for determining a recovery time for a resource in a heterogeneous computing environment comprising interdependent resources. A graph for the resource representing all sequence dependencies and all group relations are created. The recovery time may be a cumulative startup time or a cumulative shutdown time of the resource considering interdependencies of the resource to other resources. The recovery time for all support resources having sequence dependencies with the resource is calculated and each node representing the support resources are removed from the graph. Then the recovery time for all member resources left in the graph that have group relations with the resource is calculated per a group type of the resource. The recovery time for the resource is a sum of the recovery time of all support resources, the recovery time of all member resources, and a unit recovery time of the resource.
    Type: Grant
    Filed: October 10, 2011
    Date of Patent: June 10, 2014
    Assignee: International Business Machines Corporation
    Inventors: Thomas Lumpp, David B. Petersen, Wolfgang Schaeberle, Jeurgen Schneider, Isabell Schwertle
  • Publication number: 20140157036
    Abstract: In one embodiment, a test case run analyzer may filter out failure events with known causes from a test report. The test case run analyzer may receive a test report of a test case run of an application process. The test case run analyzer may automatically identify a failure event in the test case run. The test case run analyzer may automatically compare the failure event to a failure pattern set. The test case run analyzer may filter the test report based on the failure pattern set.
    Type: Application
    Filed: November 30, 2012
    Publication date: June 5, 2014
    Applicant: Microsoft Corporation
    Inventors: Thomas Walton, Herman Widjaja, Anish Swaminathan, Andrew Precious, Edwin Bruce Shankle, III, Andrew Campbell, Sean Edmison, Jacob Beaudoin
  • Publication number: 20140157039
    Abstract: A method of detecting uninitialized memory reads is shown where either all or a subset of a random access memory system is initialized to a know value. One or more watch points are implemented where after a memory read is detected the value read is compared to the value written during initialization. If the values match debug information is captured and appropriate corrective action is taken.
    Type: Application
    Filed: December 4, 2012
    Publication date: June 5, 2014
    Applicant: TEXAS INSTRUMENTS INCORPORATED
    Inventor: Jason L. Peck
  • Publication number: 20140157040
    Abstract: A computer-implemented method detects a stability change in a computer system, and identifies a first set of at least one capability of the computer system that is affected by the stability change. In response to detecting the stability change, the method identifies a software application that was installed prior to the stability change, and identifies a second set of at least one capability of the computer system that is utilized by the identified software application. The method compares the first and second capability sets to determine a degree of similarity, and compares the time that the stability change was detected to the time that the identified software application was installed to determine a temporal proximity. The method then identifies the likelihood that the identified software application is the cause of the stability change, wherein the identified likelihood is a function of the degree of similarity and the temporal proximity.
    Type: Application
    Filed: December 10, 2012
    Publication date: June 5, 2014
    Applicant: International Business Machines Corporation
    Inventors: Albert D. Bennah, Adrian X. Rodriguez, Eric Woods
  • Publication number: 20140157037
    Abstract: A computer program product includes computer usable program code for: detecting a stability change in a computer system; identifying a first set of at least one capability of the computer system that is affected by the stability change; identifying, in response to detecting the stability change, a software application that was installed or updated prior to the stability change; identifying a second set of at least one capability that is utilized by the identified software application; comparing the first set to the second set to determine a degree of similarity; comparing a first time that the stability change was detected to a second time that the identified software application was installed or updated to determine a temporal proximity; and identifying the likelihood that the identified software application is the cause of the stability change, wherein the identified likelihood is a function of the degree of similarity and the temporal proximity.
    Type: Application
    Filed: November 30, 2012
    Publication date: June 5, 2014
    Applicant: International Business Machines Corporation
    Inventors: Albert D. Bennah, Adrian X. Rodriguez, Eric Woods
  • Publication number: 20140157038
    Abstract: A method, system and computer program product for reducing the failure of processes. After a job is received, a determination is made as whether the received job is a “short-lived job” or a “long-lived job.” A short-lived job refers to a job who accomplishes a given task in less than a threshold period of time. A long-lived job refers to a job who accomplishes a given task in greater than a threshold period of time. For an identified long-lived job, the long-lived job is executed on a single process apart from other processes; whereas, the short-lived job is executed on at least one process separate from the processes executing long-lived jobs. As a result of executing the long-lived jobs on separate processes from the short-lived jobs, the likelihood of having a process fail is lessened since the duration of time that the process is running will be lessened.
    Type: Application
    Filed: December 4, 2012
    Publication date: June 5, 2014
    Applicant: International Business Machines Corporation
    Inventors: Rohith K. Ashok, Roy F. Brabson, Michael J. Burr, Hugh E. Hockett, Kristin R. Whetstone
  • Publication number: 20140149781
    Abstract: A system and a method are disclosed for batch execution of system calls in an operating system. In one implementation, a processing device configures a system call batching buffer table in a user space of an operating system, the system call batching buffer table including a plurality of system call units, associates a system call number with the system call batching buffer table, and issues a trap instruction to a kernel of the operating system to execute at least one of the plurality of system call units, the trap instruction including the system call number.
    Type: Application
    Filed: November 26, 2012
    Publication date: May 29, 2014
    Applicant: RED HAT, INC.
    Inventor: Neil R. T. Horman
  • Patent number: 8738965
    Abstract: A test method for restarting a computing device communicating with a remote computer. The computing device is shut down and awakened by the remote computer. A second hardware information of the computing device after restarting the operating system of the computing device is compared with initial hardware information of the computing device when the computing device is initial started. Test results are stored to a predetermined storage path and displayed on a screen after the test ends.
    Type: Grant
    Filed: August 9, 2011
    Date of Patent: May 27, 2014
    Assignees: Hong Fu Jin Precision Industry (ShenZhen) Co., Ltd., Hon Hai Precision Industry Co., Ltd.
    Inventors: Hai-Li Wang, Yong-Qian Deng
  • Patent number: 8738996
    Abstract: A system includes a flash memory, an encoder, a first interface, a decoder and a controller. The encoder is configured to (i) receive data, and (ii) encode the data based on an error correction code. The first interface is configured to (i) write the encoded data to a memory cells in the flash memory, and (ii) read the encoded data back from the memory cells. The decoder is configured to (i) decode the encoded data read back from the memory cells, and (ii) based on the decoded data, determine a number of decoding errors for the plurality of memory cells. The controller is configured to, in response to the number of decoding errors being greater than or equal to a first threshold, cease accessing the memory cells. The first threshold is less than a maximum number of errors correctable by the error correction code for the memory cells.
    Type: Grant
    Filed: October 1, 2013
    Date of Patent: May 27, 2014
    Assignee: Marvell International Ltd.
    Inventors: Chen Kuo Huang, Sui-Hung Fred Au, Xueshi Yang, Lau Nguyen
  • Patent number: 8737202
    Abstract: Various exemplary embodiments relate to a method and related network node including one or more of the following: selecting a connection of a plurality of connections; testing the selected connection to determine whether the selected connection is usable; and based on a determination that the selected connection is not usable: closing the selected connection and at least one additional connection of the plurality of connections, and reestablishing the selected connection and the at least one additional connection.
    Type: Grant
    Filed: May 29, 2012
    Date of Patent: May 27, 2014
    Assignee: Alcatel Lucent
    Inventors: Shanawaz Shaik, Kugendran Sabaratnam, Mike Vihtari, Xiong Xue
  • Patent number: 8738760
    Abstract: An approach for supporting automated fault isolation is provided. An alarm is received indicating a fault within a customer network that includes one or more network elements. The customer network being monitored by a service provider. A script is retrieved for specifying a dialogue with the network elements for data collection. A command is transmitted to the customer network according to the one script to obtain data from the network elements, wherein the data is utilized to determine the fault. Further, a workflow event corresponding to the alarm is generated, wherein resolution of the fault is automated according to the workflow event.
    Type: Grant
    Filed: December 23, 2005
    Date of Patent: May 27, 2014
    Assignee: Verizon Business Global LLC
    Inventors: Frank Paul Adams, Jr., Sanjiv Purushottam Damle, Richard Brian Drescher, Joshua Jerome Morris, Michael Robert Smith, Stephen Michael Smith, Jonathan Lyle Spieker, Christopher Lawson White
  • Publication number: 20140143588
    Abstract: An apparatus comprises at least one processor and at least one memory including computer program code, the at least one memory and the computer program code configured, with the at least one processor, to cause the apparatus at least to perform at least commanding a slave node to activate an immediate error response mode; and receiving an instant response from the slave node in response to a communication error.
    Type: Application
    Filed: November 21, 2012
    Publication date: May 22, 2014
    Applicant: Nokia Corporation
    Inventor: Pekka E. Leinonen
  • Patent number: 8732524
    Abstract: Methods, systems and computing devices are provided for using a completed corrective action as evidence of a fault. The methods, systems and computing devices receive equipment status evidence and determine an equipment fault based on the equipment status evidence. The methods, systems and computing devices also create and rank a list of potential failure modes based at least in part on the determined equipment fault, recommend a corrective action to correct the equipment fault based at least in part on the ranking of the potential failure modes and receiving additional equipment status evidence indicating that the recommended corrective action failed to correct the equipment fault. The methods, systems and computing devices then associate a detection probability and a false negative rate with the failed corrective action to create additional status evidence, and re-rank the list of potential failure modes for subsequent performance based on the additional status evidence.
    Type: Grant
    Filed: August 3, 2011
    Date of Patent: May 20, 2014
    Assignee: Honeywell International Inc.
    Inventors: Bradley John Barton, David Michael Kolbet, Qingqiu Ginger Shao, Randy R. Magnuson
  • Patent number: 8732515
    Abstract: A method and apparatus for detecting, curing and remedying invasive software installation inadvertently, negligently, or intentionally marketed by a vendor. A party may procure a product that sends back invasive data to a source. A testing regimen may identify and defeat sources of any invasive executables found. Accordingly, a party may identify those software packages deemed invasive, and may optionally provide a solution to either defeat or monitor them, where practicable. An independent developer may obtain intellectual property rights in the testing, solution or both of the counter-invasive software system or product. An independent developer may become a supplier of testing or solution systems, motivating a supplier by one of several mechanisms. The developer or damaged party may obtain a legal status with respect to the vendor or of a host of software as a customer, user, clients, shareholder, etc.
    Type: Grant
    Filed: August 11, 2011
    Date of Patent: May 20, 2014
    Inventor: Phillip M. Adams
  • Patent number: 8732396
    Abstract: A DAS system that implements RAID technology is provided in which an array of solid state disks (SSDs) that is external to the DAS controllers of the DAS system is used by the DAS controllers as WB cache memory for performing WB caching operations. Using the external SSD array as WB cache memory allows the DAS system to be fully cache coherent without significantly increasing the complexity of the DAS system and without increasing the amount of bandwidth that is utilized for performing caching operations. In addition, using the external SSD array as WB cache memory obviates the need to mirror DAS controllers.
    Type: Grant
    Filed: June 4, 2010
    Date of Patent: May 20, 2014
    Assignee: LSI Corporation
    Inventor: Luca Bert
  • Patent number: 8726066
    Abstract: A method, system, and computer product for data replication comprising enabling a second site to receive I/O from a first site; wherein the second site stores the I/O in a journal until transferring the I/O to an image, regulating, at the first site, an application by delaying acknowledgements of I/O requests from the application, and controlling the size of a do stream in the journal by slowing a response of a splitter.
    Type: Grant
    Filed: March 31, 2011
    Date of Patent: May 13, 2014
    Assignee: EMC Corporation
    Inventors: Assaf Natanzon, Jehuda Shemer
  • Patent number: 8726065
    Abstract: Managing failover operations on a cluster of computers, including: identifying, by a failover hold module, a failure to access data storage in the cluster of computers; preventing the execution of all read operations directed to the data storage that were received after the failure to access data storage was identified; executing all write operations directed to the data storage that were received after the failure to access data storage was identified, including writing data to a cache; identifying that a failover to alternative data storage is complete; executing the held read operations, including reading data from the alternative data storage; and copying, from cache to the alternative data storage, the data written to the cache as part of the write operations.
    Type: Grant
    Filed: October 18, 2011
    Date of Patent: May 13, 2014
    Assignee: International Business Machines Corporation
    Inventors: Travis M. Drucker, Joel C. Dubbels, Thomas J. Eggebraaten, Janice R. Glowacki, Richard J. Stevens, David A. Wall
  • Patent number: 8724449
    Abstract: In one embodiment, a method includes configuring a first node, located at an edge of a core network and connected to an access ring, with a maintenance end point for a virtual local area network on a port connecting the first node to the access ring. The first node operates in an active mode for the virtual local area network and is in communication with a second node located at the edge of the core network and connected to the access ring. The method further includes receiving and processing at a processor at the first node, continuity check messages from access nodes on the access ring, synchronizing the first node with the second node, and communicating from the first node to the second node to initiate switching of the second node from a backup mode to the active mode for the virtual local area network upon identification of a failure.
    Type: Grant
    Filed: June 10, 2009
    Date of Patent: May 13, 2014
    Assignee: Cisco Technology, Inc.
    Inventors: Samer Salam, Ali Sajassi
  • Publication number: 20140129871
    Abstract: Some aspects of the present disclosure provide for a system and method for fault mitigation of a non-volatile memory (NVM) store subject to error correction code (ECC) checking. A simple and robust means to test the integrity of failsafe code stored within the non-volatile memory prior to execution are disclosed. In some embodiments, the failsafe code comprises program elements to communicate the memory failure to other parts of the system, or to execute an orderly shutdown. In the event that an ECC error occurs, the failsafe code can be verified, and upon successful verification, executed.
    Type: Application
    Filed: November 7, 2012
    Publication date: May 8, 2014
    Applicant: Infineon Technologies AG
    Inventors: Dragos Poenaru, Thomas LeMense, Thomas Lange
  • Publication number: 20140129872
    Abstract: A method includes calculating a first syndrome of a codeword read from a memory location under a first set of conditions and calculating a second syndrome of the codeword read from the memory location under a second set of conditions. The method also includes analyzing the first and second syndromes and applying one of the first and second syndromes to the codeword to find the codeword having a minimum number of errors.
    Type: Application
    Filed: January 9, 2014
    Publication date: May 8, 2014
    Applicant: Micron Technology, Inc.
    Inventors: Christopher Bueb, Sean Eilert
  • Patent number: 8713562
    Abstract: Exemplary system and computer program product embodiments for deploying code in a computing sysplex environment are provided. In one embodiment, by way of example only, a system-wide trending mechanism is applied. At least one of an idle time and a low Central Processing Unit (CPU) utilization time of one system in the sysplex environment is matched with an estimated deployment time obtained from at least one of a latest measured period of time and a calculated time trend. A system-wide coordinating mechanism is applied. A staggered code deployment operation is recommended for at least one node of the system at an optimum system time generated from the matching. Data obtained from the code deployment operation is recorded, including a new latest measured period of time. The data is used to generate an updated time trend.
    Type: Grant
    Filed: January 6, 2012
    Date of Patent: April 29, 2014
    Assignee: International Business Machines Corporation
    Inventor: Joseph Whitney Dain
  • Publication number: 20140115376
    Abstract: An integrated circuit is provided with error detection circuitry and error repair circuitry. Error tolerance circuitry is responsive to a control parameter to selectively disable the error repair circuitry. The control parameter is dependent on the processing performed within the circuit. For example, the control parameter may be generated in dependence upon the program instruction being executed, the output signal value which is in error, the previous behavior of the circuit or in other ways.
    Type: Application
    Filed: December 30, 2013
    Publication date: April 24, 2014
    Applicant: ARM Limited
    Inventors: Shidhartha DAS, David Michael Bull, Emre Ozer