Patents Assigned to Stratus Technologies Bermuda LTD
  • Publication number: 20220066887
    Abstract: In part, the disclosure relates to a real-time fault tolerant system. The system may include a first computing device, a second computing, and a hardware interconnect. The first computing device may include one or more memory devices, one or more processors, a first network interface operable to receive device data and transmit output data over a time-slot-based bus, wherein the output data is generated from processing device data, and a first real-time checkpoint engine. The second computing device may include similar components or the same components as the first computing device. The hardware interconnect is operable to permit data exchange between the first computing device and the second computing device. Checkpoints may be generated by checkpoint engines during lower-priority communication time slots allocated on the time slot-based bus to avoid interfering with any real-time communications to or from the first and second computing devices.
    Type: Application
    Filed: August 26, 2020
    Publication date: March 3, 2022
    Applicant: Stratus Technologies Bermuda, Ltd.
    Inventors: Charles J. Horvath, Lei Cao
  • Publication number: 20210034464
    Abstract: A method and system of checkpointing in a computing system having a primary node and a secondary node is disclosed. In one embodiment the method includes the steps of determining by the primary node to initiate a checkpoint process; sending a notification to the secondary node, by the primary node, of an impending checkpoint process; blocking, by the primary node, I/O requests from the Operating System (OS) that arrive at the primary node after the determination to initiate the checkpoint process; completing, by the primary node, active I/O requests for data received from the OS prior to the determination to initiate the checkpoint process, by accessing the primary node data storage; and upon receiving, by the primary node, a notice of checkpoint readiness from the secondary node, initiating a checkpoint process to move state and data from the primary node to the secondary node.
    Type: Application
    Filed: June 13, 2020
    Publication date: February 4, 2021
    Applicant: Stratus Technologies Bermuda, Ltd.
    Inventors: Nathaniel Horwitch Dailey, Stephen J. Wark, Angel L. Pagan
  • Publication number: 20210034523
    Abstract: In part, the disclosure relates to a method of performing a checkpoint process in an active-active computer system including a first node and a second node, wherein each node includes an active checkpoint cache, flush cache, and data storage. In various embodiments, flush operations are coordinated between nodes. The method includes receiving a request for a checkpoint operation at the first node; pausing activity at the first node; notifying the second node of the impending checkpoint operation; performing the checkpoint operation, wherein data associated with the checkpoint operation includes the active checkpoint cache and the flush cache; merging the active checkpoint cache into the flush cache; and resuming activity at the first node. The method may also include each node informing the other node of the completion of cache flush operations.
    Type: Application
    Filed: June 13, 2020
    Publication date: February 4, 2021
    Applicant: Stratus Technologies Bermuda, Ltd.
    Inventor: Nathaniel Horwitch Dailey
  • Publication number: 20210037092
    Abstract: In part, disclosure relates to a method of regulating checkpointing in an active active fault tolerant system. The method includes receiving a request from a client through a network at a primary computer; copying, by the primary computer, the request from the client to a secondary computer; processing the request from the client, using the primary computer, to generate a primary computer result; processing the copy of the request from the client, using the secondary computer, to generate a secondary computer result; comparing the primary computer result and the secondary computer result to obtain a comparison metric; determining whether a minimum checkpoint interval has been met or exceeded; and if the minimum checkpoint interval has not been met or exceeded, delay initiating a checkpoint process from primary computer to secondary computer.
    Type: Application
    Filed: June 13, 2020
    Publication date: February 4, 2021
    Applicant: Stratus Technologies Bermuda, Ltd.
    Inventor: Lei Cao
  • Publication number: 20210034483
    Abstract: In part, the disclosure relates to systems and methods to rapidly copy the computer operating system, drivers and applications from a source computer to a target computer using a duplication engine. Once the copy is complete the source computer will resume execution, and the target computer will first alter its configuration (also referred to as a role or personality) and then resume execution conforming to its new configuration as indicated by a profile stored in protected or specialized memory. The profile can be value, a file, or other memory structure and is protected in the sense that the profile (and or the region of memory where it is stored) must not be overwritten by a state transfer from the source computer to the target computer.
    Type: Application
    Filed: June 13, 2020
    Publication date: February 4, 2021
    Applicant: Stratus Technologies Bermuda, Ltd.
    Inventor: Steven Michael Haid
  • Publication number: 20200050523
    Abstract: A fault tolerant computer system and method are disclosed. The system may include a plurality of CPU nodes, each including: a processor and a memory; at least two IO domains, wherein at least one of the IO domains is designated an active IO domain performing communication functions for the active CPU nodes; and a switching fabric connecting each CPU node to each IO domain. One CPU node is designated a standby CPU node and the remainder are designated as active CPU nodes. If a failure, a beginning of a failure, or a predicted failure occurs in an active node, the state and memory of the active CPU node are transferred to the standby CPU node which becomes the new active CPU node. If a failure occurs in an active IO domain, the communication functions performed by the failing active IO domain are transferred to the other IO domain.
    Type: Application
    Filed: August 9, 2019
    Publication date: February 13, 2020
    Applicant: STRATUS TECHNOLOGIES BERMUDA, LTD.
    Inventors: Chester W. Pawlowski, John M. Chaves, Andrew Alden, Craig D. Keefer, Christopher D. Cotton, Michael Egan
  • Patent number: 10360117
    Abstract: A checkpointing method in a network device fault tolerant system using virtual machines. In one embodiment, the network device has an input port, an output port, an active virtual machine and a standby virtual machine, a network application on the active virtual machine which manipulates data present on the input port and transmits the manipulated data from the output port; a checkpoint engine on the active virtual machine; and an interface agent, on the active virtual machine, having callable functions to move data from the input port to the output port. The method includes the steps of determining, by the checkpoint engine, that a checkpoint is required; requesting by the checkpoint engine that the interface agent quiescent itself; returning, by the interface agent to the network application, an indicator that no packets are available regardless of whether or not packets are arriving at the input port.
    Type: Grant
    Filed: June 19, 2017
    Date of Patent: July 23, 2019
    Assignee: Stratus Technologies Bermuda Ltd.
    Inventors: Steven Michael Haid, Lei Cao, Aaron Tyrone Smith
  • Patent number: 10216598
    Abstract: A method of transferring memory from an active to a standby memory in an FT Server system. The method includes the steps of: reserving a portion of memory using BIOS; loading and initializing an FT Kernel Mode Driver; loading and initializing an FT Virtual Machine Manager (FTVMM) including the Second Level Address Translation table SLAT into the reserved memory. In another embodiment, the method includes tracking memory accesses using the FTVMM's SLAT in Reserved Memory and tracking “L2” Guest memory accesses by tracking the current Guest's SLAT and intercepting the Hypervisor's writes to the SLAT. In yet another embodiment, the method includes entering Brownout by collecting the D-Bits; invalidating the processor's cached SLAT translation entries, and copying the dirtied pages from the active memory to memory in the second Subsystem. In one embodiment, the method includes entering Blackout and moving the final dirty pages from active to the mirror memory.
    Type: Grant
    Filed: July 11, 2017
    Date of Patent: February 26, 2019
    Assignee: Stratus Technologies Bermuda LTD.
    Inventors: Steven Michael Haid, John Rogers MacLeod
  • Publication number: 20190018746
    Abstract: A method of transferring memory from an active to a standby memory in an FT Server system. The method includes the steps of: reserving a portion of memory using BIOS; loading and initializing an FT Kernel Mode Driver; loading and initializing an FT Virtual Machine Manager (FTVMM) including the Second Level Address Translation table SLAT into the reserved memory. In another embodiment, the method includes tracking memory accesses using the FTVMM's SLAT in Reserved Memory and tracking “L2” Guest memory accesses by tracking the current Guest's SLAT and intercepting the Hypervisor's writes to the SLAT. In yet another embodiment, the method includes entering Brownout by collecting the D-Bits; invalidating the processor's cached SLAT translation entries, and copying the dirtied pages from the active memory to memory in the second Subsystem. In one embodiment, the method includes entering Blackout and moving the final dirty pages from active to the mirror memory.
    Type: Application
    Filed: July 11, 2017
    Publication date: January 17, 2019
    Applicant: Stratus Technologies Bermuda Ltd.
    Inventors: Steven Michael Haid, John Rogers MacLeod
  • Publication number: 20170364425
    Abstract: A checkpointing method in a network device fault tolerant system using virtual machines. In one embodiment, the network device has an input port, an output port, an active virtual machine and a standby virtual machine, a network application on the active virtual machine which manipulates data present on the input port and transmits the manipulated data from the output port; a checkpoint engine on the active virtual machine; and an interface agent, on the active virtual machine, having callable functions to move data from the input port to the output port. The method includes the steps of determining, by the checkpoint engine, that a checkpoint is required; requesting by the checkpoint engine that the interface agent quiescent itself; returning, by the interface agent to the network application, an indicator that no packets are available regardless of whether or not packets are arriving at the input port.
    Type: Application
    Filed: June 19, 2017
    Publication date: December 21, 2017
    Applicant: STRATUS TECHNOLOGIES BERMUDA LTD.
    Inventors: Steven Michael Haid, Lei Cao, Aaron Tyrone Smith
  • Patent number: 9760442
    Abstract: A method of delaying checkpointing in a virtual machine system. In one embodiment, the method includes the steps of examining a network frame to determine if it is a deferrable frame and if the frame is a deferrable frame, delaying a checkpoint associated with the frame. In another embodiment, the deferrable frame is one of a group comprising: an IP packet tagged with the ‘more fragments’ attribute; TCP data segments that lack the PSH flag and carry no flags other than ‘ACK’; and TCP segments that contain no data and carry only the ‘ACK’ flag; and any frame originating from or destined to a designated network address or port number. In still another embodiment, the method includes the step of concatenating the delays due to deferrable frames. In still yet another embodiment, the method further includes setting an upper limit to the amount of delay that can be generated.
    Type: Grant
    Filed: December 16, 2014
    Date of Patent: September 12, 2017
    Assignee: Stratus Technologies Bermuda Ltd.
    Inventors: Thomas D. Bissett, Paul A. Leveille, Srinivasu Chinta
  • Patent number: 9652338
    Abstract: A method for determining a delay in a dynamic, event driven, checkpoint interval. In one embodiment, the method includes the steps of determining the number of network bits to be transferred; determining the target bit transfer rate; calculating the next cycle delay as the number of bits to be transferred divided by the target bit transfer rate. In another aspect, the invention relates to a method for delaying a checkpoint interval. In one embodiment, the method includes the steps of monitoring the transfer of a prior batch of network data and delaying a subsequent checkpoint until the transfer of a prior batch of network data has reached a certain predetermined level of completion. In another embodiment, the predetermined level of completion is 100%.
    Type: Grant
    Filed: December 16, 2014
    Date of Patent: May 16, 2017
    Assignee: Stratus Technologies Bermuda Ltd.
    Inventors: Thomas D. Bissett, Paul A. Leveille, Srinivasu Chinta
  • Patent number: 9588844
    Abstract: In one aspect, the invention relates to a fault tolerant computing system. The system includes a primary virtual machine and a secondary virtual machine, wherein the primary and secondary virtual machines are in communication, wherein the primary virtual machine comprises a first checkpointing engine and a first network interface, wherein the secondary virtual machine comprises a second network interface, wherein the first checkpointing engine forwards a page of memory of the primary virtual machine to the second virtual machine such that the first checkpointing engine can checkpoint the page of memory without pausing the primary virtual machine.
    Type: Grant
    Filed: December 16, 2014
    Date of Patent: March 7, 2017
    Assignee: Stratus Technologies Bermuda Ltd.
    Inventors: Thomas D. Bissett, Paul A. Leveille
  • Patent number: 9251002
    Abstract: In part, the invention relates to a system and method for writing checkpointing data to a computer having a standby virtual machine for each checkpointed component on a computer having an active virtual machine. In one embodiment, the checkpointing data is processed on a per virtual machine basis. This is performed in a way that allows checkpointing data packets from multiple sources to be transferred asynchronously, subsequently reassembled into a coherent checkpoint message, and applied asynchronously.
    Type: Grant
    Filed: December 16, 2013
    Date of Patent: February 2, 2016
    Assignee: STRATUS TECHNOLOGIES BERMUDA LTD.
    Inventors: Robert Manchek, Steven Haid, Kimball A. Murray
  • Patent number: 8381012
    Abstract: An apparatus and method for fault-tolerant and spread spectrum clocking. In one embodiment a master clock synthesizer circuit generates an output clock signal of varying frequency within a predetermined range of frequencies. A slave clock synthesizer circuit is provided to track the output clock signal generated by the master clock synthesizer circuit. If the master clock synthesizer circuit fails or generates an invalid output clock signal, the slave clock synthesizer circuit takes over and functions as the master clock synthesizer circuit. In one embodiment a method of fault-tolerant spread spectrum clocking includes generating a first digital data stream; receiving the first digital data stream, a first input reference signal and a first clock signal in a master clock synthesizer circuit; generating an first output clock signal of varying frequency by the master clock synthesizer circuit in response to the first digital data stream and the first clock signal.
    Type: Grant
    Filed: March 15, 2012
    Date of Patent: February 19, 2013
    Assignee: Stratus Technologies Bermuda Ltd.
    Inventor: Garth Dylan Wiebe
  • Patent number: 8312318
    Abstract: A transparent high-availability solution utilizing virtualization technology is presented. A cluster environment and management thereof is implemented through an automated installation and setup procedure resulting in a cluster acting as a single system. The cluster is setup in an isolated virtual machine on each of a number of physical nodes of the system. Customer applications are run within separate application virtual machines on one physical node at a time and are run independently and unaware of their configuration as part of a high-availability cluster. Upon detection of a failure, traffic is rerouted through a redundant node and the application virtual machines are migrated from the failing node to another node using live migration techniques.
    Type: Grant
    Filed: January 18, 2012
    Date of Patent: November 13, 2012
    Assignee: Stratus Technologies Bermuda Ltd.
    Inventors: Simon P. Graham, Daniel Lussier
  • Patent number: 8271416
    Abstract: A method for dynamically determining a predetermined previous condition of a rule-based system comprising a plurality of rules. Each of the plurality of rules is associated with a transaction and a condition under which the transaction is executed. In one embodiment, the method includes the steps of (a) determining a system condition; (b) determining an immediate previous condition that caused the system condition; (c) setting the immediate previous condition to the system condition; and (d) repeating steps (b) and (c) until the predetermined previous condition is reached.
    Type: Grant
    Filed: August 12, 2008
    Date of Patent: September 18, 2012
    Assignee: Stratus Technologies Bermuda Ltd.
    Inventors: Haitham Mahmoud Al-Beik, Bjorn Bergsten
  • Patent number: 8234521
    Abstract: A system is provided for rapidly synchronizing two or more processing elements in a fault-tolerant computing system. Embodiments of this system allow for the rapid synchronization of two processing elements through partial copies of the contents of memory associate with each processing element.
    Type: Grant
    Filed: September 30, 2008
    Date of Patent: July 31, 2012
    Assignee: Stratus Technologies Bermuda Ltd.
    Inventors: Simon Graham, Daniel Lussier, Timothy Wegner, Jeffrey Somers, Steven Haid, John W. Edwards, Jr.
  • Patent number: 8161311
    Abstract: An apparatus and method for fault-tolerant and spread spectrum clocking. In one embodiment a master clock synthesizer circuit generates an output clock signal of varying frequency within a predetermined range of frequencies. A slave clock synthesizer circuit is provided to track the output clock signal generated by the master clock synthesizer circuit. If the master clock synthesizer circuit fails or generates an invalid output clock signal, the slave clock synthesizer circuit takes over and functions as the master clock synthesizer circuit. In one embodiment a method of fault-tolerant spread spectrum clocking includes generating a first digital data stream; receiving the first digital data stream, a first input reference signal and a first clock signal in a master clock synthesizer circuit; generating an first output clock signal of varying frequency by the master clock synthesizer circuit in response to the first digital data stream and the first clock signal.
    Type: Grant
    Filed: August 23, 2007
    Date of Patent: April 17, 2012
    Assignee: Stratus Technologies Bermuda Ltd
    Inventor: Garth Dylan Wiebe
  • Patent number: 8117495
    Abstract: A transparent high-availability solution utilizing virtualization technology is presented. A cluster environment and management thereof is implemented through an automated installation and setup procedure resulting in a cluster acting as a single system. The cluster is setup in an isolated virtual machine on each of a number of physical nodes of the system. Customer applications are run within separate application virtual machines on one physical node at a time and are run independently and unaware of their configuration as part of a high-availability cluster. Upon detection of a failure, traffic is rerouted through a redundant node and the application virtual machines are migrated from the failing node to another node using live migration techniques.
    Type: Grant
    Filed: November 26, 2007
    Date of Patent: February 14, 2012
    Assignee: Stratus Technologies Bermuda Ltd
    Inventors: Simon Graham, Dan Lussier