Patents by Inventor Aaron F. Godfrey

Aaron F. Godfrey has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 9154400
    Abstract: A system for allowing dynamic changing of routing information of a network interconnect while avoiding deadlocks and preserving packet ordering. A network resiliency system detects when an error in the network interconnect occurs and dynamically generates new routing information for the routers that factors in the detected error. The network resiliency system then generates new routing information that factors in the failure. The network resiliency system then directs the network interconnect to enter a quiescent state in which no packets are transiting through the network interconnect. After the network interconnect enters the quiescent state, the network resiliency system directs the loading of the new routing information into the routing tables of the network interconnect and then directs the network interconnect to start injecting request packets into the network interconnect.
    Type: Grant
    Filed: May 10, 2011
    Date of Patent: October 6, 2015
    Assignee: Cray Inc.
    Inventors: Aaron F. Godfrey, Christopher B. Johns
  • Patent number: 8982688
    Abstract: A method and system for detecting congestion in a network of nodes, abating the network congestion, and identifying the cause of the network congestion is provided. A congestion detection system may comprise a detection system, an abatement system, and a causation system. The detection system monitors the performance of network components such as the network interface controllers and tiles of routers to determine whether the network is congested such that a delay in delivering packets becomes unacceptable. Upon detecting that the network is congested, an abatement system abates the congestion by limiting the rate at which packets are injected into the network from the nodes. Upon detecting that the network is congested, a causation system may identify the job that is executing on a node that is the cause of the network congestion.
    Type: Grant
    Filed: March 9, 2011
    Date of Patent: March 17, 2015
    Assignee: Cray Inc
    Inventors: Edwin L. Froese, Christopher B. Johns, Aaron F. Godfrey, Laurence S. Kaplan, Matthew P. Kelly, Brent T. Shields
  • Patent number: 8953442
    Abstract: A method and system for detecting congestion in a network of nodes, abating the network congestion, and identifying the cause of the network congestion is provided. A congestion detection system may comprise a detection system, an abatement system, and a cauzation system. The detection system monitors the performance of network components such as the network interface controllers and tiles of routers to determine whether the network is congested such that a delay in delivering packets becomes unacceptable. Upon detecting that the network is congested, an abatement system abates the congestion by limiting the rate at which packets are injected into the network from the nodes. Upon detecting that the network is congested, a cauzation system may identify the job that is executing on a node that is the cause of the network congestion.
    Type: Grant
    Filed: March 9, 2011
    Date of Patent: February 10, 2015
    Assignee: Cray Inc.
    Inventors: Laurence S. Kaplan, Edwin L. Froese, Christopher B. Johns, Matthew P. Kelly, Aaron F. Godfrey, Brent T. Shields
  • Patent number: 8885467
    Abstract: A method and system for detecting congestion in a network of nodes, abating the network congestion, and identifying the cause of the network congestion is provided. A congestion detection system may comprise a detection system, an abatement system, and a causation system. The detection system monitors the performance of network components such as the network interface controllers and tiles of routers to determine whether the network is congested such that a delay in delivering packets becomes unacceptable. Upon detecting that the network is congested, an abatement system abates the congestion by limiting the rate at which packets are injected into the network from the nodes. Upon detecting that the network is congested, a causation system may identify the job that is executing on a node that is the cause of the network congestion.
    Type: Grant
    Filed: March 9, 2011
    Date of Patent: November 11, 2014
    Assignee: Cray Inc.
    Inventors: Aaron F. Godfrey, Christopher B. Johns, Edwin L. Froese, Matthew P. Kelly, Laurence S. Kaplan, Brent T. Shields
  • Patent number: 8854951
    Abstract: A system for allowing dynamic changing of routing information of a network interconnect while avoiding deadlocks and preserving packet ordering. A network resiliency system detects when an error in the network interconnect occurs and dynamically generates new routing information for the routers that factors in the detected error. The network resiliency system then generates new routing information that factors in the failure. The network resiliency system then directs the network interconnect to enter a quiescent state in which no packets are transiting through the network interconnect. After the network interconnect enters the quiescent state, the network resiliency system directs the loading of the new routing information into the routing tables of the network interconnect and then directs the network interconnect to start injecting request packets into the network interconnect.
    Type: Grant
    Filed: May 10, 2011
    Date of Patent: October 7, 2014
    Assignee: Cray Inc.
    Inventors: Aaron F. Godfrey, Christopher B. Johns
  • Publication number: 20120287821
    Abstract: A system for allowing dynamic changing of routing information of a network interconnect while avoiding deadlocks and preserving packet ordering. A network resiliency system detects when an error in the network interconnect occurs and dynamically generates new routing information for the routers that factors in the detected error. The network resiliency system then generates new routing information that factors in the failure. The network resiliency system then directs the network interconnect to enter a quiescent state in which no packets are transiting through the network interconnect. After the network interconnect enters the quiescent state, the network resiliency system directs the loading of the new routing information into the routing tables of the network interconnect and then directs the network interconnect to start injecting request packets into the network interconnect.
    Type: Application
    Filed: May 10, 2011
    Publication date: November 15, 2012
    Inventors: Aaron F. Godfrey, Christopher B. Johns
  • Publication number: 20120287777
    Abstract: A system for allowing dynamic changing of routing information of a network interconnect while avoiding deadlocks and preserving packet ordering. A network resiliency system detects when an error in the network interconnect occurs and dynamically generates new routing information for the routers that factors in the detected error. The network resiliency system then generates new routing information that factors in the failure. The network resiliency system then directs the network interconnect to enter a quiescent state in which no packets are transiting through the network interconnect. After the network interconnect enters the quiescent state, the network resiliency system directs the loading of the new routing information into the routing tables of the network interconnect and then directs the network interconnect to start injecting request packets into the network interconnect.
    Type: Application
    Filed: May 10, 2011
    Publication date: November 15, 2012
    Inventors: Aaron F. Godfrey, Christopher B. Johns
  • Publication number: 20120230177
    Abstract: A method and system for detecting congestion in a network of nodes, abating the network congestion, and identifying the cause of the network congestion is provided. A congestion detection system may comprise a detection system, an abatement system, and a causation system. The detection system monitors the performance of network components such as the network interface controllers and tiles of routers to determine whether the network is congested such that a delay in delivering packets becomes unacceptable. Upon detecting that the network is congested, an abatement system abates the congestion by limiting the rate at which packets are injected into the network from the nodes. Upon detecting that the network is congested, a causation system may identify the job that is executing on a node that is the cause of the network congestion.
    Type: Application
    Filed: March 9, 2011
    Publication date: September 13, 2012
    Inventors: Edwin L. Froese, Christopher B. Johns, Aaron F. Godfrey, Laurence S. Kaplan, Matthew P. Kelly, Brent T. Shields
  • Publication number: 20120230188
    Abstract: A method and system for detecting congestion in a network of nodes, abating the network congestion, and identifying the cause of the network congestion is provided. A congestion detection system may comprise a detection system, an abatement system, and a causation system. The detection system monitors the performance of network components such as the network interface controllers and tiles of routers to determine whether the network is congested such that a delay in delivering packets becomes unacceptable. Upon detecting that the network is congested, an abatement system abates the congestion by limiting the rate at which packets are injected into the network from the nodes. Upon detecting that the network is congested, a causation system may identify the job that is executing on a node that is the cause of the network congestion.
    Type: Application
    Filed: March 9, 2011
    Publication date: September 13, 2012
    Inventors: Aaron F. Godfrey, Christopher B. Johns, Edwin L. Froese, Matthew P. Kelly, Laurence S. Kaplan, Brent T. Shields
  • Publication number: 20120230212
    Abstract: A method and system for detecting congestion in a network of nodes, abating the network congestion, and identifying the cause of the network congestion is provided. A congestion detection system may comprise a detection system, an abatement system, and a causation system. The detection system monitors the performance of network components such as the network interface controllers and tiles of routers to determine whether the network is congested such that a delay in delivering packets becomes unacceptable. Upon detecting that the network is congested, an abatement system abates the congestion by limiting the rate at which packets are injected into the network from the nodes. Upon detecting that the network is congested, a causation system may identify the job that is executing on a node that is the cause of the network congestion.
    Type: Application
    Filed: March 9, 2011
    Publication date: September 13, 2012
    Inventors: Laurence S. Kaplan, Edwin L. Froese, Christopher B. Johns, Matthew P. Kelly, Aaron F. Godfrey, Brent T. Shields
  • Patent number: 8261134
    Abstract: A multiprocessor computer system comprises one or more watchdog timers operable to detect failure of a memory operation based on passage of a certain timing period from a memory operation being issued without a valid response. An error handler is operable to take corrective action regarding the failed memory operation, such as to provide at least one of hardware state management and application state management.
    Type: Grant
    Filed: January 28, 2010
    Date of Patent: September 4, 2012
    Assignee: Cray Inc.
    Inventors: Dennis C. Abts, Steven L. Scott, Aaron F. Godfrey
  • Patent number: 8239704
    Abstract: In some embodiments, the present invention relates to a method of maintaining a global clock within a multiprocessor system having a plurality of nodes that are connected in a network via links. A virtual spanning tree is mapped onto the network and the nodes and the links are configured such that each node is in a parent-child relationship with one or more other nodes in the virtual spanning tree. A global clock is generated in a root of the virtual spanning tree and global clock signals are communicated down the virtual spanning tree to each of the nodes.
    Type: Grant
    Filed: June 12, 2009
    Date of Patent: August 7, 2012
    Assignee: Cray Inc.
    Inventors: Steven L. Scott, Dennis C. Abts, Aaron F. Godfrey
  • Patent number: 8095759
    Abstract: A multiprocessor computer system comprises a plurality of processors and a plurality of nodes, each node comprising one or more processors. A local memory in each of the plurality of nodes is coupled to the processors in each node, and a hardware firewall comprising a part of one or more of the nodes is operable to prevent a write from an unauthorized processor from writing to the local memory.
    Type: Grant
    Filed: May 29, 2009
    Date of Patent: January 10, 2012
    Assignee: Cray Inc.
    Inventors: Dennis C. Abts, Steven L. Scott, Aaron F. Godfrey
  • Publication number: 20100318831
    Abstract: In some embodiments, the present invention relates to a method of maintaining a global clock within a multiprocessor system having a plurality of nodes that are connected in a network via links. A virtual spanning tree is mapped onto the network and the nodes and the links are configured such that each node is in a parent-child relationship with one or more other nodes in the virtual spanning tree. A global clock is generated in a root of the virtual spanning tree and global clock signals are communicated down the virtual spanning tree to each of the nodes.
    Type: Application
    Filed: June 12, 2009
    Publication date: December 16, 2010
    Applicant: Cray Inc.
    Inventors: Steven L. Scott, Dennis C. Abts, Aaron F. Godfrey
  • Publication number: 20100318774
    Abstract: A multiprocessor computer system comprises a plurality of processors distributed across a plurality of node coupled by a processor interconnect network. One or more of the processors is operable to manage hung processor instructions by setting a graduation timeout counter after a first program instruction graduates, resetting the graduation timeout counter if a subsequent program instruction graduates before the graduation timeout counter expires, and resetting the processor if the graduation timeout counter expires before the subsequent program instruction graduates.
    Type: Application
    Filed: June 12, 2009
    Publication date: December 16, 2010
    Applicant: Cray Inc.
    Inventors: Dennis C. Abts, Aaron F. Godfrey
  • Publication number: 20100306489
    Abstract: A multiprocessor computer system comprises a plurality of processors and a plurality of nodes, each node comprising one or more processors. A local memory in each of the plurality of nodes is coupled to the processors in each node, and a hardware firewall comprising a part of one or more of the nodes is operable to prevent a write from an unauthorized processor from writing to the local memory.
    Type: Application
    Filed: May 29, 2009
    Publication date: December 2, 2010
    Applicant: Cray Inc.
    Inventors: Dennis C. Abts, Steven L. Scott, Aaron F. Godfrey
  • Publication number: 20100199121
    Abstract: A multiprocessor computer system comprises one or more watchdog timers operable to detect failure of a memory operation based on passage of a certain timing period from a memory operation being issued without a valid response. An error handler is operable to take corrective action regarding the failed memory operation, such as to provide at least one of hardware state management and application state management.
    Type: Application
    Filed: January 28, 2010
    Publication date: August 5, 2010
    Applicant: Cray Inc
    Inventors: Dennis C. Abts, Steven L. Scott, Aaron F. Godfrey