Patents by Inventor John E Attinella

John E Attinella has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Configuring compute nodes in a parallel computer using remote direct memory access (‘RDMA’)

Patent number: 10831701

Abstract: Configuring compute nodes in a parallel computer using remote direct memory access (‘RDMA’), the parallel computer comprising a plurality of compute nodes coupled for data communications via one or more data communications networks, including: initiating, by a source compute node of the parallel computer, an RDMA broadcast operation to broadcast binary configuration information to one or more target compute nodes in the parallel computer; preparing, by each target compute node, the target compute node for receipt of the binary configuration information from the source compute node; transmitting, by each target compute node, a ready message to the target compute node, the ready message indicating that the target compute node is ready to receive the binary configuration information from the source compute node; and performing, by the source compute node, an RDMA broadcast operation to write the binary configuration information into memory of each target compute node.

Type: Grant

Filed: September 6, 2019

Date of Patent: November 10, 2020

Assignee: International Business Machines Corporation

Inventors: Michael E. Aho, John E. Attinella, Thomas M. Gooding, Michael B. Mundy
Configuring compute nodes in a parallel computer using remote direct memory access (‘RDMA’)

Patent number: 10810155

Abstract: Configuring compute nodes in a parallel computer using remote direct memory access (‘RDMA’), the parallel computer comprising a plurality of compute nodes coupled for data communications via one or more data communications networks, including: initiating, by a source compute node of the parallel computer, an RDMA broadcast operation to broadcast binary configuration information to one or more target compute nodes in the parallel computer; preparing, by each target compute node, the target compute node for receipt of the binary configuration information from the source compute node; transmitting, by each target compute node, a ready message to the target compute node, the ready message indicating that the target compute node is ready to receive the binary configuration information from the source compute node; and performing, by the source compute node, an RDMA broadcast operation to write the binary configuration information into memory of each target compute node.

Type: Grant

Filed: August 13, 2019

Date of Patent: October 20, 2020

Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Michael E. Aho, John E. Attinella, Thomas M. Gooding, Michael B. Mundy
CONFIGURING COMPUTE NODES IN A PARALLEL COMPUTER USING REMOTE DIRECT MEMORY ACCESS ('RDMA')

Publication number: 20190391955

Abstract: Configuring compute nodes in a parallel computer using remote direct memory access (‘RDMA’), the parallel computer comprising a plurality of compute nodes coupled for data communications via one or more data communications networks, including: initiating, by a source compute node of the parallel computer, an RDMA broadcast operation to broadcast binary configuration information to one or more target compute nodes in the parallel computer; preparing, by each target compute node, the target compute node for receipt of the binary configuration information from the source compute node; transmitting, by each target compute node, a ready message to the target compute node, the ready message indicating that the target compute node is ready to receive the binary configuration information from the source compute node; and performing, by the source compute node, an RDMA broadcast operation to write the binary configuration information into memory of each target compute node.

Type: Application

Filed: September 6, 2019

Publication date: December 26, 2019

Inventors: MICHAEL E. AHO, JOHN E. ATTINELLA, THOMAS M. GOODING, MICHAEL B. MUNDY
CONFIGURING COMPUTE NODES IN A PARALLEL COMPUTER USING REMOTE DIRECT MEMORY ACCESS ('RDMA')

Publication number: 20190370213

Abstract: Configuring compute nodes in a parallel computer using remote direct memory access (‘RDMA’), the parallel computer comprising a plurality of compute nodes coupled for data communications via one or more data communications networks, including: initiating, by a source compute node of the parallel computer, an RDMA broadcast operation to broadcast binary configuration information to one or more target compute nodes in the parallel computer; preparing, by each target compute node, the target compute node for receipt of the binary configuration information from the source compute node; transmitting, by each target compute node, a ready message to the target compute node, the ready message indicating that the target compute node is ready to receive the binary configuration information from the source compute node; and performing, by the source compute node, an RDMA broadcast operation to write the binary configuration information into memory of each target compute node.

Type: Application

Filed: August 13, 2019

Publication date: December 5, 2019

Inventors: MICHAEL E. AHO, JOHN E. ATTINELLA, THOMAS M. GOODING, MICHAEL B. MUNDY
Configuring compute nodes in a parallel computer using remote direct memory access (‘RDMA’)

Patent number: 10474625

Abstract: Configuring compute nodes in a parallel computer using remote direct memory access (‘RDMA’), the parallel computer comprising a plurality of compute nodes coupled for data communications via one or more data communications networks, including: initiating, by a source compute node of the parallel computer, an RDMA broadcast operation to broadcast binary configuration information to one or more target compute nodes in the parallel computer; preparing, by each target compute node, the target compute node for receipt of the binary configuration information from the source compute node; transmitting, by each target compute node, a ready message to the target compute node, the ready message indicating that the target compute node is ready to receive the binary configuration information from the source compute node; and performing, by the source compute node, an RDMA broadcast operation to write the binary configuration information into memory of each target compute node.

Type: Grant

Filed: January 17, 2012

Date of Patent: November 12, 2019

Assignee: International Business Machines Corporation

Inventors: Michael E. Aho, John E. Attinella, Thomas M. Gooding, Michael B. Mundy
Configuring compute nodes in a parallel computer using remote direct memory access (‘RDMA’)

Patent number: 10474626

Abstract: Configuring compute nodes in a parallel computer using remote direct memory access (‘RDMA’), the parallel computer comprising a plurality of compute nodes coupled for data communications via one or more data communications networks, including: initiating, by a source compute node of the parallel computer, an RDMA broadcast operation to broadcast binary configuration information to one or more target compute nodes in the parallel computer; preparing, by each target compute node, the target compute node for receipt of the binary configuration information from the source compute node; transmitting, by each target compute node, a ready message to the target compute node, the ready message indicating that the target compute node is ready to receive the binary configuration information from the source compute node; and performing, by the source compute node, an RDMA broadcast operation to write the binary configuration information into memory of each target compute node.

Type: Grant

Filed: December 10, 2012

Date of Patent: November 12, 2019

Assignee: International Business Machines Corporation

Inventors: Michael E. Aho, John E. Attinella, Thomas M. Gooding, Michael B. Mundy
Collectively loading programs in a multiple program multiple data environment

Patent number: 10104202

Abstract: Techniques are disclosed for loading programs efficiently in a parallel computing system. In one embodiment, nodes of the parallel computing system receive a load description file which indicates, for each program of a multiple program multiple data (MPMD) job, nodes which are to load the program. The nodes determine, using collective operations, a total number of programs to load and a number of programs to load in parallel. The nodes further generate a class route for each program to be loaded in parallel, where the class route generated for a particular program includes only those nodes on which the program needs to be loaded. For each class route, a node is selected using a collective operation to be a load leader which accesses a file system to load the program associated with a class route and broadcasts the program via the class route to other nodes which require the program.

Type: Grant

Filed: March 13, 2013

Date of Patent: October 16, 2018

Assignee: International Business Machines Corporation

Inventors: Michael E. Aho, John E. Attinella, Thomas M. Gooding, Samuel J. Miller
COLLECTIVELY LOADING PROGRAMS IN A MULTIPLE PROGRAM MULTIPLE DATA ENVIRONMENT

Publication number: 20170212766

Abstract: Techniques are disclosed for loading programs efficiently in a parallel computing system. In one embodiment, nodes of the parallel computing system receive a load description file which indicates, for each program of a multiple program multiple data (MPMD) job, nodes which are to load the program. The nodes determine, using collective operations, a total number of programs to load and a number of programs to load in parallel. The nodes further generate a class route for each program to be loaded in parallel, where the class route generated for a particular program includes only those nodes on which the program needs to be loaded. For each class route, a node is selected using a collective operation to be a load leader which accesses a file system to load the program associated with a class route and broadcasts the program via the class route to other nodes which require the program.

Type: Application

Filed: March 13, 2013

Publication date: July 27, 2017

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Michael E. AHO, John E. ATTINELLA, Thomas M. GOODING, Samuel J. MILLER
Collectively loading programs in a multiple program multiple data environment

Patent number: 9491259

Abstract: Techniques are disclosed for loading programs efficiently in a parallel computing system. In one embodiment, nodes of the parallel computing system receive a load description file which indicates, for each program of a multiple program multiple data (MPMD) job, nodes which are to load the program. The nodes determine, using collective operations, a total number of programs to load and a number of programs to load in parallel. The nodes further generate a class route for each program to be loaded in parallel, where the class route generated for a particular program includes only those nodes on which the program needs to be loaded. For each class route, a node is selected using a collective operation to be a load leader which accesses a file system to load the program associated with a class route and broadcasts the program via the class route to other nodes which require the program.

Type: Grant

Filed: March 13, 2013

Date of Patent: November 8, 2016

Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Michael E. Aho, John E. Attinella, Thomas M. Gooding, Samuel J. Miller
Collectively loading an application in a parallel computer

Patent number: 9229782

Abstract: Collectively loading an application in a parallel computer, the parallel computer comprising a plurality of compute nodes, including: identifying, by a parallel computer control system, a subset of compute nodes in the parallel computer to execute a job; selecting, by the parallel computer control system, one of the subset of compute nodes in the parallel computer as a job leader compute node; retrieving, by the job leader compute node from computer memory, an application for executing the job; and broadcasting, by the job leader to the subset of compute nodes in the parallel computer, the application for executing the job.

Type: Grant

Filed: March 27, 2012

Date of Patent: January 5, 2016

Assignee: International Business Machines Corporation

Inventors: Michael E. Aho, John E. Attinella, Thomas M. Gooding, Samuel J. Miller, Michael B. Mundy
Servicing a globally broadcast interrupt signal in a multi-threaded computer

Patent number: 9223728

Abstract: Methods, apparatuses, and computer program products for servicing a globally broadcast interrupt signal in a multi-threaded computer comprising a plurality of processor threads. Embodiments include an interrupt controller indicating in a plurality of local interrupt status locations that a globally broadcast interrupt signal has been received by the interrupt controller. Embodiments also include a thread determining that a local interrupt status location corresponding to the thread indicates that the globally broadcast interrupt signal has been received by the interrupt controller. Embodiments also include the thread processing one or more entries in a global interrupt status bit queue based on whether global interrupt status bits associated with the globally broadcast interrupt signal are locked. Each entry in the global interrupt status bit queue corresponds to a queued global interrupt.

Type: Grant

Filed: March 12, 2013

Date of Patent: December 29, 2015

Assignee: International Business Machines Corporation

Inventors: John E. Attinella, Kristan D. Davis, Roy G. Musselman, David L. Satterfield
Servicing a globally broadcast interrupt signal in a multi-threaded computer

Patent number: 9223729

Abstract: Methods, apparatuses, and computer program products for servicing a globally broadcast interrupt signal in a multi-threaded computer comprising a plurality of processor threads. Embodiments include an interrupt controller indicating in a plurality of local interrupt status locations that a globally broadcast interrupt signal has been received by the interrupt controller. Embodiments also include a thread determining that a local interrupt status location corresponding to the thread indicates that the globally broadcast interrupt signal has been received by the interrupt controller. Embodiments also include the thread processing one or more entries in a global interrupt status bit queue based on whether global interrupt status bits associated with the globally broadcast interrupt signal are locked. Each entry in the global interrupt status bit queue corresponds to a queued global interrupt.

Type: Grant

Filed: March 14, 2013

Date of Patent: December 29, 2015

Assignee: International Business Machines Corporation

Inventors: John E. Attinella, Kristan D. Davis, Roy G. Musselman, David L. Satterfield
Aggregating job exit statuses of a plurality of compute nodes executing a parallel application

Patent number: 9086962

Abstract: Aggregating job exit statuses of a plurality of compute nodes executing a parallel application, including: identifying a subset of compute nodes in the parallel computer to execute the parallel application; selecting one compute node in the subset of compute nodes in the parallel computer as a job leader compute node; initiating execution of the parallel application on the subset of compute nodes; receiving an exit status from each compute node in the subset of compute nodes, where the exit status for each compute node includes information describing execution of some portion of the parallel application by the compute node; aggregating each exit status from each compute node in the subset of compute nodes; and sending an aggregated exit status for the subset of compute nodes in the parallel computer.

Type: Grant

Filed: June 15, 2012

Date of Patent: July 21, 2015

Assignee: International Business Machines Corporation

Inventors: Michael E. Aho, John E. Attinella, Thomas M. Gooding, Michael B. Mundy
Core file limiter for abnormally terminating processes

Patent number: 9003226

Abstract: Computer program product and system to limit core file generation in a massively parallel computing system comprising a plurality of compute nodes each executing at least one task, of a plurality of tasks, by: upon determining that a first task executing on a first compute node has failed, performing an atomic load and increment operation on a core file count; generating a first core file upon determining that the core file count is below a predefined threshold; and not generating the first core file upon determining that the core file count is not below the predefined threshold.

Type: Grant

Filed: November 14, 2012

Date of Patent: April 7, 2015

Assignee: International Business Machines Corporation

Inventors: Michael E. Aho, John E. Attinella, Thomas M. Gooding
Core file limiter for abnormally terminating processes

Patent number: 8996911

Abstract: Computer program product and system to limit core file generation in a massively parallel computing system comprising a plurality of compute nodes each executing at least one task, of a plurality of tasks, by: upon determining that a first task executing on a first compute node has failed, performing an atomic load and increment operation on a core file count; generating a first core file upon determining that the core file count is below a predefined threshold; and not generating the first core file upon determining that the core file count is not below the predefined threshold.

Type: Grant

Filed: December 5, 2012

Date of Patent: March 31, 2015

Assignee: International Business Machines Corporation

Inventors: Michael E. Aho, John E. Attinella, Thomas M. Gooding
Servicing A Globally Broadcast Interrupt Signal In A Multi-Threaded Computer

Publication number: 20140281090

Abstract: Methods, apparatuses, and computer program products for servicing a globally broadcast interrupt signal in a multi-threaded computer comprising a plurality of processor threads. Embodiments include an interrupt controller indicating in a plurality of local interrupt status locations that a globally broadcast interrupt signal has been received by the interrupt controller. Embodiments also include a thread determining that a local interrupt status location corresponding to the thread indicates that the globally broadcast interrupt signal has been received by the interrupt controller. Embodiments also include the thread processing one or more entries in a global interrupt status bit queue based on whether global interrupt status bits associated with the globally broadcast interrupt signal are locked. Each entry in the global interrupt status bit queue corresponds to a queued global interrupt.

Type: Application

Filed: March 14, 2013

Publication date: September 18, 2014

Applicant: International Business Machines Corporation

Inventors: John E. Attinella, Kristan D. Davis, Roy G. Musselman, David L. Satterfield
COLLECTIVELY LOADING PROGRAMS IN A MULTIPLE PROGRAM MULTIPLE DATA ENVIRONMENT

Publication number: 20140282599

Abstract: Techniques are disclosed for loading programs efficiently in a parallel computing system. In one embodiment, nodes of the parallel computing system receive a load description file which indicates, for each program of a multiple program multiple data (MPMD) job, nodes which are to load the program. The nodes determine, using collective operations, a total number of programs to load and a number of programs to load in parallel. The nodes further generate a class route for each program to be loaded in parallel, where the class route generated for a particular program includes only those nodes on which the program needs to be loaded. For each class route, a node is selected using a collective operation to be a load leader which accesses a file system to load the program associated with a class route and broadcasts the program via the class route to other nodes which require the program.

Type: Application

Filed: March 13, 2013

Publication date: September 18, 2014

Applicant: International Business Machines Corporation

Inventors: Michael E. AHO, John E. ATTINELLA, Thomas M. GOODING, Samuel J. MILLER
CORE FILE LIMITER FOR ABNORMALLY TERMINATING PROCESSES

Publication number: 20140136890

Abstract: Computer program product and system to limit core file generation in a massively parallel computing system comprising a plurality of compute nodes each executing at least one task, of a plurality of tasks, by: upon determining that a first task executing on a first compute node has failed, performing an atomic load and increment operation on a core file count; generating a first core file upon determining that the core file count is below a predefined threshold; and not generating the first core file upon determining that the core file count is not below the predefined threshold.

Type: Application

Filed: November 14, 2012

Publication date: May 15, 2014

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Michael E. Aho, John E. Attinella, Thomas M. Gooding
CORE FILE LIMITER FOR ABNORMALLY TERMINATING PROCESSES

Publication number: 20140136888

Abstract: Computer program product and system to limit core file generation in a massively parallel computing system comprising a plurality of compute nodes each executing at least one task, of a plurality of tasks, by: upon determining that a first task executing on a first compute node has failed, performing an atomic load and increment operation on a core file count; generating a first core file upon determining that the core file count is below a predefined threshold; and not generating the first core file upon determining that the core file count is not below the predefined threshold.

Type: Application

Filed: December 5, 2012

Publication date: May 15, 2014

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Michael E. Aho, John E. Attinella, Thomas M. Gooding
Aggregating Job Exit Statuses Of A Plurality Of Compute Nodes Executing A Parallel Application

Publication number: 20130339805

Abstract: Aggregating job exit statuses of a plurality of compute nodes executing a parallel application, including: identifying a subset of compute nodes in the parallel computer to execute the parallel application; selecting one compute node in the subset of compute nodes in the parallel computer as a job leader compute node; initiating execution of the parallel application on the subset of compute nodes; receiving an exit status from each compute node in the subset of compute nodes, where the exit status for each compute node includes information describing execution of some portion of the parallel application by the compute node; aggregating each exit status from each compute node in the subset of compute nodes; and sending an aggregated exit status for the subset of compute nodes in the parallel computer.

Type: Application

Filed: June 15, 2012

Publication date: December 19, 2013

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Michael E. Aho, John E. Attinella, Thomas M. Gooding, Michael B. Mundy

1 2 next