Patents by Inventor Sayantan Sur

Sayantan Sur has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

OVERLAPPED RENDEZVOUS MEMORY REGISTRATION

Publication number: 20190102236

Abstract: Methods, software, and systems for improved data transfer operations using overlapped rendezvous memory registration. Techniques are disclosed for transferring data between a first process operating as a sender and a second process operating as a receiver. The sender sends a PUT request message to the receiver including payload data stored in a send buffer and first and second match indicia. Subsequent to or in conjunction with sending the PUT request message, the send buffer is exposed on the sender. The first match indicia is used to determine whether the PUT request is expected or unexpected. If the PUT request is unexpected, an RMA GET operation is performed using the second matching indicia to pull data from the send buffer and write the data to a memory region in the user space of the process associated with the receiver. The RMA GET operation may be retried one or more times in the event that the send buffer has yet to be exposed.

Type: Application

Filed: September 30, 2017

Publication date: April 4, 2019

Inventors: Sayantan Sur, Keith Underwood, Ravindra Babu Ganapathi, Andrew Friedley
THROTTLING FOR BANDWIDTH IMBALANCED DATA TRANSFERS

Publication number: 20190068501

Abstract: Techniques are disclosed to throttle bandwidth imbalanced data transfers. In some examples, an example computer-implemented method may include splitting a payload of a data transfer operation over a network fabric into multiple chunk get operations, starting the execution of a threshold number of the chunk get operations, and scheduling the remaining chunk get operations for subsequent execution. The method may also include executing a scheduled chunk get operation in response determining a completion of an executing chunk get operation. In some embodiments, the chunk get operations may be implemented as triggered operations.

Type: Application

Filed: August 25, 2017

Publication date: February 28, 2019

Applicant: INTEL CORPORATION

Inventors: TIMO SCHNEIDER, KEITH D. UNDERWOOD, MARIO FLAJSLIK, SAYANTAN SUR, JAMES DINAN
TRIGGERED OPERATIONS TO IMPROVE ALLREDUCE OVERLAP

Publication number: 20190042946

Abstract: An embodiment of a semiconductor package apparatus may include technology to embed one or more trigger operations in one or more messages related to collective operations for a neural network, and issue the one or more messages related to the collective operations to a hardware-based message scheduler in a desired order of execution. Other embodiments are disclosed and claimed.

Type: Application

Filed: September 11, 2018

Publication date: February 7, 2019

Applicant: Intel Corporation

Inventors: Sayantan Sur, James Dinan, Maria Garzaran, Anupama Kurpad, Andrew Friedley, Nusrat Islam, Robert Zak
TECHNOLOGIES FOR OFFLOADED MANAGEMENT OF COMMUNICATION

Publication number: 20180287954

Abstract: Technologies for offloaded management of communication are disclosed. In order to manage communication with information that may be available to applications in a compute device, the compute device may offload communication management to a host fabric interface using a credit management system. A credit limit is established, and each message to be sent is added to a queue with a corresponding number of credits required to send the message. The host fabric interface of the compute device may send out messages as credits become available and decrease the number of available credits based on the number of credits required to send a particular message. When an acknowledgement of receipt of a message is received, the number of credits required to send the corresponding message may be added back to an available credit pool.

Type: Application

Filed: March 29, 2017

Publication date: October 4, 2018

Inventors: James Dinan, Sayantan Sur, Mario Flajslik, Keith D. Underwood
TECHNOLOGIES FOR FINE-GRAINED COMPLETION TRACKING OF MEMORY BUFFER ACCESSES

Publication number: 20180267742

Abstract: Technologies for fine-grained completion tracking of memory buffer accesses include a compute device. The compute device is to establish multiple counter pairs for a memory buffer. Each counter pair includes a locally managed offset and a completion counter. The compute device is also to receive a request from a remote compute device to access the memory buffer, assign one of the counter pairs to the request, advance the locally managed offset of the assigned counter pair by the amount of data to be read or written, and advance the completion counter of the assigned counter pair as the data is read from or written to the memory buffer. Other embodiments are also described and claimed.

Type: Application

Filed: March 20, 2017

Publication date: September 20, 2018

Inventors: James Dinan, Keith D. Underwood, Sayantan Sur, Charles A. Giefer, Mario Flajslik
COLLECTIVE COMMUNICATION OPERATION

Publication number: 20180183857

Abstract: Particular embodiments described herein provide for an electronic device that can be configured to consolidate data from one or more processes on a node, where the node is part of a first collection of nodes, communicate the consolidated data to a second node, where the second node is in the first collection of nodes, where the first collection of nodes is part of a first group of a collection of nodes, and communicate the consolidated data to a third node, wherein the third node is in a second collection of nodes, where the second collection of nodes is part of the first group of the collection of nodes. In an example, the node is part of a multi-tiered dragonfly topology network and the data is part of a gather or scatter process.

Type: Application

Filed: December 23, 2016

Publication date: June 28, 2018

Applicant: Intel Corporation

Inventors: Akhil Langer, Sayantan Sur
Method, apparatus and system for performing matching operations in a computing system

Patent number: 9811403

Abstract: In one embodiment, an apparatus includes: a plurality of queues having a plurality of first entries to store receive information for a process; a master queue having a plurality of second entries to store wild card receive information, where redundant information of the plurality of second entries is to be included in a plurality of redundant entries of the plurality of queues; and a control circuit to match an incoming receive operation within one of the plurality of queues. Other embodiments are described and claimed.

Type: Grant

Filed: June 22, 2016

Date of Patent: November 7, 2017

Assignee: Intel Corporation

Inventor: Sayantan Sur
AT LEAST ONE MECHANISM TO PERMIT, AT LEAST IN PART, ALLOCATION AND/OR CONFIGURATION, AT LEAST IN PART, OF AT LEAST ONE NETWORK-ASSOCIATED OBJECT

Publication number: 20170104692

Abstract: In an embodiment, at least one interface mechanism may be provided. The mechanism may permit, at least in part, at least one process allocate, at least in part, and/or configure, at least in part, at least one network-associated object. Such allocation and/or configuration, at least in part, may be in accordance with at least one parameter set that may correspond, at least in part, to at least one query issued by the at least one process via the mechanism. Many modifications are possible without departing from this embodiment.

Type: Application

Filed: September 29, 2016

Publication date: April 13, 2017

Inventors: WILLIAM R. MAGRO, TODD M. RIMMER, ROBERT J. WOODRUFF, MARK S. HEFTY, SAYANTAN SUR
Low latency cluster computing

Patent number: 9560117

Abstract: An embodiment includes a low-latency mechanism for performing a checkpoint on a distributed application. More specifically, an embodiment of the invention includes processing a first application on a compute node, which is included in a cluster, to produce first computed data and then storing the first computed data in volatile memory included locally in the compute node; halting the processing of the first application, based on an initiated checkpoint, and storing first state data corresponding to the halted first application in the volatile memory; storing the first state information and the first computed data in non-volatile memory included locally in the compute node; and resuming processing of the halted first application and then continuing the processing the first application to produce second computed data while simultaneously pulling the first state information and the first computed data from the non-volatile memory to an input/output (IO) node.

Type: Grant

Filed: December 30, 2011

Date of Patent: January 31, 2017

Assignee: Intel Corporation

Inventors: Mark S. Hefty, Arlin Davis, Robert Woodruff, Sayantan Sur, Shiow-wen Cheng
At least one mechanism to permit, at least in part, allocation and/or configuration, at least in part, of at least one network-associated object

Patent number: 9479506

Abstract: In an embodiment, at least one interface mechanism may be provided. The mechanism may permit, at least in part, at least one process allocate, at least in part, and/or configure, at least in part, at least one network-associated object. Such allocation and/or configuration, at least in part, may be in accordance with at least one parameter set that may correspond, at least in part, to at least one query issued by the at least one process via the mechanism. Many modifications are possible without departing from this embodiment.

Type: Grant

Filed: April 16, 2014

Date of Patent: October 25, 2016

Assignee: Intel Corporation

Inventors: William R. Magro, Todd M. Rimmer, Robert J. Woodruff, Mark S. Hefty, Sayantan Sur
AT LEAST ONE MECHANISM TO PERMIT, AT LEAST IN PART, ALLOCATION AND/OR CONFIGURATION, AT LEAST IN PART, OF AT LEAST ONE NETWORK-ASSOCIATED OBJECT

Publication number: 20150305006

Abstract: In an embodiment, at least one interface mechanism may be provided. The mechanism may permit, at least in part, at least one process allocate, at least in part, and/or configure, at least in part, at least one network-associated object. Such allocation and/or configuration, at least in part, may be in accordance with at least one parameter set that may correspond, at least in part, to at least one query issued by the at least one process via the mechanism. Many modifications are possible without departing from this embodiment.

Type: Application

Filed: April 16, 2014

Publication date: October 22, 2015

Inventors: WILLIAM R. MAGRO, TODD M. RIMMER, ROBERT J. WOODRUFF, MARK S. HEFTY, SAYANTAN SUR
LOW LATENCY CLUSTER COMPUTING

Publication number: 20140129635

Abstract: An embodiment includes a low-latency mechanism for performing a checkpoint on a distributed application. More specifically, an embodiment of the invention includes processing a first application on a compute node, which is included in a cluster, to produce first computed data and then storing the first computed data in volatile memory included locally in the compute node; halting the processing of the first application, based on an initiated checkpoint, and storing first state data corresponding to the halted first application in the volatile memory; storing the first state information and the first computed data in non-volatile memory included locally in the compute node; and resuming processing of the halted first application and then continuing the processing the first application to produce second computed data while simultaneously pulling the first state information and the first computed data from the non-volatile memory to an input/output (IO) node.

Type: Application

Filed: December 30, 2011

Publication date: May 8, 2014

Inventors: Mark S. Hefty, Arlin Davis, Robert Woodruff, Sayantan Sur, Shiow-wen Cheng

prev 1 2