Patents by Inventor Arslan Zulfiqar

Arslan Zulfiqar has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20240168799
    Abstract: Apparatuses, systems, and techniques to modify graphs. In at least one embodiment, a processor comprises one or more circuits to modify an execution order of at least one graph portion.
    Type: Application
    Filed: November 21, 2022
    Publication date: May 23, 2024
    Inventors: David Fontaine, Houston Thompson Hoffman, Arslan Zulfiqar, Stephen Jones, James Dinan, Jiri Johannes Kraus
  • Publication number: 20230005096
    Abstract: Apparatuses, systems, and techniques to generate one or more graph code nodes to allocate memory. In at least one embodiment, one or more graph code nodes to allocate memory are generated, based on, for example, CUDA or other parallel computing platform code.
    Type: Application
    Filed: November 12, 2021
    Publication date: January 5, 2023
    Inventors: Steven Arthur Gurfinkel, Stephen Anthony Bernard Jones, Jason David Gaiser, Fnu Vishnuswaroop Ramesh, Houston Thompson Hoffman, Michael Francis Carilli, David Anthony Fontaine, Arslan Zulfiqar
  • Publication number: 20230005097
    Abstract: Apparatuses, systems, and techniques to generate one or more graph code nodes to deallocate memory. In at least one embodiment, one or more graph code nodes to deallocate memory are generated, based on, for example, CUDA or other parallel computing platform code.
    Type: Application
    Filed: November 12, 2021
    Publication date: January 5, 2023
    Inventors: Steven Arthur Gurfinkel, Stephen Anthony Bernard Jones, Jason David Gaiser, Fnu Vishnuswaroop Ramesh, Houston Thompson Hoffman, Michael Francis Carilli, David Anthony Fontaine, Arslan Zulfiqar
  • Publication number: 20200372337
    Abstract: System and methods to train a neural network to systematically find a cross-over point, given the number of devices (e.g., Graphical Processing Units) used to train a deep learning (DL) model, that indicates which parallelization strategy to implement when optimizing the training of the DL model on a particular system to achieve maximum efficiency gains.
    Type: Application
    Filed: May 21, 2019
    Publication date: November 26, 2020
    Inventors: Eiman Ebrahimi, Arslan Zulfiqar, Saptadeep Pal
  • Patent number: 9948587
    Abstract: A method for data deduplication during execution of an application on a plurality of computing nodes, including: generating, by a first processor in a first computing node executing the application, a first message to process application data owned by a second computing node executing the application; receiving, by a first network interface (NI) of the first computing node, the first message; extracting, by the first NI, a first key from the first message; determining, by the first NI, the first key is not a duplicate; and placing, by the first NI and in response to the first key not being a duplicate, the first message on a network connecting the first computing node to the second computing node.
    Type: Grant
    Filed: August 8, 2014
    Date of Patent: April 17, 2018
    Assignee: Oracle International Corporation
    Inventors: Herbert Dewitt Schwetman, Jr., Pranay Koka, Arslan Zulfiqar
  • Patent number: 9535842
    Abstract: Each computing node of a distributed computing system may implement a hardware mechanism at the network interface for message driven prefetching of application data. For example, a parallel data-intensive application that employs function shipping may distribute respective portions of a large data set to main memory on multiple computing nodes. The application may send messages to one of the computing nodes referencing data that is stored locally on the node. For each received message, the network interface on the recipient node may extract the reference, initiate the prefetching of referenced data into a local cache (e.g., an LLC), and then store the message for subsequent interpretation and processing by a local processor core. When the processor core retrieves a stored message for processing, the referenced data may already be in the LLC, avoiding a CPU stall while retrieving it from memory. The hardware mechanism may be configured via software.
    Type: Grant
    Filed: August 28, 2014
    Date of Patent: January 3, 2017
    Assignee: Oracle International Corporation
    Inventors: Herbert D. Schwetman, Jr., Mohammad Arslan Zulfiqar, Pranay Koka
  • Publication number: 20160062894
    Abstract: Each computing node of a distributed computing system may implement a hardware mechanism at the network interface for message driven prefetching of application data. For example, a parallel data-intensive application that employs function shipping may distribute respective portions of a large data set to main memory on multiple computing nodes. The application may send messages to one of the computing nodes referencing data that is stored locally on the node. For each received message, the network interface on the recipient node may extract the reference, initiate the prefetching of referenced data into a local cache (e.g., an LLC), and then store the message for subsequent interpretation and processing by a local processor core. When the processor core retrieves a stored message for processing, the referenced data may already be in the LLC, avoiding a CPU stall while retrieving it from memory. The hardware mechanism may be configured via software.
    Type: Application
    Filed: August 28, 2014
    Publication date: March 3, 2016
    Applicant: ORACLE INTERNATIONAL CORPORATION
    Inventors: Herbert D. Schwetman, JR., Mohammad Arslan Zulfiqar, Pranay Koka
  • Publication number: 20160043977
    Abstract: A method for data deduplication during execution of an application on a plurality of computing nodes, including: generating, by a first processor in a first computing node executing the application, a first message to process application data owned by a second computing node executing the application; receiving, by a first network interface (NI) of the first computing node, the first message; extracting, by the first NI, a first key from the first message; determining, by the first NI, the first key is not a duplicate; and placing, by the first NI and in response to the first key not being a duplicate, the first message on a network connecting the first computing node to the second computing node.
    Type: Application
    Filed: August 8, 2014
    Publication date: February 11, 2016
    Inventors: Herbert Dewitt Schwetman, JR., Pranay Koka, Arslan Zulfiqar
  • Patent number: 8909051
    Abstract: In a multi-chip module (MCM), integrated circuits are coupled by optical waveguides that convey optical signals. The optical waveguides provide dedicated point-to-point optical links between all pairs of the integrated circuits. Moreover, for a given point-to-point optical link between a given pair of integrated circuits, other integrated circuits in the integrated circuits steal access on the given point-to-point optical link when communicating information to one of the given pair of integrated circuits so that the given point-to-point optical link is shared by more than the given pair of integrated circuits. Furthermore, the integrated circuits recover errors in messages in the optical signals corrupted by collisions on the given point-to-point optical link using erasure coding. In this way, the MCM may provide an optical network with increased bandwidth relative to a point-to-point optical network.
    Type: Grant
    Filed: October 9, 2012
    Date of Patent: December 9, 2014
    Assignee: Oracle International Corporation
    Inventors: Arslan Zulfiqar, Pranay Koka, Herbert D. Schwetman, Jr.