Patents by Inventor Surendra Byna

Surendra Byna has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 8643656
    Abstract: A method includes configuring a shared library, stored in a memory, to be loaded into applications to intercept graphics processing unit (GPU) computation requests for different types of workload kernals corresponding to the applications. The method further includes generating a power prediction and a performance prediction for at least one candidate kernel combination for execution on a GPU responsive to the GPU computations requests. The at least one candidate kernel combination pertains to at least two of the workload kernals. The method also includes rendering a decision of whether to execute the at least one candidate kernel combination or to execute the at least two of the workload kernals pertaining thereto separately, based on the power prediction and the performance prediction.
    Type: Grant
    Filed: September 8, 2011
    Date of Patent: February 4, 2014
    Assignee: NEC Laboratories America, Inc.
    Inventors: Dong Li, Surendra Byna, Srimat Chakradhar
  • Patent number: 8375392
    Abstract: Systems and method for data-aware scheduling of applications on a heterogeneous platform having at least one central processing unit (CPU) and at least one accelerator. Such systems and methods include a function call handling module configured to intercept, analyze, and schedule library calls on a processing element. The function call handling module further includes a function call interception module configured to intercept function calls to predefined libraries, a function call analysis module configured to analyze argument size and location, and a function call redirection module configured to schedule library calls and data transfers. The systems and methods also use a memory unification module, configured to keep data coherent between memories associated with the at least one CPU and the at least one accelerator based on the output of the function call redirection module.
    Type: Grant
    Filed: August 20, 2010
    Date of Patent: February 12, 2013
    Assignee: NEC Laboratories America, Inc.
    Inventors: Michela Becchi, Surendra Byna, Srihari Cadambi, Srimat Chakradhar
  • Publication number: 20120081373
    Abstract: A method includes configuring a shared library, stored in a memory, to be loaded into applications to intercept graphics processing unit (GPU) computation requests for different types of workload kernals corresponding to the applications. The method further includes generating a power prediction and a performance prediction for at least one candidate kernel combination for execution on a GPU responsive to the GPU computations requests. The at least one candidate kernel combination pertains to at least two of the workload kernals. The method also includes rendering a decision of whether to execute the at least one candidate kernel combination or to execute the at least two of the workload kernals pertaining thereto separately, based on the power prediction and the performance prediction.
    Type: Application
    Filed: September 8, 2011
    Publication date: April 5, 2012
    Applicant: NEC LABORATORIES AMERICA, INC.
    Inventors: DONG LI, SURENDRA BYNA, SRIMAT CHAKRADHAR
  • Publication number: 20110173155
    Abstract: Systems and method for data-aware scheduling of applications on a heterogeneous platform having at least one central processing unit (CPU) and at least one accelerator. Such systems and methods include a function call handling module configured to intercept, analyze, and schedule library calls on a processing element. The function call handling module further includes a function call interception module configured to intercept function calls to predefined libraries, a function call analysis module configured to analyze argument size and location, and a function call redirection module configured to schedule library calls and data transfers. The systems and methods also use a memory unification module, configured to keep data coherent between memories associated with the at least one CPU and the at least one accelerator based on the output of the function call redirection module.
    Type: Application
    Filed: August 20, 2010
    Publication date: July 14, 2011
    Applicant: NEC Laboratories America, Inc.
    Inventors: Michela Becchi, Surendra Byna, Srihari Cadambi, Srimat Chakradhar