Patents by Inventor Karthikeyan Vaithianathan

Karthikeyan Vaithianathan has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 10120691
    Abstract: In one embodiment, a processor includes an accelerator, a decoder to decode a first instruction into a decoded first instruction, and a second instruction into a decoded second instruction, and an execution unit to execute the first decoded instruction to, for a thread executing on the accelerator that is to be placed in an inactive state, cause a save of context information for the thread, and a save of a vector identifying the accelerator corresponding to the context information, and execute the second decoded instruction to read the vector to determine the accelerator to restore saved context information into for the thread, read the saved context information, and restore the saved context information into the accelerator.
    Type: Grant
    Filed: July 19, 2016
    Date of Patent: November 6, 2018
    Assignee: INTEL CORPORATION
    Inventors: Boris Ginzburg, Ronny Ronen, Eliezer Weissmann, Karthikeyan Vaithianathan, Ehud Cohen
  • Patent number: 9658829
    Abstract: Embodiments of a near optimal configurable adder tree for arbitrary shaped 2D block sum of absolute differences (SAD) calculation engine are generally described herein. Other embodiments may be described and claimed. In some embodiments, a configurable two-dimensional adder tree architecture for computing a sum of absolute differences (SAD) for various block sizes up to 16 by 16 comprises a first stage of one-dimensional adder trees and a second stage of one-dimensional adder trees, wherein each one-dimensional adder tree comprises an input routing network, a plurality of adder units, and an output routing network.
    Type: Grant
    Filed: October 19, 2009
    Date of Patent: May 23, 2017
    Assignee: Intel Corporation
    Inventors: Karthikeyan Vaithianathan, Arvind Sudarsanam
  • Publication number: 20160328234
    Abstract: In one embodiment, a processor includes an accelerator, a decoder to decode a first instruction into a decoded first instruction, and a second instruction into a decoded second instruction, and an execution unit to execute the first decoded instruction to, for a thread executing on the accelerator that is to be placed in an inactive state, cause a save of context information for the thread, and a save of a vector identifying the accelerator corresponding to the context information, and execute the second decoded instruction to read the vector to determine the accelerator to restore saved context information into for the thread, read the saved context information, and restore the saved context information into the accelerator.
    Type: Application
    Filed: July 19, 2016
    Publication date: November 10, 2016
    Inventors: BORIS GINZBURG, RONNY RONEN, ELIEZER WEISSMANN, KARTHIKEYAN VAITHIANATHAN, EHUD COHEN
  • Patent number: 9396020
    Abstract: An apparatus is described having multiple cores, each core having: a) an accelerator; and, b) a general purpose CPU coupled to the accelerator. The general purpose CPU has functional unit logic circuitry to execute an instruction that returns an amount of storage space to store context information of the accelerator.
    Type: Grant
    Filed: March 30, 2012
    Date of Patent: July 19, 2016
    Assignee: Intel Corporation
    Inventors: Boris Ginzburg, Ronny Ronen, Eliezer Weissmann, Karthikeyan Vaithianathan, Ehud Cohen
  • Patent number: 9152572
    Abstract: Some implementations disclosed herein provide techniques and arrangements for an specialized logic engine that includes translation lookaside buffer to support multiple threads executing on multiple cores. The translation lookaside buffer enables the specialized logic engine to directly access a virtual address of a thread executing on one of the plurality of processing cores. For example, an acceleration compute engine may receive one or more instructions from a thread executed by a processing core. The acceleration compute engine may retrieve, based on an address space identifier associated with the one or more instructions, a physical address associated with the one or more instructions from the translation lookaside buffer to execute the one or more instructions using the physical address.
    Type: Grant
    Filed: December 30, 2011
    Date of Patent: October 6, 2015
    Assignee: Intel Corporation
    Inventors: Ronny Ronen, Boris Ginzburg, Eliezer Weissmann, Karthikeyan Vaithianathan
  • Publication number: 20130262816
    Abstract: Some implementations disclosed herein provide techniques and arrangements for an specialized logic engine that includes translation lookaside buffer to support multiple threads executing on multiple cores. The translation lookaside buffer enables the specialized logic engine to directly access a virtual address of a thread executing on one of the plurality of processing cores. For example, an acceleration compute engine may receive one or more instructions from a thread executed by a processing core. The acceleration compute engine may retrieve, based on an address space identifier associated with the one or more instructions, a physical address associated with the one or more instructions from the translation lookaside buffer to execute the one or more instructions using the physical address.
    Type: Application
    Filed: December 30, 2011
    Publication date: October 3, 2013
    Inventors: Ronny Ronen, Boris Ginzburg, Eliezer Weissmann, Karthikeyan Vaithianathan
  • Publication number: 20130027416
    Abstract: Apparatus, systems and methods are described including dividing cache lines into at least most significant portions and next most significant portions, storing cache line contents in a register array so that the most significant portion of each cache line is stored in a first row of the register array and the next most significant portion of each cache line is stored in a second row of the register array. Contents of a first register portion of the first row may be provided to a barrel shifter where the contents may be aligned and then stored in a buffer.
    Type: Application
    Filed: July 25, 2011
    Publication date: January 31, 2013
    Inventors: Karthikeyan Vaithianathan, Bhargava G. Reddy
  • Publication number: 20110093518
    Abstract: Embodiments of a near optimal configurable adder tree for arbitrary shaped 2D block sum of absolute differences (SAD) calculation engine are generally described herein. Other embodiments may be described and claimed. In some embodiments, a configurable two-dimensional adder tree architecture for computing a sum of absolute differences (SAD) for various block sizes up to 16 by 16 comprises a first stage of one-dimensional adder trees and a second stage of one-dimensional adder trees, wherein each one-dimensional adder tree comprises an input routing network, a plurality of adder units, and an output routing network.
    Type: Application
    Filed: October 19, 2009
    Publication date: April 21, 2011
    Inventors: Karthikeyan Vaithianathan, Arvind Sudarsanam