Patents by Inventor Dheemanth Nagaraj
Dheemanth Nagaraj has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Patent number: 12294368Abstract: The present disclosure is directed to 3-D stacked architecture for Programmable Fabrics and Central Processing Units (CPUs). The 3-D stacked orientation enables reconfigurability of the fabric, and allows the fabric to function using coarse-grained and fine-grained acceleration for offloading CPU processing. Additionally, the programmable fabric may be able to function to interface with multiple other compute chiplet components in the 3-D stacked orientation. This enables multiple compute components to communicate without the need for offloading the data communications between the compute chiplets.Type: GrantFiled: September 24, 2021Date of Patent: May 6, 2025Assignee: Intel CorporationInventors: Rahul Pal, Dheeraj Subbareddy, Mahesh Kumashikar, Dheemanth Nagaraj, Rajesh Vivekanandham, Anshuman Thakur, Ankireddy Nalamalpu, Md Altaf Hossain, Atul Maheshwari
-
Publication number: 20240118892Abstract: Methods and apparatuses relating to processing neural networks are described. In one embodiment, an apparatus to process a neural network includes a plurality of fully connected layer chips coupled by an interconnect; a plurality of convolutional layer chips each coupled by an interconnect to a respective fully connected layer chip of the plurality of fully connected layer chips and each of the plurality of fully connected layer chips and the plurality of convolutional layer chips including an interconnect to couple each of a forward propagation compute intensive tile, a back propagation compute intensive tile, and a weight gradient compute intensive tile of a column of compute intensive tiles between a first memory intensive tile and a second memory intensive tile.Type: ApplicationFiled: December 18, 2023Publication date: April 11, 2024Inventors: Swagath VENKATARAMANI, Dipankar DAS, Ashish RANJAN, Subarno BANERJEE, Sasikanth AVANCHA, Ashok JAGANNATHAN, Ajaya V. DURG, Dheemanth NAGARAJ, Bharat KAUL, Anand RAGHUNATHAN
-
Publication number: 20220050683Abstract: Methods and apparatuses relating to processing neural networks are described. In one embodiment, an apparatus to process a neural network includes a plurality of fully connected layer chips coupled by an interconnect; a plurality of convolutional layer chips each coupled by an interconnect to a respective fully connected layer chip of the plurality of fully connected layer chips and each of the plurality of fully connected layer chips and the plurality of convolutional layer chips including an interconnect to couple each of a forward propagation compute intensive tile, a back propagation compute intensive tile, and a weight gradient compute intensive tile of a column of compute intensive tiles between a first memory intensive tile and a second memory intensive tile.Type: ApplicationFiled: October 26, 2021Publication date: February 17, 2022Inventors: Swagath VENKATARAMANI, Dipankar DAS, Ashish RANJAN, Subarno BANERJEE, Sasikanth AVANCHA, Ashok JAGANNATHAN, Ajaya V. DURG, Dheemanth NAGARAJ, Bharat KAUL, Anand RAGHUNATHAN
-
Publication number: 20220044123Abstract: Processors may be enhanced by embedding programmable logic devices, such as field-programmable gate arrays. For instance, an application-specific integrated circuit device may include main fixed function circuitry operable to perform a main fixed function of the application-specific integrated circuit device. The application-specific integrated circuit also includes a support processor that performs operations outside of the main fixed function of the application-specific integrated circuit device, wherein the support processor comprises an embedded programmable fabric to provide programmable flexibility to application-specific integrated circuit device.Type: ApplicationFiled: September 24, 2021Publication date: February 10, 2022Inventors: Rajesh Vivekanandham, Dheeraj Subbareddy, Dheemanth Nagaraj, Vijay S. R. Degalahal, Anshuman Thakur, Ankireddy Nalamalpu, MD Altaf Hossain, Mahesh Kumashikar, Atul Maheshwari
-
Publication number: 20220014202Abstract: The present disclosure is directed to 3-D stacked architecture for Programmable Fabrics and Central Processing Units (CPUs). The 3-D stacked orientation enables reconfigurability of the fabric, and allows the fabric to function using coarse-grained and fine-grained acceleration for offloading CPU processing. Additionally, the programmable fabric may be able to function to interface with multiple other compute chiplet components in the 3-D stacked orientation. This enables multiple compute components to communicate without the need for offloading the data communications between the compute chiplets.Type: ApplicationFiled: September 24, 2021Publication date: January 13, 2022Inventors: Rahul Pal, Dheeraj Subbareddy, Mahesh Kumashikar, Dheemanth Nagaraj, Rajesh Vivekanandham, Anshuman Thakur, Ankireddy Nalamalpu, MD Altaf Hossain, Atul Maheshwari
-
Publication number: 20190303743Abstract: Methods and apparatuses relating to processing neural networks are described. In one embodiment, an apparatus to process a neural network includes a plurality of fully connected layer chips coupled by an interconnect; a plurality of convolutional layer chips each coupled by an interconnect to a respective fully connected layer chip of the plurality of fully connected layer chips and each of the plurality of fully connected layer chips and the plurality of convolutional layer chips including an interconnect to couple each of a forward propagation compute intensive tile, a back propagation compute intensive tile, and a weight gradient compute intensive tile of a column of compute intensive tiles between a first memory intensive tile and a second memory intensive tile.Type: ApplicationFiled: September 27, 2016Publication date: October 3, 2019Inventors: Swagath VENKATARAMANI, Dipankar DAS, Ashish RANJAN, Subarno BANERJEE, Sasikanth AVANCHA, Ashok JAGANNATHAN, Ajaya V. DURG, Dheemanth NAGARAJ, Bharat KAUL, Anand RAGHUNATHAN
-
Patent number: 9298629Abstract: In one embodiment, a method includes receiving a read request from a first caching agent, determining whether a directory entry associated with the memory location indicates that the information is not present in a remote caching agent, and if so, transmitting the information from the memory location to the first caching agent before snoop processing with respect to the read request is completed. Other embodiments are described and claimed.Type: GrantFiled: November 21, 2014Date of Patent: March 29, 2016Assignee: Intel CorporationInventors: Sailesh Kottapalli, Henk G. Neefs, Rahul Pal, Manoj K. Arora, Dheemanth Nagaraj
-
Patent number: 9235520Abstract: Embodiments of the invention describe a cache coherency protocol that eliminates the need for ordering between message classes and also eliminates home tracker preallocation. Embodiments of the invention describe a less complex conflict detection and resolution mechanism (at the home agent) without any performance degradation in form of bandwidth or latency compared to prior art solutions. Embodiments of the invention describe a home agent that may receive request messages, e.g., data ownership request messages and data request messages, which include issuance data indicating an order of the respective message issued. Said home agent may determine whether an early or late conflict exists based, at least in part, on a received conflict response message and the issuance data of a most recent completed transaction.Type: GrantFiled: December 20, 2011Date of Patent: January 12, 2016Assignee: Intel CorporationInventors: Manoj K. Arora, Robert G. Blankenship, Rahul Pal, Dheemanth Nagaraj
-
Publication number: 20150081977Abstract: In one embodiment, a method includes receiving a read request from a first caching agent, determining whether a directory entry associated with the memory location indicates that the information is not present in a remote caching agent, and if so, transmitting the information from the memory location to the first caching agent before snoop processing with respect to the read request is completed. Other embodiments are described and claimed.Type: ApplicationFiled: November 21, 2014Publication date: March 19, 2015Inventors: SAILESH KOTTAPALLI, HENK G. NEEFS, RAHUL PAL, MANOJ K. ARORA, DHEEMANTH NAGARAJ
-
Patent number: 8918592Abstract: In one embodiment, a method includes receiving a read request from a first caching agent, determining whether a directory entry associated with the memory location indicates that the information is not present in a remote caching agent, and if so, transmitting the information from the memory location to the first caching agent before snoop processing with respect to the read request is completed. Other embodiments are described and claimed.Type: GrantFiled: December 27, 2013Date of Patent: December 23, 2014Assignee: Intel CorporationInventors: Sailesh Kottapalli, Henk G. Neefs, Rahul Pal, Manoj K. Arora, Dheemanth Nagaraj
-
Publication number: 20140359230Abstract: Embodiments of the invention describe a cache coherency protocol that eliminates the need for ordering between message classes and also eliminates home tracker preallocation. Embodiments of the invention describe a less complex conflict detection and resolution mechanism (at the home agent) without any performance degradation in form of bandwidth or latency compared to prior art solutions. Embodiments of the invention describe a home agent that may receive request messages, e.g., data ownership request messages and data request messages, which include issuance data indicating an order of the respective message issued. Said home agent may determine whether an early or late conflict exists based, at least in part, on a received conflict response message and the issuance data of a most recent completed transaction.Type: ApplicationFiled: December 20, 2011Publication date: December 4, 2014Inventors: Manoj K. Arora, Robert G. Blankenship, Rahul Pal, Dheemanth Nagaraj
-
Patent number: 8862918Abstract: Systems and methods of operating a computing system may involve identifying a plurality of state values, wherein each state value corresponds to a computing thread associated with a processor. An average value can be determined for the plurality of state values, wherein a determination may be made as to whether to grant a frequency boost request based at least in part on the average value.Type: GrantFiled: July 1, 2011Date of Patent: October 14, 2014Assignee: Intel CorporationInventors: Baskaran Ganesan, James S. Burns, Suresh Sugumar, Devadatta V. Bodas, Sundaravarathan R. Iyengar, Feranak Nelson, Dheemanth Nagaraj, Russell J. Fenger
-
Patent number: 8799586Abstract: Methods and apparatus relating to memory mirroring and migration at a Home Agent (HA) are described. In one embodiment, a home agent may mirror its data at a slave agent. In some embodiments, a bit in a directory may indicate status of cache lines. Other embodiments are also disclosed.Type: GrantFiled: September 30, 2009Date of Patent: August 5, 2014Assignee: Intel CorporationInventors: Ganesh Kumar, Dheemanth Nagaraj, Vincent R. Freytag, Eric Delano, Gregory S. Averill
-
Patent number: 8782347Abstract: In one embodiment, a method includes receiving a read request from a first caching agent and if a directory entry associated with the request is in an unknown state, an invalidating snoop message is sent to at least one other caching agent to invalidate information in a cache location of the other caching agent corresponding to the location of the read request, to enable setting of the directory entry into a known state. Other embodiments are described and claimed.Type: GrantFiled: June 26, 2009Date of Patent: July 15, 2014Assignee: Intel CorporationInventors: Ganesh Kumar, Dheemanth Nagaraj, Vincent R. Freytag, Eric DeLano, Gregory S. Averill
-
Publication number: 20140115274Abstract: In one embodiment, a method includes receiving a read request from a first caching agent, determining whether a directory entry associated with the memory location indicates that the information is not present in a remote caching agent, and if so, transmitting the information from the memory location to the first caching agent before snoop processing with respect to the read request is completed. Other embodiments are described and claimed.Type: ApplicationFiled: December 27, 2013Publication date: April 24, 2014Inventors: SAILESH KOTTAPALLI, HENK G. NEEFS, RAHUL PAL, MANOJ K. ARORA, DHEEMANTH NAGARAJ
-
Patent number: 8656115Abstract: In one embodiment, a method includes receiving a read request from a first caching agent, determining whether a directory entry associated with the memory location indicates that the information is not present in a remote caching agent, and if so, transmitting the information from the memory location to the first caching agent before snoop processing with respect to the read request is completed. Other embodiments are described and claimed.Type: GrantFiled: August 20, 2010Date of Patent: February 18, 2014Assignee: Intel CorporationInventors: Sailesh Kottapalli, Henk G. Neefs, Rahul Pal, Manoj K. Arora, Dheemanth Nagaraj
-
Patent number: 8473791Abstract: A method comprises detecting a defective area in a Dynamic Random Access Memory (DRAM). The method further comprises establishing a redundant memory buffer at a per-memory module level. The method still further comprises loading the redundant memory buffer with a copy of data from the defective area. The method additionally comprises substituting data from the redundant memory buffer for data stored in the defective area upon a memory access to the defective area.Type: GrantFiled: April 30, 2007Date of Patent: June 25, 2013Assignee: Hewlett-Packard Development Company, L.P.Inventors: Mark Shaw, Larry Thayer, Chris Petersen, Lidia Warnes, Dheemanth Nagaraj
-
Publication number: 20130007475Abstract: Systems and methods of operating a computing system may involve identifying a plurality of state values, wherein each state value corresponds to a computing thread associated with a processor. An average value can be determined for the plurality of state values, wherein a determination may be made as to whether to grant a frequency boost request based at least in part on the average value.Type: ApplicationFiled: July 1, 2011Publication date: January 3, 2013Inventors: Baskaran Ganesan, James S. Burns, Suresh Sugumar, Devadatta V. Bodas, Sundaravarathan R. Iyengar, Feranak Nelson, Dheemanth Nagaraj, Russell J. Fenger
-
Patent number: 8327228Abstract: Methods and apparatus relating to home agent data and memory management are described. In one embodiment, a scrubber logic corrects an error at a location in a memory corresponding to a target address by writing back the corrected version of data to the target location. In an embodiment, a map out logic maps out an index or way of a directory cache in response to a number of errors, corresponding to the directory cache, exceeding a threshold value. Other embodiments are also disclosed.Type: GrantFiled: September 30, 2009Date of Patent: December 4, 2012Assignee: Intel CorporationInventors: Ganesh Kumar, Dheemanth Nagaraj, Vincent R. Freytag, Eric Delano, Gregory S. Averill
-
Publication number: 20120047333Abstract: In one embodiment, a method includes receiving a read request from a first caching agent, determining whether a directory entry associated with the memory location indicates that the information is not present in a remote caching agent, and if so, transmitting the information from the memory location to the first caching agent before snoop processing with respect to the read request is completed. Other embodiments are described and claimed.Type: ApplicationFiled: August 20, 2010Publication date: February 23, 2012Inventors: SAILESH KOTTAPALLI, Henk G. Neefs, Rahul Pal, Manoj K. Arora, Dheemanth Nagaraj