Patents by Inventor Ahmed Louri

Ahmed Louri has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 11502934
    Abstract: With the advent of manycore architecture, on-chip interconnect connects a number of cores, caches, memory modules, accelerators, graphic processing unit (GPU) or chiplets in one system. However, on-chip interconnect architecture consumes a significant portion of total parallel computing chip power. Power-gating is an effective technique to reduce power consumption by powering off the routers, but it suffers from a large wake-up latency to resume the full activity of routers. Recent research aims to improve the wake-up latency penalty by hiding it through early wake-up techniques. However, these techniques do not exploit the full advantage of power-gating due to the early wake-up. Consequently, they do not achieve significant power savings. The present invention provides a new router architecture that remedies the large wake-up latency overheads while providing significant power savings. The invention takes advantage of a simple switch to transmit packets without waking up the router.
    Type: Grant
    Filed: August 21, 2019
    Date of Patent: November 15, 2022
    Assignee: The George Washington Univesity
    Inventors: Hao Zheng, Ahmed Louri
  • Patent number: 11489788
    Abstract: An interconnection network for a processing unit having an array of cores. The interconnection network includes routers and adaptable links that selectively connect routers in the interconnection network. For example, each router may be electrically connected to one or more of the adaptable links via one or more multiplexers and a link controller may control the multiplexers to selectively connect routers via the adaptable links. In another example, adaptable links may be formed as part of an interposer and the link controller selectively connect routers via the adaptable links in the interposer using interposer switches. The adaptable links enable the interconnection network to be dynamically partitioned. Each of those partitions may be dynamically reconfigured to form a topology.
    Type: Grant
    Filed: October 30, 2020
    Date of Patent: November 1, 2022
    Assignee: The George Washington University
    Inventors: Hao Zheng, Ke Wang, Ahmed Louri
  • Patent number: 11483256
    Abstract: Systems and methods are disclosed for reducing latency and power consumption of on-chip movement through an approximate communication framework for network-on-chips (“NoCs”). The technology leverages the fact that big data applications (e.g., recognition, mining, and synthesis) can tolerate modest error and transfers data with the necessary accuracy, thereby improving the energy-efficiency and performance of multi-core processors.
    Type: Grant
    Filed: May 4, 2021
    Date of Patent: October 25, 2022
    Assignee: The George Washington University
    Inventors: Ahmed Louri, Yuechen Chen
  • Publication number: 20220188600
    Abstract: Systems and methods are disclosed for a centrosymmetric convolutional neural network (CSCNN), an algorithm/hardware co-design framework for CNN compression and acceleration that mitigates the effects of computational irregularity and effectively exploits computational reuse and sparsity for increased performance and energy efficiency.
    Type: Application
    Filed: December 15, 2021
    Publication date: June 16, 2022
    Inventors: Jiajun Li, Ahmed Louri
  • Publication number: 20220188613
    Abstract: We introduce SGCNAX, a scalable GCN accelerator architecture for the high-performance and energy-efficient acceleration of GCNs. Unlike prior GCN accelerators that either employ limited loop optimization techniques, or determine the design variables based on random sampling, we systematically explore the loop optimization techniques for GCN acceleration and provide a flexible GCN dataflow that adapts to different GCN configurations to achieve optimal efficiency. We further provide two hardware-based techniques to address the workload imbalance problem caused by the unbalanced distribution of zeros in GCNs. Specifically, SGCNAX exploits an outer-product-based computation architecture that mitigates the intra-PE (Processing Elements) workload imbalance, and employs a group-and-shuffle approach to mitigate the inter-PE workload imbalance.
    Type: Application
    Filed: December 15, 2021
    Publication date: June 16, 2022
    Inventors: Jiajun LI, Ahmed LOURI
  • Publication number: 20210342690
    Abstract: Systems and methods are disclosed for improving on-chip security, while minimizes the latency and cost of security techniques to improve system-level performance and power simultaneously. The framework uses machine learning algorithms, such as an artificial neural network (ANN), for runtime attack detection with higher accuracy. Further, a learning-based attack mitigation method using deep reinforcement learning is disclosed, where the method may be used to isolate the malicious components and to optimize network latency and energy-efficiency.
    Type: Application
    Filed: May 4, 2021
    Publication date: November 4, 2021
    Inventors: Ke WANG, Hao ZHENG, Ahmed LOURI
  • Publication number: 20210344617
    Abstract: Systems and methods are disclosed for reducing latency and power consumption of on-chip movement through an approximate communication framework for network-on-chips (“NoCs”). The technology leverages the fact that big data applications (e.g., recognition, mining, and synthesis) can tolerate modest error and transfers data with the necessary accuracy, thereby improving the energy-efficiency and performance of multi-core processors.
    Type: Application
    Filed: May 4, 2021
    Publication date: November 4, 2021
    Inventors: Ahmed LOURI, Yuecben Chen
  • Publication number: 20210344618
    Abstract: An interconnection network for a processing unit having an array of cores. The interconnection network includes routers and adaptable links that selectively connect routers in the interconnection network. For example, each router may be electrically connected to one or more of the adaptable links via one or more multiplexers and a link controller may control the multiplexers to selectively connect routers via the adaptable links. In another example, adaptable links may be formed as part of an interposer and the link controller selectively connect routers via the adaptable links in the interposer using interposer switches. The adaptable links enable the interconnection network to be dynamically partitioned. Each of those partitions may be dynamically reconfigured to form a topology.
    Type: Application
    Filed: October 30, 2020
    Publication date: November 4, 2021
    Inventors: Hao ZHENG, Ke WANG, Ahmed LOURI
  • Publication number: 20200067814
    Abstract: With the advent of manycore architecture, on-chip interconnect connects a number of cores, caches, memory modules, accelerators, graphic processing unit (GPU) or chiplets in one system. However, on-chip interconnect architecture consumes a significant portion of total parallel computing chip power. Power-gating is an effective technique to reduce power consumption by powering off the routers, but it suffers from a large wake-up latency to resume the full activity of routers. Recent research aims to improve the wake-up latency penalty by hiding it through early wake-up techniques. However, these techniques do not exploit the full advantage of power-gating due to the early wake-up. Consequently, they do not achieve significant power savings. The present invention provides a new router architecture that remedies the large wake-up latency overheads while providing significant power savings. The invention takes advantage of a simple switch to transmit packets without waking up the router.
    Type: Application
    Filed: August 21, 2019
    Publication date: February 27, 2020
    Inventors: Hao ZHENG, Ahmed Louri
  • Publication number: 20200067637
    Abstract: A proactive fault-tolerant scheme which improves performance and energy efficiency for NoCs. The fault-tolerant scheme allows routers to switch among several different fault-tolerant operations. Each operation mode has different trade-offs among fault-tolerant capability, retransmission traffic, latency, and energy efficiency. Another example provides a proactive, dynamic control policy to balance and optimize the dynamic interactions and trade-offs. The example control policy uses example machine learning algorithm called reinforcement learning (RL). The example RL-based controller independently observes a set of NoC system parameters at runtime, and over time they evolve optimal per-router control policies. By automatically and optimally switching among the four fault-tolerant modes, the trained control policy results in minimizing system level network latency and maximizing energy efficiency while detecting and correcting errors.
    Type: Application
    Filed: August 21, 2019
    Publication date: February 27, 2020
    Inventors: Ke WANG, Ahmed LOURI
  • Patent number: 10148593
    Abstract: A first example provides a circuit configured to operate in four modes. A first mode includes propagating data from a first terminal of the circuit to a second terminal of the circuit. A second mode includes propagating data from the second terminal of the circuit to the first terminal of the circuit. A third mode includes storing data received by the first terminal. A fourth mode includes storing data received by the second terminal. A second example provides a circuit configured to cause one or more communication links to operate in one of two modes based on data traffic detected on the one or more communication links. The first mode includes propagating data from a first router to a second router. The second mode includes propagating data to the first router from the second router.
    Type: Grant
    Filed: February 13, 2015
    Date of Patent: December 4, 2018
    Assignees: Ohio University, Arizona Board of Regents on Behalf of the University of Arizona
    Inventors: Avinash Karanth Kodi, Dominic Ditomaso, Ahmed Louri
  • Publication number: 20170048166
    Abstract: A first example provides a circuit configured to operate in four modes. A first mode includes propagating data from a first terminal of the circuit to a second terminal of the circuit. A second mode includes propagating data from the second terminal of the circuit to the first terminal. A fourth mode includes storing data received by the second terminal. A second example provides a circuit configured to cause one or more communication links to operate in one of two modes based on data traffic detected on the one or more communication links. The first mode includes propagating data from a first router to a second router. The second mode includes propagating data to the first router from the second router.
    Type: Application
    Filed: February 13, 2015
    Publication date: February 16, 2017
    Inventors: Avinash Karanth Kodi, Dominic Ditomaso, Ahmed Louri
  • Publication number: 20150263972
    Abstract: The present invention provides methods for detecting and correcting transmission errors in inter-router links of Network-on-Chip (NoC) architectures. A NoC has repeaters along its bus lines. The output of a main repeater is compared and multiplexed with the output of a shadow repeater. If these outputs are the same the multiplexer outputs the output of the main repeater, otherwise an error is detected and the multiplexer outputs the output of the shadow repeater.
    Type: Application
    Filed: May 13, 2015
    Publication date: September 17, 2015
    Inventors: Ahmed Louri, Janet Meiling Wang Roveda, Avinash K. Kodi, Ashwini Sarathy
  • Publication number: 20110191646
    Abstract: The present invention provides methods for detecting and correcting transmission errors in inter-router links of Network-on-Chip (NoC) architectures. A NoC has repeaters along its bus lines. The output of a main repeater is compared and multiplexed with the output of a shadow repeater. If these outputs are the same the multiplexer outputs the output of the main repeater, otherwise an error is detected and the multiplexer outputs the output of the shadow repeater.
    Type: Application
    Filed: April 6, 2009
    Publication date: August 4, 2011
    Applicants: on behalf of the University of Arizona, Ohio University
    Inventors: Ahmed Louri, Janet Meiling Wang Roveda, Avinash K. Kodi, Ashwini Sarathy
  • Publication number: 20100002581
    Abstract: The present invention provides methods for connecting routers and transmitting data along inter-router links within Nework-on-Chip (NoC) architectures.
    Type: Application
    Filed: April 6, 2009
    Publication date: January 7, 2010
    Applicant: The Arizona Board of Regents on Behalf of The University of Arizona
    Inventors: Ahmed Louri, Janet Meiling Wang Roveda, Avinash K. Kodi, Ashwini Sarathy