Patents by Inventor Dejan S. Milojicic

Dejan S. Milojicic has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20240111993
    Abstract: Systems and methods are provided for performing object store offloading. A user query can be received from a client device to access a data object. The semantic structure associated with the data object can be identified, as well as one or more relationships associated with the semantic structure of the data object. A view of the data object can be determined based on the one or more relationships and said view can be provided to a user interface.
    Type: Application
    Filed: September 30, 2022
    Publication date: April 4, 2024
    Inventors: DARIO KOROLIJA, Kun Wu, Sai Rahul Chalamalasetti, Lance Mackimmie Evans, Dejan S. Milojicic
  • Publication number: 20240111970
    Abstract: In some examples, a device includes a first processing core comprising a resistive memory array to perform an analog computation, and a digital processing core comprising a digital memory programmable with different values to perform different computations responsive to respective different conditions. The device further includes a controller to selectively apply input data to the first processing core and the digital processing core.
    Type: Application
    Filed: December 4, 2023
    Publication date: April 4, 2024
    Inventors: John Paul Strachan, Dejan S. Milojicic, Martin Foltin, Sai Rahul Chalamalasetti, Amit S. Sharma
  • Publication number: 20240112029
    Abstract: A crossbar array includes a number of memory elements. An analog-to-digital converter (ADC) is electronically coupled to the vector output register. A digital-to-analog converter (DAC) is electronically coupled to the vector input register. A processor is electronically coupled to the ADC and to the DAC. The processor may be configured to determine whether division of input vector data by output vector data from the crossbar array is within a threshold value, and if not within the threshold value, determine changed data values as between the output vector data and the input vector data, and write the changed data values to the memory elements of the crossbar array.
    Type: Application
    Filed: December 5, 2023
    Publication date: April 4, 2024
    Inventors: Sai Rahul Chalamalasetti, Paolo Faraboschi, Martin Foltin, Catherine Graves, Dejan S. Milojicic, John Paul Strachan, Sergey Serebryakov
  • Publication number: 20240045726
    Abstract: Systems and methods are provided for maintaining a desired efficiency of use of resources in a computing system, such as a high performance computing (HPC) system in conjunction with a desired quality of service (QoS) associated with performance of an application executed by the resources. Efficiency and QoS may be considered together, and the provided systems and methods optimize both during application runtime.
    Type: Application
    Filed: July 27, 2022
    Publication date: February 8, 2024
    Inventors: KENNETH LEACH, DEJAN S. MILOJICIC, MAXIM ALT
  • Publication number: 20240036938
    Abstract: Systems and methods are provided for a modular switch system that comprises disaggregated components, plugins, and managers that enable flexibility to adjust the dynamic configuration of a switch system. This can create modularity and customizability at different times of the lifecycle of the currently configured switch system.
    Type: Application
    Filed: July 28, 2022
    Publication date: February 1, 2024
    Inventors: DEJAN S. MILOJICIC, DUNCAN ROWETH, DEREK SCHUMACHER
  • Publication number: 20240020155
    Abstract: Systems and methods are provided for incorporating an optimized dispatcher with an FaaS infrastructure to permit and restrict access to resources. For example, the dispatcher may assign requests to “warm” resources and initiate a fault process if the resource is overloaded or a cache-miss is identified (e.g., by restarting or rebooting the resource). The warm instances or accelerators associated with the allocation size that are identified may be commensurate to the demand and help dynamically route requests to faster accelerators.
    Type: Application
    Filed: September 28, 2023
    Publication date: January 18, 2024
    Inventors: Dejan S. Milojicic, Kimberly Keeton, Paolo Faraboschi, Cullen E. Bash
  • Patent number: 11868855
    Abstract: In exemplary aspects, a golden data structure can be used to validate the stability of machine learning (ML) models and weights. The golden data structure includes golden input data and corresponding golden output data. The golden output data represents the known correct results that should be output by a ML model when it is run with the golden input data as inputs. The golden data structure can be stored in a secure memory and retrieved for validation separately or together with the deployment of the ML model for a requested ML operation. If the golden data structure is used to validate the model and/or weights concurrently with the performance of the requested operation, the golden input data is combined with the input data for the requested operation and run through the model. Relevant outputs are compared with the golden output data to validate the stability of the model and weights.
    Type: Grant
    Filed: November 4, 2019
    Date of Patent: January 9, 2024
    Assignee: Hewlett Packard Enterprise Development LP
    Inventors: Sai Rahul Chalamalasetti, Sergey Serebryakov, Dejan S. Milojicic
  • Patent number: 11861429
    Abstract: In some examples, a device includes a first processing core comprising a resistive memory array to perform an analog computation, and a digital processing core comprising a digital memory programmable with different values to perform different computations responsive to respective different conditions. The device further includes a controller to selectively apply input data to the first processing core and the digital processing core.
    Type: Grant
    Filed: April 30, 2018
    Date of Patent: January 2, 2024
    Assignee: Hewlett Packard Enterprise Development LP
    Inventors: John Paul Strachan, Dejan S. Milojicic, Martin Foltin, Sai Rahul Chalamalasetti, Amit S. Sharma
  • Patent number: 11853846
    Abstract: A crossbar array includes a number of memory elements. An analog-to-digital converter (ADC) is electronically coupled to the vector output register. A digital-to-analog converter (DAC) is electronically coupled to the vector input register. A processor is electronically coupled to the ADC and to the DAC. The processor may be configured to determine whether division of input vector data by output vector data from the crossbar array is within a threshold value, and if not within the threshold value, determine changed data values as between the output vector data and the input vector data, and write the changed data values to the memory elements of the crossbar array.
    Type: Grant
    Filed: April 30, 2018
    Date of Patent: December 26, 2023
    Assignee: Hewlett Packard Enterprise Development LP
    Inventors: Sai Rahul Chalamalasetti, Paolo Faraboschi, Martin Foltin, Catherine Graves, Dejan S. Milojicic, John Paul Strachan, Sergey Serebryakov
  • Patent number: 11809218
    Abstract: Systems and methods are provided for incorporating an optimized dispatcher with an FaaS infrastructure to permit and restrict access to resources. For example, the dispatcher may assign requests to “warm” resources and initiate a fault process if the resource is overloaded or a cache-miss is identified (e.g., by restarting or rebooting the resource). The warm instances or accelerators associated with the allocation size that are identified may be commensurate to the demand and help dynamically route requests to faster accelerators.
    Type: Grant
    Filed: March 11, 2021
    Date of Patent: November 7, 2023
    Assignee: Hewlett Packard Enterprise Development LP
    Inventors: Dejan S. Milojicic, Kimberly Keeton, Paolo Faraboschi, Cullen E. Bash
  • Publication number: 20230281682
    Abstract: Examples related to providing cloud-based computational services to a requesting entity having an infrastructure management agent. Revenue accounting corresponding to a first time period is performed for computational services utilized by the requesting entity. If revenue has changed over the first period of time beyond a pre-selected threshold amount, the infrastructure management agent adjusts quality of service (QoS) parameters or modifies billing discounts for the cloud-based computational services.
    Type: Application
    Filed: March 7, 2022
    Publication date: September 7, 2023
    Inventors: Dejan S. Milojicic, Kenneth Leach, Max Alt
  • Patent number: 11650953
    Abstract: A method of computing in memory, the method including inputting a packet including data into a computing memory unit having a control unit, loading the data into at least one computing in memory micro-unit, processing the data in the computing in memory micro-unit, and outputting the processed data. Also, a computing in memory system including a computing in memory unit having a control unit, wherein the computing in memory unit is configured to receive a packet having data and a computing in memory micro-unit disposed in the computing in memory unit, the computing in memory micro-unit having at least one of a memory matrix and a logic elements matrix.
    Type: Grant
    Filed: October 16, 2020
    Date of Patent: May 16, 2023
    Assignee: Hewlett Packard Enterprise Development LP
    Inventors: Dejan S. Milojicic, Kirk M. Bresniker, Paolo Faraboschi, John Paul Strachan
  • Patent number: 11561607
    Abstract: Encoding of domain logic rules in an analog content addressable memory (aCAM) is disclosed. By encoding domain logic in an aCAM, rapid and flexible search capabilities are enabled, including the capability to search ranges of analog values, fuzzy match capabilities, and optimized parameter search capabilities. This is achieved with low latency by using only a small number of clock cycles at low power. A domain logic ruleset may be represented using various data structures such as decision trees, directed graphs, or the like. These representations can be converted to a table of values, where each table column can be directly mapped to a corresponding row of the aCAM.
    Type: Grant
    Filed: October 30, 2020
    Date of Patent: January 24, 2023
    Assignee: Hewlett Packard Enterprise Development LP
    Inventors: Catherine Graves, Can Li, John Paul Strachan, Dejan S. Milojicic, Kimberly Keeton
  • Publication number: 20220291952
    Abstract: Systems and methods are provided for incorporating an optimized dispatcher with an FaaS infrastructure to permit and restrict access to resources. For example, the dispatcher may assign requests to “warm” resources and initiate a fault process if the resource is overloaded or a cache-miss is identified (e.g., by restarting or rebooting the resource). The warm instances or accelerators associated with the allocation size that are identified may be commensurate to the demand and help dynamically route requests to faster accelerators.
    Type: Application
    Filed: March 11, 2021
    Publication date: September 15, 2022
    Inventors: DEJAN S. MILOJICIC, Kimberly Keeton, Paolo Faraboschi, Cullen E. Bash
  • Patent number: 11397836
    Abstract: Quantifying power usage for a service. An example method may include identifying a dependency model for the service, the dependency model based in part on infrastructure providing the service. The method may also include determining power usage for the service using the dependency model.
    Type: Grant
    Filed: November 9, 2018
    Date of Patent: July 26, 2022
    Assignee: Hewlett Packard Enterprise Development LP
    Inventors: Yuan Chen, Dejan S Milojicic, Daniel Juergen Gmach, Cullen E. Bash
  • Patent number: 11385863
    Abstract: Disclosed techniques provide for dynamically changing precision of a multi-stage compute process. For example, changing neural network (NN) parameters on a per-layer basis depending on properties of incoming data streams and per-layer performance of an NN among other considerations. NNs include multiple layers that may each be calculated with a different degree of accuracy and therefore, compute resource overhead (e.g., memory, processor resources, etc.). NNs are usually trained with 32-bit or 16-bit floating-point numbers. Once trained, an NN may be deployed in production. One approach to reduce compute overhead is to reduce parameter precision of NNs to 16 or 8 for deployment. The conversion to an acceptable lower precision is usually determined manually before deployment and precision levels are fixed while deployed.
    Type: Grant
    Filed: August 1, 2018
    Date of Patent: July 12, 2022
    Assignee: Hewlett Packard Enterprise Development LP
    Inventors: Sai Rahul Chalamalasetti, Paolo Faraboschi, Martin Foltin, Catherine Graves, Dejan S. Milojicic, Sergey Serebryakov, John Paul Strachan
  • Publication number: 20220138204
    Abstract: Encoding of domain logic rules in an analog content addressable memory (aCAM) is disclosed. By encoding domain logic in an aCAM, rapid and flexible search capabilities are enabled, including the capability to search ranges of analog values, fuzzy match capabilities, and optimized parameter search capabilities. This is achieved with low latency by using only a small number of clock cycles at low power. A domain logic ruleset may be represented using various data structures such as decision trees, directed graphs, or the like. These representations can be converted to a table of values, where each table column can be directly mapped to a corresponding row of the aCAM.
    Type: Application
    Filed: October 30, 2020
    Publication date: May 5, 2022
    Inventors: CATHERINE GRAVES, CAN LI, JOHN PAUL STRACHAN, DEJAN S. MILOJICIC, KIMBERLY KEETON
  • Publication number: 20220121885
    Abstract: Testing for bias in a machine learning (ML) model in a manner that is independent of the code/weights deployment path is described. If bias is detected, an alert for bias is generated, and optionally, the ML model can be incrementally re-trained to mitigate the detected bias. Re-training the ML model to mitigate the bias may include enforcing a bias cost function to maintain a level of bias in the ML model below a threshold bias level. One or more statistical metrics representing the level of bias present in the ML model may be determined and compared against one or more threshold values. If one or more metrics exceed corresponding threshold value(s), the level of bias in the ML model may be deemed to exceed a threshold level of bias, and re-training of the ML model to mitigate the bias may be initiated.
    Type: Application
    Filed: October 19, 2020
    Publication date: April 21, 2022
    Inventors: Sai Rahul Chalamalasetti, Dejan S. Milojicic, Sergey Serebryakov
  • Patent number: 11294763
    Abstract: A computer system includes multiple memory array components that include respective analog memory arrays which are sequenced to implement a multi-layer process. An error array data structure is obtained for at least a first memory array component, and from which a determination is made as to whether individual nodes (or cells) of the error array data structure are significant. A determination can be made as to any remedial operations that can be performed to mitigate errors of significance.
    Type: Grant
    Filed: August 28, 2018
    Date of Patent: April 5, 2022
    Assignee: Hewlett Packard Enterprise Development LP
    Inventors: John Paul Strachan, Catherine Graves, Dejan S. Milojicic, Paolo Faraboschi, Martin Foltin, Sergey Serebryakov
  • Patent number: 11182134
    Abstract: Systems and methods are provided for optimizing parameters of a system across an entire stack, including algorithms layer, toolchain layer, execution or runtime layer, and hardware layer. Results from the layer-specific optimization functions of each domain can be consolidated using one or more consolidation optimization functions to consolidate the layer-specific optimization results, capturing the relationship between the different layers of the stack. Continuous monitoring of the programming model during execution may be implemented and can enable the programming model to self-adjust based on real-time performance metrics. In this way, programmers and system administrators are relieved of the need for domain knowledge and are offered a systematic way for continuous optimization (rather than an ad hoc approach).
    Type: Grant
    Filed: February 24, 2020
    Date of Patent: November 23, 2021
    Assignee: Hewlett Packard Enterprise Development LP
    Inventors: Dejan S. Milojicic, Mehmet Kivanc Ozonat, Sergey Serebryakov