Patents by Inventor Dejan S. Milojicic

Dejan S. Milojicic has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

OBJECT STORE OFFLOADING

Publication number: 20240111993

Abstract: Systems and methods are provided for performing object store offloading. A user query can be received from a client device to access a data object. The semantic structure associated with the data object can be identified, as well as one or more relationships associated with the semantic structure of the data object. A view of the data object can be determined based on the one or more relationships and said view can be provided to a user interface.

Type: Application

Filed: September 30, 2022

Publication date: April 4, 2024

Inventors: DARIO KOROLIJA, Kun Wu, Sai Rahul Chalamalasetti, Lance Mackimmie Evans, Dejan S. Milojicic
RESISTIVE AND DIGITAL PROCESSING CORES

Publication number: 20240111970

Abstract: In some examples, a device includes a first processing core comprising a resistive memory array to perform an analog computation, and a digital processing core comprising a digital memory programmable with different values to perform different computations responsive to respective different conditions. The device further includes a controller to selectively apply input data to the first processing core and the digital processing core.

Type: Application

Filed: December 4, 2023

Publication date: April 4, 2024

Inventors: John Paul Strachan, Dejan S. Milojicic, Martin Foltin, Sai Rahul Chalamalasetti, Amit S. Sharma
ACCELERATION OF MODEL/WEIGHT PROGRAMMING IN MEMRISTOR CROSSBAR ARRAYS

Publication number: 20240112029

Abstract: A crossbar array includes a number of memory elements. An analog-to-digital converter (ADC) is electronically coupled to the vector output register. A digital-to-analog converter (DAC) is electronically coupled to the vector input register. A processor is electronically coupled to the ADC and to the DAC. The processor may be configured to determine whether division of input vector data by output vector data from the crossbar array is within a threshold value, and if not within the threshold value, determine changed data values as between the output vector data and the input vector data, and write the changed data values to the memory elements of the crossbar array.

Type: Application

Filed: December 5, 2023

Publication date: April 4, 2024

Inventors: Sai Rahul Chalamalasetti, Paolo Faraboschi, Martin Foltin, Catherine Graves, Dejan S. Milojicic, John Paul Strachan, Sergey Serebryakov
RUNTIME-SUSTAINED QOS AND OPTIMIZED RESOURCE EFFICIENCY

Publication number: 20240045726

Abstract: Systems and methods are provided for maintaining a desired efficiency of use of resources in a computing system, such as a high performance computing (HPC) system in conjunction with a desired quality of service (QoS) associated with performance of an application executed by the resources. Efficiency and QoS may be considered together, and the provided systems and methods optimize both during application runtime.

Type: Application

Filed: July 27, 2022

Publication date: February 8, 2024

Inventors: KENNETH LEACH, DEJAN S. MILOJICIC, MAXIM ALT
DYNAMICALLY MODULAR AND CUSTOMIZABLE COMPUTING ENVIRONMENTS

Publication number: 20240036938

Abstract: Systems and methods are provided for a modular switch system that comprises disaggregated components, plugins, and managers that enable flexibility to adjust the dynamic configuration of a switch system. This can create modularity and customizability at different times of the lifecycle of the currently configured switch system.

Type: Application

Filed: July 28, 2022

Publication date: February 1, 2024

Inventors: DEJAN S. MILOJICIC, DUNCAN ROWETH, DEREK SCHUMACHER
OPTIMAL DISPATCHING OF FUNCTION-AS-A-SERVICE IN HETEROGENEOUS ACCELERATOR ENVIRONMENTS

Publication number: 20240020155

Abstract: Systems and methods are provided for incorporating an optimized dispatcher with an FaaS infrastructure to permit and restrict access to resources. For example, the dispatcher may assign requests to “warm” resources and initiate a fault process if the resource is overloaded or a cache-miss is identified (e.g., by restarting or rebooting the resource). The warm instances or accelerators associated with the allocation size that are identified may be commensurate to the demand and help dynamically route requests to faster accelerators.

Type: Application

Filed: September 28, 2023

Publication date: January 18, 2024

Inventors: Dejan S. Milojicic, Kimberly Keeton, Paolo Faraboschi, Cullen E. Bash
Resiliency for machine learning workloads

Patent number: 11868855

Abstract: In exemplary aspects, a golden data structure can be used to validate the stability of machine learning (ML) models and weights. The golden data structure includes golden input data and corresponding golden output data. The golden output data represents the known correct results that should be output by a ML model when it is run with the golden input data as inputs. The golden data structure can be stored in a secure memory and retrieved for validation separately or together with the deployment of the ML model for a requested ML operation. If the golden data structure is used to validate the model and/or weights concurrently with the performance of the requested operation, the golden input data is combined with the input data for the requested operation and run through the model. Relevant outputs are compared with the golden output data to validate the stability of the model and weights.

Type: Grant

Filed: November 4, 2019

Date of Patent: January 9, 2024

Assignee: Hewlett Packard Enterprise Development LP

Inventors: Sai Rahul Chalamalasetti, Sergey Serebryakov, Dejan S. Milojicic
Resistive and digital processing cores

Patent number: 11861429

Abstract: In some examples, a device includes a first processing core comprising a resistive memory array to perform an analog computation, and a digital processing core comprising a digital memory programmable with different values to perform different computations responsive to respective different conditions. The device further includes a controller to selectively apply input data to the first processing core and the digital processing core.

Type: Grant

Filed: April 30, 2018

Date of Patent: January 2, 2024

Assignee: Hewlett Packard Enterprise Development LP

Inventors: John Paul Strachan, Dejan S. Milojicic, Martin Foltin, Sai Rahul Chalamalasetti, Amit S. Sharma
Acceleration of model/weight programming in memristor crossbar arrays

Patent number: 11853846

Abstract: A crossbar array includes a number of memory elements. An analog-to-digital converter (ADC) is electronically coupled to the vector output register. A digital-to-analog converter (DAC) is electronically coupled to the vector input register. A processor is electronically coupled to the ADC and to the DAC. The processor may be configured to determine whether division of input vector data by output vector data from the crossbar array is within a threshold value, and if not within the threshold value, determine changed data values as between the output vector data and the input vector data, and write the changed data values to the memory elements of the crossbar array.

Type: Grant

Filed: April 30, 2018

Date of Patent: December 26, 2023

Assignee: Hewlett Packard Enterprise Development LP

Inventors: Sai Rahul Chalamalasetti, Paolo Faraboschi, Martin Foltin, Catherine Graves, Dejan S. Milojicic, John Paul Strachan, Sergey Serebryakov
Optimal dispatching of function-as-a-service in heterogeneous accelerator environments

Patent number: 11809218

Abstract: Systems and methods are provided for incorporating an optimized dispatcher with an FaaS infrastructure to permit and restrict access to resources. For example, the dispatcher may assign requests to “warm” resources and initiate a fault process if the resource is overloaded or a cache-miss is identified (e.g., by restarting or rebooting the resource). The warm instances or accelerators associated with the allocation size that are identified may be commensurate to the demand and help dynamically route requests to faster accelerators.

Type: Grant

Filed: March 11, 2021

Date of Patent: November 7, 2023

Assignee: Hewlett Packard Enterprise Development LP

Inventors: Dejan S. Milojicic, Kimberly Keeton, Paolo Faraboschi, Cullen E. Bash
INFRASTRUCTURE MANAGEMENT AGENT MANAGING PARAMETERS CORRESPONDING TO CLOUD-BASED COMPUTATIONAL SERVICES

Publication number: 20230281682

Abstract: Examples related to providing cloud-based computational services to a requesting entity having an infrastructure management agent. Revenue accounting corresponding to a first time period is performed for computational services utilized by the requesting entity. If revenue has changed over the first period of time beyond a pre-selected threshold amount, the infrastructure management agent adjusts quality of service (QoS) parameters or modifies billing discounts for the cloud-based computational services.

Type: Application

Filed: March 7, 2022

Publication date: September 7, 2023

Inventors: Dejan S. Milojicic, Kenneth Leach, Max Alt
Methods and systems for computing in memory

Patent number: 11650953

Abstract: A method of computing in memory, the method including inputting a packet including data into a computing memory unit having a control unit, loading the data into at least one computing in memory micro-unit, processing the data in the computing in memory micro-unit, and outputting the processed data. Also, a computing in memory system including a computing in memory unit having a control unit, wherein the computing in memory unit is configured to receive a packet having data and a computing in memory micro-unit disposed in the computing in memory unit, the computing in memory micro-unit having at least one of a memory matrix and a logic elements matrix.

Type: Grant

Filed: October 16, 2020

Date of Patent: May 16, 2023

Assignee: Hewlett Packard Enterprise Development LP

Inventors: Dejan S. Milojicic, Kirk M. Bresniker, Paolo Faraboschi, John Paul Strachan
Accelerating constrained, flexible, and optimizable rule look-ups in hardware

Patent number: 11561607

Abstract: Encoding of domain logic rules in an analog content addressable memory (aCAM) is disclosed. By encoding domain logic in an aCAM, rapid and flexible search capabilities are enabled, including the capability to search ranges of analog values, fuzzy match capabilities, and optimized parameter search capabilities. This is achieved with low latency by using only a small number of clock cycles at low power. A domain logic ruleset may be represented using various data structures such as decision trees, directed graphs, or the like. These representations can be converted to a table of values, where each table column can be directly mapped to a corresponding row of the aCAM.

Type: Grant

Filed: October 30, 2020

Date of Patent: January 24, 2023

Assignee: Hewlett Packard Enterprise Development LP

Inventors: Catherine Graves, Can Li, John Paul Strachan, Dejan S. Milojicic, Kimberly Keeton
OPTIMAL DISPATCHING OF FUNCTION-AS-A-SERVICE IN HETEROGENEOUS ACCELERATOR ENVIRONMENTS

Publication number: 20220291952

Abstract: Systems and methods are provided for incorporating an optimized dispatcher with an FaaS infrastructure to permit and restrict access to resources. For example, the dispatcher may assign requests to “warm” resources and initiate a fault process if the resource is overloaded or a cache-miss is identified (e.g., by restarting or rebooting the resource). The warm instances or accelerators associated with the allocation size that are identified may be commensurate to the demand and help dynamically route requests to faster accelerators.

Type: Application

Filed: March 11, 2021

Publication date: September 15, 2022

Inventors: DEJAN S. MILOJICIC, Kimberly Keeton, Paolo Faraboschi, Cullen E. Bash
Quantifying power usage for a service

Patent number: 11397836

Abstract: Quantifying power usage for a service. An example method may include identifying a dependency model for the service, the dependency model based in part on infrastructure providing the service. The method may also include determining power usage for the service using the dependency model.

Type: Grant

Filed: November 9, 2018

Date of Patent: July 26, 2022

Assignee: Hewlett Packard Enterprise Development LP

Inventors: Yuan Chen, Dejan S Milojicic, Daniel Juergen Gmach, Cullen E. Bash
Adjustable precision for multi-stage compute processes

Patent number: 11385863

Abstract: Disclosed techniques provide for dynamically changing precision of a multi-stage compute process. For example, changing neural network (NN) parameters on a per-layer basis depending on properties of incoming data streams and per-layer performance of an NN among other considerations. NNs include multiple layers that may each be calculated with a different degree of accuracy and therefore, compute resource overhead (e.g., memory, processor resources, etc.). NNs are usually trained with 32-bit or 16-bit floating-point numbers. Once trained, an NN may be deployed in production. One approach to reduce compute overhead is to reduce parameter precision of NNs to 16 or 8 for deployment. The conversion to an acceptable lower precision is usually determined manually before deployment and precision levels are fixed while deployed.

Type: Grant

Filed: August 1, 2018

Date of Patent: July 12, 2022

Assignee: Hewlett Packard Enterprise Development LP

Inventors: Sai Rahul Chalamalasetti, Paolo Faraboschi, Martin Foltin, Catherine Graves, Dejan S. Milojicic, Sergey Serebryakov, John Paul Strachan
ACCELERATING CONSTRAINED, FLEXIBLE, AND OPTIMIZABLE RULE LOOK-UPS IN HARDWARE

Publication number: 20220138204

Abstract: Encoding of domain logic rules in an analog content addressable memory (aCAM) is disclosed. By encoding domain logic in an aCAM, rapid and flexible search capabilities are enabled, including the capability to search ranges of analog values, fuzzy match capabilities, and optimized parameter search capabilities. This is achieved with low latency by using only a small number of clock cycles at low power. A domain logic ruleset may be represented using various data structures such as decision trees, directed graphs, or the like. These representations can be converted to a table of values, where each table column can be directly mapped to a corresponding row of the aCAM.

Type: Application

Filed: October 30, 2020

Publication date: May 5, 2022

Inventors: CATHERINE GRAVES, CAN LI, JOHN PAUL STRACHAN, DEJAN S. MILOJICIC, KIMBERLY KEETON
MACHINE LEARNING MODEL BIAS DETECTION AND MITIGATION

Publication number: 20220121885

Abstract: Testing for bias in a machine learning (ML) model in a manner that is independent of the code/weights deployment path is described. If bias is detected, an alert for bias is generated, and optionally, the ML model can be incrementally re-trained to mitigate the detected bias. Re-training the ML model to mitigate the bias may include enforcing a bias cost function to maintain a level of bias in the ML model below a threshold bias level. One or more statistical metrics representing the level of bias present in the ML model may be determined and compared against one or more threshold values. If one or more metrics exceed corresponding threshold value(s), the level of bias in the ML model may be deemed to exceed a threshold level of bias, and re-training of the ML model to mitigate the bias may be initiated.

Type: Application

Filed: October 19, 2020

Publication date: April 21, 2022

Inventors: Sai Rahul Chalamalasetti, Dejan S. Milojicic, Sergey Serebryakov
Determining significance levels of error values in processes that include multiple layers

Patent number: 11294763

Abstract: A computer system includes multiple memory array components that include respective analog memory arrays which are sequenced to implement a multi-layer process. An error array data structure is obtained for at least a first memory array component, and from which a determination is made as to whether individual nodes (or cells) of the error array data structure are significant. A determination can be made as to any remedial operations that can be performed to mitigate errors of significance.

Type: Grant

Filed: August 28, 2018

Date of Patent: April 5, 2022

Assignee: Hewlett Packard Enterprise Development LP

Inventors: John Paul Strachan, Catherine Graves, Dejan S. Milojicic, Paolo Faraboschi, Martin Foltin, Sergey Serebryakov
Self-adjustable end-to-end stack programming

Patent number: 11182134

Abstract: Systems and methods are provided for optimizing parameters of a system across an entire stack, including algorithms layer, toolchain layer, execution or runtime layer, and hardware layer. Results from the layer-specific optimization functions of each domain can be consolidated using one or more consolidation optimization functions to consolidate the layer-specific optimization results, capturing the relationship between the different layers of the stack. Continuous monitoring of the programming model during execution may be implemented and can enable the programming model to self-adjust based on real-time performance metrics. In this way, programmers and system administrators are relieved of the need for domain knowledge and are offered a systematic way for continuous optimization (rather than an ad hoc approach).

Type: Grant

Filed: February 24, 2020

Date of Patent: November 23, 2021

Assignee: Hewlett Packard Enterprise Development LP

Inventors: Dejan S. Milojicic, Mehmet Kivanc Ozonat, Sergey Serebryakov

1 2 3 4 5 … next