Patents by Inventor Michael Mantor

Michael Mantor has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Configurable multiple-die graphics processing unit

Patent number: 12361628

Abstract: A graphics processing unit (GPU) of a processing system is partitioned into multiple dies (referred to as GPU chiplets) that are configurable to collectively function and interface with an application as a single GPU in a first mode and as multiple GPUs in a second mode. By dividing the GPU into multiple GPU chiplets, the processing system flexibly and cost-effectively configures an amount of active GPU physical resources based on an operating mode. In addition, a configurable number of GPU chiplets are assembled into a single GPU, such that multiple different GPUs having different numbers of GPU chiplets can be assembled using a small number of tape-outs and a multiple-die GPU can be constructed out of GPU chiplets that implement varying generations of technology.

Type: Grant

Filed: December 8, 2022

Date of Patent: July 15, 2025

Assignee: Advanced Micro Devices, Inc.

Inventors: Mark Fowler, Samuel Naffziger, Michael Mantor, Mark Leather
LOCAL LAUNCH IN WORKGROUP PROCESSORS

Publication number: 20250217195

Abstract: Workgroup processors associated with a shader program interface are provided with local launchers capable of launching shader threads partially or completely independently from the shader program interface. The local launchers maintain local queues separately from the shader program interface. The local launchers allocate resources for shader thread execution at an associated workgroup processor either directly or through a request to the shader program interface. In some implementations, the shader program interface leases resources to the local launcher in response to a request for resources and terminates the lease when the local launcher notifies the shader program interface that execution of the shader thread is complete.

Type: Application

Filed: December 30, 2023

Publication date: July 3, 2025

Inventors: Matthaeus G. Chajdas, Christopher J. Brennan, Michael Mantor, Dominik Joerg Baumeister, Fabian Robert Sebastian Wildgrube, Nicolai Haehnle, Randy Ramsey
RECONFIGURABLE VIRTUAL GRAPHICS AND COMPUTE PROCESSOR PIPELINE

Publication number: 20250191111

Abstract: A plurality of programmable processing cores is configured to process graphics primitives and corresponding data and a plurality of fixed-function hardware units. The processing cores and the fixed-function hardware units are configured to implement a configurable number of virtual pipelines. Each virtual pipeline includes a configurable number of fragments and an operational state of each virtual pipeline is specified by a different context. The configurable number of virtual pipelines can be modified from a first number to a second number that is different than the first number. An emulation of a fixed-function hardware unit can be instantiated on one or more of the graphics processing cores in response to detection of a bottleneck in a fixed-function hardware unit. One or more of the virtual pipelines can then be reconfigured to utilize the emulation instead of the fixed-function hardware unit.

Type: Application

Filed: February 25, 2025

Publication date: June 12, 2025

Inventors: Timour T. PALTASHEV, Michael MANTOR, Rex Eldon MCCRARY
Vertical and horizontal broadcast of shared operands

Patent number: 12327124

Abstract: An array processor includes processor element arrays distributed in rows and columns. The processor element arrays perform operations on parameter values. The array processor also includes memory interfaces that broadcast sets of the parameter values to mutually exclusive subsets of the rows and columns of the processor element arrays. In some cases, the array processor includes single-instruction-multiple-data (SIMD) units including subsets of the processor element arrays in corresponding rows, workgroup processors (WGPs) including subsets of the SIMD units, and a memory fabric configured to interconnect with an external memory that stores the parameter values. The memory interfaces broadcast the parameter values to the SIMD units that include the processor element arrays in rows associated with the memory interfaces and columns of processor element arrays that are implemented across the SIMD units in the WGPs. The memory interfaces access the parameter values from the external memory via the memory fabric.

Type: Grant

Filed: March 30, 2023

Date of Patent: June 10, 2025

Assignee: Advanced Micro Devices, Inc.

Inventors: Sateesh Lagudu, Allen H. Rush, Michael Mantor, Arun Vaidyanathan Ananthanarayan, Prasad Nagabhushanamgari, Maxim V. Kazakov
Dual vector arithmetic logic unit

Patent number: 12299413

Abstract: A processing system executes wavefronts at multiple arithmetic logic unit (ALU) pipelines of a single instruction multiple data (SIMD) unit in a single execution cycle. The ALU pipelines each include a number of ALUs that execute instructions on wavefront operands that are collected from vector general process register (VGPR) banks at a cache and output results of the instructions executed on the wavefronts at a buffer. By storing wavefronts supplied by the VGPR banks at the cache, a greater number of wavefronts can be made available to the SIMD unit without increasing the VGPR bandwidth, enabling multiple ALU pipelines to execute instructions during a single execution cycle.

Type: Grant

Filed: January 16, 2024

Date of Patent: May 13, 2025

Assignee: ADVANCED MICRO DEVICES, INC.

Inventors: Bin He, Brian Emberling, Mark Leather, Michael Mantor
FLOATING POINT BIAS SWITCHING

Publication number: 20250130774

Abstract: The disclosed circuit can interpret a bit sequence as a value based on one of multiple floating point number formats in a bias mode indicated by a bias mode indicator. The circuit can and perform an operation using the value in the bias mode. Various other methods, systems, and computer-readable media are also disclosed.

Type: Application

Filed: December 22, 2023

Publication date: April 24, 2025

Applicant: Advanced Micro Devices, Inc.

Inventors: Shubh Shah, Ashutosh Garg, Bin He, Michael Mantor, Shubra Marwaha, Subramaniam Maiyuran
STOCHASTIC ROUNDING CIRCUIT

Publication number: 20250130769

Abstract: The disclosed circuit is configured to round a value in a first number format using a random value. Using the rounded value, the circuit can convert the rounded value to a second number format that has a lower precision than a precision of the first number format. Various other methods, systems, and computer-readable media are also disclosed.

Type: Application

Filed: December 22, 2023

Publication date: April 24, 2025

Applicant: Advanced Micro Devices, Inc.

Inventors: Shubh Shah, Ashutosh Garg, Bin He, Michael Mantor, Shubra Marwaha, Subramaniam Maiyuran
FLOATING-POINT CONVERSION CIRCUIT

Publication number: 20250130767

Abstract: The disclosed circuit can select micro-operations specifically for converting a value in a first number format to a second number format. The circuit can include micro-operations for various conversions between different number formats, including number formats of different floating-point precisions. Various other methods, systems, and computer-readable media are also disclosed.

Type: Application

Filed: October 21, 2024

Publication date: April 24, 2025

Applicant: Advanced Micro Devices, Inc.

Inventors: Shubh Shah, Ashutosh Garg, Bin He, Michael Mantor, Shubra Marwaha, Subramaniam Maiyuran
MULTI-FORMAT OPERAND CIRCUIT

Publication number: 20250130794

Abstract: The disclosed processing circuit can perform an operation with a first operand having a first number format and a second operand having a second number format by directly using the first operand in the first number format and the second operand in the second number format to produce an output result. Various other methods, systems, and computer-readable media are also disclosed.

Type: Application

Filed: December 28, 2023

Publication date: April 24, 2025

Applicant: Advanced Micro Devices, Inc.

Inventors: Shubh Shah, Ashutosh Garg, Bin He, Michael Mantor, Shubra Marwaha, Subramaniam Maiyuran
ACCELERATION UNIT WITH MODULAR ARCHITECTURE

Publication number: 20250117352

Abstract: A processing system includes one or more accelerator units (AUs) each having a modular architecture. To this end, each AU includes a connection circuitry and one or more memory stacks disposed on the connection circuitry. Further, each AU includes one or more interposer dies each disposed on the connection circuitry such that each interposer die of the one or more interposer dies is communicatively coupled to a corresponding memory stack of the memory stacks via the connection circuitry. Further, each interposer die of each AU includes circuitry configured to concurrently support two or more types of compute dies.

Type: Application

Filed: October 9, 2024

Publication date: April 10, 2025

Inventors: Alan D. Smith, Michael Mantor, Mark Fowler, Vydhyanathan Kalyanasundharam, Samuel Naffziger
MULTIMODAL PERSONA CONFIGURATION FOR NON-PLAYABLE CHARACTERS

Publication number: 20250111578

Abstract: Methods and systems are provided for generating a stylized representation of a non-player character (NPC) in a virtual environment. A multimodal plurality of inputs regarding characteristics of the NPC is received, which is processed to generate visual data representing the NPC's appearance and to generate behavior data representing the NPC's actions. The generated visual data and behavior data are adapted to a selected character model to create an adapted configuration model, which is used to generate rendering information for the NPC.

Type: Application

Filed: September 27, 2024

Publication date: April 3, 2025

Inventors: Karthik Mohan Kumar, Archana Ramalingam, Michael Mantor, Pedro Antonio Pena
Reconfigurable virtual graphics and compute processor pipeline

Patent number: 12254527

Abstract: A graphics processing unit (GPU) includes a plurality of programmable processing cores configured to process graphics primitives and corresponding data and a plurality of fixed-function hardware units. The plurality of processing cores and the plurality of fixed-function hardware units are configured to implement a configurable number of virtual pipelines to concurrently process different command flows. Each virtual pipeline includes a configurable number of fragments and an operational state of each virtual pipeline is specified by a different context. The configurable number of virtual pipelines can be modified from a first number to a second number that is different than the first number. An emulation of a fixed-function hardware unit can be instantiated on one or more of the graphics processing cores in response to detection of a bottleneck in a fixed-function hardware unit. One or more of the virtual pipelines can then be reconfigured to utilize the emulation instead of the fixed-function hardware unit.

Type: Grant

Filed: May 21, 2020

Date of Patent: March 18, 2025

Assignee: Advanced Micro Devices, Inc.

Inventors: Timour T. Paltashev, Michael Mantor, Rex Eldon McCrary
STREAMING WAVE COALESCER CIRCUIT

Publication number: 20250068429

Abstract: A Streaming Wave Coalescer (SWC) circuit stores a first set of state values associated with a first subset of threads of a first wave in a bin based on each of the first subset of threads including a first set of instructions to be executed. A second set of state values associated with a second subset of threads of a second wave is stored in the bin based on each of the second subset of threads including the first set of instructions to be executed and based on the first wave and the second wave both being associated with a hard key. A third wave is formed from the threads of the first subset and the second subset and is emitted for execution. As a result of reorganizing the threads and reconstituting a different wave, thread divergence of waves sent for execution is reduced.

Type: Application

Filed: December 12, 2023

Publication date: February 27, 2025

Inventors: John Stephen Junkins, Christopher J. Brennan, Ian Richard Beaumont, Kellie Marks, Matthaeus G. Chajdas, Max Oberberger, Michael John Bedy, Michael Mantor, Sean Keely
HIERARCHICAL WORK SCHEDULING

Publication number: 20250068464

Abstract: A method for hierarchical work scheduling includes consuming a work item at a first scheduling domain having a local scheduler circuit and one or more workgroup processing elements. Consuming the work item produces a set of new work items. Subsequently, the local scheduler circuit distributes at least one new work item of the set of new work items to be executed locally at the first scheduling domain. If the local scheduler circuit of the first scheduling domain determines that the set of new work items includes one or more work items that would overload the first scheduling domain with work if scheduled for local execution, those work items are distributed to the next higher-level scheduler circuit in a scheduling domain hierarchy for redistribution to one or more other scheduling domains.

Type: Application

Filed: November 8, 2024

Publication date: February 27, 2025

Inventors: Matthaeus G. Chajdas, Christopher J. Brennan, Michael Mantor, Robert W. Martin, Nicolai Haehnle
Processing unit with small footprint arithmetic logic unit

Patent number: 12217021

Abstract: A parallel processing unit employs an arithmetic logic unit (ALU) having a relatively small footprint, thereby reducing the overall power consumption and circuit area of the processing unit. To support the smaller footprint, the ALU includes multiple stages to execute operations corresponding to a received instruction. The ALU executes at least one operation at a precision indicated by the received instruction, and then reduces the resulting data of the at least one operation to a smaller size before providing the results to another stage of the ALU to continue execution of the instruction.

Type: Grant

Filed: July 7, 2023

Date of Patent: February 4, 2025

Assignee: Advanced Micro Devices, Inc.

Inventors: Bin He, Shubh Shah, Michael Mantor
Spatial partitioning in a multi-tenancy graphics processing unit

Patent number: 12205218

Abstract: A graphics processing unit (GPU) or other apparatus includes a plurality of shader engines. The apparatus also includes a first front end (FE) circuit and one or more second FE circuits. The first FE circuit is configured to schedule geometry workloads for the plurality of shader engines in a first mode. The first FE circuit is configured to schedule geometry workloads for a first subset of the plurality of shader engines and the one or more second FE circuits are configured to schedule geometry workloads for a second subset of the plurality of shader engines in a second mode. In some cases, a partition switch is configured to selectively connect the first FE circuit or the one or more second FE circuits to the second subset of the plurality of shader engines depending on whether the apparatus is in the first mode or the second mode.

Type: Grant

Filed: March 29, 2022

Date of Patent: January 21, 2025

Assignee: Advanced Micro Devices, Inc.

Inventors: Mark Leather, Michael Mantor
ADAPTIVE MULTIMODAL FUSING FOR NON-PLAYER CHARACTER GENERATION AND CONFIGURATION

Publication number: 20240424407

Abstract: Systems and techniques for generating and animating non-player characters (NPCs) within virtual digital environments are provided. Multimodal input data is received that comprises a plurality of input modalities for interaction with an NPC having a set of body features and a set of facial features. The multimodal input data is processed through one or more neural networks to generate animation sequences for both the body features and facial features of the NPC. Generating such animation sequences includes disentangling the multimodal input data to generate substantially disentangled latent representations, combining these representations with the multimodal input data, and using a large-language model (LLM) to generate speech data for the NPC. Further processing using reverse diffusion generates face vertex displacement data and joint trajectory data based on the combined representation and generated speech data.

Type: Application

Filed: June 20, 2024

Publication date: December 26, 2024

Inventors: Karthik Mohan Kumar, Michael Mantor, Pedro Antonio Pena, Archana Ramalingam
MULTIMODAL CONTEXTUALIZER FOR NON-PLAYER CHARACTER GENERATION AND CONFIGURATION

Publication number: 20240428494

Abstract: Systems and techniques for generating and animating non-player characters (NPCs) within virtual digital environments are provided. Multimodal input data is received that comprises a plurality of input modalities for interaction with an NPC having a set of body features and a set of facial features. The multimodal input data is processed through one or more neural networks to generate animation sequences for both the body features and facial features of the NPC. Generating such animation sequences includes disentangling the multimodal input data to generate substantially disentangled latent representations, combining these representations with the multimodal input data, and using a large-language model (LLM) to generate speech data for the NPC. Further processing using reverse diffusion generates face vertex displacement data and joint trajectory data based on the combined representation and generated speech data.

Type: Application

Filed: June 20, 2024

Publication date: December 26, 2024

Inventors: Karthik Mohan Kumar, Michael Mantor, Pedro Antonio Pena, Archana Ramalingam
FUSED MULTIMODAL FRAMEWORK FOR NON-PLAYER CHARACTER GENERATION AND CONFIGURATION

Publication number: 20240424398

Abstract: Systems and techniques for generating and animating non-player characters (NPCs) within virtual digital environments are provided. Multimodal input data is received that comprises a plurality of input modalities for interaction with an NPC having a set of body features and a set of facial features. The multimodal input data is processed through one or more neural networks to generate animation sequences for both the body features and facial features of the NPC. Generating such animation sequences includes disentangling the multimodal input data to generate substantially disentangled latent representations, combining these representations with the multimodal input data, and using a large-language model (LLM) to generate speech data for the NPC. Further processing using reverse diffusion generates face vertex displacement data and joint trajectory data based on the combined representation and generated speech data.

Type: Application

Filed: June 20, 2024

Publication date: December 26, 2024

Inventors: Karthik Mohan Kumar, Michael Mantor, Pedro Antonio Pena, Archana Ramalingam
Hierarchical work scheduling

Patent number: 12153957

Abstract: A method for hierarchical work scheduling includes consuming a work item at a first scheduling domain having a local scheduler circuit and one or more workgroup processing elements. Consuming the work item produces a set of new work items. Subsequently, the local scheduler circuit distributes at least one new work item of the set of new work items to be executed locally at the first scheduling domain. If the local scheduler circuit of the first scheduling domain determines that the set of new work items includes one or more work items that would overload the first scheduling domain with work if scheduled for local execution, those work items are distributed to the next higher-level scheduler circuit in a scheduling domain hierarchy for redistribution to one or more other scheduling domains.

Type: Grant

Filed: September 30, 2022

Date of Patent: November 26, 2024

Assignee: Advanced Micro Devices, Inc.

Inventors: Matthaeus G. Chajdas, Christopher J. Brennan, Michael Mantor, Robert W. Martin, Nicolai Haehnle

1 2 3 4 5 … next