Patents by Inventor Baoguang Yang
Baoguang Yang has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Patent number: 12229864Abstract: This disclosure provides systems, devices, apparatus, and methods, including computer programs encoded on storage media, for runtime optimization of the shader execution flow. A graphics processor may obtain instruction execution data associated with a graphics workload, the instruction execution data including graphics data for a set of shader operations. The graphics processor may configure, at a first iteration, at least one predication value based on the instruction execution data including the graphics data for the set of shader operations. The graphics processor may adjust, at a second iteration, an execution flow of the graphics workload based on the configured at least one predication value, the execution flow of the graphics workload including the set of shader operations. The graphics processor may execute or refrain from executing, at the second iteration, each of the set of shader operations based on the adjusted execution flow of the graphics workload.Type: GrantFiled: August 5, 2022Date of Patent: February 18, 2025Assignee: QUALCOMM IncorporatedInventors: Yun Du, Eric Demers, Andrew Evan Gruber, Chun Yu, Baoguang Yang, Chihong Zhang, Yuehai Du, Avinash Seetharamaiah, Jonnala Gadda Nagendra Kumar, Gang Zhong, Zilin Ying, Fei Wei
-
Patent number: 12212559Abstract: A security defending method, a coprocessor, and a processing apparatus are disclosed. The security defending method is applicable in a coprocessor, including: receiving a jump destination encryption request for the operation task; using mask configuration to perform first mask processing on the first jump destination address value to obtain a first intermediate jump destination address value; performing an authentication operation based on the first jump destination storage address, a key reference value corresponding to the operation task and the first intermediate jump destination address value, to obtain a first encryption result value; using the mask configuration to perform second mask processing on the first encryption result value to obtain a first intermediate encryption result value; performing an authentication operation on the first intermediate encryption result value and the first jump destination address value to obtain a first encryption jump destination address value.Type: GrantFiled: December 29, 2022Date of Patent: January 28, 2025Assignee: Beijing ESWIN Computing Technology Co., Ltd.Inventors: Baoguang Liu, Dan Liu, Liu Cao, Lele Ma, Wenjuan Zhang, Xun Zhang, Xianshuai Yang, Bin Liu, Xinyu Qin, Yifan Liu, Kaixuan Wang, Jun Han
-
Patent number: 12067666Abstract: Aspects presented herein relate to methods and devices for graphics processing including an apparatus, e.g., a GPU. The apparatus may receive a set of draw call instructions corresponding to a graphics workload, where the set of draw call instructions is associated with at least one run-time parameter. The apparatus may also obtain a first shader program associated with storing data in a system memory and at least one second shader program associated with storing data in a constant memory. Further, the apparatus may execute the first shader program or the at least one second shader program based on whether the at least one run-time parameter is less than or equal to a size of the constant memory. The apparatus may also update or maintain a configuration of a shader processor or a streaming processor based on executing the first shader program or the at least one second shader program.Type: GrantFiled: May 18, 2022Date of Patent: August 20, 2024Assignee: QUALCOMM IncorporatedInventors: Yun Du, Eric Demers, Andrew Evan Gruber, Chun Yu, Chihong Zhang, Baoguang Yang, Yuehai Du, Gang Zhong, Avinash Seetharamaiah, Jonnala Gadda Nagendra Kumar
-
Publication number: 20240212257Abstract: This disclosure provides systems, devices, apparatus, and methods, including computer programs encoded on storage media, for workload packing in a graphics texture pipeline. A graphics processor may combine a set of samples into one or more hardware transactions, where the set of samples is associated with at least one of a first type of texture filtering or a first type of shader requested texture format component and the one or more hardware transactions are associated with at least one of a second type of texture filtering or a second type of shader requested texture format component. The graphics processor may process, in a texture pipeline at the graphics processor, the one or more hardware transactions including the set of samples. The graphics processor may output an indication of the processed one or more hardware transactions including the set of samples.Type: ApplicationFiled: December 22, 2022Publication date: June 27, 2024Inventors: Liang LI, Andrew Evan GRUBER, Baoguang YANG
-
Patent number: 11954758Abstract: This disclosure provides systems, devices, apparatus, and methods, including computer programs encoded on storage media, for dynamic wave pairing. A graphics processor may allocate one or more GPU workloads to one or more wave slots of a plurality of wave slots. The graphics processor may select a first execution slot of a plurality of execution slots for executing the one or more GPU workloads. The selection may be based on one of a plurality of granularities. The graphics processor may execute, at the selected first execution slot, the one or more GPU workloads at the one of the plurality of granularities.Type: GrantFiled: February 24, 2022Date of Patent: April 9, 2024Assignee: QUALCOMM IncorporatedInventors: Yun Du, Andrew Evan Gruber, Zilin Ying, Chunling Hu, Baoguang Yang, Yang Xia, Gang Zhong, Chun Yu, Eric Demers
-
Publication number: 20240046543Abstract: This disclosure provides systems, devices, apparatus, and methods, including computer programs encoded on storage media, for runtime optimization of the shader execution flow. A graphics processor may obtain instruction execution data associated with a graphics workload, the instruction execution data including graphics data for a set of shader operations. The graphics processor may configure, at a first iteration, at least one predication value based on the instruction execution data including the graphics data for the set of shader operations. The graphics processor may adjust, at a second iteration, an execution flow of the graphics workload based on the configured at least one predication value, the execution flow of the graphics workload including the set of shader operations. The graphics processor may execute or refrain from executing, at the second iteration, each of the set of shader operations based on the adjusted execution flow of the graphics workload.Type: ApplicationFiled: August 5, 2022Publication date: February 8, 2024Inventors: Yun DU, Eric DEMERS, Andrew Evan GRUBER, Chun YU, Baoguang YANG, Chihong ZHANG, Yuehai DU, Avinash SEETHARAMAIAH, Jonnala Gadda NAGENDRA KUMAR, Gang ZHONG, Zilin YING, Fei WEI
-
Patent number: 11893654Abstract: The present disclosure relates to methods and devices for graphics processing including an apparatus, e.g., a GPU. The apparatus may configure a portion of a GPU to include at least one depth processing block, the at least one depth processing block being associated with at least one depth buffer. The apparatus may also identify one or more depth passes of each of a plurality of graphics workloads, the plurality of graphics workloads being associated with a plurality of frames. Further, the apparatus may process each of the one or more depth passes in the portion of the GPU including the at least one depth processing block, each of the one or more depth passes being processed by the at least one depth processing block, the one or more depth passes being associated with the at least one depth buffer.Type: GrantFiled: July 12, 2021Date of Patent: February 6, 2024Assignee: QUALCOMM IncorporatedInventors: Sreyas Kurumanghat, Kalyan Kumar Bhiravabhatla, Andrew Evan Gruber, Tao Wang, Baoguang Yang, Pavan Kumar Akkaraju
-
Publication number: 20230377240Abstract: Aspects presented herein relate to methods and devices for graphics processing including an apparatus, e.g., a GPU. The apparatus may receive a set of draw call instructions corresponding to a graphics workload, where the set of draw call instructions is associated with at least one run-time parameter. The apparatus may also obtain a first shader program associated with storing data in a system memory and at least one second shader program associated with storing data in a constant memory. Further, the apparatus may execute the first shader program or the at least one second shader program based on whether the at least one run-time parameter is less than or equal to a size of the constant memory. The apparatus may also update or maintain a configuration of a shader processor or a streaming processor based on executing the first shader program or the at least one second shader program.Type: ApplicationFiled: May 18, 2022Publication date: November 23, 2023Inventors: Yun DU, Eric DEMERS, Andrew Evan GRUBER, Chun YU, Chihong ZHANG, Baoguang YANG, Yuehai DU, Gang ZHONG, Avinash SEETHARAMAIAH, Jonnala Gadda NAGENDRA KUMAR
-
Publication number: 20230267567Abstract: This disclosure provides systems, devices, apparatus, and methods, including computer programs encoded on storage media, for dynamic wave pairing. A graphics processor may allocate one or more GPU workloads to one or more wave slots of a plurality of wave slots. The graphics processor may select a first execution slot of a plurality of execution slots for executing the one or more GPU workloads. The selection may be based on one of a plurality of granularities. The graphics processor may execute, at the selected first execution slot, the one or more GPU workloads at the one of the plurality of granularities.Type: ApplicationFiled: February 24, 2022Publication date: August 24, 2023Inventors: Yun DU, Andrew Evan GRUBER, Zilin YING, Chunling HU, Baoguang YANG, Yang XIA, Gang ZHONG, Chun YU, Eric DEMERS
-
Patent number: 11657471Abstract: The present disclosure relates to methods and devices for graphics processing including an apparatus, e.g., a GPU. The apparatus may generate a table including a plurality of entries to store data associated with at least one of a constant value or an immediate value. The apparatus may also process, upon generating the table, first data including at least one of a constant value or an immediate value. Further, the apparatus may store, in the generated table, at least one of the constant value or the immediate value of the first data. The apparatus may also transmit, upon storing at least one of the constant value or the immediate value in the table, the table including the stored at least one of the constant value or the immediate value of the first data.Type: GrantFiled: June 23, 2021Date of Patent: May 23, 2023Assignee: QUALCOMM IncorporatedInventors: Yun Du, Andrew Evan Gruber, Chihong Zhang, Jian Jiang, Gang Zhong, Baoguang Yang, Yang Xia, Chun Yu, Eric Demers
-
Publication number: 20230017522Abstract: The present disclosure relates to methods and devices for graphics processing including an apparatus, e.g., a GPU. The apparatus may configure a portion of a GPU to include at least one depth processing block, the at least one depth processing block being associated with at least one depth buffer. The apparatus may also identify one or more depth passes of each of a plurality of graphics workloads, the plurality of graphics workloads being associated with a plurality of frames. Further, the apparatus may process each of the one or more depth passes in the portion of the GPU including the at least one depth processing block, each of the one or more depth passes being processed by the at least one depth processing block, the one or more depth passes being associated with the at least one depth buffer.Type: ApplicationFiled: July 12, 2021Publication date: January 19, 2023Inventors: Sreyas KURUMANGHAT, Kalyan Kumar BHIRAVABHATLA, Andrew Evan GRUBER, Tao WANG, Baoguang YANG, Pavan Kumar AKKARAJU
-
Publication number: 20220414814Abstract: The present disclosure relates to methods and devices for graphics processing including an apparatus, e.g., a GPU. The apparatus may generate a table including a plurality of entries to store data associated with at least one of a constant value or an immediate value. The apparatus may also process, upon generating the table, first data including at least one of a constant value or an immediate value. Further, the apparatus may store, in the generated table, at least one of the constant value or the immediate value of the first data. The apparatus may also transmit, upon storing at least one of the constant value or the immediate value in the table, the table including the stored at least one of the constant value or the immediate value of the first data.Type: ApplicationFiled: June 23, 2021Publication date: December 29, 2022Inventors: Yun DU, Andrew Evan GRUBER, Chihong ZHANG, Jian JIANG, Gang ZHONG, Baoguang YANG, Yang XIA, Chun YU, Eric DEMERS
-
Publication number: 20220357983Abstract: The present disclosure relates to methods and devices for graphics processing including an apparatus, e.g., a GPU. The apparatus may receive a plurality of workloads based on a workload order, each of the plurality of workloads being received in the workload order including at least a first workload and a second workload. The apparatus may also allocate one or more workloads of the plurality of workloads to one or more wave slots. Additionally, the apparatus may execute the one or more allocated workloads at the one or more wave slots, such that at least the first workload is executed at the first wave slot and the second workload is executed at the second wave slot. The apparatus may also allocate at least one other workload of the plurality of workloads to at least one previously-allocated wave slot of the one or more wave slots.Type: ApplicationFiled: May 7, 2021Publication date: November 10, 2022Inventors: Yun DU, Andrew Evan GRUBER, Zilin YING, Gang ZHONG, Baoguang YANG, Yang YU, Yang XIA, Ravindra KUMAR, Chun YU, Eric DEMERS
-
Patent number: 11204765Abstract: A graphics processing unit (GPU) utilizes block general purpose registers (bGPRs) to load multiple waves of samples for an instruction group into a processing pipeline and receive processed samples from the pipeline. The GPU acquires a credit for the bGPR for execution of the instruction group for a first wave using a persistent GPR and the bGPR. The GPU refunds the credit upon loading the first wave into the pipeline. The GPU executes a subsequent wave for the instruction group to load samples to the pipeline when at least one credit is available and the pipeline is processing the first wave. The GPU stores an indication of each wave that has been loaded into the pipeline in a queue. The GPU returns samples for a next wave in the queue from the pipeline to the bGPR for further processing when the physical slot of the bGPR is available.Type: GrantFiled: August 26, 2020Date of Patent: December 21, 2021Assignee: QUALCOMM IncorporatedInventors: Yun Du, Fei Wei, Gang Zhong, Minjie Huang, Jian Jiang, Zilin Ying, Baoguang Yang, Yang Xia, Jing Han, Liangxiao Hu, Chihong Zhang, Chun Yu, Andrew Evan Gruber, Eric Demers
-
Patent number: 11094032Abstract: Methods, systems, and devices for image processing are described. A device may determine, based on a test operation, to terminate a first wave associated with a first slot of a set of slots. The device may update a terminated wave bit associated with the first slot based on the determination to terminate the first wave. In some aspects, the device may update a number of invocations field associated with the first wave based on the determination to terminate the first wave. The device may release the first slot based on updating the terminated wave bit and the number of invocations field. In some examples, the device may output the number of invocations field to a rendering backend of the device based on the terminated wave bit.Type: GrantFiled: January 3, 2020Date of Patent: August 17, 2021Assignee: QUALCOMM IncorporatedInventors: Yun Du, Chun Yu, Andrew Evan Gruber, Zilin Ying, Baoguang Yang
-
Publication number: 20210209717Abstract: Methods, systems, and devices for image processing are described. A device may determine, based on a test operation, to terminate a first wave associated with a first slot of a set of slots. The device may update a terminated wave bit associated with the first slot based on the determination to terminate the first wave. In some aspects, the device may update a number of invocations field associated with the first wave based on the determination to terminate the first wave. The device may release the first slot based on updating the terminated wave bit and the number of invocations field. In some examples, the device may output the number of invocations field to a rendering backend of the device based on the terminated wave bit.Type: ApplicationFiled: January 3, 2020Publication date: July 8, 2021Inventors: Yun DU, Chun YU, Andrew Evan GRUBER, Zilin YING, Baoguang YANG
-
Patent number: 8605088Abstract: A method for reconstructing geometry mapping of a rasterized area is provided. The method includes: finding a testing pixel within the rasterized area; finding an occluding point corresponding to the testing pixel in a geometry shadow map of the rasterized area; determining weight values of the occluding point according to the (x, y) coordinate values of the testing pixel and vertices of a triangle occluding the testing pixel in the rasterized area; determining depth value of the occluding point according to the weight value and z coordinate of the vertices of the occluding triangle; and comparing the depth value of the occluding point with the depth value of the testing pixel so as to determine whether the testing pixel is drawn in light or in shadow.Type: GrantFiled: July 26, 2012Date of Patent: December 10, 2013Assignee: VIA Technologies, Inc.Inventors: Qinghua Dai, Baoguang Yang
-
Patent number: 8471853Abstract: A reconstructable geometry mapping method is provided. The reconstructable geometry mapping method includes: extracting geometry information of a plurality of occluding geometry shapes of an object's front-face with respect to a light source's point of view; performing a consistency test on a testing pixel so as to determine an occluding geometry shape corresponding to the testing pixel from the object's front-face among the plurality of occluding geometry shapes, in which the occluding geometry shape includes an occluding point, and the testing pixel overlaps with the occluding point when viewing from the light's point of view; reconstructing a depth value of an occluding point corresponding to the testing pixel; and performing a shadow determination of the testing pixel.Type: GrantFiled: October 10, 2008Date of Patent: June 25, 2013Assignee: VIA Technologies, Inc.Inventors: Qinghua Dai, Baoguang Yang
-
Publication number: 20120287132Abstract: A method for reconstructing geometry mapping of a rasterized area is provided. The method includes: finding a testing pixel within the rasterized area; finding an occluding point corresponding to the testing pixel in a geometry shadow map of the rasterized area; determining weight values of the occluding point according to the (x, y) coordinate values of the testing pixel and vertices of a triangle occluding the testing pixel in the rasterized area; determining depth value of the occluding point according to the weight value and z coordinate of the vertices of the occluding triangle; and comparing the depth value of the occluding point with the depth value of the testing pixel so as to determine whether the testing pixel is drawn in light or in shadow.Type: ApplicationFiled: July 26, 2012Publication date: November 15, 2012Applicant: VIA TECHNOLOGIES, INC.Inventors: Qinghua Dai, Baoguang Yang
-
Publication number: 20090109222Abstract: A reconstructable geometry mapping method is provided. The reconstructable geometry mapping method includes: extracting geometry information of a plurality of occluding geometry shapes of an object's front-face with respect to a light source's point of view; performing a consistency test on a testing pixel so as to determine an occluding geometry shape corresponding to the testing pixel from the object's front-face among the plurality of occluding geometry shapes, in which the occluding geometry shape includes an occluding point, and the testing pixel overlaps with the occluding point when viewing from the light's point of view; reconstructing a depth value of an occluding point corresponding to the testing pixel; and performing a shadow determination of the testing pixel.Type: ApplicationFiled: October 10, 2008Publication date: April 30, 2009Applicant: VIA Technologies, Inc.Inventors: Qinghua Dai, Baoguang Yang