Parallel Processors (e.g., Identical Processors) Patents (Class 345/505)

Apparatus and method for virtualized scheduling of multiple duplicate graphics engines

Patent number: 10937119

Abstract: An apparatus and method for virtualized scheduling. For example, one embodiment of a graphics processing apparatus comprises: a graphics processor comprising a plurality of graphics processing engines, each of the graphics processing engines usable to execute graphics program code for a plurality of graphics contexts, each of the graphics contexts associated with a particular user mode driver (UMD); and a scheduler to schedule the graphics program code for execution on the plurality of graphics engines, the scheduler comprising an integrated context queue to store program code from all of the graphics contexts, the scheduler to select graphics processing engines to execute the program code from each context based on a detected load and/or availability of each graphics processing engine and to determine an order for executing the program code from each context based on relative priorities associated with the different contexts.

Type: Grant

Filed: February 13, 2020

Date of Patent: March 2, 2021

Assignee: Intel Corporation

Inventors: Murali Ramadoss, Penne Lee, Ankur Shah, Ping Liu, Joseph Koston
Deterministic labeled data generation and artificial intelligence training pipeline

Patent number: 10922585

Abstract: Labeled data is deterministically generated for training or validating machine learning models for image analysis. Approaches are described that allow this training data to be generated, for example, in real-time, and in response to the conditions at the location where images are generated by image sensors.

Type: Grant

Filed: February 12, 2019

Date of Patent: February 16, 2021

Assignee: Recogni Inc.

Inventors: Shabarivas Abhiram, Eugene M. Feinberg
System and method for editing dynamically aggregated data

Patent number: 10891303

Abstract: Disclosed are systems and methods for editing aggregated data based on edit requests that are received. An edit file is updated to include a newly-received edit. The edit file includes anchor points that each point to one of a plurality of aggregate data objects. The anchor points are used to determine one of the objects to which the edit file corresponds. The edit file is then applied to update data in this object including making the new edit.

Type: Grant

Filed: April 19, 2018

Date of Patent: January 12, 2021

Assignee: VERIZON MEDIA INC.

Inventors: Brian Hein, Peter Monaco, Richard Sarvate, Nitish Dalal
Method and apparatus for transitioning in-process applications to remote devices

Patent number: 10831433

Abstract: A method and apparatus that incorporate teachings of the present disclosure may include, for example, receiving at a mobile communication device a video stream from a computing device. The video stream is associated with images generated by a software application and is transmitted by the computing device responsive to a request to redirect control of the software application to the mobile communication device. The method may also include presenting the streamed video at the mobile communication device and transmitting to the computing device a stimulation of a remote user input function associated with the mobile communication device, where the transmitted stimulation corresponds to at least one action of a plurality of associable actions that can be executed by the software application. Additional embodiments are disclosed.

Type: Grant

Filed: July 12, 2018

Date of Patent: November 10, 2020

Assignee: STEELSERIES ApS

Inventors: Bruce Hawver, Jacob Wolff-Petersen
Performance optimization of hardware accelerators

Patent number: 10831541

Abstract: Example embodiments of the present disclosure provide methods and devices for optimizing performance of hardware accelerators. The accelerator device may detect status information of a current acceleration task being executed. The detected status information is provided to a host associated with the accelerator device. The host makes preparation for a subsequent acceleration task based on the status information before termination of the current running acceleration task. The accelerator device may execute the subsequent acceleration task based on the preparation. In this way, the performance of hardware accelerator is optimized.

Type: Grant

Filed: July 19, 2018

Date of Patent: November 10, 2020

Assignee: International Business Machines Corporation

Inventors: Yang Liu, Yong Lu, Peng Ou, Hong Bo Peng
Data processing systems

Patent number: 10825128

Abstract: A data processing system comprises processing circuitry arranged to generate data to form an output array of data, processing circuitry arranged to store the generated data in an output buffer 15 by writing compressed blocks of data representing particular regions of the output array of data to the output buffer, processing circuitry 14 arranged to read a compressed block of data representing a particular region of the array of data from the output buffer, processing circuitry 16 arranged to acquire meta-data from the compressed block of data, and processing circuitry 21 arranged to process the block of data. The acquired meta-data is used to affect the processing of the block of data.

Type: Grant

Filed: August 10, 2015

Date of Patent: November 3, 2020

Assignee: Arm Limited

Inventors: Daren Croxford, Ben James, Sean Ellis, Edward Charles Plowman
Method and apparatus for simultaneously executing multiple contexts on a graphics engine

Patent number: 10796472

Abstract: Apparatus and method for simultaneous command streamers. For example, one embodiment of an apparatus comprises: a plurality of work element queues to store work elements for a plurality of thread contexts, each work element associated with a context descriptor identifying a context storage region in memory; a plurality of command streamers, each command streamer associated with one of the plurality of work element queues, the command streamers to independently submit instructions for execution as specified by the work elements; a thread dispatcher to evaluate the thread contexts including priority values, to tag each instruction with an execution identifier (ID), and to responsively dispatch each instruction including the execution ID in accordance with the thread context; and a plurality of graphics functional units to independently execute each instruction dispatched by the thread dispatcher and to associate each instruction with a thread context based on its execution ID.

Type: Grant

Filed: June 30, 2018

Date of Patent: October 6, 2020

Assignee: Intel Corporation

Inventors: Michael Apodaca, Ankur Shah, Ben Ashbaugh, Brandon Fliflet, Hema Nalluri, Pattabhiraman K, Peter Doyle, Joseph Koston, James Valerio, Murali Ramadoss, Altug Koker, Aditya Navale, Prasoonkumar Surti, Balaji Vembu
Glyph rendering in three-dimensional space

Patent number: 10783695

Abstract: In one embodiment, a computing system may determine a pixel area in a two-dimensional coordinate system associated with a display. The system may project the pixel area into a three-dimensional coordinate system to determine a projected area in the three-dimensional coordinate system. Based on the projected area, the system may access a portion of an analytical definition of a glyph, the portion of the analytical definition defining one or more areas of the glyph. The system may compute a coverage proportion of the pixel area that overlaps with the one or more areas of the glyph. The system may then determine a color for the pixel area based on the coverage proportion.

Type: Grant

Filed: August 23, 2019

Date of Patent: September 22, 2020

Assignee: Facebook Technologies, LLC

Inventor: Warren Andrew Hunt
Apparatus and method for processing fractional reciprocal operations

Patent number: 10768896

Abstract: An apparatus and method for performing a reciprocal. For example one embodiment of a processor comprises: a decoder to decode a reciprocal instruction to generate a decoded reciprocal instruction; a source register to store at least one packed input data element; a destination register to store a result data element; and reciprocal execution circuitry to execute the decoded reciprocal instruction, the reciprocal execution circuitry to use a first portion of the packed input data element as an index to a data structure containing a plurality of sets of coefficients to identify a first set of coefficients from the plurality of sets, the reciprocal execution circuitry to generate a reciprocal of the packed input data element using a combination of the coefficients and a second portion of the packed input data element.

Type: Grant

Filed: December 21, 2017

Date of Patent: September 8, 2020

Assignee: Intel Corporation

Inventors: Cristina Anderson, Elmoustapha Ould-Ahmed-Vall, Marius Cornea-Hasegan, Robert Valentine, Mark Charney, Jesus Corbal, Venkateswara Madduri
Geometry to tiling arbiter for tile-based rendering system

Patent number: 10740950

Abstract: A geometry processing phase of tile-based rendering includes a plurality of parallel geometry pipelines, a plurality of tiling pipelines and a geometry to tiling arbiter situated between the plurality of geometry pipelines and the plurality of tiling pipelines. Each geometry pipeline is configured to: generate one or more geometry blocks for each geometry group of a subset of ordered geometry groups; generate a corresponding primitive position block for each geometry block, and compress each geometry blocks to generate a corresponding compressed geometry block. The tiling pipelines are configured to generate, from the primitive position blocks, a list for each tile indicating primitives that fall within the bounds of that tile. The geometry to tiling arbiter is configured to forward the primitive position blocks generated by the plurality of geometry pipelines to the plurality of tiling pipelines in the correct order based on the order of the geometry groups.

Type: Grant

Filed: October 10, 2018

Date of Patent: August 11, 2020

Assignee: Imagination Technologies Limited

Inventors: Tim Rollingson, Jairaj Dave
Storage system having a host that manages physical data locations of a storage device

Patent number: 10732855

Abstract: A storage system includes a storage device including a controller and a nonvolatile memory unit, and a host including a processor configured to determine whether or not the host is going to access the storage device within a predetermined range of time, and cause the storage device to be powered off when it is determined that the host is not going to access the storage device within the predetermined range of time.

Type: Grant

Filed: October 15, 2018

Date of Patent: August 4, 2020

Assignee: TOSHIBA MEMORY CORPORATION

Inventor: Daisuke Hashimoto
Graphics library extensions

Patent number: 10699464

Abstract: Methods for enabling graphics features in processors are described herein. Methods are provided to enable trinary built-in functions in the shader, allow separation of the graphics processor's address space from the requirement that all textures must be physically backed, enable use of a sparse buffer allocated in virtual memory, allow a reference value used for stencil test to be generated and exported from a fragment shader, provide support for use specific operations in the stencil buffers, allow capture of multiple transform feedback streams, allow any combination of streams for rasterization, allow a same set of primitives to be used with multiple transform feedback streams as with a single stream, allow rendering to be directed to layered framebuffer attachments with only a vertex and fragment shader present, and allow geometry to be directed to one of an array of several independent viewport rectangles without a geometry shader.

Type: Grant

Filed: October 31, 2016

Date of Patent: June 30, 2020

Assignee: Advanced Micro Devices, Inc.

Inventors: Graham Sellers, Eric Zolnowski, Pierre Boudier, Juraj Obert
Safe buffer transfer mechanism in a distributed storage system

Patent number: 10691616

Abstract: A method is provided for use in a distributed storage system having a write cache, the method comprising: receiving, by a first node in the distributed storage system, a first Remote Procedure Call (RPC) command to perform a partial update to a buffer that is stored in a memory of the first node, the partial update including replacing old data that is present in the buffer with new data that is stored in the write cache; transmitting, from the first node to a second node, a second RPC command instructing the second node to provide the new data to the first node by executing a Remote Direct Memory Access (RDMA) write operation into the memory of the first node; and performing the partial update based on the new data when the RDMA. write operation is completed and the new data is received by the first node as a result.

Type: Grant

Filed: January 29, 2019

Date of Patent: June 23, 2020

Assignee: EMC IP Holding Company LLC

Inventors: Vladimir Shveidel, Lior Kamran
Computer systems that are able to reduce memory data flow and graphics processing methods thereof

Patent number: 10679318

Abstract: A graphics processing method is provided, adapted to a graphic processing unit, the steps including: receiving, via a CSP, a first command associated with all render targets from a display driver; determining, via the display driver, sizes and areas of a plurality of tiles in each frame; repeatedly controlling, via a scissor pool unit, a graphics processing unit to perform drawing processing for each tile according to the first command; comparing, via a signature comparing unit of a cache memory, a signature of a current tile of a current frame and a signature of a tile corresponding to the same position of a previous frame and generating a comparison result; and determining whether to flush the dirty data of the current tile stored in the cache memory from the cache memory to a memory access unit according to the comparison result.

Type: Grant

Filed: April 11, 2019

Date of Patent: June 9, 2020

Assignee: SHANGHAI ZHAOXIN SEMICONDUCTOR CO., LTD.

Inventors: Fengxia Wu, Deming Gu, Heng Que, Yi Zhou, Ying Wang
Application-specific virtualized graphics processing

Patent number: 10628908

Abstract: Methods, systems, and computer-readable media for application-specific virtualized graphics processing are disclosed. A virtual compute instance is provisioned from a provider network. The provider network comprises a plurality of computing devices configured to implement a plurality of virtual compute instances with multi-tenancy. A virtual GPU is attached to the virtual compute instance. The virtual GPU is selected based at least in part on requirements of an application. The virtual GPU is implemented using a physical GPU, and the physical GPU is accessible to the virtual compute instance over a network. The application is executed using the virtual GPU on the virtual compute instance.

Type: Grant

Filed: February 26, 2018

Date of Patent: April 21, 2020

Assignee: Amazon Technologies, Inc.

Inventors: Nicholas Patrick Wilt, Ashutosh Tambe, Nathan Lee Burns
Memory access compression using clear code for tile pixels

Patent number: 10621109

Abstract: One embodiment provides for a graphics processor comprising a translation lookaside buffer (TLB) to cache a first page table entry for a virtual to physical address mapping for use by the graphics processor, the first page table entry to indicate that a first virtual page is cleared to a clear color and a graphics pipeline to bypass a memory access for the first virtual page based on the first page table entry, wherein the graphics pipeline is to read a field in the first page table entry to determine a value of the clear color.

Type: Grant

Filed: February 27, 2019

Date of Patent: April 14, 2020

Assignee: Intel Corporation

Inventors: Prasoonkumar Surti, Abhishek R. Appu, Kiran C. Veernapu
Control device and processing apparatus system

Patent number: 10613806

Abstract: A control device includes: a receiving unit that receives a request for processing; a determination unit that determines a processing apparatus to be moved from among a plurality of processing apparatuses based on a predetermined standard in a case where the receiving unit receives a request for processing; and a control unit that controls movement of the processing apparatus based on a result of determination performed by the determination unit.

Type: Grant

Filed: December 14, 2016

Date of Patent: April 7, 2020

Assignee: FUJI XEROX CO., LTD.

Inventors: Naoya Ogata, Hajime Kajiyama, Hideki Fujimoto, Kunitoshi Yamamoto, Akira Ichikawa, Mariko Miyazaki, Tetsuya Kobayashi
Display method and terminal device

Patent number: 10614772

Abstract: A display method and a terminal device are provided, so as to resolve a prior-art problem that CPU power consumption of the terminal device is relatively high because a display system of the terminal device processes three procedures in parallel in a Vsync period. The method is: determining, by the display system of the terminal device, a start moment of a first procedure in an (M+1)th Vsync period according to first processing duration of the first procedure in an Mth Vsync period, so that the display system starts to execute a device display procedure before starting to execute the first procedure, the display system delays executing the first procedure in the (M+1)th Vsync period, so that a time required for processing three procedures in parallel in the (M+1)th Vsync period by the display system is reduced, and CPU power consumption of the terminal device is reduced.

Type: Grant

Filed: March 8, 2016

Date of Patent: April 7, 2020

Assignee: Huawei Technologies Co., Ltd.

Inventors: Jing Zhao, Bo Yu
System and method for parallelization of data processing in a processor

Patent number: 10558466

Abstract: Systems, apparatuses, and methods for adjusting group sizes to match a processor lane width are described. In early iterations of an algorithm, a processor partitions a dataset into groups of data points which are integer multiples of the processing lane width of the processor. For example, when performing a K-means clustering algorithm, the processor determines that a first plurality of data points belong to a first group during a given iteration. If the first plurality of data points is not an integer multiple of the number of processing lanes, then the processor reassigns a first number of data points from the first plurality of data points to one or more other groups. The processor then performs the next iteration with these first number of data points assigned to other groups even though the first number of data points actually meets the algorithmic criteria for belonging to the first group.

Type: Grant

Filed: June 23, 2016

Date of Patent: February 11, 2020

Assignee: Advanced Micro Devices, Inc.

Inventors: Mauricio Breternitz, Mayank Daga
Geo-enrichment of data based on shapes

Patent number: 10554766

Abstract: Some embodiments provide a non-transitory machine-readable medium that stores a program. The program receives a request to geo-enrich data comprising a set of location data. The program further accesses shape data comprising a plurality of shapes associated with a plurality of geographical regions. The program also associates, for each location data in the set of location data, a shape in the plurality of shape with the location data.

Type: Grant

Filed: December 29, 2016

Date of Patent: February 4, 2020

Assignee: SAP SE

Inventors: Christopher Bolognese, Jonathan Tiu, Xing Jin, Sae-Won Om, Lyndon Hiew
Cloud-offloaded position calculation with on-device acquisition

Patent number: 10545244

Abstract: Methods, systems, and computer programs are presented for cloud-offloaded position calculation with on-device acquisition. One method includes operations for collecting raw global positioning system (GPS) signals for a set of positions, and storing in memory the raw GPS signals. The method further includes processing the raw GPS signals in batch mode at the computing device to obtain acquisition data for the positions, where the processing includes identifying tasks for parallel processing by a graphics processing unit (GPU), performing, by the GPU, the tasks in parallel by assigning each of the tasks to a core processor within the GPU, and combining results from performing the tasks to obtain the acquisition data. Additionally, the method includes an operation for transmitting the acquisition data to a server for calculating locations for the set positions at the server, the server storing the locations and making the locations available for location-tracking applications.

Type: Grant

Filed: June 15, 2017

Date of Patent: January 28, 2020

Assignee: Microsoft Technology Licensing, LLC

Inventors: Jie Liu, Nissanka Arachchige Bodhi Priyantha, Lucas N Joppa, Qiang Xu
Directed acyclic graph path enumeration with application in multilevel instancing

Patent number: 10546411

Abstract: Embodiments provide for an apparatus including one or more processors having logic to enumerate a directed path through nodes of a directed acyclic graph, the logic to determine a key for a node and a path identifier for a directed path between nodes of the directed acyclic graph.

Type: Grant

Filed: March 31, 2016

Date of Patent: January 28, 2020

Assignee: INTEL CORPORATION

Inventor: Louis Feng
Acceleration of convolutional neural network training using stochastic perforation

Patent number: 10540583

Abstract: Technical solutions are described to accelerate training of a multi-layer convolutional neural network. According to one aspect, a computer implemented method is described. A convolutional layer includes input maps, convolutional kernels, and output maps. The method includes a forward pass, a backward pass, and an update pass that each include convolution calculations. The described method performs the convolutional operations involved in the forward, the backward, and the update passes based on a first, a second, and a third perforation map respectively. The perforation maps are stochastically generated, and distinct from each other. The method further includes interpolating results of the selective convolution operations to obtain remaining results. The method includes iteratively repeating the forward pass, the backward pass, and the update pass until the convolutional neural network is trained. Other aspects such as a system, apparatus, and computer program product are also described.

Type: Grant

Filed: November 30, 2015

Date of Patent: January 21, 2020

Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Leland Chang, Suyog Gupta
Adaptive compute size per workload

Patent number: 10521880

Abstract: Methods and apparatus relating to techniques for power management. In an example, an apparatus comprises logic, at least partially comprising hardware logic, to receive one or more frames for a workload, determine one or more compute resource parameters for the workload, and store the one or more compute resource parameters for the workload in a memory in association with workload context data for the workload. Other embodiments are also disclosed and claimed.

Type: Grant

Filed: April 17, 2017

Date of Patent: December 31, 2019

Assignee: INTEL CORPORATION

Inventors: Balaji Vembu, Josh B. Mastronarde, Altug Koker, Nikos Kaburlasos, Abhishek R. Appu, Joydeep Ray
Data processing system for a graphical interface and graphical interface comprising such a data processing system

Patent number: 10509737

Abstract: A data processing system for a graphical interface includes at least one graphics processing unit (GPU) and at least one central processing unit (CPU) which communicates with the graphics processing unit, said processing unit and said central processing unit each including a group of data processing cores (C1, . . . , C6, C?1, . . . C?6). The data processing cores of the graphics processing unit are each connected to a data processing core of the central processing unit via a single dedicated bus (B1, . . . , B6) in such a way as to carry out a data transfer in parallel between said graphics processing unit and said central processing unit.

Type: Grant

Filed: March 25, 2015

Date of Patent: December 17, 2019

Assignee: Zodiac Aero Electric

Inventors: Etienne Denis Marie Zante, Rémi Andreoletti
Policies and architecture to dynamically offload VR processing to HMD based on external cues

Patent number: 10475149

Abstract: Virtual reality systems and methods are described. For example, one embodiment of an apparatus comprises: a communications interface to provide frame data of a virtual reality scene to a head mounted display (HMD); at least one performance monitor coupled to at least one component of the apparatus the at least one performance monitor to monitor performance of the at least one component and to send an alert based on the performance of the at least one component; a processor to process the frame data; a controller to receive the alert based on the performance of the at least one component and to offload processing of the frame data from the processor to the HMD for processing; and a display to show the rendered view of the scene.

Type: Grant

Filed: September 25, 2017

Date of Patent: November 12, 2019

Assignee: INTEL CORPORATION

Inventors: Paul S. Diefenbaugh, Karthik Veeramani, Deepak S. Vembar, Rajneesh Chowdhury, Atsuo Kuwahara
Fast determination of workgroup batches from multi-dimensional kernels

Patent number: 10467724

Abstract: Techniques are disclosed relating to dispatching compute work from a compute stream. In some embodiments, workgroup batch circuitry is configured to select (e.g., in a single clock cycle) multiple workgroups to be distributed to different shader circuitry. In some embodiments, iterator circuitry is configured to determine next positions in different dimensions at least partially in parallel. For example, in some embodiments, first circuitry is configured to determine a next position in a first dimension and an increment amount for a second dimension. In some embodiments, second circuitry is configured to determine at least partially in parallel with the determination of the next position in the first dimension, next positions in the second dimension for multiple possible increment amounts in the second dimension. In some embodiments, this may facilitate a configurable number of workgroups per batch and may increase performance, e.g., by increasing the overall number of workgroups dispatched per clock cycle.

Type: Grant

Filed: February 14, 2018

Date of Patent: November 5, 2019

Assignee: Apple Inc.

Inventors: Andrew M. Havlir, Jeffrey T. Brady
Unmanned aerial vehicle and method for operating an unmanned aerial vehicle

Patent number: 10459445

Abstract: According to various aspects, an unmanned aerial vehicle may be described, the unmanned aerial vehicle including: one or more sensors configured to gather thermal information associated with a vicinity of the unmanned aerial vehicle; one or more processors configured to determine at least one control information based on the thermal information and to control the unmanned aerial vehicle based on the at least one control information.

Type: Grant

Filed: September 28, 2017

Date of Patent: October 29, 2019

Assignee: Intel IP Corporation

Inventors: Stefan Menzel, Daniel Pohl, Thomas Seiler, Markus Achtelik
General purpose distributed data parallel computing using a high level language

Patent number: 10437573

Abstract: General-purpose distributed data-parallel computing using a high-level language is disclosed. Data parallel portions of a sequential program that is written by a developer in a high-level language are automatically translated into a distributed execution plan. The distributed execution plan is then executed on large compute clusters. Thus, the developer is allowed to write the program using familiar programming constructs in the high level language. Moreover, developers without experience with distributed compute systems are able to take advantage of such systems.

Type: Grant

Filed: July 10, 2017

Date of Patent: October 8, 2019

Assignee: MICROSOFT TECHNOLOGY LICENSING, LLC

Inventors: Yuan Yu, Dennis Fetterly, Michael Isard, Ulfar Erlingsson, Mihai Budiu
Image processing apparatus, image processing method, and surgical system

Patent number: 10440241

Abstract: The present technology relates to an image processing apparatus, an image processing method, and a surgical system, by which a captured image can be displayed with low latency in almost real time. A DMA controller 51 of a CPU 31 divides image data, which is input via an IF card 34, by the number of GPU cards 35-1, 35-2 in a horizontal direction and allocates them. In each of the GPU cards 35-1, 35-2, the image data is subjected to time division processing in the vertical direction. With this, the use of the plurality of GPU cards 35-1, 35-2 increases the speed of processing associated with display for the image data. High-speed display is realized due to reduction in latency. The present technology is applicable to an endoscopic camera, a surgical microscope, and the like.

Type: Grant

Filed: April 13, 2015

Date of Patent: October 8, 2019

Assignee: SONY CORPORATION

Inventors: Masahito Yamane, Tsuneo Hayashi
Sensor-derived object flight performance tracking

Patent number: 10420999

Abstract: Systems and techniques for sensor-derived object flight performance tracking are described herein. A set of magnetometer readings may be obtained from a magnetometer included with an object. A local rotation axis of the object may be determined at a time using the set of magnetometer readings. The local rotation axis may describe rotation of the object around a local magnetic target. A global rotation axis may be calculated based on an initial orientation of the object. The global rotation axis may describe a fixed rotation axis of the object during flight in a global coordinate frame, wherein an angle between the global rotation axis and magnetic north remains constant during the flight. An orientation of the object may be determined for the time using the global rotation axis and the local rotation axis of the object at the time.

Type: Grant

Filed: March 27, 2017

Date of Patent: September 24, 2019

Assignee: Intel Corporation

Inventors: Narayan Sundararajan, Xue Yang, Suresh V. Golwalkar, Mahanth Gowda, Romit Roy Choudhury
Exactly-once semantics for streaming analytics in non-idempotent output operations

Patent number: 10423625

Abstract: An approach for distributed stream computing in non-idempotent output operations is provided. The approach assigns an eventid to a corresponding entityid. The approach determines a minibatchid and a partitionid for a partition. The approach determines whether the partition was previously processed. The approach generates a new minibatchid and a new partitionid for a new partition based upon determining the partition was not previously processed. The approach determines whether a record was previously processed based upon determining the partition was previously processed. The approach processes the record of the partition based upon determining the record was not previously processed.

Type: Grant

Filed: October 11, 2016

Date of Patent: September 24, 2019

Assignee: SAMSUNG SDS AMERICA, INC.

Inventors: Partho Datta, Girish Kathalagiri Somashekariah
Apparatus and method for efficiently accessing memory when performing a horizontal data reduction

Patent number: 10409571

Abstract: Apparatus and method for optimizing shader execution. For example, one embodiment of a graphics processing apparatus comprises: a plurality of execution units to execute shader programs; optimization detection circuitry and/or logic to identify one or more portions of shader program code to be optimized including one or more reduction operations which require read/write memory operations and associated barrier operations; and optimization circuitry and/or logic to optimize the shader program code by converting a plurality of the read/write memory operations to read/write register operations and removing one or more barrier operations to generate optimized shader program code; the execution units to execute the optimized shader program code.

Type: Grant

Filed: March 15, 2018

Date of Patent: September 10, 2019

Assignee: Intel Corporation

Inventor: Marek Targowski
Enhanced three-dimensional point cloud rendering

Patent number: 10410406

Abstract: A method of rendering a three-dimensional point cloud in a two-dimensional display includes inputting the three-dimensional point cloud that includes three-dimensional coordinates of a set of points, creating a depth buffer for the three-dimensional point cloud that includes depth data for the set of points from a viewpoint location. The method further includes determining a foreground depth buffer by, for each respective pixel area of the two-dimensional display, determining a foreground depth by detecting a closest point to the viewpoint location among a subset of the set of points corresponding to the respective pixel area, and assigning a depth of the closest point as the foreground depth for the respective pixel area. The method further includes filtering the depth buffer to obtain a filtered depth buffer by removing points that are not in the foreground, and outputting the filtered depth buffer to the two-dimensional display.

Type: Grant

Filed: February 27, 2017

Date of Patent: September 10, 2019

Assignee: Trimble AB

Inventors: Andrés Serna, Thomas Chaperon
Intelligently distributed stage data mining system

Patent number: 10402415

Abstract: An intelligently distributed stage data mining system is disclosed herein, including an intelligent central server, a first WLAN receiving and transmitting unit, a local cluster control unit, a second WLAN receiving and transmitting unit, a third WLAN receiving and transmitting unit, a self-adaptive multi-dimensional transmission processing unit, a plurality of ZigBee receiving and transmitting units and a distributed data extraction unit. The intelligent central server is used for sending data acquisition and stage correction instruction to the local cluster control unit, and for receiving the stage real-time data uploaded by the self-adaptive multi-dimensional transmission processing unit. The local cluster control unit is used for receiving the data acquisition instruction sent by the intelligent central server, and forwarding instructions to the self-adaptive multi-dimensional transmission processing unit.

Type: Grant

Filed: July 10, 2016

Date of Patent: September 3, 2019

Assignee: ZHEJIANG DAFENG INDUSTRY CO., LTD

Inventors: Hua Feng, Qiyun Feng, Zhen Liu, Haihong Tian, Dong Wang, Lifeng Wu
Texture processing apparatus and method

Patent number: 10388033

Abstract: A texture processor includes: a texture cache configured to store textures; a controller configured to determine a texture address corresponding to a requested texture among the stored textures and read a texture corresponding to the texture address from the texture cache; a format converter configured to convert a format of the read texture into another format, based on a degree of texture precision required by a graphics processing unit (GPU); and a texture filter configured to perform texture filtering using the read texture having its format converted into the another format.

Type: Grant

Filed: April 14, 2016

Date of Patent: August 20, 2019

Assignee: SAMSUNG ELECTRONICS CO., LTD.

Inventors: Yeongon Cho, Seok Kang, Soojung Ryu, Jeongae Park, Woong Seo, Sangheon Lee
Acceleration of convolutional neural network training using stochastic perforation

Patent number: 10380479

Abstract: Technical solutions are described to accelerate training of a multi-layer convolutional neural network. According to one aspect, a computer implemented method is described. A convolutional layer includes input maps, convolutional kernels, and output maps. The method includes a forward pass, a backward pass, and an update pass that each include convolution calculations. The described method performs the convolutional operations involved in the forward, the backward, and the update passes based on a first, a second, and a third perforation map respectively. The perforation maps are stochastically generated, and distinct from each other. The method further includes interpolating results of the selective convolution operations to obtain remaining results. The method includes iteratively repeating the forward pass, the backward pass, and the update pass until the convolutional neural network is trained. Other aspects such as a system, apparatus, and computer program product are also described.

Type: Grant

Filed: October 8, 2015

Date of Patent: August 13, 2019

Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Leland Chang, Suyog Gupta
Neural network unit employing user-supplied reciprocal for normalizing an accumulated value

Patent number: 10380064

Abstract: A neural network unit including a register programmable with a representation of a reciprocal value of a divisor and a plurality of neural processing units (NPU). Each NPU has an ALU, an accumulator, and a reciprocal multiplier unit. The ALU performs arithmetic and logical operations on a sequence of operands to generate a sequence of results and accumulates the sequence of results as an accumulated value into the accumulator. The reciprocal multiplier unit receives the representation of the reciprocal value and the accumulated value and in response generates a result that is the quotient of the accumulated value and the divisor.

Type: Grant

Filed: April 5, 2016

Date of Patent: August 13, 2019

Assignee: VIA ALLIANCE SEMICONDUCTOR CO., LTD.

Inventors: G. Glenn Henry, Terry Parks
Method and processing apparatus for gating redundant threads

Patent number: 10360177

Abstract: Described is a method and processing apparatus to improve power efficiency by gating redundant threads processing. In particular, the method for gating redundant threads in a graphics processor includes determining if data for a thread and data for at least another thread are within a predetermined similarity threshold, gating execution of the at least another thread if the data for the thread and the data for the at least another thread are within the predetermined similarity threshold, and using an output data from the thread as an output data for the at least another thread.

Type: Grant

Filed: June 22, 2016

Date of Patent: July 23, 2019

Assignees: ATI Technologies ULC, Advanced Micro Devices, Inc.

Inventors: Syed Zohaib M. Gilani, Jiasheng Chen, QingCheng Wang, YunXiao Zou, Michael Mantor, Bin He, Timour T. Paltashev
Method for performing random read access to a block of data using parallel LUT read instruction in vector processors

Patent number: 10331347

Abstract: This disclosure is directed to the problem of paralleling random read access within a reasonably sized block of data for a vector SIMD processor. The invention sets up plural parallel look up tables, moves data from main memory to each plural parallel look up table and then employs a look up table read instruction to simultaneously move data from each parallel look up table to a corresponding part a vector destination register. This enables data processing by vector single instruction multiple data (SIMD) operations. This vector destination register load can be repeated if the tables store more used data. New data can be loaded into the original tables if appropriate. A level one memory is preferably partitioned as part data cache and part directly addressable memory. The look up table memory is stored in the directly addressable memory.

Type: Grant

Filed: May 29, 2018

Date of Patent: June 25, 2019

Assignee: TEXAS INSTRUMENTS INCORPORATED

Inventors: Jayasree Sankaranarayanan, Dipan Kumar Mandal
Method and system for distributed shader optimization

Patent number: 10296345

Abstract: Embodiments of the present invention are operable to communicate a list of important shaders and their current best-known compilations to remote client devices over a communications network. Client devices are allowed to produce modified shader compilations by varying optimizations. If a client device produces a modified compilation that beats an important shader's current best-known compilation, embodiments of the present invention can communicate this new best-known shader compilation back to a host computer system. Furthermore, embodiments of the present invention may periodically broadcast the new best-known shader compilation back to client devices for possible further optimization or for efficient rendering operations using the best-known shader compilation.

Type: Grant

Filed: August 8, 2017

Date of Patent: May 21, 2019

Assignee: Nvidia Corporation

Inventor: Jeremy Zelsnack
Moving image encoding device, moving image decoding device, moving image coding method, and moving image decoding method

Patent number: 10291939

Abstract: When carrying out an intra-frame prediction process to generate an intra prediction image by using an already-encoded image signal in a frame, an intra prediction part 4 selects a filter from one or more filters which are prepared in advance according to the states of various parameters associated with the encoding of a target block to be filtered, and carries out a filtering process on a prediction image by using the filter. As a result, prediction errors which occur locally can be reduced, and the image quality can be improved.

Type: Grant

Filed: July 21, 2011

Date of Patent: May 14, 2019

Assignee: MITSUBISHI ELECTRIC CORPORATION

Inventors: Akira Minezawa, Shunichi Sekiguchi, Kazuo Sugimoto
Control program and control method for programmable logic device and information processing apparatus including the same

Patent number: 10289787

Abstract: A PLD control program causes a computer to execute process including: outputting a configuration request for configuring a logic circuit in any of a plurality of areas where logic circuits are enabled to be reconfigured in a programmable logic device (hereinafter referred to as a PLD); and selecting, when a total size or number of a plurality of first logic circuits already configured in the plurality of areas and a second logic circuit newly configured in accordance with the configuration request exceeds a size or a number of logic circuits that are enabled to be configured in the plurality of areas, a logic circuit to be replaced with the second logic circuit, as a replacement target, from among the plurality of first logic circuits, based on a save and restore time needed to save and restore state data in the first logic circuits.

Type: Grant

Filed: July 12, 2017

Date of Patent: May 14, 2019

Assignee: FUJITSU LIMITED

Inventor: Hidetoshi Matsumura
Display controller

Patent number: 10283089

Abstract: A display controller comprises an input stage 20 operable to read at least one input surface, a composition stage 28 operable to compose plural input surfaces to generate a composited output surface, an output stage 30 operable to provide the composited output surface to a display for display, a scaling engine 31 operable to scale a composited output surface generated by the composition stage 28, and a write-out stage 32 operable to write a composited and/or scaled output surface to external memory.

Type: Grant

Filed: June 1, 2015

Date of Patent: May 7, 2019

Assignee: Arm Limited

Inventors: Damian Modrzyk, Pawel Duc, Piotr Chrobak, Michal Bogusz, Daren Croxford
Coexistence of message-passing-like algorithms and procedural coding

Patent number: 10275289

Abstract: First logical cores supported on physical processor cores in a computing system can be designated for execution of message-passing workers of a plurality of message workers while at least second logical cores supported on the physical processor cores can be designated for execution of procedural code such that resources of a physical processor core supporting the first logical core and the second logical core are shared between a first logical core and a second logical core. A database object in a repository can be assigned to one message-passing worker, which can execute operations on the database object while procedurally coded operations are processed using the second logical core on one or more of the plurality of physical processor cores while the first logical core executes the message-passing worker.

Type: Grant

Filed: February 13, 2017

Date of Patent: April 30, 2019

Assignee: SAP SE

Inventor: Ivan Schreter
Memory access compression using clear code for tile pixels

Patent number: 10268596

Abstract: One embodiment provides for a graphics processor comprising a translation lookaside buffer (TLB) to cache a first page table entry for a virtual to physical address mapping for use by the graphics processor, the first page table entry to indicate that a first virtual page is cleared to a clear color and a graphics pipeline module to bypass a memory access for the first virtual page based on the first page table entry.

Type: Grant

Filed: April 21, 2017

Date of Patent: April 23, 2019

Assignee: Intel Corporation

Inventors: Prasoonkumar Surti, Abhishek R. Appu, Kiran C. Veernapu
Distributed frame buffer and API for scalable parallel rendering

Patent number: 10249073

Abstract: Embodiments provide for a graphics processing apparatus comprising multiple compute nodes coupled to a communication layer, a rendering system executing on the multiple compute nodes, wherein the communication layer enables a distributed object executing on one of the multiple compute nodes to communicate with the rendering system, and a distributed framebuffer logic to subdivide a logical screen space for a frame into multiple regions and subdivide ownership of the regions among the multiple compute nodes.

Type: Grant

Filed: July 21, 2015

Date of Patent: April 2, 2019

Assignee: INTEL CORPORATION

Inventors: Ingo Wald, Gregory P. Johnson
Moving image encoding device, moving image decoding device, moving image coding method, and moving image decoding method

Patent number: 10244264

Abstract: When carrying out an intra-frame prediction process to generate an intra prediction image by using an already-encoded image signal in a frame, an intra prediction part 4 selects a filter from one or more filters which are prepared in advance according to the states of various parameters associated with the encoding of a target block to be filtered, and carries out a filtering process on a prediction image by using the filter. As a result, prediction errors which occur locally can be reduced, and the image quality can be improved.

Type: Grant

Filed: February 27, 2015

Date of Patent: March 26, 2019

Assignee: MITSUBISHI ELECTRIC CORPORATION

Inventors: Akira Minezawa, Shunichi Sekiguchi, Kazuo Sugimoto
Moving image encoding device, moving image decoding device, moving image coding method, and moving image decoding method

Patent number: 10244233

Abstract: When carrying out an intra-frame prediction process to generate an intra prediction image by using an already-encoded image signal in a frame, an intra prediction part 4 selects a filter from one or more filters which are prepared in advance according to the states of various parameters associated with the encoding of a target block to be filtered, and carries out a filtering process on a prediction image by using the filter. As a result, prediction errors which occur locally can be reduced, and the image quality can be improved.

Type: Grant

Filed: February 27, 2015

Date of Patent: March 26, 2019

Assignee: MITSUBISHI ELECTRIC CORPORATION

Inventors: Akira Minezawa, Shunichi Sekiguchi, Kazuo Sugimoto
Visibility-based state updates in graphical processing units

Patent number: 10242481

Abstract: In general, techniques are described for visibility-based state updates in graphical processing units (GPUs). A device that renders image data comprising a memory configured to store state data and a GPU may implement the techniques. The GPU may be configured to perform a multi-pass rendering process to render an image from the image data. The GPU determines visibility information for a plurality of objects defined by the image data during a first pass of the multi-pass rendering process. The visibility information indicates whether each of the plurality of objects will be visible in the image rendered from the image data during a second pass of the multi-pass rendering process. The GPU then retrieves the state data from the memory for use by the second pass of the multi-pass rendering process in rendering the plurality of objects of the image data based on the visibility information.

Type: Grant

Filed: March 15, 2012

Date of Patent: March 26, 2019

Assignee: QUALCOMM Incorporated

Inventors: Andrew Gruber, Ravi Somnath Jenkal

prev 1 2 3 4 5 6 … next