For A Parallel Or Multiprocessor System Patents (Class 717/149)

Loop compiling (Class 717/150)

Propagating reduced-precision on computation graphs

Patent number: 11972238

Abstract: Methods, systems, and apparatus for propagating reduced-precision on computation graphs are described. In one aspect, a method includes receiving data specifying a directed graph that includes operators for a program. The operators include first operators that each represent a numerical operation performed on numerical values having a first level of precision and second operators that each represent a numerical operation performed on numerical values having a second level of precision. One or more downstream operators are identified for a first operator. A determination is made whether each downstream operator represents a numerical operation that is performed on input values having the second level of precision. Whenever each downstream operator represents a numerical operation that is performed on input values having the second level of precision, a precision of numerical values output by the operation represented by the first operator is adjusted to the second level of precision.

Type: Grant

Filed: June 13, 2022

Date of Patent: April 30, 2024

Assignee: Google LLC

Inventor: Yuanzhong Xu
Mere-parsing with boundary and semantic driven scoping

Patent number: 11966695

Abstract: Methods, systems and computer program products for implementing a mere-parser are disclosed. Text data is processed to generate one or more parse items. A boundary based attribute associated with one of the parse items is identified, and the identified mere attribute is associated with one or more of the remaining parse items that is not blocked from associated with the boundary based attribute.

Type: Grant

Filed: October 27, 2020

Date of Patent: April 23, 2024

Assignee: Optum360, LLC

Inventors: Daniel T. Heinze, Mark L. Morsch
Logic fabric based on microsector infrastructure with data register having scan registers

Patent number: 11960734

Abstract: Systems and methods described herein may relate to providing a dynamically configurable circuitry able to be programed using a microsector granularity. Furthermore, selective partial reconfiguration operations may be performed use write operations to write a new configuration over existing configurations to selectively reprogram a portion of programmable logic. A quasi-delay insensitive (QDI) shift register and/or control circuitry receiving data and commands from an access register disposed between portions of programmable logic may enable at least some of the operations described.

Type: Grant

Filed: September 25, 2020

Date of Patent: April 16, 2024

Assignee: Intel Corporation

Inventors: Sean R Atsatt, Ilya K. Ganusov
Multiprocessor programming toolkit for design reuse

Patent number: 11914989

Abstract: Techniques for specifying and implementing a software application targeted for execution on a multiprocessor array (MPA). The MPA may include a plurality of processing elements, supporting memory, and a high bandwidth interconnection network (IN), communicatively coupling the plurality of processing elements and supporting memory. In some embodiments, software code may specify one or more cell definitions that include: program instructions executable to perform a function and one or more language constructs. The software code may further instantiate first, second, and third cell instances, each of which is an instantiation of one of the one or more cell definitions, where the instantiation includes configuration of the one or more language constructs such that: the first and second cell instances communicate via respective communication ports and the first and second cell instances are included in the third cell instance.

Type: Grant

Filed: October 28, 2021

Date of Patent: February 27, 2024

Assignee: Coherent Logix, Incorporated

Inventors: Stephen E. Lim, Viet N. Ngo, Jeffrey M. Nicholson, John Mark Beardslee, Teng-I Wang, Zhong Qing Shang, Michael Lyle Purnell
System for managing calculation processing graph of artificial neural network and method of managing calculation processing graph by using the same

Patent number: 11915149

Abstract: Provided are a system for managing a calculation processing graph of an artificial neural network and a method of managing a calculation processing graph by using the system. A system for managing a calculation processing graph of an artificial neural network run by a plurality of heterogeneous resources includes: a task manager configured to allocate the plurality of heterogeneous resources to a first subgraph and a second subgraph that are to be run, the first subgraph and the second subgraph being included in the calculation processing graph; a first compiler configured to compile the first subgraph to be executable on a first resource among the plurality of heterogeneous resources; and a second compiler configured to compile the second subgraph to be executable on a second resource among the plurality of heterogeneous resources, wherein the first subgraph and the second subgraph are respectively managed through separate calculation paths.

Type: Grant

Filed: September 5, 2019

Date of Patent: February 27, 2024

Assignee: SAMSUNG ELECTRONICS CO., LTD.

Inventor: Seung-Soo Yang
Job decomposition processing method for distributed computing

Patent number: 11907693

Abstract: A job decomposition processing method for distributed computing, which comprises: analyzing a source program to be run by program static analysis to determine a function call graph contained in the source program; determining feature information of functions contained in the source program by program dynamics analysis or/and a program intelligent decomposition algorithm, wherein the feature information of the functions is used to characterize relevant information when each function is being running; decomposing the source program based on the feature information of the functions, a function relationship and available resource information of a computing platform to form an execution recommendation for each function on the computing platform, i.e., which hardware resources are used for computing each function; finally inserting a modifier in the source program and starting computation on the computing platform.

Type: Grant

Filed: February 17, 2023

Date of Patent: February 20, 2024

Assignee: ZHEJIANG LAB

Inventors: Wenyuan Bai, Feng Gao
Hybrid virtual GPU co-scheduling

Patent number: 11900157

Abstract: An embodiment of a semiconductor package apparatus may include technology to manage one or more virtual graphic processor units, and co-schedule the one or more virtual graphic processor units based on both general processor instructions and graphics processor instructions. Other embodiments are disclosed and claimed.

Type: Grant

Filed: September 19, 2018

Date of Patent: February 13, 2024

Assignee: Intel Corporation

Inventors: Yan Zhao, Zhi Wang, Weinan Li
Multi-use chip-to-chip interface

Patent number: 11892966

Abstract: Systems, methods, and apparatuses are described that enable IC architectures to enable a single anchor to connect to and accept a variety of chiplets at any port by way of a programming model that enables the anchor or chiplet to dynamically adapt to configurations, requirements, or aspects of any coupled component and provide an interface for the coupled components.

Type: Grant

Filed: December 14, 2021

Date of Patent: February 6, 2024

Assignee: XILINX, INC.

Inventors: Krishnan Srinivasan, Ygal Arbel, Sagheer Ahmad
Binary translation using raw binary code with compiler produced metadata

Patent number: 11886848

Abstract: A method, system, and computer-readable medium for binary translation cause a binary translator to combine raw binary code and compiler-produced metadata associated with a compiled program module. The binary translator is caused to further reconcile, using the compiler-produced metadata, original compiler-produced control flow information with how lower-level machine instructions comprise a control flow in the raw binary code, and original compiler-produced aliasing information with how lower-level machine instructions access the memory locations described by the aliasing information according to predetermined criteria. The binary translator further caused to prevent, copy propagation of values in temporary variables for decimal computations beyond offsets in the machine instructions where the temporary variables are killed.

Type: Grant

Filed: May 25, 2022

Date of Patent: January 30, 2024

Assignee: International Business Machines Corporation

Inventors: Toshihiko Koju, Reid Copeland, David Kevin Siegwart, Jordan Ryan Zannier, Allan H. Kielstra
Method for constructing behavioural software signatures

Patent number: 11868473

Abstract: A method for constructing behavioral software signatures. The method includes: embedding execution traces of a set of software in a vector space, an execution trace of a software agent including at least one event and being representative of the execution of the software, the embedding representing an event of the execution trace by a vector encoding a context for occurrence of the event; partitioning the vectors associated with the software of the set to generate a data group representative of a behavior, a behavioral label being associated with the data group; associating a behavioral label with a vector, which is representative of the data group to which the vector belongs, and associating a trace of behavioral labels with a trace of vectors, the trace of labels being representative of execution of a software agent, and extracting in the trace of labels at least one behavioral signature associated with the software.

Type: Grant

Filed: January 30, 2020

Date of Patent: January 9, 2024

Assignee: ORANGE

Inventors: Baptiste Olivier, Xiao Han
Processor chip and control methods thereof

Patent number: 11842265

Abstract: Disclosed in a processor chip configured to perform neural network processing. The processor chip includes a memory, a first processor configured to perform neural network processing on a data stored in the memory, a second processor and a third processor, and the second processor is configured to transmit a control signal to the first processor and the third processor to cause the first processor and the third processor to perform an operation.

Type: Grant

Filed: June 19, 2020

Date of Patent: December 12, 2023

Assignee: Samsung Electronics Co., Ltd.

Inventors: Yongmin Tai, Insang Cho, Wonjae Lee, Chanyoung Hwang
Isolating tenants executing in multi-tenant software containers

Patent number: 11842217

Abstract: Mechanisms for resource isolation allow tenants executing in a multi-tenant software container to be isolated in order to prevent resource starvation by one or more of the tenants. Mechanisms for dependency isolation may be utilized to prevent one tenant executing in a multi-tenant software container from using another tenant in the same container in a manner that requires co-tenancy. Mechanisms for security isolation may be utilized to prevent one tenant in a multi-tenant software container from accessing protected data or functionality of another tenant. Mechanisms for fault isolation may be utilized to prevent tenants in a multi-tenant software container from causing faults or other types of errors that affect other tenants executing in the same software container.

Type: Grant

Filed: July 29, 2021

Date of Patent: December 12, 2023

Assignee: Amazon Technologies, Inc.

Inventors: Keian Christopher, Kevin Michael Beranek, Christopher Keakini Kaulia, Vijay Ravindra Kulkarni, Samuel Leonard Moniz, Kyle Bradley Peterson, Ajit Ashok Varangaonkar, Jun Xu
Method and system for optimizing data storage of query statistics of graph database

Patent number: 11816132

Abstract: Disclosed are a method and a system for optimizing data storage of query statistics of a graph database. The method includes: periodically scanning, on storage servers in which partitions are located, all edges in the partitions; determining, according to all the edges in the partitions, partitions to which start points and end points belong, and calculating outgoing-edge correlation and incoming-edge correlation between partitions; calculating relevancies between partitions through a preset correlation matrix weight according to the outgoing-edge correlation and the incoming-edge correlation between partitions; and storing partitions with high relevancies on a same storage server.

Type: Grant

Filed: September 22, 2021

Date of Patent: November 14, 2023

Assignee: Vesoft Inc.

Inventors: Tong Yue, Xiaomeng Ye, Yujue Wang, Yu Liu, Min Wu, Chenguang Wang
Global modulo allocation in neural network compilation

Patent number: 11809849

Abstract: In one example, a method performed by a compiler comprises: receiving a dataflow graph of a neural network, the neural network comprising a neural network operator; receiving information of computation resources and memory resources of a neural network hardware accelerator intended to execute the neural network operator; determining, based on the dataflow graph, iterations of an operation on elements of a tensor included in the neural network operator; determining, based on the information, a mapping between the elements of the tensor to addresses in the portion of the local memory, and a number of the iterations of the operation to be included in a batch, wherein the number of the iterations in the batch are to be executed in parallel by the neural network hardware accelerator; and generating a schedule of execution of the batches of the iterations of the operations.

Type: Grant

Filed: May 20, 2021

Date of Patent: November 7, 2023

Assignee: Amazon Technologies, Inc.

Inventors: Hongbin Zheng, Randy Renfu Huang, Robert Geva
Diagnosing and mitigating memory leak in computing nodes

Patent number: 11775407

Abstract: The present disclosure relates to systems, methods, and computer readable media for diagnosing and mitigating memory impact events, such as memory leaks, high memory usage, or other memory issues causing a host node from performing as expected on a cloud computing system. The systems described herein involve receiving locally generated memory usage data from a plurality of host nodes. The systems described herein may aggregate the memory usage data and determine a memory impact diagnosis based on a subset of the aggregated memory usage data. The systems described herein may further apply a mitigation model for mitigating the memory impact event. The systems described herein provide an end-to-end solution for diagnosing and mitigating a variety of memory issues using a dynamic and scalable system that reduces a negative impact of memory leaks and other memory issues on a cloud computing system.

Type: Grant

Filed: March 7, 2022

Date of Patent: October 3, 2023

Assignee: Microsoft Technology Licensing, LLC

Inventors: Cong Chen, Xinsheng Yang, Yingnong Dang, Si Qin
Transaction nesting depth testing instruction

Patent number: 11775297

Abstract: In a system providing transactional memory support, a transaction nesting depth testing instruction is provided for triggering processing circuitry 4 to set at least one status value to one of a plurality of states depending on a transaction nesting depth indicative of a number of executed transaction start instructions of a given thread for which the corresponding transaction remains unaborted and uncommitted, the plurality of states including a first state selected when the transaction nesting depth is 1 and at least one further state selected when the transaction nesting depth is greater than or less than 1. The supported ISA enables the setting of the at least one status value and a conditional branch conditional on the at least one status value being in the first state to be performed in response to a single transaction nesting depth testing instruction and a single conditional branch instruction.

Type: Grant

Filed: August 21, 2018

Date of Patent: October 3, 2023

Assignee: Arm Limited

Inventors: Grigorios Magklis, Matthew James Horsnell, Stephan Diestelhorst
Parameter optimization device, method and program

Patent number: 11720080

Abstract: An optimum combination of a loop unrolling number and a circuit parallel number in a high-level synthesis is determined. A circuit synthesis information generation unit sets, as parameter candidates, a plurality of combinations of a loop unrolling number and a circuit parallel number to generate circuit synthesis information indicating a synthesis circuit obtained by high-level synthesis processing for each of the combinations. An optimum parameter determination unit calculates, for each piece of the generated circuit synthesis information, an estimation processing performance related to the synthesis circuit indicated by the circuit synthesis information, and determines an optimum combination of the loop unrolling number and the circuit parallel number based on the circuit synthesis information based on which a maximum estimation processing performance is obtained.

Type: Grant

Filed: May 21, 2019

Date of Patent: August 8, 2023

Assignee: NIPPON TELEGRAPH AND TELEPHONE CORPORATION

Inventors: Syuhei Yoshida, Yuta Ukon, Koji Yamazaki, Koyo Nitta
Apparatuses, methods, and systems for instructions of a matrix operations accelerator

Patent number: 11714875

Abstract: Systems, methods, and apparatuses relating to a matrix operations accelerator are described.

Type: Grant

Filed: December 28, 2019

Date of Patent: August 1, 2023

Assignee: Intel Corporation

Inventors: Amit Gradstein, Simon Rubanovich, Sagi Meller, Saeed Kharouf, Gavri Berger, Zeev Sperber, Jose Yallouz, Ron Schneider
Deployment of applications conforming to application data sharing and decision service platform schema

Patent number: 11663175

Abstract: Systems, methods, and software are disclosed herein for facilitating deployment of a decision service for sharing application data among multiple isolated applications executing on one or more application platforms. In an implementation, a method of deploying applications conforming to a platform schema for facilitating sharing of the application data among isolated applications executing on one or more application platforms is described. The method includes receiving a request to submit a third party application to an application deployment system, identifying a validation manifest associated with a platform schema responsive to receiving the request, and automatically verifying that the third party application to conforms to the platform schema by performing a set of pre-defined validation checks. The request identifies the platform schema and platform capability information associated with the third party application. The validation manifest includes the set of pre-defined validation checks.

Type: Grant

Filed: August 28, 2019

Date of Patent: May 30, 2023

Assignee: MICROSOFT TECHNOLOGY LICENSING, LLC

Inventors: David Mowatt, Stephen O'Driscoll
Efficient parallelized computation of global behavior profiles in real-time transaction scoring systems

Patent number: 11636485

Abstract: Parallelized computation by a real-time transaction scoring system that incorporates global behavior profiling of transacting entities includes dividing a global profile computing component of a transaction scoring model of a real-time behavioral analytics transaction scoring system into a plurality of global profile component instances. The transaction scoring model uses a plurality of global profile variables, each of the plurality of global profile component instances using its own global profile partition that contains the estimate of global profile variables and being configured for update by a dedicated thread of execution of the real-time transaction scoring system, each dedicated thread being configured for receiving and scoring a portion of input transactions.

Type: Grant

Filed: April 6, 2018

Date of Patent: April 25, 2023

Assignee: Fair Isaac Corporation

Inventors: Scott Michael Zoldi, Alexei Betin
Systems and methods for systolic array design from a high-level program

Patent number: 11604758

Abstract: Systems and methods for automated systolic array design from a high-level program are disclosed. One implementation of a systolic array design supporting a convolutional neural network includes a two-dimensional array of reconfigurable processing elements arranged in rows and columns. Each processing element has an associated SIMD vector and is connected through a local connection to at least one other processing element. An input feature map buffer having a double buffer is configured to store input feature maps, and an interconnect system is configured to pass data to neighboring processing elements in accordance with a processing element scheduler. A CNN computation is mapped onto the two-dimensional array of reconfigurable processing elements using an automated system configured to determine suitable reconfigurable processing element parameters.

Type: Grant

Filed: November 12, 2020

Date of Patent: March 14, 2023

Assignee: Xilinx, Inc.

Inventors: Peng Zhang, Cody Hao Yu, Xuechao Wei, Peichen Pan
Device profiling in GPU accelerators by using host-device coordination

Patent number: 11579852

Abstract: System and method of compiling a program having a mixture of host code and device code to enable Profile Guided Optimization (PGO) for device code execution. An exemplary integrated compiler can compile source code programmed to be executed by a host processor (e.g., CPU) and a co-processor (e.g., a GPU) concurrently. The compilation can generate an instrumented executable code which includes: profile instrumentation counters for the device functions; and instructions for the host processor to allocate and initialize device memory for the counters and to retrieve collected profile information from the device memory to generate instrumentation output. The output is fed back to the compiler for compiling the source code a second time to generate optimized executable code for the device functions defined in the source code.

Type: Grant

Filed: July 27, 2020

Date of Patent: February 14, 2023

Assignee: NVIDIA Corporation

Inventors: Hariharan Sandanagobalane, Sean Lee, Vinod Grover
Systems and methods for extending a live range of a virtual scalar register

Patent number: 11556319

Abstract: Systems and methods are described for extending a live range for a virtual scalar register during compiling of a program, comprising: receiving an intermediate representation (IR) of a source code configured for implementing single-instruction-multiple-thread (SIMT) execution, the IR representing the source code as control flow graph including a plurality of basic blocks (BB); and when a virtual scalar register defined in a first BB of the IR is last used in a second BB of the IR that is a divergent BB, modifying the IR to extend the live range of the virtual scalar register.

Type: Grant

Filed: September 1, 2020

Date of Patent: January 17, 2023

Assignee: HUAWEI TECHNOLOGIES CO., LTD.

Inventors: Abraham Davidson Fai Chung Chan, Tyler Bryce Nowicki, Guansong Zhang, Ahmed Mohammed ElShafiey Mohammed Eltantawy
Systems and methods for configurable ordered transformation of database content

Patent number: 11550579

Abstract: A system includes processor hardware and memory hardware that stores instructions. The instructions include, in response to receiving a request, determining a request type of the request, retrieving a first set of collected information, and selecting a first set of instructions corresponding to the request type. The instructions include constructing a first result by executing each instruction of the first set of instructions to create the first entry as a nested entry within the first result including data of the first set of collected information identified in the first set of instructions as nested or retrieve first data of the first set of collected information identified by the first instruction and add the first data to the first entry of the first result. The instructions include transforming a display of the operator device to complete a set of fields displayed on the display with corresponding entries of the first result.

Type: Grant

Filed: March 12, 2020

Date of Patent: January 10, 2023

Assignee: TD Ameritrade IP Company, Inc.

Inventors: Sean William Watts, Igor Vornovitskiy, IV
Asynchronous execution mechanism

Patent number: 11494867

Abstract: An apparatus to facilitate asynchronous execution at a processing unit. The apparatus includes one or more processors to detect independent task passes that may be executed out of order in a pipeline of the processing unit, schedule a first set of processing tasks to be executed at a first set of processing elements at the processing unit and schedule a second set of tasks to be executed at a second set of processing elements, wherein execution of the first set of tasks at the first set of processing elements is to be performed simultaneous and in parallel to execution of the second set of tasks at the second set of processing elements.

Type: Grant

Filed: December 8, 2020

Date of Patent: November 8, 2022

Assignee: Intel Corporation

Inventors: Saurabh Sharma, Michael Apodaca, Aditya Navale, Travis Schluessler, Vamsee Vardhan Chivukula, Abhishek Venkatesh, Subramaniam Maiyuran
Methods and apparatus for finding long methods in code

Patent number: 11467829

Abstract: A method and apparatus are disclosed for finding overlong source code segments (e.g., methods) by evaluating input source code segments for a plurality of predetermined code metric values in order to identify candidate source code segments (e.g., non-autogenerated methods) which do not meet a first code metric value and to assess each candidate source code segment against a second code metric value to identify different sets of candidate source code segments (e.g., test methods and normal methods) so that each set of candidate source code segments may be assessed against a tailored set of code length thresholds to identify any overlong source code segment having a code length which meets or exceeds at least two of the tailored set of code length thresholds.

Type: Grant

Filed: May 29, 2020

Date of Patent: October 11, 2022

Assignee: DevFactory Innovations FZ-LLC

Inventor: Aditya T. Kadam
Call graph enhancement using stitching algorithm

Patent number: 11379198

Abstract: A code base is parsed to identify methods encapsulated therein. Thereafter, a call graph is generated based on the parsing using a graph generation technique. The call graph is a directed call graph comprising a plurality of nodes characterizing the identified methods. It can then be determined, based on one or more design patterns used to generate the code base, that at least a portion of the nodes generated call graph are disconnected nodes. At least two of the disconnected nodes are then connected using a stitching algorithm to result in a modified call graph. Data characterizing the modified call graph can then be provided (e.g., displayed in a graphical user interface, stored in a database, loaded into memory, transmitted to a remote computing device, etc.).

Type: Grant

Filed: December 14, 2020

Date of Patent: July 5, 2022

Assignee: SAP SE

Inventors: Amitabh Goswami, Amrit Shankar Dutta Dutta, Abhishek Hondad, Alok Kumar
Verifiable testcase workflow

Patent number: 11379349

Abstract: Verifiable test case workflow is provided by creating a secure database for actions taken regarding a source file that is stored on a first computer; creating a test executable from one or more source files and storing it on the first computer; finalizing the source file for test on a second computer different from the first computer; hashing a test environment related to the source file and the second computer; and in response to determining that a version of the test executable provided to the second computer matches a version of the test executable provided to the secure database: executing the test executable on the second computer; hashing test results from testing the source file on the second computer; and adding the test executable as hashed and the test results as hashed to the secure database to actions already stored in the secure database.

Type: Grant

Filed: January 3, 2020

Date of Patent: July 5, 2022

Assignee: International Business Machines Corporation

Inventors: Ann Barnette Umberhocker, Ariba Siddiqui, Sowmya Janakiraman, George Conerly Wilson
Job analytics aggregation tool

Patent number: 11368452

Abstract: An analytics tool includes a network interface and an analytics engine. The network interface receives a request for job analytics of a job. The job comprises uploading a plurality of batches, each of the plurality batches comprising a subset of information of a data table. A network node of a plurality of network nodes uploads a batch of the plurality of batches. The analytics engine configured to determines the plurality of network nodes used to complete the job. The analytics engine retrieves network node data for each of the plurality of network nodes. The analytics engine generates the job analytics by aggregating the network node data for each of the plurality of network nodes.

Type: Grant

Filed: November 11, 2019

Date of Patent: June 21, 2022

Assignee: Bank of America Corporation

Inventor: John Abraham
Framework for user-directed profile-driven optimizations

Patent number: 11321061

Abstract: A method for using profiling to obtain application-specific, preferred parameter values for an application is disclosed. First, a parameter for which to obtain an application-specific value is identified. Code is then augmented for application-specific profiling of the parameter. The parameter is profiled and profile data is collected. The profile data is then analyzed to determine the application's preferred parameter value for the profile parameter.

Type: Grant

Filed: July 29, 2019

Date of Patent: May 3, 2022

Assignee: Google LLC

Inventors: Teresa Louise Johnson, Xinliang David Li
System for fully integrated predictive decision-making and simulation

Patent number: 11295262

Abstract: A system for fully integrated predictive decision-making and simulation having a high-volume deep web scraper system, a data retrieval engine, a directed computational graph module, and a decision and action path simulation engine.

Type: Grant

Filed: October 30, 2020

Date of Patent: April 5, 2022

Assignee: QOMPLX, INC.

Inventors: Jason Crabtree, Andrew Sellers
Automatic out-of-bound access prevention in GPU kernels executed in a managed environment

Patent number: 11288108

Abstract: Techniques are provided for an automated method of adding out-of-bound access prevention in GPU kernels executed in a managed environment. In an embodiment, a system of computers compiles a GPU kernel code function that includes one or more array references that are memory address dependent. The system of computers compiles the kernel code function by generating a rewritten GPU kernel code module that includes, within the function signature of the rewritten GPU kernel code module, a respective array size parameter for each array reference of the one or more array references included in the GPU kernel code function. The system of computers further compiles the kernel code function by adding bounding protection instructions to the one or more potential out-of-bound access instructions in the rewritten GPU kernel code module. The potential out-of-bound access instructions comprise instructions that reference each respective array size parameter of the one or more array references.

Type: Grant

Filed: December 3, 2019

Date of Patent: March 29, 2022

Assignee: Oracle International Corporation

Inventors: Alberto Parravicini, Davide Bartolini, Lukas Stadler, Arnaud Delamare
Computer-readable recording medium recording port switching program and port switching method

Patent number: 11265266

Abstract: A non-transitory computer-readable recording medium is provided in which a port switching program for causing a computer to execute a process including: transmitting, in response to a mirror switching instruction that specifies a migration source port and a migration destination port, a first mirror switching notification to a virtual switch that has the migration destination port to request a change of mirror setting in the migration destination port; canceling mirror setting for a transmission packet to the migration destination port in the migration source port; and canceling mirror setting for a received packet from the migration destination port in the migration source port in response to a second mirror switching notification from the virtual switch, the second mirror switching notification indicating the change of the mirror setting in the migration destination port is stored.

Type: Grant

Filed: June 21, 2019

Date of Patent: March 1, 2022

Assignee: FUJITSU LIMITED

Inventor: Kazuhiro Suzuki
System on chip including a multi-core processor and task scheduling method thereof

Patent number: 11243806

Abstract: A scheduling method of a system on chip including a multi-core processor includes receiving a schedule-requested task, converting a priority assigned to the schedule-requested task into a linear priority weight, selecting a plurality of candidate cores, to which the schedule-requested task will be assigned, from among cores of the multi-core processor, calculating a preemption compare index indicating a current load state of each of the plurality of candidate cores, comparing the linear priority weight with the preemption compare index of the each of the plurality of candidate cores to generate a comparison result, and assigning the schedule-requested task to one candidate core of the plurality of candidate cores depending on the comparison result.

Type: Grant

Filed: July 22, 2019

Date of Patent: February 8, 2022

Assignee: SAMSUNG ELECTRONICS CO., LTD.

Inventors: Jong-Lae Park, Soohyun Kim, Youngtae Lee, Byung-Soo Kwon
Task allocation among devices in a distributed data storage system

Patent number: 11240305

Abstract: In one example, a processor may receive a first request to process a first task, the first request including a first estimated central processing unit utilization for the first task and a first estimated memory utilization for the first task and receive central processing unit capacities and memory capacities of a plurality of sub-data routers including at least a first sub-data router. The processor may further determine that the first sub-data router has a lowest central processing unit capacity from among the plurality of sub-data routers that is sufficient to accommodate the first estimated central processing unit utilization for the first task and determine that the first sub-data router has a memory capacity that is sufficient to accommodate the first estimated memory utilization for the first task. The processor may then assign the first task to the first sub-data router.

Type: Grant

Filed: July 28, 2016

Date of Patent: February 1, 2022

Assignee: AT&T INTELLECTUAL PROPERTY I, L.P.

Inventors: Sheldon Kent Meredith, William Cottrill, Juliette Zerick
Heterogeneous parallel primitives programming model

Patent number: 11231962

Abstract: With the success of programming models such as OpenCL and CUDA, heterogeneous computing platforms are becoming mainstream. However, these heterogeneous systems are low-level, not composable, and their behavior is often implementation defined even for standardized programming models. In contrast, the method and system embodiments for the heterogeneous parallel primitives (HPP) programming model disclosed herein provide a flexible and composable programming platform that guarantees behavior even in the case of developing high-performance code.

Type: Grant

Filed: October 30, 2017

Date of Patent: January 25, 2022

Assignee: Advanced Micro Devices, Inc.

Inventors: Benedict R. Gaster, Lee W. Howes
Graph database management system and method for a distributed computing environment

Patent number: 11222072

Abstract: A graph database management system includes a computing system in communication with a distributed computing environment comprising a plurality of elements and a database that stores element records associated with corresponding elements of the distributed computing environment. The computing system generates a graph database having a plurality of vertices representing the element records of the distributed computing environment and at least one edge representing a specified relationship between at least one pair of the element records. Thereafter, the computing system may receive a request to view the vertices associated with the at least one pair of element records and their associated edge, and facilitate the display of the vertices and their associated edge on a display in response to the request.

Type: Grant

Filed: July 17, 2015

Date of Patent: January 11, 2022

Assignee: EMC IP Holding Company LLC

Inventor: Geoffrey D. Bourne
Completing decision logic to avoid a side effect

Patent number: 11144840

Abstract: An approach is provided for completing a decision logic. For statements in a syntax tree of the decision logic and using a symbolic execution technique, path expression(s) that refer to respective input object(s) are identified. A statement in the decision logic is detected that modifies an attribute value of a path expression included in the path expression(s) and that refers to an input object included in the input object(s). A copy instruction is inserted as a new node in the syntax tree so that the attribute value of the path expression is a copy of the input object. Responsive to inserting the copy instruction, the path expression is prevented from modifying the input object.

Type: Grant

Filed: July 26, 2018

Date of Patent: October 12, 2021

Assignee: International Business Machines Corporation

Inventors: Jean-Michel G. B. Bernelas, Ulrich M. Junker, Remi Van Keisbelck
Automated concurrency and repetition with minimal syntax

Patent number: 11113064

Abstract: A processor core receives a request to execute application code including a trigger instruction and an instruction block that reads a row of data values from a data structure and outputs a data value from a function using the row as input. The data structure is divided into multiple portions and the trigger instruction indicates that multiple instances of the instruction block are to be executed concurrently. In response to the request and to identification of the instruction block and trigger instruction, the processor core generates multiple instances of a support block that causes independent repetitive execution of each instance of the instruction block until all rows of the corresponding portion of the data structure are used as input. The processor core assigns instances of the instruction and support blocks to multiple processor cores, and provides each instance of the instruction block with the corresponding portion of the data structure.

Type: Grant

Filed: November 27, 2020

Date of Patent: September 7, 2021

Assignee: SAS INSTITUTE INC.

Inventors: Jack Joseph Rouse, Robert William Pratt, Jared Carl Erickson, Manoj Keshavmurthi Chari
Queries based on ranges of hash values

Patent number: 11106672

Abstract: A system includes a database client, and a distributed database comprising database nodes. The distributed database may receive a database query from the client, determine that the query comprises a range of hash values of a table partition stored by a node of the distributed database, and determine that the range of hash values is not stored by other nodes of the distributed database. Responsive to determining that the range of hash values of the query is stored by the node and not by the other nodes, the database may generate an optimized distributed execution plan that includes the node that stores the range of hash values and excludes the nodes that do not include the range of hash values.

Type: Grant

Filed: September 25, 2015

Date of Patent: August 31, 2021

Assignee: MICRO FOCUS LLC

Inventors: Rui Liu, Qiming Chen, Jeff Lefevre, Malu G. Castellanos, Meichun Hsu
Information processing apparatus, control method, and program to control allocation of computer resources for different types of tasks

Patent number: 11093281

Abstract: An information processing apparatus determines computer resources to be allocated to each task execution entity, based on upper limit value information and processing amount information. The upper limit value information indicates an upper limit value of the total amount of computer resources to be allocated to all task execution entities. The processing amount information indicates an amount of tasks to be processed by each task execution entity.

Type: Grant

Filed: March 29, 2019

Date of Patent: August 17, 2021

Assignee: NEC CORPORATION

Inventor: Takashi Yagi
System and method of storing and analyzing information

Patent number: 10990587

Abstract: A system and method of storing and analyzing information is disclosed. The system includes a compiler layer to convert user queries to data parallel executable code. The system further includes a library of multithreaded algorithms, processes, and data structures. The system also includes a multithreaded runtime library for implementing compiled code at runtime. The executable code is dynamically loaded on computing elements and contains calls to the library of multithreaded algorithms, processes, and data structures and the multithreaded runtime library.

Type: Grant

Filed: October 22, 2018

Date of Patent: April 27, 2021

Assignee: Battelle Memorial Institute

Inventors: John T. Feo, David J. Haglin, Alessandro Morari, Antonino Tumeo, Oreste Villa, Jesse R. Weaver
Joint compilation method and system for heterogeneous hardware architecture

Patent number: 10963229

Abstract: The present invention provides a joint compilation method and system for a heterogeneous hardware architecture. The method comprises steps of: determining, according to calculation characteristics of heterogeneous units in the hardware architecture, a strategy for dividing an overall calculation task graph into a plurality of subtasks, and allocating the plurality of divided subtasks to corresponding heterogeneous unit compilers for compilation to generate corresponding target machine instruction codes; and, linking the generated target machine instruction codes to form a set of machine instruction codes oriented to the heterogeneous hardware architecture. With the joint compilation method and system of the present invention, an executable program body, which can run on a heterogeneous hardware architecture system and be mixed with hardware machine instruction codes of various heterogeneous units at different levels, can be automatically compiled, optimized and generated by activating one compilation.

Type: Grant

Filed: July 26, 2019

Date of Patent: March 30, 2021

Assignee: SHANGHAI DENGLIN TECHNOLOGIES CO., LTD

Inventors: Chenhui Wang, Fan Peng, Xiaoquan Li, Can Li, Ping Wang
Systems and methods for generating code for parallel processing units

Patent number: 10949182

Abstract: Systems and methods generate code from a source program where the generated code may be compiled and executed on a Graphics Processing Unit (GPU). A parallel loop analysis check may be performed on regions of the source program identified for parallelization. One or more optimizations also may be applied to the source program that convert mathematical operations into a parallel form. The source program may be partitioned into segments for execution on a host and a device. Kernels may be created for the segments to be executed on the device. The size of the kernels may be determined, and memory transfers between the host and device may be optimized.

Type: Grant

Filed: November 17, 2017

Date of Patent: March 16, 2021

Assignee: The MathWorks, Inc.

Inventors: Girish Venkataramani, Rama P. Kokku, Jayaprabha Shankar, James L. Brock, Chun-Yu Shei, Vijaya Raghavan
Loading models on nodes having multiple model service frameworks

Patent number: 10929191

Abstract: This disclosure relates to model loading. In one aspect, a method includes determining, based on a preset execution script and resource information of multiple execution nodes, loading-tasks corresponding to the execution nodes. Each execution node is deployed on a corresponding cluster node. Loading requests are sent to the execution nodes, thereby causing the execution nodes to start execution processes based on the corresponding loading requests. The execution processes start multiple model service frameworks on each cluster node. Multiple models are loaded onto each of the model service frameworks. Each loading request includes loading-tasks corresponding to the execution node to which the loading request was sent. The execution processes include a respective execution process for each model service framework.

Type: Grant

Filed: July 27, 2020

Date of Patent: February 23, 2021

Assignee: Advanced New Technologies Co., Ltd.

Inventors: Yueming Wang, Jiliang Li
Runtime GPU/CPU selection

Patent number: 10929161

Abstract: A method, computer program product, and system includes a processor(s) obtaining, during runtime, from a compiler, two versions of a data parallel loop for an operation. The host computing system comprises includes a CPU and a GPU is accessible to the host. The processor(s) online profiles the two versions by asynchronously executing the first version, in a profile mode, with the GPU and executing the second version, in the profile mode, with the CPU. The processor(s) generates execution times for the first version and the second version. The processor(s) stores the executions times and performance data in a storage, where the performance data comprises a size of the data parallel loop for the operation. The processor(s) update a regression model(s) to predict performance numbers for a process of an unknown loop size. The processor(s) execute the operation with the CPU or the GPU based on the performance data.

Type: Grant

Filed: August 27, 2019

Date of Patent: February 23, 2021

Assignee: International Business Machines Corporation

Inventors: Gita Koblents, Alon Shalev Housfater, Kazuaki Ishizaki, Akihiro Hayashi
Techniques for context switching using distributed compute workload parsers

Patent number: 10901777

Abstract: Techniques are disclosed relating to context switching using distributed compute workload parsers. In some embodiments, an apparatus includes a plurality of shader units configured to perform operations for compute workgroups included in compute kernels, a plurality of distributed workload parser circuits each configured to dispatch workgroups to a respective set of the shader units, a communications fabric, and a master workload parser circuit configured to communicate with the distributed workload parser circuits via the communications fabric. In some embodiments, the master workload parser circuit maintains a first set of master state information that does not change for a compute kernel based on operations by the shader units and a second set of master state information that may be changed by operations specified by the kernel. In some embodiments, the master workload parser circuit performs a multi-phase state storage process in communications with the distributed workload parser circuits.

Type: Grant

Filed: September 26, 2018

Date of Patent: January 26, 2021

Assignee: Apple Inc.

Inventors: Andrew M. Havlir, Jeffrey T. Brady
Method of executing a tuple graph program across a network

Patent number: 10887235

Abstract: A programming model provides a method for executing a program in a distributed architecture. One or more first shards of the distributed architecture execute one or more operations, and sending tuples to at least one second shard, the tuples being part of a stream and being based on the one or more operations. The one or more first shards send a token value to the at least one second shard when the sending of the tuples in the stream is complete. The at least one second shard determines whether a total of the token values matches a number of the one or more first shards, and takes a first action in response to determining that the total of the token values matches the number of the one or more first shards. The first action may include marking the stream as being complete and/or generating a message indicating that the stream is complete.

Type: Grant

Filed: August 24, 2017

Date of Patent: January 5, 2021

Assignee: Google LLC

Inventors: Gautham Thambidorai, Matthew Rosencrantz, Sanjay Ghemawat, Srdjan Petrovic, Ivan Posva
Emergency accurate control method and system for large-scale interruptible loads

Patent number: 10886740

Abstract: Provided is an emergency accurate control method and system for large-scale interruptible loads. The method includes: acquiring, by a region control master station, a sheddable load sequence table; acquiring, by the region control master station, a first to-be-shed load; performing, by the region control master station, minimum under-shedding matching layer by layer according to the first to-be-shed load, and shedding a sheddable load corresponding to control substation matching with the first to-be-shed load; and sending, by the region control master station, a second to-be-shed load to the corresponding control substation for load shedding if the second to-be-shed load exists.

Type: Grant

Filed: May 10, 2019

Date of Patent: January 5, 2021

Assignees: State Grid Jiangsu Electric Power Co., Ltd., Nari Technology Co., Ltd.

Inventors: Jijun Yin, Qing Chen, Gang Chen, Xiao Lu, Jianyu Luo, Haifeng Li, Xueming Li, Kaiming Luo, Lin Liu, Yunsong Yan, Yefeng Jiang, Jianfeng Ren, Haifeng Xia
Dependency-based streamlined processing

Patent number: 10853079

Abstract: A method and computer program product for performing a plurality of processing operations. A plurality of processor nodes each include one or more operational instances. Each processor node includes criteria for generating its operational instances. The processor nodes are linked together in a directed acyclic processing graph in which dependent nodes use data from the operational instances of upstream nodes to perform a node-specific set of processing operations. Dependency relationships between the processor nodes are defined on an operational instance basis, where operational instances in dependent processor nodes identify data associated with, or generated by, specific upstream operational instances that is used to perform the node-specific set of operations for that dependent operational instance. The processing graph may also include connectors nodes defining instance-level dependency relationships between processor nodes.

Type: Grant

Filed: September 19, 2019

Date of Patent: December 1, 2020

Assignee: Side Effects Software Inc.

Inventors: Ken Xu, Taylor James Petrick

1 2 3 4 5 … next