For A Parallel Or Multiprocessor System Patents (Class 717/149)
  • Patent number: 11972238
    Abstract: Methods, systems, and apparatus for propagating reduced-precision on computation graphs are described. In one aspect, a method includes receiving data specifying a directed graph that includes operators for a program. The operators include first operators that each represent a numerical operation performed on numerical values having a first level of precision and second operators that each represent a numerical operation performed on numerical values having a second level of precision. One or more downstream operators are identified for a first operator. A determination is made whether each downstream operator represents a numerical operation that is performed on input values having the second level of precision. Whenever each downstream operator represents a numerical operation that is performed on input values having the second level of precision, a precision of numerical values output by the operation represented by the first operator is adjusted to the second level of precision.
    Type: Grant
    Filed: June 13, 2022
    Date of Patent: April 30, 2024
    Assignee: Google LLC
    Inventor: Yuanzhong Xu
  • Patent number: 11966695
    Abstract: Methods, systems and computer program products for implementing a mere-parser are disclosed. Text data is processed to generate one or more parse items. A boundary based attribute associated with one of the parse items is identified, and the identified mere attribute is associated with one or more of the remaining parse items that is not blocked from associated with the boundary based attribute.
    Type: Grant
    Filed: October 27, 2020
    Date of Patent: April 23, 2024
    Assignee: Optum360, LLC
    Inventors: Daniel T. Heinze, Mark L. Morsch
  • Patent number: 11960734
    Abstract: Systems and methods described herein may relate to providing a dynamically configurable circuitry able to be programed using a microsector granularity. Furthermore, selective partial reconfiguration operations may be performed use write operations to write a new configuration over existing configurations to selectively reprogram a portion of programmable logic. A quasi-delay insensitive (QDI) shift register and/or control circuitry receiving data and commands from an access register disposed between portions of programmable logic may enable at least some of the operations described.
    Type: Grant
    Filed: September 25, 2020
    Date of Patent: April 16, 2024
    Assignee: Intel Corporation
    Inventors: Sean R Atsatt, Ilya K. Ganusov
  • Patent number: 11914989
    Abstract: Techniques for specifying and implementing a software application targeted for execution on a multiprocessor array (MPA). The MPA may include a plurality of processing elements, supporting memory, and a high bandwidth interconnection network (IN), communicatively coupling the plurality of processing elements and supporting memory. In some embodiments, software code may specify one or more cell definitions that include: program instructions executable to perform a function and one or more language constructs. The software code may further instantiate first, second, and third cell instances, each of which is an instantiation of one of the one or more cell definitions, where the instantiation includes configuration of the one or more language constructs such that: the first and second cell instances communicate via respective communication ports and the first and second cell instances are included in the third cell instance.
    Type: Grant
    Filed: October 28, 2021
    Date of Patent: February 27, 2024
    Assignee: Coherent Logix, Incorporated
    Inventors: Stephen E. Lim, Viet N. Ngo, Jeffrey M. Nicholson, John Mark Beardslee, Teng-I Wang, Zhong Qing Shang, Michael Lyle Purnell
  • Patent number: 11915149
    Abstract: Provided are a system for managing a calculation processing graph of an artificial neural network and a method of managing a calculation processing graph by using the system. A system for managing a calculation processing graph of an artificial neural network run by a plurality of heterogeneous resources includes: a task manager configured to allocate the plurality of heterogeneous resources to a first subgraph and a second subgraph that are to be run, the first subgraph and the second subgraph being included in the calculation processing graph; a first compiler configured to compile the first subgraph to be executable on a first resource among the plurality of heterogeneous resources; and a second compiler configured to compile the second subgraph to be executable on a second resource among the plurality of heterogeneous resources, wherein the first subgraph and the second subgraph are respectively managed through separate calculation paths.
    Type: Grant
    Filed: September 5, 2019
    Date of Patent: February 27, 2024
    Assignee: SAMSUNG ELECTRONICS CO., LTD.
    Inventor: Seung-Soo Yang
  • Patent number: 11907693
    Abstract: A job decomposition processing method for distributed computing, which comprises: analyzing a source program to be run by program static analysis to determine a function call graph contained in the source program; determining feature information of functions contained in the source program by program dynamics analysis or/and a program intelligent decomposition algorithm, wherein the feature information of the functions is used to characterize relevant information when each function is being running; decomposing the source program based on the feature information of the functions, a function relationship and available resource information of a computing platform to form an execution recommendation for each function on the computing platform, i.e., which hardware resources are used for computing each function; finally inserting a modifier in the source program and starting computation on the computing platform.
    Type: Grant
    Filed: February 17, 2023
    Date of Patent: February 20, 2024
    Assignee: ZHEJIANG LAB
    Inventors: Wenyuan Bai, Feng Gao
  • Patent number: 11900157
    Abstract: An embodiment of a semiconductor package apparatus may include technology to manage one or more virtual graphic processor units, and co-schedule the one or more virtual graphic processor units based on both general processor instructions and graphics processor instructions. Other embodiments are disclosed and claimed.
    Type: Grant
    Filed: September 19, 2018
    Date of Patent: February 13, 2024
    Assignee: Intel Corporation
    Inventors: Yan Zhao, Zhi Wang, Weinan Li
  • Patent number: 11892966
    Abstract: Systems, methods, and apparatuses are described that enable IC architectures to enable a single anchor to connect to and accept a variety of chiplets at any port by way of a programming model that enables the anchor or chiplet to dynamically adapt to configurations, requirements, or aspects of any coupled component and provide an interface for the coupled components.
    Type: Grant
    Filed: December 14, 2021
    Date of Patent: February 6, 2024
    Assignee: XILINX, INC.
    Inventors: Krishnan Srinivasan, Ygal Arbel, Sagheer Ahmad
  • Patent number: 11886848
    Abstract: A method, system, and computer-readable medium for binary translation cause a binary translator to combine raw binary code and compiler-produced metadata associated with a compiled program module. The binary translator is caused to further reconcile, using the compiler-produced metadata, original compiler-produced control flow information with how lower-level machine instructions comprise a control flow in the raw binary code, and original compiler-produced aliasing information with how lower-level machine instructions access the memory locations described by the aliasing information according to predetermined criteria. The binary translator further caused to prevent, copy propagation of values in temporary variables for decimal computations beyond offsets in the machine instructions where the temporary variables are killed.
    Type: Grant
    Filed: May 25, 2022
    Date of Patent: January 30, 2024
    Assignee: International Business Machines Corporation
    Inventors: Toshihiko Koju, Reid Copeland, David Kevin Siegwart, Jordan Ryan Zannier, Allan H. Kielstra
  • Patent number: 11868473
    Abstract: A method for constructing behavioral software signatures. The method includes: embedding execution traces of a set of software in a vector space, an execution trace of a software agent including at least one event and being representative of the execution of the software, the embedding representing an event of the execution trace by a vector encoding a context for occurrence of the event; partitioning the vectors associated with the software of the set to generate a data group representative of a behavior, a behavioral label being associated with the data group; associating a behavioral label with a vector, which is representative of the data group to which the vector belongs, and associating a trace of behavioral labels with a trace of vectors, the trace of labels being representative of execution of a software agent, and extracting in the trace of labels at least one behavioral signature associated with the software.
    Type: Grant
    Filed: January 30, 2020
    Date of Patent: January 9, 2024
    Assignee: ORANGE
    Inventors: Baptiste Olivier, Xiao Han
  • Patent number: 11842265
    Abstract: Disclosed in a processor chip configured to perform neural network processing. The processor chip includes a memory, a first processor configured to perform neural network processing on a data stored in the memory, a second processor and a third processor, and the second processor is configured to transmit a control signal to the first processor and the third processor to cause the first processor and the third processor to perform an operation.
    Type: Grant
    Filed: June 19, 2020
    Date of Patent: December 12, 2023
    Assignee: Samsung Electronics Co., Ltd.
    Inventors: Yongmin Tai, Insang Cho, Wonjae Lee, Chanyoung Hwang
  • Patent number: 11842217
    Abstract: Mechanisms for resource isolation allow tenants executing in a multi-tenant software container to be isolated in order to prevent resource starvation by one or more of the tenants. Mechanisms for dependency isolation may be utilized to prevent one tenant executing in a multi-tenant software container from using another tenant in the same container in a manner that requires co-tenancy. Mechanisms for security isolation may be utilized to prevent one tenant in a multi-tenant software container from accessing protected data or functionality of another tenant. Mechanisms for fault isolation may be utilized to prevent tenants in a multi-tenant software container from causing faults or other types of errors that affect other tenants executing in the same software container.
    Type: Grant
    Filed: July 29, 2021
    Date of Patent: December 12, 2023
    Assignee: Amazon Technologies, Inc.
    Inventors: Keian Christopher, Kevin Michael Beranek, Christopher Keakini Kaulia, Vijay Ravindra Kulkarni, Samuel Leonard Moniz, Kyle Bradley Peterson, Ajit Ashok Varangaonkar, Jun Xu
  • Patent number: 11816132
    Abstract: Disclosed are a method and a system for optimizing data storage of query statistics of a graph database. The method includes: periodically scanning, on storage servers in which partitions are located, all edges in the partitions; determining, according to all the edges in the partitions, partitions to which start points and end points belong, and calculating outgoing-edge correlation and incoming-edge correlation between partitions; calculating relevancies between partitions through a preset correlation matrix weight according to the outgoing-edge correlation and the incoming-edge correlation between partitions; and storing partitions with high relevancies on a same storage server.
    Type: Grant
    Filed: September 22, 2021
    Date of Patent: November 14, 2023
    Assignee: Vesoft Inc.
    Inventors: Tong Yue, Xiaomeng Ye, Yujue Wang, Yu Liu, Min Wu, Chenguang Wang
  • Patent number: 11809849
    Abstract: In one example, a method performed by a compiler comprises: receiving a dataflow graph of a neural network, the neural network comprising a neural network operator; receiving information of computation resources and memory resources of a neural network hardware accelerator intended to execute the neural network operator; determining, based on the dataflow graph, iterations of an operation on elements of a tensor included in the neural network operator; determining, based on the information, a mapping between the elements of the tensor to addresses in the portion of the local memory, and a number of the iterations of the operation to be included in a batch, wherein the number of the iterations in the batch are to be executed in parallel by the neural network hardware accelerator; and generating a schedule of execution of the batches of the iterations of the operations.
    Type: Grant
    Filed: May 20, 2021
    Date of Patent: November 7, 2023
    Assignee: Amazon Technologies, Inc.
    Inventors: Hongbin Zheng, Randy Renfu Huang, Robert Geva
  • Patent number: 11775407
    Abstract: The present disclosure relates to systems, methods, and computer readable media for diagnosing and mitigating memory impact events, such as memory leaks, high memory usage, or other memory issues causing a host node from performing as expected on a cloud computing system. The systems described herein involve receiving locally generated memory usage data from a plurality of host nodes. The systems described herein may aggregate the memory usage data and determine a memory impact diagnosis based on a subset of the aggregated memory usage data. The systems described herein may further apply a mitigation model for mitigating the memory impact event. The systems described herein provide an end-to-end solution for diagnosing and mitigating a variety of memory issues using a dynamic and scalable system that reduces a negative impact of memory leaks and other memory issues on a cloud computing system.
    Type: Grant
    Filed: March 7, 2022
    Date of Patent: October 3, 2023
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Cong Chen, Xinsheng Yang, Yingnong Dang, Si Qin
  • Patent number: 11775297
    Abstract: In a system providing transactional memory support, a transaction nesting depth testing instruction is provided for triggering processing circuitry 4 to set at least one status value to one of a plurality of states depending on a transaction nesting depth indicative of a number of executed transaction start instructions of a given thread for which the corresponding transaction remains unaborted and uncommitted, the plurality of states including a first state selected when the transaction nesting depth is 1 and at least one further state selected when the transaction nesting depth is greater than or less than 1. The supported ISA enables the setting of the at least one status value and a conditional branch conditional on the at least one status value being in the first state to be performed in response to a single transaction nesting depth testing instruction and a single conditional branch instruction.
    Type: Grant
    Filed: August 21, 2018
    Date of Patent: October 3, 2023
    Assignee: Arm Limited
    Inventors: Grigorios Magklis, Matthew James Horsnell, Stephan Diestelhorst
  • Patent number: 11720080
    Abstract: An optimum combination of a loop unrolling number and a circuit parallel number in a high-level synthesis is determined. A circuit synthesis information generation unit sets, as parameter candidates, a plurality of combinations of a loop unrolling number and a circuit parallel number to generate circuit synthesis information indicating a synthesis circuit obtained by high-level synthesis processing for each of the combinations. An optimum parameter determination unit calculates, for each piece of the generated circuit synthesis information, an estimation processing performance related to the synthesis circuit indicated by the circuit synthesis information, and determines an optimum combination of the loop unrolling number and the circuit parallel number based on the circuit synthesis information based on which a maximum estimation processing performance is obtained.
    Type: Grant
    Filed: May 21, 2019
    Date of Patent: August 8, 2023
    Assignee: NIPPON TELEGRAPH AND TELEPHONE CORPORATION
    Inventors: Syuhei Yoshida, Yuta Ukon, Koji Yamazaki, Koyo Nitta
  • Patent number: 11714875
    Abstract: Systems, methods, and apparatuses relating to a matrix operations accelerator are described.
    Type: Grant
    Filed: December 28, 2019
    Date of Patent: August 1, 2023
    Assignee: Intel Corporation
    Inventors: Amit Gradstein, Simon Rubanovich, Sagi Meller, Saeed Kharouf, Gavri Berger, Zeev Sperber, Jose Yallouz, Ron Schneider
  • Patent number: 11663175
    Abstract: Systems, methods, and software are disclosed herein for facilitating deployment of a decision service for sharing application data among multiple isolated applications executing on one or more application platforms. In an implementation, a method of deploying applications conforming to a platform schema for facilitating sharing of the application data among isolated applications executing on one or more application platforms is described. The method includes receiving a request to submit a third party application to an application deployment system, identifying a validation manifest associated with a platform schema responsive to receiving the request, and automatically verifying that the third party application to conforms to the platform schema by performing a set of pre-defined validation checks. The request identifies the platform schema and platform capability information associated with the third party application. The validation manifest includes the set of pre-defined validation checks.
    Type: Grant
    Filed: August 28, 2019
    Date of Patent: May 30, 2023
    Assignee: MICROSOFT TECHNOLOGY LICENSING, LLC
    Inventors: David Mowatt, Stephen O'Driscoll
  • Patent number: 11636485
    Abstract: Parallelized computation by a real-time transaction scoring system that incorporates global behavior profiling of transacting entities includes dividing a global profile computing component of a transaction scoring model of a real-time behavioral analytics transaction scoring system into a plurality of global profile component instances. The transaction scoring model uses a plurality of global profile variables, each of the plurality of global profile component instances using its own global profile partition that contains the estimate of global profile variables and being configured for update by a dedicated thread of execution of the real-time transaction scoring system, each dedicated thread being configured for receiving and scoring a portion of input transactions.
    Type: Grant
    Filed: April 6, 2018
    Date of Patent: April 25, 2023
    Assignee: Fair Isaac Corporation
    Inventors: Scott Michael Zoldi, Alexei Betin
  • Patent number: 11604758
    Abstract: Systems and methods for automated systolic array design from a high-level program are disclosed. One implementation of a systolic array design supporting a convolutional neural network includes a two-dimensional array of reconfigurable processing elements arranged in rows and columns. Each processing element has an associated SIMD vector and is connected through a local connection to at least one other processing element. An input feature map buffer having a double buffer is configured to store input feature maps, and an interconnect system is configured to pass data to neighboring processing elements in accordance with a processing element scheduler. A CNN computation is mapped onto the two-dimensional array of reconfigurable processing elements using an automated system configured to determine suitable reconfigurable processing element parameters.
    Type: Grant
    Filed: November 12, 2020
    Date of Patent: March 14, 2023
    Assignee: Xilinx, Inc.
    Inventors: Peng Zhang, Cody Hao Yu, Xuechao Wei, Peichen Pan
  • Patent number: 11579852
    Abstract: System and method of compiling a program having a mixture of host code and device code to enable Profile Guided Optimization (PGO) for device code execution. An exemplary integrated compiler can compile source code programmed to be executed by a host processor (e.g., CPU) and a co-processor (e.g., a GPU) concurrently. The compilation can generate an instrumented executable code which includes: profile instrumentation counters for the device functions; and instructions for the host processor to allocate and initialize device memory for the counters and to retrieve collected profile information from the device memory to generate instrumentation output. The output is fed back to the compiler for compiling the source code a second time to generate optimized executable code for the device functions defined in the source code.
    Type: Grant
    Filed: July 27, 2020
    Date of Patent: February 14, 2023
    Assignee: NVIDIA Corporation
    Inventors: Hariharan Sandanagobalane, Sean Lee, Vinod Grover
  • Patent number: 11556319
    Abstract: Systems and methods are described for extending a live range for a virtual scalar register during compiling of a program, comprising: receiving an intermediate representation (IR) of a source code configured for implementing single-instruction-multiple-thread (SIMT) execution, the IR representing the source code as control flow graph including a plurality of basic blocks (BB); and when a virtual scalar register defined in a first BB of the IR is last used in a second BB of the IR that is a divergent BB, modifying the IR to extend the live range of the virtual scalar register.
    Type: Grant
    Filed: September 1, 2020
    Date of Patent: January 17, 2023
    Assignee: HUAWEI TECHNOLOGIES CO., LTD.
    Inventors: Abraham Davidson Fai Chung Chan, Tyler Bryce Nowicki, Guansong Zhang, Ahmed Mohammed ElShafiey Mohammed Eltantawy
  • Patent number: 11550579
    Abstract: A system includes processor hardware and memory hardware that stores instructions. The instructions include, in response to receiving a request, determining a request type of the request, retrieving a first set of collected information, and selecting a first set of instructions corresponding to the request type. The instructions include constructing a first result by executing each instruction of the first set of instructions to create the first entry as a nested entry within the first result including data of the first set of collected information identified in the first set of instructions as nested or retrieve first data of the first set of collected information identified by the first instruction and add the first data to the first entry of the first result. The instructions include transforming a display of the operator device to complete a set of fields displayed on the display with corresponding entries of the first result.
    Type: Grant
    Filed: March 12, 2020
    Date of Patent: January 10, 2023
    Assignee: TD Ameritrade IP Company, Inc.
    Inventors: Sean William Watts, Igor Vornovitskiy, IV
  • Patent number: 11494867
    Abstract: An apparatus to facilitate asynchronous execution at a processing unit. The apparatus includes one or more processors to detect independent task passes that may be executed out of order in a pipeline of the processing unit, schedule a first set of processing tasks to be executed at a first set of processing elements at the processing unit and schedule a second set of tasks to be executed at a second set of processing elements, wherein execution of the first set of tasks at the first set of processing elements is to be performed simultaneous and in parallel to execution of the second set of tasks at the second set of processing elements.
    Type: Grant
    Filed: December 8, 2020
    Date of Patent: November 8, 2022
    Assignee: Intel Corporation
    Inventors: Saurabh Sharma, Michael Apodaca, Aditya Navale, Travis Schluessler, Vamsee Vardhan Chivukula, Abhishek Venkatesh, Subramaniam Maiyuran
  • Patent number: 11467829
    Abstract: A method and apparatus are disclosed for finding overlong source code segments (e.g., methods) by evaluating input source code segments for a plurality of predetermined code metric values in order to identify candidate source code segments (e.g., non-autogenerated methods) which do not meet a first code metric value and to assess each candidate source code segment against a second code metric value to identify different sets of candidate source code segments (e.g., test methods and normal methods) so that each set of candidate source code segments may be assessed against a tailored set of code length thresholds to identify any overlong source code segment having a code length which meets or exceeds at least two of the tailored set of code length thresholds.
    Type: Grant
    Filed: May 29, 2020
    Date of Patent: October 11, 2022
    Assignee: DevFactory Innovations FZ-LLC
    Inventor: Aditya T. Kadam
  • Patent number: 11379198
    Abstract: A code base is parsed to identify methods encapsulated therein. Thereafter, a call graph is generated based on the parsing using a graph generation technique. The call graph is a directed call graph comprising a plurality of nodes characterizing the identified methods. It can then be determined, based on one or more design patterns used to generate the code base, that at least a portion of the nodes generated call graph are disconnected nodes. At least two of the disconnected nodes are then connected using a stitching algorithm to result in a modified call graph. Data characterizing the modified call graph can then be provided (e.g., displayed in a graphical user interface, stored in a database, loaded into memory, transmitted to a remote computing device, etc.).
    Type: Grant
    Filed: December 14, 2020
    Date of Patent: July 5, 2022
    Assignee: SAP SE
    Inventors: Amitabh Goswami, Amrit Shankar Dutta Dutta, Abhishek Hondad, Alok Kumar
  • Patent number: 11379349
    Abstract: Verifiable test case workflow is provided by creating a secure database for actions taken regarding a source file that is stored on a first computer; creating a test executable from one or more source files and storing it on the first computer; finalizing the source file for test on a second computer different from the first computer; hashing a test environment related to the source file and the second computer; and in response to determining that a version of the test executable provided to the second computer matches a version of the test executable provided to the secure database: executing the test executable on the second computer; hashing test results from testing the source file on the second computer; and adding the test executable as hashed and the test results as hashed to the secure database to actions already stored in the secure database.
    Type: Grant
    Filed: January 3, 2020
    Date of Patent: July 5, 2022
    Assignee: International Business Machines Corporation
    Inventors: Ann Barnette Umberhocker, Ariba Siddiqui, Sowmya Janakiraman, George Conerly Wilson
  • Patent number: 11368452
    Abstract: An analytics tool includes a network interface and an analytics engine. The network interface receives a request for job analytics of a job. The job comprises uploading a plurality of batches, each of the plurality batches comprising a subset of information of a data table. A network node of a plurality of network nodes uploads a batch of the plurality of batches. The analytics engine configured to determines the plurality of network nodes used to complete the job. The analytics engine retrieves network node data for each of the plurality of network nodes. The analytics engine generates the job analytics by aggregating the network node data for each of the plurality of network nodes.
    Type: Grant
    Filed: November 11, 2019
    Date of Patent: June 21, 2022
    Assignee: Bank of America Corporation
    Inventor: John Abraham
  • Patent number: 11321061
    Abstract: A method for using profiling to obtain application-specific, preferred parameter values for an application is disclosed. First, a parameter for which to obtain an application-specific value is identified. Code is then augmented for application-specific profiling of the parameter. The parameter is profiled and profile data is collected. The profile data is then analyzed to determine the application's preferred parameter value for the profile parameter.
    Type: Grant
    Filed: July 29, 2019
    Date of Patent: May 3, 2022
    Assignee: Google LLC
    Inventors: Teresa Louise Johnson, Xinliang David Li
  • Patent number: 11295262
    Abstract: A system for fully integrated predictive decision-making and simulation having a high-volume deep web scraper system, a data retrieval engine, a directed computational graph module, and a decision and action path simulation engine.
    Type: Grant
    Filed: October 30, 2020
    Date of Patent: April 5, 2022
    Assignee: QOMPLX, INC.
    Inventors: Jason Crabtree, Andrew Sellers
  • Patent number: 11288108
    Abstract: Techniques are provided for an automated method of adding out-of-bound access prevention in GPU kernels executed in a managed environment. In an embodiment, a system of computers compiles a GPU kernel code function that includes one or more array references that are memory address dependent. The system of computers compiles the kernel code function by generating a rewritten GPU kernel code module that includes, within the function signature of the rewritten GPU kernel code module, a respective array size parameter for each array reference of the one or more array references included in the GPU kernel code function. The system of computers further compiles the kernel code function by adding bounding protection instructions to the one or more potential out-of-bound access instructions in the rewritten GPU kernel code module. The potential out-of-bound access instructions comprise instructions that reference each respective array size parameter of the one or more array references.
    Type: Grant
    Filed: December 3, 2019
    Date of Patent: March 29, 2022
    Assignee: Oracle International Corporation
    Inventors: Alberto Parravicini, Davide Bartolini, Lukas Stadler, Arnaud Delamare
  • Patent number: 11265266
    Abstract: A non-transitory computer-readable recording medium is provided in which a port switching program for causing a computer to execute a process including: transmitting, in response to a mirror switching instruction that specifies a migration source port and a migration destination port, a first mirror switching notification to a virtual switch that has the migration destination port to request a change of mirror setting in the migration destination port; canceling mirror setting for a transmission packet to the migration destination port in the migration source port; and canceling mirror setting for a received packet from the migration destination port in the migration source port in response to a second mirror switching notification from the virtual switch, the second mirror switching notification indicating the change of the mirror setting in the migration destination port is stored.
    Type: Grant
    Filed: June 21, 2019
    Date of Patent: March 1, 2022
    Assignee: FUJITSU LIMITED
    Inventor: Kazuhiro Suzuki
  • Patent number: 11243806
    Abstract: A scheduling method of a system on chip including a multi-core processor includes receiving a schedule-requested task, converting a priority assigned to the schedule-requested task into a linear priority weight, selecting a plurality of candidate cores, to which the schedule-requested task will be assigned, from among cores of the multi-core processor, calculating a preemption compare index indicating a current load state of each of the plurality of candidate cores, comparing the linear priority weight with the preemption compare index of the each of the plurality of candidate cores to generate a comparison result, and assigning the schedule-requested task to one candidate core of the plurality of candidate cores depending on the comparison result.
    Type: Grant
    Filed: July 22, 2019
    Date of Patent: February 8, 2022
    Assignee: SAMSUNG ELECTRONICS CO., LTD.
    Inventors: Jong-Lae Park, Soohyun Kim, Youngtae Lee, Byung-Soo Kwon
  • Patent number: 11240305
    Abstract: In one example, a processor may receive a first request to process a first task, the first request including a first estimated central processing unit utilization for the first task and a first estimated memory utilization for the first task and receive central processing unit capacities and memory capacities of a plurality of sub-data routers including at least a first sub-data router. The processor may further determine that the first sub-data router has a lowest central processing unit capacity from among the plurality of sub-data routers that is sufficient to accommodate the first estimated central processing unit utilization for the first task and determine that the first sub-data router has a memory capacity that is sufficient to accommodate the first estimated memory utilization for the first task. The processor may then assign the first task to the first sub-data router.
    Type: Grant
    Filed: July 28, 2016
    Date of Patent: February 1, 2022
    Assignee: AT&T INTELLECTUAL PROPERTY I, L.P.
    Inventors: Sheldon Kent Meredith, William Cottrill, Juliette Zerick
  • Patent number: 11231962
    Abstract: With the success of programming models such as OpenCL and CUDA, heterogeneous computing platforms are becoming mainstream. However, these heterogeneous systems are low-level, not composable, and their behavior is often implementation defined even for standardized programming models. In contrast, the method and system embodiments for the heterogeneous parallel primitives (HPP) programming model disclosed herein provide a flexible and composable programming platform that guarantees behavior even in the case of developing high-performance code.
    Type: Grant
    Filed: October 30, 2017
    Date of Patent: January 25, 2022
    Assignee: Advanced Micro Devices, Inc.
    Inventors: Benedict R. Gaster, Lee W. Howes
  • Patent number: 11222072
    Abstract: A graph database management system includes a computing system in communication with a distributed computing environment comprising a plurality of elements and a database that stores element records associated with corresponding elements of the distributed computing environment. The computing system generates a graph database having a plurality of vertices representing the element records of the distributed computing environment and at least one edge representing a specified relationship between at least one pair of the element records. Thereafter, the computing system may receive a request to view the vertices associated with the at least one pair of element records and their associated edge, and facilitate the display of the vertices and their associated edge on a display in response to the request.
    Type: Grant
    Filed: July 17, 2015
    Date of Patent: January 11, 2022
    Assignee: EMC IP Holding Company LLC
    Inventor: Geoffrey D. Bourne
  • Patent number: 11144840
    Abstract: An approach is provided for completing a decision logic. For statements in a syntax tree of the decision logic and using a symbolic execution technique, path expression(s) that refer to respective input object(s) are identified. A statement in the decision logic is detected that modifies an attribute value of a path expression included in the path expression(s) and that refers to an input object included in the input object(s). A copy instruction is inserted as a new node in the syntax tree so that the attribute value of the path expression is a copy of the input object. Responsive to inserting the copy instruction, the path expression is prevented from modifying the input object.
    Type: Grant
    Filed: July 26, 2018
    Date of Patent: October 12, 2021
    Assignee: International Business Machines Corporation
    Inventors: Jean-Michel G. B. Bernelas, Ulrich M. Junker, Remi Van Keisbelck
  • Patent number: 11113064
    Abstract: A processor core receives a request to execute application code including a trigger instruction and an instruction block that reads a row of data values from a data structure and outputs a data value from a function using the row as input. The data structure is divided into multiple portions and the trigger instruction indicates that multiple instances of the instruction block are to be executed concurrently. In response to the request and to identification of the instruction block and trigger instruction, the processor core generates multiple instances of a support block that causes independent repetitive execution of each instance of the instruction block until all rows of the corresponding portion of the data structure are used as input. The processor core assigns instances of the instruction and support blocks to multiple processor cores, and provides each instance of the instruction block with the corresponding portion of the data structure.
    Type: Grant
    Filed: November 27, 2020
    Date of Patent: September 7, 2021
    Assignee: SAS INSTITUTE INC.
    Inventors: Jack Joseph Rouse, Robert William Pratt, Jared Carl Erickson, Manoj Keshavmurthi Chari
  • Patent number: 11106672
    Abstract: A system includes a database client, and a distributed database comprising database nodes. The distributed database may receive a database query from the client, determine that the query comprises a range of hash values of a table partition stored by a node of the distributed database, and determine that the range of hash values is not stored by other nodes of the distributed database. Responsive to determining that the range of hash values of the query is stored by the node and not by the other nodes, the database may generate an optimized distributed execution plan that includes the node that stores the range of hash values and excludes the nodes that do not include the range of hash values.
    Type: Grant
    Filed: September 25, 2015
    Date of Patent: August 31, 2021
    Assignee: MICRO FOCUS LLC
    Inventors: Rui Liu, Qiming Chen, Jeff Lefevre, Malu G. Castellanos, Meichun Hsu
  • Patent number: 11093281
    Abstract: An information processing apparatus determines computer resources to be allocated to each task execution entity, based on upper limit value information and processing amount information. The upper limit value information indicates an upper limit value of the total amount of computer resources to be allocated to all task execution entities. The processing amount information indicates an amount of tasks to be processed by each task execution entity.
    Type: Grant
    Filed: March 29, 2019
    Date of Patent: August 17, 2021
    Assignee: NEC CORPORATION
    Inventor: Takashi Yagi
  • Patent number: 10990587
    Abstract: A system and method of storing and analyzing information is disclosed. The system includes a compiler layer to convert user queries to data parallel executable code. The system further includes a library of multithreaded algorithms, processes, and data structures. The system also includes a multithreaded runtime library for implementing compiled code at runtime. The executable code is dynamically loaded on computing elements and contains calls to the library of multithreaded algorithms, processes, and data structures and the multithreaded runtime library.
    Type: Grant
    Filed: October 22, 2018
    Date of Patent: April 27, 2021
    Assignee: Battelle Memorial Institute
    Inventors: John T. Feo, David J. Haglin, Alessandro Morari, Antonino Tumeo, Oreste Villa, Jesse R. Weaver
  • Patent number: 10963229
    Abstract: The present invention provides a joint compilation method and system for a heterogeneous hardware architecture. The method comprises steps of: determining, according to calculation characteristics of heterogeneous units in the hardware architecture, a strategy for dividing an overall calculation task graph into a plurality of subtasks, and allocating the plurality of divided subtasks to corresponding heterogeneous unit compilers for compilation to generate corresponding target machine instruction codes; and, linking the generated target machine instruction codes to form a set of machine instruction codes oriented to the heterogeneous hardware architecture. With the joint compilation method and system of the present invention, an executable program body, which can run on a heterogeneous hardware architecture system and be mixed with hardware machine instruction codes of various heterogeneous units at different levels, can be automatically compiled, optimized and generated by activating one compilation.
    Type: Grant
    Filed: July 26, 2019
    Date of Patent: March 30, 2021
    Assignee: SHANGHAI DENGLIN TECHNOLOGIES CO., LTD
    Inventors: Chenhui Wang, Fan Peng, Xiaoquan Li, Can Li, Ping Wang
  • Patent number: 10949182
    Abstract: Systems and methods generate code from a source program where the generated code may be compiled and executed on a Graphics Processing Unit (GPU). A parallel loop analysis check may be performed on regions of the source program identified for parallelization. One or more optimizations also may be applied to the source program that convert mathematical operations into a parallel form. The source program may be partitioned into segments for execution on a host and a device. Kernels may be created for the segments to be executed on the device. The size of the kernels may be determined, and memory transfers between the host and device may be optimized.
    Type: Grant
    Filed: November 17, 2017
    Date of Patent: March 16, 2021
    Assignee: The MathWorks, Inc.
    Inventors: Girish Venkataramani, Rama P. Kokku, Jayaprabha Shankar, James L. Brock, Chun-Yu Shei, Vijaya Raghavan
  • Patent number: 10929191
    Abstract: This disclosure relates to model loading. In one aspect, a method includes determining, based on a preset execution script and resource information of multiple execution nodes, loading-tasks corresponding to the execution nodes. Each execution node is deployed on a corresponding cluster node. Loading requests are sent to the execution nodes, thereby causing the execution nodes to start execution processes based on the corresponding loading requests. The execution processes start multiple model service frameworks on each cluster node. Multiple models are loaded onto each of the model service frameworks. Each loading request includes loading-tasks corresponding to the execution node to which the loading request was sent. The execution processes include a respective execution process for each model service framework.
    Type: Grant
    Filed: July 27, 2020
    Date of Patent: February 23, 2021
    Assignee: Advanced New Technologies Co., Ltd.
    Inventors: Yueming Wang, Jiliang Li
  • Patent number: 10929161
    Abstract: A method, computer program product, and system includes a processor(s) obtaining, during runtime, from a compiler, two versions of a data parallel loop for an operation. The host computing system comprises includes a CPU and a GPU is accessible to the host. The processor(s) online profiles the two versions by asynchronously executing the first version, in a profile mode, with the GPU and executing the second version, in the profile mode, with the CPU. The processor(s) generates execution times for the first version and the second version. The processor(s) stores the executions times and performance data in a storage, where the performance data comprises a size of the data parallel loop for the operation. The processor(s) update a regression model(s) to predict performance numbers for a process of an unknown loop size. The processor(s) execute the operation with the CPU or the GPU based on the performance data.
    Type: Grant
    Filed: August 27, 2019
    Date of Patent: February 23, 2021
    Assignee: International Business Machines Corporation
    Inventors: Gita Koblents, Alon Shalev Housfater, Kazuaki Ishizaki, Akihiro Hayashi
  • Patent number: 10901777
    Abstract: Techniques are disclosed relating to context switching using distributed compute workload parsers. In some embodiments, an apparatus includes a plurality of shader units configured to perform operations for compute workgroups included in compute kernels, a plurality of distributed workload parser circuits each configured to dispatch workgroups to a respective set of the shader units, a communications fabric, and a master workload parser circuit configured to communicate with the distributed workload parser circuits via the communications fabric. In some embodiments, the master workload parser circuit maintains a first set of master state information that does not change for a compute kernel based on operations by the shader units and a second set of master state information that may be changed by operations specified by the kernel. In some embodiments, the master workload parser circuit performs a multi-phase state storage process in communications with the distributed workload parser circuits.
    Type: Grant
    Filed: September 26, 2018
    Date of Patent: January 26, 2021
    Assignee: Apple Inc.
    Inventors: Andrew M. Havlir, Jeffrey T. Brady
  • Patent number: 10887235
    Abstract: A programming model provides a method for executing a program in a distributed architecture. One or more first shards of the distributed architecture execute one or more operations, and sending tuples to at least one second shard, the tuples being part of a stream and being based on the one or more operations. The one or more first shards send a token value to the at least one second shard when the sending of the tuples in the stream is complete. The at least one second shard determines whether a total of the token values matches a number of the one or more first shards, and takes a first action in response to determining that the total of the token values matches the number of the one or more first shards. The first action may include marking the stream as being complete and/or generating a message indicating that the stream is complete.
    Type: Grant
    Filed: August 24, 2017
    Date of Patent: January 5, 2021
    Assignee: Google LLC
    Inventors: Gautham Thambidorai, Matthew Rosencrantz, Sanjay Ghemawat, Srdjan Petrovic, Ivan Posva
  • Patent number: 10886740
    Abstract: Provided is an emergency accurate control method and system for large-scale interruptible loads. The method includes: acquiring, by a region control master station, a sheddable load sequence table; acquiring, by the region control master station, a first to-be-shed load; performing, by the region control master station, minimum under-shedding matching layer by layer according to the first to-be-shed load, and shedding a sheddable load corresponding to control substation matching with the first to-be-shed load; and sending, by the region control master station, a second to-be-shed load to the corresponding control substation for load shedding if the second to-be-shed load exists.
    Type: Grant
    Filed: May 10, 2019
    Date of Patent: January 5, 2021
    Assignees: State Grid Jiangsu Electric Power Co., Ltd., Nari Technology Co., Ltd.
    Inventors: Jijun Yin, Qing Chen, Gang Chen, Xiao Lu, Jianyu Luo, Haifeng Li, Xueming Li, Kaiming Luo, Lin Liu, Yunsong Yan, Yefeng Jiang, Jianfeng Ren, Haifeng Xia
  • Patent number: 10853079
    Abstract: A method and computer program product for performing a plurality of processing operations. A plurality of processor nodes each include one or more operational instances. Each processor node includes criteria for generating its operational instances. The processor nodes are linked together in a directed acyclic processing graph in which dependent nodes use data from the operational instances of upstream nodes to perform a node-specific set of processing operations. Dependency relationships between the processor nodes are defined on an operational instance basis, where operational instances in dependent processor nodes identify data associated with, or generated by, specific upstream operational instances that is used to perform the node-specific set of operations for that dependent operational instance. The processing graph may also include connectors nodes defining instance-level dependency relationships between processor nodes.
    Type: Grant
    Filed: September 19, 2019
    Date of Patent: December 1, 2020
    Assignee: Side Effects Software Inc.
    Inventors: Ken Xu, Taylor James Petrick