Instruction Alignment Patents (Class 712/204)
-
Patent number: 12144055Abstract: A method of operating a terminal device (101) includes configuring a data connection (189) on a wireless link (114) between the terminal device (101) and a network (100) based on a first control signal (4001-4003) native to a first layer (255) of a transmission protocol stack (250) associated with the wireless link (114). The method also includes participating in a data communication via the data connection (189). The method also includes receiving a second control signal (205) native to a second layer (251) of the transmission protocol stack (250), the second layer (251) being arranged lower in hierarchy of the transmission protocol stack (250) than the first layer (255). The method also includes in response to receiving the second control signal (205): activating a discontinuous reception (390) for the data communication.Type: GrantFiled: May 2, 2019Date of Patent: November 12, 2024Assignee: Sony Group CorporationInventors: Basuki Priyanto, Nafiseh Mazloum, Anders Berggren, Rickard Ljung
-
Patent number: 11743128Abstract: Aspects described herein relate to determining a multicast broadcast (MB) bandwidth size for receiving an MB service, the MB bandwidth being different than each bandwidth size in the set of multiple configurable bandwidth sizes for receiving unicast services.Type: GrantFiled: February 3, 2021Date of Patent: August 29, 2023Assignee: QUALCOMM IncorporatedInventors: Alberto Rico Alvarino, Umesh Phuyal, Ayan Sengupta
-
Patent number: 11671901Abstract: Some of the present implementations provide a method for assembling a target SIB for a target service. The method receives, from a first cell on a first frequency carrier, a plurality of SIB segments of the target SIB, each of the plurality of SIB segments associated with a corresponding value tag. The method stores a first SIB segment in the plurality of SIB segments in a memory of the UE. For each subsequent SIB segment in the plurality of SIB segments, the method determines whether a corresponding value tag of the subsequent SIB segment is the same as the corresponding value tag of the first SIB segment and if they are the same, stores the subsequent SIB segment in the memory of the UE. The method then assembles the target SIB using the stored plurality of SIB segments.Type: GrantFiled: May 28, 2021Date of Patent: June 6, 2023Assignee: FG Innovation Company LimitedInventors: Yung-Lan Tseng, Hung-Chen Chen, Mei-Ju Shih
-
Patent number: 11650825Abstract: An instruction set architecture including instructions for a processor and instructions for a coprocessor may include synchronizing instructions that may be used to begin and end instruction sequences that include coprocessor instructions (coprocessor sequences). If a terminating synchronizing instruction is followed by an initial synchronizing instruction and the pair are detected in the coprocessor concurrently, the coprocessor may suppress execution of the pair of instructions.Type: GrantFiled: February 10, 2022Date of Patent: May 16, 2023Assignee: Apple Inc.Inventors: Aditya Kesiraju, Rajdeep L. Bhuyar, Ran A. Chachick, Andrew J. Beaumont-Smith
-
Patent number: 11609785Abstract: Systems, apparatuses, and methods for efficient parallel execution of multiple work units in a processor by reducing a number of memory accesses are disclosed. A computing system includes a processor core with a parallel data architecture. The processor core executes a software application with matrix operations. The processor core supports the broadcast of shared data to multiple compute units of the processor core. A compiler or other code assigns thread groups to compute units based on detecting shared data among the compute units. Rather than send multiple read accesses to a memory subsystem for the shared data, the processor core generates a single access request. The single access request includes information to identify the multiple compute units for receiving the shared data when broadcasted by the processor core.Type: GrantFiled: December 30, 2019Date of Patent: March 21, 2023Assignee: Advanced Micro Devices, Inc.Inventors: Li Peng, Jian Yang, Chi Tang
-
Patent number: 11582749Abstract: Techniques for transmitting and receiving wireless communications over an unlicensed radio frequency spectrum band are disclosed, including techniques for transmitting and receiving service information blocks over the unlicensed radio frequency spectrum band, techniques for gaining access to the unlicensed radio frequency spectrum band by performing extended clear channel assessments (eCCAs), techniques for transmitting and receiving synchronization signals and reference signals over the unlicensed radio frequency spectrum band, techniques for communicating locations of reference signals, and techniques for communicating availability of certain resources to be combined across multiple different transmissions.Type: GrantFiled: July 1, 2015Date of Patent: February 14, 2023Assignee: QUALCOMM IncorporatedInventors: Durga Prasad Malladi, Tao Luo, Aleksandar Damnjanovic, Yongbin Wei, Madhavan Srinivasan Vajapeyam, Wanshi Chen
-
Patent number: 11500643Abstract: In one embodiment, a microprocessor, comprising: a branch prediction table comprising plural entries, wherein at least a portion of the plural entries corresponds to an indirect branch type; and an indirect valid table; wherein based on an indirect branch instruction fetch, an entry corresponding to an indirect branch instruction in the branch prediction table is configured as invalid based on clearing a corresponding entry in the indirect valid table.Type: GrantFiled: November 13, 2020Date of Patent: November 15, 2022Assignee: CENTAUR TECHNOLOGY, INC.Inventors: Thomas C. McDonald, Timothy Jon Sulzbach
-
Patent number: 11429507Abstract: A system and method determines a unique performance benchmark for specific computer object code for a particular microprocessor. By generating multiple unique benchmarks for a single, same code module on multiple different processors, the method determines which processor is optimal for the code module. By generating for a single designated processor a performance benchmark for each code modules of multiple modules, where the multiple modules have a same/similar functionality but variations in detailed code or algorithms, the system and method identifies code variation(s) which is/are optimal for the single designated processor. The system and method may entail first extracting selected features of object code (as actually executed) into a code profile, and then generating the performance benchmark based on the code profile and in machine-level timing data for the selected microprocessor. In this way, code security is achieved by fire-walling the object code from the second stage of the method.Type: GrantFiled: June 10, 2020Date of Patent: August 30, 2022Assignee: General Electric CompanyInventors: Andrea M. Schmitz, Andrew W. Berner, Matthew B. Pfenninger, Jeffrey S. Gilton
-
Patent number: 11337142Abstract: Techniques for receiving repeatedly transmitted broadcast information appropriately are disclosed. One aspect of the present invention relates to a base station including a communication control unit configured to control radio communication with user equipment and a broadcast information transmission unit configured to transmit broadcast information, wherein the broadcast information transmission unit includes scheduling information for second system information in first system information and broadcasts the second system information in accordance with the scheduling information.Type: GrantFiled: May 12, 2016Date of Patent: May 17, 2022Assignee: NTT DOCOMO, INC.Inventors: Hideaki Takahashi, Kazuaki Takeda, Wuri Andarmawanti Hapsari
-
Patent number: 11249763Abstract: An arithmetic processing unit includes an instruction decoder which decodes a fetch instruction to issue an execution instruction; a reservation station which temporarily stores the execution instruction; and an arithmetic unit which executes the execution instruction, and the fetch instruction includes a multi-flow instruction which is divided into divided instructions and a single instruction. The instruction decoder includes: a pre-decoder including N number of slots each of which divides the multi-flow instruction into divided instructions; a main decoder including N number of slots each of which decodes the instructions to issue an execution instruction; and a pre-decoder buffer including N?K number of slots each of which temporarily stores instructions in the pre-decoder. The instruction decoder repeats transferring the divided instructions and the single instructions from the slots of the pre-decoder and the slots of the pre-decoder buffer to the main decoder as much as possible in order.Type: GrantFiled: February 4, 2019Date of Patent: February 15, 2022Assignee: FUJITSU LIMITEDInventors: Ryohei Okazaki, Yasunobu Akizuki
-
Patent number: 11200055Abstract: Embodiments detailed herein relate to matrix operations. In particular, support for matrix (tile) addition, subtraction, and multiplication is described. For example, circuitry to support instructions for element-by-element matrix (tile) addition, subtraction, and multiplication are detailed. In some embodiments, for matrix (tile) addition, decode circuitry is to decode an instruction having fields for an opcode, a first source matrix operand identifier, a second source matrix operand identifier, and a destination matrix operand identifier; and execution circuitry is to execute the decoded instruction to, for each data element position of the identified first source matrix operand: add a first data value at that data element position to a second data value at a corresponding data element position of the identified second source matrix operand, and store a result of the addition into a corresponding data element position of the identified destination matrix operand.Type: GrantFiled: July 1, 2017Date of Patent: December 14, 2021Assignee: Intel CorporationInventors: Robert Valentine, Dan Baum, Zeev Sperber, Jesus Corbal, Elmoustapha Ould-Ahmed-Vall, Bret L. Toll, Mark J. Charney, Barukh Ziv, Alexander Heinecke, Milind Girkar, Simon Rubanovich
-
Patent number: 11080057Abstract: A processing device includes an instruction decode circuit including decoders that decode instructions respectively assigned an instruction number that is determined for every one of the decoders, an instruction execution circuit that executes the instructions decoded by the instruction decode circuit, an instruction complete holding circuit including hold blocks provided in correspondence with each of the decoders and respectively including hold regions assigned the instruction number, and used for an instruction complete process, and an instruction complete controller that stores instruction information that is generated by decoding the instructions by the decoders, in one of the hold regions of the hold block corresponding to the decoder that decodes the instruction, based on the instruction number, and obtain, in order, the instruction information corresponding to the instructions executed by the instruction execution circuit from the instruction complete holding circuit, to perform the instruction compleType: GrantFiled: November 4, 2019Date of Patent: August 3, 2021Assignee: FUJITSU LIMITEDInventor: Atushi Fusejima
-
Patent number: 11071044Abstract: A mobile telecommunications system entity for a mobile telecommunications system has at least one entity, which serves at least one user equipment. The mobile telecommunications system entity has circuitry which is configured to transmit scheduling information for on-demand system information, which can be requested by the at least one user equipment.Type: GrantFiled: October 26, 2017Date of Patent: July 20, 2021Assignee: SONY CORPORATIONInventors: Yuxin Wei, Brian Alexander Martin, Vivek Sharma, Hideji Wakabayashi, Shinichiro Tsuda
-
Patent number: 11036545Abstract: Accelerated synchronization operations using fine grain dependency check are disclosed. A graphics multiprocessor includes a plurality of execution units and synchronization circuitry that is configured to determine availability of at least one execution unit. The synchronization circuitry to perform a fine grain dependency check of availability of dependent data or operands in shared local memory or cache when at least one execution unit is available.Type: GrantFiled: March 15, 2019Date of Patent: June 15, 2021Assignee: Intel CorporationInventors: Subramaniam Maiyuran, Varghese George, Altug Koker, Aravindh Anantaraman, SungYe Kim, Valentin Andrei, Joydeep Ray
-
Patent number: 11036506Abstract: Some example memory systems include a load and store unit (LSU) operable to load a memory reference. The LSU may include an alignment register, a current memory reference register, and a vector register. The memory system may include a memory coupled to the LSU. The memory may be operable to store a memory reference. The memory reference may be aligned or unaligned in the memory, and the LSU may be operable to efficiently load both unaligned and aligned memory references. Some example memory systems include a load and store unit (LSU) operable to store to the memory at a memory address. The LSU may be operable to efficiently store to both unaligned and aligned memory addresses. The LSU may perform loads and stores in forward and reverse stride.Type: GrantFiled: December 11, 2019Date of Patent: June 15, 2021Assignee: Motorola Solutions, Inc.Inventors: Jerry Redington, Johnny R. Ferreira, Charles R. Ruelke
-
Patent number: 10963186Abstract: Implementations disclosed herein provide a method of receiving a command from a host, the command providing a starting logical block address (LBA) and a length of the command, generating a multiplicity bit mask (MBM) for the command in response to receiving a command from a host, and storing the MBM to an MBM table in a row corresponding to a stream that the command is part of.Type: GrantFiled: March 21, 2019Date of Patent: March 30, 2021Assignee: SEAGATE TECHNOLOGY LLCInventors: Andi Sumaryo Sutiawan, Brandon Mun Hon Yuen, Xu Huang
-
Patent number: 10956439Abstract: Examples of the present disclosure provide apparatuses and methods for direct data transfer. An example method comprises transferring data between a first device and a second device, wherein the first device is a bit vector operation device, and transforming the data using a data transform engine (DTE) by rearranging the data to enable the data to be stored on the first device when transferring the data between the second device and first memory device.Type: GrantFiled: February 19, 2016Date of Patent: March 23, 2021Assignee: Micron Technology, Inc.Inventors: Isom Crawford, Jr., Graham Kirsch, John D. Leidel
-
Patent number: 10771931Abstract: Provided are a method for a terminal for requesting a system information block (SIB), and an apparatus for supporting the method in a wireless communication system. The method may comprise the steps of: receiving, from a radio access network (RAN), an SIB list comprising one or more SIBs supported by a cell; receiving, from the RAN, SIB broadcast information indicating whether the SIB supported by the cell is broadcast in a broadcast control channel (BCCH) section; detecting an omitted SIB on the basis of the SIB list and SIB broadcast information; and requesting the omitted SIB from the RAN.Type: GrantFiled: May 10, 2017Date of Patent: September 8, 2020Assignee: LG Electronics Inc.Inventors: Youngdae Lee, Sangwon Kim, Jaewook Lee
-
Patent number: 10747543Abstract: At least some instructions executed in a pipeline are each associated with corresponding trace information that characterizes execution of that instruction in the pipeline. A predetermined type of store instructions flow through a subset of contiguous stages of the pipeline. A signal is received to store a portion of the trace information. A stage before the subset of contiguous stages is stalled. A store instruction of the predetermined type is inserted into a stage at the beginning of the subset of contiguous stages to enable the store instruction to reach the memory access stage at which an operand of the store instruction including the portion of the trace information is sent out of the pipeline. The store instruction is filtered from a stage of the subset of contiguous stages that occurs earlier in the pipeline than a stage in which trace information is generated.Type: GrantFiled: December 28, 2018Date of Patent: August 18, 2020Assignee: Marvell Asia Pte, Ltd.Inventors: Gerald Lampert, Nitin Prakash, Shubhendu Sekhar Mukherjee, David Albert Carlson
-
Patent number: 10721108Abstract: Various communication systems may benefit from efficient communication of system information. For example, certain wireless communication systems may benefit from system information block enhancement for low complexity user equipment and/or user equipment in coverage enhancement mode. A method can include decoding a transport block size (TBS) index in a compact downlink control information. The method can also include monitoring for SIB based on the decoded TBS index. The method may optionally include monitoring for the SIB based on a predefined transmission pattern of physical downlink control channel for machine type communication. The method may also optionally include decoding of M-SI messages from a subframe according to a pattern indicated by an information element in M-SIB1.Type: GrantFiled: January 29, 2016Date of Patent: July 21, 2020Assignee: NOKIA SOLUTIONS AND NETWORKS OYInventors: Yan Ji Zhang, Rapeepat Ratasuk
-
Patent number: 10719322Abstract: A technique includes determining whether one or more instructions in an instruction group require cracking. Whether the instructions that require cracking are associated with a decode-time instruction optimization (DTIO) sequence is also determined. In response to a first instruction, included in the one or more instructions, requiring cracking and the first instruction not being part of a DTIO sequence, the first instruction is cracked into internal operations (IOPs). In response to a second instruction, included in the one or more instructions, requiring cracking and the second instruction being part of a DTIO sequence, an IOP sequence (that includes at least one IOP that is associated with at least a cracked version of the second instruction and at least a third instruction that is included in the one or more instructions and at least one other IOP that is associated with the cracked version of the second instruction) is generated.Type: GrantFiled: June 10, 2015Date of Patent: July 21, 2020Assignee: International Business Machines CorporationInventors: Michael K. Gschwind, Valentina Salapura
-
Patent number: 10671393Abstract: A technique includes determining whether one or more instructions in an instruction group require cracking. Whether the instructions that require cracking are associated with a decode-time instruction optimization (DTIO) sequence is also determined. In response to a first instruction, included in the one or more instructions, requiring cracking and the first instruction not being part of a DTIO sequence, the first instruction is cracked into internal operations (IOPs). In response to a second instruction, included in the one or more instructions, requiring cracking and the second instruction being part of a DTIO sequence, an IOP sequence (that includes at least one IOP that is associated with at least a cracked version of the second instruction and at least a third instruction that is included in the one or more instructions and at least one other IOP that is associated with the cracked version of the second instruction) is generated.Type: GrantFiled: April 24, 2015Date of Patent: June 2, 2020Assignee: International Business Machines CorporationInventors: Michael K. Gschwind, Valentina Salapura
-
Patent number: 10540512Abstract: A parallel processing method, system, and/or computer program product for performing data parallel wide accesses on an unstructured text is provided. The parallel processing includes creating a pointer that points to a beginning of the unstructured text and loading into a vector register a string segment of the unstructured text based on the pointer. Then, access permissions of a first byte of the string segment are automatically tested. In turn, a determination is made as to whether the string segment includes an end indication, and a remaining portion of the unstructured text is validated by accessing and loading a last character identified by the end indication into the vector register when the string segment is determined to include the end indication.Type: GrantFiled: September 29, 2015Date of Patent: January 21, 2020Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Michael K. Gschwind, Brett Olsson
-
Patent number: 10496407Abstract: An apparatus and method for performing addition of signed packed data values using rotation and halving.Type: GrantFiled: December 21, 2017Date of Patent: December 3, 2019Assignee: Intel CorporationInventors: Venkateswara Madduri, Elmoustapha Ould-Ahmed-Vall, Robert Valentine, Mark Charney, Jesus Corbal, Binwei Yang
-
Patent number: 10338927Abstract: A hardware/software co-design for an optimized dynamic out-of-order Very Long Instruction Word (VLIW) pipeline.Type: GrantFiled: April 3, 2017Date of Patent: July 2, 2019Assignee: Intel CorporationInventors: Denis M. Khartikov, Naveen Neelakantam, John H. Kelm, Polychronis Xekalakis
-
Patent number: 10333847Abstract: An embodiment of the invention includes receiving packet header vectors where a header vector includes a number of packet header words. Match operations are performed on the packet header words. At least one packet header word is modified based on the match operations. At least one processor is used for each packet header word to perform the packet match operations and modify at least one packet header word. Instructions are received from an instruction word where a VLIW instruction word includes all of the instruction words. Each processor performs an operation in response to the instruction word.Type: GrantFiled: May 23, 2018Date of Patent: June 25, 2019Assignee: TEXAS INSTRUMENTS INCORPORATEDInventors: Patrick W. Bosshart, Hun-Seok Kim
-
Patent number: 10275217Abstract: According to one general aspect, a load unit may include a load circuit configured to load at least one piece of data from a memory. The load unit may include an alignment circuit configured to align the data to generate an aligned data. The load unit may also include a mathematical operation execution circuit configured to generate a resultant of a predetermined mathematical operation with the at least one piece of data as an operand. Wherein the load unit is configured to, if an active instruction is associated with the predetermined mathematical operation, bypass the alignment circuit and input the piece of data directly to the mathematical operation execution circuit.Type: GrantFiled: June 2, 2017Date of Patent: April 30, 2019Assignee: SAMSUNG ELECTRONICS CO., LTD.Inventors: Rama S. Gopal, Paul E. Kitchin, Karthik Sundaram
-
Patent number: 10061705Abstract: A technique for processing instructions includes examining instructions in an instruction stream of a processor to determine properties of the instructions. The properties indicate whether the instructions may belong in an instruction sequence subject to decode-time instruction optimization (DTIO). Whether the properties of multiple ones of the instructions are compatible for inclusion within an instruction sequence of a same group is determined. The instructions with compatible ones of the properties are grouped into a first instruction group. The instructions of the first instruction group are decoded subsequent to formation of the first instruction group. Whether the first instruction group actually includes a DTIO sequence is verified based on the decoding. Based on the verifying, DTIO is performed on the instructions of the first instruction group or is not performed on the instructions of the first instruction group.Type: GrantFiled: June 9, 2015Date of Patent: August 28, 2018Assignee: International Business Machines CorporationInventors: Michael K. Gschwind, Valentina Salapura
-
Patent number: 10057849Abstract: A system for, and method of, reducing power consumed obtaining system information from a cell, the system information contained in at least a master information block, a scheduling information block and a system information block. In one embodiment, the system includes: (1) a broadcast control channel (BCCH) frame cache configured to buffer received BCCH frames bearing portions of the system information and (2) a system information verifier associated with the BCCH frame cache and configured to determine version consistency in the master information block and the scheduling information block by employing the check numbers associated therewith.Type: GrantFiled: June 2, 2015Date of Patent: August 21, 2018Assignee: Nvidia CorporationInventors: Timothy Rogers, Rene-Cedric Vanderbergh
-
Patent number: 10054943Abstract: A UAV has two rotors. First and second sensors sense a first and second type of input respectively. The second type of input is different than the first type, the first sensor providing a first sensor output and the second sensor providing a second sensor output. The first sensor output is input to a first computer and the second sensor output is input to a second computer. The first and second computer communicate in parallel to process the first and second sensor outputs to create a control signal having a predetermined number of variables therein, each variable having an exclusive position within the signal. The first computer outputs a first variable and the second computer outputs a second variable, each output being assigned an exclusive position within the control signal. At least one of the first and second computers outputting the control signal to the rotors.Type: GrantFiled: October 24, 2012Date of Patent: August 21, 2018Assignee: Hoverfly Technologies, Inc.Inventors: George Richard Sapp, II, Alfred D. Ducharme, Daniel Burroughs, Stacey L. Ducharme
-
Patent number: 10009276Abstract: An embodiment of the invention includes receiving packet header vectors where a header vector includes a number of packet header words. Match operations are performed on the packet header words. At least one packet header word is modified based on the match operations. At least one processor is used for each packet header word to perform the packet match operations and modify at least one packet header word. Instructions are received from an instruction word where a VLIW instruction word includes all of the instruction words. Each processor performs an operation in response to the instruction word.Type: GrantFiled: February 26, 2014Date of Patent: June 26, 2018Assignee: TEXAS INSTRUMENTS INCORPORATEDInventors: Patrick W. Bosshart, Hun-Seok Kim
-
Patent number: 9927787Abstract: Method and a system for managing distributing computing suitable for implementation in an automation system are provided, wherein in a first step, a set of program instructions is partitioned into a plurality of unit blocks, where each unit block comprises at least one program instruction and, in a second set, at least one complementary block corresponding to at least one unit block are identified from the remainder of the plurality of unit blocks, where the complementary blocks are identified based on a comparison between read-write access of global variables in the unit block and corresponding complementary blocks and, in another step, the plurality of unit blocks are executed on a set of multiple processors within the automation system such that at least one complementary block is executed in parallel with a corresponding unit block.Type: GrantFiled: March 19, 2014Date of Patent: March 27, 2018Assignee: Siemens AktiengesellschaftInventors: Elvis Antony, Grace Leelavathy
-
Patent number: 9910824Abstract: A computer processor is disclosed. The computer processor may comprise a vector unit comprising a vector register file comprising at least one register to hold a varying number of elements. The computer processor may further comprise processing logic configured to operate on the varying number of elements in the vector register file using one or more instructions that separate a vector or combine two vectors. The computer processor may be implemented as a monolithic integrated circuit.Type: GrantFiled: June 1, 2015Date of Patent: March 6, 2018Assignee: Optimum Semiconductor Technologies, Inc.Inventors: Mayan Moudgill, Gary J. Nacer, C. John Glossner, Arthur Joseph Hoane, Paul Hurtley, Murugappan Senthilvelan, Pablo Balzola
-
Patent number: 9910770Abstract: There is a need to provide a microcomputer capable of eliminating an external terminal for endian selection. Flash memory includes a user boot area for storing a program executed in user boot mode and corresponding endian information and a user area for storing a program executed in user mode and corresponding endian information. A data transfer circuit reads endian information stored in the user boot area or the user area in accordance with operation mode and supplies the endian information to a CPU before reset release of the CPU. Accordingly, an external terminal for endian selection can be eliminated.Type: GrantFiled: June 29, 2016Date of Patent: March 6, 2018Assignee: RENESAS ELECTRONICS CORPORATIONInventors: Mamoru Sakugawa, Tomohiro Sakurai, Katsuyoshi Watanabe, Seiji Ikari, Takashi Nasu, Tsutomu Kumagai
-
Patent number: 9891927Abstract: A microprocessor includes a plurality of processing cores and an uncore random access memory (RAM) readable and writable by each of the plurality of processing cores. Each core of the plurality of processing cores comprises microcode run by the core that implements architectural instructions of an instruction set architecture of the microprocessor. The microcode is configured to both read and write the uncore RAM to accomplish inter-core communication between the plurality of processing cores.Type: GrantFiled: May 19, 2014Date of Patent: February 13, 2018Assignee: VIA TECHNOLOGIES, INC.Inventors: G. Glenn Henry, Terry Parks, Rodney E. Hooker, Stephan Gaskins
-
Patent number: 9773534Abstract: A non-volatile memory accelerator and a method for speeding up data access are provided. The non-volatile memory accelerator includes a data pre-fetching unit, a cache unit, and an access interface circuit. The data pre-fetching unit has a plurality of line buffers. One of the line buffers provides read data according to a read command, or the data pre-fetching unit reads at least one cache data as the read data according to the read command. The data pre-fetching unit further stores in at least one of the line buffers a plurality of pre-stored data with continuous addresses according to the read command. The cache unit stores the at least one cache data and the pre-stored data with the continuous addresses. The access interface circuit is configured to be an interface circuit of the non-volatile memory.Type: GrantFiled: May 19, 2016Date of Patent: September 26, 2017Assignee: Faraday Technology Corp.Inventors: Kun-Chih Chen, Hsiao-An Chuang
-
Patent number: 9740488Abstract: Methods and apparatus are provided for optimizing a processor core. Common processor subcircuitry is used to perform calculations for various types of instructions, including branch and non-branch instructions. Increasing the commonality of calculations across different instruction types allows branch instructions to jump to byte aligned memory address even if supported instructions are multi-byte or word aligned.Type: GrantFiled: October 15, 2014Date of Patent: August 22, 2017Assignee: Altera CorporationInventor: James Loran Ball
-
Patent number: 9632778Abstract: Embodiments relate to packed loading and storing of data. An aspect includes a system for packed loading and storing of distributed data. The system includes memory and a processing element configured to communicate with the memory. The processing element is configured to perform a method including fetching and decoding an instruction for execution by the processing element. A plurality of individually addressable data elements is gathered from non-contiguous locations in the memory which are narrower than a nominal width of register file elements in the processing element based on the instruction. The processing element packs and loads the data elements into register file elements of a register file entry based on the instruction, such that at least two of the data elements gathered from the non-contiguous locations in the memory are packed and loaded into a single register file element of the register file entry.Type: GrantFiled: August 8, 2012Date of Patent: April 25, 2017Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Bruce M. Fleischer, Thomas W. Fox, Hans M. Jacobson, Jaime H. Moreno, Ravi Nair, Daniel A. Prener
-
Gather/scatter of multiple data elements with packed loading/storing into/from a register file entry
Patent number: 9632777Abstract: Embodiments relate to packed loading and storing of data. An aspect includes a method for packed loading and storing of data distributed in a system that includes memory and a processing element. The method includes fetching and decoding an instruction for execution by the processing element. The processing element gathers a plurality of individually addressable data elements from non-contiguous locations in the memory which are narrower than a nominal width of register file elements in the processing element based on the instruction. The data elements are packed and loaded into register file elements of a register file entry by the processing element based on the instruction, such that at least two of the data elements gathered from the non-contiguous locations in the memory are packed and loaded into a single register file element of the register file entry.Type: GrantFiled: August 3, 2012Date of Patent: April 25, 2017Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Bruce M. Fleischer, Thomas W. Fox, Hans M. Jacobson, Jaime H. Moreno, Ravi Nair, Daniel A. Prener -
Patent number: 9619226Abstract: Embodiments of systems, apparatuses, and methods for performing in a computer processor vector packed horizontal add or subtract of packed data elements in response to a single vector packed horizontal add or subtract instruction that includes a destination vector register operand, a source vector register operand, and an opcode are describes.Type: GrantFiled: December 23, 2011Date of Patent: April 11, 2017Assignee: Intel CorporationInventors: Mostafa Hagog, Elmoustapha Ould-Ahmed-Vall, Robert Valentine, Amit Gradstein, Simon Rubanovich, Zeev Sperber
-
Patent number: 9594687Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for pre-fetching content. One of the systems includes a pre-fetcher configured to perform operations including determining, for a virtual machine executing on a device and using a first virtual machine physical address associated with the virtual machine, a second virtual machine physical address for data to pre-fetch for the execution of the virtual machine on the device, determining, using the second virtual machine physical address and an address mapping that associates virtual machine physical addresses for the virtual machine with device physical addresses for the device, a device physical address for the data, and requesting the data from a memory using the device physical address.Type: GrantFiled: April 14, 2015Date of Patent: March 14, 2017Assignee: Google Inc.Inventors: Richard Yoo, Liqun Cheng, Parthasarathy Ranganathan, Rama Krishna Govindaraju
-
Patent number: 9582413Abstract: Embodiments relate to accessing data in a memory. A method for accessing data in a memory coupled to a processor is provided. The method receives a memory reference instruction for accessing data of a first size at an address in the memory. The method determines an alignment size of the address in the memory. The method accesses the data of the first size in one or more groups of data by accessing each group of data block concurrently. The groups of data have sizes that are multiples of the alignment size.Type: GrantFiled: December 4, 2014Date of Patent: February 28, 2017Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Jonathan D. Bradbury, Michael K. Gschwind, Christian Jacobi, Timothy J. Slegel
-
Patent number: 9554308Abstract: Methods, systems, and devices are described for transmitting scheduling requests for uplink transmission resources following a handover. A user equipment (UE) may determine that a handover from a first base station to a second base station has occurred, and may implement one or more processes to enhance efficiency in communications following the handover. A UE, for example, may wait for successful acquisition and/or derivation of timing information from a base station before attempting to schedule uplink resources with the base station.Type: GrantFiled: August 19, 2014Date of Patent: January 24, 2017Assignee: QUALCOMM IncorporatedInventors: Mouaffac Ambriss, Mutaz Zuhier Afif Shukair, Salil Sawhney, Deepak Krishnamoorthi
-
Patent number: 9536110Abstract: The instruction code including an instruction code stored in the area where the encrypted instruction code is stored in a non-rewritable format is authenticated using a specific key which is specific to the core where the instruction code is executed or an authenticated key by a specific key to perform an encryption processing for the input and output data between the core and the outside.Type: GrantFiled: November 27, 2013Date of Patent: January 3, 2017Assignee: SOCIONEXT INC.Inventors: Seiji Goto, Jun Kamada, Taiji Tamiya
-
Patent number: 9529043Abstract: The disclosure describes novel methods and apparatuses for controlling a device's TCA circuit when the device exists in a JTAG daisy-chain arrangement with other devices. The methods and apparatuses allow the TCA test pattern set used during device manufacturing to be reused when the device is placed in a JTAG daisy-chain arrangement with other devices, such as in a customers system using the device. Additional embodiments are also provided and described in the disclosure.Type: GrantFiled: April 30, 2015Date of Patent: December 27, 2016Assignee: Texas Instruments IncorporatedInventor: Lee D. Whetsel
-
Patent number: 9495163Abstract: A data processing apparatus is provided comprising processing circuitry and an instruction decoder responsive to program instructions to control processing circuitry to perform the data processing. The instruction decoder is responsive to an address calculating instruction to perform an address calculating operation for calculating a partial address result from a non-fixed reference address and a partial offset value such that a full address specifying a memory location of an information entity is calculable from said partial address result using at least one supplementary program instruction. The partial offset value has a bit-width greater than or equal to said instruction size and is encoded within at least one partial offset field of said address calculating instruction. A corresponding data processing method, virtual machine and computer program product are also provided.Type: GrantFiled: December 17, 2014Date of Patent: November 15, 2016Assignee: ARM LimitedInventors: Nigel John Stephens, David James Seal
-
Patent number: 9489203Abstract: The present application describes a method and apparatus for prefetching instructions based on predicted branch target addresses. Some embodiments of the method include providing a second cache line to a second cache when a target address for a branch instruction in a first cache line of a first cache is included in the second cache line of the first cache and when the second cache line is not resident in the second cache.Type: GrantFiled: December 11, 2012Date of Patent: November 8, 2016Assignee: Advanced Micro Devices, Inc.Inventor: James D. Dundas
-
Patent number: 9430425Abstract: Systems and methods for resource sharing of pipelined circuitry of an integrated circuit (IC) are provided. For example, in one embodiment, a method for sharing a functional unit of an integrated circuit (IC) includes receiving two or more threads configured to access the functional unit through two or more data entry points associated with corresponding data exit points configured to receive processed thread data. The method further includes arbitrating the processing of the two or more threads by the functional unit to obtain the processed thread data. To arbitrate, the exit points that cannot receive additional data are determined. Threads are only received from data entry points with corresponding data exit points that can receive additional data. The processed output data is provided to a corresponding exit point.Type: GrantFiled: December 27, 2012Date of Patent: August 30, 2016Assignee: Altera CorporationInventor: Tomasz Czajkowski
-
Patent number: 9230002Abstract: A method for sharing information between a publisher and multiple subscribers is provided. The publisher uses a latch-free, single publisher, multiple subscriber shared queue to share information. Logical change records representing changes made to a database are enqueued in the shared queue as messages in a stream of messages, and subscribers read the logical change records. Subscribers may filter logical change records before sending to apply processes for processing. An identifying property of the source instance of a change encapsulated in a logical change record may be included with each message enqueued.Type: GrantFiled: January 30, 2009Date of Patent: January 5, 2016Assignee: ORACLE INTERNATIONAL CORPORATIONInventors: Lik Wong, Nimar Arora, Lei Gao, Thuvan Hoang, Haobo Xu
-
Patent number: 9128698Abstract: Disclosed herein are systems, apparatuses, and methods performing in a computer processor of performing a rotate and XOR in response to a single XOR and rotate instruction, wherein the rotate and XOR instruction includes a first and second source operand, a destination operand, and an immediate value.Type: GrantFiled: September 28, 2012Date of Patent: September 8, 2015Assignee: Intel CorporationInventors: Vinodh Gopal, Gilbert M. Wolrich, James D. Guilford, Kirk S. Yap