Patents by Inventor Vojin G. Oklobdzija
Vojin G. Oklobdzija has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Patent number: 11967955Abstract: A clocked storage element comprises a first latch having an input data node, a clock input node and a first latch output data node, and a second latch having an input connected to the first latch output data node, a clock input node and a second latch output data node. The first and second latches can have a clocked pull-up current path consisting of two p-channel transistors between their respective output data nodes and the VDD supply line, and a clocked pull-down current path consisting of two n-channel transistors between their respective output data nodes and the VSS supply line.Type: GrantFiled: January 12, 2023Date of Patent: April 23, 2024Assignee: SambaNova Systems, Inc.Inventor: Vojin G. Oklobdzija
-
Patent number: 11831316Abstract: A system of free running oscillators synchronized to the lowest frequency running one and following PVT variation generates a system clock. A method is particularly applicable to clock relatively small clock domains within a multi-core chip containing thousands of cores, and where the clock domain encompasses one or more cores and additional logic blocks. The resulting system clock is divided by 2k using latches or flip-flops to achieve a symmetric 50-50 duty cycle of the system clock. Further, such PVT insensitive system clock can be used as a reference for a PLL or DLL generated clock for the domain.Type: GrantFiled: July 29, 2022Date of Patent: November 28, 2023Inventor: Vojin G. Oklobdzija
-
Publication number: 20230155579Abstract: A clocked storage element comprises a first latch having an input data node, a clock input node and a first latch output data node, and a second latch having an input connected to the first latch output data node, a clock input node and a second latch output data node. The first and second latches can have a clocked pull-up current path consisting of two p-channel transistors between their respective output data nodes and the VDD supply line, and a clocked pull-down current path consisting of two n-channel transistors between their respective output data nodes and the VSS supply line.Type: ApplicationFiled: January 12, 2023Publication date: May 18, 2023Applicant: SambaNova Systems, Inc.Inventor: Vojin G. OKLOBDZIJA
-
Publication number: 20230087096Abstract: A system of free running oscillators synchronized to the lowest frequency running one and following PVT variation generates a system clock. A method is particularly applicable to clock relatively small clock domains within a multi-core chip containing thousands of cores, and where the clock domain encompasses one or more cores and additional logic blocks. The resulting system clock is divided by 2k using latches or flip-flops to achieve a symmetric 50-50 duty cycle of the system clock. Further, such PVT insensitive system clock can be used as a reference for a PLL or DLL generated clock for the domain.Type: ApplicationFiled: July 29, 2022Publication date: March 23, 2023Inventor: Vojin G. Oklobdzija
-
Publication number: 20230045265Abstract: A clocked storage element comprises a first latch having an input data node, a clock input node and a first latch output data node, and a second latch having an input connected to the first latch output data node, a clock input node and a second latch output data node. The first and second latches can have a clocked pull-up current path consisting of two p-channel transistors between their respective output data nodes and the VDD supply line, and a clocked pull-down current path consisting of two n-channel transistors between their respective output data nodes and the VSS supply line.Type: ApplicationFiled: December 15, 2021Publication date: February 9, 2023Applicant: SambaNova Systems, Inc.Inventor: Vojin G. OKLOBDZIJA
-
Publication number: 20230015430Abstract: A floating-point accumulator circuit includes an addend input register having an addend exponent and an addend significand and an accumulation register with a first portion to hold a representation of an accumulation exponent and a second portion to hold a representation of an accumulation significand. A control circuit is also included to generate an accumulator zero control signal and an addend zero control signal based on the addend exponent and the accumulation exponent. It also includes an adder circuit with an output an input of the accumulation register. A first zeroing circuit sends either a zero or a value based on the addend significand to a first input of the adder circuit based on the addend zero control signal, and a second zeroing circuit sends either zeros or a value based on the accumulator significand to a second input of the adder circuit, based on the accumulator zero control signal.Type: ApplicationFiled: September 12, 2022Publication date: January 19, 2023Applicant: SambaNova Systems, Inc.Inventors: Vojin G. Oklobdzija, Matthew M. Kim
-
Patent number: 11558041Abstract: A clocked storage element comprises a first latch having an input data node, a clock input node and a first latch output data node, and a second latch having an input connected to the first latch output data node, a clock input node and a second latch output data node. The first and second latches can have a clocked pull-up current path consisting of two p-channel transistors between their respective output data nodes and the VDD supply line, and a clocked pull-down current path consisting of two n-channel transistors between their respective output data nodes and the VSS supply line.Type: GrantFiled: December 15, 2021Date of Patent: January 17, 2023Assignee: SambaNova Systems, Inc.Inventor: Vojin G. Oklobdzija
-
Publication number: 20230004353Abstract: A floating-point accumulator circuit includes a floating-point input having an input significand field and a first shifter coupled to the input significand field and providing an output of the input significand field shifted by a first amount. A carry-save adder has a first, second, and third input and an output. The first input is coupled to the output of the first shifter and the output provides carry bits and sum bits representing a summation of the first input, the second input, and the third input as a significand of the accumulated value. Shifters are also coupled to the carry bits and the sum bits of the output of the carry-save adder to respectively provide the carry bits and the sum bits, both shifted by a second amount, to the second input and the third input of the carry-save adder.Type: ApplicationFiled: September 12, 2022Publication date: January 5, 2023Applicant: SambaNova Systems, Inc.Inventors: Vojin G. Oklobdzija, Matthew M. Kim
-
Publication number: 20220308834Abstract: A Floating point Multiply-Add, Accumulate Unit, supporting BF16 format for Multiply-Accumulate operations, and FP32 Single-Precision Addition complying with the IEEE 754 Standard is described with exception handling. Operations including exception handling in a way that does not interfere with execution of data flow operations, overflow detection, zero detection and sign extension are adopted for 2's complement and Carry-Save format.Type: ApplicationFiled: November 23, 2021Publication date: September 29, 2022Applicant: SAMBANOVA SYSTEMS, INC.Inventors: Vojin G. Oklobdzija, Matthew M. Kim
-
Patent number: 11442696Abstract: A Floating point Multiply-Add, Accumulate Unit, supporting BF16 format for Multiply-Accumulate operations, and FP32 Single-Precision Addition complying with the IEEE 754 Standard is described with exception handling. Operations including exception handling in a way that does not interfere with execution of data flow operations, overflow detection, zero detection and sign extension are adopted for 2's complement and Carry-Save format.Type: GrantFiled: November 23, 2021Date of Patent: September 13, 2022Assignee: SAMBANOVA SYSTEMS, INC.Inventors: Vojin G. Oklobdzija, Matthew M. Kim
-
Patent number: 11429349Abstract: Floating point Multiply-Add, Accumulate Unit, supporting BF16 format for Multiply-Accumulate operations, and FP32 Single-Precision Addition complying with the IEEE 754 Standard. The Multiply-Accumulate unit uses higher radix and longer internal 2's complement significand representation to facilitate precision as well as comparison and operation with negative numbers. The addition is performed using Carry-Save format to avoid long carry propagation and speed up the operation. Operations including overflow detection, zero detection and sign extension are adopted for 2s complement and Carry-Save format. Handling of Overflow and Sign Extension allows for fast operation relatively independent on the size of the accumulator.Type: GrantFiled: August 9, 2021Date of Patent: August 30, 2022Assignee: SambaNova Systems, Inc.Inventors: Vojin G. Oklobdzija, Matthew M. Kim
-
Patent number: 11411553Abstract: A system of free running oscillators synchronized to the lowest frequency running one and following PVT variation generates a system clock. A method is particularly applicable to clock relatively small clock domains within a multi-core chip containing thousands of cores, and where the clock domain encompasses one or more cores and additional logic blocks. The resulting system clock is divided by 2k using latches or flip-flops to achieve a symmetric 50-50 duty cycle of the system clock. Further, such PVT insensitive system clock can be used as a reference for a PLL or DLL generated clock for the domain.Type: GrantFiled: June 12, 2020Date of Patent: August 9, 2022Inventor: Vojin G. Oklobdzija
-
Patent number: 11366638Abstract: Floating point Multiply-Add, Accumulate Unit, supporting BF16 format for Multiply-Accumulate operations, and FP32 Single-Precision Addition complying with the IEEE 754 Standard. The Multiply-Accumulate unit uses higher radix and longer internal 2's complement significand representation to facilitate precision as well as comparison and operation with negative numbers. The addition can be performed using Carry-Save format to avoid long carry propagation and speed up the operation. The circuit uses early exponent comparison to shorten the accumulate pipeline stage. Operations including overflow detection, zero detection and sign extension are adopted for 2s complement and Carry-Save format.Type: GrantFiled: September 2, 2021Date of Patent: June 21, 2022Assignee: SambaNova Systems, Inc.Inventors: Vojin G. Oklobdzija, Matthew M. Kim
-
Patent number: 10707839Abstract: A system of free running oscillators synchronized to the lowest frequency running one and following PVT variation generates a system clock. A method is particularly applicable to clock relatively small clock domains within a multi-core chip containing thousands of cores, and where the clock domain encompasses one or more cores and additional logic blocks. The resulting system clock is divided by 2k using latches or flip-flops to achieve a symmetric 50-50 duty cycle of the system clock. Further, such PVT insensitive system clock can be used as a reference for a PLL or DLL generated clock for the domain.Type: GrantFiled: March 28, 2018Date of Patent: July 7, 2020Inventor: Vojin G. Oklobdzija
-
Patent number: 7509486Abstract: Methods and apparatus for an encryption processor for performing accelerated computations to establish secure network sessions. The encryption processor includes an execution unit and a decode unit. The execution unit is configured to execute Montgomery operations and including at least one adder and at least two multipliers. The decode unit is configured to determine if a square operation or a product operation needs to be performed and to issue the appropriate instructions so that certain multiply and/or addition operations are performed in parallel in the execution unit while performing either the Montgomery square or Montgomery product operation.Type: GrantFiled: July 7, 2000Date of Patent: March 24, 2009Assignee: Broadcom CorporationInventors: David K. Chin, Vojin G. Oklobdzija, Aamir Farooqui
-
Patent number: 6553541Abstract: Reduction of the complexity of a Viterbi-type sequence detector is disclosed. It was based on elimination of less probably taken branches in the trellis. The method is applied to the design of the E2PR4 channel with 8/9 rate sliding block trellis code. Coding, by itself eliminates two states by coding constraints, and the disclosed method reduces the number of required ACS units from 14 to 11, while reducing their complexity as well. For the implementation of E2PR4 detection, 4 4-way, 3 3-way, 3 2-way and one 1-way ACSs are needed. System simulations show no BER performance drop at common SNRs when compared with a full 16-state E2PR4 implementation in magnetic disk drives.Type: GrantFiled: April 5, 2000Date of Patent: April 22, 2003Assignee: Texas Instruments IncorporatedInventors: Borivoje Nikolic, Leo Fu, Michael Leung, Vojin G. Oklobdzija, Richard Yamasaki
-
Publication number: 20030062925Abstract: The present invention provides techniques, including a system and method, for improving speed in a flip-flop, having a pre-charged stage coupled to an evaluation stage. In one exemplary embodiment delay is reduced by using a conditional rather than an unconditional keeper, where the conditional keeper has the function of a keeper only under certain conditions. In some embodiments there is a conditional keeper in either the pre-charged stage or the evaluation stage or both stages. Another embodiment provides for the combining of the evaluation stage with one or more external logic functions.Type: ApplicationFiled: January 11, 2002Publication date: April 3, 2003Applicant: Fujitsu LimitedInventors: Nikola Nedovic, Vojin G. Oklobdzija, William W. Walker
-
Publication number: 20030062940Abstract: The invention relates generally to the field of electronic circuit design, and in particular to techniques for reducing hazards in a digital logic circuit, for example, a digital logic flip-flop circuit. In an embodiment of the present invention a method for reducing hazards in a flip-flop, including, a pre-charged stage coupled to an evaluation stage by at least an internal node, is provided. First, the pre-charged stage sets the internal node based on a data input. The evaluation stage is prevented from evaluating the internal node for a predetermined time period. After the predetermined time period, the internal node is evaluated by the evaluation stage to determine an output of the flip-flop.Type: ApplicationFiled: January 11, 2002Publication date: April 3, 2003Applicant: Fujitsu LimitedInventors: Nikola Nedovic, Vojin G. Oklobdzija, William W. Walker
-
Publication number: 20030056129Abstract: Techniques, including a system and method, are disclosed for conditionally pre-charging a memory circuit, for example a flip-flop, and thus reducing power consumption. In an embodiment a method for reducing power consumption in a memory circuit, including, a pre-charged stage coupled to an evaluation stage by at least an internal node, is provided. The method includes setting an input of the pre-charged stage to a first high logic level. Next, responsive to the setting of the input, the internal node is set to a low logic level within a first transparency window. Then responsive to the setting of the internal node, the evaluation stage changes the output of the evaluation stage to a second high logic level within the first transparency window. Lastly, when the input remains at the first high-logic level, the internal node is maintained at the low logic level through at least a second transparency window.Type: ApplicationFiled: January 11, 2002Publication date: March 20, 2003Applicant: Fujitsu LimitedInventors: Nikola Nedovic, Vojin G. Oklobdzija, William W. Walker
-
Publication number: 20020143841Abstract: A multiplexer based adder circuit. The novel adder design is suitable for a number of bit sizes, but in one exemplary embodiment is a 64-bit adder. A complete 16-bit scaled adder is taught. The adder circuit is efficient and reconfigurable in that the adder can be partitioned to support a variety of data formats. The adder can add two 64-bit operands, four 32-bit operands, eight 16-bit operands, or sixteen 8-bit operands. The reconfigurability of the adder for different word sizes is achieved using only a small number of control signals for partitioning without increasing the adder size or reducing its speed. The novel adder circuit is designed using multiplexer circuits and two input inverted logic gates making the adder very fast. The adder design recognizes that pass transistor based multiplexer circuits and inverted logic gates are the fastest circuit elements for standard CMOS logic.Type: ApplicationFiled: August 20, 2001Publication date: October 3, 2002Applicant: SONY CORPORATION AND SONY ELECTRONICS, INC.Inventors: Aamir A. Farooqui, Vojin G. Oklobdzija, Farzad Chehrazi