Patents by Inventor Donglok Kim
Donglok Kim has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Patent number: 7777749Abstract: A programmable graphics pipeline and method for processing multiple partitioned multimedia data, such as graphics data, image data, video data, or audio data. A preferred embodiment of the programmable graphics pipeline includes an instruction cache, a register file, and a vector functional unit that perform partitioned instructions. In addition, an enhanced rasterization unit is used to generate inverse-mapped source coordinates in addition to destination output coordinates for graphics and other media processing. An enhanced texture address unit generates corresponding memory addresses of source texture data for graphics processing and source media data for media processing. Data retrieved from memory are stored in an enhanced texture cache for use by the vector functional unit. A vector output unit includes a blending unit for graphics data and an output buffer for wide media data.Type: GrantFiled: November 16, 2006Date of Patent: August 17, 2010Assignee: University of WashingtonInventors: Chris Yoochang Chung, Donglok Kim, Yongmin Kim
-
Patent number: 7409530Abstract: A VLIW instruction format is introduced having a set of control bits which identify subinstruction sharing conditions. At compilation the VLIW instruction is analyzed to identify subinstruction sharing opportunities. Such opportunities are encoded in the control bits of the instruction. Before the instruction is moved into the instruction cache, the instruction is compressed into the new format to delete select redundant occurrences of a subinstruction. Specifically, where a subinstruction is to be shared by corresponding functional processing units of respective clusters, the subinstruction need only appear in the instruction once. The redundant appearance is deleted. The control bits are decoded at instruction parsing time to route a shared subinstruction to the associated functional processing units.Type: GrantFiled: December 17, 2004Date of Patent: August 5, 2008Assignee: University of WashingtonInventors: Donglok Kim, Stefan G. Berg, Weiyun Sun, Yongmin Kim
-
Patent number: 7234040Abstract: Data are prefetched into a cache from a prefetch region of memory, based on a program instruction reference and on compile-time information that indicates the bounds of the prefetch region, a size of a prefetch block, and a location of the prefetch block. If the program reference address lies with the prefetch region, an offset distance is used to determine the address of the prefetch block. Prefetching is performed either from a continuous one-dimensional prefetch region, or an embedded multi-dimensional prefetch region. The prefetch block address is respectively determined in one dimension or multiple dimensions. Program-directed prefetching is implemented by a media processor or by a separate processing component in communication with the media processor. The primary components include a program-directed prefetch controller, a cache, a function unit, and a memory. Preferably, region registers store the compile-time information, and the prefetched data are stored in a cache prefetch buffer.Type: GrantFiled: July 20, 2004Date of Patent: June 19, 2007Assignee: University of WashingtonInventors: Stefan G. Berg, Donglok Kim, Yongmin Kim
-
Patent number: 7233931Abstract: A feature regulation application method for hierarchical decision learning systems receives feature regulation training data and invokes a plurality of hierarchical decision learning to create feature subset information output. The method receives learning data and uses the feature subset information and the learning data to create feature subset learning data output. The hierarchical decision learning method uses the feature subset learning data to create hierarchical decision output. The feature regulation method also outputs feature ranking information that can be used to create hierarchical decision output. The invention provides a computationally feasible method for feature selection that considers the hierarchical decision learning systems used for decision making.Type: GrantFiled: December 26, 2003Date of Patent: June 19, 2007Inventors: Shih-Jong J. Lee, Seho Oh, Donglok Kim
-
Patent number: 7209141Abstract: A boundary macroblock of a video object is padded without significant synchronization overhead between a host processor and an existing coprocessor. The host processor determines horizontal and vertical graphics primitives as a function of shape data stored in a host memory. The shape data determine whether a dot, a line, or a rectangle primitive should be used to pad transparent pixels in the macroblock. The host processor communicates the primitives to a coprocessor, which renders the primitives in an interleaved pipeline fashion to pad transparent pixels of the macroblock based on texture data stored in video memory. The flow of primitives is in one direction from the host processor to the graphics coprocessor, and the texture data is not transferred back and forth between the host processor and coprocessor. This technique is especially useful for enabling acceleration of MPEG-4 video decoding utilizing existing coprocessors capable of accelerating MPEG-1/2 video decoding.Type: GrantFiled: August 17, 2004Date of Patent: April 24, 2007Assignee: University of WashingtonInventors: Rohit Garg, Chris Yoochang Chung, Coskun Mermer, Donglok Kim, Yongmin Kim
-
Publication number: 20070070079Abstract: A programmable graphics pipeline and method for processing multiple partitioned multimedia data, such as graphics data, image data, video data, or audio data. A preferred embodiment of the programmable graphics pipeline includes an instruction cache, a register file, and a vector functional unit that perform partitioned instructions. In addition, an enhanced rasterization unit is used to generate inverse-mapped source coordinates in addition to destination output coordinates for graphics and other media processing. An enhanced texture address unit generates corresponding memory addresses of source texture data for graphics processing and source media data for media processing. Data retrieved from memory are stored in an enhanced texture cache for use by the vector functional unit. A vector output unit includes a blending unit for graphics data and an output buffer for wide media data.Type: ApplicationFiled: November 16, 2006Publication date: March 29, 2007Applicant: University of WashingtonInventors: Chris Chung, Donglok Kim, Yongmin Kim
-
Patent number: 7158141Abstract: A programmable graphics pipeline and method for processing multiple partitioned multimedia data, such as graphics data, image data, video data, or audio data. A preferred embodiment of the programmable graphics pipeline includes an instruction cache, a register file, and a vector functional unit that perform partitioned instructions. In addition, an enhanced rasterization unit is used to generate inverse-mapped source coordinates in addition to destination output coordinates for graphics and other media processing. An enhanced texture address unit generates corresponding memory addresses of source texture data for graphics processing and source media data for media processing. Data retrieved from memory are stored in an enhanced texture cache for use by the vector functional unit. A vector output unit includes a blending unit for graphics data and an output buffer for wide media data.Type: GrantFiled: January 17, 2002Date of Patent: January 2, 2007Assignee: University of WashingtonInventors: Chris Yoochang Chung, Donglok Kim, Yongmin Kim
-
Patent number: 7142718Abstract: An accumulation transformation method for fast pattern search accurately locates general patterns of interest. The method can be used for fast invariant search to match patterns of interest in images where the searched pattern varies in size or orientation or aspect ratio, when pattern appearance is degraded, when the pattern is partially occluded, where the searched image is large, multidimensional, or very high resolution, or where the pattern size is large. The accumulation transformations of the input image are determined based upon the searched projection directions. Projection profile result images are derived from the accumulation transformed input image and used for fast matching with template pattern projection profiles.Type: GrantFiled: October 28, 2002Date of Patent: November 28, 2006Inventors: Shih-Jong J. Lee, Seho Oh, Donglok Kim
-
Patent number: 7096207Abstract: A learning acceleration method is disclosed that can be applied to multiple types and stages of learning to enhance the learning efficiency and outcome. Artificially created training samples can improve representation of all classes in the training set, decrease the difficulty of obtaining sufficient training samples, and decrease the difficulty of unequal sample prevalence. Two specific embodiments of learning acceleration are disclosed: learning accelerated algorithm training and learning accelerated start-up learning. Three objects of interest implantation methods are disclosed: texture mapping of defects, parametric synthesis of negative samples, and manual image editing.Type: GrantFiled: March 22, 2002Date of Patent: August 22, 2006Inventors: Donglok Kim, Shih-Jong J. Lee, Seho Oh
-
Patent number: 7054492Abstract: An accumulation method for fast pattern search can accurately locate regular shaped patterns of interest. The method can be used for invariant search to match patterns of interest in images where the searched pattern varies in size or orientation or aspect ratio, when pattern appearance is degraded, and even when the pattern is partially occluded, where the searched image is large, multidimensional, or very high resolution, or where the pattern size is large. The computation requirement is independent of the size of the pattern region.Type: GrantFiled: September 24, 2002Date of Patent: May 30, 2006Inventors: Shih-Jong J. Lee, Seho Oh, Donglok Kim
-
Publication number: 20050144147Abstract: A feature regulation application method for hierarchical decision learning systems receives a feature regulation training data. A feature regulation method uses the feature regulation training data and invokes a plurality of the hierarchical decision learning to create feature subset information output. The feature regulation application method also receives a learning data. A feature sampling method uses the feature subset information and the learning data to create a feature subset learning data output. A hierarchical decision learning method uses the feature subset learning data to create a hierarchical decision system output. The feature regulation method also outputs feature ranking information. A feature regulated hierarchical decision learning method uses the feature subset learning data and the feature ranking information to create a hierarchical decision system output.Type: ApplicationFiled: December 26, 2003Publication date: June 30, 2005Inventors: Shih-Jong Lee, Seho Oh, Donglok Kim
-
Publication number: 20050102489Abstract: A VLIW instruction format is introduced having a set of control bits which identify subinstruction sharing conditions. At compilation the VLIW instruction is analyzed to identify subinstruction sharing opportunities. Such opportunities are encoded in the control bits of the instruction. Before the instruction is moved into the instruction cache, the instruction is compressed into the new format to delete select redundant occurrences of a subinstruction. Specifically, where a subinstruction is to be shared by corresponding functional processing units of respective clusters, the subinstruction need only appear in the instruction once. The redundant appearance is deleted. The control bits are decoded at instruction parsing time to route a shared subinstruction to the associated functional processing units.Type: ApplicationFiled: December 17, 2004Publication date: May 12, 2005Applicant: University of WashingtonInventors: Donglok Kim, Stefan Berg, Weiyun Sun, Yongmin Kim
-
Patent number: 6888892Abstract: A method for efficiently padding a macroblock of a video object plane employs two new instructions. The instructions, PadToRight and PadToLeft, are applied in alternating sequence during a PadPass 1 operation and a PadPass 2 operation. The results of these two operations are then averaged to pad each transparent pixel in each row of a macroblock that includes at least one opaque pixel. A Shift_in register is used to temporarily store data to facilitate the operation implemented by these instructions. Once the transparent pixels in each row have been padded horizontally, pixels in rows having shape data equal to zero (indicating all pixels in the row are transparent) are padded in a pre-processing step, followed by an upward propagation step. The two instructions are preferably implemented using 2:1 multiplexers implemented with an arithmetic logic unit. The method is particularly useful in set-top boxes, games, and other video applications.Type: GrantFiled: December 10, 2001Date of Patent: May 3, 2005Assignee: University of WashingtonInventors: Chris Yoochang Chung, Kerem Karadayi, Rohit Garg, Donglok Kim, Yongmin Kim
-
Patent number: 6859870Abstract: A VLIW instruction format is introduced having a set of control bits which identify subinstruction sharing conditions. At compilation the VLIW instruction is analyzed to identify subinstruction sharing opportunities. Such opportunities are encoded in the control bits of the instruction. Before the instruction is moved into the instruction cache, the instruction is compressed into the new format to delete select redundant occurrences of a subinstruction. Specifically, where a subinstruction is to be shared by corresponding functional processing units of respective clusters, the subinstruction need only appear in the instruction once. The redundant appearance is deleted. The control bits are decoded at instruction parsing time to route a shared subinstruction to the associated functional processing units.Type: GrantFiled: March 7, 2000Date of Patent: February 22, 2005Assignee: University of WashingtonInventors: Donglok Kim, Stefan G. Berg, Weiyun Sun, Yongmin Kim
-
Publication number: 20050035968Abstract: A boundary macroblock of a video object is padded without significant synchronization overhead between a host processor and an existing coprocessor. The host processor determines horizontal and vertical graphics primitives as a function of shape data stored in a host memory. The shape data determine whether a dot, a line, or a rectangle primitive should be used to pad transparent pixels in the macroblock. The host processor communicates the primitives to a coprocessor, which renders the primitives in an interleaved pipeline fashion to pad transparent pixels of the macroblock based on texture data stored in video memory. The flow of primitives is in one direction from the host processor to the graphics coprocessor, and the texture data is not transferred back and forth between the host processor and coprocessor. This technique is especially useful for enabling acceleration of MPEG-4 video decoding utilizing existing coprocessors capable of accelerating MPEG-1/2 video decoding.Type: ApplicationFiled: August 17, 2004Publication date: February 17, 2005Applicant: University of WashingtonInventors: Rohit Garg, Chris Chung, Coskun Mermer, Donglok Kim, Yongmin Kim
-
Patent number: 6842177Abstract: A boundary macroblock of a video object is padded without significant synchronization overhead between a host processor and an existing coprocessor. The host processor determines horizontal and vertical graphics primitives as a function of shape data stored in a host memory. The shape data determine whether a dot, a line, or a rectangle primitive should be used to pad transparent pixels in the macroblock. The host processor communicates the primitives to a coprocessor, which renders the primitives in an interleaved pipeline fashion to pad transparent pixels of the macroblock based on texture data stored in video memory. The flow of primitives is in one direction from the host processor to the graphics coprocessor, and the texture data is not transferred back and forth between the host processor and coprocessor. This technique is especially useful for enabling acceleration of MPEG-4 video decoding utilizing existing coprocessors capable of accelerating MPEG-1/2 video decoding.Type: GrantFiled: December 14, 2001Date of Patent: January 11, 2005Assignee: University of WashingtonInventors: Rohit Garg, Chris Yoochang Chung, Coskun Mermer, Donglok Kim, Yongmin Kim
-
Publication number: 20040268051Abstract: Data are prefetched into a cache from a prefetch region of memory, based on a program instruction reference and on compile-time information that indicates the bounds of the prefetch region, a size of a prefetch block, and a location of the prefetch block. If the program reference address lies with the prefetch region, an offset distance is used to determine the address of the prefetch block. Prefetching is performed either from a continuous one-dimensional prefetch region, or an embedded multi-dimensional prefetch region. The prefetch block address is respectively determined in one dimension or multiple dimensions. Program-directed prefetching is implemented by a media processor or by a separate processing component in communication with the media processor. The primary components include a program-directed prefetch controller, a cache, a function unit, and a memory. Preferably, region registers store the compile-time information, and the prefetched data are stored in a cache prefetch buffer.Type: ApplicationFiled: July 20, 2004Publication date: December 30, 2004Applicant: University of WashingtonInventors: Stefan G. Berg, Donglok Kim, Yongmin Kim
-
Patent number: 6804771Abstract: A processor including a transposable register file. The register file allows normal row-wise access to data and also allows a transposed column-wise access to data stored in a column among registers of the register file. In transposed access mode, a data operand is accessed in a given partition of each of n registers. One register stores a first partition. An adjacent register stores the second partition, and so forth for each of n partitions of the operand. A queue-based transposable register file also is implemented. The queue-based transposable register file includes a head pointer and a tail pointer and has a virtual register. Data written into the virtual register is written into one of the registers as selected by the head pointer. Data read from the virtual register is read from one of the registers as selected by the tail pointer.Type: GrantFiled: July 25, 2000Date of Patent: October 12, 2004Assignee: University of WashingtonInventors: Yoochang Jung, Stefan G. Berg, Donglok Kim, Yongmin Kim
-
Patent number: 6785743Abstract: The template data transfer coprocessor (TDTP) offloads block data transfer operations from a mediaprocessor. A uni-block template, program-guided template, an indirect template and queue-based template are described. The TDTP includes a template interpreter that employs an event-driven control mechanism to set up a template and compute block information and block information for each template. The programming involved in defining block data transfers for video and image processing algorithms is substantially reduced by the use of these templates.Type: GrantFiled: March 22, 2000Date of Patent: August 31, 2004Assignee: University of WashingtonInventors: Weiyun Sun, Donglok Kim, Yongmin Kim
-
Patent number: 6782470Abstract: The register file of a processor includes embedded operand queues. The configuration of the register file into registers and operand queues is defined dynamically by a computer program. The programmer determines the trade-off between the number and size of the operand queue(s) versus the number of registers used for the program. The programmer partitions a portion of the registers into one or more operand queues. A given queue occupies a consecutive set of registers, although multiple queues need not occupy consecutive registers. An additional address bit is included to distinguish operand queue addresses from register addresses. Queue state logic tracks status information for each queue, including a header pointer, tail pointer, start address, end address and number of vacancies value. The program sets the locations and depth of a given operand queue within the register file.Type: GrantFiled: November 6, 2000Date of Patent: August 24, 2004Assignee: University of WashingtonInventors: Stefan G. Berg, Michael S. Grow, Weiyun Sun, Donglok Kim, Yongmin Kim