Patents by Inventor Donglok Kim

Donglok Kim has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 7777749
    Abstract: A programmable graphics pipeline and method for processing multiple partitioned multimedia data, such as graphics data, image data, video data, or audio data. A preferred embodiment of the programmable graphics pipeline includes an instruction cache, a register file, and a vector functional unit that perform partitioned instructions. In addition, an enhanced rasterization unit is used to generate inverse-mapped source coordinates in addition to destination output coordinates for graphics and other media processing. An enhanced texture address unit generates corresponding memory addresses of source texture data for graphics processing and source media data for media processing. Data retrieved from memory are stored in an enhanced texture cache for use by the vector functional unit. A vector output unit includes a blending unit for graphics data and an output buffer for wide media data.
    Type: Grant
    Filed: November 16, 2006
    Date of Patent: August 17, 2010
    Assignee: University of Washington
    Inventors: Chris Yoochang Chung, Donglok Kim, Yongmin Kim
  • Patent number: 7409530
    Abstract: A VLIW instruction format is introduced having a set of control bits which identify subinstruction sharing conditions. At compilation the VLIW instruction is analyzed to identify subinstruction sharing opportunities. Such opportunities are encoded in the control bits of the instruction. Before the instruction is moved into the instruction cache, the instruction is compressed into the new format to delete select redundant occurrences of a subinstruction. Specifically, where a subinstruction is to be shared by corresponding functional processing units of respective clusters, the subinstruction need only appear in the instruction once. The redundant appearance is deleted. The control bits are decoded at instruction parsing time to route a shared subinstruction to the associated functional processing units.
    Type: Grant
    Filed: December 17, 2004
    Date of Patent: August 5, 2008
    Assignee: University of Washington
    Inventors: Donglok Kim, Stefan G. Berg, Weiyun Sun, Yongmin Kim
  • Patent number: 7234040
    Abstract: Data are prefetched into a cache from a prefetch region of memory, based on a program instruction reference and on compile-time information that indicates the bounds of the prefetch region, a size of a prefetch block, and a location of the prefetch block. If the program reference address lies with the prefetch region, an offset distance is used to determine the address of the prefetch block. Prefetching is performed either from a continuous one-dimensional prefetch region, or an embedded multi-dimensional prefetch region. The prefetch block address is respectively determined in one dimension or multiple dimensions. Program-directed prefetching is implemented by a media processor or by a separate processing component in communication with the media processor. The primary components include a program-directed prefetch controller, a cache, a function unit, and a memory. Preferably, region registers store the compile-time information, and the prefetched data are stored in a cache prefetch buffer.
    Type: Grant
    Filed: July 20, 2004
    Date of Patent: June 19, 2007
    Assignee: University of Washington
    Inventors: Stefan G. Berg, Donglok Kim, Yongmin Kim
  • Patent number: 7233931
    Abstract: A feature regulation application method for hierarchical decision learning systems receives feature regulation training data and invokes a plurality of hierarchical decision learning to create feature subset information output. The method receives learning data and uses the feature subset information and the learning data to create feature subset learning data output. The hierarchical decision learning method uses the feature subset learning data to create hierarchical decision output. The feature regulation method also outputs feature ranking information that can be used to create hierarchical decision output. The invention provides a computationally feasible method for feature selection that considers the hierarchical decision learning systems used for decision making.
    Type: Grant
    Filed: December 26, 2003
    Date of Patent: June 19, 2007
    Inventors: Shih-Jong J. Lee, Seho Oh, Donglok Kim
  • Patent number: 7209141
    Abstract: A boundary macroblock of a video object is padded without significant synchronization overhead between a host processor and an existing coprocessor. The host processor determines horizontal and vertical graphics primitives as a function of shape data stored in a host memory. The shape data determine whether a dot, a line, or a rectangle primitive should be used to pad transparent pixels in the macroblock. The host processor communicates the primitives to a coprocessor, which renders the primitives in an interleaved pipeline fashion to pad transparent pixels of the macroblock based on texture data stored in video memory. The flow of primitives is in one direction from the host processor to the graphics coprocessor, and the texture data is not transferred back and forth between the host processor and coprocessor. This technique is especially useful for enabling acceleration of MPEG-4 video decoding utilizing existing coprocessors capable of accelerating MPEG-1/2 video decoding.
    Type: Grant
    Filed: August 17, 2004
    Date of Patent: April 24, 2007
    Assignee: University of Washington
    Inventors: Rohit Garg, Chris Yoochang Chung, Coskun Mermer, Donglok Kim, Yongmin Kim
  • Publication number: 20070070079
    Abstract: A programmable graphics pipeline and method for processing multiple partitioned multimedia data, such as graphics data, image data, video data, or audio data. A preferred embodiment of the programmable graphics pipeline includes an instruction cache, a register file, and a vector functional unit that perform partitioned instructions. In addition, an enhanced rasterization unit is used to generate inverse-mapped source coordinates in addition to destination output coordinates for graphics and other media processing. An enhanced texture address unit generates corresponding memory addresses of source texture data for graphics processing and source media data for media processing. Data retrieved from memory are stored in an enhanced texture cache for use by the vector functional unit. A vector output unit includes a blending unit for graphics data and an output buffer for wide media data.
    Type: Application
    Filed: November 16, 2006
    Publication date: March 29, 2007
    Applicant: University of Washington
    Inventors: Chris Chung, Donglok Kim, Yongmin Kim
  • Patent number: 7158141
    Abstract: A programmable graphics pipeline and method for processing multiple partitioned multimedia data, such as graphics data, image data, video data, or audio data. A preferred embodiment of the programmable graphics pipeline includes an instruction cache, a register file, and a vector functional unit that perform partitioned instructions. In addition, an enhanced rasterization unit is used to generate inverse-mapped source coordinates in addition to destination output coordinates for graphics and other media processing. An enhanced texture address unit generates corresponding memory addresses of source texture data for graphics processing and source media data for media processing. Data retrieved from memory are stored in an enhanced texture cache for use by the vector functional unit. A vector output unit includes a blending unit for graphics data and an output buffer for wide media data.
    Type: Grant
    Filed: January 17, 2002
    Date of Patent: January 2, 2007
    Assignee: University of Washington
    Inventors: Chris Yoochang Chung, Donglok Kim, Yongmin Kim
  • Patent number: 7142718
    Abstract: An accumulation transformation method for fast pattern search accurately locates general patterns of interest. The method can be used for fast invariant search to match patterns of interest in images where the searched pattern varies in size or orientation or aspect ratio, when pattern appearance is degraded, when the pattern is partially occluded, where the searched image is large, multidimensional, or very high resolution, or where the pattern size is large. The accumulation transformations of the input image are determined based upon the searched projection directions. Projection profile result images are derived from the accumulation transformed input image and used for fast matching with template pattern projection profiles.
    Type: Grant
    Filed: October 28, 2002
    Date of Patent: November 28, 2006
    Inventors: Shih-Jong J. Lee, Seho Oh, Donglok Kim
  • Patent number: 7096207
    Abstract: A learning acceleration method is disclosed that can be applied to multiple types and stages of learning to enhance the learning efficiency and outcome. Artificially created training samples can improve representation of all classes in the training set, decrease the difficulty of obtaining sufficient training samples, and decrease the difficulty of unequal sample prevalence. Two specific embodiments of learning acceleration are disclosed: learning accelerated algorithm training and learning accelerated start-up learning. Three objects of interest implantation methods are disclosed: texture mapping of defects, parametric synthesis of negative samples, and manual image editing.
    Type: Grant
    Filed: March 22, 2002
    Date of Patent: August 22, 2006
    Inventors: Donglok Kim, Shih-Jong J. Lee, Seho Oh
  • Patent number: 7054492
    Abstract: An accumulation method for fast pattern search can accurately locate regular shaped patterns of interest. The method can be used for invariant search to match patterns of interest in images where the searched pattern varies in size or orientation or aspect ratio, when pattern appearance is degraded, and even when the pattern is partially occluded, where the searched image is large, multidimensional, or very high resolution, or where the pattern size is large. The computation requirement is independent of the size of the pattern region.
    Type: Grant
    Filed: September 24, 2002
    Date of Patent: May 30, 2006
    Inventors: Shih-Jong J. Lee, Seho Oh, Donglok Kim
  • Publication number: 20050144147
    Abstract: A feature regulation application method for hierarchical decision learning systems receives a feature regulation training data. A feature regulation method uses the feature regulation training data and invokes a plurality of the hierarchical decision learning to create feature subset information output. The feature regulation application method also receives a learning data. A feature sampling method uses the feature subset information and the learning data to create a feature subset learning data output. A hierarchical decision learning method uses the feature subset learning data to create a hierarchical decision system output. The feature regulation method also outputs feature ranking information. A feature regulated hierarchical decision learning method uses the feature subset learning data and the feature ranking information to create a hierarchical decision system output.
    Type: Application
    Filed: December 26, 2003
    Publication date: June 30, 2005
    Inventors: Shih-Jong Lee, Seho Oh, Donglok Kim
  • Publication number: 20050102489
    Abstract: A VLIW instruction format is introduced having a set of control bits which identify subinstruction sharing conditions. At compilation the VLIW instruction is analyzed to identify subinstruction sharing opportunities. Such opportunities are encoded in the control bits of the instruction. Before the instruction is moved into the instruction cache, the instruction is compressed into the new format to delete select redundant occurrences of a subinstruction. Specifically, where a subinstruction is to be shared by corresponding functional processing units of respective clusters, the subinstruction need only appear in the instruction once. The redundant appearance is deleted. The control bits are decoded at instruction parsing time to route a shared subinstruction to the associated functional processing units.
    Type: Application
    Filed: December 17, 2004
    Publication date: May 12, 2005
    Applicant: University of Washington
    Inventors: Donglok Kim, Stefan Berg, Weiyun Sun, Yongmin Kim
  • Patent number: 6888892
    Abstract: A method for efficiently padding a macroblock of a video object plane employs two new instructions. The instructions, PadToRight and PadToLeft, are applied in alternating sequence during a PadPass 1 operation and a PadPass 2 operation. The results of these two operations are then averaged to pad each transparent pixel in each row of a macroblock that includes at least one opaque pixel. A Shift_in register is used to temporarily store data to facilitate the operation implemented by these instructions. Once the transparent pixels in each row have been padded horizontally, pixels in rows having shape data equal to zero (indicating all pixels in the row are transparent) are padded in a pre-processing step, followed by an upward propagation step. The two instructions are preferably implemented using 2:1 multiplexers implemented with an arithmetic logic unit. The method is particularly useful in set-top boxes, games, and other video applications.
    Type: Grant
    Filed: December 10, 2001
    Date of Patent: May 3, 2005
    Assignee: University of Washington
    Inventors: Chris Yoochang Chung, Kerem Karadayi, Rohit Garg, Donglok Kim, Yongmin Kim
  • Patent number: 6859870
    Abstract: A VLIW instruction format is introduced having a set of control bits which identify subinstruction sharing conditions. At compilation the VLIW instruction is analyzed to identify subinstruction sharing opportunities. Such opportunities are encoded in the control bits of the instruction. Before the instruction is moved into the instruction cache, the instruction is compressed into the new format to delete select redundant occurrences of a subinstruction. Specifically, where a subinstruction is to be shared by corresponding functional processing units of respective clusters, the subinstruction need only appear in the instruction once. The redundant appearance is deleted. The control bits are decoded at instruction parsing time to route a shared subinstruction to the associated functional processing units.
    Type: Grant
    Filed: March 7, 2000
    Date of Patent: February 22, 2005
    Assignee: University of Washington
    Inventors: Donglok Kim, Stefan G. Berg, Weiyun Sun, Yongmin Kim
  • Publication number: 20050035968
    Abstract: A boundary macroblock of a video object is padded without significant synchronization overhead between a host processor and an existing coprocessor. The host processor determines horizontal and vertical graphics primitives as a function of shape data stored in a host memory. The shape data determine whether a dot, a line, or a rectangle primitive should be used to pad transparent pixels in the macroblock. The host processor communicates the primitives to a coprocessor, which renders the primitives in an interleaved pipeline fashion to pad transparent pixels of the macroblock based on texture data stored in video memory. The flow of primitives is in one direction from the host processor to the graphics coprocessor, and the texture data is not transferred back and forth between the host processor and coprocessor. This technique is especially useful for enabling acceleration of MPEG-4 video decoding utilizing existing coprocessors capable of accelerating MPEG-1/2 video decoding.
    Type: Application
    Filed: August 17, 2004
    Publication date: February 17, 2005
    Applicant: University of Washington
    Inventors: Rohit Garg, Chris Chung, Coskun Mermer, Donglok Kim, Yongmin Kim
  • Patent number: 6842177
    Abstract: A boundary macroblock of a video object is padded without significant synchronization overhead between a host processor and an existing coprocessor. The host processor determines horizontal and vertical graphics primitives as a function of shape data stored in a host memory. The shape data determine whether a dot, a line, or a rectangle primitive should be used to pad transparent pixels in the macroblock. The host processor communicates the primitives to a coprocessor, which renders the primitives in an interleaved pipeline fashion to pad transparent pixels of the macroblock based on texture data stored in video memory. The flow of primitives is in one direction from the host processor to the graphics coprocessor, and the texture data is not transferred back and forth between the host processor and coprocessor. This technique is especially useful for enabling acceleration of MPEG-4 video decoding utilizing existing coprocessors capable of accelerating MPEG-1/2 video decoding.
    Type: Grant
    Filed: December 14, 2001
    Date of Patent: January 11, 2005
    Assignee: University of Washington
    Inventors: Rohit Garg, Chris Yoochang Chung, Coskun Mermer, Donglok Kim, Yongmin Kim
  • Publication number: 20040268051
    Abstract: Data are prefetched into a cache from a prefetch region of memory, based on a program instruction reference and on compile-time information that indicates the bounds of the prefetch region, a size of a prefetch block, and a location of the prefetch block. If the program reference address lies with the prefetch region, an offset distance is used to determine the address of the prefetch block. Prefetching is performed either from a continuous one-dimensional prefetch region, or an embedded multi-dimensional prefetch region. The prefetch block address is respectively determined in one dimension or multiple dimensions. Program-directed prefetching is implemented by a media processor or by a separate processing component in communication with the media processor. The primary components include a program-directed prefetch controller, a cache, a function unit, and a memory. Preferably, region registers store the compile-time information, and the prefetched data are stored in a cache prefetch buffer.
    Type: Application
    Filed: July 20, 2004
    Publication date: December 30, 2004
    Applicant: University of Washington
    Inventors: Stefan G. Berg, Donglok Kim, Yongmin Kim
  • Patent number: 6804771
    Abstract: A processor including a transposable register file. The register file allows normal row-wise access to data and also allows a transposed column-wise access to data stored in a column among registers of the register file. In transposed access mode, a data operand is accessed in a given partition of each of n registers. One register stores a first partition. An adjacent register stores the second partition, and so forth for each of n partitions of the operand. A queue-based transposable register file also is implemented. The queue-based transposable register file includes a head pointer and a tail pointer and has a virtual register. Data written into the virtual register is written into one of the registers as selected by the head pointer. Data read from the virtual register is read from one of the registers as selected by the tail pointer.
    Type: Grant
    Filed: July 25, 2000
    Date of Patent: October 12, 2004
    Assignee: University of Washington
    Inventors: Yoochang Jung, Stefan G. Berg, Donglok Kim, Yongmin Kim
  • Patent number: 6785743
    Abstract: The template data transfer coprocessor (TDTP) offloads block data transfer operations from a mediaprocessor. A uni-block template, program-guided template, an indirect template and queue-based template are described. The TDTP includes a template interpreter that employs an event-driven control mechanism to set up a template and compute block information and block information for each template. The programming involved in defining block data transfers for video and image processing algorithms is substantially reduced by the use of these templates.
    Type: Grant
    Filed: March 22, 2000
    Date of Patent: August 31, 2004
    Assignee: University of Washington
    Inventors: Weiyun Sun, Donglok Kim, Yongmin Kim
  • Patent number: 6782470
    Abstract: The register file of a processor includes embedded operand queues. The configuration of the register file into registers and operand queues is defined dynamically by a computer program. The programmer determines the trade-off between the number and size of the operand queue(s) versus the number of registers used for the program. The programmer partitions a portion of the registers into one or more operand queues. A given queue occupies a consecutive set of registers, although multiple queues need not occupy consecutive registers. An additional address bit is included to distinguish operand queue addresses from register addresses. Queue state logic tracks status information for each queue, including a header pointer, tail pointer, start address, end address and number of vacancies value. The program sets the locations and depth of a given operand queue within the register file.
    Type: Grant
    Filed: November 6, 2000
    Date of Patent: August 24, 2004
    Assignee: University of Washington
    Inventors: Stefan G. Berg, Michael S. Grow, Weiyun Sun, Donglok Kim, Yongmin Kim