Discrete Cosine Patents (Class 375/240.2)
  • Publication number: 20090238263
    Abstract: A programmable energy efficient codec system is provided for encoding and decoding a plurality of application environments. A camera Codec and control system for an HD camera is provided for encoding uncompressed HD-SDI video signals into an MPEG-2 transport stream. A stand-alone encoder decoder system is provided in a network configuration allowing for remote display and editing of HD-SDI video. At least one plurality of HD-SDI transport streams is generated from HD-Cameras encoded into MPEG-2 transport streams and output into a DVD-ASI signal and a TS/IP packet stream further provided is a decoder which accepts MPEG-2-TS/IP packet streams from a routed IP network which are decoded into an uncompressed HD-SDI transport stream for display. A set top box is provided for decoding audio and video HD-TV. A first HDMI interface into the decoder allows acceptance of an MPEG-2-TS from local storage media. Connection to an IP routed network is provided.
    Type: Application
    Filed: February 6, 2009
    Publication date: September 24, 2009
    Inventors: Pawan Jaggi, Sandeep Kumar, Xiaohui Wei
  • Publication number: 20090238265
    Abstract: Disclosed herein is a decoding apparatus including: a pixel group information storage section configured to include memory areas each of which stores, as pixel group information, information concerning a corresponding pixel group and is referred to with a horizontal address of the pixel group, and an additional memory area that is located at a left end and stores a value indicating invalid pixel group information; a decoding information generation section configured to acquire the pixel group information identified by an address of a pixel group to be decoded and an address to the left thereof, and generating decoding information; a decoding section configured to perform a decoding process based on the decoding information; and a pixel group information update section configured to generate pixel group information and allowing it to be stored in the memory area identified by the address of the pixel group.
    Type: Application
    Filed: March 17, 2009
    Publication date: September 24, 2009
    Applicant: Sony Corporation
    Inventor: Hiroshi Yoshikawa
  • Publication number: 20090213926
    Abstract: The present invention relates to a method for up-sampling/down-sampling data of a video block in a scalable video data encoding/decoding. The up-sampling method according to the present invention obtains a 2N×2N enlarged block by computing a converting matrix to data of a given N×N video block. The converting block has matrix elements leading data of the video block to resultant data that could be obtained by a converting process that applies DCT to the data, pads some zeros to coefficients, and applies IDCT to the coefficients including the padded zeros. The down-sampling method according to the present invention obtains an N×N reduced block by computing a converting block to a given 2N×2N video block. The converting block for reducing has matrix elements leading data of the 2N×2N video block to resultant data that could be obtained by a converting process that applies DCT to the data, removes some coefficients from transformed coefficients, and applies IDCT to the remaining coefficients.
    Type: Application
    Filed: February 24, 2006
    Publication date: August 27, 2009
    Applicant: LG ELECTRONICS INC.
    Inventors: IL-Hong Shin, Hyun Wook Park
  • Patent number: 7580843
    Abstract: A synthesis subband filter apparatus is provided. The apparatus is used for processing 18 sets of signals which each includes 32 subband sampling signals in accordance with a specification providing 512 window coefficients. The apparatus includes a processor for processing the 18 sets of signals in sequence. The processor further includes a converting module and a generating module. The converting module is used for converting the 32 subband sampling signals of the set of signals being processed into 32 converted vectors by use of 32-points discrete cosine transform (DCT), and writing the 32 converted vectors into 512 default vectors with a first-in, first-out queue. The generating module is used for generating 32 pulse code modulation (PCM) signals, relative to the set of signals being processed according to a set of synthesis formulae proposed in this invention.
    Type: Grant
    Filed: May 8, 2006
    Date of Patent: August 25, 2009
    Assignee: Quanta Computer, Inc.
    Inventors: Chih-Hsien Chang, Chih-Wei Hung, Hsien-Ming Tsai
  • Patent number: 7561620
    Abstract: A system and process for compressing and decompressing multiple video streams depicting substantially the same dynamic scene from different viewpoints. Each frame in each contemporaneous set of video frames of the multiple streams is represented by at least a two layers—a main layer and a boundary layer. Compression of the main layers involves first designating one or more of these layers in each set of contemporaneous frames as keyframes. For each set of contemporaneous frames in time sequence order, the main layer of each keyframe is compressed using an inter-frame compression technique. In addition, the main layer of each non-keyframe within the frame set under consideration is compressed using a spatial prediction compression technique. Finally, the boundary layers of each frame in the current frame set are each compressed using an intra-frame compression technique. Decompression is generally the reverse of the compression process.
    Type: Grant
    Filed: August 3, 2004
    Date of Patent: July 14, 2009
    Assignee: Microsoft Corporation
    Inventors: Simon Winder, Matthew Uyttendaele, Charles Zitnick, III, Richard Szeliski, Sing Bing Kang
  • Patent number: 7558428
    Abstract: The systems and methods described herein are directed at accelerating video encoding using a graphics processing unit. In one aspect, a video encoding system uses both a central processing unit (CPU) and a graphics processing unit (GPU) to perform video encoding. The system implements a technique that enables the GPU to perform motion estimation for video encoding. The technique allows the GPU to perform a motion estimation process in parallel with the video encoding process performed by the CPU. The performance of video encoding using such a system is greatly accelerated as compared to encoding using just the CPU. In another aspect, data related to motion estimation is arranged and provided to the GPU in a way that utilizes the capabilities of the GPU. Data about video frames may be collocated to enable multiple channels of the GPU to process tasks in parallel. The depth buffer of the GPU may be used to consolidate repeated calculations and searching tasks during the motion estimation process.
    Type: Grant
    Filed: September 13, 2004
    Date of Patent: July 7, 2009
    Assignee: Microsoft Corporation
    Inventors: Guobin Shen, Shipeng Li, Guangping Gao
  • Patent number: 7555044
    Abstract: A method and system of transform-based encoding, decoding, and bitstream generation of digital video content is disclosed. The digital video content comprises a stream of pictures, slices, or macroblocks which can each be intra, predicted or bi-predicted pictures. The pictures, slices, or macroblocks comprise blocks of pixels. The method and system comprises scanning frequency coefficients from each of the block's two dimensional array of frequency coefficients in a manner that is vertically biased, producing a one dimensional array of frequency coefficients. The method and system also provide for scanning/assigning frequency coefficients from a one dimensional array of frequency coefficients, producing a two dimensional array of frequency coefficients.
    Type: Grant
    Filed: October 6, 2003
    Date of Patent: June 30, 2009
    Assignee: General Instrument Corporation
    Inventors: Limin Wang, Yue Yu
  • Patent number: 7548583
    Abstract: During Motion Picture Experts Group (MPEG) video encoding a two-dimensional discrete cosine transform (DCT) is performed on data representing an original image. The resulting coefficients are then quantized, which typically results in many zero coefficients. Because of the nature of most video data, most higher-order coefficients are typically zero and the lower-order coefficients (i.e., those grouped towards the upper left of the matrix) are more likely to be non-zero. To reduce the lengths of runs among the lower-order coefficients, the coefficients can be encoded in a zig-zag pattern. In one embodiment, the zig-zag pattern is maintained and one or more masks are generated based on the output of the quantization phase. The one or more masks are used to identify the coefficients within the matrix that are non-zero. This reduces the number of accesses to memory required to encode the non-zero coefficients and runs of zero coefficients.
    Type: Grant
    Filed: August 9, 2005
    Date of Patent: June 16, 2009
    Assignee: Apple Inc.
    Inventor: Jason Klivington
  • Publication number: 20090122861
    Abstract: The present invention is directed to video coding/decoding and discloses a method for transforming to/from transform coefficients and residual pixel data in moving pictures by a set of semi-orthonormal basis vectors. The basis vectors are derived from conventional DCT or KTL matrixes, but relaxes to some extend the requirements for orthogonality, norm equality and element size limitation. In this way the present invention provides improved coding efficiency and lower complexity compared to previously used integer transforms.
    Type: Application
    Filed: November 11, 2008
    Publication date: May 14, 2009
    Applicant: TANDBERG TELECOM AS
    Inventors: Gisle BJONTEGAARD, Arild Fuldseth
  • Publication number: 20090116554
    Abstract: Disclosed is an apparatus (500) for generating a second compressed video stream (550) having a second resolution, from a first compressed video stream (540) having a first resolution. The apparatus comprises means (513) for extracting transform domain luma data and spatial domain chroma data from the first compressed video stream (540), means (514-516) for applying a transform domain operation to the luma data to form reconstructed transform domain luma data, means (518, 519, 560) for applying a spatial domain operation to the chroma data to form reconstructed spatial domain chroma data, and means for scaling the reconstructed transform domain luma data and reconstructed spatial domain chroma data to generate the second compressed video stream.
    Type: Application
    Filed: October 29, 2008
    Publication date: May 7, 2009
    Applicant: CANON KABUSHIKI KAISHA
    Inventors: Zhonghua Ma, I-Jan Eric Wang
  • Publication number: 20090110059
    Abstract: A system, method and devices for embedding end-user access information, such as digital rights management (DRM) information and/or conditional access (CA) information, in multimedia content, such as broadcast video content, and transmitting or distributing the multimedia content, including the end-user access information, to one or more end user devices, such as multimedia content processing (set-top) devices and mobile devices. The methods, systems and devices also involve receiving and decoding the transmitted multimedia content, including extracting end-user access information from the multimedia content and using the end-user access information to allow or deny access to the multimedia content by the end user device. The end-user access information is embedded in the multimedia content in a manner that reduces or even eliminates the need to also transmit end-user access information separately or along with the associated multimedia content.
    Type: Application
    Filed: October 31, 2007
    Publication date: April 30, 2009
    Applicant: General Instrument Corporation
    Inventors: Theodore S. Rzeszewski, Eiraj Rezania
  • Patent number: 7526030
    Abstract: An inputted digital signal of a first format (DV video signal) is restored to a variable-length code by having its framing cancelled by a de-framing section 11, then decoded by a variable-length decoding (VLD) section 12, inversely quantized by an inverse quantizing (IQ) section 13, and inversely weighted by an inverse weighting (IW) section 14. Then, required resolution conversion in the orthogonal transform domain (frequency domain) is carried out on the inversely weighted video signal by a resolution converting section 16. After that, the video signal having the resolution converted is weighted by a weighting (W) section 18, then quantized by a quantizing (Q) section 19, coded by variable-length coding by a variable-length coding (VLC) section 20, and outputted as a digital signal of a second format (MPEG video signal).
    Type: Grant
    Filed: June 13, 2007
    Date of Patent: April 28, 2009
    Assignee: Sony Corporation
    Inventors: Naofumi Yanagihara, Nobuaki Izumi
  • Publication number: 20090103624
    Abstract: An input module obtains a media signal to be communicated to a destination system, after which an identification module identifies a plurality of segments within the media signal. A codec includes a selection module that automatically selects different compression methods to respectively compress at least two of the segments. The compression methods are automatically selected to produce a highest compression quality for the respective segments according to a set of criteria without exceeding a target data rate. A compression module within the codec then compresses the segments using the automatically-selected compression methods, after which an output module delivers the compressed segments to the destination system with an indication of which compression method was used to compress each segment.
    Type: Application
    Filed: November 25, 2008
    Publication date: April 23, 2009
    Applicant: BROADCAST INTERNATIONAL
    Inventors: Danny L. Mabey, Jodie L. Reynolds, Lawrence S. Reznick, John E. Wilkinson, Jack A. Prock
  • Publication number: 20090103608
    Abstract: A method and system are provided for encoding a plurality of integers with variable-length code tables constructed by combining a plurality of structured code tables. Each code table has an associated set of integer values; the sets are disjoint and exhaustive, so that every integer appears in exactly one set. An integer is encoded using the codebook associated with the set in which the integer appears.
    Type: Application
    Filed: December 30, 2008
    Publication date: April 23, 2009
    Applicant: APPLE INC.
    Inventors: Ken Kengkuan LIN, Mitchell Howard OSLICK
  • Publication number: 20090080515
    Abstract: This disclosure describes efficient transformation techniques that can be used in video coding. In particular, intermediate results of computations associated with transformation of a first block of video data are reused in the transformation of a second block of video data. The techniques may be used during a motion estimation process in which video blocks of a search space are transformed, but this disclosure is not necessarily limited in this respect. Pipelining techniques may be used to accelerate the efficient transformation techniques, and transposition memories can be implemented to facilitate efficient pipelining.
    Type: Application
    Filed: September 26, 2007
    Publication date: March 26, 2009
    Applicant: QUALCOMM Incorporated
    Inventors: Raghavendra C. Nagaraj, De Dzwo Hsu, Stephen Molloy
  • Patent number: 7499491
    Abstract: A system to process a signal sequence is described. A hybrid block matching and transform based N-Dimensional signal sequence encoder and decoder is disclosed. The encoder includes encoder side block matching predictor, which includes entropy based cost function which can be estimated from certain energy measure of the block matching difference; a fast block matching search method to learn the results from neighboring blocks and to perform large range search with only a small number of points to visit.
    Type: Grant
    Filed: June 23, 2004
    Date of Patent: March 3, 2009
    Assignee: ViChip Corp. Limited
    Inventors: Tsu-Chang Lee, Wen-Lung Chen, Yusong Huang, Li-Herng Yao, Hsi-Sheng Chen
  • Publication number: 20090052525
    Abstract: In a reproduction apparatus, for reproducing an original signal conveyed as main data by a data medium such as a recording disk or broadcasting system, with medium protection data which are specific to the data medium being conveyed together with the main data, the apparatus includes a section for generating apparatus protection data which are specific to the reproduction apparatus, a section for combining the apparatus protection data with the medium protection data to define a protection level, and a section for applying the protection level to restrict reproduction of the original signal, with stepwise variations in restriction occurring in accordance with changes in protection level. The medium protection data may include information for specifying restricted reproduction of portions of the original signal, such as by producing degraded resolution within specified regions of specified frames of a video signal.
    Type: Application
    Filed: October 10, 2008
    Publication date: February 26, 2009
    Applicant: Victor Company of Japan, Limited
    Inventor: Takayuki SUGAHARA
  • Patent number: 7492822
    Abstract: In a video encoder a prediction signal at a bit resolution of P is subtracted from an input signal received at a bit resolution of P+?, where P is typically 8 bits, and P+?, 10 bits. A transform is then performed on the result, the output of which transform being at a bit resolution of at least P+?. The transformed signal is quantised, and the quantised signal used to form the prediction signal. The quantised signal is then coded for downstream delivery. At a decoder, the process is inverted, preserving the extra ? bits input to the encoder.
    Type: Grant
    Filed: January 27, 2003
    Date of Patent: February 17, 2009
    Assignee: Snell & Wilcox Limited
    Inventors: Andrew David Raine Cotton, Michael James Knee
  • Patent number: 7486731
    Abstract: In a video encoder a prediction signal at a bit resolution of P is subtracted from an input signal received at a bit resolution of P+?, where P is typically 8 bits, and P+?, 10 bits. A transform is then performed on the result, the output of which transform being at a bit resolution of at least P+?. The transformed signal is quantized, and the quantized signal used to form the prediction signal. The quantized signal is then coded for downstream delivery. At a decoder, the process is inverted, preserving the extra ? bits input to the encoder.
    Type: Grant
    Filed: October 28, 2004
    Date of Patent: February 3, 2009
    Assignee: Snell & Wilcox Limited
    Inventors: Andrew David Raine Cotton, Michael James Knee
  • Patent number: 7474697
    Abstract: An apparatus for partitioning moving picture data comprises a first quantizing unit for first-quantizing a received video signal and outputting a first-quantized signal; and a second quantizing unit for second-quantizing the first-quantized signal and partitioning the first-quantized signal into a preceding part and a succeeding part.
    Type: Grant
    Filed: July 25, 2005
    Date of Patent: January 6, 2009
    Assignee: LG Electronics Inc.
    Inventor: Kyu Chan Roh
  • Publication number: 20080310503
    Abstract: Provided are a method and apparatus for entropy-encoding/entropy-decoding video data. The method of entropy-encoding video data includes binarizing coefficients of the frequency domain, which are generated by transforming a residual block of a current block into the frequency domain, using different binarization methods and performing binary arithmetic coding on the binarized coefficients. In this way, the coefficients are binarized adaptively according to whether the frequencies of the coefficients are high or low, thereby improving the compression efficiency of the video data.
    Type: Application
    Filed: April 24, 2008
    Publication date: December 18, 2008
    Applicants: Samsung Electronics Co., Ltd., INDUSTRY-ACADEMIA COOPERATION GROUP OF SEJONG UNIVERSITY
    Inventors: Yung-Lyul LEE, Dae-yeon KIM
  • Publication number: 20080298700
    Abstract: Down-sampling of an image may be performed in the DCT domain. A multiple layered network is used to select transform matrices for down-sampling a DCT image of size M×N to a DCT image of size I×J. A spatial domain down-sampling method is selected and applied to the DCT image to produce a down-sampled DCT reference image. A learning with forgetting algorithm is used to apply a decay to the elements of the transform matrix and select a transform matrices which solve an optimization problem. The optimization problem is a function of the visual quality of images obtained using the transform matrices and the computational complexity associated with using the transform matrices. The visual quality is a measure of the difference between the down-sampled DCT image obtained using the transform matrices and the visual quality of the DCT reference image obtained using a spatial domain down-sampling method.
    Type: Application
    Filed: June 4, 2008
    Publication date: December 4, 2008
    Inventors: Xiang Yu, En-hui Yang, Haiquan Wang
  • Publication number: 20080285644
    Abstract: Provided is an apparatus and method for encoding/decoding moving pictures based on adaptive scanning. The moving picture apparatus and method can increase a compression rate based on adaptive scanning by performing intra prediction onto blocks of a predetermined size, and scanning coefficients acquired from Discrete Cosine Transform (DCT) of a residue signal and quantization differently according to the intra prediction mode. The moving picture encoding apparatus includes: a mode selector for selecting and outputting a prediction mode; a predictor for predicting pixel values of pixels to be encoded of an input video based on the prediction mode to thereby output a residue signal block; a transform/quantization unit for performing DCT onto the residue signal block and quantizing the transformed residue signal block; and an encoder for adaptively scanning and encoding the quantized residue signal block based on the prediction mode.
    Type: Application
    Filed: October 19, 2006
    Publication date: November 20, 2008
    Inventors: Jeong-Il Seo, Wook-Joong Kim, Kyu-Heon Kim, Kyeong-Ok Kang, Jin-Woo Hong, Yung-Lyul Lee, Ki-Hun Han, Jae-Ho Hur, Dong-Gyu Sim, Seoung-Jun Oh
  • Publication number: 20080279274
    Abstract: An integrated circuit (102) and method computes fixed point vector dot products (424) and/or matrix vector products using a type of distributed architecture that loads bit planes (add00-add30) and uses the loaded bit planes to generate a plurality of partial products (416-422) directly, such as without a lookup table, and the plurality of partial products are computed in real time and are not read out of addressable memory. In one example, pixel coefficients and corresponding data are loaded such that, for example, a bit plane is loaded to generate partial product results on a per bit plane basis. The plurality of partial products are then summed (414) or accumulated to produce fixed point vector dot product data (424).
    Type: Application
    Filed: May 8, 2007
    Publication date: November 13, 2008
    Applicant: Freescale Semiconductor, Inc.
    Inventor: Nickolai J. Iliev
  • Publication number: 20080267294
    Abstract: Transcoding as from MPEG-2 SDTV to MPEG-4 CIF reuses motion vectors and downsamples in the frequency (DCT) domain with differing treatments of frame-DCT and field-DCT blocks, and alternatively uses de-interlacing IDCT with respect to the row dimension plus deferred column downsampling for reference frame blocks.
    Type: Application
    Filed: July 2, 2008
    Publication date: October 30, 2008
    Applicant: Texas Instruments Incorporated
    Inventor: Felix C. Fernandes
  • Patent number: 7440501
    Abstract: Adjacent blocks are identified in an image. Coding parameters for the adjacent blocks are identified. Deblock filtering between the identified adjacent blocks is skipped if the coding parameters for the identified adjacent blocks are similar and not skipped if the coding parameters for the identified adjacent blocks are substantially different.
    Type: Grant
    Filed: April 13, 2005
    Date of Patent: October 21, 2008
    Assignee: Sharp Laboratories of America, Inc.
    Inventors: Shijun Sun, Shawmin Lei
  • Publication number: 20080232475
    Abstract: A method for encoding video with a two-dimensional (2D) transform separable to two one-dimensional (1D) transforms. The method receives an array of values for a sub-section of an image, performs a first 1D-transform of the array, transposes the resulting array, and performs a second 1D-transform of the array resulting from the transpose. The method, without performing another transpose, generates a data stream using a transposed scan order based on the values of the array resulting from the second transform. A method for decoding video encoded by a 2D transform, which separable to two 1D transforms. The method receives a data stream containing encoded values for an image, parses out the values into an array using a transposed scan order, performs a first 1D-inverse transform on the array, transposes the resulting array, and performs a second 1D-inverse transform of the array resulting from the transpose to produce a decoded output.
    Type: Application
    Filed: April 28, 2008
    Publication date: September 25, 2008
    Inventors: Maynard Handley, Roger Kumar, Thomas Pun, Xiaochun Nie, Hsi-Jung Wu
  • Patent number: 7426311
    Abstract: An object-based coding apparatus and method for image signals, wherein upon scanning shape-adaptive transform coefficients of an input image signal transformed in accordance with a shape-adaptive transform, only segments containing such shape-adaptive transform coefficients are scanned. In the scanning operation, segments containing no transform coefficient are skipped, thereby reducing the quantity of data being encoded. An apparatus for and a method of object-based decoding of image signals are also disclosed which can decode bit streams generated using the coding method, thereby reproducing image signals.
    Type: Grant
    Filed: November 19, 1998
    Date of Patent: September 16, 2008
    Assignee: Hyundai Electronics Industries Co. Ltd.
    Inventors: Sung Moon Chun, Jin Hak Lee, Joo Hee Moon, Gwang Hoon Park, Jae Kyoon Kim, Jae-won Chung
  • Publication number: 20080219345
    Abstract: A streaming media codec may include a collection of media stream processing modules arranged into a processing graph. One or more of the modules may perform a Fourier-related transform, and a significant fraction of media stream processing may occur post-transform. The media stream may be considered as a sequence of processing blocks, and post-transform processing blocks contain transform coefficients. Such transform coefficients are amenable to classification into processing classes. Some processing classes may require significantly less processing effort than others by post-transform processing modules. Such transform coefficient classes may be efficiently specified, for example, with coefficient bounding rectangles, and the specification provided to one or more post-transform streaming media processing modules to enable the modules to allocate their processing resources more effectively.
    Type: Application
    Filed: March 6, 2007
    Publication date: September 11, 2008
    Applicant: Microsoft Corporation
    Inventors: Jiang Li, Hua Cai
  • Patent number: 7424057
    Abstract: The present invention provides an apparatus for converting image data, including a block extraction unit extracts a class tap from a composite signal. A pixel-location-mode output unit determines a pixel location mode from the extracted class tap, and outputs it to a coefficient memory. A coefficient calculation unit acquires a seed coefficient from a seed coefficient memory to determine a predictive coefficient based on a transform method selection signal input from a designation unit, and stores the result into the coefficient memory. The coefficient memory supplies a predictive coefficient corresponding to the pixel location mode to a predictive calculation unit. A block extraction unit extracts a predictive tap from the composite signal, and outputs the result to the predictive calculation unit. The predictive calculation unit outputs a component signal or a transformed component signal based on the predictive tap and the predictive coefficient.
    Type: Grant
    Filed: September 9, 2003
    Date of Patent: September 9, 2008
    Assignee: Sony Corporation
    Inventors: Tetsujiro Kondo, Takeharu Nishikata, Shizuo Chikaoka, Sakon Yamamoto
  • Patent number: 7424151
    Abstract: A system and method for classifying an image block of a printed image into contone, halftone, or error diffusion classes, includes scanning the printed image; selecting an n by n block of pixels from the scanned image in a luminance channel; calculating an array of DCT coefficients of the pixel block, wherein the calculated DCT coefficients are representative of spatial frequency and spatial orientation of the pixel block; comparing the array of DCT coefficients with an array of predetermined values, wherein the array of predetermined values are indicative of contone, halftone and error diffusion classes; and determining the image classification of the pixel block based on the comparison of the DCT coefficients with the array of predetermined values. If the image is determined to be halftone, then the method further evaluates the DCT coefficients to determine the frequency of the halftone screen.
    Type: Grant
    Filed: June 4, 2004
    Date of Patent: September 9, 2008
    Assignee: Xerox Corporation
    Inventors: Ying-Wei Lin, Shen-Ge Wang, Beilei Xu
  • Patent number: 7418134
    Abstract: The present invention relates to a method and system for foreground segmentation in which frames of a video sequence are analyzed in the transform domain to determine one or more features. The features are used to model the background. The background can be modeled as a single Gaussian model with a mean and variance of the features. A current frame is segmented by determining if one or more features of the current frame analyzed in the foreground domain satisfy a threshold between the background model. The threshold value can be based on the mean and/or variance of features. During the segmentation, the mean and variance can be updated based on previous corresponding values and current features to adaptively update the background model. In one embodiment, the frames are divided into a plurality of blocks. A transform is used to analyze the blocks in the transform domain. For example, the transform can be a discrete cosine transform (DCT).
    Type: Grant
    Filed: May 12, 2004
    Date of Patent: August 26, 2008
    Assignee: Princeton University
    Inventors: Stuart Schwartz, Juhua Zhu, Bede Liu
  • Patent number: 7408991
    Abstract: A method for decoding video data blocks using variable length codes, comprising transforming information about the spatial frequency distribution of a video data block into pixel values. Prior to said transformation, a first reference value (Xref) representing the abruptness of variations in information about spatial frequency distribution within the block is generated, after said transformation, a second reference value (?) representing the abruptness of variation in certain information between the block and at least one previously transformed video data block is generated. The first reference value (Xref) is compared to a first threshold value (TH1) and the second reference value (?) to a second threshold value (TH2); and as a response to either of the first (Xref) and second reference values (?) being greater than the first (TH1) and respectively the second threshold value (TH2), an error in the block is detected.
    Type: Grant
    Filed: October 28, 2003
    Date of Patent: August 5, 2008
    Assignee: Nokia Mobile Phones Limited
    Inventor: Ari Hourunranta
  • Publication number: 20080181296
    Abstract: A method, and apparatus, and logic encoded in one or more computer-readable tangible medium to carry out a method. The method is to code a plurality of ordered sequences of quantized transform coefficients of a corresponding plurality of blocks of image data using a hybrid coding method that includes determining a breakpoint location in the sequence between a low frequency region and a high frequency region; coding the low-frequency region using a selected low-frequency variable length coding method; coding the high-frequency region using a selected high-frequency variable length coding method; and coding the location of the breakpoint. The plurality of blocks include adjacent blocks and form a multi-block partition. The breakpoint location is determined as a function of the sequences of the multi-block partition such that different multi-block partitions can have different breakpoints.
    Type: Application
    Filed: March 15, 2007
    Publication date: July 31, 2008
    Inventors: Dihong Tian, Wen-hsiung Chen, Pi Sheng Chang, John A. Toebes
  • Publication number: 20080151995
    Abstract: In one embodiment, a first DCT block of coefficients is received from a transcoding process. The DCT block is of a first size, such as an 8×8 size, and in a first coding standard, such as MPEG-2. A distribution of zero and non-zero coefficients is then determined from the first DCT block. For example, areas where non-zero coefficients may be determined. This may simplify a corresponding sum of absolute difference (SAD) calculation. The SAD is then calculated for a second DCT block of a second size, such as a 4×4 size block, in a second coding standard, such as AVC. The SAD is determined from the coefficients in the first DCT block. Accordingly, the SAD is determined without performing an inverse DCT on the 8×8 block and then computing the SAD for a 4×4 block. It is then determined if a bypass of a DCT operation for the 4×4 block can be performed based on the SAD computed. For example, the SAD is compared to a threshold to determine if a bypass can be performed.
    Type: Application
    Filed: December 22, 2006
    Publication date: June 26, 2008
    Applicants: Sony Corporation, Sony Electronics Inc.
    Inventor: Jeongnam Youn
  • Publication number: 20080130743
    Abstract: When data is an HDTV signal, a buffer memory is used as two banks and pipeline processing is performed. When data is an SDTV signal, on the other hand, the buffer memory is used as a bankless buffer memory, and the pipeline processing is not performed.
    Type: Application
    Filed: November 28, 2007
    Publication date: June 5, 2008
    Applicants: SANYO ELECTRIC CO., LTD., SANYO SEMICONDUCTOR CO., LTD.
    Inventors: Tetsuo Kosuge, Kensuke Fujimura, Naoki Tanahashi
  • Patent number: 7379500
    Abstract: To encode image and video data using a transform having low computational complexity and high compression efficiency, all elements of the matrix can be expressed with power-of-2 elements, with a template of the matrix approximating a DCT matrix.
    Type: Grant
    Filed: September 30, 2003
    Date of Patent: May 27, 2008
    Assignee: Microsoft Corporation
    Inventors: Shan Lu, Keman Yu, Jiang Li, Shipeng Li
  • Patent number: 7376280
    Abstract: A method for encoding video with a two-dimensional (2D) transform separable to two one-dimensional (1D) transforms. The method receives an array of values for a sub-section of an image, performs a first 1D-transform of the array, transposes the resulting array, and performs a second 1D-transform of the array resulting from the transpose. The method, without performing another transpose, generates a data stream using a transposed scan order based on the values of the array resulting from the second transform. A method for decoding video encoded by a 2D transform, which separable to two 1D transforms. The method receives a data stream containing encoded values for an image, parses out the values into an array using a transposed scan order, performs a first 1D-inverse transform on the array, transposes the resulting array, and performs a second 1D-inverse transform of the array resulting from the transpose to produce a decoded output.
    Type: Grant
    Filed: April 30, 2003
    Date of Patent: May 20, 2008
    Assignee: Apple Inc
    Inventors: Maynard Handley, Roger Kumar, Thomas Pun, Xiaochun Nie, Hsi-Jung Wu
  • Patent number: 7372906
    Abstract: Data is discrete cosine transformed and streamed to a processor where quantized and inverse quantized blocks are generated. A second streaming data connection streams the inverse quantized blocks to an inverse discrete cosine transform block to generate reconstructed prediction error macroblocks. An addition circuit adds each reconstructed prediction error macroblock and its corresponding predictor macroblock to generate a respective reconstructed macroblock. The quantized macroblocks are zig-zag scanned, run level coded and variable length coded to generate and encoded bitstream.
    Type: Grant
    Filed: March 17, 2003
    Date of Patent: May 13, 2008
    Assignee: STMicroelectronics Limited
    Inventor: Martin Bolton
  • Patent number: 7369706
    Abstract: An image processing device for processing an original image divided into a plurality of blocks that are made up of a plurality of picture elements from encoded image-data which is obtained by encoding a spatial-frequency component of the block as a plurality of spatial-frequency coefficients, and producing reduced encoded image-data which is encoded data on a reduced image that is obtained by reducing the original image to a given reduction rate. If the block in the encoded image-data of the original image is in an area where the value of the function is small due to gentle change in brightness or color of the image, for example, only the direct-current component coefficient of is used for the block. On the other hand, if the block is in a area where the value of the function is large because the area is a boundary area, for example, such as an outline in the image, limited number of lower spatial frequency-coefficients are used for the block. The number is based on the reduction rate of the image.
    Type: Grant
    Filed: December 30, 2003
    Date of Patent: May 6, 2008
    Assignee: Matsushita Electric Industrial Co., Ltd.
    Inventor: Nobukazu Kurauchi
  • Patent number: 7369613
    Abstract: A process for format conversion of DCT macroblocks in an MPEG video bitstream that are divided into blocks, each of which includes a plurality of microblocks. In each DCT block, the significant frequencies are identified and preserved, isolating a corresponding microblock preferably consisting of the microblock on the top left of each block and setting to zero the coefficients of the remaining microblocks. On the microblock thus isolated there is performed an inverse discrete cosine transform, and the microblock thus obtained is merged with the homologous microblocks obtained from the other blocks comprised in a respective starting macroblock, so as to give rise to a merging block. The merging block thus obtained undergoes a discrete cosine transform so as to obtain a final block, which can be assembled into a macroblock with converted format.
    Type: Grant
    Filed: September 12, 2002
    Date of Patent: May 6, 2008
    Assignee: STMicroelectronics S.r.l.
    Inventors: Davide Concion, Emiliano Piccinelli, Danilo Pau
  • Patent number: 7366240
    Abstract: A method of processing video frame data includes the steps of: receiving a video frame; partially decoding the video frame; fully decoding the video frame to produce macroblocks; determining video data parameters from the partially decoded video frame or both the partially and fully decoded video frame; and encoding the macroblocks based on the determined video data parameters to provide a compressed video frame for subsequent display.
    Type: Grant
    Filed: March 16, 2004
    Date of Patent: April 29, 2008
    Assignee: STMicroelectronics Asia Pacific Pte Ltd.
    Inventors: Patricia Chiang Wei Yin, Lucas Y. W. Hui, Jean-Michel Bard
  • Publication number: 20080089409
    Abstract: Systems and methods provide directional discrete cosine transformation (DCT) and motion compensated DCT. In one implementation, an exemplary system finds a directional property of an image, such as a visual trend, factorizes a DCT operation into primal operations, and applies the primal operations along a corresponding direction to perform the DCT. Motion compensated DCT applies the primal operations along a motion trajectory of a video sequence. When the directional DCT is applied blockwise, the directional coding modes for adjacent blocks can be optimized in view of each other using a weighted graph to represent the related coding mode decisions.
    Type: Application
    Filed: October 17, 2006
    Publication date: April 17, 2008
    Applicant: Microsoft Corporation
    Inventors: Jizheng Xu, Feng Wu, Hao Xu
  • Patent number: 7356191
    Abstract: In an image encoder is provided, including a to-be-encoded object prediction unit which counts quantization coefficients which are newly made significant, for example, when each bit plane is encoded, on the basis of quantization coefficient for each bit plane in all code blocks to provide a feature amount, estimates the number of generated codes on the basis of the feature amount, and adds the estimated number of codes in a predetermined order. When an initial target number of codes is reached, the to-be-encoded object prediction unit stops the addition, and supplies the entropy encoder with information on the bit planes having the quantization coefficients thereof been added as to-be-encoded object information. In this entropy encoder, only the quantization coefficient in the bit planes represented by the to-be-encoded object information, is processed by three types of encoding passes while truncating the quantization coefficients of the other bit planes.
    Type: Grant
    Filed: December 1, 2003
    Date of Patent: April 8, 2008
    Assignee: Sony Corporation
    Inventors: Kazuhisa Hosaka, Takahiro Fukuhara
  • Patent number: 7356082
    Abstract: A metadata extraction unit has a feature point selection and motion estimation unit 62 for extracting at least one feature point representing characteristics of the video/audio signals in a compressed domain of the video/audio signals. Thus, reduction of time or cost for processing can be realized and it makes it possible to process effectively.
    Type: Grant
    Filed: November 29, 1999
    Date of Patent: April 8, 2008
    Assignee: Sony Corporation
    Inventor: Peter M. Kuhn
  • Patent number: 7352811
    Abstract: A data encoding apparatus operable to encode a plurality of data blocks produces encoded data in accordance with at least one of a selectable target data quantity or a selectable target data quality.
    Type: Grant
    Filed: March 12, 2002
    Date of Patent: April 1, 2008
    Assignee: Sony United Kingdom Limited
    Inventors: Jonathan James Stone, Jason Charles Pelly, Nicholas Ian Saunders
  • Patent number: 7352812
    Abstract: Adjacent blocks are identified in an image. Coding parameters for the adjacent blocks are identified. Deblock filtering between the identified adjacent blocks is skipped if the coding parameters for the identified adjacent blocks are similar and not skipped if the coding parameters for the identified adjacent blocks are substantially different.
    Type: Grant
    Filed: March 29, 2002
    Date of Patent: April 1, 2008
    Assignee: Sharp Laboratories of America, Inc.
    Inventors: Shijun Sun, Shawmin Lei
  • Patent number: 7349476
    Abstract: A digital motion picture decoding apparatus comprising an input buffer memory for storing coded data to be decoded, a reproduced picture memory for storing the decoded picture data to be displayed, and a display picture deciding means for deciding a reproduced picture to be output from the picture data stored in the reproduced picture memory, further comprises: a program changing means for changing the type of the coded data to be decoded; a program change detecting means for detecting from the output of the program changing means that the type of the coded data to be decoded is changed; and a display state maintaining means for controlling the picture data output from the reproduced picture memory so as to maintain the display state of the reproduced picture which is currently displayed according to the decision of the display picture deciding means, when it is detected from the output of the program change detecting means that the change of the type of the coded data to be decoded makes the coded data not c
    Type: Grant
    Filed: March 28, 2005
    Date of Patent: March 25, 2008
    Assignee: Matsushita Electric Industrial Co., Ltd.
    Inventors: Nobuo Setoguchi, Akihiro Watabe
  • Patent number: 7342962
    Abstract: Transcoding as from MPEG-2 SDTV to MPEG-4 CIF reuses motion vectors and downsamples in the frequency (DCT) domain with differing treatments of frame-DCT and field-DCT blocks, and alternatively uses de-interlacing IDCT with respect to the row dimension plus deferred column downsampling for reference frame blocks.
    Type: Grant
    Filed: September 17, 2003
    Date of Patent: March 11, 2008
    Assignee: Texas Instruments Incorporated
    Inventor: Felix C. Fernandes
  • Publication number: 20080031318
    Abstract: Provided are a frame conversion apparatus and method and a frame type detecting apparatus and method. The frame conversion apparatus may include a detecting unit and an inverse telecine unit. The detecting unit may determine that each data frame is a telecine frame or a progressive frame according to the rate of discrete cosine transform (DCT) operation type performed on macro blocks of the frame or the number of the macro blocks on which each type of DCT operation is performed and output frame type information. The inverse telecine unit may convert the data frames into progressive frames and output the progressive frames in response to the frame type information.
    Type: Application
    Filed: September 22, 2006
    Publication date: February 7, 2008
    Inventors: Seong-Jong Ha, Nam-lk Cho, Sang-Hwa Lee