Discrete Cosine Patents (Class 375/240.2)
-
Publication number: 20090238263Abstract: A programmable energy efficient codec system is provided for encoding and decoding a plurality of application environments. A camera Codec and control system for an HD camera is provided for encoding uncompressed HD-SDI video signals into an MPEG-2 transport stream. A stand-alone encoder decoder system is provided in a network configuration allowing for remote display and editing of HD-SDI video. At least one plurality of HD-SDI transport streams is generated from HD-Cameras encoded into MPEG-2 transport streams and output into a DVD-ASI signal and a TS/IP packet stream further provided is a decoder which accepts MPEG-2-TS/IP packet streams from a routed IP network which are decoded into an uncompressed HD-SDI transport stream for display. A set top box is provided for decoding audio and video HD-TV. A first HDMI interface into the decoder allows acceptance of an MPEG-2-TS from local storage media. Connection to an IP routed network is provided.Type: ApplicationFiled: February 6, 2009Publication date: September 24, 2009Inventors: Pawan Jaggi, Sandeep Kumar, Xiaohui Wei
-
Publication number: 20090238265Abstract: Disclosed herein is a decoding apparatus including: a pixel group information storage section configured to include memory areas each of which stores, as pixel group information, information concerning a corresponding pixel group and is referred to with a horizontal address of the pixel group, and an additional memory area that is located at a left end and stores a value indicating invalid pixel group information; a decoding information generation section configured to acquire the pixel group information identified by an address of a pixel group to be decoded and an address to the left thereof, and generating decoding information; a decoding section configured to perform a decoding process based on the decoding information; and a pixel group information update section configured to generate pixel group information and allowing it to be stored in the memory area identified by the address of the pixel group.Type: ApplicationFiled: March 17, 2009Publication date: September 24, 2009Applicant: Sony CorporationInventor: Hiroshi Yoshikawa
-
Publication number: 20090213926Abstract: The present invention relates to a method for up-sampling/down-sampling data of a video block in a scalable video data encoding/decoding. The up-sampling method according to the present invention obtains a 2N×2N enlarged block by computing a converting matrix to data of a given N×N video block. The converting block has matrix elements leading data of the video block to resultant data that could be obtained by a converting process that applies DCT to the data, pads some zeros to coefficients, and applies IDCT to the coefficients including the padded zeros. The down-sampling method according to the present invention obtains an N×N reduced block by computing a converting block to a given 2N×2N video block. The converting block for reducing has matrix elements leading data of the 2N×2N video block to resultant data that could be obtained by a converting process that applies DCT to the data, removes some coefficients from transformed coefficients, and applies IDCT to the remaining coefficients.Type: ApplicationFiled: February 24, 2006Publication date: August 27, 2009Applicant: LG ELECTRONICS INC.Inventors: IL-Hong Shin, Hyun Wook Park
-
Patent number: 7580843Abstract: A synthesis subband filter apparatus is provided. The apparatus is used for processing 18 sets of signals which each includes 32 subband sampling signals in accordance with a specification providing 512 window coefficients. The apparatus includes a processor for processing the 18 sets of signals in sequence. The processor further includes a converting module and a generating module. The converting module is used for converting the 32 subband sampling signals of the set of signals being processed into 32 converted vectors by use of 32-points discrete cosine transform (DCT), and writing the 32 converted vectors into 512 default vectors with a first-in, first-out queue. The generating module is used for generating 32 pulse code modulation (PCM) signals, relative to the set of signals being processed according to a set of synthesis formulae proposed in this invention.Type: GrantFiled: May 8, 2006Date of Patent: August 25, 2009Assignee: Quanta Computer, Inc.Inventors: Chih-Hsien Chang, Chih-Wei Hung, Hsien-Ming Tsai
-
Patent number: 7561620Abstract: A system and process for compressing and decompressing multiple video streams depicting substantially the same dynamic scene from different viewpoints. Each frame in each contemporaneous set of video frames of the multiple streams is represented by at least a two layers—a main layer and a boundary layer. Compression of the main layers involves first designating one or more of these layers in each set of contemporaneous frames as keyframes. For each set of contemporaneous frames in time sequence order, the main layer of each keyframe is compressed using an inter-frame compression technique. In addition, the main layer of each non-keyframe within the frame set under consideration is compressed using a spatial prediction compression technique. Finally, the boundary layers of each frame in the current frame set are each compressed using an intra-frame compression technique. Decompression is generally the reverse of the compression process.Type: GrantFiled: August 3, 2004Date of Patent: July 14, 2009Assignee: Microsoft CorporationInventors: Simon Winder, Matthew Uyttendaele, Charles Zitnick, III, Richard Szeliski, Sing Bing Kang
-
Patent number: 7558428Abstract: The systems and methods described herein are directed at accelerating video encoding using a graphics processing unit. In one aspect, a video encoding system uses both a central processing unit (CPU) and a graphics processing unit (GPU) to perform video encoding. The system implements a technique that enables the GPU to perform motion estimation for video encoding. The technique allows the GPU to perform a motion estimation process in parallel with the video encoding process performed by the CPU. The performance of video encoding using such a system is greatly accelerated as compared to encoding using just the CPU. In another aspect, data related to motion estimation is arranged and provided to the GPU in a way that utilizes the capabilities of the GPU. Data about video frames may be collocated to enable multiple channels of the GPU to process tasks in parallel. The depth buffer of the GPU may be used to consolidate repeated calculations and searching tasks during the motion estimation process.Type: GrantFiled: September 13, 2004Date of Patent: July 7, 2009Assignee: Microsoft CorporationInventors: Guobin Shen, Shipeng Li, Guangping Gao
-
Patent number: 7555044Abstract: A method and system of transform-based encoding, decoding, and bitstream generation of digital video content is disclosed. The digital video content comprises a stream of pictures, slices, or macroblocks which can each be intra, predicted or bi-predicted pictures. The pictures, slices, or macroblocks comprise blocks of pixels. The method and system comprises scanning frequency coefficients from each of the block's two dimensional array of frequency coefficients in a manner that is vertically biased, producing a one dimensional array of frequency coefficients. The method and system also provide for scanning/assigning frequency coefficients from a one dimensional array of frequency coefficients, producing a two dimensional array of frequency coefficients.Type: GrantFiled: October 6, 2003Date of Patent: June 30, 2009Assignee: General Instrument CorporationInventors: Limin Wang, Yue Yu
-
Patent number: 7548583Abstract: During Motion Picture Experts Group (MPEG) video encoding a two-dimensional discrete cosine transform (DCT) is performed on data representing an original image. The resulting coefficients are then quantized, which typically results in many zero coefficients. Because of the nature of most video data, most higher-order coefficients are typically zero and the lower-order coefficients (i.e., those grouped towards the upper left of the matrix) are more likely to be non-zero. To reduce the lengths of runs among the lower-order coefficients, the coefficients can be encoded in a zig-zag pattern. In one embodiment, the zig-zag pattern is maintained and one or more masks are generated based on the output of the quantization phase. The one or more masks are used to identify the coefficients within the matrix that are non-zero. This reduces the number of accesses to memory required to encode the non-zero coefficients and runs of zero coefficients.Type: GrantFiled: August 9, 2005Date of Patent: June 16, 2009Assignee: Apple Inc.Inventor: Jason Klivington
-
Publication number: 20090122861Abstract: The present invention is directed to video coding/decoding and discloses a method for transforming to/from transform coefficients and residual pixel data in moving pictures by a set of semi-orthonormal basis vectors. The basis vectors are derived from conventional DCT or KTL matrixes, but relaxes to some extend the requirements for orthogonality, norm equality and element size limitation. In this way the present invention provides improved coding efficiency and lower complexity compared to previously used integer transforms.Type: ApplicationFiled: November 11, 2008Publication date: May 14, 2009Applicant: TANDBERG TELECOM ASInventors: Gisle BJONTEGAARD, Arild Fuldseth
-
Publication number: 20090116554Abstract: Disclosed is an apparatus (500) for generating a second compressed video stream (550) having a second resolution, from a first compressed video stream (540) having a first resolution. The apparatus comprises means (513) for extracting transform domain luma data and spatial domain chroma data from the first compressed video stream (540), means (514-516) for applying a transform domain operation to the luma data to form reconstructed transform domain luma data, means (518, 519, 560) for applying a spatial domain operation to the chroma data to form reconstructed spatial domain chroma data, and means for scaling the reconstructed transform domain luma data and reconstructed spatial domain chroma data to generate the second compressed video stream.Type: ApplicationFiled: October 29, 2008Publication date: May 7, 2009Applicant: CANON KABUSHIKI KAISHAInventors: Zhonghua Ma, I-Jan Eric Wang
-
Publication number: 20090110059Abstract: A system, method and devices for embedding end-user access information, such as digital rights management (DRM) information and/or conditional access (CA) information, in multimedia content, such as broadcast video content, and transmitting or distributing the multimedia content, including the end-user access information, to one or more end user devices, such as multimedia content processing (set-top) devices and mobile devices. The methods, systems and devices also involve receiving and decoding the transmitted multimedia content, including extracting end-user access information from the multimedia content and using the end-user access information to allow or deny access to the multimedia content by the end user device. The end-user access information is embedded in the multimedia content in a manner that reduces or even eliminates the need to also transmit end-user access information separately or along with the associated multimedia content.Type: ApplicationFiled: October 31, 2007Publication date: April 30, 2009Applicant: General Instrument CorporationInventors: Theodore S. Rzeszewski, Eiraj Rezania
-
Patent number: 7526030Abstract: An inputted digital signal of a first format (DV video signal) is restored to a variable-length code by having its framing cancelled by a de-framing section 11, then decoded by a variable-length decoding (VLD) section 12, inversely quantized by an inverse quantizing (IQ) section 13, and inversely weighted by an inverse weighting (IW) section 14. Then, required resolution conversion in the orthogonal transform domain (frequency domain) is carried out on the inversely weighted video signal by a resolution converting section 16. After that, the video signal having the resolution converted is weighted by a weighting (W) section 18, then quantized by a quantizing (Q) section 19, coded by variable-length coding by a variable-length coding (VLC) section 20, and outputted as a digital signal of a second format (MPEG video signal).Type: GrantFiled: June 13, 2007Date of Patent: April 28, 2009Assignee: Sony CorporationInventors: Naofumi Yanagihara, Nobuaki Izumi
-
Publication number: 20090103624Abstract: An input module obtains a media signal to be communicated to a destination system, after which an identification module identifies a plurality of segments within the media signal. A codec includes a selection module that automatically selects different compression methods to respectively compress at least two of the segments. The compression methods are automatically selected to produce a highest compression quality for the respective segments according to a set of criteria without exceeding a target data rate. A compression module within the codec then compresses the segments using the automatically-selected compression methods, after which an output module delivers the compressed segments to the destination system with an indication of which compression method was used to compress each segment.Type: ApplicationFiled: November 25, 2008Publication date: April 23, 2009Applicant: BROADCAST INTERNATIONALInventors: Danny L. Mabey, Jodie L. Reynolds, Lawrence S. Reznick, John E. Wilkinson, Jack A. Prock
-
Publication number: 20090103608Abstract: A method and system are provided for encoding a plurality of integers with variable-length code tables constructed by combining a plurality of structured code tables. Each code table has an associated set of integer values; the sets are disjoint and exhaustive, so that every integer appears in exactly one set. An integer is encoded using the codebook associated with the set in which the integer appears.Type: ApplicationFiled: December 30, 2008Publication date: April 23, 2009Applicant: APPLE INC.Inventors: Ken Kengkuan LIN, Mitchell Howard OSLICK
-
Publication number: 20090080515Abstract: This disclosure describes efficient transformation techniques that can be used in video coding. In particular, intermediate results of computations associated with transformation of a first block of video data are reused in the transformation of a second block of video data. The techniques may be used during a motion estimation process in which video blocks of a search space are transformed, but this disclosure is not necessarily limited in this respect. Pipelining techniques may be used to accelerate the efficient transformation techniques, and transposition memories can be implemented to facilitate efficient pipelining.Type: ApplicationFiled: September 26, 2007Publication date: March 26, 2009Applicant: QUALCOMM IncorporatedInventors: Raghavendra C. Nagaraj, De Dzwo Hsu, Stephen Molloy
-
Patent number: 7499491Abstract: A system to process a signal sequence is described. A hybrid block matching and transform based N-Dimensional signal sequence encoder and decoder is disclosed. The encoder includes encoder side block matching predictor, which includes entropy based cost function which can be estimated from certain energy measure of the block matching difference; a fast block matching search method to learn the results from neighboring blocks and to perform large range search with only a small number of points to visit.Type: GrantFiled: June 23, 2004Date of Patent: March 3, 2009Assignee: ViChip Corp. LimitedInventors: Tsu-Chang Lee, Wen-Lung Chen, Yusong Huang, Li-Herng Yao, Hsi-Sheng Chen
-
Publication number: 20090052525Abstract: In a reproduction apparatus, for reproducing an original signal conveyed as main data by a data medium such as a recording disk or broadcasting system, with medium protection data which are specific to the data medium being conveyed together with the main data, the apparatus includes a section for generating apparatus protection data which are specific to the reproduction apparatus, a section for combining the apparatus protection data with the medium protection data to define a protection level, and a section for applying the protection level to restrict reproduction of the original signal, with stepwise variations in restriction occurring in accordance with changes in protection level. The medium protection data may include information for specifying restricted reproduction of portions of the original signal, such as by producing degraded resolution within specified regions of specified frames of a video signal.Type: ApplicationFiled: October 10, 2008Publication date: February 26, 2009Applicant: Victor Company of Japan, LimitedInventor: Takayuki SUGAHARA
-
Patent number: 7492822Abstract: In a video encoder a prediction signal at a bit resolution of P is subtracted from an input signal received at a bit resolution of P+?, where P is typically 8 bits, and P+?, 10 bits. A transform is then performed on the result, the output of which transform being at a bit resolution of at least P+?. The transformed signal is quantised, and the quantised signal used to form the prediction signal. The quantised signal is then coded for downstream delivery. At a decoder, the process is inverted, preserving the extra ? bits input to the encoder.Type: GrantFiled: January 27, 2003Date of Patent: February 17, 2009Assignee: Snell & Wilcox LimitedInventors: Andrew David Raine Cotton, Michael James Knee
-
Patent number: 7486731Abstract: In a video encoder a prediction signal at a bit resolution of P is subtracted from an input signal received at a bit resolution of P+?, where P is typically 8 bits, and P+?, 10 bits. A transform is then performed on the result, the output of which transform being at a bit resolution of at least P+?. The transformed signal is quantized, and the quantized signal used to form the prediction signal. The quantized signal is then coded for downstream delivery. At a decoder, the process is inverted, preserving the extra ? bits input to the encoder.Type: GrantFiled: October 28, 2004Date of Patent: February 3, 2009Assignee: Snell & Wilcox LimitedInventors: Andrew David Raine Cotton, Michael James Knee
-
Patent number: 7474697Abstract: An apparatus for partitioning moving picture data comprises a first quantizing unit for first-quantizing a received video signal and outputting a first-quantized signal; and a second quantizing unit for second-quantizing the first-quantized signal and partitioning the first-quantized signal into a preceding part and a succeeding part.Type: GrantFiled: July 25, 2005Date of Patent: January 6, 2009Assignee: LG Electronics Inc.Inventor: Kyu Chan Roh
-
Publication number: 20080310503Abstract: Provided are a method and apparatus for entropy-encoding/entropy-decoding video data. The method of entropy-encoding video data includes binarizing coefficients of the frequency domain, which are generated by transforming a residual block of a current block into the frequency domain, using different binarization methods and performing binary arithmetic coding on the binarized coefficients. In this way, the coefficients are binarized adaptively according to whether the frequencies of the coefficients are high or low, thereby improving the compression efficiency of the video data.Type: ApplicationFiled: April 24, 2008Publication date: December 18, 2008Applicants: Samsung Electronics Co., Ltd., INDUSTRY-ACADEMIA COOPERATION GROUP OF SEJONG UNIVERSITYInventors: Yung-Lyul LEE, Dae-yeon KIM
-
Publication number: 20080298700Abstract: Down-sampling of an image may be performed in the DCT domain. A multiple layered network is used to select transform matrices for down-sampling a DCT image of size M×N to a DCT image of size I×J. A spatial domain down-sampling method is selected and applied to the DCT image to produce a down-sampled DCT reference image. A learning with forgetting algorithm is used to apply a decay to the elements of the transform matrix and select a transform matrices which solve an optimization problem. The optimization problem is a function of the visual quality of images obtained using the transform matrices and the computational complexity associated with using the transform matrices. The visual quality is a measure of the difference between the down-sampled DCT image obtained using the transform matrices and the visual quality of the DCT reference image obtained using a spatial domain down-sampling method.Type: ApplicationFiled: June 4, 2008Publication date: December 4, 2008Inventors: Xiang Yu, En-hui Yang, Haiquan Wang
-
Publication number: 20080285644Abstract: Provided is an apparatus and method for encoding/decoding moving pictures based on adaptive scanning. The moving picture apparatus and method can increase a compression rate based on adaptive scanning by performing intra prediction onto blocks of a predetermined size, and scanning coefficients acquired from Discrete Cosine Transform (DCT) of a residue signal and quantization differently according to the intra prediction mode. The moving picture encoding apparatus includes: a mode selector for selecting and outputting a prediction mode; a predictor for predicting pixel values of pixels to be encoded of an input video based on the prediction mode to thereby output a residue signal block; a transform/quantization unit for performing DCT onto the residue signal block and quantizing the transformed residue signal block; and an encoder for adaptively scanning and encoding the quantized residue signal block based on the prediction mode.Type: ApplicationFiled: October 19, 2006Publication date: November 20, 2008Inventors: Jeong-Il Seo, Wook-Joong Kim, Kyu-Heon Kim, Kyeong-Ok Kang, Jin-Woo Hong, Yung-Lyul Lee, Ki-Hun Han, Jae-Ho Hur, Dong-Gyu Sim, Seoung-Jun Oh
-
Publication number: 20080279274Abstract: An integrated circuit (102) and method computes fixed point vector dot products (424) and/or matrix vector products using a type of distributed architecture that loads bit planes (add00-add30) and uses the loaded bit planes to generate a plurality of partial products (416-422) directly, such as without a lookup table, and the plurality of partial products are computed in real time and are not read out of addressable memory. In one example, pixel coefficients and corresponding data are loaded such that, for example, a bit plane is loaded to generate partial product results on a per bit plane basis. The plurality of partial products are then summed (414) or accumulated to produce fixed point vector dot product data (424).Type: ApplicationFiled: May 8, 2007Publication date: November 13, 2008Applicant: Freescale Semiconductor, Inc.Inventor: Nickolai J. Iliev
-
Publication number: 20080267294Abstract: Transcoding as from MPEG-2 SDTV to MPEG-4 CIF reuses motion vectors and downsamples in the frequency (DCT) domain with differing treatments of frame-DCT and field-DCT blocks, and alternatively uses de-interlacing IDCT with respect to the row dimension plus deferred column downsampling for reference frame blocks.Type: ApplicationFiled: July 2, 2008Publication date: October 30, 2008Applicant: Texas Instruments IncorporatedInventor: Felix C. Fernandes
-
Patent number: 7440501Abstract: Adjacent blocks are identified in an image. Coding parameters for the adjacent blocks are identified. Deblock filtering between the identified adjacent blocks is skipped if the coding parameters for the identified adjacent blocks are similar and not skipped if the coding parameters for the identified adjacent blocks are substantially different.Type: GrantFiled: April 13, 2005Date of Patent: October 21, 2008Assignee: Sharp Laboratories of America, Inc.Inventors: Shijun Sun, Shawmin Lei
-
Publication number: 20080232475Abstract: A method for encoding video with a two-dimensional (2D) transform separable to two one-dimensional (1D) transforms. The method receives an array of values for a sub-section of an image, performs a first 1D-transform of the array, transposes the resulting array, and performs a second 1D-transform of the array resulting from the transpose. The method, without performing another transpose, generates a data stream using a transposed scan order based on the values of the array resulting from the second transform. A method for decoding video encoded by a 2D transform, which separable to two 1D transforms. The method receives a data stream containing encoded values for an image, parses out the values into an array using a transposed scan order, performs a first 1D-inverse transform on the array, transposes the resulting array, and performs a second 1D-inverse transform of the array resulting from the transpose to produce a decoded output.Type: ApplicationFiled: April 28, 2008Publication date: September 25, 2008Inventors: Maynard Handley, Roger Kumar, Thomas Pun, Xiaochun Nie, Hsi-Jung Wu
-
Patent number: 7426311Abstract: An object-based coding apparatus and method for image signals, wherein upon scanning shape-adaptive transform coefficients of an input image signal transformed in accordance with a shape-adaptive transform, only segments containing such shape-adaptive transform coefficients are scanned. In the scanning operation, segments containing no transform coefficient are skipped, thereby reducing the quantity of data being encoded. An apparatus for and a method of object-based decoding of image signals are also disclosed which can decode bit streams generated using the coding method, thereby reproducing image signals.Type: GrantFiled: November 19, 1998Date of Patent: September 16, 2008Assignee: Hyundai Electronics Industries Co. Ltd.Inventors: Sung Moon Chun, Jin Hak Lee, Joo Hee Moon, Gwang Hoon Park, Jae Kyoon Kim, Jae-won Chung
-
Publication number: 20080219345Abstract: A streaming media codec may include a collection of media stream processing modules arranged into a processing graph. One or more of the modules may perform a Fourier-related transform, and a significant fraction of media stream processing may occur post-transform. The media stream may be considered as a sequence of processing blocks, and post-transform processing blocks contain transform coefficients. Such transform coefficients are amenable to classification into processing classes. Some processing classes may require significantly less processing effort than others by post-transform processing modules. Such transform coefficient classes may be efficiently specified, for example, with coefficient bounding rectangles, and the specification provided to one or more post-transform streaming media processing modules to enable the modules to allocate their processing resources more effectively.Type: ApplicationFiled: March 6, 2007Publication date: September 11, 2008Applicant: Microsoft CorporationInventors: Jiang Li, Hua Cai
-
Patent number: 7424057Abstract: The present invention provides an apparatus for converting image data, including a block extraction unit extracts a class tap from a composite signal. A pixel-location-mode output unit determines a pixel location mode from the extracted class tap, and outputs it to a coefficient memory. A coefficient calculation unit acquires a seed coefficient from a seed coefficient memory to determine a predictive coefficient based on a transform method selection signal input from a designation unit, and stores the result into the coefficient memory. The coefficient memory supplies a predictive coefficient corresponding to the pixel location mode to a predictive calculation unit. A block extraction unit extracts a predictive tap from the composite signal, and outputs the result to the predictive calculation unit. The predictive calculation unit outputs a component signal or a transformed component signal based on the predictive tap and the predictive coefficient.Type: GrantFiled: September 9, 2003Date of Patent: September 9, 2008Assignee: Sony CorporationInventors: Tetsujiro Kondo, Takeharu Nishikata, Shizuo Chikaoka, Sakon Yamamoto
-
Patent number: 7424151Abstract: A system and method for classifying an image block of a printed image into contone, halftone, or error diffusion classes, includes scanning the printed image; selecting an n by n block of pixels from the scanned image in a luminance channel; calculating an array of DCT coefficients of the pixel block, wherein the calculated DCT coefficients are representative of spatial frequency and spatial orientation of the pixel block; comparing the array of DCT coefficients with an array of predetermined values, wherein the array of predetermined values are indicative of contone, halftone and error diffusion classes; and determining the image classification of the pixel block based on the comparison of the DCT coefficients with the array of predetermined values. If the image is determined to be halftone, then the method further evaluates the DCT coefficients to determine the frequency of the halftone screen.Type: GrantFiled: June 4, 2004Date of Patent: September 9, 2008Assignee: Xerox CorporationInventors: Ying-Wei Lin, Shen-Ge Wang, Beilei Xu
-
Patent number: 7418134Abstract: The present invention relates to a method and system for foreground segmentation in which frames of a video sequence are analyzed in the transform domain to determine one or more features. The features are used to model the background. The background can be modeled as a single Gaussian model with a mean and variance of the features. A current frame is segmented by determining if one or more features of the current frame analyzed in the foreground domain satisfy a threshold between the background model. The threshold value can be based on the mean and/or variance of features. During the segmentation, the mean and variance can be updated based on previous corresponding values and current features to adaptively update the background model. In one embodiment, the frames are divided into a plurality of blocks. A transform is used to analyze the blocks in the transform domain. For example, the transform can be a discrete cosine transform (DCT).Type: GrantFiled: May 12, 2004Date of Patent: August 26, 2008Assignee: Princeton UniversityInventors: Stuart Schwartz, Juhua Zhu, Bede Liu
-
Patent number: 7408991Abstract: A method for decoding video data blocks using variable length codes, comprising transforming information about the spatial frequency distribution of a video data block into pixel values. Prior to said transformation, a first reference value (Xref) representing the abruptness of variations in information about spatial frequency distribution within the block is generated, after said transformation, a second reference value (?) representing the abruptness of variation in certain information between the block and at least one previously transformed video data block is generated. The first reference value (Xref) is compared to a first threshold value (TH1) and the second reference value (?) to a second threshold value (TH2); and as a response to either of the first (Xref) and second reference values (?) being greater than the first (TH1) and respectively the second threshold value (TH2), an error in the block is detected.Type: GrantFiled: October 28, 2003Date of Patent: August 5, 2008Assignee: Nokia Mobile Phones LimitedInventor: Ari Hourunranta
-
Publication number: 20080181296Abstract: A method, and apparatus, and logic encoded in one or more computer-readable tangible medium to carry out a method. The method is to code a plurality of ordered sequences of quantized transform coefficients of a corresponding plurality of blocks of image data using a hybrid coding method that includes determining a breakpoint location in the sequence between a low frequency region and a high frequency region; coding the low-frequency region using a selected low-frequency variable length coding method; coding the high-frequency region using a selected high-frequency variable length coding method; and coding the location of the breakpoint. The plurality of blocks include adjacent blocks and form a multi-block partition. The breakpoint location is determined as a function of the sequences of the multi-block partition such that different multi-block partitions can have different breakpoints.Type: ApplicationFiled: March 15, 2007Publication date: July 31, 2008Inventors: Dihong Tian, Wen-hsiung Chen, Pi Sheng Chang, John A. Toebes
-
Publication number: 20080151995Abstract: In one embodiment, a first DCT block of coefficients is received from a transcoding process. The DCT block is of a first size, such as an 8×8 size, and in a first coding standard, such as MPEG-2. A distribution of zero and non-zero coefficients is then determined from the first DCT block. For example, areas where non-zero coefficients may be determined. This may simplify a corresponding sum of absolute difference (SAD) calculation. The SAD is then calculated for a second DCT block of a second size, such as a 4×4 size block, in a second coding standard, such as AVC. The SAD is determined from the coefficients in the first DCT block. Accordingly, the SAD is determined without performing an inverse DCT on the 8×8 block and then computing the SAD for a 4×4 block. It is then determined if a bypass of a DCT operation for the 4×4 block can be performed based on the SAD computed. For example, the SAD is compared to a threshold to determine if a bypass can be performed.Type: ApplicationFiled: December 22, 2006Publication date: June 26, 2008Applicants: Sony Corporation, Sony Electronics Inc.Inventor: Jeongnam Youn
-
Publication number: 20080130743Abstract: When data is an HDTV signal, a buffer memory is used as two banks and pipeline processing is performed. When data is an SDTV signal, on the other hand, the buffer memory is used as a bankless buffer memory, and the pipeline processing is not performed.Type: ApplicationFiled: November 28, 2007Publication date: June 5, 2008Applicants: SANYO ELECTRIC CO., LTD., SANYO SEMICONDUCTOR CO., LTD.Inventors: Tetsuo Kosuge, Kensuke Fujimura, Naoki Tanahashi
-
Patent number: 7379500Abstract: To encode image and video data using a transform having low computational complexity and high compression efficiency, all elements of the matrix can be expressed with power-of-2 elements, with a template of the matrix approximating a DCT matrix.Type: GrantFiled: September 30, 2003Date of Patent: May 27, 2008Assignee: Microsoft CorporationInventors: Shan Lu, Keman Yu, Jiang Li, Shipeng Li
-
Patent number: 7376280Abstract: A method for encoding video with a two-dimensional (2D) transform separable to two one-dimensional (1D) transforms. The method receives an array of values for a sub-section of an image, performs a first 1D-transform of the array, transposes the resulting array, and performs a second 1D-transform of the array resulting from the transpose. The method, without performing another transpose, generates a data stream using a transposed scan order based on the values of the array resulting from the second transform. A method for decoding video encoded by a 2D transform, which separable to two 1D transforms. The method receives a data stream containing encoded values for an image, parses out the values into an array using a transposed scan order, performs a first 1D-inverse transform on the array, transposes the resulting array, and performs a second 1D-inverse transform of the array resulting from the transpose to produce a decoded output.Type: GrantFiled: April 30, 2003Date of Patent: May 20, 2008Assignee: Apple IncInventors: Maynard Handley, Roger Kumar, Thomas Pun, Xiaochun Nie, Hsi-Jung Wu
-
Patent number: 7372906Abstract: Data is discrete cosine transformed and streamed to a processor where quantized and inverse quantized blocks are generated. A second streaming data connection streams the inverse quantized blocks to an inverse discrete cosine transform block to generate reconstructed prediction error macroblocks. An addition circuit adds each reconstructed prediction error macroblock and its corresponding predictor macroblock to generate a respective reconstructed macroblock. The quantized macroblocks are zig-zag scanned, run level coded and variable length coded to generate and encoded bitstream.Type: GrantFiled: March 17, 2003Date of Patent: May 13, 2008Assignee: STMicroelectronics LimitedInventor: Martin Bolton
-
Patent number: 7369706Abstract: An image processing device for processing an original image divided into a plurality of blocks that are made up of a plurality of picture elements from encoded image-data which is obtained by encoding a spatial-frequency component of the block as a plurality of spatial-frequency coefficients, and producing reduced encoded image-data which is encoded data on a reduced image that is obtained by reducing the original image to a given reduction rate. If the block in the encoded image-data of the original image is in an area where the value of the function is small due to gentle change in brightness or color of the image, for example, only the direct-current component coefficient of is used for the block. On the other hand, if the block is in a area where the value of the function is large because the area is a boundary area, for example, such as an outline in the image, limited number of lower spatial frequency-coefficients are used for the block. The number is based on the reduction rate of the image.Type: GrantFiled: December 30, 2003Date of Patent: May 6, 2008Assignee: Matsushita Electric Industrial Co., Ltd.Inventor: Nobukazu Kurauchi
-
Process for the format conversion of MPEG bitstreams, a system and computer program product therefor
Patent number: 7369613Abstract: A process for format conversion of DCT macroblocks in an MPEG video bitstream that are divided into blocks, each of which includes a plurality of microblocks. In each DCT block, the significant frequencies are identified and preserved, isolating a corresponding microblock preferably consisting of the microblock on the top left of each block and setting to zero the coefficients of the remaining microblocks. On the microblock thus isolated there is performed an inverse discrete cosine transform, and the microblock thus obtained is merged with the homologous microblocks obtained from the other blocks comprised in a respective starting macroblock, so as to give rise to a merging block. The merging block thus obtained undergoes a discrete cosine transform so as to obtain a final block, which can be assembled into a macroblock with converted format.Type: GrantFiled: September 12, 2002Date of Patent: May 6, 2008Assignee: STMicroelectronics S.r.l.Inventors: Davide Concion, Emiliano Piccinelli, Danilo Pau -
Patent number: 7366240Abstract: A method of processing video frame data includes the steps of: receiving a video frame; partially decoding the video frame; fully decoding the video frame to produce macroblocks; determining video data parameters from the partially decoded video frame or both the partially and fully decoded video frame; and encoding the macroblocks based on the determined video data parameters to provide a compressed video frame for subsequent display.Type: GrantFiled: March 16, 2004Date of Patent: April 29, 2008Assignee: STMicroelectronics Asia Pacific Pte Ltd.Inventors: Patricia Chiang Wei Yin, Lucas Y. W. Hui, Jean-Michel Bard
-
Publication number: 20080089409Abstract: Systems and methods provide directional discrete cosine transformation (DCT) and motion compensated DCT. In one implementation, an exemplary system finds a directional property of an image, such as a visual trend, factorizes a DCT operation into primal operations, and applies the primal operations along a corresponding direction to perform the DCT. Motion compensated DCT applies the primal operations along a motion trajectory of a video sequence. When the directional DCT is applied blockwise, the directional coding modes for adjacent blocks can be optimized in view of each other using a weighted graph to represent the related coding mode decisions.Type: ApplicationFiled: October 17, 2006Publication date: April 17, 2008Applicant: Microsoft CorporationInventors: Jizheng Xu, Feng Wu, Hao Xu
-
Patent number: 7356191Abstract: In an image encoder is provided, including a to-be-encoded object prediction unit which counts quantization coefficients which are newly made significant, for example, when each bit plane is encoded, on the basis of quantization coefficient for each bit plane in all code blocks to provide a feature amount, estimates the number of generated codes on the basis of the feature amount, and adds the estimated number of codes in a predetermined order. When an initial target number of codes is reached, the to-be-encoded object prediction unit stops the addition, and supplies the entropy encoder with information on the bit planes having the quantization coefficients thereof been added as to-be-encoded object information. In this entropy encoder, only the quantization coefficient in the bit planes represented by the to-be-encoded object information, is processed by three types of encoding passes while truncating the quantization coefficients of the other bit planes.Type: GrantFiled: December 1, 2003Date of Patent: April 8, 2008Assignee: Sony CorporationInventors: Kazuhisa Hosaka, Takahiro Fukuhara
-
Patent number: 7356082Abstract: A metadata extraction unit has a feature point selection and motion estimation unit 62 for extracting at least one feature point representing characteristics of the video/audio signals in a compressed domain of the video/audio signals. Thus, reduction of time or cost for processing can be realized and it makes it possible to process effectively.Type: GrantFiled: November 29, 1999Date of Patent: April 8, 2008Assignee: Sony CorporationInventor: Peter M. Kuhn
-
Patent number: 7352811Abstract: A data encoding apparatus operable to encode a plurality of data blocks produces encoded data in accordance with at least one of a selectable target data quantity or a selectable target data quality.Type: GrantFiled: March 12, 2002Date of Patent: April 1, 2008Assignee: Sony United Kingdom LimitedInventors: Jonathan James Stone, Jason Charles Pelly, Nicholas Ian Saunders
-
Patent number: 7352812Abstract: Adjacent blocks are identified in an image. Coding parameters for the adjacent blocks are identified. Deblock filtering between the identified adjacent blocks is skipped if the coding parameters for the identified adjacent blocks are similar and not skipped if the coding parameters for the identified adjacent blocks are substantially different.Type: GrantFiled: March 29, 2002Date of Patent: April 1, 2008Assignee: Sharp Laboratories of America, Inc.Inventors: Shijun Sun, Shawmin Lei
-
Patent number: 7349476Abstract: A digital motion picture decoding apparatus comprising an input buffer memory for storing coded data to be decoded, a reproduced picture memory for storing the decoded picture data to be displayed, and a display picture deciding means for deciding a reproduced picture to be output from the picture data stored in the reproduced picture memory, further comprises: a program changing means for changing the type of the coded data to be decoded; a program change detecting means for detecting from the output of the program changing means that the type of the coded data to be decoded is changed; and a display state maintaining means for controlling the picture data output from the reproduced picture memory so as to maintain the display state of the reproduced picture which is currently displayed according to the decision of the display picture deciding means, when it is detected from the output of the program change detecting means that the change of the type of the coded data to be decoded makes the coded data not cType: GrantFiled: March 28, 2005Date of Patent: March 25, 2008Assignee: Matsushita Electric Industrial Co., Ltd.Inventors: Nobuo Setoguchi, Akihiro Watabe
-
Patent number: 7342962Abstract: Transcoding as from MPEG-2 SDTV to MPEG-4 CIF reuses motion vectors and downsamples in the frequency (DCT) domain with differing treatments of frame-DCT and field-DCT blocks, and alternatively uses de-interlacing IDCT with respect to the row dimension plus deferred column downsampling for reference frame blocks.Type: GrantFiled: September 17, 2003Date of Patent: March 11, 2008Assignee: Texas Instruments IncorporatedInventor: Felix C. Fernandes
-
Publication number: 20080031318Abstract: Provided are a frame conversion apparatus and method and a frame type detecting apparatus and method. The frame conversion apparatus may include a detecting unit and an inverse telecine unit. The detecting unit may determine that each data frame is a telecine frame or a progressive frame according to the rate of discrete cosine transform (DCT) operation type performed on macro blocks of the frame or the number of the macro blocks on which each type of DCT operation is performed and output frame type information. The inverse telecine unit may convert the data frames into progressive frames and output the progressive frames in response to the frame type information.Type: ApplicationFiled: September 22, 2006Publication date: February 7, 2008Inventors: Seong-Jong Ha, Nam-lk Cho, Sang-Hwa Lee