Plural Patents (Class 375/240.14)
-
Patent number: 11051012Abstract: An image encoding device and a method, and an image decoding device and a method that are designed to improve encoding efficiency in IntraBC. A screen is divided into four slices (Slices #0 through #3). In a case where reference to a different slice is prohibited, the range that can be referred to from the current CTU in Slice #2 is only the decoded portion in Slice #2, and therefore, any block in Slice #1 cannot be referred to, for example. In the case of the present technology, on the other hand, decoded different slices (Slice #0 and Slice #1) are included in the referable range, and accordingly, a block in Slice #1 can be referred to from the current CTU in Slice #2.Type: GrantFiled: March 19, 2020Date of Patent: June 29, 2021Assignee: SONY CORPORATIONInventor: Ohji Nakagami
-
Patent number: 11039163Abstract: Aspects of the disclosure provide a method for video coding in merge mode or skip mode. The method can include receiving a prediction block (PB) of a picture, determining number and positions of merge candidates of the PB according to a size and/or a shape of the PB, and constructing a candidate list including motion data of a subset of the merge candidate positions.Type: GrantFiled: October 16, 2019Date of Patent: June 15, 2021Assignee: MEDIATEK INC.Inventors: Chun-Chia Chen, Chih-Wei Hsu, Yu-Wen Huang
-
Patent number: 11025950Abstract: A motion-field based reference frame is rendered that can be used to perform a prediction process for a first frame of a video sequence to be predicted. A first reference frame from the video sequence for forward inter prediction of the first frame is determined, as is a second reference frame from the video sequence for backward inter prediction of the first frame. A respective motion field determined for blocks of the first frame is used to determine parameters forming an affine homographic model. A current block of a reconstructed reference frame is rendered at a co-located position within a motion field-based reference frame by applying the affine homographic model to the current block. An affine homographic model may be determined for each reconstructed reference frame block to render the motion-field based reference frame.Type: GrantFiled: February 12, 2018Date of Patent: June 1, 2021Assignee: GOOGLE LLCInventors: Yaowu Xu, Jingning Han, Ching-Han Chiang
-
Patent number: 10999606Abstract: A method, system, medium, and article provide neural network loop filtering for video coding with multiple alternative neural networks.Type: GrantFiled: May 21, 2019Date of Patent: May 4, 2021Assignee: Intel CorporationInventors: Hujun Yin, Shoujiang Ma, Xiaoran Fang, Rongzhen Yang
-
Patent number: 10984545Abstract: Techniques for estimating depth for a video stream captured by a monocular image sensor are disclosed. A sequence of image frames are captured by the monocular image sensor. A first neural network is configured to process at least a portion of the sequence of image frames to generate a depth probability volume. The depth probability volume includes a plurality of probability maps corresponding to a number of discrete depth candidate locations over a range of depths defined for the scene. The depth probability volume can be updated using a second neural network that is configured to generate adaptive gain parameters to integrate the DPVs over time. A third neural network is configured to refine the updated depth probability volume from a lower resolution to a higher resolution that matches the original resolution of the sequence of image frames. A depth map can be calculated based on the depth probability volume.Type: GrantFiled: June 12, 2019Date of Patent: April 20, 2021Assignee: NVIDIA CorporationInventors: Jinwei Gu, Kihwan Kim, Chao Liu
-
Patent number: 10979663Abstract: The disclosure relates to optimizing and improving VR video resolution and bandwidth usage for VR videos. The processes and systems include receiving a video input and reassigning pixels within a viewport to maximize resolution around the center of the viewport. The process can include video frame stamping to provide a mechanism for a client-side player to determine characteristics of an optimized video frame display.Type: GrantFiled: March 26, 2018Date of Patent: April 13, 2021Assignee: YERBA BUENA VR, INC.Inventors: Sebastian Amengual Galdon, Victor Ernesto Garcia Sanchez
-
Patent number: 10970816Abstract: A neural network structure, namely a warped external recurrent neural network, is disclosed for reconstructing images with synthesized effects. The effects can include motion blur, depth of field reconstruction (e.g., simulating lens effects), and/or anti-aliasing (e.g., removing artifacts caused by sampling frequency). The warped external recurrent neural network is not recurrent at each layer inside the neural network. Instead, the external state output by the final layer of the neural network is warped and provided as a portion of the input to the neural network for the next image in a sequence of images. In contrast, in a conventional recurrent neural network, hidden state generated at each layer is provided as a feedback input to the generating layer. The neural network can be implemented, at least in part, on a processor. In an embodiment, the neural network is implemented on at least one parallel processing unit.Type: GrantFiled: May 24, 2019Date of Patent: April 6, 2021Assignee: Nvidia CorporationInventors: Carl Jacob Munkberg, Jon Niklas Theodor Hasselgren, Marco Salvi
-
Patent number: 10944982Abstract: Methods to switch between renditions of a video stream are generally described. In some examples, the methods may include encoding a video stream at a first image quality in a first rendition and a second, lower image quality in a second rendition. The methods may further include sending the first rendition to a recipient computing device. The methods may include receiving a request to switch from the first rendition to the second rendition. The methods may include determining that first indicator data of a first inter-coded frame indicates that the video stream can be switched to a lower image quality rendition at the first inter-coded frame. In some examples, the methods may further include sending the second rendition to the recipient computing device.Type: GrantFiled: November 15, 2019Date of Patent: March 9, 2021Assignee: AMAZON TECHNOLOGIES, INC.Inventors: Qi Keith Wang, Yueshi Shen, Yongjun Wu, Tarek Amara, Naushirwan Navroze Patuck, Ilya Vladimirovich Brailovskiy
-
Patent number: 10931944Abstract: The present invention relates to an image processing device and method that enable generation of a highly precise prediction image using a small amount of control information. A motion compensation circuit specifies a macroblock corresponding to a prediction image in part of reference frames using a motion vector supplied from a prediction mode determination circuit, reads an image thereof from a frame memory, and extracts it as a motion compensation image. A motion prediction circuit reads, from a frame memory, at least one or more of the remaining reference frames, performs motion prediction of the motion compensation image supplied from the motion compensation circuit in each frame, reads an image of a macroblock that matches or is similar to the motion compensation image from the frame memory, and extracts it as a motion compensation image.Type: GrantFiled: May 9, 2019Date of Patent: February 23, 2021Assignee: SONY CORPORATIONInventors: Ohji Nakagami, Yoichi Yagasaki
-
Patent number: 10916041Abstract: A method for point cloud encoding includes generating, for a 3D point cloud, first and second frames representing the 3D point cloud at different depths, wherein the first and second frames each include a set of patches representing a cluster of points of the 3D point cloud. The method also includes encoding the first frame. After encoding the first frame, the method includes decoding the first frame. The method further includes generating a third frame representing a difference between corresponding points of the second frame and the decoded first frame. The method additionally includes encoding the third frame. The method also includes generating a compressed bitstream including the encoded first frame and the encoded third frame. The method further includes transmitting the compressed bitstream.Type: GrantFiled: March 18, 2019Date of Patent: February 9, 2021Assignee: Samsung Electronics Co., Ltd.Inventors: Rajan Laxman Joshi, Madhukar Budagavi
-
Patent number: 10904555Abstract: Aspects of the disclosure provide method and apparatus for video coding. In some examples, an apparatus includes processing circuitry for video decoding. In the method, a combined inter coding unit mode indicator is received. Inter-prediction information for one of a plurality of blocks in a current picture is received and the inter-prediction information includes motion information of the one of the plurality of blocks. Each of the plurality of blocks is reconstructed according to the inter-prediction information of the one of the plurality of blocks based on a determination that the combined inter coding unit indicator indicates that each of the plurality of blocks partitioned from a parent block is associated with the inter-prediction information of the one of the plurality of blocks.Type: GrantFiled: December 27, 2018Date of Patent: January 26, 2021Assignee: Tencent America LLCInventors: Shan Liu, Xiang Li, Xin Zhao, Meng Xu
-
Patent number: 10893294Abstract: A method for video decoding is disclosed. A directional intra prediction mode is decoded (910) for a current block of a picture in a video, said directional intra prediction mode having a direction. Based on said directional intra prediction mode, a first predictor for a sample is accessed (7010), the sample being within said current block. Based on said directional intra prediction mode, a second predictor for said sample, is accessed (7020) said first and second predictors being on a line at least approximating said direction. A sample value of said sample is predicted (7030) by using said first and second predictors; and said sample of said current block is reconstructed (965) based on said predicted sample value.Type: GrantFiled: May 29, 2018Date of Patent: January 12, 2021Assignee: InterDigital VC Holdings, Inc.Inventors: Gagan Rath, Fabrice Leleannec, Franck Galpin, Fabien Racape
-
Patent number: 10827172Abstract: An apparatus includes a memory and one or more processors. The processors code each block in a frame of moving images with use of a reference image. The processors generate a decoded image by decoding a coded block and calculate evaluation values for selection of each type of offset filtering. The processors also determine whether the block to be coded is a static region and make an adjustment in the evaluation value on the first offset filtering, based on a layer position of the reference image in a time direction upon a determination that the block to be coded is the static region. The processors also select the type of offset filtering based on the evaluation value on first offset filtering having undergone the adjustment and the evaluation values on second offset filtering and third offset filtering and carry out the selected offset filtering for the decoded image.Type: GrantFiled: September 6, 2018Date of Patent: November 3, 2020Assignee: FUJITSU LIMITEDInventor: Hidenobu Miyoshi
-
Patent number: 10810970Abstract: A luminance difference in an image displayed on an area of interest is easily recognized in a manner free from the effect of luminance of an area surrounding the area of interest. An HDR display (100) includes an image processing unit (14). If the number of digits of a difference between a luminance center value of a luminance distribution in the area of interest in a display unit (20) and a luminance center value of a luminance distribution in an entire display region is 2 digits or more, the image processing unit (14) performs a luminance conversion on the luminance value of each pixel in an entire screen in a manner such that the luminance center value of the luminance distribution in the entire display region approaches the luminance center value of the luminance distribution in the area of interest.Type: GrantFiled: March 22, 2018Date of Patent: October 20, 2020Assignee: SHARP KABUSHIKI KAISHAInventor: Hidekazu Miyata
-
Patent number: 10779001Abstract: According to one embodiment, an image encoding method includes selecting a motion reference block from an encoded pixel block to which an inter prediction is applied. The method includes selecting one or more available blocks from the motion reference block. The method includes selecting a selection block from the available blocks. The method includes generating a predicted image of the encoding target block using motion information of the selection block. The method includes encoding a prediction error between the predicted image and an original image. The method includes encoding selection information specifying the selection block by referring to a code table decided according to a number of the available blocks.Type: GrantFiled: January 17, 2019Date of Patent: September 15, 2020Assignee: KABUSHIKI KAISHA TOSHIBAInventors: Taichiro Shiodera, Saori Asaka, Akiyuki Tanizawa, Takeshi Chujoh
-
Patent number: 10755164Abstract: A dynamic time-evolution Boltzmann machine capable of learning is provided. Aspects include acquiring a time-series input data and supplying a plurality of input values of input data of the time-series input data at one time point to a plurality of nodes of the mode. Aspects also include computing, based on an input data sequence before the one time point in the time-series input data and a weight parameter between each of a plurality of input values of input data of the input data sequence and a corresponding one of the plurality of nodes of the model, a conditional probability of the input value at the one time point given that the input data sequence has occurred. Aspects further include adjusting the weight parameter so as to increase a conditional probability of occurrence of the input data at the one time point given that the input data sequence has occurred.Type: GrantFiled: September 14, 2015Date of Patent: August 25, 2020Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Takayuki Osogami, Makoto Otsuka
-
Patent number: 10721472Abstract: A video decoder, encoder, and corresponding methods for processing video data for an image block and a particular reference picture index to predict the image block are disclosed that utilize adaptive weighting of reference pictures to enhance video compression, where a decoder includes a reference picture weighting factor unit for determining a weighting factor corresponding to the particular reference picture index; an encoder includes a reference picture weighting factor assignor for assigning a weighting factor corresponding to the particular reference picture index; and a method for decoding includes receiving a reference picture index with the data that corresponds to the image block, determining a weighting factor for each received reference picture index, retrieving a reference picture for each index, motion compensating the retrieved reference picture, and multiplying the motion compensated reference picture by the corresponding weighting factor to form a weighted motion compensated reference pictureType: GrantFiled: May 3, 2019Date of Patent: July 21, 2020Assignee: InterDigital VC Holdings, Inc.Inventor: Jill MacDonald Boyce
-
Patent number: 10708584Abstract: A method for decoding image, includes generating a residual block; reconstructing an intra prediction mode group indicator and a prediction mode index of a current block, wherein the indicator indicates whether the intra prediction mode of the current block belongs to which intra prediction mode group among a first or second group, and the prediction mode index specifies intra prediction mode in an intra prediction mode group indicated by the indicator; constructing the first group including three intra prediction modes using valid intra prediction modes of left and top blocks of the current block; determining the intra prediction mode corresponding to the prediction mode index in the first group as the intra prediction mode of the current block when the indicator indicates the first group; generating a prediction block based on the determined intra prediction mode of the current block; generating a reconstructed block using the residual block and the prediction block; and performing deblocking filtering on aType: GrantFiled: June 3, 2019Date of Patent: July 7, 2020Assignee: INFOBRIDGE PTE. LTD.Inventor: Shin Ji Park
-
Patent number: 10692185Abstract: A method for training an algorithm to process at least a section of received visual data using a training dataset and reference dataset. The method comprises an iterative method with iterations comprising: generating a set of training data using the algorithm; comparing one or more characteristics of the training data to one or more characteristics of at least a section of the reference dataset; and modifying one or more parameters of the algorithm to optimise processed visual data based on the comparison between the characteristic of the training data and the characteristic of the reference dataset. The algorithm may output the processed visual data with the same content as the at least a section of received visual data. Some aspects and/or implementations provide for improved super-resolution of lower quality images to produce super-resolution images with improved characteristics (e.g. less blur, less undesired smoothing) compared to other super-resolution techniques.Type: GrantFiled: December 28, 2017Date of Patent: June 23, 2020Assignee: Magic Pony Technology LimitedInventors: Zehan Wang, Wenzhe Shi, Ferenc Huszar, Robert David Bishop
-
Patent number: 10687071Abstract: Aspects of the disclosure provide method and apparatus for video coding. In some examples, an apparatus includes processing circuitry for video decoding. The processing circuitry locates luma units of a luma coding unit that are co-located with chroma units of a chroma coding unit, and then determines a popularity of an intra block copy mode in the luma units. Further, the processing circuitry determines a block vector for a chroma unit in the chroma coding unit when the popularity meets a requirement, and reconstructs at least one sample of the chroma unit according to the block vector.Type: GrantFiled: November 28, 2018Date of Patent: June 16, 2020Assignee: Tencent America LLCInventors: Xiaozhong Xu, Xiang Li, Shan Liu
-
Patent number: 10674180Abstract: In one embodiment of the present invention, an encode validator identifies and classifies errors introduced during the parallel chunk-based translation of a source to a corresponding aggregate encode. In operation, upon receiving a source for encoding, a frame difference generator creates a frame difference file for the source. A parallel encoder then distributes per-chunk encoding operations across machines and creates an aggregate encode. The encode validator decodes the aggregate encode and creates a corresponding frame difference file. Subsequently, the encode validator performs phase correlation operations between the two frame difference files to detect errors generated by encoding process faults (i.e., dropping a frame, etc.) while suppressing discrepancies inherent in encoding, such as those attributable to low bit-rate encoding.Type: GrantFiled: February 13, 2015Date of Patent: June 2, 2020Assignee: NETFLIX, INC.Inventors: Anne Aaron, Zhonghua Ma
-
Patent number: 10659788Abstract: An optical flow reference frame portion (e.g., a block or an entire frame) is generated that can be used for inter prediction of blocks of a current frame in a video sequence. A forward reference frame and a backward reference frame are used in an optical flow estimation that produces a respective motion field for pixels of a current frame. The motion fields are used to warp some or all pixels of the reference frames to the pixels of the current frame. The warped reference frame pixels are blended to form the optical flow reference frame portion. The inter prediction may be performed as part of encoding or decoding portions of the current frame.Type: GrantFiled: November 20, 2017Date of Patent: May 19, 2020Assignee: GOOGLE LLCInventors: Yaowu Xu, Bohan Li, Jingning Han
-
Patent number: 10652567Abstract: Techniques are disclosed for coding and decoding video data using object recognition and object modeling as a basis of coding and error recovery. A video decoder may decode coded video data received from a channel. The video decoder may perform object recognition on decoded video data obtained therefrom, and, when an object is recognized in the decoded video data, the video decoder may generate a model representing the recognized object. It may store data representing the model locally. The video decoder may communicate the model data to an encoder, which may form a basis of error mitigation and recovery. The video decoder also may monitor deviation patterns in the object model and associated patterns in audio content; if/when video decoding is suspended due to operational errors, the video decoder may generate simulated video data by analyzing audio data received during the suspension period and developing video data from the data model and deviation(s) associated with patterns detected from the audio data.Type: GrantFiled: March 28, 2018Date of Patent: May 12, 2020Assignee: APPLE INC.Inventors: Xing Wen, Dazhong Zhang, Peikang Song, Xiaosong Zhou, Sudeng Hu, Hsi-Jung Wu, Jae Hoon Kim
-
Patent number: 10587887Abstract: A reference picture marking process and a reference picture list management process is handled in a unified reference picture marking and reference picture list management process. A new idle reference picture list may be used for handling reference pictures that are not used for reference in the current picture. Differential coding of picture order count may be used to increase coding efficiency. The reference picture management syntax structure may be sent in the picture parameter set for improved coding efficiency e.g. in regular GOP (group of pictures) arrangements.Type: GrantFiled: March 8, 2018Date of Patent: March 10, 2020Assignee: NOKIA TECHNOLOGIES OYInventors: Miska Matias Hannuksela, Srikanth Manchenahally Gopalakrishna
-
Patent number: 10587891Abstract: A format for use in encoding moving image data, comprising: a sequence of frames including plurality of the frames in which at least a region is encoded using motion estimation; a respective set of motion vector values representing motion vectors of the motion estimation for each respective one of these frames or each respective one of one or more regions within each of such frames; and at least one respective indicator associated with each of the respective frames or regions, indicating whether the respective motion vector values of the respective frame or region are encoded at a first resolution or a second resolution.Type: GrantFiled: September 21, 2017Date of Patent: March 10, 2020Assignee: Microsoft Technology Licensing, LLCInventors: You Zhou, Sergey Silkin, Sergey Sablin, Chih-Lung Lin, Ming-Chieh Lee, Gary J. Sullivan
-
Patent number: 10575112Abstract: An audio signal processing apparatus includes a receiver which receives an audio signal including audio data for at least a first audio object associated with a three dimensional image. The audio signal also includes depth position data indicative of a target depth position for the first audio object. A determiner determines a visual rendering depth range for a target three dimensional display for presenting the three dimensional image, and a mapper for mapping the target depth position to a rendering depth position for the audio object where the mapping is dependent on the visual rendering depth range. The visual rendering depth range may specifically be a depth range in which the three dimensional display can accurately render objects, and the mapper may amend the positions of audio object sound sources such that these match the depth positions of corresponding visual objects presented by the three dimensional display.Type: GrantFiled: August 30, 2016Date of Patent: February 25, 2020Assignee: Koninklijke Philips N.V.Inventor: Werner Paulus Josephus De Bruijn
-
Patent number: 10567799Abstract: This disclosure describes techniques for simplifying depth inter mode coding in a three-dimensional (3D) video coding process, such as 3D-HEVC. The techniques include generating a motion parameter candidate list, e.g., merging candidate list, for a current depth prediction unit (PU). In some examples, the described techniques include determining that a sub-PU motion parameter inheritance (MPI) motion parameter candidate is unavailable for inclusion in the motion parameter candidate list for the current depth PU if motion parameters of a co-located texture block to a representative block of the current depth PU are unavailable. In some examples, the described techniques include deriving a sub-PU MPI candidate for inclusion in the motion parameter candidate list for the current depth PU only if a partition mode of the current depth PU is 2N×2N.Type: GrantFiled: March 7, 2014Date of Patent: February 18, 2020Assignee: Qualcomm IncorporatedInventors: Hongbin Liu, Ying Chen
-
Patent number: 10506250Abstract: Provided are a method and apparatus for intra predicting an image, which generate a prediction value via linear interpolation in horizontal and vertical directions of a current prediction unit. The method includes: generating first and second virtual pixels by using at least one adjacent pixel located upper right and lower left to a current prediction unit; obtaining a first prediction value of a current pixel via linear interpolation using an adjacent left pixel located on the same line as the first virtual pixel and the current pixel; obtaining a second prediction value of the current pixel via linear interpolation using an adjacent upper pixel located on the same column as the second virtual pixel and the current pixel; and obtaining a prediction value of the current pixel by using the first and second prediction values.Type: GrantFiled: September 21, 2018Date of Patent: December 10, 2019Assignee: SAMSUNG ELECTRONICS CO., LTD.Inventors: Tammy Lee, Jianle Chen
-
Patent number: 10491917Abstract: A device for decoding video data determines a block of video data is coded in an inter prediction mode; implicitly determines that a decoder-side motion vector derivation (DMVD) mode is enabled for the block of video data; determines motion information for the block of video data; uses the motion information to determine a reference block in accordance with the DMVD mode; and generates a predictive block for the block of video data based on the reference block.Type: GrantFiled: March 21, 2018Date of Patent: November 26, 2019Assignee: QUALCOMM IncorporatedInventors: Yi-Wen Chen, Kai Zhang, Xiang Li, Jianle Chen, Wei-Jung Chien, Hsiao-Chiang Chuang, Marta Karczewicz
-
Patent number: 10484685Abstract: An exemplary method for intelligent compression defines a threshold value for a key performance indicator. Based on the key performance indicator value, data blocks generated by a producer component may be scaled down to reduce power and/or bandwidth consumption when being compressed according to a lossless compression module. The compressed data blocks are then stored in a memory component along with metadata that signals the scaling factor used prior to compression. Consumer components later retrieving the compressed data blocks from the memory component may decompress the data blocks and upscale, if required, based on the scaling factor signaled by the metadata.Type: GrantFiled: April 18, 2017Date of Patent: November 19, 2019Assignee: QUALCOMM IncorporatedInventors: Serag Gadelrab, Chinchuan Chiu, Moinul Khan, Kyle Ernewein, Tom Longo, Simon Booth, Meghal Varia, Milivoje Aleksic, King-Chung Lai
-
Patent number: 10460435Abstract: A pattern inspection method includes calculating a first coefficient of a filter function by using data of optical images of plural small regions selected and data of developed images of the plural small regions based on design data; calculating a second coefficient of a filter function by using data of an optical image of a reference small region selected and data of a developed image of the reference small region selected; and determining, for each pixel, whether there exists a pixel for which a difference, between a first temporary reference image to be compared with the optical image of the reference small region generated using the filter function in which the first coefficient is defined and a second temporary reference image to be compared with the optical image of the reference small region generated using the filter function in which the second coefficient is defined, is larger than a threshold.Type: GrantFiled: April 5, 2017Date of Patent: October 29, 2019Assignee: NuFlare Technology, Inc.Inventor: Takafumi Inoue
-
Patent number: 10440375Abstract: Methods of adaptive transform type based on transform unit (TU) size for enhancement layer (EL) coding and multiple motion candidates for EL coding based on corresponding base layer (BL) video data are provided. One method selects a transform type from multiple allowable transform types based on the TU size and applies the selected transform type to the transform units of the inter-layer prediction processed data. Another method derives multiple motion candidates for the EL video data coded in Merge mode or Inter mode based on motion information associated with the corresponding BL video data.Type: GrantFiled: March 12, 2018Date of Patent: October 8, 2019Assignee: MediaTek Inc.Inventors: Tzu-Der Chuang, Zhenzhong Chen, Shan Liu, Ching-Yeh Chen, Chih-Wei Hsu
-
Patent number: 10425650Abstract: A scalable video signal decoding method according to the present invention comprises determining a corresponding picture of a reference layer used for inter-layer prediction of a current picture using interlayer reference information on the current picture of a current layer, generating an interlayer reference picture by up-sampling the determined corresponding picture, generating a reference picture list including a temporal reference picture and the interlayer reference picture, and performing an inter prediction of the current picture on the basis of the reference picture list.Type: GrantFiled: August 29, 2014Date of Patent: September 24, 2019Assignee: KT CORPORATIONInventors: Bae Keun Lee, Joo Young Kim
-
Patent number: 10410350Abstract: A method of using a computer to semantically segment an image using a convolutional neural network system where a processor configured to convolve an input image with a plurality of filters and outputting a first output volume, pool the first output volume and creating a first activation map, determine the level of influence of the first activation map on the semantic segmentation, up-pool the first activation map to form an output image having a same number of pixels as the input image, output a probabilistic segmentation result, labeling each pixel's probability that it is a particular label, and the determination of the level of influence of the first activation map on the semantic segmentation is done using a gate layer that is positioned between a pooling layer and an up-pooling layer.Type: GrantFiled: October 30, 2017Date of Patent: September 10, 2019Assignee: Rakuten, Inc.Inventors: Jiu Xu, Bjorn Stenger
-
Patent number: 10402634Abstract: According to an embodiment, an information processing device includes a memory and processing circuitry. The processing circuitry is configured to receive an image and area information indicating a first area. The processing circuitry is configured to detect an object from the image. The processing circuitry is configured to, based on positional relationship between the first area and the object, estimate probability of occurrence of an event in the first area.Type: GrantFiled: August 30, 2017Date of Patent: September 3, 2019Assignee: Kabushiki Kaisha ToshibaInventor: Tatsuo Kozakaya
-
Patent number: 10390013Abstract: A method for decoding video includes receiving a frame of the video that includes at least one slice and at least one tile. Each of the at least one slice and the at least one tile are not all aligned with one another. Each of the at least one slice is characterized that it is decoded independently of the other the at least one slice. Each of the at least one tile is characterized that it is a rectangular region of the frame and having coding units for the decoding arranged in a raster scan order. The at least one tile of the frame are collectively arranged in a raster scan order of the frame.Type: GrantFiled: October 26, 2016Date of Patent: August 20, 2019Assignee: Velos Media, LLCInventors: Kiran Misra, Christopher A. Segall
-
Patent number: 10368086Abstract: The present application discloses an image coding/decoding method, device, and system, where an encoder performs singular vector decomposition on a prediction block corresponding to a to-be-coded image block, to obtain eigenvector matrices U and V of the prediction block; and performs coding processing on residual data according to the eigenvector matrices U and V of the prediction block, where the residual data is a difference between a pixel value of the to-be-coded image block and a pixel value of the corresponding prediction block. This can reduce identifier load of a coded bit stream.Type: GrantFiled: July 11, 2018Date of Patent: July 30, 2019Assignees: Huawei Technologies Co., Ltd., Tsinghua UniversityInventors: Xiaoran Cao, Yun He, Xiaozhen Zheng, Jianhua Zheng
-
Patent number: 10326993Abstract: A video decoder, encoder, and corresponding methods for processing video data for an image block and a particular reference picture index to predict the image block are disclosed that utilize adaptive weighting of reference pictures to enhance video compression, where a decoder includes a reference picture weighting factor unit for determining a weighting factor corresponding to the particular reference picture index; an encoder includes a reference picture weighting factor assignor for assigning a weighting factor corresponding to the particular reference picture index; and a method for decoding includes receiving a reference picture index with the data that corresponds to the image block, determining a weighting factor for each received reference picture index, retrieving a reference picture for each index, motion compensating the retrieved reference picture, and multiplying the motion compensated reference picture by the corresponding weighting factor to form a weighted motion compensated reference pictureType: GrantFiled: February 13, 2018Date of Patent: June 18, 2019Assignee: InterDigital VC Holdings, Inc.Inventor: Jill MacDonald Boyce
-
Patent number: 10306265Abstract: In general, this disclosure describes techniques for simplifying SDC coding of large intra-prediction blocks, such as 64×64 blocks, in a 3D video coding process, such as 3D-HEVC. In some examples, the techniques may include processing 64×64 intra-prediction blocks as four 32×32 intra-prediction blocks in intra SDC. Processing large intra-prediction blocks as multiple, smaller intra-prediction blocks in intra SDC may reduce maximum buffer size requirements in the intra SDC process.Type: GrantFiled: December 30, 2013Date of Patent: May 28, 2019Assignee: Qualcomm IncorporatedInventors: Hongbin Liu, Ying Chen
-
Patent number: 10291930Abstract: Methods and apparatus are provided for uni-prediction of self-derivation of motion estimation. An apparatus includes a video encoder (300) for encoding at least a portion of a picture. The video encoder includes a self derivation motion estimator (333) for performing self derivation motion estimation for the portion. The self derivation motion estimation is used for selectively performing uni-prediction or bi-prediction for the portion based on one or more criterion.Type: GrantFiled: October 18, 2016Date of Patent: May 14, 2019Assignee: INTERDIGITAL MADISON PATENT HOLDINGSInventors: Liwei Guo, Peng Yin, Yunfei Zheng, Xiaoan Lu, Qian Xu, Joel Sole
-
Patent number: 10264286Abstract: An apparatus configured to code video information in a bitstream includes a memory and a processor in communication with the memory. The memory is configured to store video information associated with a plurality of video layers in the bitstream, the plurality of video layers in the bitstream divided into a plurality of bitstream partitions, herein each bitstream partition contains at least one of the plurality of video layers. The processor is configured to process a bitstream conformance parameter associated with a first bitstream partition of the plurality of bitstream partitions, wherein the bitstream conformance parameter is applicable to the first bitstream partition but not to another portion of the bitstream not encompassed by the first bitstream partition. The processor may encode or decode the video information in the bitstream.Type: GrantFiled: June 24, 2015Date of Patent: April 16, 2019Assignee: QUALCOMM IncorporatedInventors: Adarsh Krishnan Ramasubramonian, Ye-Kui Wang
-
Patent number: 10225573Abstract: A current block of a video frame can be encoded or decoded using parameterized motion models. First and second parameterized motion models are identified. The first parameterized motion model corresponds to a first motion model type, and the second parameterized motion model corresponds to a second motion model type. The first and second parameterized motion models are associated with one or more reference frames. One of the first or second parameterized motion models is selected along with an associated reference frame, such as based on a lowest prediction error. A motion vector is generated between the current block and the selected reference frame by warping pixels of the current block to a warped patch of the selected reference frame according to the selected parameterized motion model. A prediction block is generated using the motion vector, and the current block is encoded or decoded using the prediction block.Type: GrantFiled: January 31, 2017Date of Patent: March 5, 2019Assignee: GOOGLE LLCInventors: Debargha Mukherjee, James Bankoski, Yue Chen, Sarah Parker
-
Mode-dependent coefficient scanning and directional transforms for different colour sampling formats
Patent number: 10205941Abstract: A method of coding 4:2:2 or 4:4:4 video data comprises predicting luminance and/or chrominance samples of an image from other respective reference samples derived from the same image according to a prediction mode associated with a sample to be predicted, the prediction mode being selected for each of a plurality of blocks of samples, from a set of two or more candidate prediction modes; detecting differences between the samples and the respective predicted samples; selecting a frequency-separation transform from two or more candidate frequency separation transforms according to the prediction mode associated with a current block of samples using a mapping between transform and prediction mode, the mapping between different, as between chrominance and luminance samples, for at least the 4:4:4 format; and encoding the detected differences by frequency-separating the differences, using the selected frequency-separation transform.Type: GrantFiled: June 14, 2017Date of Patent: February 12, 2019Assignee: Sony CorporationInventors: James Alexander Gamei, Nicholas Ian Saunders, Karl James Sharman, Paul James Silcock -
Patent number: 10187650Abstract: Video encoders and decoders and video encoding and decoding methods are provided. A video encoder includes an input buffer configured to receive a video data stream and to supply current frame data, a frame buffer configured to store reconstructed frame data, and an encoder circuit configured to read reference frame data from the frame buffer, to encode the current frame data received from the input buffer using the reference frame data and to write the reconstructed frame data to the frame buffer. The encoder circuit may be configured to write the reconstructed frame data by overwriting the reference frame data in the frame buffer.Type: GrantFiled: June 17, 2014Date of Patent: January 22, 2019Assignee: STMICROELECTRONICS INTERNATIONAL N.V.Inventors: Sumit Johar, SurinderPal Singh
-
Patent number: 10176090Abstract: Providing memory bandwidth compression using adaptive compression in central processing unit (CPU)-based systems is disclosed. In one aspect, a compressed memory controller (CMC) is configured to implement two compression mechanisms: a first compression mechanism for compressing small amounts of data (e.g., a single memory line), and a second compression mechanism for compressing large amounts of data (e.g., multiple associated memory lines). When performing a memory write operation using write data that includes multiple associated memory lines, the CMC compresses each of the memory lines separately using the first compression mechanism, and also compresses the memory lines together using the second compression mechanism. If the result of the second compression is smaller than the result of the first compression, the CMC stores the second compression result in the system memory. Otherwise, the first compression result is stored.Type: GrantFiled: September 15, 2016Date of Patent: January 8, 2019Assignee: QUALCOMM IncorporatedInventors: Colin Beaton Verrilli, Natarajan Vaidhyanathan, Mattheus Cornelis Antonius Adrianus Heddes
-
Patent number: 10158871Abstract: A video processing system provided with video encoding apparatus 1 and video decoding apparatus 2. The encoding apparatus 1 outputs a maximum delay time that is incurred by backward prediction, in addition to encoded data D1 resulting from encoding of video data D0. The decoding apparatus 2 effects input of the maximum delay time that is incurred by backward prediction, in addition to encoded data D1 from the encoding apparatus 1. Then the decoding apparatus 2 decodes the encoded data D1 with reference to the input maximum delay time to generate motion video data D2.Type: GrantFiled: November 17, 2017Date of Patent: December 18, 2018Assignee: NTT DOCOMO, INC.Inventors: Satoru Adachi, Choong Seng Boon, Sadaatsu Kato, Minoru Etoh, Thiow Keng Tan
-
Patent number: 10148967Abstract: An image coding method includes: generating a predicted block; calculating a residual block; calculating quantized coefficients by performing transform and quantization on the residual block; calculating a coded residual block by performing inverse quantization and inverse transform on the quantized coefficients; generating a temporary coded block; determining whether or not an offset process is required, to generate first flag information indicating a result of the determination; executing the offset process on the temporary coded block when it is determined that the offset process is required; and performing variable-length coding on the quantized coefficients and the first flag information.Type: GrantFiled: March 13, 2018Date of Patent: December 4, 2018Assignee: SUN PATENT TRUSTInventors: Toru Matsunobu, Takahiro Nishi, Youji Shibahara, Hisao Sasai, Kyoko Tanikawa, Toshiyasu Sugio
-
Patent number: 10123023Abstract: In a moving picture coding device that codes a moving picture using motion compensation prediction in units of blocks acquired by dividing each picture of the moving picture, a prediction mode determiner derives motion information of a coding target block. A motion compensation predictor changes the derived motion information for the coding target block having a size smaller than or equal to a certain size when the coding target block has the size smaller than or equal to the certain size. The motion compensation predictor constructs a prediction signal of the coding target block through the motion compensation prediction using the derived motion information when the coding target block has a size larger than the certain size or using the changed motion information when the coding target block has the size smaller than or equal to the certain size.Type: GrantFiled: July 16, 2018Date of Patent: November 6, 2018Assignee: JVC KENWOOD CorporationInventors: Shigeru Fukushima, Hideki Takehara, Motoharu Ueda
-
Patent number: 10116944Abstract: A video encoding device includes: first video encoding means for encoding an input image to generate first coded data; a buffer for storing the input image; coded data transcoding means for transcoding the first coded data generated by the first video encoding means, to generate second coded data; and second video encoding means for generating a prediction signal based on the second coded data supplied from the coded data transcoding means. The first video encoding means includes: dividing means for dividing the input image into a plurality of image areas; and at least one encoding means corresponding to the image areas each of which is made up of a plurality of blocks, and for performing encoding in units of blocks. The encoding means also encodes a block that is included in an image area adjacent with a dividing line in between and is located near the dividing line.Type: GrantFiled: January 29, 2015Date of Patent: October 30, 2018Assignee: NEC CORPORATIONInventors: Takayuki Ishida, Kensuke Shimofure, Naoya Tsuji, Suguru Nagayama, Keiichi Chono
-
Patent number: RE48074Abstract: An image encoding device includes a first prediction parameter determination section (53) for selecting, for each of prediction units belonging to a first group, a prediction parameter from a basic set; a second prediction parameter determination section (55) for selecting, for each of prediction units belonging to a second group, a prediction parameter from a reduced set (i) including at least a part of the prediction parameter(s) selected by the first prediction parameter determination section (53) and (ii) is constituted by a prediction parameter(s), the number of which is not more than the number of prediction parameters included in the basic set; and a prediction parameter encoding section (243) for encoding (i) information indicating which one of prediction parameters is selected by the first prediction parameter determination section (53) and (ii) information indicating which one of prediction parameters is selected by the second prediction parameter determination section (55).Type: GrantFiled: June 8, 2018Date of Patent: June 30, 2020Assignee: Velos Media, LLCInventor: Tomoyuki Yamamoto