Distinguishing Text From Other Regions Patents (Class 382/176)

Removing character from text in non-image form where location of character in image of text falls outside of valid content boundary

Patent number: 8682075

Abstract: Data representing an image of text is received, as is data representing the text in non-image form. A valid content boundary within the image of the text is determined. For each character within the text in the non-image form, a location of the character within the image of the text is determined. Where the location of the character within the image of the text falls outside the valid content boundary, the character is removed from the data representing the text in the non-image form.

Type: Grant

Filed: December 28, 2010

Date of Patent: March 25, 2014

Assignee: Hewlett-Packard Development Company, L.P.

Inventor: Prakash Reddy
Method for omnidirectional processing of 2D images including recognizable characters

Patent number: 8682077

Abstract: The invention is a method for omnidirectional recognition of recognizable characters in a captured two-dimensional image. An optical reader configured in accordance with the invention searches for pixel groupings in a starburst pattern, and subjects located pixel groupings to a preliminary edge crawling process which records the pixel position of the grouping's edge and records the count of edge pixels. If two similar-sized pixel groupings are located that are of sizes sufficient to potentially represent recognizable characters, then the reader launches “alignment rails” at pixel positions substantially parallel to a centerline connecting the center points of the two similarly sized groupings. A reader according to the invention searches for additional recognizable characters within the rail area, and subjects each located pixel grouping within the rail area to a shape-characterizing edge crawling process for developing data that characterizes the shape of a pixel grouping's edge.

Type: Grant

Filed: June 11, 2010

Date of Patent: March 25, 2014

Assignee: Hand Held Products, Inc.

Inventor: Andrew Longacre, Jr.
SEGMENTATION CO-CLUSTERING

Publication number: 20140079316

Abstract: An approach to segmentation or clustering of a set of elements combines separate procedures and uses training data for those procedures on labeled data. This approach is applied to elements being components of an image of text (e.g., printed or handwritten). In some examples, the elements are connected sets of pixels. In images of text, the clusters can correspond to individual lines. The approach provides improved clustering performance as compared to any one of the procedures taken alone.

Type: Application

Filed: September 17, 2013

Publication date: March 20, 2014

Applicant: Raytheon BBN Technologies Corp.

Inventors: Shiv N. Vitaladevuni, Rohit Prasad, Premkumar Natarajan
Image processing method and apparatus, and document management server, performing character recognition on a difference image

Patent number: 8675260

Abstract: According to one embodiment, the image processing apparatus includes a printing control unit, an image reading unit, an extracting unit, a difference image extracting unit, and a determination unit. The printing control unit controls printing of a plurality of pages on one sheet of paper according to a print setting information which indicates a printing form, and printing of a code indicating the print setting information on the paper. The image reading unit read the paper. The extracting unit extracts the code from the read image. The difference image extracting unit extracts a difference image between the printed image and the read image.

Type: Grant

Filed: March 14, 2012

Date of Patent: March 18, 2014

Assignee: Toshiba Tec Kabushiki Kaisha

Inventors: Shigeo Uchida, Taira Ashikawa, Satoshi Oyama, Katsuhito Mochizuki
Method, system and computer program product for transmitting data from a document application to a data application

Patent number: 8667410

Abstract: In a method for computer-aided transfer of data from a document application into a data application having a set of data fields, a document is displayed in the document application opened on a computer with a display device, and wherein from the document data are to be transferred into the data application also opened on the computer. A name of a data field into which data are to be transferred is displayed on the display device. Via identification of a corresponding data value in the document on the display device, a character string representing the data value is automatically read out from the document and entered into the data field corresponding to the data field name in the data application via actuation of a predetermined button.

Type: Grant

Filed: July 4, 2006

Date of Patent: March 4, 2014

Assignee: Open Text S.A.

Inventor: Johannes Schacht
Compression of digital images of scanned documents

Patent number: 8666185

Abstract: A first aspect of the invention relates to a method for creating a binary mask image from an a inputted digital image of a scanned document, comprising the steps of creating a binarized image by binarizing the inputted digital image, detecting first text regions representing light text on a dark background, and inverting the first text regions, such that the inverted first text regions are interpretable in the same way as dark text on a light background. A second aspect of the invention relates to a method for comparing in a binary image a first pixel blob with a second pixel blob to determine whether they represent matching symbols, comprising the steps of detecting a line in one blob not present in the other and/or determining if one of the blobs represents an italicized symbol where the other does not.

Type: Grant

Filed: November 17, 2011

Date of Patent: March 4, 2014

Assignee: I.R.I.S.

Inventors: Michel Dauw, Pierre Demuelenaere
Image forming apparatus and image forming method determining correct pixel value of target pixel subject to interpolation

Patent number: 8659801

Abstract: An image forming apparatus includes an image interpolation unit to compute a correct pixel value of a target pixel subject to interpolation of a halftone image. The image interpolation unit includes a base pattern setting unit to set a base pattern including the target pixel, a reference pattern setting unit to set reference pattern in a region peripheral to the target pixel, an analogous pattern acquisition unit to acquire at least one analogous pattern analogous to the base pattern from the reference pattern, a high-resolution pattern creating unit to create a high-resolution pattern having a predetermined resolution or higher by synthesizing the acquired analogous pattern, a pixel value estimating unit to compute an estimated pixel value of the target pixel based on the created high-resolution pattern, and a pixel value determination unit to determine the correct pixel value of the target pixel based on the computed estimated pixel value.

Type: Grant

Filed: August 23, 2011

Date of Patent: February 25, 2014

Assignee: Ricoh Company, Ltd.

Inventor: Satoshi Nakamura
Document editing apparatus and method

Patent number: 8655074

Abstract: A method for storing a document recognition result is proposed. The method includes selecting a picture area from a document image, storing an image of the selected picture area in an image file format, removing the selected picture area, filling the removed picture area with a surrounding background color, and performing character recognition of a text area.

Type: Grant

Filed: February 2, 2011

Date of Patent: February 18, 2014

Assignee: Samsung Electronics Co., Ltd

Inventors: Ji-Hoon Kim, Sang-Ho Kim, Seong-Taek Hwang, Dong-Chang Lee
Data obfuscation of text data using entity detection and replacement

Patent number: 8649552

Abstract: A data obfuscation method, apparatus and computer program product are disclosed in which at least selected text entities such as words or abbreviations in a document are obfuscated to prevent the disclosure of private information if the document is disclosed. A user establishes various configuration parameters for selected text entities desired to obfuscated. The document is processed and text entities matching the configuration parameters are tagged for obfuscation. The tagged entities are then substituted in the document with obfuscating text. The obfuscating text can be derived from a hash table. The hash table may be used to provide a reverse obfuscation method by which original data can be restored to an obfuscated document.

Type: Grant

Filed: April 3, 2008

Date of Patent: February 11, 2014

Assignee: International Business Machines Corporation

Inventors: Sreeram Viswanath Balakrishnan, Rema Ananthanarayanan, Souptik Datta
SYMBOL COMPRESSION USING CONDITIONAL ENTROPY ESTIMATION

Publication number: 20140037210

Abstract: The present disclosure includes a system and method for symbol compression using conditional entropy estimation. One method for symbol compression using conditional entropy estimation includes approximating a quantity of symbol encoding bits for a number of symbols using a conditional entropy estimation. Dictionary entries are generated from the number of symbols so as to minimize a total bit-stream quantity. The total bit-stream quantity includes at least the approximated quantity of symbol encoding bits and a quantity of dictionary entries encoding bits. The symbols are encoded using the dictionary entries as a reference.

Type: Application

Filed: July 31, 2012

Publication date: February 6, 2014

Inventors: Dejan Depalov, Peter Bauer, Charles A. Bouman, Jan Allebach, Yandong Guo
Image forming apparatus and image forming method including a first measuring unit that measures invisible toner

Patent number: 8643910

Abstract: An image forming apparatus includes an acquiring unit that acquires image data expressing an image region included in an image with a first value and a background region included in the image with a second value; a segmenting unit that segments the image region into multiple segments arranged in a fast scanning direction; a converting unit that converts a value of at least one of the segments into the second value; an output unit that generates an image signal on the basis of the image data and outputs the image signal; an exposure unit that exposes a charged image bearing member to light according to the output image signal by scanning the light thereto in the fast scanning direction so as to form a latent image; and a developing unit that forms the image by developing the latent image using an invisible toner that absorbs infrared light or ultraviolet light.

Type: Grant

Filed: July 27, 2011

Date of Patent: February 4, 2014

Assignee: Fuji Xerox Co., Ltd.

Inventor: Junichi Ichikawa
Method and system for searching for information on a network in response to an image query sent by a user from a mobile communications device

Patent number: 8644610

Abstract: Present invention relates to a method and system for automatic searching for information on a network in response to an image query sent by a user. The image query includes an image that is captured by using a mobile communications device with a camera. The image is processed to detect the text present in it. The detected text is then recognized using an OCR. Subsequently, the text is searched for matches in the corresponding domain database, selected from the various domain databases present in the network. Thereafter, selected matches and additional related information is sent to the user.

Type: Grant

Filed: August 9, 2012

Date of Patent: February 4, 2014

Assignee: A9.com, Inc.

Inventors: Gurumurthy D. Ramkumar, Raghavan Manmatha, Supratik Bhattacharyya, Gautam Bhargava, Mark A. Ruzon
Mobile terminal and control method thereof

Patent number: 8644881

Abstract: A method for controlling a mobile terminal, and which includes receiving, via an input unit, a selection signal indicating a selection of a predetermined button among multiple predetermined buttons on the mobile terminal, in which the multiple predetermined buttons corresponding to different preset functions executed on the mobile terminal; capturing, via a camera included on the mobile terminal, a preview image of an object upon receiving the selection signal; recognizing, via a controller included on the mobile terminal, a character string included in the captured preview image; and performing, via the controller, a preset function using the recognized character string and that corresponds to the selection of the predetermined button.

Type: Grant

Filed: June 9, 2010

Date of Patent: February 4, 2014

Assignee: LG Electronics Inc.

Inventors: Yoon-Ho Kim, Hye-Jin Oh
Orientation detection using image processing

Patent number: 8643741

Abstract: Devices, methods, and computer readable media for performing image orientation detection using image processing techniques are described. In one implementation, an image processing method is disclosed that obtains image data from a first image captured by an image sensor (e.g., from any image capture electronic device). Positional sensor data captured by the device and corresponding to the image data may also be acquired (e.g., through an accelerometer). If the orientation of the device is not reliably discernible from the positional sensor data, the method may attempt to use rotationally invariant character detection metrics to determine the most likely orientation of the image, e.g., by using a decision forest algorithm. Face detection information may be used in conjunction with, or as a substitute for, the character detection data based on one or more priority parameters. Image orientation information may then be included within the image's metadata.

Type: Grant

Filed: January 17, 2012

Date of Patent: February 4, 2014

Assignee: Apple Inc.

Inventor: Ralph Brunner
Whiteboard archiving and presentation method

Patent number: 8639032

Abstract: The present invention discloses methods of archiving and optimizing lectures, presentations and other captured video for playback, particularly for blind and low vision individuals. A digital imaging device captures a preselected field of view that is subject to periodic change such as a whiteboard in a classroom. A sequence of frames is captured. Frames associated with additions or erasures to the whiteboard are identified. The Cartesian coordinates of the regions of these alterations within the frame are identified. When the presentation is played back, the regions that are altered are enlarged or masked to assist the low vision user. In another embodiment of the invention, the timing of the alterations segments the recorded audio into chapters so that the blind user can skip forward and backward to different sections of the presentation.

Type: Grant

Filed: August 29, 2008

Date of Patent: January 28, 2014

Assignee: Freedom Scientific, Inc.

Inventors: Garald Lee Voorhees, Robert Anders Steinberger, Ralph Ernest Ocampo
AUTOMATIC CORRECTION OF SKEW IN NATURAL IMAGES AND VIDEO

Publication number: 20140022406

Abstract: An electronic device and method use a camera to capture an image of an environment outside followed by identification of regions therein. A subset of the regions is selected, based on attributes of the regions, such as aspect ratio, height, and variance in stroke width. Next, a number of angles that are candidates for use as skew of the image are determined (e.g. one angle is selected for each region. based on peakiness of a histogram of the region, evaluated at different angles). Then, an angle that is most common among these candidates is identified as the angle of skew of the image. The just-described identification of skew angle is performed prior to classification of any region as text or non-text. After skew identification, at least all regions in the subset are rotated by negative of the skew angle, to obtain skew-corrected regions for use in optical character recognition.

Type: Application

Filed: March 14, 2013

Publication date: January 23, 2014

Applicant: QUALCOMM INCORPORATED

Inventors: Pawan Kumar Baheti, Kishor K. Barman, Hemanth P. Acharya
IMAGE PROCESSING DEVICE, IMAGE PROCESSING METHOD AND STORAGE MEDIUM

Publication number: 20140023272

Abstract: Character code data and vector drawing data are both listed and provided in a re-editable manner. Electronic data is generated in which information obtained by vectorizing character areas in an image and information obtained by recognizing characters in the image are stored in respective storage locations. As for the electronic data generated in this manner, because character code data and vector drawing data generated from the input image are both presented by a display and edit program, a user can immediately utilize the both data.

Type: Application

Filed: September 24, 2013

Publication date: January 23, 2014

Applicant: CANON KABUSHIKI KAISHA

Inventors: Taeko Yamazaki, Tomotoshi Kanatsu, Makoto Enomoto, Kitahiro Kaneda
System and method for identifying pictures in documents

Patent number: 8634644

Abstract: A system and method to identify pictures in documents. An image representing a page of a document is received. The image is analyzed to identify text objects in the page. A masked image is generated by masking out regions of the image including the text objects in the page. Groups of pixels in the masked image are identified, wherein a respective group of pixels corresponds to at least one picture in the page. When there is one or more groups of pixels, regions for pictures are identified based on the one or more groups of pixels. Metadata tags for the pictures are stored, wherein a respective metadata tag for a respective picture includes information about a respective bounding box for the respective picture.

Type: Grant

Filed: August 25, 2009

Date of Patent: January 21, 2014

Assignee: Fuji Xerox Co., Ltd.

Inventors: Patrick Chiu, Francine Chen, Laurent Denoue
SYSTEM AND METHOD FOR SEPARATING IMAGE AND TEXT IN A DOCUMENT

Publication number: 20140009772

Abstract: The disclosed embodiment relates to system and method for separating background image from foreground text in one or more electronic pages. The one or more electronic pages are compared to check whether the background image in each of the one or more electronic pages are same. If it found that the one or more electronic pages have common background image, the common background image is subtracted from each of the one or more pages. The foreground text from each of the one or more electronic pages is recognized using an OCR. Finally, the recognized foreground text from each of the one or more electronic pages is consolidated in a file. The consolidated file can be printed or send to one or more recipients over an email.

Type: Application

Filed: July 9, 2012

Publication date: January 9, 2014

Applicant: XEROX CORPORATION

Inventor: Ying Gao
Image processing device and image forming device

Patent number: 8625150

Abstract: An image processing device includes an image data acquiring part that acquires the first and second image data, an edge characteristic extraction part that extracts first edges and second edges forming the shapes of the rectangular regions contained in the first and second image data, a rectangular characteristic calculating part that extracts both a first calculated rectangular region formed by the first edges and a second calculated rectangular region formed by the second edges, a position adjustment parameter calculating part that calculates parameters indicating a separation distance and a separation angle between the first calculated rectangular region and the second calculated rectangular region when the first image data and the second image data are overlapped, and an image data correction part that corrects at least one of the first image data and the second image data by shifting and rotating based upon the parameters.

Type: Grant

Filed: July 23, 2010

Date of Patent: January 7, 2014

Assignee: Oki Data Corporation

Inventor: Tomonori Kondo
Image forming apparatus that outputs image data to plural destinations

Patent number: 8625127

Abstract: An image forming apparatus includes a receiving unit that receives image data; an extracting unit that extracts specific information from the image data; a first recognizing unit that recognizes destination information from the specific information; and a control unit that outputs the image data, wherein, when the first recognition unit recognizes a plurality of destination information, the control unit outputs the image data to respective destinations corresponding to each of the plurality of the destination information.

Type: Grant

Filed: February 11, 2009

Date of Patent: January 7, 2014

Assignee: Brother Kogyo Kabushiki Kaisha

Inventor: Akihiro Yamada
GESTURE-BASED VISUAL SEARCH

Publication number: 20140003714

Abstract: A user may perform an image search on an object shown in an image. The user may use a mobile device to display an image. In response to displaying the image, the client device may send the image to a visual search system for image segmentation. Upon receiving a segmented image from the visual search system, the client device may display the segmented image to the user who may select one or more segments including an object of interest to instantiate a search. The visual search system may formulate a search query based on the one or more selected segments and perform a search using the search query. The visual search system may then return search results to the client device for display to the user.

Type: Application

Filed: September 5, 2013

Publication date: January 2, 2014

Applicant: Microsoft Corporation

Inventors: Tao Mei, Shipeng Li, Ying-Qing Xu, Ning Zhang, Zheng Chen, Jian-Tao Sun
Image processing apparatus, method, and storage medium for determining attributes

Patent number: 8620081

Abstract: An image processing apparatus determines an attribute of a block image based on the attribute of the block image determined based on a color distribution characteristic amount of the block image and the attribute of the block image determined based on an edge characteristic amount of the block image.

Type: Grant

Filed: November 7, 2011

Date of Patent: December 31, 2013

Assignee: Canon Kabushiki Kaisha

Inventors: Xiaoyan Dai, Taeko Yamazaki
Methods and systems for locating text in a digital image

Patent number: 8620080

Abstract: Aspects of the present invention relate to systems and methods for locating text in a digital image. According to a first aspect of the present invention, a multi-stage filtering technique may be used to progressively refine a set of candidate text components associated with a digital image. A first, refined set of candidate text components may be formed by filtering an initial set of candidate text components based on component properties. Text lines may reconstructed from the first, refined set of candidate text components. The first, refined set of candidate text components may be further filtered based on text-line properties measured on the reconstructed text lines.

Type: Grant

Filed: September 26, 2008

Date of Patent: December 31, 2013

Assignee: Sharp Laboratories of America, Inc.

Inventor: Ahmet Mufit Ferman
Utilizing subtitles in multiple languages to facilitate second-language learning

Patent number: 8620139

Abstract: Processing video for utilization in second language learning is described herein. A video file includes spoken words in a source language, subtitles in the source language, and subtitles in a native language of an end user (a target language). The subtitles in the source language are synchronized with the spoken words in the video, and the subtitles in the source language are mapped to the subtitles in the target language. Both sets of subtitles are displayed simultaneously as the video is played by the end user.

Type: Grant

Filed: April 29, 2011

Date of Patent: December 31, 2013

Assignee: Microsoft Corporation

Inventors: Chi Ho Li, Matthew Robert Scott
Text detection using multi-layer connected components with histograms

Patent number: 8611662

Abstract: A digital image is converted to a multiple level image, and multiple scale sets are formed from connected components of the multiple level image such that different ones of the scale sets define different size spatial bins. For each of the multiple scale sets there is generated a count of connected components extracted from the respective scale set for each spatial bin; and adjacent spatial bins which represent connected components are linked. Then the connected components from the different scale sets are merged and text line detection is performed on the merged connected components. In one embodiment each of the scale sets is a histogram, and prior to linking all bins with less than a predetermined count are filtered out; and each histogram is extended such that counts of adjacent horizontal and vertical bins are added (single region bins are filtered out) and the linking is on the extended histograms.

Type: Grant

Filed: November 21, 2011

Date of Patent: December 17, 2013

Assignee: Nokia Corporation

Inventors: Shang-hsuan Tsai, Vasudev Parameswaran, Radek Grzeszczuk
OCR multi-resolution method and apparatus

Patent number: 8611661

Abstract: In some embodiments, provided are procedures for processing images that may have different font sizes. In some embodiments, it involves OCR'ing with multiple passes at different resolutions.

Type: Grant

Filed: December 26, 2007

Date of Patent: December 17, 2013

Assignee: Intel Corporation

Inventors: Oscar Nestares, Badusha Kalathiparambil
FINDING TEXT IN NATURAL SCENES

Publication number: 20130330004

Abstract: As set forth herein, systems and methods facilitate providing an efficient edge-detection and closed-contour based approach for finding text in natural scenes such as photographic images, digital, and/or electronic images, and the like. Edge information (e.g., edges of structures or objects in the images) is obtained via an edge detection technique. Edges from text characters form closed contours even in the presence of reasonable levels of noise. Closed contour linking and candidate text line formation are two additional features of the described approach. A candidate text line classifier is applied to further screen out false-positive text identifications. Candidate text regions for placement of text in the natural scene of the electronic image are highlighted and presented to a user.

Type: Application

Filed: June 12, 2012

Publication date: December 12, 2013

Applicant: XEROX CORPORATION

Inventors: Raja Bala, Zhigang Fan, Hengzhou Ding, Jan P. Allebach, Charles A. Bouman
ADAPTIVE THRESHOLDING FOR IMAGE RECOGNITION

Publication number: 20130330003

Abstract: Various approaches for providing textual information to an application, system, or service are disclosed. In particular, various embodiments enable a user to capture an image with a camera of a portable computing device. The computing device is capable of taking the image and processing it to recognize, identify, and/or isolate the text in order to forward the text to an application or function. The application or function can then utilize the text to perform an action in substantially real-time. The text may include an email, phone number, URL, an address, and the like and the application or function may be dialing the phone number, navigating to the URL, opening an address book to save contact information, displaying a map to show the address, and so on. Adaptive thresholding can be used to account for variations across an image, in order to improve the accuracy and efficiency of text recognition processes.

Type: Application

Filed: June 7, 2012

Publication date: December 12, 2013

Applicant: AMAZON TECHNOLOGIES, INC.

Inventors: Volodymyr V. Ivanchenko, Geoffrey Scott Heller, Richard Howard Suplee, III, Daniel Bibireata
Adaptive thresholding for image recognition

Patent number: 8606011

Abstract: Various approaches for providing textual information to an application, system, or service are disclosed. In particular, various embodiments enable a user to capture an image with a camera of a portable computing device. The computing device is capable of taking the image and processing it to recognize, identify, and/or isolate the text in order to forward the text to an application or function. The application or function can then utilize the text to perform an action in substantially real-time. The text may include an email, phone number, URL, an address, and the like and the application or function may be dialing the phone number, navigating to the URL, opening an address book to save contact information, displaying a map to show the address, and so on. Adaptive thresholding can be used to account for variations across an image, in order to improve the accuracy and efficiency of text recognition processes.

Type: Grant

Filed: June 7, 2012

Date of Patent: December 10, 2013

Assignee: Amazon Technologies, Inc.

Inventors: Volodymyr V. Ivanchenko, Geoffrey Scott Heller, Richard Howard Suplee, III, Daniel Bibireata
Identifying text pixels in scanned images

Patent number: 8606010

Abstract: A processor and method make use of multiple weak classifiers to construct a single strong classifier to identify regions that contain text within an input image document. The weak classifiers are grouped by their computing cost from low to median to high, and each weak classifier is assigned a weight value based on its ability to accurately identify text regions. A level 1 classifier is constructed by selecting weak classifiers from the low group, a level 2 classifier is constructed by selecting weak classifiers from the low and median groups, and a level 3 classifier is constructed by selecting weak classifiers from the low, median and high groups. Regions that the level 1 classifier identifies as containing text are submitted to the level 2 classifier, and regions that the level 2 classifier identifies as containing text are submitted to the level 3 classifier.

Type: Grant

Filed: March 18, 2011

Date of Patent: December 10, 2013

Assignee: Seiko Epson Corporation

Inventor: Jing Xiao
Method and system for printing a web page

Patent number: 8593666

Abstract: A method and system for printing a web page include converting the web page content to an image and segmenting the image into a plurality of regions. At least one of the regions is selected, and the selected region is printed.

Type: Grant

Filed: February 11, 2009

Date of Patent: November 26, 2013

Assignee: Hewlett-Packard Development Company, L.P.

Inventor: Jun Xiao
Adaptive partial character recognition

Patent number: 8594431

Abstract: A method and system for recognizing a character affected by a noise or an obstruction is disclosed. After receiving an image with characters, a character being affected by a noise or an obstruction is determined. Then, areas in the character where the noise or obstruction affected are precisely located. Templates representing every possible character in the image are updated by removing equivalent areas to the areas in the character being affected by the noise or obstruction. Then, the character is classified in a template among the updated templates by finding the template having the highest number of matching pixels with the character.

Type: Grant

Filed: September 14, 2012

Date of Patent: November 26, 2013

Assignee: International Business Machines Corporation

Inventors: Ami Ben-Horesh, Amir Geva, Eugeniusz Walach
Visualization program, visualization method and visualization apparatus for visualizing reading order of content

Patent number: 8588526

Abstract: A visualization program, method and apparatus for determining reading order of content in a structured document. The method includes generating, for each of a plurality of elements, a directed segment; storing, in the reading order, the generated directed segments of the elements into a storage device; reading from the storage device; linking together the directed segments for the elements in accordance with the reading order; and displaying the linked directed segments overlaid on the structured document which is displayed on the screen. A computer implemented program and an apparatus for carrying out the above method are also provided.

Type: Grant

Filed: November 15, 2012

Date of Patent: November 19, 2013

Assignee: International Business Machines Corporation

Inventor: Daisuke Sato
Compression of text contents for display remoting

Patent number: 8582886

Abstract: Embodiments of the invention compress an image that contains a representation of text. Embodiments take an image of graphical data and determines one or more portions of that image that have a high probability of containing text. Embodiments then take each such portion of the image and determines one or more rows of text within each portion (where text does, in fact, exist within the portion). The embodiments then traverse each vertical band of pixels of each row to determine sub-glyphs. Where a particular sub-glyph is encountered for the first time, the embodiments cache that sub-glyph, and send it (or a compressed representation thereof) to a client in a remote presentation session. Where a particular sub-glyph has been cached already, the embodiments send a reference to that cached vertical band to the client.

Type: Grant

Filed: May 19, 2011

Date of Patent: November 12, 2013

Assignee: Microsoft Corporation

Inventors: Nadim Y. Abdo, Voicu Anton Albu
Hybrid codec for compound image compression

Patent number: 8582876

Abstract: One or more portions of a compound image may be classified as picture portions and at least one remaining portion of the compound image may be classified as a non-picture portion. A first layer of a layered image may be generated based on the picture portions of the compound image. The first layer may be compliant with a first image format. A second layer of the layered image may be generated based on the non-picture portion. The second layer may be compliant with a second image format that is different from the first image format. The first layer and the second layer may be sent to a web browser. The first format and the second format may be supported by the web browser.

Type: Grant

Filed: November 15, 2011

Date of Patent: November 12, 2013

Assignee: Microsoft Corporation

Inventors: Huifeng Shen, Zhaotai Pan, Yan Lu, Shipeng Li
NOISE REMOVAL FROM IMAGES CONTAINING TEXT

Publication number: 20130294693

Abstract: The noise in an image having text is removed by convolving a shaped kernel centered on a pixel for each pixel in the image. The shaped kernel has a shape configured to identify pixels that are not part of the text. For example, the shaped kernel may be shaped with zeros in a center of the kernel to identify pixels that are not part of the text. A value for the pixel is set to erase the pixel when the resulting convolution value for the pixel is less than a threshold. The process may be repeated multiple times for differently shaped kernels, including kernels of different sizes and different configurations, such as having values greater than one in at least one of a row, column, and diagonal.

Type: Application

Filed: January 3, 2013

Publication date: November 7, 2013

Applicant: QUALCOMM Incorporated

Inventors: Ramin Rezaiifar, Serafin Diaz Spindola
CLUSTERING OF FORMS FROM LARGE-SCALE SCANNED-DOCUMENT COLLECTION

Publication number: 20130294690

Abstract: Techniques for identifying documents sharing common underlying structures in a large collection of documents and processing the documents using the identified structures are disclosed. Images of the document collection are processed to detect occurrences of a predetermined set of image features that are common or similar among forms. The images are then indexed in an image index based on the detected image features. A graph of nodes is built. Nodes in the graph represent images and are connected to nodes representing similar document images by edges. Documents sharing common underlying structures are identified by gathering strongly inter-connected nodes in the graph. The identified documents are processed based at least in part on the resulting clusters.

Type: Application

Filed: July 8, 2013

Publication date: November 7, 2013

Inventors: Shlomo Urbach, Eyal Fink, Tal Yadid, Yuval Netzer
Connected component labeling system and method

Patent number: 8577144

Abstract: Embodiments disclosed include methods for connected component labeling including labeling groups of raw data as one or more regions, the labeling including designating one or more data structures as containing information about the one or more regions; designating one or more of the regions as one or more subregions to expose a spatial distribution of one or more region features; and arranging at least one memory array with a 1:1 correspondence to a data array associated with the raw data to enable one or more data structures to include feature labels of the one or more subregions, the 1:1 correspondence enabling acquisition of the one or more region features with a controllable precision.

Type: Grant

Filed: August 27, 2012

Date of Patent: November 5, 2013

Assignee: eyeP, Inc.

Inventor: Craig Sullender
Enhanced screen capture for form manipulation

Patent number: 8571319

Abstract: According to one embodiment of the present invention, a method for processing forms based on an image is presented. A form is captured in an image, and a number of field values within the form in the image are detected. The number of field values is stored in the image metadata. In another illustrative embodiment, an access request for a form is detected. A determination is made as to whether the form corresponds to a stored image in a number of stored images. If the form corresponds to a stored image, metadata associated with the stored image is retrieved. The metadata includes a number of field values and associated textual data corresponding to the form. The form is populated with the number of field values and the associated textual data from the metadata associated with the stored image.

Type: Grant

Filed: July 28, 2009

Date of Patent: October 29, 2013

Assignee: International Business Machines Corporation

Inventors: Swaminathan Balasubramanian, Andrew R. Jones, Brian M. O'Connell, Keith R. Walker
Video thumbnail selection

Patent number: 8571330

Abstract: A method for selecting a video thumbnail includes generating a visual theme model for a sample set of images that are representative of textual information corresponding to a video file. Each of a set of candidate key frames is distinguished according to similarities shared between the candidate key frames and the visual theme model. A display is caused of a selected one of the distinguished candidate key frames as a video thumbnail for the video file.

Type: Grant

Filed: September 17, 2009

Date of Patent: October 29, 2013

Assignee: Hewlett-Packard Development Company, L.P.

Inventors: Yuli Gao, Tong Zhang, Jun Xiao
Image Processing Apparatus that Groups Objects Within Image

Publication number: 20130272610

Abstract: Provided is an image processing apparatus including: a grouping preference unit configured to register user preference information on a storage device based on a user operation, the user preference information indicating how objects within an image are to be classified into groups; an image analysis unit configured to detect the objects within the image; and a grouping unit configured to read the user preference information from the storage device and classify the objects detected within the image into the groups indicated in the read user preference information.

Type: Application

Filed: April 10, 2013

Publication date: October 17, 2013

Applicant: KYOCERA Document Solutions Inc.

Inventors: Ryosuke Ogishi, Yosuke Kashimoto, Masaaki Aiba, Takashi Murakami
METHOD OF PROVIDING ONLINE INFORMATION USING IMAGE INFORMATION

Publication number: 20130272612

Abstract: The present invention provides a method of providing online information using image, including separating each of a target image, received from a user terminal, and an original image, received from an information provider apparatus, into a text region and a graphic region; selecting an important text region from the text region; extracting features from the text region, the graphic region, and the important text region, respectively; searching for the original image corresponding to the target image using the features of the text region, the graphic region, and the important text region; and searching for supplementary information related to the retrieved original image and provided the retrieved supplementary information.

Type: Application

Filed: March 26, 2013

Publication date: October 17, 2013

Applicant: ELECTRONICS AND TELECOMMUNICATIONS RESEARCH INSTITUTE

Inventors: Jae Cheol SIM, Kang Yong LEE
Image frame processing method and device for displaying moving images to a variety of displays

Patent number: 8559798

Abstract: A rendering process for rendering an image frame and a postprocess for adapting the image frame to a display are separated. A rendering processing unit 42 generates an image frame sequence by performing rendering at a predetermined frame rate regardless of a condition that the image frame should meet for output to the display. A postprocessing unit 50 subjects the image frame sequence generated by the rendering processing unit to a merge process so as to generate and output an updated image frame sequence that meets the condition. Since the rendering process and the postprocess are separated, the image frame sequence can be generated regardless of the specification of the display such as resolution and frame rate of the display.

Type: Grant

Filed: May 19, 2005

Date of Patent: October 15, 2013

Assignees: Sony Corporation, Sony Computer Entertainment Inc.

Inventors: Sachiyo Aoki, Akio Ohba, Masaaki Oka, Nobuo Sasaki
System and method for processing data signals

Patent number: 8559688

Abstract: A signal processing method is presented. The method includes acquiring undersampled data corresponding to an object, initializing a first image solution and a second image solution, determining a linear combination solution based upon the first image solution and the second image solution, generating a plurality of selected coefficients by iteratively updating the first image solution, the second image solution and the linear combination solution and adaptively thresholding one or more transform coefficients utilizing the undersampled data, an updated first image solution, an updated second image solution and an updated linear combination solution, and reconstructing a data signal using the plurality of selected coefficients.

Type: Grant

Filed: June 30, 2010

Date of Patent: October 15, 2013

Assignee: General Electric Company

Inventors: Kedar Bhalchandra Khare, Kevin Franklin King, Luca Marinelli, Christopher Judson Hardy
Using a video processing and text extraction method to identify video segments of interest

Patent number: 8559720

Abstract: A video processing technique is performed on a video media service to identify video segments of interest. The video processing technique (200) is supplemented by a text extraction technique (245, 255, 270), as well. The resulting video segments of interest (289) can be stored as to produce a version of the media service which is shorter in time length than the original video media service.

Type: Grant

Filed: March 30, 2009

Date of Patent: October 15, 2013

Assignee: Thomson Licensing S.A.

Inventors: Ruiduo Yang, Ying Luo, Claire-Hélène Demarty, Lionel Oisel
Image forming apparatus to correct misaligned scanned document and method of controlling the same

Patent number: 8553239

Abstract: A sheet of a document is scanned, character objects are extracted from a scan image, the extracted character objects are divided line by line, and the direction of the document is set on the basis of a blank percentage determined according to start and end positions of lines. If the direction of the document is different from that of a previous document, an image processing unit rotates the scan image.

Type: Grant

Filed: April 16, 2008

Date of Patent: October 8, 2013

Assignee: Samsung Electronics Co., Ltd

Inventor: Hyung Soo Ohk
CONVERSION OF A DOCUMENT OF CAPTURED IMAGES INTO A FORMAT FOR OPTIMIZED DISPLAY ON A MOBILE DEVICE

Publication number: 20130259377

Abstract: Systems may be provided for recording a document with a camera-based mobile radio device and for converting textual information in the document into a format for suitable presentation on the mobile device. A document may be recorded by the mobile device in an image. A layout structure may be recognized with a text block in the image. Character text in the text block may be recognized by OCR. An order of the text blocks may be determined by taking into account the layout structure. A suitable format for presenting the character texts on the mobile device's display may be selected. The format may be adapted to a width of the display so that during reading of the character texts on the display, substantially only vertical scrolling is necessary. A file may be generated and displayed in the format with the character texts in the determined order of the text blocks.

Type: Application

Filed: March 28, 2013

Publication date: October 3, 2013

Applicant: Nuance Communications, Inc.

Inventor: Herr Cüneyt Göktekin
Information processing apparatus and information processing method

Patent number: 8548250

Abstract: An information processing apparatus is disclosed, including: a reading part reading vector information included in an electronic file; a first line segment extraction part extracting line segment parameter information of a line object from the vector information; a second line segment extraction part extracting polygon parameter information of a polygon object from the vector information and extracting the line segment parameter information of line segments forming the polygon object from the extracted polygon parameter information; a rectangle extraction part extracting rectangle parameter information based on the line segment parameter; a minimum rectangle determination part determining whether or not a rectangle formed based on the rectangle parameter information is a minimum rectangle which does not connote other rectangles; and a minimum rectangle output part outputting the minimum rectangle.

Type: Grant

Filed: November 5, 2008

Date of Patent: October 1, 2013

Assignee: Ricoh Company, Ltd.

Inventor: Kunio Okita
Method and user interface for performing a scan operation for a scanner coupled to a computer system

Patent number: RE44528

Abstract: A method and user interface is provided for use on a computer system coupled with a scanner for performing a scan operation on an original document, which allows the user to acquire scanned images in an easier and more user-friendly manner. The method allows the user to scan an original document without requiring the user to have learned knowledge background in the science of image processing, and also allows the scanner to perform only one scan operation on the original document. These features allow the use of the scanner to be easier and more user-friendly than the prior art. By the, method, the first step is to determine a set of image processing settings by a scanner driving program that are suited for optimal scan of the original document; and then the scanner is activated to perform a scan operation on the original document based on the image processing settings to thereby obtain a primitive scanned image.

Type: Grant

Filed: November 7, 2011

Date of Patent: October 8, 2013

Assignee: Intellectual Ventures I LLC

Inventors: Chuan-Yu Hsu, Jay Liu, T. J. Hsu

prev … 4 5 6 7 8 9 10 11 12 … next