Distinguishing Text From Other Regions Patents (Class 382/176)
  • Patent number: 8682075
    Abstract: Data representing an image of text is received, as is data representing the text in non-image form. A valid content boundary within the image of the text is determined. For each character within the text in the non-image form, a location of the character within the image of the text is determined. Where the location of the character within the image of the text falls outside the valid content boundary, the character is removed from the data representing the text in the non-image form.
    Type: Grant
    Filed: December 28, 2010
    Date of Patent: March 25, 2014
    Assignee: Hewlett-Packard Development Company, L.P.
    Inventor: Prakash Reddy
  • Patent number: 8682077
    Abstract: The invention is a method for omnidirectional recognition of recognizable characters in a captured two-dimensional image. An optical reader configured in accordance with the invention searches for pixel groupings in a starburst pattern, and subjects located pixel groupings to a preliminary edge crawling process which records the pixel position of the grouping's edge and records the count of edge pixels. If two similar-sized pixel groupings are located that are of sizes sufficient to potentially represent recognizable characters, then the reader launches “alignment rails” at pixel positions substantially parallel to a centerline connecting the center points of the two similarly sized groupings. A reader according to the invention searches for additional recognizable characters within the rail area, and subjects each located pixel grouping within the rail area to a shape-characterizing edge crawling process for developing data that characterizes the shape of a pixel grouping's edge.
    Type: Grant
    Filed: June 11, 2010
    Date of Patent: March 25, 2014
    Assignee: Hand Held Products, Inc.
    Inventor: Andrew Longacre, Jr.
  • Publication number: 20140079316
    Abstract: An approach to segmentation or clustering of a set of elements combines separate procedures and uses training data for those procedures on labeled data. This approach is applied to elements being components of an image of text (e.g., printed or handwritten). In some examples, the elements are connected sets of pixels. In images of text, the clusters can correspond to individual lines. The approach provides improved clustering performance as compared to any one of the procedures taken alone.
    Type: Application
    Filed: September 17, 2013
    Publication date: March 20, 2014
    Applicant: Raytheon BBN Technologies Corp.
    Inventors: Shiv N. Vitaladevuni, Rohit Prasad, Premkumar Natarajan
  • Patent number: 8675260
    Abstract: According to one embodiment, the image processing apparatus includes a printing control unit, an image reading unit, an extracting unit, a difference image extracting unit, and a determination unit. The printing control unit controls printing of a plurality of pages on one sheet of paper according to a print setting information which indicates a printing form, and printing of a code indicating the print setting information on the paper. The image reading unit read the paper. The extracting unit extracts the code from the read image. The difference image extracting unit extracts a difference image between the printed image and the read image.
    Type: Grant
    Filed: March 14, 2012
    Date of Patent: March 18, 2014
    Assignee: Toshiba Tec Kabushiki Kaisha
    Inventors: Shigeo Uchida, Taira Ashikawa, Satoshi Oyama, Katsuhito Mochizuki
  • Patent number: 8667410
    Abstract: In a method for computer-aided transfer of data from a document application into a data application having a set of data fields, a document is displayed in the document application opened on a computer with a display device, and wherein from the document data are to be transferred into the data application also opened on the computer. A name of a data field into which data are to be transferred is displayed on the display device. Via identification of a corresponding data value in the document on the display device, a character string representing the data value is automatically read out from the document and entered into the data field corresponding to the data field name in the data application via actuation of a predetermined button.
    Type: Grant
    Filed: July 4, 2006
    Date of Patent: March 4, 2014
    Assignee: Open Text S.A.
    Inventor: Johannes Schacht
  • Patent number: 8666185
    Abstract: A first aspect of the invention relates to a method for creating a binary mask image from an a inputted digital image of a scanned document, comprising the steps of creating a binarized image by binarizing the inputted digital image, detecting first text regions representing light text on a dark background, and inverting the first text regions, such that the inverted first text regions are interpretable in the same way as dark text on a light background. A second aspect of the invention relates to a method for comparing in a binary image a first pixel blob with a second pixel blob to determine whether they represent matching symbols, comprising the steps of detecting a line in one blob not present in the other and/or determining if one of the blobs represents an italicized symbol where the other does not.
    Type: Grant
    Filed: November 17, 2011
    Date of Patent: March 4, 2014
    Assignee: I.R.I.S.
    Inventors: Michel Dauw, Pierre Demuelenaere
  • Patent number: 8659801
    Abstract: An image forming apparatus includes an image interpolation unit to compute a correct pixel value of a target pixel subject to interpolation of a halftone image. The image interpolation unit includes a base pattern setting unit to set a base pattern including the target pixel, a reference pattern setting unit to set reference pattern in a region peripheral to the target pixel, an analogous pattern acquisition unit to acquire at least one analogous pattern analogous to the base pattern from the reference pattern, a high-resolution pattern creating unit to create a high-resolution pattern having a predetermined resolution or higher by synthesizing the acquired analogous pattern, a pixel value estimating unit to compute an estimated pixel value of the target pixel based on the created high-resolution pattern, and a pixel value determination unit to determine the correct pixel value of the target pixel based on the computed estimated pixel value.
    Type: Grant
    Filed: August 23, 2011
    Date of Patent: February 25, 2014
    Assignee: Ricoh Company, Ltd.
    Inventor: Satoshi Nakamura
  • Patent number: 8655074
    Abstract: A method for storing a document recognition result is proposed. The method includes selecting a picture area from a document image, storing an image of the selected picture area in an image file format, removing the selected picture area, filling the removed picture area with a surrounding background color, and performing character recognition of a text area.
    Type: Grant
    Filed: February 2, 2011
    Date of Patent: February 18, 2014
    Assignee: Samsung Electronics Co., Ltd
    Inventors: Ji-Hoon Kim, Sang-Ho Kim, Seong-Taek Hwang, Dong-Chang Lee
  • Patent number: 8649552
    Abstract: A data obfuscation method, apparatus and computer program product are disclosed in which at least selected text entities such as words or abbreviations in a document are obfuscated to prevent the disclosure of private information if the document is disclosed. A user establishes various configuration parameters for selected text entities desired to obfuscated. The document is processed and text entities matching the configuration parameters are tagged for obfuscation. The tagged entities are then substituted in the document with obfuscating text. The obfuscating text can be derived from a hash table. The hash table may be used to provide a reverse obfuscation method by which original data can be restored to an obfuscated document.
    Type: Grant
    Filed: April 3, 2008
    Date of Patent: February 11, 2014
    Assignee: International Business Machines Corporation
    Inventors: Sreeram Viswanath Balakrishnan, Rema Ananthanarayanan, Souptik Datta
  • Publication number: 20140037210
    Abstract: The present disclosure includes a system and method for symbol compression using conditional entropy estimation. One method for symbol compression using conditional entropy estimation includes approximating a quantity of symbol encoding bits for a number of symbols using a conditional entropy estimation. Dictionary entries are generated from the number of symbols so as to minimize a total bit-stream quantity. The total bit-stream quantity includes at least the approximated quantity of symbol encoding bits and a quantity of dictionary entries encoding bits. The symbols are encoded using the dictionary entries as a reference.
    Type: Application
    Filed: July 31, 2012
    Publication date: February 6, 2014
    Inventors: Dejan Depalov, Peter Bauer, Charles A. Bouman, Jan Allebach, Yandong Guo
  • Patent number: 8643910
    Abstract: An image forming apparatus includes an acquiring unit that acquires image data expressing an image region included in an image with a first value and a background region included in the image with a second value; a segmenting unit that segments the image region into multiple segments arranged in a fast scanning direction; a converting unit that converts a value of at least one of the segments into the second value; an output unit that generates an image signal on the basis of the image data and outputs the image signal; an exposure unit that exposes a charged image bearing member to light according to the output image signal by scanning the light thereto in the fast scanning direction so as to form a latent image; and a developing unit that forms the image by developing the latent image using an invisible toner that absorbs infrared light or ultraviolet light.
    Type: Grant
    Filed: July 27, 2011
    Date of Patent: February 4, 2014
    Assignee: Fuji Xerox Co., Ltd.
    Inventor: Junichi Ichikawa
  • Patent number: 8644610
    Abstract: Present invention relates to a method and system for automatic searching for information on a network in response to an image query sent by a user. The image query includes an image that is captured by using a mobile communications device with a camera. The image is processed to detect the text present in it. The detected text is then recognized using an OCR. Subsequently, the text is searched for matches in the corresponding domain database, selected from the various domain databases present in the network. Thereafter, selected matches and additional related information is sent to the user.
    Type: Grant
    Filed: August 9, 2012
    Date of Patent: February 4, 2014
    Assignee: A9.com, Inc.
    Inventors: Gurumurthy D. Ramkumar, Raghavan Manmatha, Supratik Bhattacharyya, Gautam Bhargava, Mark A. Ruzon
  • Patent number: 8644881
    Abstract: A method for controlling a mobile terminal, and which includes receiving, via an input unit, a selection signal indicating a selection of a predetermined button among multiple predetermined buttons on the mobile terminal, in which the multiple predetermined buttons corresponding to different preset functions executed on the mobile terminal; capturing, via a camera included on the mobile terminal, a preview image of an object upon receiving the selection signal; recognizing, via a controller included on the mobile terminal, a character string included in the captured preview image; and performing, via the controller, a preset function using the recognized character string and that corresponds to the selection of the predetermined button.
    Type: Grant
    Filed: June 9, 2010
    Date of Patent: February 4, 2014
    Assignee: LG Electronics Inc.
    Inventors: Yoon-Ho Kim, Hye-Jin Oh
  • Patent number: 8643741
    Abstract: Devices, methods, and computer readable media for performing image orientation detection using image processing techniques are described. In one implementation, an image processing method is disclosed that obtains image data from a first image captured by an image sensor (e.g., from any image capture electronic device). Positional sensor data captured by the device and corresponding to the image data may also be acquired (e.g., through an accelerometer). If the orientation of the device is not reliably discernible from the positional sensor data, the method may attempt to use rotationally invariant character detection metrics to determine the most likely orientation of the image, e.g., by using a decision forest algorithm. Face detection information may be used in conjunction with, or as a substitute for, the character detection data based on one or more priority parameters. Image orientation information may then be included within the image's metadata.
    Type: Grant
    Filed: January 17, 2012
    Date of Patent: February 4, 2014
    Assignee: Apple Inc.
    Inventor: Ralph Brunner
  • Patent number: 8639032
    Abstract: The present invention discloses methods of archiving and optimizing lectures, presentations and other captured video for playback, particularly for blind and low vision individuals. A digital imaging device captures a preselected field of view that is subject to periodic change such as a whiteboard in a classroom. A sequence of frames is captured. Frames associated with additions or erasures to the whiteboard are identified. The Cartesian coordinates of the regions of these alterations within the frame are identified. When the presentation is played back, the regions that are altered are enlarged or masked to assist the low vision user. In another embodiment of the invention, the timing of the alterations segments the recorded audio into chapters so that the blind user can skip forward and backward to different sections of the presentation.
    Type: Grant
    Filed: August 29, 2008
    Date of Patent: January 28, 2014
    Assignee: Freedom Scientific, Inc.
    Inventors: Garald Lee Voorhees, Robert Anders Steinberger, Ralph Ernest Ocampo
  • Publication number: 20140022406
    Abstract: An electronic device and method use a camera to capture an image of an environment outside followed by identification of regions therein. A subset of the regions is selected, based on attributes of the regions, such as aspect ratio, height, and variance in stroke width. Next, a number of angles that are candidates for use as skew of the image are determined (e.g. one angle is selected for each region. based on peakiness of a histogram of the region, evaluated at different angles). Then, an angle that is most common among these candidates is identified as the angle of skew of the image. The just-described identification of skew angle is performed prior to classification of any region as text or non-text. After skew identification, at least all regions in the subset are rotated by negative of the skew angle, to obtain skew-corrected regions for use in optical character recognition.
    Type: Application
    Filed: March 14, 2013
    Publication date: January 23, 2014
    Applicant: QUALCOMM INCORPORATED
    Inventors: Pawan Kumar Baheti, Kishor K. Barman, Hemanth P. Acharya
  • Publication number: 20140023272
    Abstract: Character code data and vector drawing data are both listed and provided in a re-editable manner. Electronic data is generated in which information obtained by vectorizing character areas in an image and information obtained by recognizing characters in the image are stored in respective storage locations. As for the electronic data generated in this manner, because character code data and vector drawing data generated from the input image are both presented by a display and edit program, a user can immediately utilize the both data.
    Type: Application
    Filed: September 24, 2013
    Publication date: January 23, 2014
    Applicant: CANON KABUSHIKI KAISHA
    Inventors: Taeko Yamazaki, Tomotoshi Kanatsu, Makoto Enomoto, Kitahiro Kaneda
  • Patent number: 8634644
    Abstract: A system and method to identify pictures in documents. An image representing a page of a document is received. The image is analyzed to identify text objects in the page. A masked image is generated by masking out regions of the image including the text objects in the page. Groups of pixels in the masked image are identified, wherein a respective group of pixels corresponds to at least one picture in the page. When there is one or more groups of pixels, regions for pictures are identified based on the one or more groups of pixels. Metadata tags for the pictures are stored, wherein a respective metadata tag for a respective picture includes information about a respective bounding box for the respective picture.
    Type: Grant
    Filed: August 25, 2009
    Date of Patent: January 21, 2014
    Assignee: Fuji Xerox Co., Ltd.
    Inventors: Patrick Chiu, Francine Chen, Laurent Denoue
  • Publication number: 20140009772
    Abstract: The disclosed embodiment relates to system and method for separating background image from foreground text in one or more electronic pages. The one or more electronic pages are compared to check whether the background image in each of the one or more electronic pages are same. If it found that the one or more electronic pages have common background image, the common background image is subtracted from each of the one or more pages. The foreground text from each of the one or more electronic pages is recognized using an OCR. Finally, the recognized foreground text from each of the one or more electronic pages is consolidated in a file. The consolidated file can be printed or send to one or more recipients over an email.
    Type: Application
    Filed: July 9, 2012
    Publication date: January 9, 2014
    Applicant: XEROX CORPORATION
    Inventor: Ying Gao
  • Patent number: 8625150
    Abstract: An image processing device includes an image data acquiring part that acquires the first and second image data, an edge characteristic extraction part that extracts first edges and second edges forming the shapes of the rectangular regions contained in the first and second image data, a rectangular characteristic calculating part that extracts both a first calculated rectangular region formed by the first edges and a second calculated rectangular region formed by the second edges, a position adjustment parameter calculating part that calculates parameters indicating a separation distance and a separation angle between the first calculated rectangular region and the second calculated rectangular region when the first image data and the second image data are overlapped, and an image data correction part that corrects at least one of the first image data and the second image data by shifting and rotating based upon the parameters.
    Type: Grant
    Filed: July 23, 2010
    Date of Patent: January 7, 2014
    Assignee: Oki Data Corporation
    Inventor: Tomonori Kondo
  • Patent number: 8625127
    Abstract: An image forming apparatus includes a receiving unit that receives image data; an extracting unit that extracts specific information from the image data; a first recognizing unit that recognizes destination information from the specific information; and a control unit that outputs the image data, wherein, when the first recognition unit recognizes a plurality of destination information, the control unit outputs the image data to respective destinations corresponding to each of the plurality of the destination information.
    Type: Grant
    Filed: February 11, 2009
    Date of Patent: January 7, 2014
    Assignee: Brother Kogyo Kabushiki Kaisha
    Inventor: Akihiro Yamada
  • Publication number: 20140003714
    Abstract: A user may perform an image search on an object shown in an image. The user may use a mobile device to display an image. In response to displaying the image, the client device may send the image to a visual search system for image segmentation. Upon receiving a segmented image from the visual search system, the client device may display the segmented image to the user who may select one or more segments including an object of interest to instantiate a search. The visual search system may formulate a search query based on the one or more selected segments and perform a search using the search query. The visual search system may then return search results to the client device for display to the user.
    Type: Application
    Filed: September 5, 2013
    Publication date: January 2, 2014
    Applicant: Microsoft Corporation
    Inventors: Tao Mei, Shipeng Li, Ying-Qing Xu, Ning Zhang, Zheng Chen, Jian-Tao Sun
  • Patent number: 8620081
    Abstract: An image processing apparatus determines an attribute of a block image based on the attribute of the block image determined based on a color distribution characteristic amount of the block image and the attribute of the block image determined based on an edge characteristic amount of the block image.
    Type: Grant
    Filed: November 7, 2011
    Date of Patent: December 31, 2013
    Assignee: Canon Kabushiki Kaisha
    Inventors: Xiaoyan Dai, Taeko Yamazaki
  • Patent number: 8620080
    Abstract: Aspects of the present invention relate to systems and methods for locating text in a digital image. According to a first aspect of the present invention, a multi-stage filtering technique may be used to progressively refine a set of candidate text components associated with a digital image. A first, refined set of candidate text components may be formed by filtering an initial set of candidate text components based on component properties. Text lines may reconstructed from the first, refined set of candidate text components. The first, refined set of candidate text components may be further filtered based on text-line properties measured on the reconstructed text lines.
    Type: Grant
    Filed: September 26, 2008
    Date of Patent: December 31, 2013
    Assignee: Sharp Laboratories of America, Inc.
    Inventor: Ahmet Mufit Ferman
  • Patent number: 8620139
    Abstract: Processing video for utilization in second language learning is described herein. A video file includes spoken words in a source language, subtitles in the source language, and subtitles in a native language of an end user (a target language). The subtitles in the source language are synchronized with the spoken words in the video, and the subtitles in the source language are mapped to the subtitles in the target language. Both sets of subtitles are displayed simultaneously as the video is played by the end user.
    Type: Grant
    Filed: April 29, 2011
    Date of Patent: December 31, 2013
    Assignee: Microsoft Corporation
    Inventors: Chi Ho Li, Matthew Robert Scott
  • Patent number: 8611662
    Abstract: A digital image is converted to a multiple level image, and multiple scale sets are formed from connected components of the multiple level image such that different ones of the scale sets define different size spatial bins. For each of the multiple scale sets there is generated a count of connected components extracted from the respective scale set for each spatial bin; and adjacent spatial bins which represent connected components are linked. Then the connected components from the different scale sets are merged and text line detection is performed on the merged connected components. In one embodiment each of the scale sets is a histogram, and prior to linking all bins with less than a predetermined count are filtered out; and each histogram is extended such that counts of adjacent horizontal and vertical bins are added (single region bins are filtered out) and the linking is on the extended histograms.
    Type: Grant
    Filed: November 21, 2011
    Date of Patent: December 17, 2013
    Assignee: Nokia Corporation
    Inventors: Shang-hsuan Tsai, Vasudev Parameswaran, Radek Grzeszczuk
  • Patent number: 8611661
    Abstract: In some embodiments, provided are procedures for processing images that may have different font sizes. In some embodiments, it involves OCR'ing with multiple passes at different resolutions.
    Type: Grant
    Filed: December 26, 2007
    Date of Patent: December 17, 2013
    Assignee: Intel Corporation
    Inventors: Oscar Nestares, Badusha Kalathiparambil
  • Publication number: 20130330004
    Abstract: As set forth herein, systems and methods facilitate providing an efficient edge-detection and closed-contour based approach for finding text in natural scenes such as photographic images, digital, and/or electronic images, and the like. Edge information (e.g., edges of structures or objects in the images) is obtained via an edge detection technique. Edges from text characters form closed contours even in the presence of reasonable levels of noise. Closed contour linking and candidate text line formation are two additional features of the described approach. A candidate text line classifier is applied to further screen out false-positive text identifications. Candidate text regions for placement of text in the natural scene of the electronic image are highlighted and presented to a user.
    Type: Application
    Filed: June 12, 2012
    Publication date: December 12, 2013
    Applicant: XEROX CORPORATION
    Inventors: Raja Bala, Zhigang Fan, Hengzhou Ding, Jan P. Allebach, Charles A. Bouman
  • Publication number: 20130330003
    Abstract: Various approaches for providing textual information to an application, system, or service are disclosed. In particular, various embodiments enable a user to capture an image with a camera of a portable computing device. The computing device is capable of taking the image and processing it to recognize, identify, and/or isolate the text in order to forward the text to an application or function. The application or function can then utilize the text to perform an action in substantially real-time. The text may include an email, phone number, URL, an address, and the like and the application or function may be dialing the phone number, navigating to the URL, opening an address book to save contact information, displaying a map to show the address, and so on. Adaptive thresholding can be used to account for variations across an image, in order to improve the accuracy and efficiency of text recognition processes.
    Type: Application
    Filed: June 7, 2012
    Publication date: December 12, 2013
    Applicant: AMAZON TECHNOLOGIES, INC.
    Inventors: Volodymyr V. Ivanchenko, Geoffrey Scott Heller, Richard Howard Suplee, III, Daniel Bibireata
  • Patent number: 8606011
    Abstract: Various approaches for providing textual information to an application, system, or service are disclosed. In particular, various embodiments enable a user to capture an image with a camera of a portable computing device. The computing device is capable of taking the image and processing it to recognize, identify, and/or isolate the text in order to forward the text to an application or function. The application or function can then utilize the text to perform an action in substantially real-time. The text may include an email, phone number, URL, an address, and the like and the application or function may be dialing the phone number, navigating to the URL, opening an address book to save contact information, displaying a map to show the address, and so on. Adaptive thresholding can be used to account for variations across an image, in order to improve the accuracy and efficiency of text recognition processes.
    Type: Grant
    Filed: June 7, 2012
    Date of Patent: December 10, 2013
    Assignee: Amazon Technologies, Inc.
    Inventors: Volodymyr V. Ivanchenko, Geoffrey Scott Heller, Richard Howard Suplee, III, Daniel Bibireata
  • Patent number: 8606010
    Abstract: A processor and method make use of multiple weak classifiers to construct a single strong classifier to identify regions that contain text within an input image document. The weak classifiers are grouped by their computing cost from low to median to high, and each weak classifier is assigned a weight value based on its ability to accurately identify text regions. A level 1 classifier is constructed by selecting weak classifiers from the low group, a level 2 classifier is constructed by selecting weak classifiers from the low and median groups, and a level 3 classifier is constructed by selecting weak classifiers from the low, median and high groups. Regions that the level 1 classifier identifies as containing text are submitted to the level 2 classifier, and regions that the level 2 classifier identifies as containing text are submitted to the level 3 classifier.
    Type: Grant
    Filed: March 18, 2011
    Date of Patent: December 10, 2013
    Assignee: Seiko Epson Corporation
    Inventor: Jing Xiao
  • Patent number: 8593666
    Abstract: A method and system for printing a web page include converting the web page content to an image and segmenting the image into a plurality of regions. At least one of the regions is selected, and the selected region is printed.
    Type: Grant
    Filed: February 11, 2009
    Date of Patent: November 26, 2013
    Assignee: Hewlett-Packard Development Company, L.P.
    Inventor: Jun Xiao
  • Patent number: 8594431
    Abstract: A method and system for recognizing a character affected by a noise or an obstruction is disclosed. After receiving an image with characters, a character being affected by a noise or an obstruction is determined. Then, areas in the character where the noise or obstruction affected are precisely located. Templates representing every possible character in the image are updated by removing equivalent areas to the areas in the character being affected by the noise or obstruction. Then, the character is classified in a template among the updated templates by finding the template having the highest number of matching pixels with the character.
    Type: Grant
    Filed: September 14, 2012
    Date of Patent: November 26, 2013
    Assignee: International Business Machines Corporation
    Inventors: Ami Ben-Horesh, Amir Geva, Eugeniusz Walach
  • Patent number: 8588526
    Abstract: A visualization program, method and apparatus for determining reading order of content in a structured document. The method includes generating, for each of a plurality of elements, a directed segment; storing, in the reading order, the generated directed segments of the elements into a storage device; reading from the storage device; linking together the directed segments for the elements in accordance with the reading order; and displaying the linked directed segments overlaid on the structured document which is displayed on the screen. A computer implemented program and an apparatus for carrying out the above method are also provided.
    Type: Grant
    Filed: November 15, 2012
    Date of Patent: November 19, 2013
    Assignee: International Business Machines Corporation
    Inventor: Daisuke Sato
  • Patent number: 8582886
    Abstract: Embodiments of the invention compress an image that contains a representation of text. Embodiments take an image of graphical data and determines one or more portions of that image that have a high probability of containing text. Embodiments then take each such portion of the image and determines one or more rows of text within each portion (where text does, in fact, exist within the portion). The embodiments then traverse each vertical band of pixels of each row to determine sub-glyphs. Where a particular sub-glyph is encountered for the first time, the embodiments cache that sub-glyph, and send it (or a compressed representation thereof) to a client in a remote presentation session. Where a particular sub-glyph has been cached already, the embodiments send a reference to that cached vertical band to the client.
    Type: Grant
    Filed: May 19, 2011
    Date of Patent: November 12, 2013
    Assignee: Microsoft Corporation
    Inventors: Nadim Y. Abdo, Voicu Anton Albu
  • Patent number: 8582876
    Abstract: One or more portions of a compound image may be classified as picture portions and at least one remaining portion of the compound image may be classified as a non-picture portion. A first layer of a layered image may be generated based on the picture portions of the compound image. The first layer may be compliant with a first image format. A second layer of the layered image may be generated based on the non-picture portion. The second layer may be compliant with a second image format that is different from the first image format. The first layer and the second layer may be sent to a web browser. The first format and the second format may be supported by the web browser.
    Type: Grant
    Filed: November 15, 2011
    Date of Patent: November 12, 2013
    Assignee: Microsoft Corporation
    Inventors: Huifeng Shen, Zhaotai Pan, Yan Lu, Shipeng Li
  • Publication number: 20130294693
    Abstract: The noise in an image having text is removed by convolving a shaped kernel centered on a pixel for each pixel in the image. The shaped kernel has a shape configured to identify pixels that are not part of the text. For example, the shaped kernel may be shaped with zeros in a center of the kernel to identify pixels that are not part of the text. A value for the pixel is set to erase the pixel when the resulting convolution value for the pixel is less than a threshold. The process may be repeated multiple times for differently shaped kernels, including kernels of different sizes and different configurations, such as having values greater than one in at least one of a row, column, and diagonal.
    Type: Application
    Filed: January 3, 2013
    Publication date: November 7, 2013
    Applicant: QUALCOMM Incorporated
    Inventors: Ramin Rezaiifar, Serafin Diaz Spindola
  • Publication number: 20130294690
    Abstract: Techniques for identifying documents sharing common underlying structures in a large collection of documents and processing the documents using the identified structures are disclosed. Images of the document collection are processed to detect occurrences of a predetermined set of image features that are common or similar among forms. The images are then indexed in an image index based on the detected image features. A graph of nodes is built. Nodes in the graph represent images and are connected to nodes representing similar document images by edges. Documents sharing common underlying structures are identified by gathering strongly inter-connected nodes in the graph. The identified documents are processed based at least in part on the resulting clusters.
    Type: Application
    Filed: July 8, 2013
    Publication date: November 7, 2013
    Inventors: Shlomo Urbach, Eyal Fink, Tal Yadid, Yuval Netzer
  • Patent number: 8577144
    Abstract: Embodiments disclosed include methods for connected component labeling including labeling groups of raw data as one or more regions, the labeling including designating one or more data structures as containing information about the one or more regions; designating one or more of the regions as one or more subregions to expose a spatial distribution of one or more region features; and arranging at least one memory array with a 1:1 correspondence to a data array associated with the raw data to enable one or more data structures to include feature labels of the one or more subregions, the 1:1 correspondence enabling acquisition of the one or more region features with a controllable precision.
    Type: Grant
    Filed: August 27, 2012
    Date of Patent: November 5, 2013
    Assignee: eyeP, Inc.
    Inventor: Craig Sullender
  • Patent number: 8571319
    Abstract: According to one embodiment of the present invention, a method for processing forms based on an image is presented. A form is captured in an image, and a number of field values within the form in the image are detected. The number of field values is stored in the image metadata. In another illustrative embodiment, an access request for a form is detected. A determination is made as to whether the form corresponds to a stored image in a number of stored images. If the form corresponds to a stored image, metadata associated with the stored image is retrieved. The metadata includes a number of field values and associated textual data corresponding to the form. The form is populated with the number of field values and the associated textual data from the metadata associated with the stored image.
    Type: Grant
    Filed: July 28, 2009
    Date of Patent: October 29, 2013
    Assignee: International Business Machines Corporation
    Inventors: Swaminathan Balasubramanian, Andrew R. Jones, Brian M. O'Connell, Keith R. Walker
  • Patent number: 8571330
    Abstract: A method for selecting a video thumbnail includes generating a visual theme model for a sample set of images that are representative of textual information corresponding to a video file. Each of a set of candidate key frames is distinguished according to similarities shared between the candidate key frames and the visual theme model. A display is caused of a selected one of the distinguished candidate key frames as a video thumbnail for the video file.
    Type: Grant
    Filed: September 17, 2009
    Date of Patent: October 29, 2013
    Assignee: Hewlett-Packard Development Company, L.P.
    Inventors: Yuli Gao, Tong Zhang, Jun Xiao
  • Publication number: 20130272610
    Abstract: Provided is an image processing apparatus including: a grouping preference unit configured to register user preference information on a storage device based on a user operation, the user preference information indicating how objects within an image are to be classified into groups; an image analysis unit configured to detect the objects within the image; and a grouping unit configured to read the user preference information from the storage device and classify the objects detected within the image into the groups indicated in the read user preference information.
    Type: Application
    Filed: April 10, 2013
    Publication date: October 17, 2013
    Applicant: KYOCERA Document Solutions Inc.
    Inventors: Ryosuke Ogishi, Yosuke Kashimoto, Masaaki Aiba, Takashi Murakami
  • Publication number: 20130272612
    Abstract: The present invention provides a method of providing online information using image, including separating each of a target image, received from a user terminal, and an original image, received from an information provider apparatus, into a text region and a graphic region; selecting an important text region from the text region; extracting features from the text region, the graphic region, and the important text region, respectively; searching for the original image corresponding to the target image using the features of the text region, the graphic region, and the important text region; and searching for supplementary information related to the retrieved original image and provided the retrieved supplementary information.
    Type: Application
    Filed: March 26, 2013
    Publication date: October 17, 2013
    Applicant: ELECTRONICS AND TELECOMMUNICATIONS RESEARCH INSTITUTE
    Inventors: Jae Cheol SIM, Kang Yong LEE
  • Patent number: 8559798
    Abstract: A rendering process for rendering an image frame and a postprocess for adapting the image frame to a display are separated. A rendering processing unit 42 generates an image frame sequence by performing rendering at a predetermined frame rate regardless of a condition that the image frame should meet for output to the display. A postprocessing unit 50 subjects the image frame sequence generated by the rendering processing unit to a merge process so as to generate and output an updated image frame sequence that meets the condition. Since the rendering process and the postprocess are separated, the image frame sequence can be generated regardless of the specification of the display such as resolution and frame rate of the display.
    Type: Grant
    Filed: May 19, 2005
    Date of Patent: October 15, 2013
    Assignees: Sony Corporation, Sony Computer Entertainment Inc.
    Inventors: Sachiyo Aoki, Akio Ohba, Masaaki Oka, Nobuo Sasaki
  • Patent number: 8559688
    Abstract: A signal processing method is presented. The method includes acquiring undersampled data corresponding to an object, initializing a first image solution and a second image solution, determining a linear combination solution based upon the first image solution and the second image solution, generating a plurality of selected coefficients by iteratively updating the first image solution, the second image solution and the linear combination solution and adaptively thresholding one or more transform coefficients utilizing the undersampled data, an updated first image solution, an updated second image solution and an updated linear combination solution, and reconstructing a data signal using the plurality of selected coefficients.
    Type: Grant
    Filed: June 30, 2010
    Date of Patent: October 15, 2013
    Assignee: General Electric Company
    Inventors: Kedar Bhalchandra Khare, Kevin Franklin King, Luca Marinelli, Christopher Judson Hardy
  • Patent number: 8559720
    Abstract: A video processing technique is performed on a video media service to identify video segments of interest. The video processing technique (200) is supplemented by a text extraction technique (245, 255, 270), as well. The resulting video segments of interest (289) can be stored as to produce a version of the media service which is shorter in time length than the original video media service.
    Type: Grant
    Filed: March 30, 2009
    Date of Patent: October 15, 2013
    Assignee: Thomson Licensing S.A.
    Inventors: Ruiduo Yang, Ying Luo, Claire-Hélène Demarty, Lionel Oisel
  • Patent number: 8553239
    Abstract: A sheet of a document is scanned, character objects are extracted from a scan image, the extracted character objects are divided line by line, and the direction of the document is set on the basis of a blank percentage determined according to start and end positions of lines. If the direction of the document is different from that of a previous document, an image processing unit rotates the scan image.
    Type: Grant
    Filed: April 16, 2008
    Date of Patent: October 8, 2013
    Assignee: Samsung Electronics Co., Ltd
    Inventor: Hyung Soo Ohk
  • Publication number: 20130259377
    Abstract: Systems may be provided for recording a document with a camera-based mobile radio device and for converting textual information in the document into a format for suitable presentation on the mobile device. A document may be recorded by the mobile device in an image. A layout structure may be recognized with a text block in the image. Character text in the text block may be recognized by OCR. An order of the text blocks may be determined by taking into account the layout structure. A suitable format for presenting the character texts on the mobile device's display may be selected. The format may be adapted to a width of the display so that during reading of the character texts on the display, substantially only vertical scrolling is necessary. A file may be generated and displayed in the format with the character texts in the determined order of the text blocks.
    Type: Application
    Filed: March 28, 2013
    Publication date: October 3, 2013
    Applicant: Nuance Communications, Inc.
    Inventor: Herr Cüneyt Göktekin
  • Patent number: 8548250
    Abstract: An information processing apparatus is disclosed, including: a reading part reading vector information included in an electronic file; a first line segment extraction part extracting line segment parameter information of a line object from the vector information; a second line segment extraction part extracting polygon parameter information of a polygon object from the vector information and extracting the line segment parameter information of line segments forming the polygon object from the extracted polygon parameter information; a rectangle extraction part extracting rectangle parameter information based on the line segment parameter; a minimum rectangle determination part determining whether or not a rectangle formed based on the rectangle parameter information is a minimum rectangle which does not connote other rectangles; and a minimum rectangle output part outputting the minimum rectangle.
    Type: Grant
    Filed: November 5, 2008
    Date of Patent: October 1, 2013
    Assignee: Ricoh Company, Ltd.
    Inventor: Kunio Okita
  • Patent number: RE44528
    Abstract: A method and user interface is provided for use on a computer system coupled with a scanner for performing a scan operation on an original document, which allows the user to acquire scanned images in an easier and more user-friendly manner. The method allows the user to scan an original document without requiring the user to have learned knowledge background in the science of image processing, and also allows the scanner to perform only one scan operation on the original document. These features allow the use of the scanner to be easier and more user-friendly than the prior art. By the, method, the first step is to determine a set of image processing settings by a scanner driving program that are suited for optimal scan of the original document; and then the scanner is activated to perform a scan operation on the original document based on the image processing settings to thereby obtain a primitive scanned image.
    Type: Grant
    Filed: November 7, 2011
    Date of Patent: October 8, 2013
    Assignee: Intellectual Ventures I LLC
    Inventors: Chuan-Yu Hsu, Jay Liu, T. J. Hsu