Distinguishing Text From Other Regions Patents (Class 382/176)
-
Patent number: 8682075Abstract: Data representing an image of text is received, as is data representing the text in non-image form. A valid content boundary within the image of the text is determined. For each character within the text in the non-image form, a location of the character within the image of the text is determined. Where the location of the character within the image of the text falls outside the valid content boundary, the character is removed from the data representing the text in the non-image form.Type: GrantFiled: December 28, 2010Date of Patent: March 25, 2014Assignee: Hewlett-Packard Development Company, L.P.Inventor: Prakash Reddy
-
Patent number: 8682077Abstract: The invention is a method for omnidirectional recognition of recognizable characters in a captured two-dimensional image. An optical reader configured in accordance with the invention searches for pixel groupings in a starburst pattern, and subjects located pixel groupings to a preliminary edge crawling process which records the pixel position of the grouping's edge and records the count of edge pixels. If two similar-sized pixel groupings are located that are of sizes sufficient to potentially represent recognizable characters, then the reader launches “alignment rails” at pixel positions substantially parallel to a centerline connecting the center points of the two similarly sized groupings. A reader according to the invention searches for additional recognizable characters within the rail area, and subjects each located pixel grouping within the rail area to a shape-characterizing edge crawling process for developing data that characterizes the shape of a pixel grouping's edge.Type: GrantFiled: June 11, 2010Date of Patent: March 25, 2014Assignee: Hand Held Products, Inc.Inventor: Andrew Longacre, Jr.
-
Publication number: 20140079316Abstract: An approach to segmentation or clustering of a set of elements combines separate procedures and uses training data for those procedures on labeled data. This approach is applied to elements being components of an image of text (e.g., printed or handwritten). In some examples, the elements are connected sets of pixels. In images of text, the clusters can correspond to individual lines. The approach provides improved clustering performance as compared to any one of the procedures taken alone.Type: ApplicationFiled: September 17, 2013Publication date: March 20, 2014Applicant: Raytheon BBN Technologies Corp.Inventors: Shiv N. Vitaladevuni, Rohit Prasad, Premkumar Natarajan
-
Patent number: 8675260Abstract: According to one embodiment, the image processing apparatus includes a printing control unit, an image reading unit, an extracting unit, a difference image extracting unit, and a determination unit. The printing control unit controls printing of a plurality of pages on one sheet of paper according to a print setting information which indicates a printing form, and printing of a code indicating the print setting information on the paper. The image reading unit read the paper. The extracting unit extracts the code from the read image. The difference image extracting unit extracts a difference image between the printed image and the read image.Type: GrantFiled: March 14, 2012Date of Patent: March 18, 2014Assignee: Toshiba Tec Kabushiki KaishaInventors: Shigeo Uchida, Taira Ashikawa, Satoshi Oyama, Katsuhito Mochizuki
-
Patent number: 8667410Abstract: In a method for computer-aided transfer of data from a document application into a data application having a set of data fields, a document is displayed in the document application opened on a computer with a display device, and wherein from the document data are to be transferred into the data application also opened on the computer. A name of a data field into which data are to be transferred is displayed on the display device. Via identification of a corresponding data value in the document on the display device, a character string representing the data value is automatically read out from the document and entered into the data field corresponding to the data field name in the data application via actuation of a predetermined button.Type: GrantFiled: July 4, 2006Date of Patent: March 4, 2014Assignee: Open Text S.A.Inventor: Johannes Schacht
-
Patent number: 8666185Abstract: A first aspect of the invention relates to a method for creating a binary mask image from an a inputted digital image of a scanned document, comprising the steps of creating a binarized image by binarizing the inputted digital image, detecting first text regions representing light text on a dark background, and inverting the first text regions, such that the inverted first text regions are interpretable in the same way as dark text on a light background. A second aspect of the invention relates to a method for comparing in a binary image a first pixel blob with a second pixel blob to determine whether they represent matching symbols, comprising the steps of detecting a line in one blob not present in the other and/or determining if one of the blobs represents an italicized symbol where the other does not.Type: GrantFiled: November 17, 2011Date of Patent: March 4, 2014Assignee: I.R.I.S.Inventors: Michel Dauw, Pierre Demuelenaere
-
Patent number: 8659801Abstract: An image forming apparatus includes an image interpolation unit to compute a correct pixel value of a target pixel subject to interpolation of a halftone image. The image interpolation unit includes a base pattern setting unit to set a base pattern including the target pixel, a reference pattern setting unit to set reference pattern in a region peripheral to the target pixel, an analogous pattern acquisition unit to acquire at least one analogous pattern analogous to the base pattern from the reference pattern, a high-resolution pattern creating unit to create a high-resolution pattern having a predetermined resolution or higher by synthesizing the acquired analogous pattern, a pixel value estimating unit to compute an estimated pixel value of the target pixel based on the created high-resolution pattern, and a pixel value determination unit to determine the correct pixel value of the target pixel based on the computed estimated pixel value.Type: GrantFiled: August 23, 2011Date of Patent: February 25, 2014Assignee: Ricoh Company, Ltd.Inventor: Satoshi Nakamura
-
Patent number: 8655074Abstract: A method for storing a document recognition result is proposed. The method includes selecting a picture area from a document image, storing an image of the selected picture area in an image file format, removing the selected picture area, filling the removed picture area with a surrounding background color, and performing character recognition of a text area.Type: GrantFiled: February 2, 2011Date of Patent: February 18, 2014Assignee: Samsung Electronics Co., LtdInventors: Ji-Hoon Kim, Sang-Ho Kim, Seong-Taek Hwang, Dong-Chang Lee
-
Patent number: 8649552Abstract: A data obfuscation method, apparatus and computer program product are disclosed in which at least selected text entities such as words or abbreviations in a document are obfuscated to prevent the disclosure of private information if the document is disclosed. A user establishes various configuration parameters for selected text entities desired to obfuscated. The document is processed and text entities matching the configuration parameters are tagged for obfuscation. The tagged entities are then substituted in the document with obfuscating text. The obfuscating text can be derived from a hash table. The hash table may be used to provide a reverse obfuscation method by which original data can be restored to an obfuscated document.Type: GrantFiled: April 3, 2008Date of Patent: February 11, 2014Assignee: International Business Machines CorporationInventors: Sreeram Viswanath Balakrishnan, Rema Ananthanarayanan, Souptik Datta
-
Publication number: 20140037210Abstract: The present disclosure includes a system and method for symbol compression using conditional entropy estimation. One method for symbol compression using conditional entropy estimation includes approximating a quantity of symbol encoding bits for a number of symbols using a conditional entropy estimation. Dictionary entries are generated from the number of symbols so as to minimize a total bit-stream quantity. The total bit-stream quantity includes at least the approximated quantity of symbol encoding bits and a quantity of dictionary entries encoding bits. The symbols are encoded using the dictionary entries as a reference.Type: ApplicationFiled: July 31, 2012Publication date: February 6, 2014Inventors: Dejan Depalov, Peter Bauer, Charles A. Bouman, Jan Allebach, Yandong Guo
-
Patent number: 8643910Abstract: An image forming apparatus includes an acquiring unit that acquires image data expressing an image region included in an image with a first value and a background region included in the image with a second value; a segmenting unit that segments the image region into multiple segments arranged in a fast scanning direction; a converting unit that converts a value of at least one of the segments into the second value; an output unit that generates an image signal on the basis of the image data and outputs the image signal; an exposure unit that exposes a charged image bearing member to light according to the output image signal by scanning the light thereto in the fast scanning direction so as to form a latent image; and a developing unit that forms the image by developing the latent image using an invisible toner that absorbs infrared light or ultraviolet light.Type: GrantFiled: July 27, 2011Date of Patent: February 4, 2014Assignee: Fuji Xerox Co., Ltd.Inventor: Junichi Ichikawa
-
Patent number: 8644610Abstract: Present invention relates to a method and system for automatic searching for information on a network in response to an image query sent by a user. The image query includes an image that is captured by using a mobile communications device with a camera. The image is processed to detect the text present in it. The detected text is then recognized using an OCR. Subsequently, the text is searched for matches in the corresponding domain database, selected from the various domain databases present in the network. Thereafter, selected matches and additional related information is sent to the user.Type: GrantFiled: August 9, 2012Date of Patent: February 4, 2014Assignee: A9.com, Inc.Inventors: Gurumurthy D. Ramkumar, Raghavan Manmatha, Supratik Bhattacharyya, Gautam Bhargava, Mark A. Ruzon
-
Patent number: 8644881Abstract: A method for controlling a mobile terminal, and which includes receiving, via an input unit, a selection signal indicating a selection of a predetermined button among multiple predetermined buttons on the mobile terminal, in which the multiple predetermined buttons corresponding to different preset functions executed on the mobile terminal; capturing, via a camera included on the mobile terminal, a preview image of an object upon receiving the selection signal; recognizing, via a controller included on the mobile terminal, a character string included in the captured preview image; and performing, via the controller, a preset function using the recognized character string and that corresponds to the selection of the predetermined button.Type: GrantFiled: June 9, 2010Date of Patent: February 4, 2014Assignee: LG Electronics Inc.Inventors: Yoon-Ho Kim, Hye-Jin Oh
-
Patent number: 8643741Abstract: Devices, methods, and computer readable media for performing image orientation detection using image processing techniques are described. In one implementation, an image processing method is disclosed that obtains image data from a first image captured by an image sensor (e.g., from any image capture electronic device). Positional sensor data captured by the device and corresponding to the image data may also be acquired (e.g., through an accelerometer). If the orientation of the device is not reliably discernible from the positional sensor data, the method may attempt to use rotationally invariant character detection metrics to determine the most likely orientation of the image, e.g., by using a decision forest algorithm. Face detection information may be used in conjunction with, or as a substitute for, the character detection data based on one or more priority parameters. Image orientation information may then be included within the image's metadata.Type: GrantFiled: January 17, 2012Date of Patent: February 4, 2014Assignee: Apple Inc.Inventor: Ralph Brunner
-
Patent number: 8639032Abstract: The present invention discloses methods of archiving and optimizing lectures, presentations and other captured video for playback, particularly for blind and low vision individuals. A digital imaging device captures a preselected field of view that is subject to periodic change such as a whiteboard in a classroom. A sequence of frames is captured. Frames associated with additions or erasures to the whiteboard are identified. The Cartesian coordinates of the regions of these alterations within the frame are identified. When the presentation is played back, the regions that are altered are enlarged or masked to assist the low vision user. In another embodiment of the invention, the timing of the alterations segments the recorded audio into chapters so that the blind user can skip forward and backward to different sections of the presentation.Type: GrantFiled: August 29, 2008Date of Patent: January 28, 2014Assignee: Freedom Scientific, Inc.Inventors: Garald Lee Voorhees, Robert Anders Steinberger, Ralph Ernest Ocampo
-
Publication number: 20140022406Abstract: An electronic device and method use a camera to capture an image of an environment outside followed by identification of regions therein. A subset of the regions is selected, based on attributes of the regions, such as aspect ratio, height, and variance in stroke width. Next, a number of angles that are candidates for use as skew of the image are determined (e.g. one angle is selected for each region. based on peakiness of a histogram of the region, evaluated at different angles). Then, an angle that is most common among these candidates is identified as the angle of skew of the image. The just-described identification of skew angle is performed prior to classification of any region as text or non-text. After skew identification, at least all regions in the subset are rotated by negative of the skew angle, to obtain skew-corrected regions for use in optical character recognition.Type: ApplicationFiled: March 14, 2013Publication date: January 23, 2014Applicant: QUALCOMM INCORPORATEDInventors: Pawan Kumar Baheti, Kishor K. Barman, Hemanth P. Acharya
-
Publication number: 20140023272Abstract: Character code data and vector drawing data are both listed and provided in a re-editable manner. Electronic data is generated in which information obtained by vectorizing character areas in an image and information obtained by recognizing characters in the image are stored in respective storage locations. As for the electronic data generated in this manner, because character code data and vector drawing data generated from the input image are both presented by a display and edit program, a user can immediately utilize the both data.Type: ApplicationFiled: September 24, 2013Publication date: January 23, 2014Applicant: CANON KABUSHIKI KAISHAInventors: Taeko Yamazaki, Tomotoshi Kanatsu, Makoto Enomoto, Kitahiro Kaneda
-
Patent number: 8634644Abstract: A system and method to identify pictures in documents. An image representing a page of a document is received. The image is analyzed to identify text objects in the page. A masked image is generated by masking out regions of the image including the text objects in the page. Groups of pixels in the masked image are identified, wherein a respective group of pixels corresponds to at least one picture in the page. When there is one or more groups of pixels, regions for pictures are identified based on the one or more groups of pixels. Metadata tags for the pictures are stored, wherein a respective metadata tag for a respective picture includes information about a respective bounding box for the respective picture.Type: GrantFiled: August 25, 2009Date of Patent: January 21, 2014Assignee: Fuji Xerox Co., Ltd.Inventors: Patrick Chiu, Francine Chen, Laurent Denoue
-
Publication number: 20140009772Abstract: The disclosed embodiment relates to system and method for separating background image from foreground text in one or more electronic pages. The one or more electronic pages are compared to check whether the background image in each of the one or more electronic pages are same. If it found that the one or more electronic pages have common background image, the common background image is subtracted from each of the one or more pages. The foreground text from each of the one or more electronic pages is recognized using an OCR. Finally, the recognized foreground text from each of the one or more electronic pages is consolidated in a file. The consolidated file can be printed or send to one or more recipients over an email.Type: ApplicationFiled: July 9, 2012Publication date: January 9, 2014Applicant: XEROX CORPORATIONInventor: Ying Gao
-
Patent number: 8625150Abstract: An image processing device includes an image data acquiring part that acquires the first and second image data, an edge characteristic extraction part that extracts first edges and second edges forming the shapes of the rectangular regions contained in the first and second image data, a rectangular characteristic calculating part that extracts both a first calculated rectangular region formed by the first edges and a second calculated rectangular region formed by the second edges, a position adjustment parameter calculating part that calculates parameters indicating a separation distance and a separation angle between the first calculated rectangular region and the second calculated rectangular region when the first image data and the second image data are overlapped, and an image data correction part that corrects at least one of the first image data and the second image data by shifting and rotating based upon the parameters.Type: GrantFiled: July 23, 2010Date of Patent: January 7, 2014Assignee: Oki Data CorporationInventor: Tomonori Kondo
-
Patent number: 8625127Abstract: An image forming apparatus includes a receiving unit that receives image data; an extracting unit that extracts specific information from the image data; a first recognizing unit that recognizes destination information from the specific information; and a control unit that outputs the image data, wherein, when the first recognition unit recognizes a plurality of destination information, the control unit outputs the image data to respective destinations corresponding to each of the plurality of the destination information.Type: GrantFiled: February 11, 2009Date of Patent: January 7, 2014Assignee: Brother Kogyo Kabushiki KaishaInventor: Akihiro Yamada
-
Publication number: 20140003714Abstract: A user may perform an image search on an object shown in an image. The user may use a mobile device to display an image. In response to displaying the image, the client device may send the image to a visual search system for image segmentation. Upon receiving a segmented image from the visual search system, the client device may display the segmented image to the user who may select one or more segments including an object of interest to instantiate a search. The visual search system may formulate a search query based on the one or more selected segments and perform a search using the search query. The visual search system may then return search results to the client device for display to the user.Type: ApplicationFiled: September 5, 2013Publication date: January 2, 2014Applicant: Microsoft CorporationInventors: Tao Mei, Shipeng Li, Ying-Qing Xu, Ning Zhang, Zheng Chen, Jian-Tao Sun
-
Patent number: 8620081Abstract: An image processing apparatus determines an attribute of a block image based on the attribute of the block image determined based on a color distribution characteristic amount of the block image and the attribute of the block image determined based on an edge characteristic amount of the block image.Type: GrantFiled: November 7, 2011Date of Patent: December 31, 2013Assignee: Canon Kabushiki KaishaInventors: Xiaoyan Dai, Taeko Yamazaki
-
Patent number: 8620080Abstract: Aspects of the present invention relate to systems and methods for locating text in a digital image. According to a first aspect of the present invention, a multi-stage filtering technique may be used to progressively refine a set of candidate text components associated with a digital image. A first, refined set of candidate text components may be formed by filtering an initial set of candidate text components based on component properties. Text lines may reconstructed from the first, refined set of candidate text components. The first, refined set of candidate text components may be further filtered based on text-line properties measured on the reconstructed text lines.Type: GrantFiled: September 26, 2008Date of Patent: December 31, 2013Assignee: Sharp Laboratories of America, Inc.Inventor: Ahmet Mufit Ferman
-
Patent number: 8620139Abstract: Processing video for utilization in second language learning is described herein. A video file includes spoken words in a source language, subtitles in the source language, and subtitles in a native language of an end user (a target language). The subtitles in the source language are synchronized with the spoken words in the video, and the subtitles in the source language are mapped to the subtitles in the target language. Both sets of subtitles are displayed simultaneously as the video is played by the end user.Type: GrantFiled: April 29, 2011Date of Patent: December 31, 2013Assignee: Microsoft CorporationInventors: Chi Ho Li, Matthew Robert Scott
-
Patent number: 8611662Abstract: A digital image is converted to a multiple level image, and multiple scale sets are formed from connected components of the multiple level image such that different ones of the scale sets define different size spatial bins. For each of the multiple scale sets there is generated a count of connected components extracted from the respective scale set for each spatial bin; and adjacent spatial bins which represent connected components are linked. Then the connected components from the different scale sets are merged and text line detection is performed on the merged connected components. In one embodiment each of the scale sets is a histogram, and prior to linking all bins with less than a predetermined count are filtered out; and each histogram is extended such that counts of adjacent horizontal and vertical bins are added (single region bins are filtered out) and the linking is on the extended histograms.Type: GrantFiled: November 21, 2011Date of Patent: December 17, 2013Assignee: Nokia CorporationInventors: Shang-hsuan Tsai, Vasudev Parameswaran, Radek Grzeszczuk
-
Patent number: 8611661Abstract: In some embodiments, provided are procedures for processing images that may have different font sizes. In some embodiments, it involves OCR'ing with multiple passes at different resolutions.Type: GrantFiled: December 26, 2007Date of Patent: December 17, 2013Assignee: Intel CorporationInventors: Oscar Nestares, Badusha Kalathiparambil
-
Publication number: 20130330004Abstract: As set forth herein, systems and methods facilitate providing an efficient edge-detection and closed-contour based approach for finding text in natural scenes such as photographic images, digital, and/or electronic images, and the like. Edge information (e.g., edges of structures or objects in the images) is obtained via an edge detection technique. Edges from text characters form closed contours even in the presence of reasonable levels of noise. Closed contour linking and candidate text line formation are two additional features of the described approach. A candidate text line classifier is applied to further screen out false-positive text identifications. Candidate text regions for placement of text in the natural scene of the electronic image are highlighted and presented to a user.Type: ApplicationFiled: June 12, 2012Publication date: December 12, 2013Applicant: XEROX CORPORATIONInventors: Raja Bala, Zhigang Fan, Hengzhou Ding, Jan P. Allebach, Charles A. Bouman
-
Publication number: 20130330003Abstract: Various approaches for providing textual information to an application, system, or service are disclosed. In particular, various embodiments enable a user to capture an image with a camera of a portable computing device. The computing device is capable of taking the image and processing it to recognize, identify, and/or isolate the text in order to forward the text to an application or function. The application or function can then utilize the text to perform an action in substantially real-time. The text may include an email, phone number, URL, an address, and the like and the application or function may be dialing the phone number, navigating to the URL, opening an address book to save contact information, displaying a map to show the address, and so on. Adaptive thresholding can be used to account for variations across an image, in order to improve the accuracy and efficiency of text recognition processes.Type: ApplicationFiled: June 7, 2012Publication date: December 12, 2013Applicant: AMAZON TECHNOLOGIES, INC.Inventors: Volodymyr V. Ivanchenko, Geoffrey Scott Heller, Richard Howard Suplee, III, Daniel Bibireata
-
Patent number: 8606011Abstract: Various approaches for providing textual information to an application, system, or service are disclosed. In particular, various embodiments enable a user to capture an image with a camera of a portable computing device. The computing device is capable of taking the image and processing it to recognize, identify, and/or isolate the text in order to forward the text to an application or function. The application or function can then utilize the text to perform an action in substantially real-time. The text may include an email, phone number, URL, an address, and the like and the application or function may be dialing the phone number, navigating to the URL, opening an address book to save contact information, displaying a map to show the address, and so on. Adaptive thresholding can be used to account for variations across an image, in order to improve the accuracy and efficiency of text recognition processes.Type: GrantFiled: June 7, 2012Date of Patent: December 10, 2013Assignee: Amazon Technologies, Inc.Inventors: Volodymyr V. Ivanchenko, Geoffrey Scott Heller, Richard Howard Suplee, III, Daniel Bibireata
-
Patent number: 8606010Abstract: A processor and method make use of multiple weak classifiers to construct a single strong classifier to identify regions that contain text within an input image document. The weak classifiers are grouped by their computing cost from low to median to high, and each weak classifier is assigned a weight value based on its ability to accurately identify text regions. A level 1 classifier is constructed by selecting weak classifiers from the low group, a level 2 classifier is constructed by selecting weak classifiers from the low and median groups, and a level 3 classifier is constructed by selecting weak classifiers from the low, median and high groups. Regions that the level 1 classifier identifies as containing text are submitted to the level 2 classifier, and regions that the level 2 classifier identifies as containing text are submitted to the level 3 classifier.Type: GrantFiled: March 18, 2011Date of Patent: December 10, 2013Assignee: Seiko Epson CorporationInventor: Jing Xiao
-
Patent number: 8593666Abstract: A method and system for printing a web page include converting the web page content to an image and segmenting the image into a plurality of regions. At least one of the regions is selected, and the selected region is printed.Type: GrantFiled: February 11, 2009Date of Patent: November 26, 2013Assignee: Hewlett-Packard Development Company, L.P.Inventor: Jun Xiao
-
Patent number: 8594431Abstract: A method and system for recognizing a character affected by a noise or an obstruction is disclosed. After receiving an image with characters, a character being affected by a noise or an obstruction is determined. Then, areas in the character where the noise or obstruction affected are precisely located. Templates representing every possible character in the image are updated by removing equivalent areas to the areas in the character being affected by the noise or obstruction. Then, the character is classified in a template among the updated templates by finding the template having the highest number of matching pixels with the character.Type: GrantFiled: September 14, 2012Date of Patent: November 26, 2013Assignee: International Business Machines CorporationInventors: Ami Ben-Horesh, Amir Geva, Eugeniusz Walach
-
Patent number: 8588526Abstract: A visualization program, method and apparatus for determining reading order of content in a structured document. The method includes generating, for each of a plurality of elements, a directed segment; storing, in the reading order, the generated directed segments of the elements into a storage device; reading from the storage device; linking together the directed segments for the elements in accordance with the reading order; and displaying the linked directed segments overlaid on the structured document which is displayed on the screen. A computer implemented program and an apparatus for carrying out the above method are also provided.Type: GrantFiled: November 15, 2012Date of Patent: November 19, 2013Assignee: International Business Machines CorporationInventor: Daisuke Sato
-
Patent number: 8582886Abstract: Embodiments of the invention compress an image that contains a representation of text. Embodiments take an image of graphical data and determines one or more portions of that image that have a high probability of containing text. Embodiments then take each such portion of the image and determines one or more rows of text within each portion (where text does, in fact, exist within the portion). The embodiments then traverse each vertical band of pixels of each row to determine sub-glyphs. Where a particular sub-glyph is encountered for the first time, the embodiments cache that sub-glyph, and send it (or a compressed representation thereof) to a client in a remote presentation session. Where a particular sub-glyph has been cached already, the embodiments send a reference to that cached vertical band to the client.Type: GrantFiled: May 19, 2011Date of Patent: November 12, 2013Assignee: Microsoft CorporationInventors: Nadim Y. Abdo, Voicu Anton Albu
-
Patent number: 8582876Abstract: One or more portions of a compound image may be classified as picture portions and at least one remaining portion of the compound image may be classified as a non-picture portion. A first layer of a layered image may be generated based on the picture portions of the compound image. The first layer may be compliant with a first image format. A second layer of the layered image may be generated based on the non-picture portion. The second layer may be compliant with a second image format that is different from the first image format. The first layer and the second layer may be sent to a web browser. The first format and the second format may be supported by the web browser.Type: GrantFiled: November 15, 2011Date of Patent: November 12, 2013Assignee: Microsoft CorporationInventors: Huifeng Shen, Zhaotai Pan, Yan Lu, Shipeng Li
-
Publication number: 20130294693Abstract: The noise in an image having text is removed by convolving a shaped kernel centered on a pixel for each pixel in the image. The shaped kernel has a shape configured to identify pixels that are not part of the text. For example, the shaped kernel may be shaped with zeros in a center of the kernel to identify pixels that are not part of the text. A value for the pixel is set to erase the pixel when the resulting convolution value for the pixel is less than a threshold. The process may be repeated multiple times for differently shaped kernels, including kernels of different sizes and different configurations, such as having values greater than one in at least one of a row, column, and diagonal.Type: ApplicationFiled: January 3, 2013Publication date: November 7, 2013Applicant: QUALCOMM IncorporatedInventors: Ramin Rezaiifar, Serafin Diaz Spindola
-
Publication number: 20130294690Abstract: Techniques for identifying documents sharing common underlying structures in a large collection of documents and processing the documents using the identified structures are disclosed. Images of the document collection are processed to detect occurrences of a predetermined set of image features that are common or similar among forms. The images are then indexed in an image index based on the detected image features. A graph of nodes is built. Nodes in the graph represent images and are connected to nodes representing similar document images by edges. Documents sharing common underlying structures are identified by gathering strongly inter-connected nodes in the graph. The identified documents are processed based at least in part on the resulting clusters.Type: ApplicationFiled: July 8, 2013Publication date: November 7, 2013Inventors: Shlomo Urbach, Eyal Fink, Tal Yadid, Yuval Netzer
-
Patent number: 8577144Abstract: Embodiments disclosed include methods for connected component labeling including labeling groups of raw data as one or more regions, the labeling including designating one or more data structures as containing information about the one or more regions; designating one or more of the regions as one or more subregions to expose a spatial distribution of one or more region features; and arranging at least one memory array with a 1:1 correspondence to a data array associated with the raw data to enable one or more data structures to include feature labels of the one or more subregions, the 1:1 correspondence enabling acquisition of the one or more region features with a controllable precision.Type: GrantFiled: August 27, 2012Date of Patent: November 5, 2013Assignee: eyeP, Inc.Inventor: Craig Sullender
-
Patent number: 8571319Abstract: According to one embodiment of the present invention, a method for processing forms based on an image is presented. A form is captured in an image, and a number of field values within the form in the image are detected. The number of field values is stored in the image metadata. In another illustrative embodiment, an access request for a form is detected. A determination is made as to whether the form corresponds to a stored image in a number of stored images. If the form corresponds to a stored image, metadata associated with the stored image is retrieved. The metadata includes a number of field values and associated textual data corresponding to the form. The form is populated with the number of field values and the associated textual data from the metadata associated with the stored image.Type: GrantFiled: July 28, 2009Date of Patent: October 29, 2013Assignee: International Business Machines CorporationInventors: Swaminathan Balasubramanian, Andrew R. Jones, Brian M. O'Connell, Keith R. Walker
-
Patent number: 8571330Abstract: A method for selecting a video thumbnail includes generating a visual theme model for a sample set of images that are representative of textual information corresponding to a video file. Each of a set of candidate key frames is distinguished according to similarities shared between the candidate key frames and the visual theme model. A display is caused of a selected one of the distinguished candidate key frames as a video thumbnail for the video file.Type: GrantFiled: September 17, 2009Date of Patent: October 29, 2013Assignee: Hewlett-Packard Development Company, L.P.Inventors: Yuli Gao, Tong Zhang, Jun Xiao
-
Publication number: 20130272610Abstract: Provided is an image processing apparatus including: a grouping preference unit configured to register user preference information on a storage device based on a user operation, the user preference information indicating how objects within an image are to be classified into groups; an image analysis unit configured to detect the objects within the image; and a grouping unit configured to read the user preference information from the storage device and classify the objects detected within the image into the groups indicated in the read user preference information.Type: ApplicationFiled: April 10, 2013Publication date: October 17, 2013Applicant: KYOCERA Document Solutions Inc.Inventors: Ryosuke Ogishi, Yosuke Kashimoto, Masaaki Aiba, Takashi Murakami
-
Publication number: 20130272612Abstract: The present invention provides a method of providing online information using image, including separating each of a target image, received from a user terminal, and an original image, received from an information provider apparatus, into a text region and a graphic region; selecting an important text region from the text region; extracting features from the text region, the graphic region, and the important text region, respectively; searching for the original image corresponding to the target image using the features of the text region, the graphic region, and the important text region; and searching for supplementary information related to the retrieved original image and provided the retrieved supplementary information.Type: ApplicationFiled: March 26, 2013Publication date: October 17, 2013Applicant: ELECTRONICS AND TELECOMMUNICATIONS RESEARCH INSTITUTEInventors: Jae Cheol SIM, Kang Yong LEE
-
Patent number: 8559798Abstract: A rendering process for rendering an image frame and a postprocess for adapting the image frame to a display are separated. A rendering processing unit 42 generates an image frame sequence by performing rendering at a predetermined frame rate regardless of a condition that the image frame should meet for output to the display. A postprocessing unit 50 subjects the image frame sequence generated by the rendering processing unit to a merge process so as to generate and output an updated image frame sequence that meets the condition. Since the rendering process and the postprocess are separated, the image frame sequence can be generated regardless of the specification of the display such as resolution and frame rate of the display.Type: GrantFiled: May 19, 2005Date of Patent: October 15, 2013Assignees: Sony Corporation, Sony Computer Entertainment Inc.Inventors: Sachiyo Aoki, Akio Ohba, Masaaki Oka, Nobuo Sasaki
-
Patent number: 8559688Abstract: A signal processing method is presented. The method includes acquiring undersampled data corresponding to an object, initializing a first image solution and a second image solution, determining a linear combination solution based upon the first image solution and the second image solution, generating a plurality of selected coefficients by iteratively updating the first image solution, the second image solution and the linear combination solution and adaptively thresholding one or more transform coefficients utilizing the undersampled data, an updated first image solution, an updated second image solution and an updated linear combination solution, and reconstructing a data signal using the plurality of selected coefficients.Type: GrantFiled: June 30, 2010Date of Patent: October 15, 2013Assignee: General Electric CompanyInventors: Kedar Bhalchandra Khare, Kevin Franklin King, Luca Marinelli, Christopher Judson Hardy
-
Patent number: 8559720Abstract: A video processing technique is performed on a video media service to identify video segments of interest. The video processing technique (200) is supplemented by a text extraction technique (245, 255, 270), as well. The resulting video segments of interest (289) can be stored as to produce a version of the media service which is shorter in time length than the original video media service.Type: GrantFiled: March 30, 2009Date of Patent: October 15, 2013Assignee: Thomson Licensing S.A.Inventors: Ruiduo Yang, Ying Luo, Claire-Hélène Demarty, Lionel Oisel
-
Patent number: 8553239Abstract: A sheet of a document is scanned, character objects are extracted from a scan image, the extracted character objects are divided line by line, and the direction of the document is set on the basis of a blank percentage determined according to start and end positions of lines. If the direction of the document is different from that of a previous document, an image processing unit rotates the scan image.Type: GrantFiled: April 16, 2008Date of Patent: October 8, 2013Assignee: Samsung Electronics Co., LtdInventor: Hyung Soo Ohk
-
Publication number: 20130259377Abstract: Systems may be provided for recording a document with a camera-based mobile radio device and for converting textual information in the document into a format for suitable presentation on the mobile device. A document may be recorded by the mobile device in an image. A layout structure may be recognized with a text block in the image. Character text in the text block may be recognized by OCR. An order of the text blocks may be determined by taking into account the layout structure. A suitable format for presenting the character texts on the mobile device's display may be selected. The format may be adapted to a width of the display so that during reading of the character texts on the display, substantially only vertical scrolling is necessary. A file may be generated and displayed in the format with the character texts in the determined order of the text blocks.Type: ApplicationFiled: March 28, 2013Publication date: October 3, 2013Applicant: Nuance Communications, Inc.Inventor: Herr Cüneyt Göktekin
-
Patent number: 8548250Abstract: An information processing apparatus is disclosed, including: a reading part reading vector information included in an electronic file; a first line segment extraction part extracting line segment parameter information of a line object from the vector information; a second line segment extraction part extracting polygon parameter information of a polygon object from the vector information and extracting the line segment parameter information of line segments forming the polygon object from the extracted polygon parameter information; a rectangle extraction part extracting rectangle parameter information based on the line segment parameter; a minimum rectangle determination part determining whether or not a rectangle formed based on the rectangle parameter information is a minimum rectangle which does not connote other rectangles; and a minimum rectangle output part outputting the minimum rectangle.Type: GrantFiled: November 5, 2008Date of Patent: October 1, 2013Assignee: Ricoh Company, Ltd.Inventor: Kunio Okita
-
Method and user interface for performing a scan operation for a scanner coupled to a computer system
Patent number: RE44528Abstract: A method and user interface is provided for use on a computer system coupled with a scanner for performing a scan operation on an original document, which allows the user to acquire scanned images in an easier and more user-friendly manner. The method allows the user to scan an original document without requiring the user to have learned knowledge background in the science of image processing, and also allows the scanner to perform only one scan operation on the original document. These features allow the use of the scanner to be easier and more user-friendly than the prior art. By the, method, the first step is to determine a set of image processing settings by a scanner driving program that are suited for optimal scan of the original document; and then the scanner is activated to perform a scan operation on the original document based on the image processing settings to thereby obtain a primitive scanned image.Type: GrantFiled: November 7, 2011Date of Patent: October 8, 2013Assignee: Intellectual Ventures I LLCInventors: Chuan-Yu Hsu, Jay Liu, T. J. Hsu