Distinguishing Text From Other Regions Patents (Class 382/176)
-
Patent number: 10896339Abstract: A method for image processing is disclosed. The method includes: obtaining an image including a check with a magnetic ink character recognition (MICR) code; generating a mask including a plurality of shapes based on the image and an estimated rotation angle of the check; generating a stroke width map (SWM) by applying a stroke width transform (SWT) to a plurality of regions in the image corresponding to the plurality of shapes; generating a first word line associated with a first region based on a plurality of words in the SWM; rotating a portion of the SWM associated with the first word line; and detecting, after rotating, the MICR code by applying a plurality of OCR processes to the portion of the SWM.Type: GrantFiled: March 7, 2019Date of Patent: January 19, 2021Assignee: Prosper Funding LLCInventor: Paul Golding
-
Patent number: 10867204Abstract: In some embodiments, a method detects a first set of frames in a video that include lines of text, the detecting performed at a frame level on each individual frame. A first representation is generated from the first set of frames and a second representation is generated from the first set of frames. The method filters the first representation based on a number of lines of text within a space in the space dimension to select a second set of frames and filters the second representation based on a number of frames within time intervals in the time dimension to select a third set of frames. Frames in both the second set of frames and the third set of frames are analyzed to determine whether the lines of text in both the second set of frames and the third set of frames are burned-in subtitles.Type: GrantFiled: April 30, 2019Date of Patent: December 15, 2020Assignee: HULU, LLCInventors: Yaqi Wang, Xiaohui Xie, Yunsheng Jiang
-
Patent number: 10855965Abstract: A segmented 3D multi-view image generator generates fewer multi-view view images for partitions having less salient features. Saliency values are calculated based on a depth map and image processing of the input image. The saliency values are compared to thresholds to map pixel locations to first, second, and third partitions. First, second, and third segmented images are created from the input image using a partition map. A multi-view generator uses the depth map and viewer eye locations to generates 28 view images from the first segmented image, 14 unique view images from the second segmented image that are replicated to 28 view images, and 7 unique view images from the third segmented image that are replicated to 28 view images. The view images for each segment are interlaced to generated interlaced segmented images that are then integrated together into a single 3D image that drives a 28-view autostereoscopic display.Type: GrantFiled: June 28, 2019Date of Patent: December 1, 2020Assignee: Hong Kong Applied Science and Technology Research Institute Company, LimitedInventors: Yuzhong Jiao, Man Chi Chan, Ping Chan Mok
-
Patent number: 10846550Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for receiving user input for an object classification of interest for image data (e.g. single frame image, continuous video, etc.) from a user device, and for each object specified by identified object data to belong to the object classification of interest, displaying data that presents each object and/or text block of interest on a user device.Type: GrantFiled: June 28, 2018Date of Patent: November 24, 2020Assignee: Google LLCInventor: Jeffrey Palm
-
Patent number: 10839206Abstract: An information processing device performs processing on document image data including first image data to undergo character recognition processing and second image data not to undergo character recognition processing. The information processing device includes a detecting section which detects the first image data, an extracting section which extracts the first image data, and a processing section. The processing section includes a counting section which counts first images, a determining section which determines whether the number of the first images exceeds a threshold, a first performing section which performs first processing when the threshold is exceeded, and a second performing section which performs second processing when the threshold is not exceeded. Through the first processing, the second image is masked with a background color of the document image and character recognition is then performed on the document image.Type: GrantFiled: December 9, 2019Date of Patent: November 17, 2020Assignee: KYOCERA Document Solutions Inc.Inventor: Hironori Hayashi
-
Patent number: 10839573Abstract: Some embodiments of the present disclosure provide a content integration system. The content integration system is configured to retrieve a source digital content, retrieve a target digital content, identify a region within the target digital content for placing or integrating the source digital content, and place or integrate the target digital content onto the identified region of the source digital content. The content integration system can be configured to place the source digital content into the target digital content in an aesthetically-pleasing, unobtrusive, engaging, and/or otherwise favorable manner. The content integration system can be particularly useful for advertisements, enhanced expression, entertainment, information, or communication.Type: GrantFiled: March 22, 2017Date of Patent: November 17, 2020Assignee: ADOBE INC.Inventors: William L. Marino, Brunno Fidel Maciel Attore, Johan Adami
-
Patent number: 10832048Abstract: Disclosed is a new document processing solution that combines the powers of machine learning and deep learning and leverages the knowledge of a knowledge base. Textual information in an input image of a document can be converted to semantic information utilizing the knowledge base. A semantic image can then be generated utilizing the semantic information and geometries of the textual information. The semantic information can be coded by semantic type determined utilizing the knowledge base and positioned in the semantic image utilizing the geometries of the textual information. A region-based convolutional neural network (R-CNN) can be trained to extract regions from the semantic image utilizing the coded semantic information and the geometries. The regions can be mapped to the textual information for classification/data extraction. With semantic images, the number of samples and time needed to train the R-CNN for document processing can be significantly reduced.Type: GrantFiled: April 7, 2020Date of Patent: November 10, 2020Assignee: OPEN TEXT SA ULCInventor: Uwe Ast
-
Patent number: 10824854Abstract: Embodiments of the present disclosure pertain to systems and method for extracting data from an image. In one embodiment, a method of extracting data from an image comprises receiving, from an optical character recognition (OCR) system, OCR text in response to sending an image to the OCR system. The OCR text comprises a plurality of lines of text. Each line of text is classified as either a line item or not a line item using a machine learning algorithm, and a plurality of data fields are extracted from each line of text classified as a line item.Type: GrantFiled: June 18, 2018Date of Patent: November 3, 2020Assignee: SAP SEInventors: Everaldo Aguiar, Ravi Sharma, Shivani Patel, Jesper Lind, Michael Stark, Yongjian Bi
-
Patent number: 10824788Abstract: A method of collecting training data of a document component may be provided. The documents have a structure and are coded in the typesetting language TeX. The method comprise receiving a TeX source file, compiling it into a PDF file and a related sync file, analyzing the PDF file, thereby determining a non-text-only document component. The method comprises also determining first coordinates of the non-text-only document component and a corresponding page number, determining a typesetting command relating to a non-text-only document component and determining second coordinates of a bounding box and a corresponding page number from the sync file, determining text elements in the non-text-only document component of the PDF file for which the first coordinates and the second coordinates overlap, and combining the determined text elements and linking them to a type of a non-text document component determined in the non-text-only document component in the TeX source file.Type: GrantFiled: February 8, 2019Date of Patent: November 3, 2020Assignee: International Business Machines CorporationInventors: Peter Willem Jan Staar, Michele Dolfi, Christoph Auer, Aleksandros Sobczyk, Konstantinos Bekas
-
Patent number: 10803366Abstract: The present invention relates to a method for extracting an output data set, wherein the method includes the following steps receiving an input data set; wherein the input data set comprises at least one textual input data set and at least one visual input data set; processing the at least one textual input data set using natural language processing into at least one textual output data set; processing the at least one visual input data set using image processing into at least one visual output data set, and outputting the output data set, including the at least one textual output data set and/or the at least one visual output data set. Further, the present invention is related to a computer program product and system.Type: GrantFiled: May 17, 2018Date of Patent: October 13, 2020Assignees: SIEMENS AKTIENGESELLSCHAFT, SIEMENS CORPORATIONInventors: Dmitriy Fradkin, Volkmar Sterzing, Stefan Langer
-
Patent number: 10796429Abstract: The subject disclosure provides systems and methods for determination of Area of Interest (AOI) for different types of input slides. Slide thumbnails may be assigned into one of five different types, and separate algorithms for AOI detection executed depending on the slide type. Slide types include ThinPrep® slides, tissue micro-array (TMA) slides, control HER2 slides with 4 cores, smear slides, and a generic slide. The slide type may be assigned based on a user input. Customized AOI detection operations are provided for each slide type. If the user enters an incorrect slide type, operations include detecting the incorrect input and executing the appropriate method. The result of each AOI detection operations provides as its output a soft-weighted image having zero intensity values at pixels that are detected as not belonging to tissue, and higher intensity values assigned to pixels detected as likely belonging to tissue regions.Type: GrantFiled: July 26, 2017Date of Patent: October 6, 2020Assignee: Ventana Medical Systems, Inc.Inventors: Anindya Sarkar, Jim Martin
-
Patent number: 10769425Abstract: A method of determining a hierarchy of a blank template using an image of the blank template and using the determined hierarchy for providing labels and field values of text lines of a filled form document.Type: GrantFiled: August 13, 2018Date of Patent: September 8, 2020Assignee: International Business Machines CorporationInventors: Antonio Foncubierta Rodriguez, Maria Gabrani, Guillaume Jaume
-
Patent number: 10750359Abstract: A portable terminal device and a method for operating the same are provided. The portable terminal device includes a communicator configured to perform communication with an external device, a display configured to display a same image as an image displayed on the external device, an inputter configured to receive an input of a selection command, and a controller configured to perform an operation corresponding to an object included in the image at a time when the selection command is input.Type: GrantFiled: March 13, 2019Date of Patent: August 18, 2020Assignee: Samsung Electronics Co., Ltd.Inventors: Do-hyoung Kim, Sang-il Lee, Taik-heon Rhee, Seong-hoon Kang, Sung-bin Kuk, Dong-jin Eun, Min-kyu Jung
-
Patent number: 10706218Abstract: Much valuable information in documents is presented within tables. However, the information within tables is hard to extract automatically with high accuracy due to the wide variety and low quality of typical tables found in electronic documents. Information extraction technology can provide a method of extracting information from heterogeneous tables by recognizing tables, the header cells, and cells that are merged or should be merged, creating a richer representation of table structure and providing a convenient way of linking cells to their row and column headers. Use of this richer representation allows a few extraction patterns to successfully pull out information from a wide variety of differently formatted tables.Type: GrantFiled: May 15, 2017Date of Patent: July 7, 2020Assignee: Linguamatics Ltd.Inventors: David Richard Milward, Himanshu Agrawal, James Robert Walton Cormack, Francisco Nuno Quintiliano Mendonca Carapeto Costa
-
Patent number: 10701434Abstract: A seek content extraction system analyzes frames of video content and identifies locations in the frames where session information is displayed. This session information refers to information that is displayed as part of video content and that describes, for a particular location in the video content, what is currently happening in the video content at that particular location. This session information is extracted from each of multiple frames, and for a given frame the extracted session information is associated with the frame. While the user is seeking forward or backward through the video content, a thumbnail of the frame at a given location in the video content is displayed along with the extracted session information associated with the frame.Type: GrantFiled: January 21, 2019Date of Patent: June 30, 2020Assignee: Adobe Inc.Inventors: Amol Jindal, Ajay Bedi
-
Patent number: 10699381Abstract: Certain embodiments involve a model for enhancing text in electronic content. For example, a system obtains electronic content comprising input text and converts the electronic content into a grayscale image. The system also converts the grayscale image into a binary image using a grid-based grayscale-conversion filter, which can include: generating a grid of pixels on the grayscale image; determining a plurality of grid-pixel threshold values at intersection points in the grid of pixels; determining a plurality of estimated pixel threshold values based on the plurality of grid-pixel threshold values; and converting the grayscale image into the binary image using the plurality of grid-pixel threshold values and the plurality of estimated pixel threshold values. The system also generates an interpolated image based on the electronic content and the binary image. The interpolated image includes output text that is darker than the input text. The system can then output the interpolated image.Type: GrantFiled: May 24, 2018Date of Patent: June 30, 2020Assignee: Adobe Inc.Inventors: Ram Bhushan Agrawal, Ankit Pangasa, Abhishek Shah
-
Patent number: 10664211Abstract: An image forming apparatus for forming an image on a sheet includes a reading section, a determining section, a generation section, a searching section, and a placement section. The reading section reads a predetermined shape from an original document. The determining section determines whether or not the predetermined shape contains a first image. The generation section generates a first search condition based on the first image. The searching section searches for at least one second image fulfilling the first search condition from a storage apparatus storing a plurality of images. The placement section places the second image in a first area in which the predetermined shape is located.Type: GrantFiled: February 26, 2018Date of Patent: May 26, 2020Assignee: KYOCERA Document Solutions Inc.Inventor: Hikaru Miyaji
-
Patent number: 10628525Abstract: Detecting and incorporating formatting characteristics within natural language processing analytics. Source documents are ingested and the markup formatting language is identified by the program. Once identified, the markup language is parsed and examined for formatting characteristics, embedded notes, comments and other metadata. The formatting characteristics of the plain text are extracted, along with the plain text, and converted into a common analysis structure (CAS), or CAS-equivalent structure, which annotates the natural language text together with its respective formatting characteristics. The CAS or CAS-equivalent structures are stored and sent to a natural language processing pipeline for further analysis via complex algorithms and rules. The natural language processing results data are curated to reflect meaningful analysis of the extracted CAS or CAS-equivalent structure.Type: GrantFiled: May 17, 2017Date of Patent: April 21, 2020Assignee: International Business Machines CorporationInventors: Patrick W. Fink, Kristin E. McNeil, Philip E. Parker, David B. Werts
-
Patent number: 10623602Abstract: The image reading apparatus includes: an image reading unit which reads card-like document sheets; a card-image recognition part which recognizes card images corresponding to the document sheets; a circular-area setting part which sets circular areas each containing a card image; a positional-information setting part which sets positional information as to the circular areas; a deviational-angle computation part which determines deviational angles of the card images; a corrected-data acquisition part which acquires corrected image data by turning the circular areas; and an array processing part which generates arrayed image data in which the card images corrected in terms of deviational inclination are disposed in array.Type: GrantFiled: May 18, 2018Date of Patent: April 14, 2020Assignee: KYOCERA Document Solutions Inc.Inventor: Hiroyuki Nagahama
-
Patent number: 10621470Abstract: A method is provided for Optical Character Recognition (OCR). A plurality of OCR decoding results each having a plurality of positions is obtained from capturing and decoding a plurality of images of the same one or more OCR characters. A recognized character in each OCR decoding result is compared with the recognized character that occupies an identical position in each of the other OCR decoding results. A number of occurrences that each particular recognized character occupies the identical position in the plurality of OCR decoding results is calculated. An individual confidence score is assigned to each particular recognized character based on the number of occurrences, with a highest individual confidence score assigned to a particular recognized character having the greatest number of occurrences.Type: GrantFiled: September 29, 2017Date of Patent: April 14, 2020Assignee: DATAMAX-O'NEIL CORPORATIONInventor: H. Sprague Ackley
-
Patent number: 10582269Abstract: The present invention relates to a device and a method for transmitting and receiving a broadcast signal comprising a subtitling service. Provided in one embodiment of the present invention is a method for transmitting a broadcast signal, the method comprising the steps of: generating a broadcast signal comprising video data and subtitle data; and transmitting the generated broadcast signal. According to the embodiment of the present invention, a transport stream providing a digital broadcast subtitling service using an XML subtitle may be transmitted.Type: GrantFiled: July 10, 2015Date of Patent: March 3, 2020Assignee: LG ELECTRONICS INC.Inventors: Hyunmook Oh, Jongyeul Suh
-
Patent number: 10547768Abstract: A method and system for improving virtual display generation with respect to a visual obstruction is provided. The method includes generating code associated with determining and resolving a physical obstruction with respect to a visual presentation. Video retrieval devices are enabled for retrieving a first video stream of a first object and a second object being viewed by users and a second video stream of the users. A visual obstruction including a portion of the first object visually obstructing a portion of the second object is detected. A boundary and content type associated with the portion of the second object being visually obstructed is determined and and analyzed with respect to a threshold value and a resulting video stream presenting an entire view of the second object without being visually obstructed with respect to the first object is generated and presented.Type: GrantFiled: May 4, 2018Date of Patent: January 28, 2020Assignee: International Business Machines CorporationInventors: James E. Bostick, John M. Ganci, Jr., Martin G. Keen, Sarbajit K. Rakshit
-
Patent number: 10546403Abstract: Systems and methods are disclosed for controlling image annotation. One method includes acquiring a digital representation of image data and generating a set of image annotations for the digital representation of the image data. The method also may include determining an association between members of the set of image annotations and generating one or more groups of members based on the association. A representative annotation from the one or more groups may also be determined, presented for selection, and the selection may be recorded in memory.Type: GrantFiled: December 7, 2017Date of Patent: January 28, 2020Assignee: HeartFlow, Inc.Inventors: Leo Grady, Michiel Schaap
-
Patent number: 10503971Abstract: A device obtains image data associated with a document. Using a first machine learning model, the device determines, for the document, a first classification of one of a plurality of document types and a first confidence score associated with the first classification, and a second classification of one of the plurality of document types and a second confidence score associated with the second classification based on the image data. The device determines a difference between the first confidence score and the second confidence score, compares the difference and a threshold value, and accept the first classification of the document when the difference satisfies the threshold value.Type: GrantFiled: August 14, 2019Date of Patent: December 10, 2019Assignee: Capital One Services, LLCInventors: Steven Dang, Jason Gould, Jennifer Jiang, Christopher Akatsuka, Douglas Slattery, Vijaya Pasam
-
Patent number: 10497075Abstract: A system and method for optimizing healthcare remittance processing includes a networked computing device that provides a user interface and access to healthcare claims and remittance data prepared by the system. The user receives a claim file prepared by a healthcare provider and an EOB/EOP prepared by a healthcare payer in response to the claim file. A remittance file is generated from the received data and is validated using automatic and manual means and is indexed against the remitted data. EOB/EOP data is converted to computer readable data in a standardized remittance file format. This transaction information is stored within the database and access to the stored information is provided to a user over a network connected interface.Type: GrantFiled: July 22, 2011Date of Patent: December 3, 2019Assignee: SYSTEMWARE, INC.Inventor: Andrea Chiappe
-
Patent number: 10477128Abstract: Dehazed images are produced based on an atmospheric light image obtained form an input image as brightest pixels of a predetermined window and white map. The white map is median filtered, morphologically filtered and, in some examples, filtered with a guided filter, and the filtered image combined with the atmospheric light image to produce a dehazed image.Type: GrantFiled: January 8, 2018Date of Patent: November 12, 2019Assignee: Nikon CorporationInventors: Ripul Bhutani, Ping-Wei Chang, Bausan Yuan
-
Patent number: 10447882Abstract: An image reading apparatus includes a platen on which a document is to be placed; an image generating unit that performs scanning on the platen to generate a position detection image and an output image; a document position detecting unit that detects whether a document exists and a position of the document based on the generated position detection image; a document extracting unit that extract an area corresponding to the document from the generated output image; and a control unit that controls the image generating unit, the document position detecting unit, and the document extracting unit so as to output an image of the extracted document.Type: GrantFiled: February 6, 2018Date of Patent: October 15, 2019Assignee: SHARP KABUSHIKI KAISHAInventors: Kazuhiro Mizude, Kazuma Ogawa, Tatsuya Fujisaki, Sho Tsujimoto
-
Patent number: 10440305Abstract: Detecting the start of a credit roll within video program may allow for the automatic extension of video recordings among other functions. The start of the credit roll may be detected by determining the number of text blocks within a sequence of frames and identifying a point in the sequence of frames where a difference between the number of text blocks in frames occurring before the point and the number of text blocks in frames occurring after the point is greatest and exceeds a specified threshold. Text blocks may be identified within each frame by partitioning the frame into one or more segments and recording the segments having a pixel of a sufficiently high contrast. Contiguous segments may be merged or combined into single blocks, which may then be filtered to remove noise and false positives. Additional content may be inserted into the credit roll frames.Type: GrantFiled: November 9, 2017Date of Patent: October 8, 2019Assignee: Comcast Cable Communications, LLCInventors: Oliver Jojic, David F. Houghton
-
Patent number: 10430611Abstract: Within one or more instances of a computing environment where an instance is a self-contained architecture to provide at least one database with corresponding search and file system. User information from the one or more instances of the computing environment is organized as zones. A zone is based on one or more characteristics of corresponding user information that are different than the instance to which the user information belongs. User information is selectively obfuscated prior to transmitting blocks of data including the obfuscated user information. The selective obfuscation is based on zone information for one or more zones to which the user information belongs.Type: GrantFiled: May 19, 2017Date of Patent: October 1, 2019Assignee: salesforce.com, inc.Inventors: Olumayokun Obembe, Gregory Lapouchnian, Vijayanth Devadhar, Jason Woods, Karthikeyan Govindarajan, Ashwini Bijwe, Prasad Peddada
-
Patent number: 10368144Abstract: The present invention relates to a device and a method for transmitting and receiving a broadcast signal including a subtitle service. Provided in one embodiment of the present invention is a method for transmitting a broadcast signal, comprising the steps of: generating a broadcast signal including video data and subtitle data; and transmitting the generated broadcast signal. According to the embodiment of the present invention, a transmission stream providing a digital broadcast subtitle service using XML subtitles can be transmitted.Type: GrantFiled: July 10, 2015Date of Patent: July 30, 2019Assignee: LG ELECTRONICS INC.Inventors: Hyunmook Oh, Jongyeul Suh
-
Patent number: 10339657Abstract: According to one embodiment, a character detection apparatus includes a feature extractor, a determiner and an integrator. The feature extractor extracts a feature value of an image including character strings. The determiner determines each priority of a plurality of different character detection schemes in accordance with character detection accuracy with respect to an image region having a feature corresponding to the feature value. The integrator integrates text line candidates of the character detection schemes, and selects, as a text line, one of the text line candidates detected by the character detection scheme with the highest priority if a superimposition degree indicating a ratio of a superimposed region among the text line candidates is no less than a first threshold value.Type: GrantFiled: June 17, 2015Date of Patent: July 2, 2019Assignee: Kabushiki Kaisha ToshibaInventors: Yojiro Tonouchi, Kaoru Suzuki
-
Patent number: 10275712Abstract: A method providing an answer to at least one analytical question containing at least one table or at least one chart is provided. The method may include receiving an input question. The method may also include extracting a plurality of information from the input question based on a natural language analysis. The method may further include forming a well-defined sentence. The method may include extracting at least one table or at least one chart associated with the input question. The method may include forming at least one mathematical equation. The method may also include solving the at least one mathematical equation. The method may include determining the answer to the input question in natural language based on the solved at least one mathematical equation. The method may further include narrating the determined answer to the input question in natural language.Type: GrantFiled: June 21, 2016Date of Patent: April 30, 2019Assignee: International Business Machines CorporationInventors: Sandesh Bhat, Joy Mustafi
-
Patent number: 10275713Abstract: A method providing an answer to at least one analytical question containing at least one table or at least one chart is provided. The method may include receiving an input question. The method may also include extracting a plurality of information from the input question based on a natural language analysis. The method may further include forming a well-defined sentence. The method may include extracting at least one table or at least one chart associated with the input question. The method may include forming at least one mathematical equation. The method may also include solving the at least one mathematical equation. The method may include determining the answer to the input question in natural language based on the solved at least one mathematical equation. The method may further include narrating the determined answer to the input question in natural language.Type: GrantFiled: June 21, 2016Date of Patent: April 30, 2019Assignee: International Business Machines CorporationInventors: Sandesh Bhat, Joy Mustafi
-
Patent number: 10278064Abstract: A portable terminal device and a method for operating the same are provided. The portable terminal device includes a communicator configured to perform communication with an external device, a display configured to display a same image as an image displayed on the external device, an inputter configured to receive an input of a selection command, and a controller configured to perform an operation corresponding to an object included in the image at a time when the selection command is input.Type: GrantFiled: August 21, 2017Date of Patent: April 30, 2019Assignee: Samsung Electronics Co., Ltd.Inventors: Do-hyoung Kim, Sang-il Lee, Taik-heon Rhee, Seong-hoon Kang, Sung-bin Kuk, Dong-jin Eun, Min-kyu Jung
-
Patent number: 10248637Abstract: Document authoring that involves illustrating pen input in an authoring environment is herein improved to provide patterns with higher perceptibility for representing the pen input in a graphical user interface. Colors and patterns are provided as effects that are applied to the illustrated pen input so that multiple textures or colors may be applied to the illustrated pen input without requiring the user to manually signal a switch in texture or colors or using multiple objects to represent the pen input. In various aspects, the patterns used in effects are created with a greater perceptibility, so that users will more readily recognize the effect, with various layers of a contrast basis image imparting a perceptible pattern and a background color image imparting colors for an enhanced ink effect definition.Type: GrantFiled: October 11, 2016Date of Patent: April 2, 2019Assignee: MICROSOFT TECHNOLOGY LICENSING, LLCInventor: Arianne Marie Taylor
-
Patent number: 10210148Abstract: The embodiments of the present invention provide a method and an apparatus for file processing. The method for file processing includes: obtaining a file; parsing the file to obtain a first character contained in the file; matching the first character with a preconfigured matching character library; obtaining an annotation corresponding to the first character when the first character satisfies a predetermined condition; and displaying the first character and the annotation. With the embodiments of the present invention, automatic annotation can be provided for a particular character in a file, such that the user's reading experience can be improved.Type: GrantFiled: August 1, 2011Date of Patent: February 19, 2019Assignees: LENOVO (BEIJING) LIMITED, BEIJING LENOVO SOFTWARE LTD.Inventors: Yaqiang Wu, Jianzhong Zhang, Zhepeng Wang, Chao Xu, Wei Wang
-
Patent number: 10185873Abstract: The tracking method comprises, for at least a first image of the text having at least a first line of characters: applying a prediction of a movement to which the text is subjected between the first image and a second image of the video stream, the movement prediction being applied to at least one second line of characters of the second image; determining at least one alignment hypothesis for aligning the first line with the second line after applying the movement prediction; estimating for each alignment hypothesis, a geometrical transformation between the first line and the second line resulting in that alignment; and evaluating a character match metric for each alignment hypothesis, the metric being evaluated from signatures calculated on the characters of at least one line of the first image and signatures calculated on the characters of at least one line of the second image put into correspondence with the characters of said at least one line of the first image after applying the geometrical transformType: GrantFiled: December 8, 2015Date of Patent: January 22, 2019Assignee: IDEMIA IDENTITY & SECURITYInventors: Alain Rouh, Jean Beaudet
-
Patent number: 10176397Abstract: A method of reading degraded symbols is described. A symbol has an initial shape, which is marked on an object. Marks, shapes, symbols, and object ID are managed separately. A symbol library with symbols and associated shapes is initially created. Shapes in the library are updated from shapes of marks as the marks degrade over time. A read mark is compared to all the shapes in the library to determine a most likely shape. A selection set is used to limit symbol selection, based on the comparison, to valid symbols. The symbol library and selection set may be customized to each usage of the method. Comparison methods use probability distributions. Confidence values are used to validate output, generate warnings, and to control updating of the library. Weighted averaging may be used at the level of shapes, comparison distributions, or selections. One application is reading tattooed marks on rodent tails in a vivarium.Type: GrantFiled: October 30, 2016Date of Patent: January 8, 2019Assignee: Vium, Inc.Inventors: Jonathan Betts-Lacroix, Daniel J. Ford
-
Patent number: 10176200Abstract: A system and method to detect similarities between images. The system and method allow comparisons between a query image and one or more catalog images in a manner that is resilient to scanning, scaling, rotating, cropping and other distortions of the query image. The system includes an image processing module that determines and/or calculates principle features of a catalog image and constructs a feature vector using one or more of the principle features. The system also includes a matching module that matches a query image to one or more catalog images. The system finds matches based on a distance measure of features present in the query image and features present in the catalog images.Type: GrantFiled: October 26, 2017Date of Patent: January 8, 2019Assignee: PicScout (Israel) LTD.Inventors: Uri Lavi, Eli Goz, Gregory Begelman
-
Patent number: 10169650Abstract: To identify emphasized text, bounding boxes are based on clusters resulting from horizontal compression and horizontal morphological dilation. The bounding boxes are processed to determine if any contain words or characters in bold. A bounding box is eliminated based on a comparison of its density and an average density across all bounding boxes. If its density is greater, text elements within the bounding box are evaluated to determine whether the text element is bold.Type: GrantFiled: June 30, 2017Date of Patent: January 1, 2019Assignee: KONICA MINOLTA LABORATORY U.S.A., INC.Inventor: Wei Ming
-
Patent number: 10165149Abstract: A system of automatically naming an electronic document may include a scanning device. The system may receive a physical document that is to be converted into an electronic document, perform optical character recognition on at least a portion of the physical document to identify one or more terms that are present in the physical document, and store the identified terms in the data store associated with the scanning device. The system may receive input from a user that includes one or more first characters and corresponds to a title of the electronic document. The system may identify one or more terms from the data store that correspond to the one or more first characters by querying the data store using the received input, and cause the identified terms to be displayed to the user via a display device of the scanning device as suggested document names for the electronic document.Type: GrantFiled: September 16, 2016Date of Patent: December 25, 2018Assignee: Xerox CorporationInventors: John Washington, John Barry Poxon
-
Patent number: 10157326Abstract: A method and a device for area identification are provided in the disclosure. The method includes: binarizing a text area including a row of characters; calculating a histogram in a vertical direction of the binarized text area, wherein the histogram includes abscissas of pixels in each column and corresponding accumulated values of foreground color pixels of the pixels in each column; and identifying a character area of each of one or more characters in the text area according to distribution information of the accumulated values.Type: GrantFiled: October 20, 2016Date of Patent: December 18, 2018Assignee: Xiaomi Inc.Inventors: Fei Long, Tao Zhang, Zhijun Chen
-
Patent number: 10133518Abstract: Provided is an image forming apparatus that solves a problem of work related to copying becoming complicated when a test copy is performed. The image forming apparatus according to this disclosure includes a scanner unit, a printer-control unit, a storage device, a characteristic-extracting unit, and a system-control unit. The printer-control unit executes a printing process of image data. The storage device stores image data of a document having plural pages that is read by the scanner unit. The characteristic-extracting unit, based on an extraction instruction to extract characteristics of an object, extracts characteristics of the object by object recognition of image data for each of the pages.Type: GrantFiled: October 31, 2017Date of Patent: November 20, 2018Assignee: KYOCERA Document Solutions Inc.Inventor: Hiromi Yamagami
-
Patent number: 10115031Abstract: Identifying a page with content in a video frame that is part of a video stream of successive video frames includes receiving the video stream, detecting edge segments in the video frame, where each of the edge segments is a candidate for being at least a part of an edge of the page, filtering the edge segments to discard a first subset of the edge segments based on curvature and based on angles between the edge segments and standard axes of the video frame, and identifying the page with content within a portion of a second subset of the edge segments that remain after filtering in response to the portion having geometric closeness to a rectangle. Edge segments having angles that significantly deviate from coordinate angles of the video frame and edge segments with a relatively high curvature may be discarded. A Canny edge detection algorithm may be used.Type: GrantFiled: February 24, 2016Date of Patent: October 30, 2018Assignee: EVERNOTE CORPORATIONInventors: Alexander Pashintsev, Boris Gorbatov, Eugene Livshitz
-
Patent number: 10108879Abstract: The present disclosure includes techniques for selecting a candidate presentation style for individual documents for inclusion in an aggregate training data set for a document type that may be used to train an OCR processing engine prior to identifying text in an image of a document of the document type. In one embodiment, text input corresponding to a text sample in a document is received, and an image of the text sample in the document is received. For each of a plurality of candidate presentation styles, an OCR processing engine is trained using a training data set corresponding to the given candidate presentation style, and the OCR processing engine is used, as trained, to identify text in the received image. The OCR processing results for each candidate presentation style are compared to the received text input. A candidate presentation style for the document is selected based on the comparisons.Type: GrantFiled: September 21, 2016Date of Patent: October 23, 2018Assignee: Intuit inc.Inventors: Eugene Krivopaltsev, Sreeneel K. Maddika, Vijay S. Yellapragada
-
Patent number: 10075742Abstract: A system for extracting and monitoring media tags within video content includes at least one server in communication with a plurality of content sources, the server receiving video content from the content sources, a recorder saving the video content, a detector receiving at least one frame of the video content, the detector detecting one or more unknown text within the frame and creating one or more images, each image associated with one of the one or more unknown text, the detector generating metadata associated with the one or more unknown text appearing in the frame, and an optical character recognition engine scanning the one or more images and converting the one or more images into one or more known text. The server further determines that the one or more known text is a media tag.Type: GrantFiled: September 20, 2016Date of Patent: September 11, 2018Assignee: TVEyes Inc.Inventors: David J. Ives, James H. Hayter, Maxim Oei, David B. Seltzer
-
Patent number: 10073543Abstract: An image segmentation method includes displaying, through a display component, an original designation region relative to an image; receiving a user input on the image, in which the user input is at least one stroke on the image; segmenting a regional area corresponding to the stroke to update the original designation region, in which the regional area at least partially overlaps with the original designation region.Type: GrantFiled: February 10, 2015Date of Patent: September 11, 2018Assignee: HTC CorporationInventors: Sheng-Jie Luo, Liang-Kang Huang, Tzu-Hao Kuo, Tung-Peng Wu
-
Patent number: 10067931Abstract: One or more computers receive input indicative of multiple files to be analyzed together, by performing one or more predetermined actions, using the contents (e.g. strings of text) of a corresponding one or more structures. The one or more structures are identified by the presence in each file, of corresponding names. The one or more structures are normally written into the files for use by an application program to layout the contents therein in a structured manner. The one or more computers are programmed to automatically parse each file, to identify therein the one or more layout structures e.g. based on the presence in each file of corresponding names of layout structures. After parsing, the one or more computer(s) perform the one or more predetermined actions, to obtain an output structure that holds the results based on the contents of each layout structure identified in each file.Type: GrantFiled: June 23, 2017Date of Patent: September 4, 2018Assignee: Oracle International CorporationInventors: Anish Desai, Lifang Yao, Sharad Bhardwaj
-
Patent number: 10051151Abstract: An image processing apparatus includes a generating unit and a display. The generating unit performs a reduction process on at least a part of document data to generate a reduced image. The display displays the reduced image.Type: GrantFiled: February 13, 2017Date of Patent: August 14, 2018Assignee: FUJI XEROX CO., LTD.Inventors: Tetsuya Hommi, Satoshi Maruyama, Yutaka Koda, Yasushi Ujigawa, Yohei Makino
-
Patent number: RE47889Abstract: Methods and systems of the present embodiment provide segmenting of connected components of markings found in document images. Segmenting includes detecting aligned text. From this detected material an aligned text mask is generated and used in processing of the images. The processing includes breaking connected components in the document images into smaller pieces or fragments by detecting and segregating the connected components and fragments thereof likely to belong to aligned text.Type: GrantFiled: July 1, 2016Date of Patent: March 3, 2020Assignee: III Holdings 6, LLCInventor: Eric Saund