Distinguishing Text From Other Regions Patents (Class 382/176)
  • Patent number: 10896339
    Abstract: A method for image processing is disclosed. The method includes: obtaining an image including a check with a magnetic ink character recognition (MICR) code; generating a mask including a plurality of shapes based on the image and an estimated rotation angle of the check; generating a stroke width map (SWM) by applying a stroke width transform (SWT) to a plurality of regions in the image corresponding to the plurality of shapes; generating a first word line associated with a first region based on a plurality of words in the SWM; rotating a portion of the SWM associated with the first word line; and detecting, after rotating, the MICR code by applying a plurality of OCR processes to the portion of the SWM.
    Type: Grant
    Filed: March 7, 2019
    Date of Patent: January 19, 2021
    Assignee: Prosper Funding LLC
    Inventor: Paul Golding
  • Patent number: 10867204
    Abstract: In some embodiments, a method detects a first set of frames in a video that include lines of text, the detecting performed at a frame level on each individual frame. A first representation is generated from the first set of frames and a second representation is generated from the first set of frames. The method filters the first representation based on a number of lines of text within a space in the space dimension to select a second set of frames and filters the second representation based on a number of frames within time intervals in the time dimension to select a third set of frames. Frames in both the second set of frames and the third set of frames are analyzed to determine whether the lines of text in both the second set of frames and the third set of frames are burned-in subtitles.
    Type: Grant
    Filed: April 30, 2019
    Date of Patent: December 15, 2020
    Assignee: HULU, LLC
    Inventors: Yaqi Wang, Xiaohui Xie, Yunsheng Jiang
  • Patent number: 10855965
    Abstract: A segmented 3D multi-view image generator generates fewer multi-view view images for partitions having less salient features. Saliency values are calculated based on a depth map and image processing of the input image. The saliency values are compared to thresholds to map pixel locations to first, second, and third partitions. First, second, and third segmented images are created from the input image using a partition map. A multi-view generator uses the depth map and viewer eye locations to generates 28 view images from the first segmented image, 14 unique view images from the second segmented image that are replicated to 28 view images, and 7 unique view images from the third segmented image that are replicated to 28 view images. The view images for each segment are interlaced to generated interlaced segmented images that are then integrated together into a single 3D image that drives a 28-view autostereoscopic display.
    Type: Grant
    Filed: June 28, 2019
    Date of Patent: December 1, 2020
    Assignee: Hong Kong Applied Science and Technology Research Institute Company, Limited
    Inventors: Yuzhong Jiao, Man Chi Chan, Ping Chan Mok
  • Patent number: 10846550
    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for receiving user input for an object classification of interest for image data (e.g. single frame image, continuous video, etc.) from a user device, and for each object specified by identified object data to belong to the object classification of interest, displaying data that presents each object and/or text block of interest on a user device.
    Type: Grant
    Filed: June 28, 2018
    Date of Patent: November 24, 2020
    Assignee: Google LLC
    Inventor: Jeffrey Palm
  • Patent number: 10839206
    Abstract: An information processing device performs processing on document image data including first image data to undergo character recognition processing and second image data not to undergo character recognition processing. The information processing device includes a detecting section which detects the first image data, an extracting section which extracts the first image data, and a processing section. The processing section includes a counting section which counts first images, a determining section which determines whether the number of the first images exceeds a threshold, a first performing section which performs first processing when the threshold is exceeded, and a second performing section which performs second processing when the threshold is not exceeded. Through the first processing, the second image is masked with a background color of the document image and character recognition is then performed on the document image.
    Type: Grant
    Filed: December 9, 2019
    Date of Patent: November 17, 2020
    Assignee: KYOCERA Document Solutions Inc.
    Inventor: Hironori Hayashi
  • Patent number: 10839573
    Abstract: Some embodiments of the present disclosure provide a content integration system. The content integration system is configured to retrieve a source digital content, retrieve a target digital content, identify a region within the target digital content for placing or integrating the source digital content, and place or integrate the target digital content onto the identified region of the source digital content. The content integration system can be configured to place the source digital content into the target digital content in an aesthetically-pleasing, unobtrusive, engaging, and/or otherwise favorable manner. The content integration system can be particularly useful for advertisements, enhanced expression, entertainment, information, or communication.
    Type: Grant
    Filed: March 22, 2017
    Date of Patent: November 17, 2020
    Assignee: ADOBE INC.
    Inventors: William L. Marino, Brunno Fidel Maciel Attore, Johan Adami
  • Patent number: 10832048
    Abstract: Disclosed is a new document processing solution that combines the powers of machine learning and deep learning and leverages the knowledge of a knowledge base. Textual information in an input image of a document can be converted to semantic information utilizing the knowledge base. A semantic image can then be generated utilizing the semantic information and geometries of the textual information. The semantic information can be coded by semantic type determined utilizing the knowledge base and positioned in the semantic image utilizing the geometries of the textual information. A region-based convolutional neural network (R-CNN) can be trained to extract regions from the semantic image utilizing the coded semantic information and the geometries. The regions can be mapped to the textual information for classification/data extraction. With semantic images, the number of samples and time needed to train the R-CNN for document processing can be significantly reduced.
    Type: Grant
    Filed: April 7, 2020
    Date of Patent: November 10, 2020
    Assignee: OPEN TEXT SA ULC
    Inventor: Uwe Ast
  • Patent number: 10824854
    Abstract: Embodiments of the present disclosure pertain to systems and method for extracting data from an image. In one embodiment, a method of extracting data from an image comprises receiving, from an optical character recognition (OCR) system, OCR text in response to sending an image to the OCR system. The OCR text comprises a plurality of lines of text. Each line of text is classified as either a line item or not a line item using a machine learning algorithm, and a plurality of data fields are extracted from each line of text classified as a line item.
    Type: Grant
    Filed: June 18, 2018
    Date of Patent: November 3, 2020
    Assignee: SAP SE
    Inventors: Everaldo Aguiar, Ravi Sharma, Shivani Patel, Jesper Lind, Michael Stark, Yongjian Bi
  • Patent number: 10824788
    Abstract: A method of collecting training data of a document component may be provided. The documents have a structure and are coded in the typesetting language TeX. The method comprise receiving a TeX source file, compiling it into a PDF file and a related sync file, analyzing the PDF file, thereby determining a non-text-only document component. The method comprises also determining first coordinates of the non-text-only document component and a corresponding page number, determining a typesetting command relating to a non-text-only document component and determining second coordinates of a bounding box and a corresponding page number from the sync file, determining text elements in the non-text-only document component of the PDF file for which the first coordinates and the second coordinates overlap, and combining the determined text elements and linking them to a type of a non-text document component determined in the non-text-only document component in the TeX source file.
    Type: Grant
    Filed: February 8, 2019
    Date of Patent: November 3, 2020
    Assignee: International Business Machines Corporation
    Inventors: Peter Willem Jan Staar, Michele Dolfi, Christoph Auer, Aleksandros Sobczyk, Konstantinos Bekas
  • Patent number: 10803366
    Abstract: The present invention relates to a method for extracting an output data set, wherein the method includes the following steps receiving an input data set; wherein the input data set comprises at least one textual input data set and at least one visual input data set; processing the at least one textual input data set using natural language processing into at least one textual output data set; processing the at least one visual input data set using image processing into at least one visual output data set, and outputting the output data set, including the at least one textual output data set and/or the at least one visual output data set. Further, the present invention is related to a computer program product and system.
    Type: Grant
    Filed: May 17, 2018
    Date of Patent: October 13, 2020
    Assignees: SIEMENS AKTIENGESELLSCHAFT, SIEMENS CORPORATION
    Inventors: Dmitriy Fradkin, Volkmar Sterzing, Stefan Langer
  • Patent number: 10796429
    Abstract: The subject disclosure provides systems and methods for determination of Area of Interest (AOI) for different types of input slides. Slide thumbnails may be assigned into one of five different types, and separate algorithms for AOI detection executed depending on the slide type. Slide types include ThinPrep® slides, tissue micro-array (TMA) slides, control HER2 slides with 4 cores, smear slides, and a generic slide. The slide type may be assigned based on a user input. Customized AOI detection operations are provided for each slide type. If the user enters an incorrect slide type, operations include detecting the incorrect input and executing the appropriate method. The result of each AOI detection operations provides as its output a soft-weighted image having zero intensity values at pixels that are detected as not belonging to tissue, and higher intensity values assigned to pixels detected as likely belonging to tissue regions.
    Type: Grant
    Filed: July 26, 2017
    Date of Patent: October 6, 2020
    Assignee: Ventana Medical Systems, Inc.
    Inventors: Anindya Sarkar, Jim Martin
  • Patent number: 10769425
    Abstract: A method of determining a hierarchy of a blank template using an image of the blank template and using the determined hierarchy for providing labels and field values of text lines of a filled form document.
    Type: Grant
    Filed: August 13, 2018
    Date of Patent: September 8, 2020
    Assignee: International Business Machines Corporation
    Inventors: Antonio Foncubierta Rodriguez, Maria Gabrani, Guillaume Jaume
  • Patent number: 10750359
    Abstract: A portable terminal device and a method for operating the same are provided. The portable terminal device includes a communicator configured to perform communication with an external device, a display configured to display a same image as an image displayed on the external device, an inputter configured to receive an input of a selection command, and a controller configured to perform an operation corresponding to an object included in the image at a time when the selection command is input.
    Type: Grant
    Filed: March 13, 2019
    Date of Patent: August 18, 2020
    Assignee: Samsung Electronics Co., Ltd.
    Inventors: Do-hyoung Kim, Sang-il Lee, Taik-heon Rhee, Seong-hoon Kang, Sung-bin Kuk, Dong-jin Eun, Min-kyu Jung
  • Patent number: 10706218
    Abstract: Much valuable information in documents is presented within tables. However, the information within tables is hard to extract automatically with high accuracy due to the wide variety and low quality of typical tables found in electronic documents. Information extraction technology can provide a method of extracting information from heterogeneous tables by recognizing tables, the header cells, and cells that are merged or should be merged, creating a richer representation of table structure and providing a convenient way of linking cells to their row and column headers. Use of this richer representation allows a few extraction patterns to successfully pull out information from a wide variety of differently formatted tables.
    Type: Grant
    Filed: May 15, 2017
    Date of Patent: July 7, 2020
    Assignee: Linguamatics Ltd.
    Inventors: David Richard Milward, Himanshu Agrawal, James Robert Walton Cormack, Francisco Nuno Quintiliano Mendonca Carapeto Costa
  • Patent number: 10701434
    Abstract: A seek content extraction system analyzes frames of video content and identifies locations in the frames where session information is displayed. This session information refers to information that is displayed as part of video content and that describes, for a particular location in the video content, what is currently happening in the video content at that particular location. This session information is extracted from each of multiple frames, and for a given frame the extracted session information is associated with the frame. While the user is seeking forward or backward through the video content, a thumbnail of the frame at a given location in the video content is displayed along with the extracted session information associated with the frame.
    Type: Grant
    Filed: January 21, 2019
    Date of Patent: June 30, 2020
    Assignee: Adobe Inc.
    Inventors: Amol Jindal, Ajay Bedi
  • Patent number: 10699381
    Abstract: Certain embodiments involve a model for enhancing text in electronic content. For example, a system obtains electronic content comprising input text and converts the electronic content into a grayscale image. The system also converts the grayscale image into a binary image using a grid-based grayscale-conversion filter, which can include: generating a grid of pixels on the grayscale image; determining a plurality of grid-pixel threshold values at intersection points in the grid of pixels; determining a plurality of estimated pixel threshold values based on the plurality of grid-pixel threshold values; and converting the grayscale image into the binary image using the plurality of grid-pixel threshold values and the plurality of estimated pixel threshold values. The system also generates an interpolated image based on the electronic content and the binary image. The interpolated image includes output text that is darker than the input text. The system can then output the interpolated image.
    Type: Grant
    Filed: May 24, 2018
    Date of Patent: June 30, 2020
    Assignee: Adobe Inc.
    Inventors: Ram Bhushan Agrawal, Ankit Pangasa, Abhishek Shah
  • Patent number: 10664211
    Abstract: An image forming apparatus for forming an image on a sheet includes a reading section, a determining section, a generation section, a searching section, and a placement section. The reading section reads a predetermined shape from an original document. The determining section determines whether or not the predetermined shape contains a first image. The generation section generates a first search condition based on the first image. The searching section searches for at least one second image fulfilling the first search condition from a storage apparatus storing a plurality of images. The placement section places the second image in a first area in which the predetermined shape is located.
    Type: Grant
    Filed: February 26, 2018
    Date of Patent: May 26, 2020
    Assignee: KYOCERA Document Solutions Inc.
    Inventor: Hikaru Miyaji
  • Patent number: 10628525
    Abstract: Detecting and incorporating formatting characteristics within natural language processing analytics. Source documents are ingested and the markup formatting language is identified by the program. Once identified, the markup language is parsed and examined for formatting characteristics, embedded notes, comments and other metadata. The formatting characteristics of the plain text are extracted, along with the plain text, and converted into a common analysis structure (CAS), or CAS-equivalent structure, which annotates the natural language text together with its respective formatting characteristics. The CAS or CAS-equivalent structures are stored and sent to a natural language processing pipeline for further analysis via complex algorithms and rules. The natural language processing results data are curated to reflect meaningful analysis of the extracted CAS or CAS-equivalent structure.
    Type: Grant
    Filed: May 17, 2017
    Date of Patent: April 21, 2020
    Assignee: International Business Machines Corporation
    Inventors: Patrick W. Fink, Kristin E. McNeil, Philip E. Parker, David B. Werts
  • Patent number: 10623602
    Abstract: The image reading apparatus includes: an image reading unit which reads card-like document sheets; a card-image recognition part which recognizes card images corresponding to the document sheets; a circular-area setting part which sets circular areas each containing a card image; a positional-information setting part which sets positional information as to the circular areas; a deviational-angle computation part which determines deviational angles of the card images; a corrected-data acquisition part which acquires corrected image data by turning the circular areas; and an array processing part which generates arrayed image data in which the card images corrected in terms of deviational inclination are disposed in array.
    Type: Grant
    Filed: May 18, 2018
    Date of Patent: April 14, 2020
    Assignee: KYOCERA Document Solutions Inc.
    Inventor: Hiroyuki Nagahama
  • Patent number: 10621470
    Abstract: A method is provided for Optical Character Recognition (OCR). A plurality of OCR decoding results each having a plurality of positions is obtained from capturing and decoding a plurality of images of the same one or more OCR characters. A recognized character in each OCR decoding result is compared with the recognized character that occupies an identical position in each of the other OCR decoding results. A number of occurrences that each particular recognized character occupies the identical position in the plurality of OCR decoding results is calculated. An individual confidence score is assigned to each particular recognized character based on the number of occurrences, with a highest individual confidence score assigned to a particular recognized character having the greatest number of occurrences.
    Type: Grant
    Filed: September 29, 2017
    Date of Patent: April 14, 2020
    Assignee: DATAMAX-O'NEIL CORPORATION
    Inventor: H. Sprague Ackley
  • Patent number: 10582269
    Abstract: The present invention relates to a device and a method for transmitting and receiving a broadcast signal comprising a subtitling service. Provided in one embodiment of the present invention is a method for transmitting a broadcast signal, the method comprising the steps of: generating a broadcast signal comprising video data and subtitle data; and transmitting the generated broadcast signal. According to the embodiment of the present invention, a transport stream providing a digital broadcast subtitling service using an XML subtitle may be transmitted.
    Type: Grant
    Filed: July 10, 2015
    Date of Patent: March 3, 2020
    Assignee: LG ELECTRONICS INC.
    Inventors: Hyunmook Oh, Jongyeul Suh
  • Patent number: 10547768
    Abstract: A method and system for improving virtual display generation with respect to a visual obstruction is provided. The method includes generating code associated with determining and resolving a physical obstruction with respect to a visual presentation. Video retrieval devices are enabled for retrieving a first video stream of a first object and a second object being viewed by users and a second video stream of the users. A visual obstruction including a portion of the first object visually obstructing a portion of the second object is detected. A boundary and content type associated with the portion of the second object being visually obstructed is determined and and analyzed with respect to a threshold value and a resulting video stream presenting an entire view of the second object without being visually obstructed with respect to the first object is generated and presented.
    Type: Grant
    Filed: May 4, 2018
    Date of Patent: January 28, 2020
    Assignee: International Business Machines Corporation
    Inventors: James E. Bostick, John M. Ganci, Jr., Martin G. Keen, Sarbajit K. Rakshit
  • Patent number: 10546403
    Abstract: Systems and methods are disclosed for controlling image annotation. One method includes acquiring a digital representation of image data and generating a set of image annotations for the digital representation of the image data. The method also may include determining an association between members of the set of image annotations and generating one or more groups of members based on the association. A representative annotation from the one or more groups may also be determined, presented for selection, and the selection may be recorded in memory.
    Type: Grant
    Filed: December 7, 2017
    Date of Patent: January 28, 2020
    Assignee: HeartFlow, Inc.
    Inventors: Leo Grady, Michiel Schaap
  • Patent number: 10503971
    Abstract: A device obtains image data associated with a document. Using a first machine learning model, the device determines, for the document, a first classification of one of a plurality of document types and a first confidence score associated with the first classification, and a second classification of one of the plurality of document types and a second confidence score associated with the second classification based on the image data. The device determines a difference between the first confidence score and the second confidence score, compares the difference and a threshold value, and accept the first classification of the document when the difference satisfies the threshold value.
    Type: Grant
    Filed: August 14, 2019
    Date of Patent: December 10, 2019
    Assignee: Capital One Services, LLC
    Inventors: Steven Dang, Jason Gould, Jennifer Jiang, Christopher Akatsuka, Douglas Slattery, Vijaya Pasam
  • Patent number: 10497075
    Abstract: A system and method for optimizing healthcare remittance processing includes a networked computing device that provides a user interface and access to healthcare claims and remittance data prepared by the system. The user receives a claim file prepared by a healthcare provider and an EOB/EOP prepared by a healthcare payer in response to the claim file. A remittance file is generated from the received data and is validated using automatic and manual means and is indexed against the remitted data. EOB/EOP data is converted to computer readable data in a standardized remittance file format. This transaction information is stored within the database and access to the stored information is provided to a user over a network connected interface.
    Type: Grant
    Filed: July 22, 2011
    Date of Patent: December 3, 2019
    Assignee: SYSTEMWARE, INC.
    Inventor: Andrea Chiappe
  • Patent number: 10477128
    Abstract: Dehazed images are produced based on an atmospheric light image obtained form an input image as brightest pixels of a predetermined window and white map. The white map is median filtered, morphologically filtered and, in some examples, filtered with a guided filter, and the filtered image combined with the atmospheric light image to produce a dehazed image.
    Type: Grant
    Filed: January 8, 2018
    Date of Patent: November 12, 2019
    Assignee: Nikon Corporation
    Inventors: Ripul Bhutani, Ping-Wei Chang, Bausan Yuan
  • Patent number: 10447882
    Abstract: An image reading apparatus includes a platen on which a document is to be placed; an image generating unit that performs scanning on the platen to generate a position detection image and an output image; a document position detecting unit that detects whether a document exists and a position of the document based on the generated position detection image; a document extracting unit that extract an area corresponding to the document from the generated output image; and a control unit that controls the image generating unit, the document position detecting unit, and the document extracting unit so as to output an image of the extracted document.
    Type: Grant
    Filed: February 6, 2018
    Date of Patent: October 15, 2019
    Assignee: SHARP KABUSHIKI KAISHA
    Inventors: Kazuhiro Mizude, Kazuma Ogawa, Tatsuya Fujisaki, Sho Tsujimoto
  • Patent number: 10440305
    Abstract: Detecting the start of a credit roll within video program may allow for the automatic extension of video recordings among other functions. The start of the credit roll may be detected by determining the number of text blocks within a sequence of frames and identifying a point in the sequence of frames where a difference between the number of text blocks in frames occurring before the point and the number of text blocks in frames occurring after the point is greatest and exceeds a specified threshold. Text blocks may be identified within each frame by partitioning the frame into one or more segments and recording the segments having a pixel of a sufficiently high contrast. Contiguous segments may be merged or combined into single blocks, which may then be filtered to remove noise and false positives. Additional content may be inserted into the credit roll frames.
    Type: Grant
    Filed: November 9, 2017
    Date of Patent: October 8, 2019
    Assignee: Comcast Cable Communications, LLC
    Inventors: Oliver Jojic, David F. Houghton
  • Patent number: 10430611
    Abstract: Within one or more instances of a computing environment where an instance is a self-contained architecture to provide at least one database with corresponding search and file system. User information from the one or more instances of the computing environment is organized as zones. A zone is based on one or more characteristics of corresponding user information that are different than the instance to which the user information belongs. User information is selectively obfuscated prior to transmitting blocks of data including the obfuscated user information. The selective obfuscation is based on zone information for one or more zones to which the user information belongs.
    Type: Grant
    Filed: May 19, 2017
    Date of Patent: October 1, 2019
    Assignee: salesforce.com, inc.
    Inventors: Olumayokun Obembe, Gregory Lapouchnian, Vijayanth Devadhar, Jason Woods, Karthikeyan Govindarajan, Ashwini Bijwe, Prasad Peddada
  • Patent number: 10368144
    Abstract: The present invention relates to a device and a method for transmitting and receiving a broadcast signal including a subtitle service. Provided in one embodiment of the present invention is a method for transmitting a broadcast signal, comprising the steps of: generating a broadcast signal including video data and subtitle data; and transmitting the generated broadcast signal. According to the embodiment of the present invention, a transmission stream providing a digital broadcast subtitle service using XML subtitles can be transmitted.
    Type: Grant
    Filed: July 10, 2015
    Date of Patent: July 30, 2019
    Assignee: LG ELECTRONICS INC.
    Inventors: Hyunmook Oh, Jongyeul Suh
  • Patent number: 10339657
    Abstract: According to one embodiment, a character detection apparatus includes a feature extractor, a determiner and an integrator. The feature extractor extracts a feature value of an image including character strings. The determiner determines each priority of a plurality of different character detection schemes in accordance with character detection accuracy with respect to an image region having a feature corresponding to the feature value. The integrator integrates text line candidates of the character detection schemes, and selects, as a text line, one of the text line candidates detected by the character detection scheme with the highest priority if a superimposition degree indicating a ratio of a superimposed region among the text line candidates is no less than a first threshold value.
    Type: Grant
    Filed: June 17, 2015
    Date of Patent: July 2, 2019
    Assignee: Kabushiki Kaisha Toshiba
    Inventors: Yojiro Tonouchi, Kaoru Suzuki
  • Patent number: 10275712
    Abstract: A method providing an answer to at least one analytical question containing at least one table or at least one chart is provided. The method may include receiving an input question. The method may also include extracting a plurality of information from the input question based on a natural language analysis. The method may further include forming a well-defined sentence. The method may include extracting at least one table or at least one chart associated with the input question. The method may include forming at least one mathematical equation. The method may also include solving the at least one mathematical equation. The method may include determining the answer to the input question in natural language based on the solved at least one mathematical equation. The method may further include narrating the determined answer to the input question in natural language.
    Type: Grant
    Filed: June 21, 2016
    Date of Patent: April 30, 2019
    Assignee: International Business Machines Corporation
    Inventors: Sandesh Bhat, Joy Mustafi
  • Patent number: 10275713
    Abstract: A method providing an answer to at least one analytical question containing at least one table or at least one chart is provided. The method may include receiving an input question. The method may also include extracting a plurality of information from the input question based on a natural language analysis. The method may further include forming a well-defined sentence. The method may include extracting at least one table or at least one chart associated with the input question. The method may include forming at least one mathematical equation. The method may also include solving the at least one mathematical equation. The method may include determining the answer to the input question in natural language based on the solved at least one mathematical equation. The method may further include narrating the determined answer to the input question in natural language.
    Type: Grant
    Filed: June 21, 2016
    Date of Patent: April 30, 2019
    Assignee: International Business Machines Corporation
    Inventors: Sandesh Bhat, Joy Mustafi
  • Patent number: 10278064
    Abstract: A portable terminal device and a method for operating the same are provided. The portable terminal device includes a communicator configured to perform communication with an external device, a display configured to display a same image as an image displayed on the external device, an inputter configured to receive an input of a selection command, and a controller configured to perform an operation corresponding to an object included in the image at a time when the selection command is input.
    Type: Grant
    Filed: August 21, 2017
    Date of Patent: April 30, 2019
    Assignee: Samsung Electronics Co., Ltd.
    Inventors: Do-hyoung Kim, Sang-il Lee, Taik-heon Rhee, Seong-hoon Kang, Sung-bin Kuk, Dong-jin Eun, Min-kyu Jung
  • Patent number: 10248637
    Abstract: Document authoring that involves illustrating pen input in an authoring environment is herein improved to provide patterns with higher perceptibility for representing the pen input in a graphical user interface. Colors and patterns are provided as effects that are applied to the illustrated pen input so that multiple textures or colors may be applied to the illustrated pen input without requiring the user to manually signal a switch in texture or colors or using multiple objects to represent the pen input. In various aspects, the patterns used in effects are created with a greater perceptibility, so that users will more readily recognize the effect, with various layers of a contrast basis image imparting a perceptible pattern and a background color image imparting colors for an enhanced ink effect definition.
    Type: Grant
    Filed: October 11, 2016
    Date of Patent: April 2, 2019
    Assignee: MICROSOFT TECHNOLOGY LICENSING, LLC
    Inventor: Arianne Marie Taylor
  • Patent number: 10210148
    Abstract: The embodiments of the present invention provide a method and an apparatus for file processing. The method for file processing includes: obtaining a file; parsing the file to obtain a first character contained in the file; matching the first character with a preconfigured matching character library; obtaining an annotation corresponding to the first character when the first character satisfies a predetermined condition; and displaying the first character and the annotation. With the embodiments of the present invention, automatic annotation can be provided for a particular character in a file, such that the user's reading experience can be improved.
    Type: Grant
    Filed: August 1, 2011
    Date of Patent: February 19, 2019
    Assignees: LENOVO (BEIJING) LIMITED, BEIJING LENOVO SOFTWARE LTD.
    Inventors: Yaqiang Wu, Jianzhong Zhang, Zhepeng Wang, Chao Xu, Wei Wang
  • Patent number: 10185873
    Abstract: The tracking method comprises, for at least a first image of the text having at least a first line of characters: applying a prediction of a movement to which the text is subjected between the first image and a second image of the video stream, the movement prediction being applied to at least one second line of characters of the second image; determining at least one alignment hypothesis for aligning the first line with the second line after applying the movement prediction; estimating for each alignment hypothesis, a geometrical transformation between the first line and the second line resulting in that alignment; and evaluating a character match metric for each alignment hypothesis, the metric being evaluated from signatures calculated on the characters of at least one line of the first image and signatures calculated on the characters of at least one line of the second image put into correspondence with the characters of said at least one line of the first image after applying the geometrical transform
    Type: Grant
    Filed: December 8, 2015
    Date of Patent: January 22, 2019
    Assignee: IDEMIA IDENTITY & SECURITY
    Inventors: Alain Rouh, Jean Beaudet
  • Patent number: 10176397
    Abstract: A method of reading degraded symbols is described. A symbol has an initial shape, which is marked on an object. Marks, shapes, symbols, and object ID are managed separately. A symbol library with symbols and associated shapes is initially created. Shapes in the library are updated from shapes of marks as the marks degrade over time. A read mark is compared to all the shapes in the library to determine a most likely shape. A selection set is used to limit symbol selection, based on the comparison, to valid symbols. The symbol library and selection set may be customized to each usage of the method. Comparison methods use probability distributions. Confidence values are used to validate output, generate warnings, and to control updating of the library. Weighted averaging may be used at the level of shapes, comparison distributions, or selections. One application is reading tattooed marks on rodent tails in a vivarium.
    Type: Grant
    Filed: October 30, 2016
    Date of Patent: January 8, 2019
    Assignee: Vium, Inc.
    Inventors: Jonathan Betts-Lacroix, Daniel J. Ford
  • Patent number: 10176200
    Abstract: A system and method to detect similarities between images. The system and method allow comparisons between a query image and one or more catalog images in a manner that is resilient to scanning, scaling, rotating, cropping and other distortions of the query image. The system includes an image processing module that determines and/or calculates principle features of a catalog image and constructs a feature vector using one or more of the principle features. The system also includes a matching module that matches a query image to one or more catalog images. The system finds matches based on a distance measure of features present in the query image and features present in the catalog images.
    Type: Grant
    Filed: October 26, 2017
    Date of Patent: January 8, 2019
    Assignee: PicScout (Israel) LTD.
    Inventors: Uri Lavi, Eli Goz, Gregory Begelman
  • Patent number: 10169650
    Abstract: To identify emphasized text, bounding boxes are based on clusters resulting from horizontal compression and horizontal morphological dilation. The bounding boxes are processed to determine if any contain words or characters in bold. A bounding box is eliminated based on a comparison of its density and an average density across all bounding boxes. If its density is greater, text elements within the bounding box are evaluated to determine whether the text element is bold.
    Type: Grant
    Filed: June 30, 2017
    Date of Patent: January 1, 2019
    Assignee: KONICA MINOLTA LABORATORY U.S.A., INC.
    Inventor: Wei Ming
  • Patent number: 10165149
    Abstract: A system of automatically naming an electronic document may include a scanning device. The system may receive a physical document that is to be converted into an electronic document, perform optical character recognition on at least a portion of the physical document to identify one or more terms that are present in the physical document, and store the identified terms in the data store associated with the scanning device. The system may receive input from a user that includes one or more first characters and corresponds to a title of the electronic document. The system may identify one or more terms from the data store that correspond to the one or more first characters by querying the data store using the received input, and cause the identified terms to be displayed to the user via a display device of the scanning device as suggested document names for the electronic document.
    Type: Grant
    Filed: September 16, 2016
    Date of Patent: December 25, 2018
    Assignee: Xerox Corporation
    Inventors: John Washington, John Barry Poxon
  • Patent number: 10157326
    Abstract: A method and a device for area identification are provided in the disclosure. The method includes: binarizing a text area including a row of characters; calculating a histogram in a vertical direction of the binarized text area, wherein the histogram includes abscissas of pixels in each column and corresponding accumulated values of foreground color pixels of the pixels in each column; and identifying a character area of each of one or more characters in the text area according to distribution information of the accumulated values.
    Type: Grant
    Filed: October 20, 2016
    Date of Patent: December 18, 2018
    Assignee: Xiaomi Inc.
    Inventors: Fei Long, Tao Zhang, Zhijun Chen
  • Patent number: 10133518
    Abstract: Provided is an image forming apparatus that solves a problem of work related to copying becoming complicated when a test copy is performed. The image forming apparatus according to this disclosure includes a scanner unit, a printer-control unit, a storage device, a characteristic-extracting unit, and a system-control unit. The printer-control unit executes a printing process of image data. The storage device stores image data of a document having plural pages that is read by the scanner unit. The characteristic-extracting unit, based on an extraction instruction to extract characteristics of an object, extracts characteristics of the object by object recognition of image data for each of the pages.
    Type: Grant
    Filed: October 31, 2017
    Date of Patent: November 20, 2018
    Assignee: KYOCERA Document Solutions Inc.
    Inventor: Hiromi Yamagami
  • Patent number: 10115031
    Abstract: Identifying a page with content in a video frame that is part of a video stream of successive video frames includes receiving the video stream, detecting edge segments in the video frame, where each of the edge segments is a candidate for being at least a part of an edge of the page, filtering the edge segments to discard a first subset of the edge segments based on curvature and based on angles between the edge segments and standard axes of the video frame, and identifying the page with content within a portion of a second subset of the edge segments that remain after filtering in response to the portion having geometric closeness to a rectangle. Edge segments having angles that significantly deviate from coordinate angles of the video frame and edge segments with a relatively high curvature may be discarded. A Canny edge detection algorithm may be used.
    Type: Grant
    Filed: February 24, 2016
    Date of Patent: October 30, 2018
    Assignee: EVERNOTE CORPORATION
    Inventors: Alexander Pashintsev, Boris Gorbatov, Eugene Livshitz
  • Patent number: 10108879
    Abstract: The present disclosure includes techniques for selecting a candidate presentation style for individual documents for inclusion in an aggregate training data set for a document type that may be used to train an OCR processing engine prior to identifying text in an image of a document of the document type. In one embodiment, text input corresponding to a text sample in a document is received, and an image of the text sample in the document is received. For each of a plurality of candidate presentation styles, an OCR processing engine is trained using a training data set corresponding to the given candidate presentation style, and the OCR processing engine is used, as trained, to identify text in the received image. The OCR processing results for each candidate presentation style are compared to the received text input. A candidate presentation style for the document is selected based on the comparisons.
    Type: Grant
    Filed: September 21, 2016
    Date of Patent: October 23, 2018
    Assignee: Intuit inc.
    Inventors: Eugene Krivopaltsev, Sreeneel K. Maddika, Vijay S. Yellapragada
  • Patent number: 10075742
    Abstract: A system for extracting and monitoring media tags within video content includes at least one server in communication with a plurality of content sources, the server receiving video content from the content sources, a recorder saving the video content, a detector receiving at least one frame of the video content, the detector detecting one or more unknown text within the frame and creating one or more images, each image associated with one of the one or more unknown text, the detector generating metadata associated with the one or more unknown text appearing in the frame, and an optical character recognition engine scanning the one or more images and converting the one or more images into one or more known text. The server further determines that the one or more known text is a media tag.
    Type: Grant
    Filed: September 20, 2016
    Date of Patent: September 11, 2018
    Assignee: TVEyes Inc.
    Inventors: David J. Ives, James H. Hayter, Maxim Oei, David B. Seltzer
  • Patent number: 10073543
    Abstract: An image segmentation method includes displaying, through a display component, an original designation region relative to an image; receiving a user input on the image, in which the user input is at least one stroke on the image; segmenting a regional area corresponding to the stroke to update the original designation region, in which the regional area at least partially overlaps with the original designation region.
    Type: Grant
    Filed: February 10, 2015
    Date of Patent: September 11, 2018
    Assignee: HTC Corporation
    Inventors: Sheng-Jie Luo, Liang-Kang Huang, Tzu-Hao Kuo, Tung-Peng Wu
  • Patent number: 10067931
    Abstract: One or more computers receive input indicative of multiple files to be analyzed together, by performing one or more predetermined actions, using the contents (e.g. strings of text) of a corresponding one or more structures. The one or more structures are identified by the presence in each file, of corresponding names. The one or more structures are normally written into the files for use by an application program to layout the contents therein in a structured manner. The one or more computers are programmed to automatically parse each file, to identify therein the one or more layout structures e.g. based on the presence in each file of corresponding names of layout structures. After parsing, the one or more computer(s) perform the one or more predetermined actions, to obtain an output structure that holds the results based on the contents of each layout structure identified in each file.
    Type: Grant
    Filed: June 23, 2017
    Date of Patent: September 4, 2018
    Assignee: Oracle International Corporation
    Inventors: Anish Desai, Lifang Yao, Sharad Bhardwaj
  • Patent number: 10051151
    Abstract: An image processing apparatus includes a generating unit and a display. The generating unit performs a reduction process on at least a part of document data to generate a reduced image. The display displays the reduced image.
    Type: Grant
    Filed: February 13, 2017
    Date of Patent: August 14, 2018
    Assignee: FUJI XEROX CO., LTD.
    Inventors: Tetsuya Hommi, Satoshi Maruyama, Yutaka Koda, Yasushi Ujigawa, Yohei Makino
  • Patent number: RE47889
    Abstract: Methods and systems of the present embodiment provide segmenting of connected components of markings found in document images. Segmenting includes detecting aligned text. From this detected material an aligned text mask is generated and used in processing of the images. The processing includes breaking connected components in the document images into smaller pieces or fragments by detecting and segregating the connected components and fragments thereof likely to belong to aligned text.
    Type: Grant
    Filed: July 1, 2016
    Date of Patent: March 3, 2020
    Assignee: III Holdings 6, LLC
    Inventor: Eric Saund