Abstract: The disclosed embodiments provide a system that manages synchronization between a profile store and a source of truth that is used to update the profile store via a real-time link. During operation, the system obtains a first set of attributes from a profile for an entity in the profile store. Next, the system obtains a second set of attributes for the entity from the source of truth. The system then compares the first and second sets of attributes to detect and correct mismatches between the first and second sets of attributes, wherein the corrected mismatches improve subsequent use of the profile in the profile store by the set of remote offerings. Finally, the system varies a rate of comparing the first and second sets of attributes for the mismatches based on one or more execution conditions associated with the source of truth.
Type:
Grant
Filed:
October 30, 2015
Date of Patent:
November 13, 2018
Assignee:
INTUIT INC.
Inventors:
Tobias Ullrich, Lars Pfannenschmidt, Frank Wisniewski, Thorsten Krüger, Shabbir Y. Rajgara, Guohan Gao
Abstract: A virtual asset testing environment is provided that is distinct from a production computing environment. A virtual asset creation template associated with a class of virtual assets to be verified is identified, each virtual asset of the class of virtual assets being created using the virtual asset creation template. A designated test virtual asset is generated using the virtual asset creation template that is deployed in the virtual asset testing environment. The designated test virtual asset is then analyzed in the virtual asset testing environment to identify any vulnerabilities in the designated test virtual asset. If a vulnerability is identified in the designated test virtual asset, a remedy to the vulnerability is applied to the virtual asset creation template, and/or virtual assets created by the virtual asset creation template deployed in the production environment.
Abstract: The present disclosure relates to language agnostic unsupervised removal of text from form images. According to one embodiment, a method comprises generating a spectral domain representation of an image by applying a two dimensional frequency domain transformation, where the image depicts form layout elements and text elements. Applying a first filter to the spectral domain representation to remove a portion of the frequency domain corresponding to the text element, and applying an inverse two dimensional frequency domain transformation to the filtered spectral domain representation of the image to generate a reconstructed image. The text elements are not depicted in the reconstructed image.
Abstract: Techniques are disclosed for restoring application services in a computer network following a service disruption. A faster service restoration (FSR) engine identifies one or more servers hosting a service. The service is associated with dependencies between the service and another one of the services. The FSR engine determines successive phases for restoring the application based on the dependencies. Each phase includes one or more clusters of servers, where each server in the cluster executes an instance one of the services. The FSR engine restores the application by each successive phase. In each phase, the restoration maintains a minimum availability of the instances of the services executed in each cluster.
Abstract: The disclosed embodiments provide a system for updating database records while maintaining accessible temporal history. The system operates by receiving a request, at a database, to select a specific instance of a record from the database at a specific point in time. In response to the request, the system reads an instance of the record from a snapshot of the database, wherein the snapshot of the database was made prior to the specific point in time. Next, the system loads one or more deltas associated with the record from the database, wherein each delta in the one or more deltas comprises the difference between a new state of the record and a prior state of the record. The system then chronologically applies the one or more deltas to the instance of the record to create the specific instance of the record. Finally, the system returns the specific instance of the record.
Type:
Grant
Filed:
October 30, 2015
Date of Patent:
October 30, 2018
Assignee:
INTUIT INC.
Inventors:
Tobias Ullrich, Lars Pfannenschmidt, Frank Wisniewski, Thorsten Krüger, Shabbir Y. Rajgara, Guohan Gao
Abstract: The present disclosure relates to processing read requests to retrieve data from a plurality of data sources. According to one embodiment, an example method includes determining a navigable path of nodes accessed to satisfy a read request based on a graph projection of an application programming interface (API). An API service generates a plurality of subqueries, each subqueries being associated with a node in the navigable path. While traversing the navigable path to satisfy the read request, the API service identifies data associated with lower level nodes that is cached at a data source associated with a current node, replaces subqueries directed to data stored at the current node and the identified data with a single subquery executed against the data source associated with the current node, and executes the single subquery at the current node. The API service returns data accessed during traversal of the navigable path.
Abstract: The present disclosure relates to building a software development project using a shared build module common to development projects targeting a specific software platform. According to one embodiment, a development system requests, from a centralized repository, version information for a latest version of a shared build module associated with a software platform targeted by the development project. The development system determines to replace a local copy of the shared build module with the latest version of the shared build module based, at least in part, on a comparison between version information for the local copy and the version information for the latest version of the shared build module. The development system builds the software development project using the local copy of the shared build module.
Abstract: Techniques are disclosed to identify a form document in an image using a digital fingerprint of the form document. To do so, the image is evaluated to detect features of the image and determine a polygon bounding each feature. For each polygon, pixels are plotted in a second image based on coordinates of a center of the polygon. The second image is the digital fingerprint of the form. To identify the form corresponding to the digital fingerprint, the digital fingerprint may be compared to digital fingerprints of known forms.
Type:
Grant
Filed:
August 1, 2018
Date of Patent:
October 30, 2018
Assignee:
Intuit inc.
Inventors:
Richard J. Becker, Greg Knoblauch, Pavlo Malynin, Anju Eappen
Abstract: The disclosed embodiments provide a system that describes a semi-structured document for the purpose of acquiring a set of data elements from the semi-structured document. During operation, the system obtains a physics model of a semi-structured document, wherein the physics model includes a set of relationships represented by physical objects that describe relative positions of a set of data elements in the semi-structured document. Next, the system applies the physics model to a representation of the semi-structured document to automatically extract a set of data from the representation. The system then provides the extracted set of data for use with one or more applications without requiring manual input of the data into the one or more applications.
Type:
Grant
Filed:
July 31, 2015
Date of Patent:
October 30, 2018
Assignee:
INTUIT INC.
Inventors:
Richard J. Becker, Carolina Simoes Gomes, Daniel L. Moise, Horace Chan
Abstract: During an image-analysis technique, the system calculates features by performing image analysis (such as optical character recognition) on a received image of a document. Using these features, as well as spatial and grammatical constraints, the system determines a layout of the document. For example, the layout may be determined using constraint-based optimization based on the spatial and the grammatical constraints. Note that the layout specifies locations of content in the document, and may be used to subsequently extract the content from the image and/or to allow a user to provide feedback on the extracted content by presenting the extracted content to the user in a context (i.e., the determined layout) that is familiar to the user.
Type:
Grant
Filed:
December 5, 2013
Date of Patent:
October 30, 2018
Assignee:
INTUIT INC.
Inventors:
Nankun Huang, Carol A. Howe, Christopher M. Dye, Robert Bamford
Abstract: A method may include identifying a candidate user based on a connection to an established user of a business management application (BMA). The candidate user may have an associated user identifier. The method may further include collecting, using the user identifier, social network data of the candidate user from an online social network, identifying, using the social network data of the candidate user, application programming interfaces (APIs) for collecting public data about the candidate user, retrieving, using the user identifier and an API, public data corresponding to the candidate user, generating, using the public data corresponding to the candidate user, an account creation request including the user identifier, and transmitting the account creation request to the BMA.
Abstract: The present disclosure includes techniques for selecting a candidate presentation style for individual documents for inclusion in an aggregate training data set for a document type that may be used to train an OCR processing engine prior to identifying text in an image of a document of the document type. In one embodiment, text input corresponding to a text sample in a document is received, and an image of the text sample in the document is received. For each of a plurality of candidate presentation styles, an OCR processing engine is trained using a training data set corresponding to the given candidate presentation style, and the OCR processing engine is used, as trained, to identify text in the received image. The OCR processing results for each candidate presentation style are compared to the received text input. A candidate presentation style for the document is selected based on the comparisons.
Type:
Grant
Filed:
September 21, 2016
Date of Patent:
October 23, 2018
Assignee:
Intuit inc.
Inventors:
Eugene Krivopaltsev, Sreeneel K. Maddika, Vijay S. Yellapragada
Abstract: Techniques are disclosed for performing optical character recognition (OCR) by assessing and improving quality of electronic documents to perform the OCR. For example a method for identifying information in an electronic document includes obtaining a reference image of the electronic document, distorting the reference image by adjusting different sets of one or more parameters associated with a quality of the reference image to generate a plurality of distorted images, analyzing each distorted image to detect the adjusted set of parameters and corresponding adjusted values, determining an accuracy of detection of the set of parameters and the adjusted values, and training a model based at least on the plurality of distorted images and the accuracy of the detection, wherein the trained model determines at least a first technique for adjusting a set of parameters in a second image to prepare the second image for optical character recognition.
Type:
Grant
Filed:
October 28, 2016
Date of Patent:
October 23, 2018
Assignee:
INTUIT INC.
Inventors:
Richard J. Becker, Rakesh Kandpal, Priya Kothari, Sheldon Porcina, Pavlo Malynin
Abstract: Embodiments of a computer system, a method and a computer-program product (e.g., software) for use with the computer system are described. These embodiments may allow users to create or maintain (including repair and validation), which collect user information from a web page. In particular, using a web browser extension, one or more users' actions while navigating or interacting with the web page are captured. This captured user-action information may specify the layout of the web page, including data locations and/or types of data. Then, using the captured user-action information, a new scraping script can be generated or, based at least in part on determined changes to the web page, an existing scraping script can be maintained.
Type:
Grant
Filed:
April 16, 2009
Date of Patent:
October 23, 2018
Assignee:
INTUIT INC.
Inventors:
Thirugnanam Subbiah, Raghunath Battula, Kenneth W. Hanscom, Charles R. Honton, Don E. Bernal, Gang Wang, David L. Henry
Abstract: A data management system manages secured data for a plurality of users. The data management system utilizes an access authorization system to authenticate users seeking access to the data management system. The access authorization system provides access tokens to authenticated users. The access tokens enable the authenticated users to access the data management system without again providing authentication data. The access authorization system includes, for each user, an access policy that governs whether the users can use the access tokens to access the data management system. The access tokens have a finite lifetime. If the users use the access tokens within the finite lifetime and if the users satisfy all of the access rules of the access policies, then the lifetime of the access tokens can be extended a finite number of times.