Abstract: According to certain aspects, a computer system may be configured to obtain information indicating a plurality of groupings of data stored in a data source, the information indicating a number of data items included in each of the plurality of groupings; determine a first grouping of the plurality of groupings including one or more data items that have changed by comparing a first number of data items included in the first grouping and a historical first number of data items included in a corresponding local version of the first grouping; access data items included in the first grouping from the data source; compare the data items included in the first grouping to data items of the corresponding local version of the first grouping to determine which data items have changed; extract the changed data items of the first grouping; and forward the extracted data items to a destination system.
Abstract: Techniques for automatically scheduling builds of derived datasets in a distributed database system that supports pipelined data transformations are described herein.
Type:
Grant
Filed:
September 7, 2017
Date of Patent:
January 8, 2019
Assignee:
Palantir Technologies, Inc.
Inventors:
Hao Dang, Gustav Brodman, Yi Xue, Stacey Milspaw, Yifei Huang, Yanran Lu
Abstract: Example embodiments relate to a network-based vulnerability detection system configured to access a database of customer transaction data corresponding to a set of card readers that includes transaction codes, receive an identification of a set of compromised card readers among the set of card readers, identify common transaction codes within the transaction data of the set of compromised card readers, and correlate the common transaction codes to one or more instances of fraud associated with the compromised set of card readers. In some example embodiments, the vulnerability detection system may be applied to monitor one or more card readers, receive transaction data corresponding to transaction conducted through the card readers, identify the common transaction codes correlated to the instances of fraud, and cause display of a notification that includes an indication of the instance of fraud at a client device.
Abstract: Systems and methods are disclosed for key phrase characterization of documents. In accordance with one implementation, a method is provided for key phrase characterization of documents. The method includes obtaining a first plurality of documents based at least on a user input, obtaining a statistical model based at least on the user input, and obtaining, from content of the first plurality of documents, a plurality of segments. The method also includes determining statistical significance of the plurality of segments based at least on the statistical model and the content, and providing for display a representative segment from the plurality of segments, the representative segment being determined based at least on the statistical significance.
Abstract: The systems and methods described herein provide highly dynamic and interactive data analysis user interfaces which enable data analysts to quickly and efficiently explore large volume data sources. In particular, a data analysis system, such as described herein, may provide features to enable the data analyst to investigate large volumes of data over many different paths of analysis while maintaining detailed and retraceable steps taken by the data analyst over the course of an investigation, as captured via the data analyst's queries and user interaction with the user interfaces provided by the data analysis system. Data analysis paths may involve exploration of high volume data sets, such as Internet proxy data, which may include trillions of rows of data. The data analyst may pursue a data analysis path that involves, among other things, applying filters, joining to other tables in a database, viewing interactive data visualizations, and so on.
Type:
Grant
Filed:
April 7, 2016
Date of Patent:
December 18, 2018
Assignee:
PALANTIR TECHNOLOGIES INC.
Inventors:
Mark Elliot, Benjamin Duffield, Adam Borochoff
Abstract: A system may determine one or more data sources which the system is configured to access. The system may provide a set of application programming interfaces (APIs) for interacting with data stored in one or more data sources. The APIs may be accessible to one or more web applications which the system is configured to serve. The system may determine that a web application has requested an operation for interacting with the data through one or more APIs in the set of APIs. The system may determine one or more responses based at least in part on the operation requested by the web application. The system may provide the response(s) to the web application.
Type:
Grant
Filed:
October 11, 2017
Date of Patent:
December 11, 2018
Assignee:
Palantir Technologies Inc.
Inventors:
Alexander Ryan, Allen Chang, William Bindi, Brian Lee, John Carrino, Julie Tibshirani, Timothy Wilson
Abstract: Computer-implemented systems and methods are disclosed for comparing and associating objects. In some embodiments, a method is provided for associating a first object with one or more objects within a plurality of objects, each object comprising a first plurality of properties, each property comprising data reflecting a characteristic of an entity represented by the object, the associated objects comprising matching data in corresponding properties for a second plurality of properties. The method may include executing, for each object within the plurality of objects and for the first object, the following: creating a slug for the object, the slug comprising the second plurality of properties from the object; and inputting the slug for the object into a Bloom filter. Further, the method may include creating for a bin within the Bloom filter corresponding to the slug for the first object, an association between objects whose slugs correspond to the bin if the slugs for those objects match.
Abstract: Systems and methods are provided for sharing maps in a collaborative environment using classification-based access control. The generation of and dissemination of maps and/or data within such maps can be governed by classification-based access control, where a user's classification level can determine whether or not maps and/or data within those maps can be seen. In scenarios whether a plurality of users wishes to collaborate on the same map, the systems and methods provided herein generate multiple versions or views of the same map in accordance with different classification levels. In this way, users with different classification levels can nevertheless see the same map and engage in collaborations regarding the same map, while maintaining control of sensitive data.
Type:
Grant
Filed:
March 29, 2018
Date of Patent:
December 4, 2018
Assignee:
Palantir Technologies Inc.
Inventors:
Peter Wilczynski, Andrew Elder, Andrew Moreland, John Carrino, Daniel Cervelli, John Garrod, Timothy Wilson
Abstract: Methods, systems and computer programs for verifying permissioning rights to one or more data resources associated with a data processing platform are disclosed, performed using one or more processors. In an example method, the method comprises receiving, from a client device, an assertion statement identifying a user, a data resource and an operation performable with respect to the data resource. Another operation comprises applying the received assertion statement to a network database storing an access control list defining, for each of a plurality of data resources associated with the data processing platform, one or more users having permission to perform one or more operations on the respective data resource, wherein applying the assertion statement is effective to determine if the received assertion statement is true or false in relation to said data resource identified in the assertion statement. In the event that the assertion is false, an error message may be generated for output.
Abstract: A technique for identifying related transaction records from a database storing transaction records for multiple entities includes grouping transaction records with a common attribute value into transaction record sets, receiving a selection of an exemplar record set and determining the probability the transaction record set stores transaction records associated with a first entity. Other operations include resolving the transaction record set as storing transaction records associated with the first entity. This improves the process of identifying related transaction records because related transaction records missed by string comparisons transaction record attributes are detected.
Abstract: An access control device configured to store a list of user identifiers and user attribute data, receive a set of access criteria specifying one or more attributes, receive and identify a user identifier via a data input component, determine an access status of the user identifier based on the access criteria, and present the access status in such a way as is perceivable by a user of the access control device.
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for a feature clustering of users, user correlation database access, and user interface generation system. The system can obtain information stored in different databases located across geographic regions, and determine unique users from the different information. The information can be included in unique records in the databases, with each record describing a particular user, and with each user described with imperfect identifying information. The system can analyze the different information utilizing machine learning models, and can associate each record with a particular unique user. The system can obtain identifications of items associated with each user, and determine the propensity of the user to disassociate with one or more items, or determine likelihoods of future association with different items not presently associated with the user.
Type:
Grant
Filed:
August 17, 2016
Date of Patent:
November 27, 2018
Assignee:
Palantir Technologies Inc.
Inventors:
Matthew Elkherj, Xavier Falco, Pierre Cholet, Giulio D'Ali' Aula, Andrew Ehrich
Abstract: Systems and methods are provided for providing an interface that includes a first region for displaying one or more charts that are generated from at least one data set and a second region through which at least some of the data set is accessible. A determination is made that at least one first subset of the data set is selected from the second region to be plotted along a first axis. A determination is made that at least one second subset of the data set is selected from the second region to be plotted along a second axis. A chart is displayed in the first region of the interface, the chart being generated based at least in part on the first subset of the data to be plotted along the first axis and the second subset of the data to be plotted along the second axis.
Abstract: In various example embodiments, a system and method for transforming instructions for collaborative updates are described herein. A group of instructions for an update of an element depicted in a client device version of a user interface are generated. The group of instructions is executed and the group or a subset of instructions are transmitted to a server. The server accepts or rejects the instructions. The server may execute the instructions to update a server version of the element. The server sends accepted instructions to the other or all client devices.
Abstract: Computer-implemented techniques for data extraction are described. The techniques include a method and system for retrieving an extraction job specification, wherein the extraction job specification comprises a source repository identifier that identifies a source repository comprising a plurality of data records; a data recipient identifier that identifies a data recipient; and a schedule that indicates a timing of when to retrieve the plurality of data records. The method and system further include retrieving the plurality of data records from the source repository based on the schedule, creating an extraction transaction from the plurality of data records, wherein the extraction transaction comprises a subset of the plurality of data records and metadata, and sending the extraction transaction to the data recipient.
Type:
Grant
Filed:
August 1, 2016
Date of Patent:
November 20, 2018
Assignee:
Palantir Technologies Inc.
Inventors:
Huw Pryce, James Neale, Robert Fink, Jared Newman, Graham Dennis, Viktor Nordling, Artur Jonkisz, Daniel Fox, Felix de Souza, Harkirat Singh, Mark Elliot
Abstract: Disclosed are data gathering and analysis systems, methods, and computer-readable storage media to facilitate an investigation process. The method includes accessing a data object representing an investigative issue. The method further includes causing presentation, on a display of a device, of a user interface configured to receive user search queries and present search results for each received search query. The method further includes tracking user activity including one or more user actions performed as part of an investigation of the investigatory issue, the one or more user actions including user interactions with the user interface. The method further includes creating a record of the user activity involving the investigatory issue, and linking the record of the user activity with the data object representing the investigative issue.
Type:
Grant
Filed:
July 19, 2017
Date of Patent:
November 20, 2018
Assignee:
Palantir Technologies Inc.
Inventors:
David Skiff, Allen Cai, Benjamin Lee, Christopher Yu, Hind Kraytem, Jason Ma, Myles Scolnick, Tarik Benabdallah, Zhixian Shen