Patents Assigned to VEDA Data Solutions, Inc.
  • Publication number: 20230409966
    Abstract: To train models, training data is needed. As personal data changes over time, the training data can get stale, obviating its usefulness in training the model. Embodiments deal with this by developing a database with a running log specifying how each person's data changes at the time. When data is ingested, it may not be normalized. To deal with this, embodiments clean the data to ensure the ingested data fields are normalized. Finally, the various tasks needed to train the model and solve for accuracy of personal data can quickly become cumbersome to a computing device. They can conflict with one another and compete inefficiently for computing resources, such as processor power and memory capacity. To deal with these issues, a scheduler is employed to queue the various tasks involved.
    Type: Application
    Filed: January 6, 2023
    Publication date: December 21, 2023
    Applicant: VEDA Data Solutions, Inc.
    Inventor: Robert Raymond LINDER
  • Publication number: 20230273934
    Abstract: The present disclosure is directed to systems and methods for identifying demographic information in a data file. The method uses evidence for a label within the column itself. In addition, the method uses likelihoods that a first label may exist at a particular location in a data file with respect to a second label. Finally, the method uses likelihoods that a first label exists in the data file at a first frequency given that a second label exists in a data file in a second frequency. All based on these likelihoods, an overall likelihood of that the label configuration is correct is determined. Using that likelihood score, a nonlinear optimization algorithm is applied to identify a best fit between a group of labels and a group of columns in a data file.
    Type: Application
    Filed: February 24, 2023
    Publication date: August 31, 2023
    Applicant: VEDA Data Solutions, Inc.
    Inventors: Robert Raymond LINDNER, Carlos VERA-CIRO
  • Publication number: 20230273848
    Abstract: The present disclosure is directed to methods and non-transitory program storage devices for identifying demographic information in an input data file and converting that data into autonomous data of an export entity file. In an embodiment, the received tabular data is sorted based on a lowest number of unique entities of a certain type in the revised data file, common data describing the lowest number of unique entities is extracted, labeled, and stored as an autonomous export entity file. In an embodiment a plurality of autonomous export entities are linked to create a master export entity comprising the plurality of autonomous export entities.
    Type: Application
    Filed: February 24, 2023
    Publication date: August 31, 2023
    Applicant: VEDA Data Solutions, Inc.
    Inventors: Carlos VERA-CIRO, Robert Raymond LINDNER
  • Publication number: 20230273900
    Abstract: The present disclosure is directed to methods and non-transitory program storage devices for identifying demographic information in an input data file even in the face of known errors that would otherwise prevent the method from operating. When a fault condition is detected, a fault handler may attempt to fix the faulty data, remove the faulty data from the input data file being processed, or provide the faulty data at a user interface so a human user can intervene. The method and storage devices may continue processing the data regardless of whether human input has been received because the system can bypass or remove the data, thereby keeping the method fault tolerant.
    Type: Application
    Filed: February 24, 2023
    Publication date: August 31, 2023
    Applicant: VEDA Data Solutions, Inc.
    Inventors: Carlos VERA-CIRO, Robert Raymond LINDNER
  • Publication number: 20220318274
    Abstract: Disclosed herein are system, method, and computer program product embodiments for linking data records in memory. The system, method, and computer program product includes accessing a first record stored in memory, the first record holding information describing a first person and accessing at least one additional record stored in memory, the additional records holding information describing additional persons. The method continues by parsing the information of the first record and additional record and assigning the parsed information to predefined categories within the respective records. After assigning the information into categories, a similarity score between categorical information in the first record and categorical information of additional records is determined. A category of an additional record is then modified based on the similarity score, so the additional record is associated with the first person.
    Type: Application
    Filed: March 21, 2022
    Publication date: October 6, 2022
    Applicant: VEDA Data Solutions, Inc.
    Inventor: Robert Raymond LINDNER
  • Patent number: 11314782
    Abstract: Disclosed herein are system, method, and computer program product embodiments for determining a first processing demand required to clean a data set and identifying one or more processors to clean the data based on the determined demand. The system, method, and computer program product embodiment further monitors the data cleaning process to determine a second demand and decides whether to continue operating one or more of the processors, based on the second demand.
    Type: Grant
    Filed: December 31, 2019
    Date of Patent: April 26, 2022
    Assignee: VEDA DATA SOLUTIONS, INC.
    Inventor: Robert Raymond Lindner
  • Publication number: 20210174380
    Abstract: The present disclosure is directed to systems and methods for identifying demographic information in a data file. The method may include: receiving the data file containing a plurality of fields of demographic information from a third-party, the data file having inconsistent or mislabeled nomenclatures for one or more fields of the plurality of fields or spurious demographic information; analyzing the data file using a machine learning model trained according to other data files to distinguish between each of the plurality of fields of demographic information, the machine learning model being based on a plurality of machine learning algorithms to identify different types demographic information; generating a score indicating a probability that each of the plurality of fields of demographic information was identified correctly; and generating a revised data file labeling each of the plurality of fields of demographic information based on the identified type.
    Type: Application
    Filed: February 22, 2021
    Publication date: June 10, 2021
    Applicant: VEDA Data Solutions, Inc.
    Inventors: Carlos VERA-CIRO, Robert Raymond LINDNER
  • Publication number: 20210134407
    Abstract: The present disclosure is directed to systems and methods for extracting unstructured data from a data source in a structure manner. Embodiments provide ways to retrieve unstructured data along from data sources not optimized for automated retrieval. For example, embodiments may generate a branched tree for each data source that maps out paths to individual sites of, for example, a healthcare provider listing the unstructured data. Using this branched tree, tasks can be generated to navigate along a path with the data source to each site and extract the unstructured data from the data source. In this way, embodiments provide the ability to navigate through a site from a base site to a site that has the relevant data.
    Type: Application
    Filed: October 30, 2019
    Publication date: May 6, 2021
    Applicant: VEDA Data Solutions, Inc.
    Inventors: Carlos Vera-Ciro, Robert Raymond Lindner
  • Publication number: 20210133769
    Abstract: The present disclosure is directed to systems and methods for identifying demographic information in a data file. The method may include: receiving the data file containing a plurality of fields of demographic information from a third-party, the data file having inconsistent or mislabeled nomenclatures for one or more fields of the plurality of fields or spurious demographic information; analyzing the data file using a machine learning model trained according to other data files to distinguish between each of the plurality of fields of demographic information, the machine learning model being based on a plurality of machine learning algorithms to identify different types demographic information; generating a score indicating a probability that each of the plurality of fields of demographic information was identified correctly; and generating a revised data file labeling each of the plurality of fields of demographic information based on the identified type.
    Type: Application
    Filed: October 30, 2019
    Publication date: May 6, 2021
    Applicant: VEDA Data Solutions, Inc.
    Inventors: Carlos Vera-Ciro, Robert Raymond Lindner
  • Publication number: 20210133275
    Abstract: The present disclosure is directed to systems and methods for identifying demographic information in a marked up document. The method may include: detecting a plurality of fields representing demographic information in a marked up document; extracting a set of features based the detected demographic information; based at least in part on the set of features, determining whether the plurality of fields represent demographic information of a single provider using a machine learning model trained according to other marked up documents; and when the plurality of fields are determined to represent demographic information of a single provider, associating the plurality of fields to represent demographic information of the single provider.
    Type: Application
    Filed: October 30, 2019
    Publication date: May 6, 2021
    Applicant: VEDA Data Solutions, Inc.
    Inventors: Carlos Vera-Ciro, Robert Raymond Lindner
  • Patent number: 10990900
    Abstract: To train models, training data is needed. As personal data changes over time, the training data can get stale, obviating its usefulness in training the model. Embodiments deal with this by developing a database with a running log specifying how each person's data changes at the time. When data is ingested, it may not be normalized. To deal with this, embodiments clean the data to ensure the ingested data fields are normalized. Finally, the various tasks needed to train the model and solve for accuracy of personal data can quickly become cumbersome to a computing device. They can conflict with one another and compete inefficiently for computing resources, such as processor power and memory capacity. To deal with these issues, a scheduler is employed to queue the various tasks involved.
    Type: Grant
    Filed: April 9, 2018
    Date of Patent: April 27, 2021
    Assignee: Veda Data Solutions, Inc.
    Inventor: Robert Raymond Lindner
  • Publication number: 20210117419
    Abstract: Disclosed herein are system, method, and computer program product embodiments for performing a regression analysis on lawfully collected personal data records. The analysis enables discovery of individuals likely to perform certain actions based on their personal data records and the personal data records and actions of others. The disclosed system, method, and computer program product may process vast quantities of data, including personal data records with thousands of categories and lawfully stored databases with millions of personal data records. Through the regression analysis, the disclosed system, method, and computer program product learn the most relevant categories for predicting an individual's actions based on input data provided by a user. The analysis then analyzes the categories of personal data records stored in a lawfully stored database to predict actions of individuals associated with those records and outputs results to the user.
    Type: Application
    Filed: December 28, 2020
    Publication date: April 22, 2021
    Applicant: VEDA Data Solutions, Inc.
    Inventor: Robert Raymond LINDNER
  • Patent number: 10878334
    Abstract: Disclosed herein are system, method, and computer program product embodiments for performing a regression analysis on lawfully collected personal data records. The analysis enables discovery of individuals likely to perform certain actions based on their personal data records and the personal data records and actions of others. The disclosed system, method, and computer program product may process vast quantities of data, including personal data records with thousands of categories and lawfully stored databases with millions of personal data records. Through the regression analysis, the disclosed system, method, and computer program product learn the most relevant categories for predicting an individual's actions based on input data provided by a user. The analysis then analyzes the categories of personal data records stored in a lawfully stored database to predict actions of individuals associated with those records and outputs results to the user.
    Type: Grant
    Filed: March 17, 2016
    Date of Patent: December 29, 2020
    Assignee: VEDA Data Solutions, Inc.
    Inventor: Robert Raymond Lindner
  • Publication number: 20200210457
    Abstract: Disclosed herein are system, method, and computer program product embodiments for linking data records in memory. The system, method, and computer program product includes accessing a first record stored in memory, the first record holding information describing a first person and accessing at least one additional record stored in memory, the additional records holding information describing additional persons. The method continues by parsing the information of the first record and additional record and assigning the parsed information to predefined categories within the respective records. After assigning the information into categories, a similarity score between categorical information in the first record and categorical information of additional records is determined. A category of an additional record is then modified based on the similarity score, so the additional record is associated with the first person.
    Type: Application
    Filed: December 31, 2019
    Publication date: July 2, 2020
    Applicant: VEDA Data Solutions, Inc.
    Inventor: Robert Raymond LINDNER
  • Patent number: 10521456
    Abstract: Disclosed herein are system, method, and computer program product embodiments for linking data records in memory. The system, method, and computer program product includes accessing a first record stored in memory, the first record holding information describing a first person and accessing at least one additional record stored in memory, the additional records holding information describing additional persons. The method continues by parsing the information of the first record and additional record and assigning the parsed information to predefined categories within the respective records. After assigning the information into categories, a similarity score between categorical information in the first record and categorical information of additional records is determined. A category of an additional record is then modified based on the similarity score, so the additional record is associated with the first person.
    Type: Grant
    Filed: March 16, 2016
    Date of Patent: December 31, 2019
    Assignee: VEDA Data Solutions, Inc.
    Inventor: Robert Raymond Lindner