Patents by Inventor Afrah Shafquat

Afrah Shafquat has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 11977550
    Abstract: Generating a synthetic longitudinal dataset includes identifying subsequence patterns in records defining event sequences for patients. Feature vectors are determined, each characterizing a corresponding one of the records, based on the subsequence patterns. The feature vectors are embedded in a lower dimension space. A seed record is iteratively selected from among the records and in each iteration: subsequence patterns are identified in a subset of the records. Instances of subsequence patterns in the seed record are replaced with instances of similar subsequence patterns identified in the subset of the records to form a modified seed record. The iterations are repeated until all of the records have been selected as the seed record. The modified seed records are combined to form the synthetic dataset.
    Type: Grant
    Filed: April 12, 2023
    Date of Patent: May 7, 2024
    Assignee: MEDIDATA SOLUTIONS, INC.
    Inventors: Jacob Aptekar, Mandis S. Beigi, Pierre-Louis Bourlon, Jason Mezey, Afrah Shafquat, Jimeng Sun
  • Patent number: 11640446
    Abstract: A method for generating a synthetic dataset from an original dataset includes encoding categorical features of the original dataset, embedding the encoded dataset in a low-dimensional space, selecting a seed record from the embedded dataset, identifying a plurality of nearest neighbor records to the seed record, generating a new record by randomly selecting features from the plurality of nearest neighbor records, and concatenating the new record into the synthetic dataset. For a synthetic dataset that contains N records, which may be the same as or different from the number of records in the original dataset, the selecting, identifying, generating, and concatenating operations operate a total of N times on the records in the embedded dataset.
    Type: Grant
    Filed: August 19, 2021
    Date of Patent: May 2, 2023
    Assignee: Medidata Solutions, Inc.
    Inventors: Mandis Beigi, Jacob Aptekar, Afrah Shafquat, Jason Mezey
  • Publication number: 20230060848
    Abstract: A method for generating a synthetic dataset from an original dataset includes encoding categorical features of the original dataset, embedding the encoded dataset in a low-dimensional space, selecting a seed record from the embedded dataset, identifying a plurality of nearest neighbor records to the seed record, generating a new record by randomly selecting features from the plurality of nearest neighbor records, and concatenating the new record into the synthetic dataset. For a synthetic dataset that contains N records, which may be the same as or different from the number of records in the original dataset, the selecting, identifying, generating, and concatenating operations operate a total of N times on the records in the embedded dataset.
    Type: Application
    Filed: August 19, 2021
    Publication date: March 2, 2023
    Inventors: Mandis Beigi, Jacob Aptekar, Afrah Shafquat, Jason Mezey