Patents by Inventor Yuya Jeremy Ong

Yuya Jeremy Ong has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20240144027
    Abstract: A method, a computer program product, and a system of personalized training a machine learning model using federated learning with gradient boosted trees. The method includes training a global machine learning model using federated learning between a plurality of parties. The method also includes distributing the global machine learning model to each of the parties and receiving personalized model updates from each of the parties. The personalized model updates are generated from updated models boosted locally and produced by each of the parties using their respective local data. The method further includes fusing the personalized model updates to produce a boosted decision tree to update the global machine learning model. The method also includes training global machine learning model, iteratively, in this manner until a stopping criterion is achieved.
    Type: Application
    Filed: February 27, 2023
    Publication date: May 2, 2024
    Inventors: Yuya Jeremy Ong, Yi Zhou, Parikshit Ram, Theodoros Salonidis, Nathalie Baracaldo Angel
  • Publication number: 20240127084
    Abstract: Methods, systems, and computer program products for a joint prediction and improvement framework for machine learning models are provided herein. A method includes obtaining a machine learning model initialized with a set of parameters; identifying one or more actions based on test inputs corresponding to the machine learning model and historical actions related to a task, where the historical actions are dependent on respective historical outputs of the machine learning model; using the identified one or more actions to jointly compute: one or more first values corresponding to inference loss for the machine learning model; and one or more second values based at least in part on a computing cost function associated with the task; and updating the set of parameters of the machine learning model based on the one or more first values and the one or more second values.
    Type: Application
    Filed: September 29, 2022
    Publication date: April 18, 2024
    Inventors: Yuya Jeremy Ong, Aly Megahed, Mark S. Squillante, Yingdong Lu, Yitao Liang, Pravar Mahajan
  • Publication number: 20240005215
    Abstract: A method, system, and computer program product for training models for federated learning. The method determines, by a federated learning aggregator, a set of sample ratios for a set of participant systems. Each sample ratio is associated with a distinct participant system. A set of participant epsilon values are generated for the set of participant systems with each participant epsilon value being associated with a participant system of the set of participant systems. A set of surrogate data sets are received for the set of participant systems with each surrogate data set representing a data set of a participant system. The federated learning aggregator generates a set of local models. Each local model is generated based on a first global model. The method generates a second global model based on a prediction set generated by the set of participant systems using the set of local models.
    Type: Application
    Filed: June 29, 2022
    Publication date: January 4, 2024
    Inventors: Yuya Jeremy Ong, Yi Zhou, Nathalie Baracaldo Angel
  • Publication number: 20230409959
    Abstract: According to one embodiment, a method, computer system, and computer program product for grouped federated learning is provided. The embodiment may include initializing a plurality of aggregation groups including a plurality of parties and a plurality of local aggregators. The embodiment may also include submitting a query to a first party from the plurality of parties. The embodiment may further include submitting an initial response to the query from the first party or a second party from the plurality of parties to a first local aggregator from the plurality of local aggregators. The embodiment may also include submitting a final response from the first local aggregator or a second local aggregator from the plurality of local aggregators to a global aggregator. The embodiment may further include building a machine learning model based on the final response.
    Type: Application
    Filed: June 21, 2022
    Publication date: December 21, 2023
    Inventors: Ali Anwar, Yi Zhou, NATHALIE BARACALDO ANGEL, Runhua Xu, YUYA JEREMY ONG, Annie K Abay, Heiko H. Ludwig, Gegi Thomas, Jayaram Kallapalayam Radhakrishnan, Laura Wynter
  • Patent number: 11768912
    Abstract: A computer-implemented method according to one embodiment includes receiving historical two-dimensional (2D) multivariate time series data; transforming the historical 2D multivariate time series data into a three-dimensional (3D) temporal tensor; training one or more deep volumetric 3D convolutional neural networks (CNNs), utilizing the 3D temporal tensor; and predicting future values for additional multivariate time series data, utilizing the one or more trained deep volumetric 3D CNNs.
    Type: Grant
    Filed: July 12, 2019
    Date of Patent: September 26, 2023
    Assignee: International Business Machines Corporation
    Inventors: Mu Qiao, Yuya Jeremy Ong, Divyesh Jadav
  • Publication number: 20230086103
    Abstract: In a method for determining anomalous behavior of a candidate taking an exam, a processor receives first exam interface input values captured during an exam session on a candidate testing device. A processor generates a first interaction vector from the first exam interface input values. A processor generates a first interaction timeline from the first interaction vector. A processor determines an anomalous behavior based on a relationship between the first interaction timeline and a selected classification cluster.
    Type: Application
    Filed: September 17, 2021
    Publication date: March 23, 2023
    Inventors: Nitin Ramchandani, Eric Kevin Butler, ROBERT ENGEL, ALY MEGAHED, YUYA JEREMY ONG
  • Publication number: 20230017500
    Abstract: One embodiment of the invention provides a method for federated learning (FL) comprising training a machine learning (ML) model collaboratively by initiating a round of FL across data parties. Each data party is allocated tokens to utilize during the training. The method further comprises maintaining, for each data party, a corresponding data usage profile indicative of an amount of data the data party consumed during the training and a corresponding participation profile indicative of an amount of data the data party provided during the training. The method further comprises selectively allocating new tokens to the data parties based on each participation profile maintained, selectively allocating additional new tokens to the data parties based on each data usage profile maintained, and reimbursing one or more tokens utilized during the training to the data parties based on one or more measurements of accuracy of the ML model.
    Type: Application
    Filed: July 12, 2021
    Publication date: January 19, 2023
    Inventors: Ali Anwar, Syed Amer Zawad, Yi Zhou, Nathalie Baracaldo Angel, Kamala Micaela Noelle Varma, Annie Abay, Ebube Chuba, Yuya Jeremy Ong, Heiko H. Ludwig
  • Publication number: 20220414531
    Abstract: An approach for providing prediction and optimization of an adversarial machine-learning model is disclosed. The approach can comprise of a training method for a defender that determines the optimal amount of adversarial training that would prevent the task optimization model from taking wrong decisions caused by an adversarial attack from the input into the model within the simultaneous predict and optimization framework. Essentially, the approach would train a robust model via adversarial training. Based on the robust training model, the user can mitigate against potential threats by (adversarial noise in the task-based optimization model) based on the given inputs from the machine learning prediction that was produced by an input.
    Type: Application
    Filed: June 25, 2021
    Publication date: December 29, 2022
    Inventors: YUYA JEREMY ONG, NATHALIE BARACALDO ANGEL, ALY MEGAHED, Ebube Chuba, Yi Zhou
  • Publication number: 20220374327
    Abstract: A method of using a computing device to compare performance of multiple algorithms. The method includes receiving, by a computing device, multiple algorithms to assess. The computing device further receives a total amount of resources to allocate to the multiple algorithms. The computing device additionally assigns a fair share of the total amount of resources to each of the multiple algorithms. The computing device still further executes each of the multiple algorithms using the assigned fair share of the total amount of resources. The computing device additionally compares the performance of each of the multiple based on at least one of multiple hardware relative utility metrics describing a hardware relative utility of any given resource allocation for each of the multiple algorithms.
    Type: Application
    Filed: April 29, 2021
    Publication date: November 24, 2022
    Inventors: Robert Engel, Aly Megahed, Eric Kevin Butler, Nitin Ramchandani, Yuya Jeremy Ong
  • Publication number: 20220318823
    Abstract: A system may include a memory and a processor in communication with the memory. The processor may be configured to perform operations that include analyzing interactions by a user within a network and generating a user profile for the user. The operations by the processor may further include identifying an attempt by the user to share a post via the network and prompting the user with a personalized alert to evaluate the post, wherein the personalized alert is generated based on the interactions, the user profile, and the properties of the post.
    Type: Application
    Filed: March 31, 2021
    Publication date: October 6, 2022
    Inventors: Marisa Affonso Vasconcelos, Mu Qiao, Nicholas Linck, YUYA JEREMY ONG, Claudio Santos Pinhanez, Rogerio Abreu de Paula
  • Publication number: 20220269938
    Abstract: To reduce misinformation consumption in the media, a computer-implemented method is described for presenting thought-provoking information about a media product that includes receiving media consumption data indicating a media product was consumed via a computing device user interface; determining claims for the media product; identifying a plurality of related media products based at least on a topic of the media product; determining positions for the plurality of related media products with respect to the one or more claims; determining a most contested claim as a claim that satisfies a condition corresponding to having a predetermined number of disagreeing related media products; generating a question based on the most contested claim and a paragraph including the most contested claim; generating an answer to the question based on the question and the related media product that disagrees with the most contested claim; and presenting the question and answer via the user interface.
    Type: Application
    Filed: February 25, 2021
    Publication date: August 25, 2022
    Inventors: Nicholas Linck, Mu Qiao, Yuya Jeremy Ong, Marisa Affonso Vasconcelos, Claudio Santos Pinhanez, Rogerio Abreu de Paula
  • Publication number: 20220207001
    Abstract: A method of using a computing device executing to interrelate two or more corpuses of dissimilar data that includes receiving input data from each of two or more corpuses of dissimilar data. The computing device computes a pass for each of the input data into two or more encoder-decoder models. The computing device further obtains a prediction of an identity mapping for each of different domains of knowledge from each of the two or more encoder-decoder models. The computing device additionally computes a distribution distance metric as an output from each of a low-dimensional embedding vector representation from each of the two or more encoder-decoder models. The computing device still further computes a function based on each of the predictions from each of the two or more encoder-decoder models and the distribution distance metrics. The computing device additionally updates the two or more encoder-decoder models.
    Type: Application
    Filed: December 31, 2020
    Publication date: June 30, 2022
    Inventors: Yuya Jeremy Ong, Eric Kevin Butler, Robert Engel, German H Flores, Aly Megahed, Nitin Ramchandani
  • Publication number: 20220197977
    Abstract: A computer-implemented method is provided for predicting future data values or target labels of multivariate time series data. The method includes receiving the multivariate time series data having present values, systematic missing values, and random missing values. The method further includes masking the present values, the systematic missing values, and the random missing values using triplet encodings. The method also includes determining time intervals between current missing values, from among the systematic missing values and the random missing values, and immediately preceding ones of the present values. The method additionally includes training, by a computing device, at least one recurrent neural network with the triplet encodings, the time intervals, and multivariate time series data to perform a feedforward pass on the recurrent neural network predicting the future data values or the target labels.
    Type: Application
    Filed: December 22, 2020
    Publication date: June 23, 2022
    Inventors: Mu Qiao, Yuya Jeremy Ong, Prithviraj Sen, Berthold Reinwald
  • Publication number: 20220147667
    Abstract: Systems, computer-implemented methods, and computer program products to facilitate generalization of a quantum imaginary time evolution process and simulation by tensor networks are provided. According to an embodiment, a system can comprise a processor that executes computer executable components stored in memory. The computer executable components can comprise a universal component that generalizes a quantum imaginary time evolution process to provide a quantum imaginary time evolution ansatz process. The computer executable components can further comprise a simulation component that applies the quantum imaginary time evolution ansatz process to one or more general multi-body quantum systems.
    Type: Application
    Filed: November 6, 2020
    Publication date: May 12, 2022
    Inventors: Mario Motta, Yuya Jeremy Ong, Barbara Anne Jones, Joseph A. Latone
  • Publication number: 20220083906
    Abstract: A method, a computer program product, and a system of training a machine learning model using federated learning with extreme gradient boosting. The method includes computing an epsilon hyperparameter using training dataset sizes from a first party and a second party. The method also includes transmitting a machine learning model and the epsilon hyperparameter to the first party and the second party and receiving a first model update and a second model update from the first party and the second party respectively. The method further includes fusing the first model update and the second model update to produce a global histogram and determining at least one split candidate in a decision tree used by the machine learning model using the global histogram. The method also includes rebuilding the machine learning model by adding the split candidate to a decision tree of the machine learning model.
    Type: Application
    Filed: September 16, 2020
    Publication date: March 17, 2022
    Inventors: Yuya Jeremy Ong, Yi Zhou, Nathalie Baracaldo Angel
  • Patent number: 11275597
    Abstract: Techniques for augmenting data visualizations based on user interactions to enhance user experience are provided. In one aspect, a method for providing real-time recommendations to a user includes: capturing user interactions with a data visualization, wherein the user interactions include images captured as the user interacts with the data visualization; building stacks of the user interactions, wherein the stacks of the user interactions are built from sequences of the user interactions captured over time; generating embeddings for the stacks of the user interactions; finding clusters of embeddings having similar properties; and making the real-time recommendations to the user based on the clusters of embeddings having the similar properties.
    Type: Grant
    Filed: January 29, 2021
    Date of Patent: March 15, 2022
    Assignee: International Business Machines Corporation
    Inventors: German H Flores, Eric Kevin Butler, Robert Engel, Aly Megahed, Yuya Jeremy Ong, Nitin Ramchandani
  • Publication number: 20210012191
    Abstract: A computer-implemented method according to one embodiment includes receiving historical two-dimensional (2D) multivariate time series data; transforming the historical 2D multivariate time series data into a three-dimensional (3D) temporal tensor; training one or more deep volumetric 3D convolutional neural networks (CNNs), utilizing the 3D temporal tensor; and predicting future values for additional multivariate time series data, utilizing the one or more trained deep volumetric 3D CNNs.
    Type: Application
    Filed: July 12, 2019
    Publication date: January 14, 2021
    Inventors: Mu Qiao, Yuya Jeremy Ong, Divyesh Jadav