Patents by Inventor Yuya Jeremy Ong
Yuya Jeremy Ong has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Publication number: 20240144027Abstract: A method, a computer program product, and a system of personalized training a machine learning model using federated learning with gradient boosted trees. The method includes training a global machine learning model using federated learning between a plurality of parties. The method also includes distributing the global machine learning model to each of the parties and receiving personalized model updates from each of the parties. The personalized model updates are generated from updated models boosted locally and produced by each of the parties using their respective local data. The method further includes fusing the personalized model updates to produce a boosted decision tree to update the global machine learning model. The method also includes training global machine learning model, iteratively, in this manner until a stopping criterion is achieved.Type: ApplicationFiled: February 27, 2023Publication date: May 2, 2024Inventors: Yuya Jeremy Ong, Yi Zhou, Parikshit Ram, Theodoros Salonidis, Nathalie Baracaldo Angel
-
Publication number: 20240127084Abstract: Methods, systems, and computer program products for a joint prediction and improvement framework for machine learning models are provided herein. A method includes obtaining a machine learning model initialized with a set of parameters; identifying one or more actions based on test inputs corresponding to the machine learning model and historical actions related to a task, where the historical actions are dependent on respective historical outputs of the machine learning model; using the identified one or more actions to jointly compute: one or more first values corresponding to inference loss for the machine learning model; and one or more second values based at least in part on a computing cost function associated with the task; and updating the set of parameters of the machine learning model based on the one or more first values and the one or more second values.Type: ApplicationFiled: September 29, 2022Publication date: April 18, 2024Inventors: Yuya Jeremy Ong, Aly Megahed, Mark S. Squillante, Yingdong Lu, Yitao Liang, Pravar Mahajan
-
Publication number: 20240005215Abstract: A method, system, and computer program product for training models for federated learning. The method determines, by a federated learning aggregator, a set of sample ratios for a set of participant systems. Each sample ratio is associated with a distinct participant system. A set of participant epsilon values are generated for the set of participant systems with each participant epsilon value being associated with a participant system of the set of participant systems. A set of surrogate data sets are received for the set of participant systems with each surrogate data set representing a data set of a participant system. The federated learning aggregator generates a set of local models. Each local model is generated based on a first global model. The method generates a second global model based on a prediction set generated by the set of participant systems using the set of local models.Type: ApplicationFiled: June 29, 2022Publication date: January 4, 2024Inventors: Yuya Jeremy Ong, Yi Zhou, Nathalie Baracaldo Angel
-
Publication number: 20230409959Abstract: According to one embodiment, a method, computer system, and computer program product for grouped federated learning is provided. The embodiment may include initializing a plurality of aggregation groups including a plurality of parties and a plurality of local aggregators. The embodiment may also include submitting a query to a first party from the plurality of parties. The embodiment may further include submitting an initial response to the query from the first party or a second party from the plurality of parties to a first local aggregator from the plurality of local aggregators. The embodiment may also include submitting a final response from the first local aggregator or a second local aggregator from the plurality of local aggregators to a global aggregator. The embodiment may further include building a machine learning model based on the final response.Type: ApplicationFiled: June 21, 2022Publication date: December 21, 2023Inventors: Ali Anwar, Yi Zhou, NATHALIE BARACALDO ANGEL, Runhua Xu, YUYA JEREMY ONG, Annie K Abay, Heiko H. Ludwig, Gegi Thomas, Jayaram Kallapalayam Radhakrishnan, Laura Wynter
-
Patent number: 11768912Abstract: A computer-implemented method according to one embodiment includes receiving historical two-dimensional (2D) multivariate time series data; transforming the historical 2D multivariate time series data into a three-dimensional (3D) temporal tensor; training one or more deep volumetric 3D convolutional neural networks (CNNs), utilizing the 3D temporal tensor; and predicting future values for additional multivariate time series data, utilizing the one or more trained deep volumetric 3D CNNs.Type: GrantFiled: July 12, 2019Date of Patent: September 26, 2023Assignee: International Business Machines CorporationInventors: Mu Qiao, Yuya Jeremy Ong, Divyesh Jadav
-
Publication number: 20230086103Abstract: In a method for determining anomalous behavior of a candidate taking an exam, a processor receives first exam interface input values captured during an exam session on a candidate testing device. A processor generates a first interaction vector from the first exam interface input values. A processor generates a first interaction timeline from the first interaction vector. A processor determines an anomalous behavior based on a relationship between the first interaction timeline and a selected classification cluster.Type: ApplicationFiled: September 17, 2021Publication date: March 23, 2023Inventors: Nitin Ramchandani, Eric Kevin Butler, ROBERT ENGEL, ALY MEGAHED, YUYA JEREMY ONG
-
Publication number: 20230017500Abstract: One embodiment of the invention provides a method for federated learning (FL) comprising training a machine learning (ML) model collaboratively by initiating a round of FL across data parties. Each data party is allocated tokens to utilize during the training. The method further comprises maintaining, for each data party, a corresponding data usage profile indicative of an amount of data the data party consumed during the training and a corresponding participation profile indicative of an amount of data the data party provided during the training. The method further comprises selectively allocating new tokens to the data parties based on each participation profile maintained, selectively allocating additional new tokens to the data parties based on each data usage profile maintained, and reimbursing one or more tokens utilized during the training to the data parties based on one or more measurements of accuracy of the ML model.Type: ApplicationFiled: July 12, 2021Publication date: January 19, 2023Inventors: Ali Anwar, Syed Amer Zawad, Yi Zhou, Nathalie Baracaldo Angel, Kamala Micaela Noelle Varma, Annie Abay, Ebube Chuba, Yuya Jeremy Ong, Heiko H. Ludwig
-
Publication number: 20220414531Abstract: An approach for providing prediction and optimization of an adversarial machine-learning model is disclosed. The approach can comprise of a training method for a defender that determines the optimal amount of adversarial training that would prevent the task optimization model from taking wrong decisions caused by an adversarial attack from the input into the model within the simultaneous predict and optimization framework. Essentially, the approach would train a robust model via adversarial training. Based on the robust training model, the user can mitigate against potential threats by (adversarial noise in the task-based optimization model) based on the given inputs from the machine learning prediction that was produced by an input.Type: ApplicationFiled: June 25, 2021Publication date: December 29, 2022Inventors: YUYA JEREMY ONG, NATHALIE BARACALDO ANGEL, ALY MEGAHED, Ebube Chuba, Yi Zhou
-
Publication number: 20220374327Abstract: A method of using a computing device to compare performance of multiple algorithms. The method includes receiving, by a computing device, multiple algorithms to assess. The computing device further receives a total amount of resources to allocate to the multiple algorithms. The computing device additionally assigns a fair share of the total amount of resources to each of the multiple algorithms. The computing device still further executes each of the multiple algorithms using the assigned fair share of the total amount of resources. The computing device additionally compares the performance of each of the multiple based on at least one of multiple hardware relative utility metrics describing a hardware relative utility of any given resource allocation for each of the multiple algorithms.Type: ApplicationFiled: April 29, 2021Publication date: November 24, 2022Inventors: Robert Engel, Aly Megahed, Eric Kevin Butler, Nitin Ramchandani, Yuya Jeremy Ong
-
Publication number: 20220318823Abstract: A system may include a memory and a processor in communication with the memory. The processor may be configured to perform operations that include analyzing interactions by a user within a network and generating a user profile for the user. The operations by the processor may further include identifying an attempt by the user to share a post via the network and prompting the user with a personalized alert to evaluate the post, wherein the personalized alert is generated based on the interactions, the user profile, and the properties of the post.Type: ApplicationFiled: March 31, 2021Publication date: October 6, 2022Inventors: Marisa Affonso Vasconcelos, Mu Qiao, Nicholas Linck, YUYA JEREMY ONG, Claudio Santos Pinhanez, Rogerio Abreu de Paula
-
Publication number: 20220269938Abstract: To reduce misinformation consumption in the media, a computer-implemented method is described for presenting thought-provoking information about a media product that includes receiving media consumption data indicating a media product was consumed via a computing device user interface; determining claims for the media product; identifying a plurality of related media products based at least on a topic of the media product; determining positions for the plurality of related media products with respect to the one or more claims; determining a most contested claim as a claim that satisfies a condition corresponding to having a predetermined number of disagreeing related media products; generating a question based on the most contested claim and a paragraph including the most contested claim; generating an answer to the question based on the question and the related media product that disagrees with the most contested claim; and presenting the question and answer via the user interface.Type: ApplicationFiled: February 25, 2021Publication date: August 25, 2022Inventors: Nicholas Linck, Mu Qiao, Yuya Jeremy Ong, Marisa Affonso Vasconcelos, Claudio Santos Pinhanez, Rogerio Abreu de Paula
-
Publication number: 20220207001Abstract: A method of using a computing device executing to interrelate two or more corpuses of dissimilar data that includes receiving input data from each of two or more corpuses of dissimilar data. The computing device computes a pass for each of the input data into two or more encoder-decoder models. The computing device further obtains a prediction of an identity mapping for each of different domains of knowledge from each of the two or more encoder-decoder models. The computing device additionally computes a distribution distance metric as an output from each of a low-dimensional embedding vector representation from each of the two or more encoder-decoder models. The computing device still further computes a function based on each of the predictions from each of the two or more encoder-decoder models and the distribution distance metrics. The computing device additionally updates the two or more encoder-decoder models.Type: ApplicationFiled: December 31, 2020Publication date: June 30, 2022Inventors: Yuya Jeremy Ong, Eric Kevin Butler, Robert Engel, German H Flores, Aly Megahed, Nitin Ramchandani
-
Publication number: 20220197977Abstract: A computer-implemented method is provided for predicting future data values or target labels of multivariate time series data. The method includes receiving the multivariate time series data having present values, systematic missing values, and random missing values. The method further includes masking the present values, the systematic missing values, and the random missing values using triplet encodings. The method also includes determining time intervals between current missing values, from among the systematic missing values and the random missing values, and immediately preceding ones of the present values. The method additionally includes training, by a computing device, at least one recurrent neural network with the triplet encodings, the time intervals, and multivariate time series data to perform a feedforward pass on the recurrent neural network predicting the future data values or the target labels.Type: ApplicationFiled: December 22, 2020Publication date: June 23, 2022Inventors: Mu Qiao, Yuya Jeremy Ong, Prithviraj Sen, Berthold Reinwald
-
Publication number: 20220147667Abstract: Systems, computer-implemented methods, and computer program products to facilitate generalization of a quantum imaginary time evolution process and simulation by tensor networks are provided. According to an embodiment, a system can comprise a processor that executes computer executable components stored in memory. The computer executable components can comprise a universal component that generalizes a quantum imaginary time evolution process to provide a quantum imaginary time evolution ansatz process. The computer executable components can further comprise a simulation component that applies the quantum imaginary time evolution ansatz process to one or more general multi-body quantum systems.Type: ApplicationFiled: November 6, 2020Publication date: May 12, 2022Inventors: Mario Motta, Yuya Jeremy Ong, Barbara Anne Jones, Joseph A. Latone
-
Publication number: 20220083906Abstract: A method, a computer program product, and a system of training a machine learning model using federated learning with extreme gradient boosting. The method includes computing an epsilon hyperparameter using training dataset sizes from a first party and a second party. The method also includes transmitting a machine learning model and the epsilon hyperparameter to the first party and the second party and receiving a first model update and a second model update from the first party and the second party respectively. The method further includes fusing the first model update and the second model update to produce a global histogram and determining at least one split candidate in a decision tree used by the machine learning model using the global histogram. The method also includes rebuilding the machine learning model by adding the split candidate to a decision tree of the machine learning model.Type: ApplicationFiled: September 16, 2020Publication date: March 17, 2022Inventors: Yuya Jeremy Ong, Yi Zhou, Nathalie Baracaldo Angel
-
Patent number: 11275597Abstract: Techniques for augmenting data visualizations based on user interactions to enhance user experience are provided. In one aspect, a method for providing real-time recommendations to a user includes: capturing user interactions with a data visualization, wherein the user interactions include images captured as the user interacts with the data visualization; building stacks of the user interactions, wherein the stacks of the user interactions are built from sequences of the user interactions captured over time; generating embeddings for the stacks of the user interactions; finding clusters of embeddings having similar properties; and making the real-time recommendations to the user based on the clusters of embeddings having the similar properties.Type: GrantFiled: January 29, 2021Date of Patent: March 15, 2022Assignee: International Business Machines CorporationInventors: German H Flores, Eric Kevin Butler, Robert Engel, Aly Megahed, Yuya Jeremy Ong, Nitin Ramchandani
-
Publication number: 20210012191Abstract: A computer-implemented method according to one embodiment includes receiving historical two-dimensional (2D) multivariate time series data; transforming the historical 2D multivariate time series data into a three-dimensional (3D) temporal tensor; training one or more deep volumetric 3D convolutional neural networks (CNNs), utilizing the 3D temporal tensor; and predicting future values for additional multivariate time series data, utilizing the one or more trained deep volumetric 3D CNNs.Type: ApplicationFiled: July 12, 2019Publication date: January 14, 2021Inventors: Mu Qiao, Yuya Jeremy Ong, Divyesh Jadav