Patents by Inventor Mridul Jain
Mridul Jain has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Patent number: 12360797Abstract: A distributed system including multiple processors associated with non-transitory computer-readable media storing computing instructions. The computing instructions, when collectively executed on the multiple processors, cause the multiple processors collectively to perform certain acts.Type: GrantFiled: January 12, 2023Date of Patent: July 15, 2025Assignee: WALMART APOLLO, LLCInventors: Saigopal Thota, Mridul Jain, Albin Kuruvilla, Pruthvi Raj Eranti, Antriksh Shah
-
Patent number: 12333338Abstract: A system including one or more processors and one or more non-transitory computer-readable media storing computing instructions configured to run on the one or more processors and perform ingesting streaming events for processing by multiple models; mapping each of the streaming events to a model of the multiple models; storing each of the streaming events in a respective queue in a respective sequence store, such that a respective one of the multiple models retrieves (i) a respective one of the streaming events in the respective sequence store associated with the respective one of the multiple models and (ii) a respective key corresponding to the respective one of the streaming events from a leaf store, to asynchronously perform the respective machine-learning inferencing, wherein the multiple models run independently and in parallel on multi-tenant threads. Other embodiments are disclosed.Type: GrantFiled: October 22, 2021Date of Patent: June 17, 2025Assignee: WALMART APOLLO, LLCInventors: Saigopal Thota, Mridul Jain, Navinder Pal Singh Brar, Pragya Jain, Giridhar Addepalli, Gajendra Alias Nishad Kamat, Santos Kumar Das
-
Patent number: 12111806Abstract: This application relates to apparatus and methods for automatically associating customer data to a corresponding customer. A computing device may receive linking data identifying a plurality of links, where each like associates at least two nodes that each represent customer data. The computing device may partition the linking data into multiple partitions, and cause a union find algorithm to be executed for each partition in parallel to associate each node with a parent ID. The computing device may iteratively execute a global shuffle algorithm to place all same nodes in a same partition, and may assign a same parent ID to the same nodes. The computing device may iteratively execute a path compression algorithm across all partitions to generate a graph output that associates all child nodes of a same parent node with the same parent ID.Type: GrantFiled: January 31, 2019Date of Patent: October 8, 2024Assignee: Walmart Apollo, LLCInventors: Mridul Jain, Saigopal Thota, Xun Luan, Gajendra Alias Nishad Kamat
-
Publication number: 20240256878Abstract: A system comprising one or more processors and one or more non-transitory computer-readable media storing computing instructions that, when executed on the one or more processors, cause the one or more processors to perform functions comprising: generating pairs of identities from a plurality of sources; for each respective pair of identities of the pairs of identities: determining a match probability for the respective pair of identities using a deep-learning transformer-based binary classification model; and linking the respective pair of identities as nodes on a graph when the match probability meets a predetermined threshold, wherein a linkage between the nodes represents a match for the respective pair of identities; generating, using a connected component algorithm, clusters each containing identities representing a respective user; and generating a respective user profile for the respective user for each cluster. Other embodiments are disclosed.Type: ApplicationFiled: January 31, 2023Publication date: August 1, 2024Applicant: Walmart Apollo, LLCInventors: Neil Palleti, Antriksh Shah, Ashraful Arefeen, Saigopal Thota, Sreenaadh Sreekumar, Mridul Jain, Nishad Kamat, Rijul Magu
-
Publication number: 20240232921Abstract: A system including one or more processors and one or more non-transitory computer-readable media storing computing instructions that, when executed on the one or more processors, cause the one or more processors to perform operations: creating links between nodes to form connected components based on linkage scores exceeding a predetermined threshold; generating, by using a relaxed blocking criteria, an initial labeled set, wherein the relaxed blocking criteria is configured to prevent loss of data signals; tuning the relaxed blocking criteria by relaxing and tightening a precision threshold associated with the connected components; and generating a quality score for the connected components, wherein the quality score comprises an objective metric to identify a true graph of each user. Other embodiments are described.Type: ApplicationFiled: March 15, 2024Publication date: July 11, 2024Applicant: WALMART APOLLO, LLCInventors: Mridul Jain, Saigopal Thota, Ashraful Arefeen, Antriksh Akshesh Shah, Albin Kuruvilla, Gajendra Aliaas Nishad Kamat, Rijul Magu, Neil Mohan Reddy Palleti
-
Patent number: 11948164Abstract: A system including one or more processors and one or more non-transitory computer-readable media storing computing instructions configured to run on the one or more processors and perform: generating linkage scores between nodes at least based on a machine learning model; creating links between the nodes to form connected components based on the linkage scores exceeding a predetermined threshold; generating an actual matching linkage set of the nodes linked in the connected components by using a relaxed blocking criteria; and generating a quality score for the connected components. Other embodiments are disclosed.Type: GrantFiled: November 12, 2021Date of Patent: April 2, 2024Assignee: WALMART APOLLO, LLCInventors: Mridul Jain, Saigopal Thota, Ashraful Arefeen, Antriksh Akshesh Shah, Albin Kuruvilla, Gajendra Alias Nishad Kamat, Rijul Magu, Neil Mohan Reddy Palleti
-
Publication number: 20240070128Abstract: In some examples, a system may, obtain constraint data and customer profile data of a plurality of customers associated with the system. Moreover, for each customer of the plurality of customers, the system may, based on the customer profile data of the customer and the constraint data, generate a score associated with one or more constraints of the plurality of constraints, based on the score of each of the one or more constraints, generate an overall score, and associate the overall score with a customer profile of the customer. Further, the system may, implement operations that generate a clean dataset based on the overall score associated with a customer profile of each of the plurality of customers.Type: ApplicationFiled: August 30, 2022Publication date: February 29, 2024Inventors: Neil Mohan Reddy Palleti, Mridul Jain, Saigopal Thota, Rijul Magu, Puja Maniktala
-
Patent number: 11860867Abstract: A system including one or more processors and one or more non-transitory computer-readable media storing computing instructions configured to run on the one or more processors and perform: bundling multiple registered queries of a dataset using a scheduling technique, wherein the dataset is homogenous in schema; running a single table scan of the dataset to process the multiple registered queries of the dataset in parallel; and generating a respective output responsive to each of the multiple registered queries. Other embodiments are disclosed.Type: GrantFiled: August 25, 2021Date of Patent: January 2, 2024Assignee: WALMART APOLLO, LLCInventors: Mridul Jain, Saigopal Thota, Rewati Mahendra Ovalekar, Sébastien Jean-Maurice Olivier Péhu, Saumya Agarwal, Sai Kiran Reddy Malikireddy, Gajendra Alias Nishad Kamat, Mitesh Sinha
-
Patent number: 11734700Abstract: This application relates to apparatus and methods for determining confidence levels in associated data using machine learning algorithms. In some examples, a computing device may generate training graph data where each training graph connects at least two nodes by an edge, and each node represents data. The computing device may train a machine learning algorithm based on the generated training data. The computing device may then receive linked data, which associates at least two nodes, each representing data, with each other. The computing device may generate graph data based on the linking data, to provide to the machine learning algorithm as input. The computing device may then execute the machine learning algorithm on the generated graph data to generate values for each of its edges. The values may identify, for each edge, a confidence level in the connection between the two nodes for that edge.Type: GrantFiled: January 19, 2023Date of Patent: August 22, 2023Assignee: Walmart Apollo, LLCInventors: Mridul Jain, Saigopal Thota, Xun Luan, Gajendra Alias Nishad Kamat
-
Publication number: 20230153848Abstract: A system including one or more processors and one or more non-transitory computer-readable media storing computing instructions configured to run on the one or more processors and perform: generating linkage scores between nodes at least based on a machine learning model; creating links between the nodes to form connected components based on the linkage scores exceeding a predetermined threshold; generating an actual matching linkage set of the nodes linked in the connected components by using a relaxed blocking criteria; and generating a quality score for the connected components. Other embodiments are disclosed.Type: ApplicationFiled: November 12, 2021Publication date: May 18, 2023Applicant: Walmart Apollo, LLCInventors: Mridul Jain, Saigopal Thota, Ashraful Arefeen, Antriksh Akshesh Shah, Albin Kuruvilla, Gajendra Alias Nishad Kamat, Rijul Magu, Neil Mohan Reddy Palleti
-
Publication number: 20230153841Abstract: This application relates to apparatus and methods for determining confidence levels in associated data using machine learning algorithms. In some examples, a computing device may generate training graph data where each training graph connects at least two nodes by an edge, and each node represents data. The computing device may train a machine learning algorithm based on the generated training data. The computing device may then receive linked data, which associates at least two nodes, each representing data, with each other. The computing device may generate graph data based on the linking data, to provide to the machine learning algorithm as input. The computing device may then execute the machine learning algorithm on the generated graph data to generate values for each of its edges. The values may identify, for each edge, a confidence level in the connection between the two nodes for that edge.Type: ApplicationFiled: January 19, 2023Publication date: May 18, 2023Inventors: Mridul Jain, Saigopal Thota, Xun Luan, Gajendra Alias Nishad Kamat
-
Publication number: 20230145505Abstract: A distributed system including multiple processors associated with non-transitory computer-readable media storing computing instructions. The computing instructions, when collectively executed on the multiple processors, cause the multiple processors collectively to perform certain acts.Type: ApplicationFiled: January 12, 2023Publication date: May 11, 2023Applicant: Walmart Apollo, LLCInventors: Saigopal Thota, Mridul Jain, Albin Kuruvilla, Pruthvi Raj Eranti, Antriksh Shah
-
Publication number: 20230128987Abstract: A system including one or more processors and one or more non-transitory computer-readable media storing computing instructions configured to run on the one or more processors and perform ingesting streaming events for processing by multiple models; mapping each of the streaming events to a model of the multiple models; storing each of the streaming events in a respective queue in a respective sequence store, such that a respective one of the multiple models retrieves (i) a respective one of the streaming events in the respective sequence store associated with the respective one of the multiple models and (ii) a respective key corresponding to the respective one of the streaming events from a leaf store, to asynchronously perform the respective machine-learning inferencing, wherein the multiple models run independently and in parallel on multi-tenant threads. Other embodiments are disclosed.Type: ApplicationFiled: October 22, 2021Publication date: April 27, 2023Applicant: Walmart Apollo, LLCInventors: Saigopal Thota, Mridul Jain, Navinder Pal Singh Brar, Pragya Jain, Giridhar Addepalli, Gajendra Alias Nishad Kamat, Santos Kumar Das
-
Patent number: 11604942Abstract: This application relates to apparatus and methods for determining confidence levels in associated data using machine learning algorithms. In some examples, a computing device may generate training graph data where each training graph connects at least two nodes by an edge, and each node represents data. The computing device may train a machine learning algorithm based on the generated training data. The computing device may then receive linked data, which associates at least two nodes, each representing data, with each other. The computing device may generate graph data based on the linking data, to provide to the machine learning algorithm as input. The computing device may then execute the machine learning algorithm on the generated graph data to generate values for each of its edges. The values may identify, for each edge, a confidence level in the connection between the two nodes for that edge.Type: GrantFiled: January 31, 2019Date of Patent: March 14, 2023Assignee: Walmart Apollo, LLCInventors: Mridul Jain, Saigopal Thota, Xun Luan, Gajendra Alias Nishad Kamat
-
Publication number: 20230068831Abstract: A system including one or more processors and one or more non-transitory computer-readable media storing computing instructions configured to run on the one or more processors and perform: bundling multiple registered queries of a dataset using a scheduling technique, wherein the dataset is homogenous in schema; running a single table scan of the dataset to process the multiple registered queries of the dataset in parallel; and generating a respective output responsive to each of the multiple registered queries. Other embodiments are disclosed.Type: ApplicationFiled: August 25, 2021Publication date: March 2, 2023Applicant: Walmart Apollo, LLCInventors: Mridul Jain, Saigopal Thota, Rewati Mahendra Ovalekar, Sébastien Jean-Maurice Olivier Péhu, Saumya Agarwal, Sai Kiran Reddy Malikireddy, Gajendra Alias Nishad Kamat, Mitesh Sinha
-
Patent number: 11556370Abstract: A distributed system including multiple processing nodes. The distributed system can perform certain acts. The acts can include receiving a set of input nodes and a set of criteria. The acts can include obtaining an adjacency list representing a large connected component. The large connected component can include nodes, edges, and edge metadata. A quantity of the nodes of the large connected component can exceed 1 billion. The adjacency list can be distributed across the multiple processing nodes. The nodes of the large connected component can include the input nodes. The acts also can include performing one or more iterations of traversing the large connected component until a stopping condition is satisfied.Type: GrantFiled: January 30, 2020Date of Patent: January 17, 2023Assignee: WALMART APOLLO, LLCInventors: Saigopal Thota, Mridul Jain, Albin Kuruvilla, Pruthvi Raj Eranti, Antriksh Shah
-
Patent number: 11501185Abstract: Systems and methods of real-time modeling pipeline inferencing are disclosed. At least one model configured to calculate at least one metric from one or more features is deployed. A model inferencing pipeline configured to extract the one or more features from a customer-specific data pipeline is implemented for the at least one mode. The model inferencing pipeline is generated using a training data set extracted from a cross-customer data pipeline. The at least one metric is calculated using the one or more features extracted from the customer-specific data pipeline.Type: GrantFiled: January 30, 2019Date of Patent: November 15, 2022Assignee: Walmart Apollo, LLCInventors: Mridul Jain, Gajendra Alias Nishad Kamat, Pawan Gupta, Saurabh Agrawal
-
Patent number: 11281657Abstract: A distributed system including multiple processing nodes. The distributed system can perform certain acts. The acts can include receiving a first conflation event identifying a first node and a second node. The first node can be part of a first set. The first set can include a sole parent node stored at a first processing node of the multiple processing nodes. The second node can be part of a second set. The second set can include a sole parent node stored at a second processing node of the multiple processing nodes. The first and second sets can be disjoint sets. The first conflation event can be received at an event-driven stream application at one of the multiple processing nodes. The acts also can include conflating the first set and the second set into a conflated set. The conflated set can include the first and second nodes. The conflated set can include a sole parent node.Type: GrantFiled: January 30, 2020Date of Patent: March 22, 2022Assignee: WALMART APOLLO, LLCInventors: Deepak Goyal, Giridhar Addepalli, Sebastien Jean-Maurice Olivier Pehu, Saigopal Thota, Mridul Jain, Navinder Pal Singh Brar
-
Patent number: 11238365Abstract: The present teaching relates to a method and system for validating labels of training data. A first group of data records associated with the training data are received, wherein each of the first group of data records includes a vector having at least one feature and a first label. For each of the first group of data records, a second label is determined based on the at least one feature in accordance with a first model. Thereafter, a loss based on the first label associated with the data record and the second label is obtained, and the data record having an incorrect first label is classified when the loss meets a pre-determined criterion. Upon classifying the data records, a sub-group of the first group of data records is generated, wherein each of the data records included in the sub-group has the incorrect first label.Type: GrantFiled: December 29, 2017Date of Patent: February 1, 2022Assignee: VERIZON MEDIA INC.Inventors: Francis Hsu, Mridul Jain, Saurabh Tewari
-
Publication number: 20210342747Abstract: The present teaching relates to distributed deep machine learning on a cluster. In one example, a request is received for estimating one or more parameters associated with a machine learning model on a cluster including a plurality of nodes. A set of data is obtained to be used for estimating the one or more parameters. The set of data is divided into a plurality of sub-sets of data, each of which corresponds to one of the plurality of nodes. Each sub-set of data is allocated to a corresponding node for estimating values of the one or more parameters based on the sub-set of data. Estimated values of the one or more parameters obtained based on a corresponding sub-set of data allocated to the node, are received from each of the plurality of nodes. The one or more parameters of the machine learning model are estimated based on the estimated values of the one or more parameters generated by at least some of the plurality of nodes.Type: ApplicationFiled: July 15, 2021Publication date: November 4, 2021Inventors: Andrew Feng, Jun Shi, Mridul Jain, Peter Cnudde