Patents by Inventor Ashwin Krishnan

Ashwin Krishnan has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

METHOD AND SYSTEM FOR DEPLOYMENT OF LARGE LANGUAGE MODELS (LLM) IN CLOUD INSTANCES

Publication number: 20260099706

Abstract: Existing model deployment approaches have the disadvantage that they do not consider feasibility of cloud instances for hosting a given LLM model. Embodiments disclosed herein provide a method and system for deployment of LLMs in a plurality of cloud instances. The system checks feasibility of the plurality of cloud instances for hosting an LLM, based on size of the LLM and storage space in each of the cloud instances. Further, a latency value for a plurality of batch sizes is determined for a plurality of LLM-accelerator pairs, in each of the plurality of cloud instances identified as feasible based on the feasibility check, using a performance model. Furthermore, a recommendation of one of the plurality of cloud instances identified as feasible is generated, based on the determined latency, a measured cost of deployment, a user workload, an application type, a plurality of latency constraints, and an evaluated performance.

Type: Application

Filed: September 15, 2025

Publication date: April 9, 2026

Applicant: Tata Consultancy Services Limited

Inventors: Ashwin KRISHNAN, Venkatesh PASUMARTI, Samarth Sudarshan INAMDAR, Arghyajoy MONDAL, Manoj Karunakaran NAMBIAR, Rekha SINGHAL
DEMAND FORECASTING SYSTEM

Publication number: 20260080422

Abstract: Aspects of the present disclosure relate to a demand forecasting system. The demand forecasting system may include components for developing forecasting models, generating demand forecasts, and handling outputs of demand forecasting models. In some embodiments, the demand forecasting system may include a model training system and one or more components that can be used by the model training system to improve model performance.

Type: Application

Filed: April 7, 2025

Publication date: March 19, 2026

Inventors: RAJESH REDDY KUNDUR, ASHWIN KRISHNAN, BLAKE ROBINSON, SAIBAL BHATTACHARYA, ADAM MORGAN, AKSHITA RAINA, ALEXANDRE SANTORO, SUBRAMANIAN IYER, PETER KIM, SAAD AHMED, CHRIS GILLIS, MARIO ESCOBEDO, ADAM RIGGALL
Field programmable gate array (FPGA) based online 3D bin packing

Patent number: 12511590

Abstract: The disclosure generally relates to an FPGA-based online 3D bin packing. Online 3D bin packing is the process of packing boxes into larger bins-Long Distance Containers (LDCs) such that the space inside each LDC is used to the maximum extent. The use of deep reinforcement learning (Deep RL) for this process is effective and popular. However, since the existing processor-based implementations are limited by Von-Neumann architecture and take a long time to evaluate each alignment for a box, only a few potential alignments are considered, resulting in sub-optimal packing efficiency. This disclosure describes an architecture for bin packing which leverages pipelining and parallel processing on FPGA for faster and exhaustive evaluation of all alignments for each box resulting in increased efficiency. In addition, a suitable generic purpose processor is employed to train the neural network within the algorithm to make the disclosed techniques computationally light, faster and efficient.

Type: Grant

Filed: August 25, 2023

Date of Patent: December 30, 2025

Assignee: TATA CONSULTANCY SERVICES LIMITED

Inventors: Ashwin Krishnan, Harshad Khadilkar, Rekha Singhal, Ansuma Basumatary, Manoj Karunakaran Nambiar, Arijit Mukherjee, Kavya Borra
Pre-optimizer and optimizer based framework for optimal deployment of embedding tables across heterogeneous memory architecture

Patent number: 12393514

Abstract: High-performance deployment of DNN recommendation models heavily rely on embedding tables, and their performance bottleneck lies in the latency of embedding access. To optimize the deployment of RMs, the method and system is disclosed, which leverages heterogeneous memory types on FPGAs to improve the overall performance by maximizing the availability of frequently accessed data in faster memory. The system, using a optimizer dynamically allocates table partitions of the embedding tables based on history of input access history. A pre-optimizer block disclosed determines whether smaller tables should be partitioned or placed entirely in smaller memories, improving overall efficiency. The performance of RM is improved with improvement in average embedding fetch latency and effectively inference latency via modified Round Trip computation.

Type: Grant

Filed: August 14, 2024

Date of Patent: August 19, 2025

Assignee: TATA CONSULTANCY SERVICES LIMITED

Inventors: Ashwin Krishnan, Manoj Karunakaran Nambiar, Rekha Singhal
OPTIMAL DEPLOYMENT OF TRANSFORMER MODELS FOR HIGH PERFORMANCE INFERENCE ON FIELD PROGRAMMABLE GATE ARRAY (FPGA)

Publication number: 20250190757

Abstract: Existing techniques fail to deploy transformer models by optimally allocating Field Programmable Gate Array resources to each fundamental block of transformer-based models for maximum performance in terms of low latency and high throughput. This disclosure relates to a system and method which constructs a plurality of parameterized transformer model templates from input parameters comprising templates, one or more transformer-based models, one or more data types corresponding to one or more transformer-based models, table comprising one or more latency values and one or more resource utilization values and feedback mode. One or more values are assigned to each plurality of parameters comprised in a plurality of parameterized transformer model templates to obtain plurality of optimal parameters. An optimal template is obtained from the final template selector for each of the plurality of parameterized transformer model templates, having maximum performance in terms of low latency and maximum throughput.

Type: Application

Filed: November 25, 2024

Publication date: June 12, 2025

Applicant: Tata Consultancy Services Limited

Inventors: Ashwin KRISHNAN, Manoj Karunakaran NAMBIAR, Madan Yelandur NANJUNDASWAMY
PRE-OPTIMIZER AND OPTIMIZER BASED FRAMEWORK FOR OPTIMAL DEPLOYMENT OF EMBEDDING TABLES ACROSS HETEROGENEOUS MEMORY ARCHITECTURE

Publication number: 20250086111

Abstract: High-performance deployment of DNN recommendation models heavily rely on embedding tables, and their performance bottleneck lies in the latency of embedding access. To optimize the deployment of RMs, the method and system is disclosed, which leverages heterogeneous memory types on FPGAs to improve the overall performance by maximizing the availability of frequently accessed data in faster memory. The system, using a optimizer dynamically allocates table partitions of the embedding tables based on history of input access history. A pre-optimizer block disclosed determines whether smaller tables should be partitioned or placed entirely in smaller memories, improving overall efficiency. The performance of RM is improved with improvement in average embedding fetch latency and effectively inference latency via modified Round Trip computation.

Type: Application

Filed: August 14, 2024

Publication date: March 13, 2025

Applicant: Tata Consultancy Services Limited

Inventors: ASHWIN KRISHNAN, MANOJ KARUNAKARAN NAMBIAR, REKHA SINGHAL
Optimal deployment of embeddings tables across heterogeneous memory architecture for high-speed recommendations inference

Patent number: 12182029

Abstract: Works in the literature fail to leverage embedding access patterns and memory units' access/storage capabilities, which when combined can yield high-speed heterogeneous systems by dynamically re-organizing embedding tables partitions across hardware during inference. A method and system for optimal deployment of embeddings tables across heterogeneous memory architecture for high-speed recommendations inference is disclosed, which dynamically partitions and organizes embedding tables across fast memory architectures to reduce access time. Partitions are chosen to take advantage of the past access patterns of those tables to ensure that frequently accessed data is available in the fast memory most of the time. Partition and replication is used to co-optimize memory access time and resources.

Type: Grant

Filed: August 25, 2023

Date of Patent: December 31, 2024

Assignee: TATA CONSULTANCY SERVICES LIMITED

Inventors: Ashwin Krishnan, Manoj Karunakaran Nambiar, Chinmay Narendra Mahajan, Rekha Singhal
OPTIMAL DEPLOYMENT OF EMBEDDINGS TABLES ACROSS HETEROGENEOUS MEMORY ARCHITECTURE FOR HIGH-SPEED RECOMMENDATIONS INFERENCE

Publication number: 20240119008

Abstract: Works in the literature fail to leverage embedding access patterns and memory units' access/storage capabilities, which when combined can yield high-speed heterogeneous systems by dynamically re-organizing embedding tables partitions across hardware during inference. A method and system for optimal deployment of embeddings tables across heterogeneous memory architecture for high-speed recommendations inference is disclosed, which dynamically partitions and organizes embedding tables across fast memory architectures to reduce access time. Partitions are chosen to take advantage of the past access patterns of those tables to ensure that frequently accessed data is available in the fast memory most of the time. Partition and replication is used to co-optimize memory access time and resources.

Type: Application

Filed: August 25, 2023

Publication date: April 11, 2024

Applicant: Tata Consultancy Services Limited

Inventors: Ashwin KRISHNAN, Manoj Karunakaran Nambiar, Chinmay Narendra Mahajan, Rekha Singhal
FIELD PROGRAMMABLE GATE ARRAY (FPGA) BASED ONLINE 3D BIN PACKING

Publication number: 20240112095

Abstract: The disclosure generally relates to an FPGA-based online 3D bin packing. Online 3D bin packing is the process of packing boxes into larger bins-Long Distance Containers (LDCs) such that the space inside each LDC is used to the maximum extent. The use of deep reinforcement learning (Deep RL) for this process is effective and popular. However, since the existing processor-based implementations are limited by Von-Neumann architecture and take a long time to evaluate each alignment for a box, only a few potential alignments are considered, resulting in sub-optimal packing efficiency. This disclosure describes an architecture for bin packing which leverages pipelining and parallel processing on FPGA for faster and exhaustive evaluation of all alignments for each box resulting in increased efficiency. In addition, a suitable generic purpose processor is employed to train the neural network within the algorithm to make the disclosed techniques computationally light, faster and efficient.

Type: Application

Filed: August 25, 2023

Publication date: April 4, 2024

Applicant: Tata Consultancy Services Limited

Inventors: ASHWIN KRISHNAN, HARSHAD KHADILKAR, REKHA SINGHAL, ANSUMA BASUMATARY, MANOJ KARUNAKARAN NAMBIAR, ARIJIT MUKHERJEE, KAVYA BORRA
METHOD AND SYSTEM TO ESTIMATE PERFORMANCE OF SESSION BASED RECOMMENDATION MODEL LAYERS ON FPGA

Publication number: 20230325647

Abstract: This disclosure relates generally to method and system to estimate performance of session based recommendation model layers on FPGA. Profiling is easy to perform on software based platforms such as a CPU and a GPU which have development frameworks and tool sets but on systems such as a FPGA, implementation risks are higher and important to model the performance prior to implementation. The disclosed method analyses a session based recommendation (SBR) model layers for performance estimation. Further, a network bandwidth is determined to process each layer of the SBR model based on dimensions. Performance of each layer of the SBR model is estimated at a predefined frequency by creating a layer profile comprising a throughput and a latency in one or more batches. Further, the method deploys an optimal layer on at least one of a heterogeneous hardware based on the estimated performance of each layer profile on the FPGA.

Type: Application

Filed: January 9, 2023

Publication date: October 12, 2023

Applicant: Tata Consultancy Services Limited

Inventors: ASHWIN KRISHNAN, MANOJ KARUNAKARAN NAMBIAR, NUPUR SUMEET
Crystalline forms and processes for the preparation of phenyl-pyrazoles useful as modulators of the 5-HTserotonin receptor

Patent number: 9783502

Abstract: The present invention relates to processes for preparing phenyl-pyrazoles of Formula (I) and salts and pharmaceutical compositions thereof, useful as modulators of 5-HT2A serotonin receptor activity. The present invention also relates to intermediates used in the processes, and their preparation. The present invention also relates to crystalline forms of 5-HT2A serotonin receptor modulators, compositions thereof and methods of using the same.

Type: Grant

Filed: October 23, 2015

Date of Patent: October 10, 2017

Assignee: Arena Pharmaceuticals, Inc.

Inventors: Tawfik Gharbaoui, Dipanjan Sengupta, Ashwin Krishnan, Nainesh Shah, Ryan M. Hart, Mark Macias, Edward A. Lally
CRYSTALLINE FORMS AND PROCESSES FOR THE PREPARATION OF PHENYL-PYRAZOLES USEFUL AS MODULATORS OF THE 5-HT2A SEROTONIN RECEPTOR

Publication number: 20160272591

Abstract: The present invention relates to processes for preparing phenyl-pyrazoles of Formula (I) and salts and pharmaceutical compositions thereof, useful as modulators of 5-HT2A serotonin receptor activity. The present invention also relates to intermediates used in the processes, and their preparation. The present invention also relates to crystalline forms of 5-HT2A serotonin receptor modulators, compositions thereof and methods of using the same.

Type: Application

Filed: October 23, 2015

Publication date: September 22, 2016

Inventors: Tawfik Gharbaoui, Dipanjan Sengupta, Ashwin Krishnan, Nainesh Shah, Ryan M. Hart, Mark Macias, Edward A. Lally
Processes for preparing aromatic ethers

Publication number: 20060155129

Abstract: The present invention relates to processes for preparing aromatic ether compounds that are modulators of glucose metabolism and therefore useful in the treatment of metabolic disorders such as diabetes and obesity.

Type: Application

Filed: January 9, 2006

Publication date: July 13, 2006

Inventors: Tawfik Gharbaoui, John Fritch, Ashwin Krishnan, Beverly Throop, Naomi Kato
Water soluble fluorinated fatty acid sulfonate derivatives useful as magnetic resonance imaging agents

Patent number: 5660815

Abstract: The present invention relates to improved imaging agents useful for .sup.19 F magnetic resonance imaging (MRI). More particularly, the present invention relates to fluorinated fatty acid sulfonate derivatives, such as those derived by reaction of fluorinated fatty acids with sulfonated compounds such as taurine analogs or isethionic acid analogs, which offer improved water solubility and biocompatibility.

Type: Grant

Filed: April 28, 1995

Date of Patent: August 26, 1997

Assignee: Molecular Biosystems, Inc.

Inventors: Rolf Lohrmann, Ashwin Krishnan
Perfluoro-1H,-1H-neopentyl containing contrast agents and method to use same

Patent number: 5401493

Abstract: Organic compounds for diagnostic imaging which contain at least one aryl group which has been derivatized to contain at least one perfluoro-1H,1H-neopentyl moiety are provided. The perfluoro-1H,1H-neopentyl groups produce a single magnetic resonance to insure a maximum signal to noise ratio. One compound disclosed is 2-O-oleoylglycerol 1,3-bis(7'-{3",5"-di[2"',2"'-di(trifluoromethyl)3"', 3"',3"'-trifluoropropyl]phenyl}heptanoate). In the preferred embodiment, a lipid emulsion is provided as a carrier vehicle to deliver the derivitized analog to a mammalian recipient. Methods to use these compounds in MRI and computerized tomography are provided.

Type: Grant

Filed: March 26, 1993

Date of Patent: March 28, 1995

Assignee: Molecular Biosystems, Inc.

Inventors: Rolf Lohrmann, Ashwin Krishnan