Patents by Inventor Ashwin Krishnan
Ashwin Krishnan has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Publication number: 20260099706Abstract: Existing model deployment approaches have the disadvantage that they do not consider feasibility of cloud instances for hosting a given LLM model. Embodiments disclosed herein provide a method and system for deployment of LLMs in a plurality of cloud instances. The system checks feasibility of the plurality of cloud instances for hosting an LLM, based on size of the LLM and storage space in each of the cloud instances. Further, a latency value for a plurality of batch sizes is determined for a plurality of LLM-accelerator pairs, in each of the plurality of cloud instances identified as feasible based on the feasibility check, using a performance model. Furthermore, a recommendation of one of the plurality of cloud instances identified as feasible is generated, based on the determined latency, a measured cost of deployment, a user workload, an application type, a plurality of latency constraints, and an evaluated performance.Type: ApplicationFiled: September 15, 2025Publication date: April 9, 2026Applicant: Tata Consultancy Services LimitedInventors: Ashwin KRISHNAN, Venkatesh PASUMARTI, Samarth Sudarshan INAMDAR, Arghyajoy MONDAL, Manoj Karunakaran NAMBIAR, Rekha SINGHAL
-
Publication number: 20260080422Abstract: Aspects of the present disclosure relate to a demand forecasting system. The demand forecasting system may include components for developing forecasting models, generating demand forecasts, and handling outputs of demand forecasting models. In some embodiments, the demand forecasting system may include a model training system and one or more components that can be used by the model training system to improve model performance.Type: ApplicationFiled: April 7, 2025Publication date: March 19, 2026Inventors: RAJESH REDDY KUNDUR, ASHWIN KRISHNAN, BLAKE ROBINSON, SAIBAL BHATTACHARYA, ADAM MORGAN, AKSHITA RAINA, ALEXANDRE SANTORO, SUBRAMANIAN IYER, PETER KIM, SAAD AHMED, CHRIS GILLIS, MARIO ESCOBEDO, ADAM RIGGALL
-
Patent number: 12511590Abstract: The disclosure generally relates to an FPGA-based online 3D bin packing. Online 3D bin packing is the process of packing boxes into larger bins-Long Distance Containers (LDCs) such that the space inside each LDC is used to the maximum extent. The use of deep reinforcement learning (Deep RL) for this process is effective and popular. However, since the existing processor-based implementations are limited by Von-Neumann architecture and take a long time to evaluate each alignment for a box, only a few potential alignments are considered, resulting in sub-optimal packing efficiency. This disclosure describes an architecture for bin packing which leverages pipelining and parallel processing on FPGA for faster and exhaustive evaluation of all alignments for each box resulting in increased efficiency. In addition, a suitable generic purpose processor is employed to train the neural network within the algorithm to make the disclosed techniques computationally light, faster and efficient.Type: GrantFiled: August 25, 2023Date of Patent: December 30, 2025Assignee: TATA CONSULTANCY SERVICES LIMITEDInventors: Ashwin Krishnan, Harshad Khadilkar, Rekha Singhal, Ansuma Basumatary, Manoj Karunakaran Nambiar, Arijit Mukherjee, Kavya Borra
-
Patent number: 12393514Abstract: High-performance deployment of DNN recommendation models heavily rely on embedding tables, and their performance bottleneck lies in the latency of embedding access. To optimize the deployment of RMs, the method and system is disclosed, which leverages heterogeneous memory types on FPGAs to improve the overall performance by maximizing the availability of frequently accessed data in faster memory. The system, using a optimizer dynamically allocates table partitions of the embedding tables based on history of input access history. A pre-optimizer block disclosed determines whether smaller tables should be partitioned or placed entirely in smaller memories, improving overall efficiency. The performance of RM is improved with improvement in average embedding fetch latency and effectively inference latency via modified Round Trip computation.Type: GrantFiled: August 14, 2024Date of Patent: August 19, 2025Assignee: TATA CONSULTANCY SERVICES LIMITEDInventors: Ashwin Krishnan, Manoj Karunakaran Nambiar, Rekha Singhal
-
Publication number: 20250190757Abstract: Existing techniques fail to deploy transformer models by optimally allocating Field Programmable Gate Array resources to each fundamental block of transformer-based models for maximum performance in terms of low latency and high throughput. This disclosure relates to a system and method which constructs a plurality of parameterized transformer model templates from input parameters comprising templates, one or more transformer-based models, one or more data types corresponding to one or more transformer-based models, table comprising one or more latency values and one or more resource utilization values and feedback mode. One or more values are assigned to each plurality of parameters comprised in a plurality of parameterized transformer model templates to obtain plurality of optimal parameters. An optimal template is obtained from the final template selector for each of the plurality of parameterized transformer model templates, having maximum performance in terms of low latency and maximum throughput.Type: ApplicationFiled: November 25, 2024Publication date: June 12, 2025Applicant: Tata Consultancy Services LimitedInventors: Ashwin KRISHNAN, Manoj Karunakaran NAMBIAR, Madan Yelandur NANJUNDASWAMY
-
Publication number: 20250086111Abstract: High-performance deployment of DNN recommendation models heavily rely on embedding tables, and their performance bottleneck lies in the latency of embedding access. To optimize the deployment of RMs, the method and system is disclosed, which leverages heterogeneous memory types on FPGAs to improve the overall performance by maximizing the availability of frequently accessed data in faster memory. The system, using a optimizer dynamically allocates table partitions of the embedding tables based on history of input access history. A pre-optimizer block disclosed determines whether smaller tables should be partitioned or placed entirely in smaller memories, improving overall efficiency. The performance of RM is improved with improvement in average embedding fetch latency and effectively inference latency via modified Round Trip computation.Type: ApplicationFiled: August 14, 2024Publication date: March 13, 2025Applicant: Tata Consultancy Services LimitedInventors: ASHWIN KRISHNAN, MANOJ KARUNAKARAN NAMBIAR, REKHA SINGHAL
-
Patent number: 12182029Abstract: Works in the literature fail to leverage embedding access patterns and memory units' access/storage capabilities, which when combined can yield high-speed heterogeneous systems by dynamically re-organizing embedding tables partitions across hardware during inference. A method and system for optimal deployment of embeddings tables across heterogeneous memory architecture for high-speed recommendations inference is disclosed, which dynamically partitions and organizes embedding tables across fast memory architectures to reduce access time. Partitions are chosen to take advantage of the past access patterns of those tables to ensure that frequently accessed data is available in the fast memory most of the time. Partition and replication is used to co-optimize memory access time and resources.Type: GrantFiled: August 25, 2023Date of Patent: December 31, 2024Assignee: TATA CONSULTANCY SERVICES LIMITEDInventors: Ashwin Krishnan, Manoj Karunakaran Nambiar, Chinmay Narendra Mahajan, Rekha Singhal
-
Publication number: 20240119008Abstract: Works in the literature fail to leverage embedding access patterns and memory units' access/storage capabilities, which when combined can yield high-speed heterogeneous systems by dynamically re-organizing embedding tables partitions across hardware during inference. A method and system for optimal deployment of embeddings tables across heterogeneous memory architecture for high-speed recommendations inference is disclosed, which dynamically partitions and organizes embedding tables across fast memory architectures to reduce access time. Partitions are chosen to take advantage of the past access patterns of those tables to ensure that frequently accessed data is available in the fast memory most of the time. Partition and replication is used to co-optimize memory access time and resources.Type: ApplicationFiled: August 25, 2023Publication date: April 11, 2024Applicant: Tata Consultancy Services LimitedInventors: Ashwin KRISHNAN, Manoj Karunakaran Nambiar, Chinmay Narendra Mahajan, Rekha Singhal
-
Publication number: 20240112095Abstract: The disclosure generally relates to an FPGA-based online 3D bin packing. Online 3D bin packing is the process of packing boxes into larger bins-Long Distance Containers (LDCs) such that the space inside each LDC is used to the maximum extent. The use of deep reinforcement learning (Deep RL) for this process is effective and popular. However, since the existing processor-based implementations are limited by Von-Neumann architecture and take a long time to evaluate each alignment for a box, only a few potential alignments are considered, resulting in sub-optimal packing efficiency. This disclosure describes an architecture for bin packing which leverages pipelining and parallel processing on FPGA for faster and exhaustive evaluation of all alignments for each box resulting in increased efficiency. In addition, a suitable generic purpose processor is employed to train the neural network within the algorithm to make the disclosed techniques computationally light, faster and efficient.Type: ApplicationFiled: August 25, 2023Publication date: April 4, 2024Applicant: Tata Consultancy Services LimitedInventors: ASHWIN KRISHNAN, HARSHAD KHADILKAR, REKHA SINGHAL, ANSUMA BASUMATARY, MANOJ KARUNAKARAN NAMBIAR, ARIJIT MUKHERJEE, KAVYA BORRA
-
Publication number: 20230325647Abstract: This disclosure relates generally to method and system to estimate performance of session based recommendation model layers on FPGA. Profiling is easy to perform on software based platforms such as a CPU and a GPU which have development frameworks and tool sets but on systems such as a FPGA, implementation risks are higher and important to model the performance prior to implementation. The disclosed method analyses a session based recommendation (SBR) model layers for performance estimation. Further, a network bandwidth is determined to process each layer of the SBR model based on dimensions. Performance of each layer of the SBR model is estimated at a predefined frequency by creating a layer profile comprising a throughput and a latency in one or more batches. Further, the method deploys an optimal layer on at least one of a heterogeneous hardware based on the estimated performance of each layer profile on the FPGA.Type: ApplicationFiled: January 9, 2023Publication date: October 12, 2023Applicant: Tata Consultancy Services LimitedInventors: ASHWIN KRISHNAN, MANOJ KARUNAKARAN NAMBIAR, NUPUR SUMEET
-
Patent number: 9783502Abstract: The present invention relates to processes for preparing phenyl-pyrazoles of Formula (I) and salts and pharmaceutical compositions thereof, useful as modulators of 5-HT2A serotonin receptor activity. The present invention also relates to intermediates used in the processes, and their preparation. The present invention also relates to crystalline forms of 5-HT2A serotonin receptor modulators, compositions thereof and methods of using the same.Type: GrantFiled: October 23, 2015Date of Patent: October 10, 2017Assignee: Arena Pharmaceuticals, Inc.Inventors: Tawfik Gharbaoui, Dipanjan Sengupta, Ashwin Krishnan, Nainesh Shah, Ryan M. Hart, Mark Macias, Edward A. Lally
-
Publication number: 20160272591Abstract: The present invention relates to processes for preparing phenyl-pyrazoles of Formula (I) and salts and pharmaceutical compositions thereof, useful as modulators of 5-HT2A serotonin receptor activity. The present invention also relates to intermediates used in the processes, and their preparation. The present invention also relates to crystalline forms of 5-HT2A serotonin receptor modulators, compositions thereof and methods of using the same.Type: ApplicationFiled: October 23, 2015Publication date: September 22, 2016Inventors: Tawfik Gharbaoui, Dipanjan Sengupta, Ashwin Krishnan, Nainesh Shah, Ryan M. Hart, Mark Macias, Edward A. Lally
-
Publication number: 20060155129Abstract: The present invention relates to processes for preparing aromatic ether compounds that are modulators of glucose metabolism and therefore useful in the treatment of metabolic disorders such as diabetes and obesity.Type: ApplicationFiled: January 9, 2006Publication date: July 13, 2006Inventors: Tawfik Gharbaoui, John Fritch, Ashwin Krishnan, Beverly Throop, Naomi Kato
-
Patent number: 5660815Abstract: The present invention relates to improved imaging agents useful for .sup.19 F magnetic resonance imaging (MRI). More particularly, the present invention relates to fluorinated fatty acid sulfonate derivatives, such as those derived by reaction of fluorinated fatty acids with sulfonated compounds such as taurine analogs or isethionic acid analogs, which offer improved water solubility and biocompatibility.Type: GrantFiled: April 28, 1995Date of Patent: August 26, 1997Assignee: Molecular Biosystems, Inc.Inventors: Rolf Lohrmann, Ashwin Krishnan
-
Patent number: 5401493Abstract: Organic compounds for diagnostic imaging which contain at least one aryl group which has been derivatized to contain at least one perfluoro-1H,1H-neopentyl moiety are provided. The perfluoro-1H,1H-neopentyl groups produce a single magnetic resonance to insure a maximum signal to noise ratio. One compound disclosed is 2-O-oleoylglycerol 1,3-bis(7'-{3",5"-di[2"',2"'-di(trifluoromethyl)3"', 3"',3"'-trifluoropropyl]phenyl}heptanoate). In the preferred embodiment, a lipid emulsion is provided as a carrier vehicle to deliver the derivitized analog to a mammalian recipient. Methods to use these compounds in MRI and computerized tomography are provided.Type: GrantFiled: March 26, 1993Date of Patent: March 28, 1995Assignee: Molecular Biosystems, Inc.Inventors: Rolf Lohrmann, Ashwin Krishnan