Patents by Inventor Kshitiz Malik
Kshitiz Malik has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Patent number: 12592017Abstract: In one embodiment, a method includes receiving a voice input having acoustic features from a first client system associated with a first user, determining emotions associated with the voice input based on one or more of the acoustic features by machine-learning models, determining facial features for a first extended-reality (XR) avatar representing the first user based on the emotions, and sending instructions for rendering the first XR avatar representing the first user to a second client system associated with a second user, wherein the first XR avatar is rendered with the determined facial features.Type: GrantFiled: July 17, 2023Date of Patent: March 31, 2026Assignee: Meta Platforms, Inc.Inventors: Abhay Suresh Harpale, Meryem Pinar Donmez Ediz, Kshitiz Malik, Omer Muzaffar, Mridul Gupta, Vijay Manikandan Janakiraman
-
Patent number: 12475170Abstract: In one embodiment, a method includes receiving a first input by a user from a client system associated with the user, wherein the first input is in a voice modality, analyzing the first input to generate one or more candidate hypotheses, determining one or more modalities for presenting output generated by the one or more computing systems to the user at the client system, and sending instructions to the client system for presenting one or more suggested auto-completions corresponding to one or more of the candidate hypotheses, respectively, wherein each suggested auto-completion comprises the corresponding candidate hypothesis, and wherein the one or more suggested auto-completions are presented in the one or more determined modalities.Type: GrantFiled: December 11, 2020Date of Patent: November 18, 2025Assignee: Meta Platforms, Inc.Inventors: Fadi Botros, Nanshu Wang, Fan Wang, Meryem Pinar Donmez Ediz, Omer Muzaffar, Kshitiz Malik, Vikas Seshagiri Rao Bhardwaj, Anuj Kumar, Shreyan Bakshi
-
Publication number: 20250148811Abstract: In one embodiment, a method includes accessing visual signals comprising images portraying textual content in a real-world environment associated with a first user from a client system associated with the first user, recognizing the textual content based on machine-learning models and the visual signals, determining a context associated with the first user with respect to the real-world environment based on the visual signals, executing tasks determined based on the textual content and the determined context for the first user, and sending instructions for presenting execution results of the tasks to the first user to the client system.Type: ApplicationFiled: October 21, 2024Publication date: May 8, 2025Inventors: Elizabeth Kelsey Santoro, Denis Savenkov, Koon Hui Geoffrey Goh, Ruchir Srivastava, Kshitiz Malik
-
Publication number: 20250029305Abstract: In one embodiment, a method includes receiving a voice input having acoustic features from a first client system associated with a first user, determining emotions associated with the voice input based on one or more of the acoustic features by machine-learning models, determining facial features for a first extended-reality (XR) avatar representing the first user based on the emotions, and sending instructions for rendering the first XR avatar representing the first user to a second client system associated with a second user, wherein the first XR avatar is rendered with the determined facial features.Type: ApplicationFiled: July 17, 2023Publication date: January 23, 2025Inventors: Abhay Suresh Harpale, Meryem Pinar Donmez Ediz, Kshitiz Malik, Omer Muzaffar, Mridul Gupta, Vijay Manikandan Janakiraman
-
Patent number: 12125297Abstract: In one embodiment, a method includes accessing visual signals comprising images portraying textual content in a real-world environment associated with a first user from a client system associated with the first user, recognizing the textual content based on machine-learning models and the visual signals, determining a context associated with the first user with respect to the real-world environment based on the visual signals, executing tasks determined based on the textual content and the determined context for the first user, and sending instructions for presenting execution results of the tasks to the first user to the client system.Type: GrantFiled: August 4, 2021Date of Patent: October 22, 2024Assignee: Meta Platforms, Inc.Inventors: Elizabeth Kelsey Santoro, Denis Savenkov, Koon Hui Geoffrey Goh, Kshitiz Malik, Ruchir Srivastava
-
Publication number: 20240112008Abstract: In one embodiment, a method includes receiving, by a first client system, from one or more remote servers, a current version of a neural network model including multiple model parameters, training the neural network model on multiple examples retrieved from a local data store to generate multiple updated model parameters, wherein each of the examples includes one or more features and one or more labels, calculating a user valuation associated with the first client system, wherein the user valuation represents a measure of utility of training the neural network model on the multiple examples, and sending, to one or more of the remote servers, the trained neural network model and the user valuation, wherein the user valuation is associated with a likelihood of the first client system being selected for a subsequent training of the neural network model.Type: ApplicationFiled: March 11, 2020Publication date: April 4, 2024Inventors: Kshitiz Malik, Seungwhan Moon, Honglei Liu, Anuj Kumar, Hongyuan Zhan, Ahmed Aly
-
Publication number: 20230245654Abstract: In one embodiment, a system includes an automatic speech recognition (ASR) module, a natural-language understanding (NLU) module, a dialog manager, one or more agents, an arbitrator, a delivery system, one or more processors, and a non-transitory memory coupled to the processors comprising instructions executable by the processors, the processors operable when executing the instructions to receive a user input, process the user input using the ASR module, the NLU module, the dialog manager, one or more of the agents, the arbitrator, and the delivery system, and provide a response to the user input.Type: ApplicationFiled: January 20, 2023Publication date: August 3, 2023Inventors: Akshat Shrivastava, Shrey Desai, Anchit Gupta, Ali Elkahky, Aleksandr Livshits, Alexander Kolmykov-Zotov, Ahmed Aly, Jinsong Yu, Manali Anand Naik, Shuhui Yang, Baiyang Liu, Surya Teja Appini, Tarun Vir Singh, Hang Su, Jiedan Zhu, Fuchun Peng, Shoubhik Bhattacharya, Kshitiz Malik, Shreyan Bakshi, Akash Bharadwaj, Harish Srinivas, Xiao Yang, Zhuangqun Huang, Gil Keren, Duc Hoang Le, Ahmed Kamal Atwa Mohamed, Zhe Liu, Pranab Mohanty
-
Publication number: 20220374645Abstract: In one embodiment, a method includes accessing visual signals comprising images portraying textual content in a real-world environment associated with a first user from a client system associated with the first user, recognizing the textual content based on machine-learning models and the visual signals, determining a context associated with the first user with respect to the real-world environment based on the visual signals, executing tasks determined based on the textual content and the determined context for the first user, and sending instructions for presenting execution results of the tasks to the first user to the client system.Type: ApplicationFiled: August 4, 2021Publication date: November 24, 2022Inventors: Elizabeth Kelsey Santoro, Denis Savenkov, Koon Hui Geoffrey Goh, Kshitiz Malik, Ruchir Srivastava
-
Patent number: 11501081Abstract: Exemplary embodiments relate to methods, mediums, and systems for moving language models from a server to the client device. Such embodiments may be deployed in an environment where the server is not able to provide modeling services to the clients, such as an end-to-end encrypted (E2EE) environment. Several different techniques are described to address issues of size and complexity reduction, model architecture optimization, model training, battery power reduction, and latency reduction.Type: GrantFiled: December 31, 2019Date of Patent: November 15, 2022Assignee: Meta Platforms, Inc.Inventors: Prince Gill, Honglei Liu, Wenhai Yang, Kshitiz Malik, Nanshu Wang, David Reiss
-
Patent number: 11455555Abstract: Exemplary embodiments relate to methods, mediums, and systems for moving language models from a server to the client device. Such embodiments may be deployed in an environment where the server is not able to provide modeling services to the clients, such as an end-to-end encrypted (E2EE) environment. Several different techniques are described to address issues of size and complexity reduction, model architecture optimization, model training, battery power reduction, and latency reduction.Type: GrantFiled: December 31, 2019Date of Patent: September 27, 2022Assignee: META PLATFORMS, INC.Inventors: Prince Gill, Honglei Liu, Wenhai Yang, Kshitiz Malik, Nanshu Wang, David Reiss
-
Publication number: 20220188361Abstract: In one embodiment, a method includes receiving a first input by a user from a client system associated with the user, wherein the first input is in a voice modality, analyzing the first input to generate one or more candidate hypotheses, determining one or more modalities for presenting output generated by the one or more computing systems to the user at the client system, and sending instructions to the client system for presenting one or more suggested auto-completions corresponding to one or more of the candidate hypotheses, respectively, wherein each suggested auto-completion comprises the corresponding candidate hypothesis, and wherein the one or more suggested auto-completions are presented in the one or more determined modalities.Type: ApplicationFiled: December 11, 2020Publication date: June 16, 2022Inventors: Fadi Botros, Nanshu Wang, Fan Wang, Meryem Pinar Donmez Ediz, Omer Muzaffar, Kshitiz Malik, Vikas Seshagiri Rao Bhardwaj, Anuj Kumar, Shreyan Bakshi
-
Patent number: 11227122Abstract: Exemplary embodiments relate to methods, mediums, and systems for moving language models from a server to the client device. Such embodiments may be deployed in an environment where the server is not able to provide modeling services to the clients, such as an end-to-end encrypted (E2EE) environment. Several different techniques are described to address issues of size and complexity reduction, model architecture optimization, model training, battery power reduction, and latency reduction.Type: GrantFiled: December 31, 2019Date of Patent: January 18, 2022Assignee: FACEBOOK, INC.Inventors: Prince Gill, Honglei Liu, Wenhai Yang, Kshitiz Malik, Nanshu Wang, David Reiss
-
Publication number: 20210117780Abstract: In one embodiment, a method includes receiving, by a first client system, from one or more remote servers, a current version of a global neural network model including multiple federated model parameters, accessing, from a local data store, multiple examples and a local personalization model including multiple local model parameters, wherein each of the examples includes one or more features and one or more labels, training the global neural network model and the local personalization model together on the examples to generate multiple updated federated model parameters and multiple updated local model parameters, storing, in the local data store, the trained local personalization model including the updated local model parameters, and sending, to one or more of the remote servers, the trained global neural network model including the updated federated model parameters.Type: ApplicationFiled: March 11, 2020Publication date: April 22, 2021Inventors: Kshitiz Malik, Seungwhan Moon, Honglei Liu, Anuj Kumar, Hongyuan Zhan, Ahmed Aly