Patents by Inventor Kunal Mukerjee
Kunal Mukerjee has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Patent number: 11960864Abstract: The disclosure herein describes generating an application from input content. Input content of a content type is obtained, such as an image file, digital document file, or the like. A content data extractor is selected from a set of content data extractors based on the content type. A set of content entities is extracted from the obtained input content using the selected content data extractor, such as text labels, text boxes, buttons, or the like. The set of content entities are normalized according to a standard interface schema and an application template is generated using the normalized set of content entities, whereby an application can be developed using the generated application template. The disclosure enables application interfaces to be designed using a variety of methods and for those different types of designs to be efficiently converted to a functional application.Type: GrantFiled: September 27, 2021Date of Patent: April 16, 2024Assignee: Microsoft Technology Licensing, LLC.Inventors: PalaniRaj Kaliyaperumal, Lily Xinchen Liu, Kuralmani Elango, Kunal Mukerjee
-
Publication number: 20230119613Abstract: Examples described herein generate training data for machine learning (ML) for natural language (NL) processing (such as semantic parsing for translating NL). A formula tree is generated based on sampling both a formula grammar and NL templates. Using the formula tree, an ML training data instance pair is generated comprising a formula example and an NL example. A context example may also be used during instantiation of the formula tree. An ML model is trained with training data including the ML training data instance pair, and ML output is generated from NL input. The ML output includes, for example, a machine-interpretable formula, a database querying language command, or a general programming language instruction. Some examples support context-free grammar, probabilistic context-free grammar, and/or non-context-free production rules.Type: ApplicationFiled: October 19, 2021Publication date: April 20, 2023Inventors: Zeqi LIN, Yu HU, Haiyuan CAO, Yi LIU, Jian-Guang LOU, Kuralmani ELANGO, PalaniRaj KALIYAPERUMAL, Weizhu CHEN, Kunal MUKERJEE
-
Publication number: 20230095089Abstract: The disclosure herein describes generating an application from input content. Input content of a content type is obtained, such as an image file, digital document file, or the like. A content data extractor is selected from a set of content data extractors based on the content type. A set of content entities is extracted from the obtained input content using the selected content data extractor, such as text labels, text boxes, buttons, or the like. The set of content entities are normalized according to a standard interface schema and an application template is generated using the normalized set of content entities, whereby an application can be developed using the generated application template. The disclosure enables application interfaces to be designed using a variety of methods and for those different types of designs to be efficiently converted to a functional application.Type: ApplicationFiled: September 27, 2021Publication date: March 30, 2023Inventors: PalaniRaj KALIYAPERUMAL, Lily Xinchen LIU, Kuralmani ELANGO, Kunal MUKERJEE
-
Patent number: 10635681Abstract: The present invention extends to methods, systems, and computer program products for forming data responsive to a query. Responding to a query can include selecting data views on top of data sources. A data catalog stores data elements from different data sources and stores links between data elements from the different data sources. A view can be selected for a query based on one or more of: classification of data included in the data catalog or prior usage of data included in the data catalog. A selected data view is populated with data elements from the data catalog using the links to determine that data elements spanning multiple data sources satisfy the query. The data catalog can be used to identify relationships between data elements from different data sources that would not otherwise be identified.Type: GrantFiled: May 25, 2017Date of Patent: April 28, 2020Assignee: Microsoft Technology Licensing, LLCInventors: Christian Liensberger, Ziv Kasperski, Stéphane Nyombayire, Venkatesh Krishnan, Santosh Balasubramanian, Kunal Mukerjee, Shankar L. Regunathan
-
Patent number: 10140366Abstract: In one embodiment, datasets are stored in a catalog. The datasets are enriched by establishing relationships among the domains in different datasets. A user searches for relevant datasets by providing examples of the domains of interest. The system identifies datasets corresponding to the user-provided examples. The system them identifies connected subsets of the datasets that are directly linked or indirectly linked through other domains. The user provides known relationship examples to filter the connected subsets and to identify the connected subsets that are most relevant to the user's query. The selected connected subsets may be further analyzed by business intelligence/analytics to create pivot tables or to process the data.Type: GrantFiled: March 16, 2015Date of Patent: November 27, 2018Assignee: Microsoft Technology Licensing, LLCInventors: John C. Platt, Surajit Chaudhuri, Lev Novik, Henricus Johannes Maria Meijer, Efim Hudis, Kunal Mukerjee, Christopher Alan Hays
-
Publication number: 20170262510Abstract: The present invention extends to methods, systems, and computer program products for forming data responsive to a query. Responding to a query can include selecting data views on top of data sources. A data catalog stores data elements from different data sources and stores links between data elements from the different data sources. A view can be selected for a query based on one or more of: classification of data included in the data catalog or prior usage of data included in the data catalog. A selected data view is populated with data elements from the data catalog using the links to determine that data elements spanning multiple data sources satisfy the query. The data catalog can be used to identify relationships between data elements from different data sources that would not otherwise be identified.Type: ApplicationFiled: May 25, 2017Publication date: September 14, 2017Inventors: Christian Liensberger, Ziv Kasperski, Stéphane Nyombayire, Venkatesh Krishnan, Santosh Balasubramanian, Kunal Mukerjee, Shankar L. Regunathan
-
Patent number: 9672256Abstract: The present invention extends to methods, systems, and computer program products for creating data views. Embodiments of the invention allow automatic creation of data views/projections on top of data sources. A view can be created on one data source (by looking at one or multiple data elements within the data source) or by looking at multiple data sources (of the same or different data types) and combining elements into a view. Data sources can be any kind of data source, including but not limited to: databases, files, and Web services. Views can be picked from a set of predefined views, picked from views of another tenant (either statically or using fuzzy logic), selected based on data element classifications and/or data element relationship classifications, selected based on data usage (e.g., by looking at logs), etc.Type: GrantFiled: February 4, 2014Date of Patent: June 6, 2017Assignee: Microsoft Technology Licensing, LLCInventors: Christian Liensberger, Ziv Kasperski, Stéphane Nyombayire, Venkatesh Krishnan, Santosh Balasubramanian, Kunal Mukerjee, Shankar L. Regunathan
-
Patent number: 9148668Abstract: Techniques and tools for encoding and decoding motion vector information for video images are described. For example, a video encoder yields an extended motion vector code by jointly coding, for a set of pixels, a switch code, motion vector information, and a terminal symbol indicating whether subsequent data is encoded for the set of pixels. In another aspect, an encoder/decoder selects motion vector predictors for macroblocks. In another aspect, a video encoder/decoder uses hybrid motion vector prediction. In another aspect, a video encoder/decoder signals a motion vector mode for a predicted image. In another aspect, a video decoder decodes a set of pixels by receiving an extended motion vector code, which reflects joint encoding of motion information together with intra/inter-coding information and a terminal symbol. The decoder determines whether subsequent data exists for the set of pixels based on e.g., the terminal symbol.Type: GrantFiled: February 13, 2014Date of Patent: September 29, 2015Assignee: Microsoft Technology Licensing, LLCInventors: Sridhar Srinivasan, Pohsiang Hsu, Thomas W. Holcomb, Kunal Mukerjee, Bruce Chih-Lung Lin
-
Publication number: 20150220598Abstract: The present invention extends to methods, systems, and computer program products for creating data views. Embodiments of the invention allow automatic creation of data views/projections on top of data sources. A view can be created on one data source (by looking at one or multiple data elements within the data source) or by looking at multiple data sources (of the same or different data types) and combining elements into a view. Data sources can be any kind of data source, including but not limited to: databases, files, and Web services. Views can be picked from a set of predefined views, picked from views of another tenant (either statically or using fuzzy logic), selected based on data element classifications and/or data element relationship classifications, selected based on data usage (e.g., by looking at logs), etc.Type: ApplicationFiled: February 4, 2014Publication date: August 6, 2015Applicant: Microsoft CorporationInventors: Christian Liensberger, Ziv Kasperski, Stéphane Nyombayire, Venkatesh Krishnan, Santosh Balasubramanian, Kunal Mukerjee, Shankar L. Regunathan
-
Publication number: 20150193533Abstract: In one embodiment, datasets are stored in a catalog. The datasets are enriched by establishing relationships among the domains in different datasets. A user searches for relevant datasets by providing examples of the domains of interest. The system identifies datasets corresponding to the user-provided examples. The system them identifies connected subsets of the datasets that are directly linked or indirectly linked through other domains. The user provides known relationship examples to filter the connected subsets and to identify the connected subsets that are most relevant to the user's query. The selected connected subsets may be further analyzed by business intelligence/analytics to create pivot tables or to process the data.Type: ApplicationFiled: March 16, 2015Publication date: July 9, 2015Applicant: Microsoft Technology Licensing, LLCInventors: John C. Platt, Surajit Chaudhuri, Lev Novik, Henricus Johannes Maria Meijer, Efim Hudis, Kunal Mukerjee, Christopher Alan Hays
-
Patent number: 8983954Abstract: In one embodiment, datasets are stored in a catalog. The datasets are enriched by establishing relationships among the domains in different datasets. A user searches for relevant datasets by providing examples of the domains of interest. The system identifies datasets corresponding to the user-provided examples. The system them identifies connected subsets of the datasets that are directly linked or indirectly linked through other domains. The user provides known relationship examples to filter the connected subsets and to identify the connected subsets that are most relevant to the user's query. The selected connected subsets may be further analyzed by business intelligence/analytics to create pivot tables or to process the data.Type: GrantFiled: April 10, 2012Date of Patent: March 17, 2015Assignee: Microsoft Technology Licensing, LLCInventors: John C. Platt, Surajit Chaudhuri, Lev Novik, Henricus Johannes Maria Meijer, Efim Hudis, Kunal Mukerjee, Christopher Alan Hays
-
Patent number: 8917768Abstract: Techniques and tools for encoding and decoding motion vector information for video images are described. For example, a video encoder yields an extended motion vector code by jointly coding, for a set of pixels, a switch code, motion vector information, and a terminal symbol indicating whether subsequent data is encoded for the set of pixels. In another aspect, an encoder/decoder selects motion vector predictors for macroblocks. In another aspect, a video encoder/decoder uses hybrid motion vector prediction. In another aspect, a video encoder/decoder signals a motion vector mode for a predicted image. In another aspect, a video decoder decodes a set of pixels by receiving an extended motion vector code, which reflects joint encoding of motion information together with intra/inter-coding information and a terminal symbol. The decoder determines whether subsequent data exists for the set of pixels based on e.g., the terminal symbol.Type: GrantFiled: November 21, 2008Date of Patent: December 23, 2014Assignee: Microsoft CorporationInventors: Sridhar Srinivasan, Pohsiang Hsu, Thomas W. Holcomb, Kunal Mukerjee, Bruce Chih-Lung Lin
-
Publication number: 20140317103Abstract: A method includes receiving a concept hierarchy at a computing device. The concept hierarchy identifies concepts associated with a document corpus. An interface based on the concept hierarchy is generated. The interface is operable to navigate and search the concept hierarchy.Type: ApplicationFiled: July 3, 2014Publication date: October 23, 2014Inventors: Kunal Mukerjee, Naveen Garg
-
Patent number: 8793242Abstract: Methods, systems, and computer program products are provided for deriving and updating document similarity indices for a plurality of documents. The number of maintained similarities can be controlled to conserve CPU and storage resources.Type: GrantFiled: June 19, 2013Date of Patent: July 29, 2014Assignee: Microsoft CorporationInventors: Sorin Gherman, Kunal Mukerjee, Adam Prout
-
Patent number: 8775426Abstract: A method includes receiving a concept hierarchy at a computing device. The concept hierarchy identifies concepts associated with a document corpus. An interface based on the concept hierarchy is generated. The interface is operable to navigate, search, and modify the concept hierarchy. The method includes transmitting the interface for display to a display device.Type: GrantFiled: September 14, 2010Date of Patent: July 8, 2014Assignee: Microsoft CorporationInventors: Kunal Mukerjee, Naveen Garg
-
Publication number: 20140161191Abstract: Techniques and tools for encoding and decoding motion vector information for video images are described. For example, a video encoder yields an extended motion vector code by jointly coding, for a set of pixels, a switch code, motion vector information, and a terminal symbol indicating whether subsequent data is encoded for the set of pixels. In another aspect, an encoder/decoder selects motion vector predictors for macroblocks. In another aspect, a video encoder/decoder uses hybrid motion vector prediction. In another aspect, a video encoder/decoder signals a motion vector mode for a predicted image. In another aspect, a video decoder decodes a set of pixels by receiving an extended motion vector code, which reflects joint encoding of motion information together with intra/inter-coding information and a terminal symbol. The decoder determines whether subsequent data exists for the set of pixels based on e.g., the terminal symbol.Type: ApplicationFiled: February 13, 2014Publication date: June 12, 2014Applicant: MICROSOFT CORPORATIONInventors: Sridhar Srinivasan, Pohsiang Hsu, Thomas W. Holcomb, Kunal Mukerjee, Bruce Chih-Lung Lin
-
Patent number: 8687697Abstract: Techniques and tools for encoding and decoding motion vector information for video images are described. For example, a video encoder yields an extended motion vector code by jointly coding, for a set of pixels, a switch code, motion vector information, and a terminal symbol indicating whether subsequent data is encoded for the set of pixels. In another aspect, an encoder/decoder selects motion vector predictors for macroblocks. In another aspect, a video encoder/decoder uses hybrid motion vector prediction. In another aspect, a video encoder/decoder signals a motion vector mode for a predicted image. In another aspect, a video decoder decodes a set of pixels by receiving an extended motion vector code, which reflects joint encoding of motion information together with intra/inter-coding information and a terminal symbol. The decoder determines whether subsequent data exists for the set of pixels based on e.g., the terminal symbol.Type: GrantFiled: April 24, 2012Date of Patent: April 1, 2014Assignee: Microsoft CorporationInventors: Sridhar Srinivasan, Pohsiang Hsu, Thomas W. Holcomb, Kunal Mukerjee, Bruce Chih-Lung Lin
-
Publication number: 20130282730Abstract: Methods, systems, and computer program products are provided for deriving and updating document similarity indices for a plurality of documents. The number of maintained similarities can be controlled to conserve CPU and storage resources.Type: ApplicationFiled: June 19, 2013Publication date: October 24, 2013Inventors: Sorin Gherman, Kunal Mukerjee, Adam Prout
-
Publication number: 20130268531Abstract: In one embodiment, datasets are stored in a catalog. The datasets are enriched by establishing relationships among the domains in different datasets. A user searches for relevant datasets by providing examples of the domains of interest. The system identifies datasets corresponding to the user-provided examples. The system them identifies connected subsets of the datasets that are directly linked or indirectly linked through other domains. The user provides known relationship examples to filter the connected subsets and to identify the connected subsets that are most relevant to the user's query. The selected connected subsets may be further analyzed by business intelligence/analytics to create pivot tables or to process the data.Type: ApplicationFiled: April 10, 2012Publication date: October 10, 2013Applicant: Microsoft CorporationInventors: John C. Platt, Surajit Chaudhuri, Lev Novik, Henricus Johannes Maria Meijer, Efim Hudis, Kunal Mukerjee, Christopher Alan Hays
-
Publication number: 20130246386Abstract: Systems are used for identifying key phrases within documents. These systems utilize a tags and a tag index to determine what a document primarily relates to. For example, an integrated data flow and extract-transform-load pipeline, crawls, parses and word breaks large corpuses of documents in database tables. Documents can be broken into tuples. The tuples can be sent to a heuristically based algorithm that uses statistical language models and weight plus cross-entropy threshold functions to summarize the document into its “top N” most statistically significant phrases. These systems can scale efficiently (e.g., linearly) and (potentially large numbers of) documents can be characterized by salient and relevant key phrases (tags).Type: ApplicationFiled: March 11, 2013Publication date: September 19, 2013Applicant: MICROSOFT CORPORATIONInventors: Sorin Gherman, Kunal Mukerjee