Patents by Inventor Vu Minh Le

Vu Minh Le has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 10691655
    Abstract: Various technologies pertaining to extracting data encoded in a tree-structured document and generating a table based upon the extracted data are described herein. In a first embodiment, the table is generated without requiring input from a data cleaner. In a second embodiment, the table is generated based upon examples set forth by a data cleaner.
    Type: Grant
    Filed: October 20, 2016
    Date of Patent: June 23, 2020
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Vu Minh Le, Sumit Gulwani, Ranvijay Kumar, Chiu Ying Cheung
  • Publication number: 20190384851
    Abstract: Systems and techniques for structured query language (SQL) query formatting by examples are described herein. In an example, a SQL query formatter is adapted to receive a SQL statement, such that the SQL statement is formed using SQL tokens. The SQL query formatter may be further adapted to use a SQL formatter model to format the SQL statement, which results in a formatted SQL statement. The SQL formatter model may be trained using at least one previously executed SQL statement. The SQL formatter model may include format definitions for SQL tokens. The SQL query formatter may be further adapted to output the formatted SQL statement.
    Type: Application
    Filed: June 13, 2018
    Publication date: December 19, 2019
    Inventors: Vu Minh Le, Gustavo Araujo Soares, Sumit Gulwani
  • Publication number: 20190347267
    Abstract: Described herein are various technologies pertaining to identifying boundaries of a substring in a log file, wherein the substring is to be extracted from the log file and used to construct a table. An indication is received that a user has selected a beginning boundary of the substring, and the ending boundary of the substring is automatically identified.
    Type: Application
    Filed: July 24, 2019
    Publication date: November 14, 2019
    Inventors: Vu Minh Le, Sumit Gulwani
  • Patent number: 10387441
    Abstract: Described herein are various technologies pertaining to identifying boundaries of a substring in a log file, wherein the substring is to be extracted from the log file and used to construct a table. An indication is received that a user has selected a beginning boundary of the substring, and the ending boundary of the substring is automatically identified.
    Type: Grant
    Filed: November 30, 2016
    Date of Patent: August 20, 2019
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Vu Minh Le, Sumit Gulwani
  • Publication number: 20180150500
    Abstract: Described herein are various technologies pertaining to identifying boundaries of a substring in a log file, wherein the substring is to be extracted from the log file and used to construct a table. An indication is received that a user has selected a beginning boundary of the substring, and the ending boundary of the substring is automatically identified.
    Type: Application
    Filed: November 30, 2016
    Publication date: May 31, 2018
    Inventors: Vu Minh Le, Sumit Gulwani
  • Publication number: 20180150554
    Abstract: Described herein are various technologies pertaining to processing a log file, such that the processed log file is well-suited for analysis by a programming by example (PBE) system. When processing the log file, header lines and comment lines in the log file can be automatically identified and filtered from the log file, such that the PBE system does not contemplate the header lines and comment lines. Further, when processing the log file, boundaries of records in the log file can be identified without requiring user input. The PBE system constructs a table based upon the identified boundaries of the records in the log file, wherein the table includes rows that correspond to the records, and further wherein the table fails to include rows that correspond to header lines and comment lines.
    Type: Application
    Filed: November 30, 2016
    Publication date: May 31, 2018
    Inventors: Vu Minh Le, Sumit Gulwani
  • Publication number: 20180113887
    Abstract: Various technologies pertaining to extracting data encoded in a tree-structured document and generating a table based upon the extracted data are described herein. In a first embodiment, the table is generated without requiring input from a data cleaner. In a second embodiment, the table is generated based upon examples set forth by a data cleaner.
    Type: Application
    Filed: October 20, 2016
    Publication date: April 26, 2018
    Inventors: Vu Minh Le, Sumit Gulwani, Ranvijay Kumar, Chiu Ying Cheung
  • Patent number: 9891895
    Abstract: Systems and methods for increasing user confidence in results that are produced by one or more programs that are generated by an underlying Programming-By-Example (PBE) system based on user input examples. A plurality of generated programs that have been generated using one or more user input examples that are indicative of an output that should be achieved to comply with a user determined result are received. The generated programs are narrowed based on one or more sub-expressions of the programs that are likely to cause the resultant program to comply with the user determined result. The one or more sub-expressions are exposed. Input that selects at least one of the one or more exposed sub-expressions to thereby identify the one of the generated programs that will result in the user determined result is received.
    Type: Grant
    Filed: September 14, 2015
    Date of Patent: February 13, 2018
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Sumit Gulwani, Benjamin Goth Zorn, Rishabh Singh, Mark Marron, Oleksandr Polozov, Vu Minh Le, Mikael Mayer, Gustavo Araujo Soares, Maxim Grechkin
  • Publication number: 20170091589
    Abstract: Various technologies described herein pertain to controlling automated programming for extracting data from an input document. Examples indicative of the data to extract from the input document can be received. The examples can include highlighted regions on the input document. Moreover, the input document can be a semi-structured document (e.g. a text file, a log file, a word processor document, a semi-structured spreadsheet, a webpage, a fixed-layout document, an image file, etc.). Further, an extraction program for extracting the data from the input document can be synthesized based on the examples. The extraction program can be synthesized in a domain specific language (DSL) for a type of the input document. Moreover, the extraction program can be executed on the input document to extract an instance of an output data schema.
    Type: Application
    Filed: December 12, 2016
    Publication date: March 30, 2017
    Inventors: Sumit Gulwani, Vu Minh Le
  • Publication number: 20170075661
    Abstract: Systems and methods for increasing user confidence in results that are produced by one or more programs that are generated by an underlying Programming-By-Example (PBE) system based on user input examples. A plurality of generated programs that have been generated using one or more user input examples that are indicative of an output that should be achieved to comply with a user determined result are received. The generated programs are narrowed based on one or more sub-expressions of the programs that are likely to cause the resultant program to comply with the user determined result. The one or more sub-expressions are exposed. Input that selects at least one of the one or more exposed sub-expressions to thereby identify the one of the generated programs that will result in the user determined result is received.
    Type: Application
    Filed: September 14, 2015
    Publication date: March 16, 2017
    Inventors: Sumit Gulwani, Benjamin Goth Zorn, Rishabh Singh, Mark Marron, Oleksandr Polozov, Vu Minh Le, Mikael Mayer, Gustavo Araujo Soares, Maxim Grechkin
  • Patent number: 9542622
    Abstract: Various technologies described herein pertain to controlling automated programming for extracting data from an input document. Examples indicative of the data to extract from the input document can be received. The examples can include highlighted regions on the input document. Moreover, the input document can be a semi-structured document (e.g. a text file, a log file, a word processor document, a semi-structured spreadsheet, a webpage, a fixed-layout document, an image file, etc.). Further, an extraction program for extracting the data from the input document can be synthesized based on the examples. The extraction program can be synthesized in a domain specific language (DSL) for a type of the input document. Moreover, the extraction program can be executed on the input document to extract an instance of an output data schema.
    Type: Grant
    Filed: March 3, 2015
    Date of Patent: January 10, 2017
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Sumit Gulwani, Vu Minh Le
  • Patent number: 9448772
    Abstract: A program development framework (PDF) is described herein which allows a user to produce a program in piecemeal fashion by successively specifying program fragments. The PDF creates a new program fragment by receiving keyword information from the user that describes a new program fragment, and then identifies context information that pertains to a programmatic context in which the new program fragment appears within the overall program being created. The PDF then generates a set of candidate program fragments that satisfy the keyword information and the context information, and ranks those candidate program fragments based on ranking information. At least part of the ranking information may be based on statistical information that is produced by analyzing a corpus of previous programs produced by one or more users. The PDF then provides the ranked program fragments to the user using various user-friendly presentation strategies.
    Type: Grant
    Filed: June 28, 2013
    Date of Patent: September 20, 2016
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Sumit Gulwani, Jonathan Paul de Halleux, Nikolai Tillmann, Vu Minh Le, Zhendong Su
  • Publication number: 20150254211
    Abstract: Various technologies described herein pertain to controlling programming for manipulating an input document based on example(s) and/or natural language input(s). A data manipulation system includes an interface component configured to receive an input document, which is semi-structured or unstructured. The data manipulation system further includes an extraction component configured to synthesize, based on a first input, a first program for parsing data of the input document. The extraction component is configured to execute the first program on the input document to form structured data. The data manipulation system also includes an operation component configured to synthesize, based on a second input, a second program for performing an operation on the structured data. The operation component is configured to execute the second program on the structured data to generate a result of the operation, which is output by the data manipulation system.
    Type: Application
    Filed: February 13, 2015
    Publication date: September 10, 2015
    Inventors: Sumit Gulwani, Edward C. Hart, JR., Vu Minh Le, Henrique S. Malvar, Mark Marron, James D. McCaffrey, Gustavo Araujo Soares, Benjamin G. Zorn
  • Publication number: 20150254530
    Abstract: Various technologies described herein pertain to controlling automated programming for extracting data from an input document. Examples indicative of the data to extract from the input document can be received. The examples can include highlighted regions on the input document. Moreover, the input document can be a semi-structured document (e.g. a text file, a log file, a word processor document, a semi-structured spreadsheet, a webpage, a fixed-layout document, an image file, etc.). Further, an extraction program for extracting the data from the input document can be synthesized based on the examples. The extraction program can be synthesized in a domain specific language (DSL) for a type of the input document. Moreover, the extraction program can be executed on the input document to extract an instance of an output data schema.
    Type: Application
    Filed: March 3, 2015
    Publication date: September 10, 2015
    Inventors: Sumit Gulwani, Vu Minh Le
  • Publication number: 20140282375
    Abstract: A program development framework (PDF) is described herein which allows a user to produce a program in piecemeal fashion by successively specifying program fragments. The PDF creates a new program fragment by receiving keyword information from the user that describes a new program fragment, and then identifies context information that pertains to a programmatic context in which the new program fragment appears within the overall program being created. The PDF then generates a set of candidate program fragments that satisfy the keyword information and the context information, and ranks those candidate program fragments based on ranking information. At least part of the ranking information may be based on statistical information that is produced by analyzing a corpus of previous programs produced by one or more users. The PDF then provides the ranked program fragments to the user using various user-friendly presentation strategies.
    Type: Application
    Filed: June 28, 2013
    Publication date: September 18, 2014
    Inventors: Sumit Gulwani, Jonathan Paul de Halleux, Nikolai Tillmann, Vu Minh Le, Zhendong Su
  • Patent number: 7146350
    Abstract: An assessments framework includes a framework batch processor and a partner batch processor. The framework batch processor performs static assessments based on information obtained from a task definition database and a policy database. The partner batch processor outputs dynamic task definitions and dynamic policy information, to be input to the framework batch processor for executing an assessment when a triggering event occurs. The triggering event may be a schedule-based trigger or an event-based trigger.
    Type: Grant
    Filed: January 29, 2003
    Date of Patent: December 5, 2006
    Assignee: Hewlett-Packard Development Company, L.P.
    Inventors: Paul Edward Holland, Mark William McDowell, Paul Laurence Dineen, Vu An Minh Le