Patents by Inventor Yifung Lin

Yifung Lin has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

NOTEBOOKS WITH CACHED RESULTS

Publication number: 20250036644

Abstract: A data platform for running a subset of cells in a notebook is provided. The data platform receives a run cells message from a notebook user interface (UI) application specifying the subset of cells to run. For each cell in the subset, the data platform runs the cell to generate a set of results, generates a cell execution stream using the results, stores the stream, and transmits the stream to the notebook UI application. The notebook UI application generates a display for the user using the cell execution stream. The data platform provides an efficient way to run specific cells in a notebook and display the results to the user.

Type: Application

Filed: October 16, 2024

Publication date: January 30, 2025

Inventors: Annissa Al-Alusi, Istvan Cseri, Yifung Lin, Jue Liu, Michael Joseph Papale, William A. Pugh, Jeffrey Shaw, Wei Song, Thiago Teixeira
Notebooks with cached results

Patent number: 12147445

Abstract: A data platform for running a subset of cells in a notebook is provided. The data platform receives a run cells message from a notebook user interface (UI) application specifying the subset of cells to run. For each cell in the subset, the data platform runs the cell to generate a set of results, generates a cell execution stream using the results, stores the stream, and transmits the stream to the notebook UI application. The notebook UI application generates a display for the user using the cell execution stream. The data platform provides an efficient way to run specific cells in a notebook and display the results to the user.

Type: Grant

Filed: November 28, 2023

Date of Patent: November 19, 2024

Assignee: Snowflake Inc.

Inventors: Annissa Al-Alusi, Istvan Cseri, Yifung Lin, Jue Liu, Michael Joseph Papale, William A. Pugh, Jeffrey Shaw, Wei Song, Thiago Teixeira
Notebooks with predictable behavior

Patent number: 11880381

Abstract: A data platform for running a subset of cells in a notebook is provided. The data platform receives a run cells message from a notebook user interface (UI) application specifying the subset of cells to run. For each cell in the subset, the data platform runs the cell to generate a set of results, generates a cell execution stream using the results, stores the stream, and transmits the stream to the notebook UI application. The notebook UI application generates a display for the user using the cell execution stream. The data platform provides an efficient way to run specific cells in a notebook and display the results to the user.

Type: Grant

Filed: July 13, 2023

Date of Patent: January 23, 2024

Assignee: Snowflake Inc.

Inventors: Annissa Al-Alusi, Istvan Cseri, Yifung Lin, Jue Liu, Michael Joseph Papale, William A. Pugh, Jeffrey Shaw, Wei Song, Thiago Teixeira
Generation of data flow from syntax tree

Patent number: 10942922

Abstract: A computerized mechanism to formulate a data flow representation from a syntax tree. The syntax tree may be, for instance, an abstract syntax tree (AST) that is formulated by compiling query script, such as big data query script. Each node in the syntax tree is composed of one or more tokens (script portions) that are taken from the query script, which relationships between the tokens being syntactically represented by links between those tokens. Accordingly, the data flow representation may also be a data flow representation of the original query script itself. In order to formulate the data flow representation from the syntax tree, the data types of the various inputs and outputs of the syntax tree nodes are identified and bound to a corresponding data flow. This may be performed whilst honoring dependencies between the nodes. The data flow may be visualized to an author or reviewer of the script.

Type: Grant

Filed: June 28, 2016

Date of Patent: March 9, 2021

Assignee: MICROSOFT TECHNOLOGY LICENSING, LLC

Inventors: David Joseph Cummings, Zhaoji Chen, Yifung Lin, Dian Zhang
Data skew finding and analysis

Patent number: 10545965

Abstract: Reducing data skew when performing computational jobs (such as responding to complex queries or queries on big data sets). To gather performance information for a vertex in a distributed stage, the distributed stage is processed and the resulting performance information being generated. The system then identifies a performance outlier of the vertices, which varies from a performance norm of other vertices. A trigger instruction is then generated that is structured to trigger the performance outlier vertex to gather data distribution information of the data that the performance outlier vertex previously processed. The trigger instruction may thus be executed to cause the performance outlier vertex to gather the data distribution information for evaluation for data skew.

Type: Grant

Filed: June 30, 2016

Date of Patent: January 28, 2020

Assignee: Microsoft Technology Licensing, LLC

Inventors: Jin Sun, Omid Afnan, Yifung Lin
Modeling resource usage for a job

Patent number: 10509683

Abstract: Simulation of a computational job using various sets of resources, and potentially also the automated or semi-automated allocation of an appropriate set of resources for accomplishing a computational job comprising multiple vertices. For each of multiple potential sets of resources, a simulation module simulates processing of the computational job. While the simulation does not actually perform the vertex on each processing node, the simulation does use dependencies between vertices, and historical data regarding the processing of instances of such vertices, in order to determine the efficacy of processing of each vertex, and to put the estimations together into an overall simulation result.

Type: Grant

Filed: May 12, 2016

Date of Patent: December 17, 2019

Assignee: Microsoft Technology Licensing, LLC

Inventors: Michael Furman, Yifung Lin, Deyang Song
Debugging remote vertex code on test machine

Patent number: 10185647

Abstract: The supporting of debugging of failed vertex code on a test machine. This debugging is made possible even though the vertex code failed while on a remote processing node, and is but one of multiple, and potentially innumerable vertices that run in a distributed environment. This represents a vast technical improvement over prior ad hoc methods for trying to debug a large distributed application, since time is not wasted on vertices that operated properly, but rather debugging is focused on the problem vertex. Even reproducing the failure is a huge technical step forward.

Type: Grant

Filed: May 12, 2016

Date of Patent: January 22, 2019

Assignee: Microsoft Technology Licensing, LLC

Inventors: Na Gao, Yifung Lin, Omid Afnan, Dian Zhang
Critical Path Detection in Job

Publication number: 20170249762

Abstract: Identifying a path for a distributed job. A method includes dynamically collecting timing and relationship information for vertices in stages of a running job. The method further includes identifying a particular vertex. The method further includes iteratively identifying a single path parent-child sequence path for the identified vertex. The method further includes displaying a Gantt chart showing the identified path, wherein the Gantt chart shows timing relationships and execution progress information.

Type: Application

Filed: February 29, 2016

Publication date: August 31, 2017

Inventors: Yifung Lin, Omid Afnan, Dian Zhang
LOCATION CORRELATION BETWEEN QUERY SCRIPT AND DATA FLOW

Publication number: 20170091317

Abstract: A computerized mechanism to automatically correlate positions of query script to portions of a data flow representation of the query script. When parsing the query script to generate the tokens, at least some of the tokens have an associated script location marker that identifies a location in the query script where the token originated from. The syntax tree of multiple nodes is then formulated, each node comprising one or more of the tokens parsed from the query script. Accordingly, the syntax tree retains the script location markers. A data flow representation of the query script is then formulated into a data flow representation. That data flow representation might, for instance, be based on the syntax tree, but augmented with data types of the various data flows. Nevertheless, the location marker is retained within the data flow representation.

Type: Application

Filed: June 28, 2016

Publication date: March 30, 2017

Inventors: David Joseph Cummings, Zhaoji Chen, Yifung Lin, Dian Zhang
DEBUGGING REMOTE VERTEX CODE ON TEST MACHINE

Publication number: 20170091076

Abstract: The supporting of debugging of failed vertex code on a test machine. This debugging is made possible even though the vertex code failed while on a remote processing node, and is but one of multiple, and potentially innumerable vertices that run in a distributed environment. This represents a vast technical improvement over prior ad hoc methods for trying to debug a large distributed application, since time is not wasted on vertices that operated properly, but rather debugging is focused on the problem vertex. Even reproducing the failure is a huge technical step forward.

Type: Application

Filed: May 12, 2016

Publication date: March 30, 2017

Inventors: Na Gao, Yifung Lin, Omid Afnan, Dian Zhang
MODELING RESOURCE USAGE FOR A JOB

Publication number: 20170090990

Abstract: Simulation of a computational job using various sets of resources, and potentially also the automated or semi-automated allocation of an appropriate set of resources for accomplishing a computational job comprising multiple vertices. For each of multiple potential sets of resources, a simulation module simulates processing of the computational job. While the simulation does not actually perform the vertex on each processing node, the simulation does use dependencies between vertices, and historical data regarding the processing of instances of such vertices, in order to determine the efficacy of processing of each vertex, and to put the estimations together into an overall simulation result.

Type: Application

Filed: May 12, 2016

Publication date: March 30, 2017

Inventors: Michael Furman, Yifung Lin, Deyang Song
DATA SKEW FINDING AND ANALYSIS

Publication number: 20170091185

Abstract: Reducing data skew when performing computational jobs (such as responding to complex queries or queries on big data sets). To gather performance information for a vertex in a distributed stage, the distributed stage is processed and the resulting performance information being generated. The system then identifies a performance outlier of the vertices, which varies from a performance norm of other vertices. A trigger instruction is then generated that is structured to trigger the performance outlier vertex to gather data distribution information of the data that the performance outlier vertex previously processed. The trigger instruction may thus be executed to cause the performance outlier vertex to gather the data distribution information for evaluation for data skew.

Type: Application

Filed: June 30, 2016

Publication date: March 30, 2017

Inventors: Jin Sun, Omid Afnan, Yifung Lin
GENERATION OF DATA FLOW FROM SYNTAX TREE

Publication number: 20170091293

Abstract: A computerized mechanism to formulate a data flow representation from a syntax tree. The syntax tree may be, for instance, an abstract syntax tree (AST) that is formulated by compiling query script, such as big data query script. Each node in the syntax tree is composed of one or more tokens (script portions) that are taken from the query script, which relationships between the tokens being syntactically represented by links between those tokens. Accordingly, the data flow representation may also be a data flow representation of the original query script itself. In order to formulate the data flow representation from the syntax tree, the data types of the various inputs and outputs of the syntax tree nodes are identified and bound to a corresponding data flow. This may be performed whilst honoring dependencies between the nodes. The data flow may be visualized to an author or reviewer of the script.

Type: Application

Filed: June 28, 2016

Publication date: March 30, 2017

Inventors: David Joseph Cummings, Zhaoji Chen, Yifung Lin, Dian Zhang
DataGrid user interface control with row details

Patent number: 8166387

Abstract: A DataGrid user interface (UI) control is presented that is configured to display a table that includes columns corresponding to fields in a data source and rows corresponding to records in the data source. The DataGrid UI control is also configured to display a details section associated with each row in the table. One or all of the details sections can be displayed or hidden based upon properties specified by a developer or input made by a user. A property may also be set to cause the details section for a row to be displayed only when a user interface cursor is hovering over the row associated with the details section. A property may also be set to cause a details section for a row to be displayed only when the associated row is selected.

Type: Grant

Filed: June 20, 2008

Date of Patent: April 24, 2012

Assignee: Microsoft Corporation

Inventors: Scott Morrison, Régis Lucien Francis Brid, Yifung Lin
SMOOTH PIXEL-BASED SCROLLING IN A USER INTERFACE CONTROL

Publication number: 20100162161

Abstract: Described is a technology in which a user interface element (a control, e.g., a datagrid) provides smoother scrolling by computing and updating a row height estimate. For example, the row height estimate along with the total number of rows may be used to configure a scrollbar. A control outputs visible rows according to user scrolling interaction based on a current row index and current pixel values corresponding to that current row index. The estimate is obtained based upon the current row index and the current pixel value for the row index. Whenever a row index is encountered that is larger than any previously-encountered row index, the row height estimate is updated, e.g., to reconfigure the scrollbar. Also described is outputting partially visible rows based on a number of pixels available in a display area below a last fully visible row, and/or above a first fully visible row.

Type: Application

Filed: December 23, 2008

Publication date: June 24, 2010

Applicant: Microsoft Corporation

Inventors: Yifung Lin, Scott Morrison, Brian Christopher Braeckel
RECYCLING OF VIEW COMPONENTS IN A USER INTERFACE

Publication number: 20100037164

Abstract: Architecture that facilitates the recycling of view components (e.g., item controls, visual containers, etc.) in a user interface rather than recreating the view components when needed again in response to a view change such as a scrolling operation or other user navigation operation such as a paging control. In the context of a datagrid, the datagrid can now be more performant when scrolling through a large number of rows. The architecture includes a recycling component that recycles the visual containers by collecting the visual containers when rows are scrolled off the view, and reusing the visual containers with different data when rows are scrolled back into the view. Thus, all the containers stay the same—new data is simply applied to the recycled container.

Type: Application

Filed: August 11, 2008

Publication date: February 11, 2010

Applicant: MICROSOFT CORPORATION

Inventors: Scott Morrison, Régis Lucien Francis Brid, Yifung Lin
DataGrid User Interface Control With Row Details

Publication number: 20090319882

Abstract: A DataGrid user interface (UI) control is presented that is configured to display a table that includes columns corresponding to fields in a data source and rows corresponding to records in the data source. The DataGrid UI control is also configured to display a details section associated with each row in the table. One or all of the details sections can be displayed or hidden based upon properties specified by a developer or input made by a user. A property may also be set to cause the details section for a row to be displayed only when a user interface cursor is hovering over the row associated with the details section. A property may also be set to cause a details section for a row to be displayed only when the associated row is selected.

Type: Application

Filed: June 20, 2008

Publication date: December 24, 2009

Applicant: MICROSOFT CORPORATION

Inventors: Scott Morrison, Regis Lucien Francis Brid, Yifung Lin
VIEW SELECTION AND SWITCHING

Publication number: 20090132915

Abstract: Application logic and user interfaces are separated to allow more than one interface or view to be easily employed for given application logic. A particular interface can be utilized to display data and/or facilitate interaction simply by identifying the interface. Available interfaces are identified to users for selection thereof. The specific interfaces and manner of identification can be filtered or otherwise controlled or arranged. Upon selection of an interface, data can be rendered accordingly. A transition can also be applied between switched interfaces.

Type: Application

Filed: November 20, 2007

Publication date: May 21, 2009

Applicant: MICROSOFT CORPORATION

Inventors: Michael Scott McCraw, Yifung Lin, David Paul Hill, Kathy K. Carper, Aliaksandr Bulankou