Patents by Inventor Yat On Lau
Yat On Lau has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Patent number: 11966381Abstract: Embodiments maintain a data pool that includes heterogeneous data sets, and receiving a first data batch of a data set from a data source into the data pool. Embodiments determine a current state of the data set based on a data set state diagram including a plurality of data set states, and identify a condition of the first data batch. Embodiments further set a data batch state for the first data batch, based on a data batch state diagram, and update the data batch state of a prior data batch received before the first data batch, based on the condition of the first data batch. Embodiments additionally transition the data set state diagram, based on the condition of the first data batch, to an updated data set state. Embodiments maintain a data state repository storing the data set state for each of the plurality of heterogeneous data sets.Type: GrantFiled: November 9, 2021Date of Patent: April 23, 2024Assignee: Microsoft Technology Licensing, LLCInventors: Liangzhao Zeng, Ting Yu Cliff Leung, Yat On Lau, Jimmy Hong, Chuang Yao, Yen-Ting Liu, Ting-Kuan Wu
-
Patent number: 11860853Abstract: Embodiments of the system in include a memory that stores a metamodel including a plurality of predefined characteristics for data sets. A data repository stores a plurality of heterogeneous data sets, each of the plurality of data sets comprising a plurality of data batches received over time. An interface receives a new data set for storage into the data repository, and data health reasoner to retrieve the stored metamodel from the memory, the stored metamodel including a plurality of predefined characteristics. The data health reasoner determines measured values of a subset of the plurality of predefined characteristics identified based on the stored metamodel, and determines a set of data health metrics for the data set based on the measured values of the subset of the set of the predefined characteristics. The data health reasoner formulates a plurality of data validation assertions for the data set and apply the plurality of data validation assertions to each instance of the data set.Type: GrantFiled: November 9, 2021Date of Patent: January 2, 2024Assignee: Microsoft Technology Licensing, LLCInventors: Liangzhao Zeng, Ting Yu Cliff Leung, Jimmy Hong, Yat On Lau
-
Publication number: 20230147939Abstract: Embodiments maintain a data pool that includes heterogeneous data sets, and receiving a first data batch of a data set from a data source into the data pool. Embodiments determine a current state of the data set based on a data set state diagram including a plurality of data set states, and identify a condition of the first data batch. Embodiments further set a data batch state for the first data batch, based on a data batch state diagram, and update the data batch state of a prior data batch received before the first data batch, based on the condition of the first data batch. Embodiments additionally transition the data set state diagram, based on the condition of the first data batch, to an updated data set state. Embodiments maintain a data state repository storing the data set state for each of the plurality of heterogeneous data sets.Type: ApplicationFiled: November 9, 2021Publication date: May 11, 2023Inventors: Liangzhao ZENG, Ting Yu Cliff LEUNG, Yat On LAU, Jimmy HONG, Chuang YAO, Yen-Ting LIU, Ting-Kuan WU
-
Publication number: 20230145069Abstract: Embodiments of the system in include a memory that stores a metamodel including a plurality of predefined characteristics for data sets. A data repository stores a plurality of heterogeneous data sets, each of the plurality of data sets comprising a plurality of data batches received over time. An interface receives a new data set for storage into the data repository, and data health reasoner to retrieve the stored metamodel from the memory, the stored metamodel including a plurality of predefined characteristics. The data health reasoner determines measured values of a subset of the plurality of predefined characteristics identified based on the stored metamodel, and determines a set of data health metrics for the data set based on the measured values of the subset of the set of the predefined characteristics. The data health reasoner formulates a plurality of data validation assertions for the data set and apply the plurality of data validation assertions to each instance of the data set.Type: ApplicationFiled: November 9, 2021Publication date: May 11, 2023Inventors: Liangzhao ZENG, Ting Yu Cliff LEUNG, Jimmy HONG, Yat On LAU
-
Patent number: 11455217Abstract: Transaction consistency query support is available for replicated data from recovery log to external data stores. An external data store is populated with records using entries of a change data table. The change data table has entries for each transaction that has committed and is to be replicated, and each of the entries stores information for each log entry in a recovery log from a database management system. Each log entry identifies a transactional change of data and a transaction completion indicator of one of commit and abort. In response to receiving a query about a transaction of the transactions, a set of records are retrieved from the external data store for the transaction. From the set of records, records whose sequence identifier values are larger than a maximum transaction commit sequence identifier are removed. From the set of records, remaining records having transaction consistency are returned.Type: GrantFiled: December 21, 2017Date of Patent: September 27, 2022Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Serge Bourbonnais, Austin F. M. D'Costa, Yat On Lau, Xiao Li, Hong Min, Gong Su, Jonathan W. Wierenga, Christian Zentgraf, Kan Zhang
-
Patent number: 11442823Abstract: Transaction consistency query support is available for replicated data from recovery log to external data stores. An external data store is populated with records using entries of a change data table. The change data table has entries for each transaction that has committed and is to be replicated, and each of the entries stores information for each log entry in a recovery log from a database management system. Each log entry identifies a transactional change of data and a transaction completion indicator of one of commit and abort. In response to receiving a query about a transaction of the transactions, a set of records are retrieved from the external data store for the transaction. From the set of records, records whose sequence identifier values are larger than a maximum transaction commit sequence identifier are removed. From the set of records, remaining records having transaction consistency are returned.Type: GrantFiled: June 3, 2016Date of Patent: September 13, 2022Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Serge Bourbonnais, Austin F. M. D'Costa, Yat On Lau, Xiao Li, Hong Min, Gong Su, Jonathan W. Wierenga, Christian Zentgraf, Kan Zhang
-
Patent number: 10216582Abstract: Provided are techniques for replicating relational transactional log data to a big data platform. Change records contained in change data tables are fetched. A relational change history with transaction snapshot consistency is rebuilt to generate consistent change records by joining the change data tables and a unit of work table based on a commit sequence identifier. The consistent change records are stored on the big data platform, and queries are answered on the big data platform using the consistent change records.Type: GrantFiled: August 15, 2016Date of Patent: February 26, 2019Assignee: International Business Machines CorporationInventors: Serge Bourbonnais, Austin F. M. D'Costa, Yat On Lau, Xiao Li, Hong Min, Gong Su, Jonathan W. Wierenga, Christian Zentgraf
-
Patent number: 10216584Abstract: Provided are techniques for replicating relational transactional log data to a big data platform. Change records contained in change data tables are fetched. A relational change history with transaction snapshot consistency is rebuilt to generate consistent change records by joining the change data tables and a unit of work table based on a commit sequence identifier. The consistent change records are stored on the big data platform, and queries are answered on the big data platform using the consistent change records.Type: GrantFiled: December 21, 2017Date of Patent: February 26, 2019Assignee: International Business Machines CorporationInventors: Serge Bourbonnais, Austin F. M. D'Costa, Yat On Lau, Xiao Li, Hong Min, Gong Su, Jonathan W. Wierenga, Christian Zentgraf
-
Publication number: 20180113771Abstract: Provided are techniques for transaction consistency query support for replicated data from recovery log to external data stores. An external data store is populated with records using entries of a change data table. The change data table has entries for each transaction that has committed and is to be replicated, and each of the entries stores information for each log entry in a recovery log from a database management system. Each log entry identifies a transactional change of data and a transaction completion indicator of one of commit and abort. In response to receiving a query about a transaction of the transactions, a set of records are retrieved from the external data store for the transaction. From the set of records, records whose sequence identifier values are larger than a maximum transaction commit sequence identifier are removed. From the set of records, remaining records having transaction consistency are returned.Type: ApplicationFiled: December 21, 2017Publication date: April 26, 2018Inventors: Serge Bourbonnais, Austin F.M. D'Costa, Yat On Lau, Xiao Li, Hong Min, Gong Su, Jonathan W. Wierenga, Christian Zentgraf, Kan Zhang
-
Publication number: 20180113766Abstract: Provided are techniques for replicating relational transactional log data to a big data platform. Change records contained in change data tables are fetched. A relational change history with transaction snapshot consistency is rebuilt to generate consistent change records by joining the change data tables and a unit of work table based on a commit sequence identifier. The consistent change records are stored on the big data platform, and queries are answered on the big data platform using the consistent change records.Type: ApplicationFiled: December 21, 2017Publication date: April 26, 2018Inventors: Serge Bourbonnais, Austin F.M. D'Costa, Yat On Lau, Xiao Li, Hong Min, Gong Su, Jonathan W. Wierenga, Christian Zentgraf
-
Patent number: 9928281Abstract: A system, method and computer program product for enabling light weight table comparison with high-accuracy (high confidence) of tables where one is a copy of the other, which copy may be maintained synchronized by replication. The method performs database comparison using a sample-based, statistics-based, or materialized query tables-based approaches. The method first identifies a block comprising a sub-set of rows of data of a source database table and a corresponding block from a target database table, and obtains a statistical value associated with each block. Then the statistical values for the corresponding source and target block are compared and a consistency evaluation of source and target database is determined based on comparing results. Further methods enable a determination of the data as being persistent or not in manner that accounts for real-time data modifications to underlying source and target database tables while identified blocks are being compared.Type: GrantFiled: March 20, 2015Date of Patent: March 27, 2018Assignee: International Business Machines CorporationInventors: Serge Bourbonnais, Yat On Lau, Xiao Li, Hong Min, Jonathan Wierenga, Xiang Wei Zhou
-
Publication number: 20180046551Abstract: Provided are techniques for replicating relational transactional log data to a big data platform. Change records contained in change data tables are fetched. A relational change history with transaction snapshot consistency is rebuilt to generate consistent change records by joining the change data tables and a unit of work table based on a commit sequence identifier. The consistent change records are stored on the big data platform, and queries are answered on the big data platform using the consistent change records.Type: ApplicationFiled: August 15, 2016Publication date: February 15, 2018Inventors: Serge Bourbonnais, Austin F.M. D'Costa, Yat On Lau, Xiao Li, Hong Min, Gong Su, Jonathan W. Wierenga, Christian Zentgraf
-
Publication number: 20170351585Abstract: Provided are techniques for transaction consistency query support for replicated data from recovery log to external data stores. An external data store is populated with records using entries of a change data table. The change data table has entries for each transaction that has committed and is to be replicated, and each of the entries stores information for each log entry in a recovery log from a database management system. Each log entry identifies a transactional change of data and a transaction completion indicator of one of commit and abort. In response to receiving a query about a transaction of the transactions, a set of records are retrieved from the external data store for the transaction. From the set of records, records whose sequence identifier values are larger than a maximum transaction commit sequence identifier are removed. From the set of records, remaining records having transaction consistency are returned.Type: ApplicationFiled: June 3, 2016Publication date: December 7, 2017Inventors: Serge Bourbonnais, Austin F.M. D'Costa, Yat On Lau, Xiao Li, Hong Min, Gong Su, Jonathan W. Wierenga, Christian Zentgraf, Kan Zhang
-
Patent number: 9727625Abstract: According to one embodiment of the present invention, a system uses parallel transaction messages for database replication. The system receives transaction messages from a source system via a plurality of parallel send queues in a receive queue. Each transaction message includes a message identifier indicating a commit order for that transaction. The system reads transaction messages in the receive queue in order in which they were committed against a source database based on the message identifier, and applies changes described by the transaction messages to a target database. Two or more transaction messages in the receive queue are applied to the target database in parallel. The system deletes transaction messages from the receive queue asynchronously to applying changes described by those transaction messages to the target database. Embodiments of the present invention further include a method and computer program product for parallel transaction messages in substantially the same manners described above.Type: GrantFiled: January 16, 2014Date of Patent: August 8, 2017Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Serge Bourbonnais, Austin F. M. D'costa, Somil Kulkarni, Yat On Lau, Jonathan W. Wierenga
-
Publication number: 20160275150Abstract: A system, method and computer program product for enabling light weight table comparison with high-accuracy (high confidence) of tables where one is a copy of the other, which copy may be maintained synchronized by replication. The method performs database comparison using a sample-based, statistics-based, or materialized query tables-based approaches. The method first identifies a block comprising a sub-set of rows of data of a source database table and a corresponding block from a target database table, and obtains a statistical value associated with each block. Then the statistical values for the corresponding source and target block are compared and a consistency evaluation of source and target database is determined based on comparing results. Further methods enable a determination of the data as being persistent or not in manner that accounts for real-time data modifications to underlying source and target database tables while identified blocks are being compared.Type: ApplicationFiled: March 20, 2015Publication date: September 22, 2016Inventors: Serge Bournonnais, Yat On Lau, Xiao Li, Hong Min, Jonathan Wierenga, Xiang Wei Zhou
-
Publication number: 20150199415Abstract: According to one embodiment of the present invention, a system uses parallel transaction messages for database replication. The system receives transaction messages from a source system via a plurality of parallel send queues in a receive queue. Each transaction message includes a message identifier indicating a commit order for that transaction. The system reads transaction messages in the receive queue in order in which they were committed against a source database based on the message identifier, and applies changes described by the transaction messages to a target database. Two or more transaction messages in the receive queue are applied to the target database in parallel. The system deletes transaction messages from the receive queue asynchronously to applying changes described by those transaction messages to the target database. Embodiments of the present invention further include a method and computer program product for parallel transaction messages in substantially the same manners described above.Type: ApplicationFiled: January 16, 2014Publication date: July 16, 2015Applicant: International Business Machines CorporationInventors: Serge Bourbonnais, Austin F.M. D'costa, Somil Kulkarni, Yat On Lau, Jonathan W. Wierenga
-
Patent number: 7627854Abstract: A computer implemented method, data processing system, and computer usable code are provided for automatically generating object setup scripts. A form is presented to a user in a graphical user interface. The user provides input in a set of fields within the form, which is received and validated. A set of object setup scripts are then generated based on the user input to the form. To generate the set of object setup scripts based on the user input an analysis is performed on the set of fields as a set of objects, and at least one variable is set to hold the value of the set of objects. String manipulations are used to obtain the values in the set of fields, and the values are entered into the set of object setup scripts.Type: GrantFiled: January 12, 2006Date of Patent: December 1, 2009Assignee: International Business Machines CorporationInventors: Dell Blair Burner, Brett Austin Johnson, Yat On Lau, Michael Wei Mao, Daina Edvina Pupons Wickham
-
Publication number: 20070162891Abstract: A computer implemented method, data processing system, and computer usable code are provided for automatically generating object setup scripts. A form is presented to a user in a graphical user interface. The user provides input in a set of fields within the form, which is received and validated. A set of object setup scripts are then generated based on the user input to the form. To generate the set of object setup scripts based on the user input an analysis is performed on the set of fields as a set of objects, and at least one variable is set to hold the value of the set of objects. String manipulations are used to obtain the values in the set of fields, and the values are entered into the set of object setup scripts.Type: ApplicationFiled: January 12, 2006Publication date: July 12, 2007Applicant: International Business Machines CorporationInventors: Dell Burner, Brett Johnson, Yat Lau, Michael Mao, Daina Wickham
-
Patent number: 7203687Abstract: A method and system for member initialization to and deactivation from an asynchronous data replication group in a database system is disclosed. The method and system allows new members to be added to the replication group or existing members to be removed from the replication group, without requiring the halting of the asynchronous replication of data. The performance advantages of asynchronous replication are still realized during member initialization or deactivation.Type: GrantFiled: February 26, 2004Date of Patent: April 10, 2007Assignee: International Business Machines CorporationInventors: Nicolas G. Adiba, Jaime F. Anaya, Elizabeth B. Hamel, Yat On Lau, Siqun Li, Bruce G. Lindsay
-
Publication number: 20050193037Abstract: A method and system for member initialization to and deactivation from an asynchronous data replication group in a database system is disclosed. The method and system allows new members to be added to the replication group or existing members to be removed from the replication group, without requiring the halting of the asynchronous replication of data. The performance advantages of asynchronous replication are still realized during member initialization or deactivation.Type: ApplicationFiled: February 26, 2004Publication date: September 1, 2005Inventors: Nicolas Adiba, Jaime Anaya, Elizabeth Hamel, Yat Lau, Siqun Li, Bruce Lindsay