Patents by Inventor Xin Ying Yang

Xin Ying Yang has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20170124087
    Abstract: Embodiments of the present invention provide efficient systems and methods for searching a keyword. Embodiments of the present invention can be used to more quickly and comprehensively search a document for a keyword or phrase, based on the preferences and behaviors of a user, so that the returned results to the user are ranked in order of the preferences of the user. By factoring in user data such as time spent on certain content and highlighted content, the system is able to more efficiently return preferred results of a user of a keyword search.
    Type: Application
    Filed: February 22, 2016
    Publication date: May 4, 2017
    Inventors: Xiao Di Deng, Shuo Li, Meng Wan, Xin Ying Yang
  • Publication number: 20170124081
    Abstract: Embodiments of the present invention provide efficient systems and methods for searching a keyword. Embodiments of the present invention can be used to more quickly and comprehensively search a document for a keyword or phrase, based on the preferences and behaviors of a user, so that the returned results to the user are ranked in order of the preferences of the user. By factoring in user data such as time spent on certain content and highlighted content, the system is able to more efficiently return preferred results of a user of a keyword search.
    Type: Application
    Filed: November 2, 2015
    Publication date: May 4, 2017
    Inventors: Xiao Di Deng, Shuo Li, Meng Wan, Xin Ying Yang
  • Publication number: 20170124134
    Abstract: A computer maps a literal in a database query to a digital representation, wherein the database query comprises a predicate, the literal is a part of the predicate, and the digital representation is predetermined based at least in part on external statistical data. The computer estimates a filter factor for the predicate based at least in part on the digital representation and compressed statistical data, wherein the compressed statistical data are prepared at least in part from the external statistical data.
    Type: Application
    Filed: November 3, 2015
    Publication date: May 4, 2017
    Inventors: Ke Wei Wei, Maryela E. Weihrauch, Hao Wu, Xin Ying Yang, Miao Zheng
  • Publication number: 20170124149
    Abstract: A computer program product to optimize a database is provided. A processor determines a frequency of search for a plurality of columns in a table. A processor determines a length of the plurality of columns. A processor determines a rank for the plurality of columns based, at least in part, on both the frequency of search and the length of the plurality of columns. A processor generates a first set of combinations of the plurality of columns. A processor determines an optimization of the first set of combinations of the plurality of columns. A processor selects an optimal combination of the first set of combinations, wherein the optimization of the optimal combination is greater than the optimization for all other combinations in the first set of combinations. A processor moves the plurality of columns excluded from the optimal combination.
    Type: Application
    Filed: January 17, 2017
    Publication date: May 4, 2017
    Inventors: Shuo Li, Shu Hua Liu, Xin Ying Yang, Jian W. Zhang
  • Patent number: 9633059
    Abstract: A computer program product to optimize a database is provided. A processor determines a frequency of search for a plurality of columns in a table. A processor determines a length of the plurality of columns. A processor determines a rank for the plurality of columns based, at least in part, on both the frequency of search and the length of the plurality of columns. A processor generates a first set of combinations of the plurality of columns. A processor determines an optimization of the first set of combinations of the plurality of columns. A processor selects an optimal combination of the first set of combinations, wherein the optimization of the optimal combination is greater than the optimization for all other combinations in the first set of combinations. A processor moves the plurality of columns excluded from the optimal combination.
    Type: Grant
    Filed: August 11, 2016
    Date of Patent: April 25, 2017
    Assignee: International Business Machines Corporation
    Inventors: Shuo Li, Shu Hua Liu, Xin Ying Yang, Jian W. Zhang
  • Patent number: 9613075
    Abstract: Embodiments of the present invention provide a method, computer program product, and computer system for masking indexes. The method may include masking the index if the index contention exceeds a defined threshold. The method may also include configuring the mask type for the index to insert the index without masking. In response to determining that the mask type for the index is configured to insert the index without masking, the index is masked according to a set of rules.
    Type: Grant
    Filed: March 10, 2016
    Date of Patent: April 4, 2017
    Assignee: International Business Machines Corporation
    Inventors: Shuo Li, Ping Liang, Xin Ying Yang, Jian Wei Zhang
  • Publication number: 20170093876
    Abstract: In an embodiment of the present invention, a command for performing a database operation with respect to a table of a database is received, whereby the table has a visible key column for identification information and one or more invisible token columns for token information. The user's role from the command is obtained. A record is identified in the table based on a combination of the identification information and the token information corresponding to the user's role. The database operation is performed on the identified record.
    Type: Application
    Filed: September 29, 2015
    Publication date: March 30, 2017
    Inventors: Hao Feng, Shuo Li, Shengyan Sun, Xin Ying Yang
  • Publication number: 20170091248
    Abstract: Embodiments of the present invention provide a method, computer program product, and computer system for masking indexes. The method may include masking the index if the index contention exceeds a defined threshold. The method may also include configuring the mask type for the index to insert the index without masking. In response to determining that the mask type for the index is configured to insert the index without masking, the index is masked according to a set of rules.
    Type: Application
    Filed: December 16, 2016
    Publication date: March 30, 2017
    Inventors: Shuo Li, Ping Liang, Xin Ying Yang, Jian Wei Zhang
  • Publication number: 20170091247
    Abstract: Embodiments of the present invention provide a method, computer program product, and computer system for masking indexes. The method may include masking the index if the index contention exceeds a defined threshold. The method may also include configuring the mask type for the index to insert the index without masking. In response to determining that the mask type for the index is configured to insert the index without masking, the index is masked according to a set of rules.
    Type: Application
    Filed: December 15, 2016
    Publication date: March 30, 2017
    Inventors: Shuo Li, Ping Liang, Xin Ying Yang, Jian Wei Zhang
  • Publication number: 20170083548
    Abstract: A computer-implemented method includes identifying one or more database modification statements and identifying one or more operational unit indicators. The one or more operation unit indicators are caused to be generated by the one or more database modification statements. An anticipated operational size is determined. The anticipated operational size is an estimated total number of the one or more operational unit indicators. An anticipated operational throughput rate is determined. The anticipated operational throughput rate is a rate at which the operational unit indicators are expected to be generated. An anticipated total execution time of the one or more database modification statements is determined based on the anticipated operational size and the anticipated operational throughput rate. A corresponding computer program product and computer system are also disclosed.
    Type: Application
    Filed: September 17, 2015
    Publication date: March 23, 2017
    Inventors: Ting Xu Guan, Shuo Li, Ping Liang, Ke Wei Wei, Xin Ying Yang
  • Publication number: 20170083576
    Abstract: A computer-implemented method includes identifying one or more database modification statements and identifying one or more operational unit indicators. The one or more operation unit indicators are caused to be generated by the one or more database modification statements. An anticipated operational size is determined. The anticipated operational size is an estimated total number of the one or more operational unit indicators. An anticipated operational throughput rate is determined. The anticipated operational throughput rate is a rate at which the operational unit indicators are expected to be generated. An anticipated total execution time of the one or more database modification statements is determined based on the anticipated operational size and the anticipated operational throughput rate. A corresponding computer program product and computer system are also disclosed.
    Type: Application
    Filed: November 8, 2016
    Publication date: March 23, 2017
    Inventors: Ting Xu Guan, Shuo Li, Ping Liang, Ke Wei Wei, Xin Ying Yang
  • Publication number: 20170075957
    Abstract: A computer-implemented method includes receiving a batch processing job. The method identifies multiple access paths for the batch processing job. The method determines one or more runtime costs for each of the multiple access paths. The method determines whether the batch processing job includes comparable access paths. The method responds to the batch processing job not including comparable access paths by receiving a query and selecting a desired access, based on the one or more runtime costs. The method responds to the batch processing job including comparable access paths by determining one or more context costs for each of the one or more access paths, receiving a query, identifying a context for the query, and selecting a desired access path for the query. The desired access path is based on the one or more runtime costs, the one or more context costs, and the context for the query.
    Type: Application
    Filed: November 7, 2016
    Publication date: March 16, 2017
    Inventors: Shuo Li, Li Yang, Ke Wei Wei, Xin Ying Yang, Jian Wei Zhang
  • Publication number: 20170075956
    Abstract: A computer-implemented method includes receiving a batch processing job. The method identifies multiple access paths for the batch processing job. The method determines one or more runtime costs for each of the multiple access paths. The method determines whether the batch processing job includes comparable access paths. The method responds to the batch processing job not including comparable access paths by receiving a query and selecting a desired access, based on the one or more runtime costs. The method responds to the batch processing job including comparable access paths by determining one or more context costs for each of the one or more access paths, receiving a query, identifying a context for the query, and selecting a desired access path for the query. The desired access path is based on the one or more runtime costs, the one or more context costs, and the context for the query.
    Type: Application
    Filed: September 15, 2015
    Publication date: March 16, 2017
    Inventors: Shuo Li, Ke Wei Wei, Li Yang, Xin Ying Yang, Jian Wei Zhang
  • Publication number: 20170068704
    Abstract: In an approach for calculating one or more access paths during bind time, a computer receives a query. The computer identifies one or more access paths for processing the received query, wherein the one or more access paths include steps associated with retrieving data from a database based on the received query. The computer calculates resource costs associated with processing the received query on the one or more identified access paths based on one of more of: resources utilized to perform steps associated with processing the received query, and system statistics associated with the one or more identified access paths.
    Type: Application
    Filed: November 18, 2016
    Publication date: March 9, 2017
    Inventors: Hao Feng, Shuo Li, ShengYan Sun, Xin Ying Yang
  • Publication number: 20170060946
    Abstract: A computer-implemented method includes identifying a query, including one or more predicates and one or more branches, wherein one or more branches includes one or more legs. The computer-implemented method further includes, for each branch, in parallel: determining a risk, determining a return row threshold, estimating a number of return rows; terminating access if the return rows exceed the threshold. The computer-implemented method further includes, for each leg, in parallel: determining a leg return row threshold; accessing the leg; fetching one or more return rows into one or more leg return row pages; terminating access if the return rows exceed the threshold; intersecting one or more leg return row pages into one or more intersected leg return row pages; and applying the one or more predicates to the one or more intersected leg return row pages. The method may be embodied in a corresponding computer system or computer program product.
    Type: Application
    Filed: August 27, 2015
    Publication date: March 2, 2017
    Inventors: Shuo Li, Heng Liu, Ke Wei Wei, Xin Ying Yang
  • Publication number: 20170060949
    Abstract: A computer-implemented method includes identifying a query, including one or more predicates and one or more branches, wherein one or more branches includes one or more legs. The computer-implemented method further includes, for each branch, in parallel: determining a risk, determining a return row threshold, estimating a number of return rows; terminating access if the return rows exceed the threshold. The computer-implemented method further includes, for each leg, in parallel: determining a leg return row threshold; accessing the leg; fetching one or more return rows into one or more leg return row pages; terminating access if the return rows exceed the threshold; intersecting one or more leg return row pages into one or more intersected leg return row pages; and applying the one or more predicates to the one or more intersected leg return row pages. The method may be embodied in a corresponding computer system or computer program product.
    Type: Application
    Filed: September 12, 2016
    Publication date: March 2, 2017
    Inventors: Shuo Li, Heng Liu, Ke Wei Wei, Xin Ying Yang
  • Patent number: 9576004
    Abstract: Managing free space in a relational database. A data statistics history for the relational database is received, including a distribution of index key columns for a clustering index. Based on the data statistics history, increasing index key columns are identified. A rate of increase of the increasing index key columns is measured. Based on the measured rate of increase and on an estimate of database reorganization frequency, free space for the increasing index key columns is reserved.
    Type: Grant
    Filed: January 25, 2016
    Date of Patent: February 21, 2017
    Assignee: International Business Machines Corporation
    Inventors: Shuo Li, Heng Liu, Ke Wei Wei, Li Yang, Xin Ying Yang, Jian Wei Zhang
  • Patent number: 9575916
    Abstract: A system identifies a performance bottleneck status in a parallel data processing environment by examining data flow associated with the parallel data processing environment to identify at least one operator, where an operator type is associated with at least one operator, at least one buffer, and a relationship that the buffer has with the operator, where the relationship is associated with the operator type. The system monitors the buffer to determine a buffer status associated with the buffer. The system applies a set of rules to identify an operator bottleneck status associated with the operator. The set of rules is applied to the operator, based on the operator type, the buffer status, and relationship that the buffer has with the operator. The system then determines a performance bottleneck status associated with the parallel data processing environment, based on the operator bottleneck status.
    Type: Grant
    Filed: January 6, 2014
    Date of Patent: February 21, 2017
    Assignee: International Business Machines Corporation
    Inventors: Brian K. Caufield, Ron E. Liu, DongJie Wei, Xin Ying Yang
  • Patent number: 9569496
    Abstract: A tool for combining common processes shared by at least two or more sub-queries within a query is provided. The tool determines the query with the at least two or more sub-queries. The tool determines whether one or more sub set relationships are shared between the at least two or more sub-queries. Responsive to a determination that one or more sub set relationships are shared between the at least two or more sub-queries, the tool determines an order class for the at least two or more sub-queries based on the one or more sub set relationships. The tool determines an access path for the query. The tool executes the access path during run-time for data accessing.
    Type: Grant
    Filed: March 11, 2016
    Date of Patent: February 14, 2017
    Assignee: International Business Machines Corporation
    Inventors: Shuo Li, Meng Wan, Xiaobo Wang, Xin Ying Yang
  • Patent number: 9569470
    Abstract: Managing a sharing relationship of tables. A super schema is formed incorporating a plurality of tables in a database according to data types of respective columns of the plurality of tables. A free storage capacity of the super schema is evaluated according to holes not occupied by any table in the super schema. A sharing relationship of the plurality of tables is determined according to the free storage capacity of the super schema. The sharing relationship of a large number of tables in a database can be managed effectively, so as to design or optimize the sharing solution among the tables.
    Type: Grant
    Filed: August 13, 2014
    Date of Patent: February 14, 2017
    Assignee: International Business Machines Corporation
    Inventors: Shuo Li, Xin Ying Yang, Mai Zeng, Xiang Zhou