Patents by Inventor Bolin Ding

Bolin Ding has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20230133683
    Abstract: An interactive feature generation system may receive a plurality of distinct features that are associated with an application, and associate a plurality of nodes in a feature graph of a first order to the plurality of distinct features. The interactive feature generation system may iteratively generate interactive features of a higher order from interactive features of a lower order to form a plurality of feature graphs of different orders. The interactive feature generation system may then propagate respective interactive features of the plurality of feature graphs of the different orders to a neural network to determine a number of interactive features of one or more orders, the determined number of interactive features of the one or more orders being used for training a predictive model to make inferences for the application.
    Type: Application
    Filed: July 14, 2020
    Publication date: May 4, 2023
    Inventors: Yuexiang XIE, Zhen Wang, Bolin Ding, Yaliang Li, Jun Huang, Weidan Kong, Jingren Zhou, Wei Lin
  • Publication number: 20210357752
    Abstract: A method, an apparatus, a storage medium, and a processor for model processing are disclosed. The method includes: obtaining an original language model; determining a task that needs to be processed by the original language model; and converting the original language model based on features of the task to obtain a target language model for processing the task. The present disclosure solves the technical problem of the difficulty of effectively using a model.
    Type: Application
    Filed: May 6, 2021
    Publication date: November 18, 2021
    Inventors: Daoyuan Chen, Yaliang Li, Minghui Qiu, Zhen Wang, Bofang Li, Bolin Ding, Hongbo Deng, Jun Huang, Wei Lin, Jingren Zhou
  • Patent number: 10977384
    Abstract: This document relates to hardware protection of differential privacy techniques. One example obtains multiple instances of encrypted telemetry data within a secure enclave and processes the encrypted telemetry data to obtain multiple instances of unencrypted telemetry data. The example also processes, within the secure enclave, the multiple instances of unencrypted telemetry data to obtain a perturbed aggregate. The example also releases the perturbed aggregate from the secure enclave.
    Type: Grant
    Filed: November 16, 2017
    Date of Patent: April 13, 2021
    Assignee: Microsoft Technoogy Licensing, LLC
    Inventors: Josh D. Benaloh, Harsha P. Nori, Janardhan Dattatreya Kulkarni, Joshua Stanley Allen, Jacob Rubin Lorch, Melissa Erin Chase, Olga Ohrimenko, Sergey Yekhanin, Srinath T. V. Setty, Bolin Ding
  • Patent number: 10949438
    Abstract: Methods, systems, and computer programs are presented for obtaining histogram data from a database utilizing an interface with histogram-related options. One method includes an operation for providing, by a server, an application programming interface (API), to access the database, which includes a histogram request, to obtain histogram data from the database, with histogram options. The server receives, from a client device, a first histogram request including histogram options. Additionally, the method includes an operation for identifying bins for the histogram based on the one or more histogram options. For each bin, the server accesses the database to obtain data for each bin. The server returns, to the client device, the histogram data for the histogram as a table containing bin values for all the bins, where the client device is configured to present the histogram to a user based on the histogram data.
    Type: Grant
    Filed: March 8, 2017
    Date of Patent: March 16, 2021
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Bolin Ding, Chi Wang, Danyel A. Fisher, Robyn Dominik Moritz
  • Patent number: 10902149
    Abstract: Methods, systems, apparatuses, and computer-readable storage medium are described herein for remotely analyzing testing results based on LDP-based data obtained from client devices in order to determine an effect of a software application with respect to its features and/or the population in which the application is tested. The analysis is based on a series of statistical computations for conducting hypothesis tests to compare population means, while ensuring LDP for each user. For example, an LDP scheme is used on the client-side that privatizes a measured value corresponding to a usage of a resource of the client. A data collector receives the privatized data from two sets of populations. Each population's clients have a software application that may differ in terms of features or user group. The privatized data received from each population is analyzed to determine an effect of the difference between the software applications of the different populations.
    Type: Grant
    Filed: March 22, 2018
    Date of Patent: January 26, 2021
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Bolin Ding, Harsha Prasad Nori, Paul Luo Li, Joshua Stanley Allen
  • Patent number: 10824592
    Abstract: Generally discussed herein are devices, systems, and methods for database management. A method may include determining a first hyperloglog (HLL) sketch of a first column of data, determining a second HLL sketch of a second column of data, estimating an inclusion coefficient based on the first and second HLL sketches, and performing operations on the first column of data or the second column of data in response to determining the inclusion coefficient is greater than, or equal to, a specified threshold.
    Type: Grant
    Filed: June 14, 2018
    Date of Patent: November 3, 2020
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Azade Nazi, Bolin Ding, Vivek R Narasayya, Surajit Chaudhuri
  • Patent number: 10776242
    Abstract: A facility for reporting on original values on behalf of each of a plurality of users—each falling within a domain partitioned into a plurality of segments of uniform size—is provided. For each of the plurality of users, the facility (a) randomly selects an upward-rounding window size that is smaller than the segment size; (b) for each source value in a domain, randomly determines a mapping of each segment of the domain to a segment identifier value; (c) determines an original value for the user; (d) adds the upward-rounding window size to the user's original value to obtain a window-augmented original value; (e) identifies a segment containing the window-augmented original value; (f) identifies a segment identifier value mapped-to from the identified segment using the determined mapping; and (g) transmits a reporting communication on the user's behalf reporting a value based on the identified segment identifier value.
    Type: Grant
    Filed: July 31, 2017
    Date of Patent: September 15, 2020
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Janardhan Dattatreya Kulkarni, Bolin Ding, Sergey Yekhanin, Joshua Allen, Alexander Norton Meade
  • Patent number: 10769123
    Abstract: Provided are methods and systems for generating physical database design tuning recommendations. Given a workload, the system analyzes the workload to identify and recommend a set of rowstore and columnstore indexes optimal for the performance of the workload. The system is designed to estimate the size of the columnstore index (at the granularity of each column) without actually building the index, estimate the improvement in query performance that each columnstore index would result in when built, and automatically derive the workload used for the physical design tuning task by analyzing stored query execution history data. This automatic workload derivation is orthogonal to columnstores and can be used even when columnstore indexes are not being used.
    Type: Grant
    Filed: June 28, 2017
    Date of Patent: September 8, 2020
    Assignee: MICROSOFT TECHNOLOGY LICENSING, LLC
    Inventors: Sudipto Das, Bolin Ding, Vivek R. Narasayya, Manoj A. Syamala, Jingjing Wang, Gaoxiang Xu
  • Patent number: 10740328
    Abstract: A processing unit can determine a first subset of a data set including data records selected based on measure values thereof. The processing unit can determine an index mapping a predicate to data records associated with that predicate and approximation values of the records. The processing unit can process a query against the first subset to provide a first result and a first accuracy value, determine that the first accuracy value does not satisfy an accuracy criterion, and process the query against the index. In some examples, the processing unit can process the query against a second subset including data records satisfying a predetermined predicate. In some examples, the processing unit can receive data records and determine the first subset. Data records can include respective measure values. Data records with higher measure values can occur in the first subset more frequently than data records with lower measure values.
    Type: Grant
    Filed: June 24, 2016
    Date of Patent: August 11, 2020
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Bolin Ding, Silu Huang, Chi Wang, Kaushik Chakrabarti, Surajit Chaudhuri
  • Publication number: 20200065712
    Abstract: In automated machine learning, an approximate best configuration can be selected among multiple candidate machine-learning configurations by progressively sampling training and test datasets for the iterative training and testing of the configurations while progressively pruning the set of candidate configurations based on associated estimated confidence intervals for their respective performance.
    Type: Application
    Filed: August 23, 2018
    Publication date: February 27, 2020
    Inventors: Chi Wang, Silu Huang, Surajit Chaudhuri, Bolin Ding
  • Patent number: 10552435
    Abstract: Techniques for presenting data visualizations are described. Consistent with some embodiments of the present invention, a data visualization system includes an approximate query engine to derive approximate query results and present approximate data visualizations, in interactive time. The data visualization system also includes a precise query engine, thereby enabling a data analyst to selectively request that a precise query result be generated for a specific query, so that the data analyst can confirm a data observation made when viewing the approximate visualization for that specific query. The precise visualization is presented in a manner that allows the data analyst to view, and thus confirm or disprove, a data observation recorded when viewing the approximate visualization, and also compare the precise visualization with the previously generated and presented approximate visualization.
    Type: Grant
    Filed: March 8, 2017
    Date of Patent: February 4, 2020
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Chi Wang, Bolin Ding, Danyel A. Fisher, Robyn Dominik Moritz
  • Publication number: 20190384830
    Abstract: Generally discussed herein are devices, systems, and methods for database management. A method may include determining a first hyperloglog (HLL) sketch of a first column of data, determining a second HLL sketch of a second column of data, estimating an inclusion coefficient based on the first and second HLL sketches, and performing operations on the first column of data or the second column of data in response to determining the inclusion coefficient is greater than, or equal to, a specified threshold.
    Type: Application
    Filed: June 14, 2018
    Publication date: December 19, 2019
    Inventors: Azade Nazi, Bolin Ding, Vivek R. Narasayya, Surajit Chaudhuri
  • Patent number: 10496643
    Abstract: One or more approximations of query output in a data analytics platform are controlled. The one or more approximations are controlled by generating values of error metrics associated with placements of samplers in one or more query execution plans associated with the query, and injecting a plurality of samplers into the query execution plans, using the determined values of the error metrics, in lieu of storing samples of input to the query prior to execution of the query.
    Type: Grant
    Filed: February 8, 2016
    Date of Patent: December 3, 2019
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Srikanth Kandula, Surajit Chaudhuri, Bolin Ding, Anil Atmanand Shanbhag, Aleksandar Vitorović, Matthaios Olma, Robert Grandl
  • Patent number: 10453228
    Abstract: A visualization system suitable for visualizing differences between data sets is provided. The system determines a first data set and a second data set along with a difference between the first and second data set. A value corresponding to each unit of display of the second data set is plotted on at least one graph. The visualization system determines, for each unit of display of the second data set, whether the first data set has a value for the unit of display. In response to the first data set having the value, the visualization system plots a difference of the value of the first data set on the at least one graph. In response to the first data set not having the value, the visualization system plots a missing structure element on the at least one graph. The at least one graph is displayed to a user.
    Type: Grant
    Filed: March 8, 2017
    Date of Patent: October 22, 2019
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Bolin Ding, Chi Wang, Danyel A. Fisher, Robyn Dominik Moritz
  • Publication number: 20190236306
    Abstract: Methods, systems, apparatuses, and computer-readable storage medium are described herein for remotely analyzing testing results based on LDP-based data obtained from client devices in order to determine an effect of a software application with respect to its features and/or the population in which the application is tested. The analysis is based on a series of statistical computations for conducting hypothesis tests to compare population means, while ensuring LDP for each user. For example, an LDP scheme is used on the client-side that privatizes a measured value corresponding to a usage of a resource of the client. A data collector receives the privatized data from two sets of populations. Each population's clients have a software application that may differ in terms of features or user group. The privatized data received from each population is analyzed to determine an effect of the difference between the software applications of the different populations.
    Type: Application
    Filed: March 22, 2018
    Publication date: August 1, 2019
    Inventors: Bolin Ding, Harsha Prasad Nori, Paul Luo Li, Joshua Stanley Allen
  • Publication number: 20190147188
    Abstract: This document relates to hardware protection of differential privacy techniques. One example obtains multiple instances of encrypted telemetry data within a secure enclave and processes the encrypted telemetry data to obtain multiple instances of unencrypted telemetry data. The example also processes, within the secure enclave, the multiple instances of unencrypted telemetry data to obtain a perturbed aggregate. The example also releases the perturbed aggregate from the secure enclave.
    Type: Application
    Filed: November 16, 2017
    Publication date: May 16, 2019
    Applicant: Microsoft Technology Licensing, LLC
    Inventors: Josh D. BENALOH, Harsha P. NORI, Janardhan Dattatreya KULKARNI, Joshua Stanley ALLEN, Jacob Rubin LORCH, Melissa Erin CHASE, Olga OHRIMENKO, Sergey YEKHANIN, Srinath T.V. SETTY, Bolin DING
  • Publication number: 20180260451
    Abstract: Methods, systems, and computer programs are presented for obtaining histogram data from a database utilizing an interface with histogram-related options. One method includes an operation for providing, by a server, an application programming interface (API), to access the database, which includes a histogram request, to obtain histogram data from the database, with histogram options. The server receives, from a client device, a first histogram request including histogram options. Additionally, the method includes an operation for identifying bins for the histogram based on the one or more histogram options. For each bin, the server accesses the database to obtain data for each bin. The server returns, to the client device, the histogram data for the histogram as a table containing bin values for all the bins, where the client device is configured to present the histogram to a user based on the histogram data.
    Type: Application
    Filed: March 8, 2017
    Publication date: September 13, 2018
    Inventors: Bolin Ding, Chi Wang, Danyel A. Fisher, Robyn Dominik Moritz
  • Publication number: 20180260450
    Abstract: Techniques for presenting data visualizations are described. Consistent with some embodiments of the present invention, a data visualization system includes an approximate query engine to derive approximate query results and present approximate data visualizations, in interactive time. The data visualization system also includes a precise query engine, thereby enabling a data analyst to selectively request that a precise query result be generated for a specific query, so that the data analyst can confirm a data observation made when viewing the approximate visualization for that specific query. The precise visualization is presented in a manner that allows the data analyst to view, and thus confirm or disprove, a data observation recorded when viewing the approximate visualization, and also compare the precise visualization with the previously generated and presented approximate visualization.
    Type: Application
    Filed: March 8, 2017
    Publication date: September 13, 2018
    Inventors: Chi Wang, Bolin Ding, Danyel A. Fisher, Robyn Dominik Moritz
  • Publication number: 20180260985
    Abstract: A visualization system suitable for visualizing differences between data sets is provided. The system determines a first data set and a second data set along with a difference between the first and second data set. A value corresponding to each unit of display of the second data set is plotted on at least one graph. The visualization system determines, for each unit of display of the second data set, whether the first data set has a value for the unit of display. In response to the first data set having the value, the visualization system plots a difference of the value of the first data set on the at least one graph. In response to the first data set not having the value, the visualization system plots a missing structure element on the at least one graph. The at least one graph is displayed to a user.
    Type: Application
    Filed: March 8, 2017
    Publication date: September 13, 2018
    Inventors: Bolin Ding, Chi Wang, Danyel A. Fisher, Robyn Dominik Moritz
  • Publication number: 20180189164
    Abstract: A facility for reporting on original values on behalf of each of a plurality of users—each falling within a domain partitioned into a plurality of segments of uniform size—is provided. For each of the plurality of users, the facility (a) randomly selects an upward-rounding window size that is smaller than the segment size; (b) for each source value in a domain, randomly determines a mapping of each segment of the domain to a segment identifier value; (c) determines an original value for the user; (d) adds the upward-rounding window size to the user's original value to obtain a window-augmented original value; (e) identifies a segment containing the window-augmented original value; (f) identifies a segment identifier value mapped-to from the identified segment using the determined mapping; and (g) transmits a reporting communication on the user's behalf reporting a value based on the identified segment identifier value.
    Type: Application
    Filed: July 31, 2017
    Publication date: July 5, 2018
    Inventors: Janardhan Dattatreya KULKARNI, Bolin DING, Sergey YEKHANIN, Joshua ALLEN, Alexander Norton MEADE