Patents by Inventor Bee-Chung Chen

Bee-Chung Chen has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 11704566
    Abstract: The disclosed embodiments provide a system for processing data. During operation, the system obtains a training dataset containing a first set of records associated with a first set of identifier (ID) values and an evaluation dataset containing a second set of records associated with a second set of ID values. Next, the system selects a random subset of ID values from the second set of ID values. The system then generates a sampled evaluation dataset comprising a first subset of records associated with the random subset of ID values in the second set of records. The system also generates a sampled training dataset comprising a second subset of records associated with the random subset of ID values in the first set of records. Finally, the system outputs the sampled training dataset and the sampled evaluation dataset for use in training and evaluating a machine learning model.
    Type: Grant
    Filed: June 20, 2019
    Date of Patent: July 18, 2023
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Yiming Ma, Menglin L. Brown, Bee-Chung Chen, Sheng Wu, Jun Jia, Bo Long
  • Patent number: 11704370
    Abstract: The disclosed embodiments provide a system for processing data. During operation, the system obtains a feature configuration for a feature. Next, the system obtains, from the feature configuration, an anchor containing metadata for accessing the feature in an environment. The system then uses one or more attributes of the anchor to retrieve one or more feature values of the feature from the environment. Finally, the system provides the one or more feature values for use with one or more machine-learning models.
    Type: Grant
    Filed: April 20, 2018
    Date of Patent: July 18, 2023
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: David J. Stein, Paul T. Ogilvie, Bee-Chung Chen, Shaunak Chatterjee, Priyanka Gariba, Ke Wu, Grace W. Tang, Yangchun Luo, Boyi Chen, Amit Yadav, Ruoyang Wang, Divya Gadde, Wenxuan Gao, Amit Chandak, Varnit Agnihotri, Wei Zhuang, Joel D. Young, Weidong Zhang
  • Publication number: 20230124258
    Abstract: Methods, systems, and computer programs are presented for determining parameters of neural networks and selecting embedding dimensions for the feature fields. One method includes an operation for initializing parameters of a neural network and weights for embedding sizes for each feature associated with the neural network. The parameters of the neural network and the weights are iteratively optimized. Each optimization iteration comprises training the neural network with current parameters of the neural network to optimize a value of the weights, and training the neural network with current values of the weights to optimize the parameters of the neural network. Further, the method includes operations for selecting embedding sizes for the features based on the optimized values of the weights, and for training the neural network based on the selected embedding sizes for the features to obtain an estimator model. A prediction is generated utilizing the estimator model.
    Type: Application
    Filed: October 19, 2021
    Publication date: April 20, 2023
    Inventors: Xiangyu Zhao, Sida Wang, Huiji Gao, Bo Long, Bee-Chung Chen, Weiwei Guo, Jun Shi
  • Patent number: 11481627
    Abstract: Computer-implemented techniques for learning composite machine learned models are disclosed. Benefits to implementors of the disclosed techniques include allowing non-machine learning experts to use the techniques for learning a composite machine learned model based on a learning dataset, reducing or eliminating the explorative trial and error process of manually tuning architectural parameters and hyperparameters, and reducing the computing resource requirements and model learning time for learning composite machine learned models. The techniques improve the operation of distributed learning computing systems by reducing or eliminating straggler effects and by reducing or minimizing synchronization latency when executing a composite model search algorithm for learning a composite machine learned model.
    Type: Grant
    Filed: October 30, 2019
    Date of Patent: October 25, 2022
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Yuwei Qiu, Chengming Jiang, Huiji Gao, Bee-Chung Chen, Bo Long
  • Publication number: 20220180241
    Abstract: Embodiments of the disclosed technologies provide tree-based transfer learning of hyperparameters of a machine learning model or tunable parameters of a black box system. A similar reference task tree is selected from a set of reference task trees. Data is transferred from the similar reference task tree to a target task tree.
    Type: Application
    Filed: December 4, 2020
    Publication date: June 9, 2022
    Inventors: QINGQUAN SONG, CHENGMING JIANG, YUNBO OUYANG, JUN JIA, HUIJI GAO, BO LONG, BEE-CHUNG CHEN, XIA HU
  • Publication number: 20220100756
    Abstract: The disclosed technologies include a navigation agent for a search interface. In an embodiment, the navigation agent uses reinforcement learning to dynamically generate and select navigation options for presentation to a user during a search session. The navigation agent selects navigation options based on reward scores, which are computed using implicit and/or explicit user feedback received in response to presentations of navigation options.
    Type: Application
    Filed: September 30, 2020
    Publication date: March 31, 2022
    Inventors: PRAVEEN KUMAR BODIGUTLA, BEE-CHUNG CHEN, BO LONG, MIAO CHENG, QIANG XIAO, TANVI SUDARSHAN MOTWANI, WENXIANG CHEN, SAI KRISHNA BOLLAM
  • Patent number: 11106982
    Abstract: In an example embodiment, a warm-start training solution is used to dramatically reduce the computational resources needed to train when retraining a generalized additive mixed-effect (GAME) model. The problem of retraining time is particularly applicable to GAME models, since these models take much longer to train as the data grows. In the past, the strategy to reduce computational resources during retraining was to use less training data, but this affects the model quality, especially for GAME models, which rely on fine-grained sub-models at, for example, member or item levels. The present solution addresses the computational resources issues without sacrificing GAME model accuracy.
    Type: Grant
    Filed: August 22, 2018
    Date of Patent: August 31, 2021
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Yiming Ma, Alex Shelkovnykov, Josh Fleming, Bee-Chung Chen, Bo Long
  • Publication number: 20210133555
    Abstract: Computer-implemented techniques for learning composite machine learned models are disclosed. Benefits to implementors of the disclosed techniques include allowing non-machine learning experts to use the techniques for learning a composite machine learned model based on a learning dataset, reducing or eliminating the explorative trial and error process of manually tuning architectural parameters and hyperparameters, and reducing the computing resource requirements and model learning time for learning composite machine learned models. The techniques improve the operation of distributed learning computing systems by reducing or eliminating straggler effects and by reducing or minimizing synchronization latency when executing a composite model search algorithm for learning a composite machine learned model.
    Type: Application
    Filed: October 30, 2019
    Publication date: May 6, 2021
    Inventors: Yuwei Qiu, Chengming Jiang, Huiji Gao, Bee-Chung Chen, Bo Long
  • Publication number: 20200401948
    Abstract: The disclosed embodiments provide a system for processing data. During operation, the system obtains a training dataset containing a first set of records associated with a first set of identifier (ID) values and an evaluation dataset containing a second set of records associated with a second set of ID values. Next, the system selects a random subset of ID values from the second set of ID values. The system then generates a sampled evaluation dataset comprising a first subset of records associated with the random subset of ID values in the second set of records. The system also generates a sampled training dataset comprising a second subset of records associated with the random subset of ID values in the first set of records. Finally, the system outputs the sampled training dataset and the sampled evaluation dataset for use in training and evaluating a machine learning model.
    Type: Application
    Filed: June 20, 2019
    Publication date: December 24, 2020
    Applicant: Microsoft Technology Licensing, LLC
    Inventors: Yiming Ma, Menglin L. Brown, Bee-Chung Chen, Sheng Wu, Jun Jia, Bo Long
  • Publication number: 20200311613
    Abstract: Herein are techniques for configuring, integrating, and operating trainable tensor transformers that each encapsulate an ensemble of trainable machine learning (ML) models. In an embodiment, a computer-implemented trainable tensor transformer uses underlying ML models and additional mechanisms to assemble and convert data tensors as needed to generate output records based on input records and inferencing. The transformer processes each input record as follows. Input tensors of the input record are converted into converted tensors. Each converted tensor represents a respective feature of many features that are capable of being processed by the underlying trainable models. The trainable models are applied to respective subsets of converted tensors to generate an inference for the input record. The inference is converted into a prediction tensor. The prediction tensor and input tensors are stored as output tensors of a respective output record for the input record.
    Type: Application
    Filed: March 29, 2019
    Publication date: October 1, 2020
    Inventors: Yiming Ma, Jun Jia, Yi Wu, Xuhong Zhang, Leon Gao, Baolei Li, Bee-Chung Chen, Bo Long
  • Patent number: 10726093
    Abstract: A system and method for intermediate landing page rerouting are provided. In example embodiments, determine whether a webpage associated with a hyperlink has corresponding social network activities. Extract content from the webpage determined to have corresponding social network activities. In response to a selection of the hyperlink, reroute a web browser to an intermediate landing page. Cause presentation, at a user interface, of the extracted content and the corresponding social network activities.
    Type: Grant
    Filed: June 30, 2016
    Date of Patent: July 28, 2020
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Shaunak Chatterjee, Ravi Kiran Holur Vijay, Romer E. Rosales, Mohamed Gamal Mohamed Mahmoud, Zheng Li, Kwei-you Tao, Bee-Chung Chen, Deepak Agarwal
  • Patent number: 10673965
    Abstract: A system and method of adjusting an affinity score between an entity pair in a social network is disclosed. The method may include determining, with a processor, whether a first member of the entity pair is a heavy user member. The method further includes if the first member is the heavy user member, determining, with the processor, an affinity adjustment factor between the first member and the second member, and adjusting, with the processor, the affinity score between the first member and the second member of the entity pair in accordance with the adjustment factor to determine an adjusted affinity score. The method may include determining, with the processor, whether a number of interactions on content items indicates that the first member is the heavy user member. The second member is associated with a content item that is being considered for display to the first member.
    Type: Grant
    Filed: August 28, 2015
    Date of Patent: June 2, 2020
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Mikhail Obukhov, Qi He, Bee-Chung Chen, Deepak Agarwal
  • Publication number: 20200065678
    Abstract: In an example embodiment, a warm-start training solution is used to dramatically reduce the computational resources needed to train when retraining a generalized additive mixed-effect (GAME) model. The problem of retraining time is particularly applicable to GAME models, since these models take much longer to train as the data grows. In the past, the strategy to reduce computational resources during retraining was to use less training data, but this affects the model quality, especially for GAME models, which rely on fine-grained sub-models at, for example, member or item levels. The present solution addresses the computational resources issues without sacrificing GAME model accuracy.
    Type: Application
    Filed: August 22, 2018
    Publication date: February 27, 2020
    Inventors: Yiming Ma, Alex Shelkovnykov, Josh Fleming, Bee-Chung Chen, Bo Long
  • Publication number: 20190332569
    Abstract: In an example embodiment, knowledge discovery using deep learning is combined with the scalability and personalization capabilities of generalized additive mixed effect (GAME) modeling. Specifically, features learned in a last fully connected layer of a deep learning model may be used to augment features used in a fixed or random effects training portion of a GAME model.
    Type: Application
    Filed: April 27, 2018
    Publication date: October 31, 2019
    Inventors: Yiming Ma, Wei Lu, Jun Jia, Bee-Chung Chen, Bo Long
  • Publication number: 20190325258
    Abstract: The disclosed embodiments provide a system for processing data. During operation, the system obtains feature configurations for a set of features and a command for inspecting a data set that is produced using the feature configurations. Next, the system obtains, from the feature configurations, one or more anchors containing metadata for accessing the set of features in an environment and a join configuration for joining a feature with one or more additional features. The system then uses the anchors to retrieve feature values of the features and zips the feature values according to the join configuration without matching entity keys associated with the feature values. Finally, the system outputs the zipped feature values in response to the command.
    Type: Application
    Filed: April 20, 2018
    Publication date: October 24, 2019
    Applicant: Microsoft Technology Licensing, LLC
    Inventors: David J. Stein, Ke Wu, Priyanka Gariba, Grace W. Tang, Yangchun Luo, Songxiang Gu, Bee-Chung Chen
  • Publication number: 20190325085
    Abstract: The disclosed embodiments provide a system for processing data. During operation, the system obtains a feature configuration for a feature. Next, the system obtains, from the feature configuration, an anchor containing metadata for accessing the feature in an environment. The system then uses one or more attributes of the anchor to retrieve one or more feature values of the feature from the environment. Finally, the system provides the one or more feature values for use with one or more machine-learning models.
    Type: Application
    Filed: April 20, 2018
    Publication date: October 24, 2019
    Applicant: Microsoft Technology Licensing, LLC
    Inventors: David J. Stein, Paul T. Ogilvie, Bee-Chung Chen, Shaunak Chatterjee, Priyanka Gariba, Ke Wu, Grace W. Tang, Yangchun Luo, Boyi Chen, Amit Yadav, Ruoyang Wang, Divya Gadde, Wenxuan Gao, Amit Chandak, Varnit Agnihotri, Wei Zhuang, Joel D. Young, Weidong Zhang
  • Publication number: 20190325262
    Abstract: The disclosed embodiments provide a system for processing data. During operation, the system obtains feature configurations for a set of features. Next, the system obtains, from the feature configurations, an anchor containing metadata for accessing a first feature in an environment and a feature derivation for generating a second feature from the first feature. The system then uses the anchor to retrieve feature values of the first feature from the environment and uses the feature derivation to generate additional feature values of the second feature from the feature values of the first feature. Finally, the system provides the additional feature values for use with one or more machine learning models.
    Type: Application
    Filed: April 20, 2018
    Publication date: October 24, 2019
    Applicant: Microsoft Technology Licensing, LLC
    Inventors: David J. Stein, Paul T. Ogilvie, Bee-Chung Chen, Ke Wu, Grace W. Tang, Priyanka Gariba, Yangchun Luo, Boyi Chen, Jian Qiao, Benjamin Hoan Le, Joel D. Young, Wei Zhuang
  • Publication number: 20190325351
    Abstract: The disclosed embodiments provide a system for processing data. During operation, the system selects a set of entity keys associated with reference feature values used with one or more machine learning models, wherein the reference feature values are generated in a first environment. Next, the system matches the set of entity keys to feature values from a second environment. The system then compares the feature values and the reference feature values to assess a consistency of a feature across the first and second environments. Finally, the system outputs a result of the assessed consistency for use in managing the feature in the first and second environments.
    Type: Application
    Filed: April 20, 2018
    Publication date: October 24, 2019
    Applicant: Microsoft Technology Licensing, LLC
    Inventors: David J. Stein, Ruoyang Wang, Ke Wu, Bee-Chung Chen, Priyanka Gariba
  • Publication number: 20190324767
    Abstract: The disclosed embodiments provide a system for sharing features in a feature management framework. During operation, the system creates a repository of feature configurations for a set of features that are accessed across multiple environments. Next, the system identifies dependencies of the repository. The system then copies shared feature configurations from other repositories represented by the dependencies. Finally, the system combines the shared feature configurations with existing feature configurations in the repository for use in retrieving feature values for one or more machine learning models.
    Type: Application
    Filed: April 20, 2018
    Publication date: October 24, 2019
    Applicant: Microsoft Technology Licensing, LLC
    Inventors: David J. Stein, Lei Li, Ke Wu, Bee-Chung Chen, Priyanka Gariba
  • Patent number: 10380624
    Abstract: This disclosure relates to systems and methods that include a member activity database including data indicative of interactions with content items on a social network by a population of users of the social network. A processor is configured to obtain an optimization criterion based on at least two constraints related to a performance of the social network, obtain, for a subset of the population of users, at least some of the data indicative of interactions with content items from the member activity database, determine, based on the at least some of the data as obtained, an operating condition for the social network that is estimated to meet the optimization criterion, and provide, to at least some of the user devices via the network interface, the social network based, at least in part, on the operating condition.
    Type: Grant
    Filed: December 30, 2014
    Date of Patent: August 13, 2019
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Deepak Agarwal, Shaunak Chatterjee, Liang Zhang, Bee-Chung Chen, Yang Yang