Patents by Inventor Chi-Yi Kuan
Chi-Yi Kuan has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Patent number: 11463461Abstract: Techniques for performing unequal sampling are provided. In one technique, multiple scores generated by a prediction model are identified, each score corresponding to a different entity of multiple entities. Multiple buckets are determined, each bucket corresponding to a different range of scores. Each entity is assigned to a bucket based on the score corresponding to the entity. A probability distribution function is generated based on the scores and a number of scores belonging to each bucket. For each entity, a probability of sampling the entity is determined based on the probability distribution function and a score corresponding to the entity. A subset of the entities are sampled based on the probability determined for each entity.Type: GrantFiled: May 29, 2019Date of Patent: October 4, 2022Assignee: Microsoft Technology Licensing, LLCInventors: Wenqian Li, Zhou Jin, Rui Zhao, Xiaosu Huang, Chi-Yi Kuan
-
Patent number: 11238087Abstract: A machine may be configured to generate an enhanced user interface for displaying social analytics based on viral mentions and threading. For example, the machine accesses a plurality of items of digital content. The machine extracts, for each of the plurality of items of digital content, a title that describes a particular item of digital content. The machine generates a group of items of digital content based on the extracted titles associated with the plurality of items of digital content. The machine identifies, from the group of items of digital content, an original item of digital content and one or more subsequent items of digital content. The machine determines a strength value associated with the original item of digital content. The machine generates and causes a display of an enhanced user interface that displays the title and the strength value associated with the original item of digital content.Type: GrantFiled: December 21, 2017Date of Patent: February 1, 2022Assignee: Microsoft Technology Licensing, LLCInventors: Weiqin Ma, Weidong Zhang, Yongzheng Zhang, Chi-Yi Kuan
-
Patent number: 11238352Abstract: Techniques are provided for predicting talent flow to and/or from a geographical region. In one technique, multiple entity profiles are stored and analyzed to generate training data that is labeled indicating whether a corresponding entity has moved to or moved from a region. A machine-learned prediction model is generated or trained based on the training data. Using the machine-learned prediction model, a prediction is made whether, for each entity corresponding to another entity profile, that entity will move to or move from a particular geographic region. Based on multiple predictions, a number of entities that are predicted to move to or move from the particular geographic region is determined. Talent flow data that is based on the number of entities is presented on a computer display.Type: GrantFiled: March 30, 2018Date of Patent: February 1, 2022Assignee: Microsoft Technology Licensing, LLCInventors: Chi-Yi Kuan, Shen Huang, Yu Wang, Yongzheng Zhang, Paul Ko, Shady Elasra, Fanbin Bu
-
Patent number: 11036700Abstract: Methods, systems, and computer programs are presented for selecting features for a machine-learning model configured to detect anomalies in the evolution of data over time. One method includes an operation for identifying one or more key fields and value fields from the fields in a relational database. The method also includes grouping data of the value fields based on values of the one or more key fields and calculating one or more statistical values for each group of data of the value fields. The method further includes operations for monitoring an evolution of the one or more statistical values over time, and for selecting, based on the evolution of the one or more statistical values over time, features to be used by a machine-learning model to detect anomalies in content of the relational database over time. The method also includes executing the machine-learning model to detect the anomalies.Type: GrantFiled: December 31, 2018Date of Patent: June 15, 2021Assignee: Microsoft Technology Licensing, LLCInventors: Shen Huang, Huan Van Hoang, Yongzheng Zhang, Chi-Yi Kuan
-
Patent number: 11037251Abstract: A system, a machine-readable storage medium storing instructions, and a computer-implemented method are described herein are directed to a Key Feature Engine receives a request for a desired number of key features to be identified from a data set in at least a portion of a database. The Key Feature Engine executes instances of multiple types of machine learning data models on the data set to calculate respective regression coefficients, in each machine learning data model instance, for each feature in a plurality of features defined by a plurality of data categories of a social network service. The Key Feature Engine identifies at least one key feature, of one or more instances of the multiple types of machine learning data models, based on a value of a corresponding regression coefficient.Type: GrantFiled: March 1, 2017Date of Patent: June 15, 2021Assignee: Microsoft Technology Licensing, LLCInventors: Burcu Baran, Chi-Yi Kuan, Huan Van Hoang, Yue Li, Yan Liu, Vibhu Prakash Saxena
-
Publication number: 20210097425Abstract: The disclosed embodiments provide a system for processing data. During operation, the system determines output of a machine learning model, which includes a score generated by the model based on features inputted into the model and feature importance metrics representing effects of the features on the score. Next, the system maps the features to elements in a feature hierarchy that groups the features under a first level of parent features. The system also generates a ranking of the first level of parent features based on the feature importance metrics. The system then combines, based on the ranking, feature values of the mapped features with a set of insight templates to produce a list of narrative insights, wherein each narrative insight includes a natural language description of a factor that contributes to the model's output. Finally, the system outputs the list of narrative insights in a user interface.Type: ApplicationFiled: September 26, 2019Publication date: April 1, 2021Inventors: Jilei Yang, Yongzheng Zhang, Shen Huang, Burcu Baran, Chi-Yi Kuan
-
Publication number: 20200387517Abstract: Disclosed are systems, methods, and non-transitory computer-readable media for search result page ranking optimization. When generating a search result page, a search result optimization system generates links to other related search result pages and adds them to the generates search result page. Adding the generates links enables internet search engines to discover additional search results pages, as well as increases their search ranking. The search result optimization system generates the links based on entity values extracted from the search results page. For example, the search result optimization system extracts entity values based on filter categories available for the search results included in the search result page as well as historical search behavior indicating a number of times that users have selected to filter search results based on the identified filter categories.Type: ApplicationFiled: June 4, 2019Publication date: December 10, 2020Inventors: Shen Huang, Huan Van Hoang, Yongzheng Zhang, Chi-Yi Kuan
-
Publication number: 20200382530Abstract: Techniques for performing unequal sampling are provided. In one technique, multiple scores generated by a prediction model are identified, each score corresponding to a different entity of multiple entities. Multiple buckets are determined, each bucket corresponding to a different range of scores. Each entity is assigned to a bucket based on the score corresponding to the entity. A probability distribution function is generated based on the scores and a number of scores belonging to each bucket. For each entity, a probability of sampling the entity is determined based on the probability distribution function and a score corresponding to the entity. A subset of the entities are sampled based on the probability determined for each entity.Type: ApplicationFiled: May 29, 2019Publication date: December 3, 2020Inventors: Wenqian Li, Zhou Jin, Rui Zhao, Xiaosu Huang, Chi-Yi Kuan
-
Patent number: 10839406Abstract: The disclosed embodiments provide a system for performing A/B testing for search engine optimization (SEO). During operation, the system divides a set of web pages into a control group, an inbound treatment group, an outbound treatment group, and a full treatment group. Next, the system generates a first set of links from the outbound treatment group to the inbound treatment group and a second set of links within the full treatment group. The system then applies an A/B test to pairs of groups selected from the control group, the inbound treatment group, the outbound treatment group, and the full treatment group. Finally, the system outputs a result of the A/B test for use in assessing an effect of the first and second sets of links on search engine results associated with the set of web pages.Type: GrantFiled: June 28, 2018Date of Patent: November 17, 2020Assignee: Microsoft Technology Licensing, LLCInventors: Quan Wang, Huan V. Hoang, Shen Huang, Yongzheng Zhang, Chi-Yi Kuan
-
Patent number: 10733572Abstract: Techniques for delaying the transmission of a message to one or more recipients using an alert in order to provide data protection and security with respect to data included in the message are disclosed herein. In some embodiments, a computer-implemented method comprises: receiving a request to transmit a message from a computing device of a user to a recipient, the request comprising content of the message; detecting an issue with the request using at least one classifier to classify the request as having the issue; generating an alert based on the detecting of the issue; and prior to transmitting the message to a destination associated with the recipient, causing the generated alert to be displayed on the computing device of the user, the alert indicating the issue with the message.Type: GrantFiled: December 22, 2017Date of Patent: August 4, 2020Assignee: Microsoft Technology Licensing, LLCInventors: Shen Huang, Yongzheng Zhang, Chi-Yi Kuan
-
Patent number: 10733221Abstract: A system and method for identifying trending topics in a document corpus are provided. First, multiple topics are identified, some of which topics may be filtered or removed based on co-occurrence. Then, for each remaining topic, a frequency of the topic in the document corpus is determined, one or more frequencies of the topic in one or more other document corpora are determined, a trending score of the topic is generated based on the determined frequencies. Lastly, the remaining topics are ranked based on the generated trending scores.Type: GrantFiled: March 30, 2016Date of Patent: August 4, 2020Assignee: Microsoft Technology Licensing, LLCInventors: Yongzheng Zhang, Rui Zhao, Chi-Yi Kuan, Yi Zheng
-
Publication number: 20200210390Abstract: Methods, systems, and computer programs are presented for selecting features for a machine-learning model configured to detect anomalies in the evolution of data over time. One method includes an operation for identifying one or more key fields and value fields from the fields in a relational database. The method also includes grouping data of the value fields based on values of the one or more key fields and calculating one or more statistical values for each group of data of the value fields. The method further includes operations for monitoring an evolution of the one or more statistical values over time, and for selecting, based on the evolution of the one or more statistical values over time, features to be used by a machine-learning model to detect anomalies in content of the relational database over time. The method also includes executing the machine-learning model to detect the anomalies.Type: ApplicationFiled: December 31, 2018Publication date: July 2, 2020Inventors: Shen Huang, Huan Van Hoang, Yongzheng Zhang, Chi-Yi Kuan
-
Patent number: 10678829Abstract: Among other things, embodiments of the present disclosure discussed herein help identify trending topics and generate customized data feeds that present trending topics to a user based on information in the user's social network profile. In some embodiments, users may select topics of interest to the user and the system can identity and present trending articles in the selected topic to the user via the user's newsfeed.Type: GrantFiled: April 11, 2017Date of Patent: June 9, 2020Assignee: Microsoft Technology Licensing, LLCInventors: Sang Hyun Park, Sui Yan, Yongzheng Zhang, Ho Joon Park, Weidong Zhang, Chi-Yi Kuan, Yi Zheng
-
Publication number: 20200089786Abstract: Techniques for automatically creating geographic region groups are provided. Attribute data about a plurality of regions is stored. Based on the attribute data, each region is classified as belonging to a tier of multiple tiers. A first set of region groups is generated, where each region group includes at least two regions assigned to different tiers. For each region group, group attribute data for that region group is generated. A comparison of first group attribute data of a first region group is performed with group attribute data of each other region group. Based on results of the comparison, first arrangement data that associates a second region group with the first region group is stored.Type: ApplicationFiled: September 19, 2018Publication date: March 19, 2020Inventors: Xiang Cheng, Chen Wang, Michael J. Tambe, Megh Mehta, Xiaojing Dong, Chi-Yi Kuan
-
Publication number: 20200005335Abstract: The disclosed embodiments provide a system for performing A/B testing for search engine optimization (SEO). During operation, the system divides a set of web pages into a control group, an inbound treatment group, an outbound treatment group, and a full treatment group. Next, the system generates a first set of links from the outbound treatment group to the inbound treatment group and a second set of links within the full treatment group. The system then applies an A/B test to pairs of groups selected from the control group, the inbound treatment group, the outbound treatment group, and the full treatment group. Finally, the system outputs a result of the A/B test for use in assessing an effect of the first and second sets of links on search engine results associated with the set of web pages.Type: ApplicationFiled: June 28, 2018Publication date: January 2, 2020Applicant: Microsoft Technology Licensing, LLCInventors: Quan Wang, Huan V. Hoang, Shen Huang, Yongzheng Zhang, Chi-Yi Kuan
-
Publication number: 20200005244Abstract: Techniques for discovering related organizations through different types of online connections are provided. In one technique, connection data is stored that identifies, for each user in a first set of users, one or more other users with which that user has a connection. Job change data is stored that identifies, for each user of a second set of users, multiple organizations for which that user has worked or had sought an employment relationship. Based on the connection data, a number of connections between employees of a first organization and employees of a second organization is identified. Based on the job change data, a number of users that listed, in their respective online profiles, the first organization as an employer is identified. Based on the number of connections and the number of users, a determination of whether the first organization and the second organization are related is made.Type: ApplicationFiled: June 28, 2018Publication date: January 2, 2020Inventors: Shen Huang, Quan Wang, Huan Hoang, Yongzheng Zhang, Chi-Yi Kuan
-
Publication number: 20190362003Abstract: Techniques in a search system for improving the precision and recall of search results returned for search queries including long tail queries submitted against a vertical search corpus are disclosed. The techniques include mapping a user query submitted to the system to a more representative query and executing the more representative query against the vertical search corpus. Documents identified as matching the more representative query may be returned in a search result as an answer to the user query in addition to or instead of documents identified as matching the user query. By including the documents identified as matching the more representative query in the search result, the search result may be relevant to the user than a search result that includes just documents matching the user query.Type: ApplicationFiled: May 24, 2018Publication date: November 28, 2019Inventors: Yongzheng Zhang, Rui Zhao, Zhou Jin, Chi-Yi Kuan
-
Publication number: 20190303773Abstract: Techniques are provided for predicting talent flow to and/or from a geographical region. In one technique, multiple entity profiles are stored and analyzed to generate training data that is labeled indicating whether a corresponding entity has moved to or moved from a region. A machine-learned prediction model is generated or trained based on the training data. Using the machine-learned prediction model, a prediction is made whether, for each entity corresponding to another entity profile, that entity will move to or move from a particular geographic region. Based on multiple predictions, a number of entities that are predicted to move to or move from the particular geographic region is determined. Talent flow data that is based on the number of entities is presented on a computer display.Type: ApplicationFiled: March 30, 2018Publication date: October 3, 2019Inventors: Chi-Yi Kuan, Shen Huang, Yu Wang, Yongzheng Zhang, Paul Ko, Shady Elasra, Fanbin Bu
-
Patent number: 10410136Abstract: The disclosed embodiments provide a system for processing data. During operation, the system obtains validated training data containing a first set of content items and a first set of classification tags for the first set of content items. Next, the system uses the validated training data to produce a statistical model for classifying content using a set of dimensions represented by the first set of classification tags. The system then uses the statistical model to generate a second set of classification tags for a second set of content items. Finally, the system outputs one or more groupings of the second set of content items by the second set of classification tags to improve understanding of content related to the set of dimensions without requiring a user to manually analyze the second set of content items.Type: GrantFiled: September 16, 2015Date of Patent: September 10, 2019Assignee: Microsoft Technology Licensing, LLCInventors: Yongzheng Zhang, Chi-Yi Kuan, Yi Zheng
-
Patent number: 10380150Abstract: Method and system for identifying user expectations in question answering in an on-line social network system are described. The automated support system is configured to address the technical problem of optimization of the processing of user input submitted to a computer in the form of a natural language. The automated support system uses machine learning algorithms to automatically extract, from the user input, information indicative of the user's expectations and obtain data relevant to the input based on said information indicative of the user's expectations.Type: GrantFiled: December 20, 2017Date of Patent: August 13, 2019Assignee: Microsoft Technology Licensing, LLCInventors: Shen Huang, Yongzheng Zhang, Chi-Yi Kuan, Hu Wang, Rui Zhao, Zhou Jin