Patents by Inventor Mingkuan Liu
Mingkuan Liu has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Publication number: 20230306047Abstract: Technology for the improved processing of search queries is provided. In one embodiment, methods may return semantically relevant search results for a search query. During a pre-computing offline processing, an inventory semantic index may be generated and may include inventory binary hashing signatures that are associated with inventory listings, such as goods or services for sell, and the index may be partitioned by categories and shards. When a search query is received, relevant categories are determined using a relevant category recognition service, and a search query binary hashing signature maybe generated for the search query. The relevant categories are searched to determine hamming distances between the inventory binary hashing signatures and the search query binary hashing signature, where the hamming distance indicates semantic relevance.Type: ApplicationFiled: June 1, 2023Publication date: September 28, 2023Inventor: Mingkuan LIU
-
Patent number: 11698921Abstract: Technology for the improved processing of search queries is provided. In one embodiment, methods may return semantically relevant search results for a search query. During a pre-computing offline processing, an inventory semantic index may be generated and may include inventory binary hashing signatures that are associated with inventory listings, such as goods or services for sell, and the index may be partitioned by categories and shards. When a search query is received, relevant categories are determined using a relevant category recognition service, and a search query binary hashing signature maybe generated for the search query. The relevant categories are searched to determine hamming distances between the inventory binary hashing signatures and the search query binary hashing signature, where the hamming distance indicates semantic relevance.Type: GrantFiled: September 17, 2018Date of Patent: July 11, 2023Assignee: EBAY INC.Inventor: Mingkuan Liu
-
Patent number: 11573985Abstract: In an example, one or more leaf category specific unsupervised statistical language model (SLM) models are trained using sample item listings corresponding to each of one or more leaf categories and structured data about the one or more leaf categories, the training including calculating an expected perplexity and a standard deviation for item listing titles. A perplexity for a title of a particular item listing is calculated and a perplexity deviation signal is generated based on a difference between the perplexity for the title of the particular item listing and the expected perplexity for item listing titles in a leaf category of the particular item listing and based on the standard deviation for item listing titles in the leaf category of the particular item listing. A gradient boosting machine (GBM) fuses the perplexity deviation signal with one or more other signals to generate a miscategorization classification score corresponding to the particular item listing.Type: GrantFiled: April 16, 2021Date of Patent: February 7, 2023Assignee: eBay Inc.Inventor: Mingkuan Liu
-
Patent number: 11227004Abstract: In accordance with an example embodiment, large scale category classification based on sequence semantic embedding and parallel learning is described. In one example, one or more closest matches are identified by comparison between (i) a publication semantic vector that corresponds to at least part of the publication, the publication semantic vector based on a first machine-learned model that projects the at least part of the publication into a semantic vector space, and (ii) a plurality of category vectors corresponding to respective categories from a plurality of categories.Type: GrantFiled: January 6, 2020Date of Patent: January 18, 2022Assignee: eBay Inc.Inventor: Mingkuan Liu
-
Publication number: 20210232606Abstract: In an example, one or more leaf category specific unsupervised statistical language model (SLM) models are trained using sample item listings corresponding to each of one or more leaf categories and structured data about the one or more leaf categories, the training including calculating an expected perplexity and a standard deviation for item listing titles. A perplexity for a title of a particular item listing is calculated and a perplexity deviation signal is generated based on a difference between the perplexity for the title of the particular item listing and the expected perplexity for item listing titles in a leaf category of the particular item listing and based on the standard deviation for item listing titles in the leaf category of the particular item listing. A gradient boosting machine (GBM) fuses the perplexity deviation signal with one or more other signals to generate a miscategorization classification score corresponding to the particular item listing.Type: ApplicationFiled: April 16, 2021Publication date: July 29, 2021Inventor: Mingkuan Liu
-
Patent number: 10984023Abstract: In an example, one or more leaf category specific unsupervised statistical language model (SLM) models are trained using sample item listings corresponding to each of one or more leaf categories and structured data about the one or more leaf categories, the training including calculating an expected perplexity and a standard deviation for item listing titles. A perplexity for a title of a particular item listing is calculated and a perplexity deviation signal is generated based on a difference between the perplexity for the title of the particular item listing and the expected perplexity for item listing titles in a leaf category of the particular item listing and based on the standard deviation for item listing titles in the leaf category of the particular item listing. A gradient boosting machine (GBM) fuses the perplexity deviation signal with one or more other signals to generate a miscategorization classification score corresponding to the particular item listing.Type: GrantFiled: September 21, 2018Date of Patent: April 20, 2021Assignee: eBay Inc.Inventor: Mingkuan Liu
-
Publication number: 20200218750Abstract: In accordance with an example embodiment, large scale category classification based on sequence semantic embedding and parallel learning is described. In one example, one or more closest matches are identified by comparison between (i) a publication semantic vector that corresponds to at least part of the publication, the publication semantic vector based on a first machine-learned model that projects the at least part of the publication into a semantic vector space, and (ii) a plurality of category vectors corresponding to respective categories from a plurality of categories.Type: ApplicationFiled: January 6, 2020Publication date: July 9, 2020Inventor: Mingkuan Liu
-
Patent number: 10635727Abstract: Embodiments of the present disclosure relate generally to semantic indexing to improve search results of a large corpus. Some embodiments identify one or more closest matches between (i) a search semantic vector that corresponds to a search query, the search semantic vector based on a first machine-learned model that projects the search query into a semantic vector space, and (ii) a plurality of publication vectors corresponding to respective publications in the publication corpus, the plurality of publication vectors based on a second machine-learned model that projects the plurality of publication vectors into the semantic vector space.Type: GrantFiled: February 22, 2017Date of Patent: April 28, 2020Assignee: eBay Inc.Inventors: Mingkuan Liu, Hao Zhang, Xianjing Liu, Alan Qing Lu
-
Patent number: 10606873Abstract: Embodiments of the present disclosure relate generally to index trimming to improve search results of a large corpus. Some embodiments, prior to receiving, from a user device, a search query of one or more keywords searching for a relevant set of publications in a publication corpus, trim candidate publications from a plurality of candidate publications to generate a trimmed plurality of candidate publications.Type: GrantFiled: February 22, 2017Date of Patent: March 31, 2020Assignee: EBAY INC.Inventors: Mingkuan Liu, Hao Zhang, Xianjing Liu, Alan Qing Lu
-
Patent number: 10599701Abstract: In accordance with an example embodiment, large scale category classification based on sequence semantic embedding and parallel learning is described. In one example, one or more closest matches are identified by comparison between (i) a publication semantic vector that corresponds to at least part of the publication, the publication semantic vector based on a first machine-learned model that projects the at least part of the publication into a semantic vector space, and (ii) a plurality of category vectors corresponding to respective categories from a plurality of categories.Type: GrantFiled: February 10, 2017Date of Patent: March 24, 2020Assignee: EBAY INC.Inventor: Mingkuan Liu
-
Publication number: 20200089808Abstract: Technology for the improved processing of search queries is provided. In one embodiment, methods may return semantically relevant search results for a search query. During a pre-computing offline processing, an inventory semantic index may be generated and may include inventory binary hashing signatures that are associated with inventory listings, such as goods or services for sell, and the index may be partitioned by categories and shards. When a search query is received, relevant categories are determined using a relevant category recognition service, and a search query binary hashing signature maybe generated for the search query. The relevant categories are searched to determine hamming distances between the inventory binary hashing signatures and the search query binary hashing signature, where the hamming distance indicates semantic relevance.Type: ApplicationFiled: September 17, 2018Publication date: March 19, 2020Inventor: Mingkuan Liu
-
Patent number: 10558696Abstract: In accordance with an example embodiment, large scale category classification based on sequence semantic embedding and parallel learning is described. In one example, one or more closest matches are identified by comparison between (i) a publication semantic vector that corresponds to at least part of the publication, the publication semantic vector based on a first machine-learned model that projects the at least part of the publication into a semantic vector space, and (ii) a plurality of category vectors corresponding to respective categories from a plurality of categories.Type: GrantFiled: February 10, 2017Date of Patent: February 11, 2020Assignee: EBAY INC.Inventor: Mingkuan Liu
-
Patent number: 10430446Abstract: Embodiments of the present disclosure relate generally to semantic indexing to improve search results of a large corpus. Some embodiments, with at least one of the keywords of the search query encoded by a semantic vector in a semantic vector space, identify a plurality of candidate publications in the publication corpus, the plurality of candidate publications encoded by a cluster of a plurality of semantic vectors in the semantic vector space, the identifying based on proximity in the semantic vector space between the at least one of the keywords of the search query and keywords in the plurality of candidate publications, the proximity based on a first machine-learned model that projects the at least one keyword in the search query and the keywords in the plurality of candidate publications into the semantic vector space.Type: GrantFiled: February 22, 2017Date of Patent: October 1, 2019Assignee: eBay Inc.Inventors: Mingkuan Liu, Hao Zhang, Xianjing Liu, Alan Qing Lu
-
Patent number: 10268752Abstract: In accordance with an example embodiment, an automated taxonomy mapping system that uses sequence semantic embedding techniques is described. Sequence sematic embedding models are used to generate the sequence vectors. The sequence semantic embedding models are trained offline and can be shared across different systems having different taxonomies and various versions of a category taxonomy.Type: GrantFiled: September 2, 2016Date of Patent: April 23, 2019Assignee: eBay Inc.Inventor: Mingkuan Liu
-
Publication number: 20190026356Abstract: In an example, one or more leaf category specific unsupervised statistical language model (SLM) models are trained using sample item listings corresponding to each of one or more leaf categories and structured data about the one or more leaf categories, the training including calculating an expected perplexity and a standard deviation for item listing titles. A perplexity for a title of a particular item listing is calculated and a perplexity deviation signal is generated based on a difference between the perplexity for the title of the particular item listing and the expected perplexity for item listing titles in a leaf category of the particular item listing and based on the standard deviation for item listing titles in the leaf category of the particular item listing. A gradient boosting machine (GBM) fuses the perplexity deviation signal with one or more other signals to generate a miscategorization classification score corresponding to the particular item listing.Type: ApplicationFiled: September 21, 2018Publication date: January 24, 2019Inventor: Mingkuan Liu
-
Patent number: 10095770Abstract: In an example, one or more leaf category specific unsupervised statistical language model (SLM) models are trained using sample item listings corresponding to each of one or more leaf categories and structured data about the one or more leaf categories, the training including calculating an expected perplexity and a standard deviation for item listing titles. A perplexity for a title of a particular item listing is calculated and a perplexity deviation signal is generated based on a difference between the perplexity for the title of the particular item listing and the expected perplexity for item listing titles in a leaf category of the particular item listing and based on the standard deviation for item listing titles in the leaf category of the particular item listing. A gradient boosting machine (GBM) fuses the perplexity deviation signal with one or more other signals to generate a miscategorization classification score corresponding to the particular item listing.Type: GrantFiled: September 22, 2015Date of Patent: October 9, 2018Assignee: eBay Inc.Inventor: Mingkuan Liu
-
Publication number: 20180052908Abstract: Embodiments of the present disclosure relate generally to semantic indexing to improve search results of a large corpus. Some embodiments, with at least one of the keywords of the search query encoded by a semantic vector in a semantic vector space, identify a plurality of candidate publications in the publication corpus, the plurality of candidate publications encoded by a cluster of a plurality of semantic vectors in the semantic vector space, the identifying based on proximity in the semantic vector space between the at least one of the keywords of the search query and keywords in the plurality of candidate publications, the proximity based on a first machine-learned model that projects the at least one keyword in the search query and the keywords in the plurality of candidate publications into the semantic vector space.Type: ApplicationFiled: February 22, 2017Publication date: February 22, 2018Inventors: Mingkuan Liu, Hao Zhang, Xianjing Liu, Alan Qing Lu
-
Publication number: 20180052929Abstract: Embodiments of the present disclosure relate generally to indexing with multiple algorithms to improve search results of a large corpus.Type: ApplicationFiled: February 22, 2017Publication date: February 22, 2018Inventors: Mingkuan Liu, Hao Zhang, Xianjing Liu, Alan Qing Lu
-
Publication number: 20180052876Abstract: Embodiments of the present disclosure relate generally to index trimming to improve search results of a large corpus. Some embodiments, prior to receiving, from a user device, a search query of one or more keywords searching for a relevant set of publications in a publication corpus, trim candidate publications from a plurality of candidate publications to generate a trimmed plurality of candidate publications.Type: ApplicationFiled: February 22, 2017Publication date: February 22, 2018Inventors: Mingkuan Liu, Hao Zhang, Xianjing Liu, Alan Qing Lu
-
Publication number: 20180052928Abstract: Embodiments of the present disclosure relate generally to semantic indexing to improve search results of a large corpus. Some embodiments identify one or more closest matches between (i) a search semantic vector that corresponds to a search query, the search semantic vector based on a first machine-learned model that projects the search query into a semantic vector space, and (ii) a plurality of publication vectors corresponding to respective publications in the publication corpus, the plurality of publication vectors based on a second machine-learned model that projects the plurality of publication vectors into the semantic vector space.Type: ApplicationFiled: February 22, 2017Publication date: February 22, 2018Inventors: Mingkuan Liu, Hao Zhang, Xianjing Liu, Alan Qing Lu