Patents by Inventor Haixun Wang

Haixun Wang has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Syntactic models for parsing search queries on online social networks

Patent number: 10185763

Abstract: In one embodiment, a method includes receiving, from a client system of a user of an online social network, a query inputted by the user, wherein the query comprises multiple n-grams; determining one or more head-terms and one or more modifier-terms of the n-grams based on a syntactic model; identifying one or more objects matching at least a portion of the query; ranking each identified object based on a quality of matching of the object to the determined head-terms and modifier-terms; and sending, to the client system in response to the query, a search-results interface for display, wherein the search-results interface includes one or more search results corresponding to one or more of the identified objects, respectively, each identified object corresponding to a search result having a rank greater than a threshold rank.

Type: Grant

Filed: November 30, 2016

Date of Patent: January 22, 2019

Assignee: Facebook, Inc.

Inventor: Haixun Wang
Semantic query by example

Patent number: 10176245

Abstract: A computer-implemented method, system, and computer program product for producing a semantic query by example are provided. The method includes receiving examples of potential results from querying a database table with an associated ontology, and extracting features from the database table and the examples based on the associated ontology. The method further includes training a classifier based on the examples and the extracted features, and applying the classifier to the database table to obtain a semantic query result. The method also includes outputting the semantic query result to a user interface, and requesting user feedback of satisfaction with the semantic query result. The method additionally includes updating the classifier and the semantic query result iteratively in response to the user feedback.

Type: Grant

Filed: September 25, 2009

Date of Patent: January 8, 2019

Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Lipyeow Lim, Haixun Wang, Min Wang
Embedding-based parsing of search queries on online social networks

Patent number: 10162886

Abstract: In one embodiment, a method includes receiving a query including multiple n-grams; parsing the query to identify a subset of n-grams; generating, for each identified n-gram, an embedding of the n-gram; determining, for each identified n-gram, one or more word senses; calculating, for each word sense for each identified n-gram, a relatedness-score for the word sense based similarity metrics of the embedding of the word sense and the embeddings of each of the other word senses corresponding to the other identified n-grams; selecting, for each identified n-gram, one of the word senses determined for the identified n-gram having a highest relatedness-score; identifying objects matching at least a portion of the query; ranking each identified object based on a quality of matching of the object to selected word senses; and sending search results corresponding to one or more of the identified objects and having a rank greater than a threshold rank.

Type: Grant

Filed: November 30, 2016

Date of Patent: December 25, 2018

Assignee: Facebook, Inc.

Inventors: Haixun Wang, Bo Zeng
Syntactic Models for Parsing Search Queries on Online Social Networks

Publication number: 20180150552

Abstract: In one embodiment, a method includes receiving, from a client system of a user of an online social network, a query inputted by the user, wherein the query comprises multiple n-grams; determining one or more head-terms and one or more modifier-terms of the n-grams based on a syntactic model; identifying one or more objects matching at least a portion of the query; ranking each identified object based on a quality of matching of the object to the determined head-terms and modifier-terms; and sending, to the client system in response to the query, a search-results interface for display, wherein the search-results interface includes one or more search results corresponding to one or more of the identified objects, respectively, each identified object corresponding to a search result having a rank greater than a threshold rank.

Type: Application

Filed: November 30, 2016

Publication date: May 31, 2018

Inventor: Haixun Wang
Embedding-based Parsing of Search Queries on Online Social Networks

Publication number: 20180150551

Abstract: In one embodiment, a method includes receiving a query including multiple n-grams; parsing the query to identify a subset of n-grams; generating, for each identified n-gram, an embedding of the n-gram; determining, for each identified n-gram, one or more word senses; calculating, for each word sense for each identified n-gram, a relatedness-score for the word sense based similarity metrics of the embedding of the word sense and the embeddings of each of the other word senses corresponding to the other identified n-grams; selecting, for each identified n-gram, one of the word senses determined for the identified n-gram having a highest relatedness-score; identifying objects matching at least a portion of the query; ranking each identified object based on a quality of matching of the object to selected word senses; and sending search results corresponding to one or more of the identified objects and having a rank greater than a threshold rank.

Type: Application

Filed: November 30, 2016

Publication date: May 31, 2018

Inventors: Haixun Wang, Bo Zeng
Method and apparatus for low cost 3D video making

Patent number: 9578308

Abstract: Video capture devices may be held in a frame having at least two openings and a plurality of slots on a first side and a plurality of tabs on a second side opposite the first side, the slots and tabs positioned such that when the frame is folded by folding the first side toward the second side, the tabs may be folded toward the first side, inserted into the slots and then folded parallel to the first side to secure the frame in a closed position. An application on a first device in the frame may establish a connection with a second device in the frame, synchronize a clock in the first device with a clock in the second device, initiating simultaneous video capture, record data indicating changes in orientation of the devices, and transmit the captured videos and the data indicating changes in orientation to a video processing tool.

Type: Grant

Filed: December 17, 2014

Date of Patent: February 21, 2017

Assignee: GOOGLE INC.

Inventors: Charles Perng, Haixun Wang, Hao Yang, Zhen Wen
Automatic taxonomy construction from keywords

Patent number: 9501569

Abstract: A system, method or computer readable storage device to derive a taxonomy from keywords is described herein. A domain-dependent taxonomy from a set of keywords may be automatically derived by leveraging both a general knowledgebase and keyword search. For example, concepts may be deduced with the technique of conceptualization, and context information may be extracted from a search engine. Then, the taxonomy may be constructed using a tree algorithm.

Type: Grant

Filed: April 23, 2013

Date of Patent: November 22, 2016

Assignee: Microsoft Technology Licensing, LLC

Inventors: Yangqiu Song, Shixia Liu, Haixun Wang, Zhongyuan Wang, Hongsong Li
Method and apparatus for low cost 3D video making

Publication number: 20160182888

Abstract: Video capture devices may be held in a frame having at least two openings and a plurality of slots on a first side and a plurality of tabs on a second side opposite the first side, the slots and tabs positioned such that when the frame is folded by folding the first side toward the second side, the tabs may be folded toward the first side, inserted into the slots and then folded parallel to the first side to secure the frame in a closed position. An application on a first device in the frame may establish a connection with a second device in the frame, synchronize a clock in the first device with a clock in the second device, initiating simultaneous video capture, record data indicating changes in orientation of the devices, and transmit the captured videos and the data indicating changes in orientation to a video processing tool.

Type: Application

Filed: December 17, 2014

Publication date: June 23, 2016

Inventors: Charles Perng, Haixun Wang, Hao Yang, Zhen Wen
Answering relational database queries using graph exploration

Patent number: 9317557

Abstract: Embodiments are directed to processing queries using schema graph traversal and to establishing a schema graph that allows queries to be answered by traversing graph nodes. In one scenario, a computer system receives a query which specifies relational tables and corresponding relationships that are to be retrieved from a relational database. The computer system accesses a schema graph that includes graph nodes representing relational tables, as well as edges that identify relationships between the relational tables. The graph nodes include relational data that was loaded from one storage area (e.g. a non-volatile storage area), and the schema graph is stored in a second storage area (e.g. a volatile storage area). The computer system then traverses the schema graph, beginning at a set of graph nodes and continuing along the edges to other graph nodes until the query has been satisfied, and then reports the results of the graph traversal.

Type: Grant

Filed: October 24, 2013

Date of Patent: April 19, 2016

Assignee: Microsoft Technology Licensing, LLC

Inventors: Bin Shao, Haixun Wang, Wei Fang
ANSWERING RELATIONAL DATABASE QUERIES USING GRAPH EXPLORATION

Publication number: 20150120775

Abstract: Embodiments are directed to processing queries using schema graph traversal and to establishing a schema graph that allows queries to be answered by traversing graph nodes. In one scenario, a computer system receives a query which specifies relational tables and corresponding relationships that are to be retrieved from a relational database. The computer system accesses a schema graph that includes graph nodes representing relational tables, as well as edges that identify relationships between the relational tables. The graph nodes include relational data that was loaded from one storage area (e.g. a non-volatile storage area), and the schema graph is stored in a second storage area (e.g. a volatile storage area). The computer system then traverses the schema graph, beginning at a set of graph nodes and continuing along the edges to other graph nodes until the query has been satisfied, and then reports the results of the graph traversal.

Type: Application

Filed: October 24, 2013

Publication date: April 30, 2015

Applicant: Microsoft Corporation

Inventors: Bin Shao, Haixun Wang, Wei Fang
System and method for indexing weighted-sequences in large databases

Patent number: 9009176

Abstract: The present invention provides an index structure for managing weighted-sequences in large databases. A weighted-sequence is defined as a two-dimensional structure in which each element in the sequence is associated with a weight. A series of network events, for instance, is a weighted-sequence because each event is associated with a timestamp. Querying a large sequence database by events' occurrence patterns is a first step towards understanding the temporal causal relationships among the events. The index structure proposed herein enables the efficient retrieval from the database of all subsequences (contiguous and non-contiguous) that match a given query sequence both by events and by weights. The index structure also takes into consideration the nonuniform frequency distribution of events in the sequence data.

Type: Grant

Filed: August 26, 2008

Date of Patent: April 14, 2015

Assignee: SAP AG

Inventors: Wei Fan, Chang-Shing Perng, Haixun Wang, Philip Shi-Lung Yu
Automatic Taxonomy Construction From Keywords

Publication number: 20140317074

Abstract: A system, method or computer readable storage device to derive a taxonomy from keywords is described herein. A domain-dependent taxonomy from a set of keywords may be automatically derived by leveraging both a general knowledgebase and keyword search. For example, concepts may be deduced with the technique of conceptualization, and context information may be extracted from a search engine. Then, the taxonomy may be constructed using a tree algorithm.

Type: Application

Filed: April 23, 2013

Publication date: October 23, 2014

Applicant: Microsoft Corporation

Inventors: Yangqiu Song, Shixia Liu, Haixun Wang, Zhongyuan Wang, Hongsong Li
Processing queries on hierarchical markup data using shared hierarchical markup trees

Patent number: 8635242

Abstract: Disclosed are a method, information processing system, and computer readable medium for processing queries. The method includes receiving a data query for a set of hierarchical markup documents. At least one query path expression is extracted from the data query. The query path is processed against at least one shared hierarchical markup document in a plurality of shared hierarchical markup documents. The plurality of shared hierarchical documents is associated with the set of hierarchical markup documents. In response to the shared hierarchical markup document completely matching the query path expression, a query result for the data query is generated. The query result is based on the processing of the query path expression against at least one of the shared hierarchical markup document and the difference hierarchical markup document.

Type: Grant

Filed: October 11, 2006

Date of Patent: January 21, 2014

Assignee: International Business Machines Corporation

Inventors: Lipyeow Lim, Haixun Wang, Min Wang
System and method for adaptive pruning

Patent number: 8301584

Abstract: Disclosed in a method and structure for searching data in databases using an ensemble of models. First the invention performs training. This training orders models within the ensemble in order of prediction accuracy and joins different numbers of models together to form sub-ensembles. The models are joined together in the sub-ensemble in the order of prediction accuracy. Next in the training process, the invention calculates confidence values of each of the sub-ensembles. The confidence is a measure of how closely results form the sub-ensemble will match results from the ensemble. The size of each of the sub-ensembles is variable depending upon the level of confidence, while, to the contrary, the size of the ensemble is fixed. After the training, the invention can make a prediction. First, the invention selects a sub-ensemble that meets a given level of confidence.

Type: Grant

Filed: December 16, 2003

Date of Patent: October 30, 2012

Assignee: International Business Machines Corporation

Inventors: Wei Fan, Haixun Wang, Philip S. Yu
Identifying and annotating shared hierarchical markup document trees

Patent number: 8108765

Abstract: Disclosed are a method, information processing system, and a computer readable medium for managing documents. The method includes analyzing a plurality of hierarchical markup documents, wherein each hierarchical markup document is representable by a hierarchical tree structure. A shared hierarchical markup document associated with the plurality of hierarchical markup documents is generated based on the analyzing. Each hierarchical markup document in the plurality of hierarchical markup documents is compared with the shared hierarchical document. A plurality of difference hierarchical markup documents is generated based on the comparing.

Type: Grant

Filed: October 11, 2006

Date of Patent: January 31, 2012

Assignee: International Business Machines Corporation

Inventors: Lipyeow Lim, Haixun Wang, Min Wang
System and method for load shedding in data mining and knowledge discovery from stream data

Patent number: 8060461

Abstract: Load shedding schemes for mining data streams. A scoring function is used to rank the importance of stream elements, and those elements with high importance are investigated. In the context of not knowing the exact feature values of a data stream, the use of a Markov model is proposed herein for predicting the feature distribution of a data stream. Based on the predicted feature distribution, one can make classification decisions to maximize the expected benefits. In addition, there is proposed herein the employment of a quality of decision (QoD) metric to measure the level of uncertainty in decisions and to guide load shedding. A load shedding scheme such as presented herein assigns available resources to multiple data streams to maximize the quality of classification decisions. Furthermore, such a load shedding scheme is able to learn and adapt to changing data characteristics in the data streams.

Type: Grant

Filed: February 17, 2009

Date of Patent: November 15, 2011

Assignee: International Business Machines Corporation

Inventors: Yun Chi, Haixun Wang, Philip S. Yu
SEMANTIC QUERY BY EXAMPLE

Publication number: 20110078187

Abstract: A computer-implemented method, system, and computer program product for producing a semantic query by example are provided. The method includes receiving examples of potential results from querying a database table with an associated ontology, and extracting features from the database table and the examples based on the associated ontology. The method further includes training a classifier based on the examples and the extracted features, and applying the classifier to the database table to obtain a semantic query result. The method also includes outputting the semantic query result to a user interface, and requesting user feedback of satisfaction with the semantic query result. The method additionally includes updating the classifier and the semantic query result iteratively in response to the user feedback.

Type: Application

Filed: September 25, 2009

Publication date: March 31, 2011

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Lipyeow Lim, Haixun Wang, Min Wang
System and method for scalable cost-sensitive learning

Patent number: 7904397

Abstract: A method (and structure) for processing an inductive learning model for a dataset of examples, includes dividing the dataset of examples into a plurality of subsets of data and generating, using a processor on a computer, a learning model using examples of a first subset of data of the plurality of subsets of data. The learning model being generated for the first subset comprises an initial stage of an evolving aggregate learning model (ensemble model) for an entirety of the dataset, the ensemble model thereby providing an evolving estimated learning model for the entirety of the dataset if all the subsets were to be processed. The generating of the learning model using data from a subset includes calculating a value for at least one parameter that provides an objective indication of an adequacy of a current stage of the ensemble model.

Type: Grant

Filed: January 20, 2010

Date of Patent: March 8, 2011

Assignee: International Business Machines Corporation

Inventors: Wei Fan, Haixun Wang, Philip S. Yu
System and method for scalable processing of multi-way data stream correlations

Patent number: 7890649

Abstract: A computer implemented method, apparatus, and computer usable program code for processing multi-way stream correlations. Stream data are received for correlation. A task is formed for continuously partitioning a multi-way stream correlation workload into smaller workload pieces. Each of the smaller workload pieces may be processed by a single host. The stream data are sent to different hosts for correlation processing.

Type: Grant

Filed: June 4, 2009

Date of Patent: February 15, 2011

Assignee: International Business Machines Corporation

Inventors: Xiaohui Gu, Haixun Wang, Philip Shi-lung Yu
Index structure for supporting structural XML queries

Patent number: 7890471

Abstract: The present invention provides a ViST (or “virtual suffix tree”), which is a novel index structure for searching XML documents. By representing both XML documents and XML queries in structure-encoded sequences, it is shown that querying XML data is equivalent to finding (non-contiguous) subsequence matches. A variety of XML queries, including those with branches, or wild-cards (‘*’ and ‘//’), can be expressed by structure-encoded sequences. Unlike index methods that disassemble a query into multiple sub-queries, and then join the results of these sub-queries to provide the final answers, ViST uses tree structures as the basic unit of query to avoid expensive join operations. Furthermore, ViST provides a unified index on both content and structure of the XML documents, hence it has a performance advantage over methods indexing either just content or structure.

Type: Grant

Filed: July 19, 2007

Date of Patent: February 15, 2011

Assignee: International Business Machines Corporation

Inventors: Wei Fan, Haixun Wang, Philip Shi-Lung Yu

prev 1 2 3 4 5 6 next