Using Probabilistic Model (epo) Patents (Class 707/E17.079)

Database query for histograms

Patent number: 10949438

Abstract: Methods, systems, and computer programs are presented for obtaining histogram data from a database utilizing an interface with histogram-related options. One method includes an operation for providing, by a server, an application programming interface (API), to access the database, which includes a histogram request, to obtain histogram data from the database, with histogram options. The server receives, from a client device, a first histogram request including histogram options. Additionally, the method includes an operation for identifying bins for the histogram based on the one or more histogram options. For each bin, the server accesses the database to obtain data for each bin. The server returns, to the client device, the histogram data for the histogram as a table containing bin values for all the bins, where the client device is configured to present the histogram to a user based on the histogram data.

Type: Grant

Filed: March 8, 2017

Date of Patent: March 16, 2021

Assignee: Microsoft Technology Licensing, LLC

Inventors: Bolin Ding, Chi Wang, Danyel A. Fisher, Robyn Dominik Moritz
Systems and methods for merging electronic data collections

Patent number: 10467276

Abstract: The present disclosure, in some embodiments, describes a system for classifying members of a collection of texts into clusters to generate merged data collections. A member text can range from a single document to the contents of a column in a database table. The classification may indicate and/or provide an estimation as to which documents or columns are most closely similar to each other, without making any assertion about the actual contents of the document or column. In some embodiments, a system may include counting some characteristic of the text. The characteristic may be chosen such that each text produces a set of counts. A statistical measure is then applied to determine the similarity of sets of counts associated with each pair of texts.

Type: Grant

Filed: January 27, 2017

Date of Patent: November 5, 2019

Assignee: CEEQ IT CORPORATION

Inventor: Gaston Henry Gonnet
ANALYSIS AND PROPOSAL CREATION FOR MANAGEMENT OF PERSONAL ELECTRONICALLY ENCODED ITEMS

Publication number: 20140067878

Abstract: A method performed on an electronic device for creating a proposal to a user. The proposal is created from an analysis of one or more personal electronically encoded items associated with one or more previously created perspectives unique to a user with each of the previously created perspectives representative of user interest and is based on similarities determined from the analysis. The proposal includes the creation of a new perspective, a new album, or a new perspective and a new album associated with the new perspective and the association therewith of one or more of the one or more analyzed personal electronically encoded items. Responsive to user acceptance of the proposal, the system creates the new perspective, the new album, or the new perspective and the new album associated with the new perspective and associates therewith one or more of the one or more analyzed personal electronically encoded items.

Type: Application

Filed: August 31, 2012

Publication date: March 6, 2014

Applicant: RESEARCH IN MOTION LIMITED

Inventors: Anand Ravindra OKA, Sean Bartholomew SIMMONS, Christopher Harris SNOW, Steven Michael HANOV, Ghasem NADDAFZADEH SHIRAZI
METHOD AND APPARATUS FOR SEEDED USER INTEREST MODELING

Publication number: 20130013644

Abstract: Methods and apparatuses are provided for user interest modeling. A method may include receiving an input from a user for specifying one or more topics from among a predetermined hierarchy of topics and subtopics. The method may additionally include retrieving one or more documents associated with the user and extracting language tokens from the documents based, at least in part, on the specified topics. Corresponding apparatuses are also provided.

Type: Application

Filed: March 29, 2010

Publication date: January 10, 2013

Applicant: NOKIA CORPORATION

Inventors: Sailesh Sathish, Jilei Tian, Rile Hu
DATABASE SYSTEMS AND METHODS FOR LINKING RECORDS AND ENTITY REPRESENTATIONS WITH SUFFICIENTLY HIGH CONFIDENCE

Publication number: 20120278340

Abstract: Disclosed are a system for, and method of, determining whether records correspond to the same individual. The system and method provide such a determination with a known minimum level of confidence. That is, the system and method provide an indication that records correspond to the same individual along with an associated confidence level. The system and method may be used to link records in a database that correspond to the same individuals, creating entity representations in the database.

Type: Application

Filed: July 11, 2012

Publication date: November 1, 2012

Applicant: LexisNexis Risk & Information Analytics Group Inc.

Inventor: David Alan Bayliss
Noise Tolerant Graphical Ranking Model

Publication number: 20120271821

Abstract: The relevance of an object, such as a document resulting from a query, may be determined automatically. A graphical model-based technique is applied to determine the relevance of the object. The graphical model may represent relationships between actual and observed labels for the object, based on features of the object. The graphical model may take into account an assumption of noisy training data by modeling the noise.

Type: Application

Filed: April 20, 2011

Publication date: October 25, 2012

Applicant: Microsoft Corporation

Inventors: Tao Qin, Tie-Yan Liu, Xiubo Geng
ONLINE SPELLING CORRECTION/PHRASE COMPLETION SYSTEM

Publication number: 20120246133

Abstract: Online spelling correction/phrase completion is described herein. A computer-executable application receives a phrase prefix from a user, wherein the phrase prefix includes a first character sequence. A transformation probability is retrieved responsive to receipt of the phrase prefix, wherein the transformation probability indicates a probability that a second character sequence has been transformed into a first character sequence. A search is then executed over a trie to locate a most probable phrase completion based at least in part upon the transformation probability.

Type: Application

Filed: March 23, 2011

Publication date: September 27, 2012

Applicant: MICROSOFT CORPORATION

Inventors: Bo-June Hsu, Kuansan Wang, Huizhong Duan
Budget driven purchase monitor

Patent number: 8271344

Abstract: The method for conducting a purchase involves obtaining a target item specification for purchasing a target item, wherein the target item specification comprises a target item description, a target purchase date, and a target price, obtaining, using a central processing unit (CPU), candidate item information related to a candidate item found on a merchant site consistent with the target item specification, wherein the candidate item information comprises a candidate item description and a candidate item price, populating a categorized entry in a user budget using the candidate item price and the target purchase date, wherein the candidate item price is no more than the target price, updating, using the CPU, the budget based on user transactions, generating a result by analyzing, using the CPU, the user budget with respect to the categorized entry, and submitting the result to a user for formulating a decision regarding whether to purchase the candidate item.

Type: Grant

Filed: October 28, 2009

Date of Patent: September 18, 2012

Assignee: Intuit Inc.

Inventors: Girish Mallenahally Channakeshava, Arien C. Ferrell
Infrastructure and architecture for development and execution of predictive models

Patent number: 8229973

Abstract: A system that enables development and execution of predictive models comprises a centralized data management system, a data extraction tool a model validation tool and a model execution tool. In embodiments, a data management system includes a data management server that can be accessed via a web browser that stores data. An extraction tool includes a data filter adapted to filter data based on, for example, a population criteria, a sample size, and a date range criteria. A model validation tool validates the model. A model execution tool allows a user to score the model.

Type: Grant

Filed: April 19, 2011

Date of Patent: July 24, 2012

Assignee: American Express Travel Related Services Company, Inc.

Inventors: Sanjay S. Agrawal, Sastry V S M Durvasula, Narasimha Murthy, Sandeep Sacheti, Deep Thomas, Karl Von Wolff
TOPIC-ORIENTED DIVERSIFIED ITEM RECOMMENDATION

Publication number: 20120089621

Abstract: A content recommendation system and method are provided in which content semantic topic analysis, user interest identification and per interest recommendations are used to deliver relevant and diversified content recommendations to the user. Semantic topic analysis is used to infer underlying topics in content items; for each content item, a topic distribution vector is derived with components that represent relevance of the content item to specific underlying topics. A user's long term and short term user interests are identified using the user's browsing history. Long term user interest(s) can be obtained by a weighted aggregation of topic distribution vectors of content items the user accessed. Short term interest can be represented by the topic distribution vector corresponding to a current content item. Using identified user's interests, relevant content items are selected for recommendations for the user.

Type: Application

Filed: October 11, 2010

Publication date: April 12, 2012

Inventors: Peng Liu, Xianyu Zhao, Wei Li
METHODS AND APPARATUS TO ANALYZE AND ADJUST DEMOGRAPHIC INFORMATION

Publication number: 20120072469

Abstract: Methods, apparatus, and articles of manufacture to analyze and adjust demographic information of audience members are disclosed. An example method involves generating a first model based on reference demographic data corresponding to panelists and based on second demographic data and behavioral data from a database proprietor. The second demographic data and the behavioral data correspond to ones of the panelists having user accounts with the database proprietor. The method also involves using the first model to partition the second demographic data into a plurality of nodes, each node representing a respective subset of the second demographic data. The method also involves redistributing at least some of the second demographic data between at least some of the nodes to generate a second model.

Type: Application

Filed: August 12, 2011

Publication date: March 22, 2012

Inventors: Albert R. Perez, Josh Gaunt
DETERMINING QUERY INTENT

Publication number: 20110314012

Abstract: A tree structure has a node associated with each category of a hierarchy of item categories. Child nodes of the tree are associated with sub-categories of the categories associated with parent nodes. Training data including received queries and indicators of a selected item category for each received query is combined with the tree structure by associating each query with the node corresponding to the selected category of the query. When a query is received, a classifier is applied to the nodes to generate a probability that the query is intended to match an item of the category associated with the node. The classifier is applied until the probability is below a threshold. One or more categories associated with the nodes that are closest to the intent of the received query are selected and indicators of items of those categories that match the received query are output.

Type: Application

Filed: June 16, 2010

Publication date: December 22, 2011

Applicant: MICROSOFT CORPORATION

Inventors: Krishnaram N. G. Kenthapadi, Panayiotis Tsaparas, Sreenivas Gollapudi, Rakesh Agrawal
Media Item Recommendation

Publication number: 20110314039

Abstract: Media item recommendation is described. In one example, a statistical model of media consumption is applied to media session consumption data from a community of users to infer parameters of the model. The model comprises a first probability distribution for each user defining a likelihood of the user having a latent characteristic for a session, and a second probability distribution for each latent characteristic defining a likelihood of a user selecting a media item given the latent characteristic. In another example, the inferred parameters are provided to a recommendation engine arranged to recommend media items. The recommendation engine uses the model with inferred parameters and data describing media items newly consumed by a user to infer a current latent characteristic for a current session of the user, and uses them to generate recommended media items for the user in the current session based on the current latent characteristic.

Type: Application

Filed: June 18, 2010

Publication date: December 22, 2011

Applicant: Microsoft Corporation

Inventors: Elena Zheleva, John Guiver, Natasa Milic-Frayling, Eduarda Mendes Rodrigues
RESEARCH MISSION IDENTIFICATION

Publication number: 20110295776

Abstract: A system and method is described herein that automatically determines if a user of a search engine is conducting a research mission and then provides one or more research tools, one or more specialized searches, one or more directed ads, and/or one or more marketplace events responsive to determining that the research mission is being conducted. The automatic provision of various events and/or tools responsive to determination of the research mission can advantageously improve the experience of the user conducting the research mission.

Type: Application

Filed: May 31, 2010

Publication date: December 1, 2011

Applicant: Yahoo! Inc.

Inventors: Debora Donato, Francesco Bonchi, Liang-Yu Chi
QUERY CORRECTION PROBABILITY BASED ON QUERY-CORRECTION PAIRS

Publication number: 20110295897

Abstract: Query-correction pairs can be extracted from search log data. Each query-correction pair can include an original query and a follow-up query, where the follow-up query meets one or more criteria for being identified as a correction of the original query, such as an indication of user input indicating the follow-up query is a correction for the original query. The query-correction pairs can be segmented to identify bi-phrases in the query-correction pairs. Probabilities of corrections between the bi-phrases can be estimated based on frequencies of matches in the query-correction pairs. Identifications of the bi-phrases and representations of the probabilities of those bi-phrases can be stored in a probabilistic model data structure.

Type: Application

Filed: June 1, 2010

Publication date: December 1, 2011

Applicant: Microsoft Corporation

Inventors: Jianfeng Gao, Christopher B. Quirk, Daniel Micol Ponce, Andreas Bode, Xu Sun
PROVIDING SEARCH RESULTS IN RESPONSE TO A SEARCH QUERY

Publication number: 20110270849

Abstract: A method for providing search results in response to a search query is provided. The method includes receiving the search query from a user and generating a plurality of results in response to the search query. The plurality of results may be ranked according to an original relevancy score. The method further includes generating a click relevancy score for each of the plurality of results and re-ranking the plurality of results according to the click relevancy score.

Type: Application

Filed: April 30, 2010

Publication date: November 3, 2011

Applicant: Microsoft Corporation

Inventors: Manik Varma, Vidit Jain
MODEL FOR MARKET IMPACT ANALYSIS OF PART REMOVAL FROM COMPLEX PRODUCTS

Publication number: 20110251877

Abstract: A model for impact analysis determines impact of part removal from a product. An entity is identifies that includes a plurality of sub-components. One or more performance measures associated with the entity are identified. One or more of the sub-components to be removed from the entity are identified. A substitution impact function is defined. Impact on said one or more performance measures is determined using the substitution impact function.

Type: Application

Filed: April 7, 2010

Publication date: October 13, 2011

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Richard D. Lawrence, Claudia Perlich
WEB-SCALE ENTITY RELATIONSHIP EXTRACTION

Publication number: 20110251984

Abstract: Methods and systems for Web-scale entity relationship extraction are usable to build large-scale entity relationship graphs from any data corpora stored on a computer-readable medium or accessible through a network. Such entity relationship graphs may be used to navigate previously undiscoverable relationships among entities within data corpora. Additionally, the entity relationship extraction may be configured to utilize discriminative models to jointly model correlated data found within the selected corpora.

Type: Application

Filed: April 9, 2010

Publication date: October 13, 2011

Applicant: Microsoft Corporation

Inventors: Zaiqing Nie, Xiaojiang Liu, Jun Zhu, Ji-Rong Wen
RANKING EXPERT RESPONSES AND FINDING EXPERTS BASED ON RANK

Publication number: 20110119264

Abstract: In a computerized social network, expert and user chat sessions are stored and rated probabilistically. Later user requests for information are met with an expert ranking, based on a balance of similarities between expert profile and questions; similarity between expert profile and prior chat sessions, and dynamically updated chat session ratings. New sessions can be rated automatically with reference to keywords distilled from past sessions responsive to user ratings—and based on session length.

Type: Application

Filed: November 18, 2009

Publication date: May 19, 2011

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Jianying Hu, Jennifer Lai, Aleksandra Mojsilovic, Vikas Sindhwani, Kevin Singley
Phonetic Variation Model Building Apparatus and Method and Phonetic Recognition System and Method Thereof

Publication number: 20110119051

Abstract: A phonetic variation model building apparatus, having a phoneme database for recording at least a standard phonetic model of a language and a plurality of non-standardized phonemes of the language is provided. A phonetic variation identifier identifies a plurality of phonetic variations between the non-standardized phonemes and the standard phonetic model. A phonetic transformation calculator calculates a plurality of coefficients of a phonetic transformation function based on the phonetic variations and the phonetic transformation function. A phonetic variation model generator generates at least a phonetic variation model based on the standard phonetic model, the phonetic transformation function and the coefficients thereof.

Type: Application

Filed: December 15, 2009

Publication date: May 19, 2011

Applicant: INSTITUTE FOR INFORMATION INDUSTRY

Inventors: Huan-Chung Li, Chung-Hsien Wu, Han-Ping Shen, Chun-Kai Wang, Chia-Hsin Hsieh
HOTSPOT ANALYSIS SYSTEMS AND METHODS

Publication number: 20100257168

Abstract: Hotspot analysis systems and methods are provided. The hotspot analysis system includes a database, a filtering module, a clustering module, and an analysis module. The database includes a plurality of records, each including context information having at least time information and position information. The filtering module filters the records according to current context information to obtain a plurality of filtered records. The clustering module clusters the filtered records into at least one hotspot cluster according to the position information of the filtered records, and generates a hotspot area for each hotspot cluster. The analysis module calculates integral hotness for each hotspot cluster according to the number of the filtered records in the hotspot cluster and the size of the hotspot area of the hotspot cluster, and generates at least hotspot area information according to the integral hotness and the hotspot area of each hotspot cluster.

Type: Application

Filed: June 26, 2009

Publication date: October 7, 2010

Inventors: Jacob GUO, Hanwen Chang, Yu-Chin Tai, Hsiaowei Chen, Jane Yung-jen Hsu
Sampling analysis of search queries

Publication number: 20100185619

Abstract: Sampling analysis includes classifying a plurality of query keywords into a plurality of query keyword subsets according to page view (PV) values associated with the plurality of query keywords, the plurality of query keywords being submitted by a plurality of users; determining a respective plurality of sample rates of a respective plurality of query keywords in a respective one of the plurality of query keyword subsets; and sampling query data in the respective one of the plurality of query keyword subsets according to the respective plurality of sample rates.

Type: Application

Filed: January 20, 2010

Publication date: July 22, 2010

Inventors: Junlin Zhang, Jian Sun, Lei Hou, Qin Zhang
METHOD AND SYSTEM FOR DEVELOPING A META-MODEL SCHEMA

Publication number: 20100153473

Abstract: The present invention provides a method, system and computer program product for developing a meta-model schema on the basis of one or more requirements associated with an enterprise process. The method includes defining various sets of meta-models based on the requirements and a predefined ontology. Each set of meta-models includes at least one meta-model that has been defined based on at least one other meta-model of the set of meta-models. Thereafter, the sets of meta-models defined for the corresponding requirements are integrated to develop the meta-model schema.

Type: Application

Filed: December 10, 2009

Publication date: June 17, 2010

Applicant: INFOSYS TECHNOLOGIES LIMITED

Inventor: Kishore GOPALAN
Method, device and software for querying and presenting search results

Publication number: 20100131484

Abstract: There is disclosed a method, device, and software for presenting search results in a response to an end-user query. Search results are combined from results from a plurality of indexes, each of the search results having an associated key field. Index entries of each of the plurality of indexes are queried using an index-specific search algorithm to obtain a set of matching search results for each index, each matching search result having a quality of match specific to its index. A relative priority is determined for each of the plurality of indexes and the matching search results from the plurality of indexes are combined into a merged list of ordered search results based on the determined priority. A search result from a lower priority index is discarded in favor of any matching search result from a higher priority index.

Type: Application

Filed: September 8, 2009

Publication date: May 27, 2010

Inventors: David B. Gosse, Tym D. Feindel, Jungho Kim, Justin R. Nutzman, Michael T. Winters, Jennifer L. Gosse
Computer Implemented Method and Program for Fast Estimation of Matrix Characteristic Values

Publication number: 20100082643

Abstract: A term-by-document (or part-by-collection) matrix can be used to index documents (or collections) for information retrieval applications. Reducing the rank of the indexing matrix can further reduce the complexity of information retrieval. A method for index matrix rank reduction can involve computing a singular value decomposition and then retaining singular values based on the singular values corresponding to singular values of multiple topics. The expected singular values corresponding to a topic can be determined using the roots of a specially formed characteristic polynomial. The coefficients of the special characteristic polynomial can be based on computing the determinants of a Gram matrix of term (or part) probabilities, a method of recursion, or a method of recursion further weighted by the probability of document (or collection) lengths.

Type: Application

Filed: December 7, 2009

Publication date: April 1, 2010

Applicant: Selective, Inc.

Inventors: Jacob Gilmore Martin, Earl Rodney Canfield
SYSTEM AND METHOD FOR INFERRING AND VISUALIZING CORRELATIONS OF DIFFERENT BUSINESS ASPECTS FOR BUSINESS TRANSFORMATION

Publication number: 20100082696

Abstract: A system and method for inferring and visualizing correlations of different business aspects for business transformation are provided. Business models, for instance, that may include business component model, business process model, value drivers and metrics model, application model, organization model, and solutions model are organized into a model topology data schema, and qualitative relationships and quantitative relationships may be configured among the entities or components of the business models. Correlations are inferred and visualized based on those relationships.

Type: Application

Filed: October 1, 2008

Publication date: April 1, 2010

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Rong Zeng Cao, Wei Ding, Shun Jiang, Juhnyoung Lee, Chun Hua Tian
TEXT BASED SCHEMA DISCOVERY AND INFORMATION EXTRACTION

Publication number: 20090300043

Abstract: Various technologies and techniques are disclosed for text based schema discovery and information extraction. Documents are analyzed to identify sections of the documents and a relationship between the sections. Statistics are stored regarding occurrences of items in the documents. A probabilistic model is generated based on the stored statistics. A database schema is generated with a plurality of tables based upon the probabilistic model. The documents are analyzed against the probabilistic model to determine how the documents map to the tables generated from the database schema. The tables are populated from the documents based on a result of the analysis against the probabilistic model.

Type: Application

Filed: May 27, 2008

Publication date: December 3, 2009

Applicant: MICROSOFT CORPORATION

Inventor: C. James MacLennan
TEXT EXPLANATION FOR ON-LINE ANALYTIC PROCESSING EVENTS

Publication number: 20070282830

Abstract: A method and structure for analyzing a database having non-text data in data fields and text in text fields. The invention first selects a subset of the database based upon criteria. The subset includes data field(s) and associated text field(s). The invention searches for data matching the criteria within structured data fields of the database. If the invention searches multiple databases, the invention creates shared dimensions for databases that do not share common attributes. The invention automatically selects a relatively short text phrase from the text fields that helps to explain the underlying meaning (i.e. unique text content) of a data subset selected using the non-text data fields.

Type: Application

Filed: August 20, 2007

Publication date: December 6, 2007

Inventors: William Cody, Vikas Krishna, Justin Lessler, William Spangler, Jeffrey Kreulen