Patents by Inventor Sanjeev Katariya

Sanjeev Katariya has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Componentized slot-filling architecture

Publication number: 20070094185

Abstract: The subject disclosure pertains to systems and methods for performing natural language processing in which tokens are mapped to task slots. The system includes a mapper component that generates a lattice representing possible interpretations of the tokens, a decoder component that creates a ranked list of paths traversing the lattice, a scorer component that generates scores used to rank paths and post-processing components that format the paths for use by other software. Each of these components may be independent, such that the component may be modified or replaced without affecting the remaining components. This allows a variety of different mathematical models and algorithms to be tested or deployed without requiring changes to the remainder of the system.

Type: Application

Filed: October 7, 2005

Publication date: April 26, 2007

Applicant: Microsoft Corporation

Inventors: William Ramsey, Jianfeng Gao, Sanjeev Katariya
Client-server word-breaking framework

Publication number: 20070088677

Abstract: Word-breaking of a query from a client machine in a client-server environment includes determining whether to use a first word breaking module operable with a client machine in the client-server environment and/or a second word breaking module operable with a server in the client-server environment.

Type: Application

Filed: October 13, 2005

Publication date: April 19, 2007

Applicant: Microsoft Corporation

Inventors: Sanjeev Katariya, William Ramsey
Distributed named entity recognition architecture

Publication number: 20070088696

Abstract: Named entity recognizers are distributed throughout a client-server system based on where the data for named entity recognition resides.

Type: Application

Filed: October 13, 2005

Publication date: April 19, 2007

Applicant: Microsoft Corporation

Inventors: Sanjeev Katariya, William Ramsey
Intelligent search results blending

Publication number: 20060287980

Abstract: The subject invention relates to systems and methods that automatically combine or interleave received search results from across knowledge databases in a uniform and consistent manner. In one aspect, an automated search results blending system is provided. The system includes a search component that directs a query to at least two databases. A learning component is employed to rank or score search results that are received from the databases in response to the query. A blending component automatically interleaves or combines the results according to the rank in order to provide a consistent ranking system across differing knowledge sources and search tools.

Type: Application

Filed: June 21, 2005

Publication date: December 21, 2006

Applicant: Microsoft Corporation

Inventors: Jun Liu, Adwait Ratnaparkhi, Qi Yao, Sanjeev Katariya
High scale adaptive search systems and methods

Publication number: 20060287993

Abstract: The subject invention relates to systems and methods that employ automated learning techniques to database and information retrieval systems in order to facilitate knowledge capabilities for users and systems. In one aspect, an adaptive information retrieval system is provided. The system includes a database component to store structured and unstructured data values. A search component queries the data values from the database, wherein a learning component associated with the search component or the database component is provided to facilitate retrieval of desired information.

Type: Application

Filed: June 21, 2005

Publication date: December 21, 2006

Applicant: Microsoft Corporation

Inventors: Qi Yao, Jun Liu, Sanjeev Katariya
Adaptive customer assistance system for software products

Publication number: 20060265232

Abstract: An adaptive customer assistance system that can serve as an integrated online and offline help platform for a suite of software products is provided. The assistance system includes a customer-interaction interface and a data management component and a download management component for distributed customer interaction. The data management component includes an authoring component, a download component, a runtime component and an analysis component. The runtime component, which includes a customer assistance model, is configured to receive a user-formulated question from the customer-interaction interface. The runtime component provides an answer to the user-formulated question based on information included in the customer assistance model. The analysis component automatically analyzes, in substantially real-time, the user-formulated question and the corresponding answer, and provides an analysis output for use in improving a quality of customer assistance.

Type: Application

Filed: May 20, 2005

Publication date: November 23, 2006

Applicant: Microsoft Corporation

Inventors: Sanjeev Katariya, Hsiao-Wuen Hon
Performant relevance improvements in search query results

Publication number: 20060253428

Abstract: Property store information and an aggregation of a plurality of ranking mechanisms, including a learning mechanism, are leveraged to provide performant query results with increased user relevancy. The learning mechanism permits query feedback to be accepted to facilitate in optimizing user relevance. This mechanism can also be incorporated with traditional Information Retrieval (IR) components, each supplying independent ranking to a relevance aggregation function that determines relevancy at a high level. This precludes diminishing the value of query feedback that occurs when the data is fed into traditional IR algorithms. By allowing the query feedback to maintain its proper weighting and utilizing scope and bias capabilities of the property store information, relevance increases in a highly performant manner.

Type: Application

Filed: May 6, 2005

Publication date: November 9, 2006

Applicant: Microsoft Corporation

Inventors: Sanjeev Katariya, Qi Yao, Jun Liu, Adwait Ratnaparkhi, Bradley Green
Adaptive systems and methods for making software easy to use via software usage mining

Publication number: 20060242638

Abstract: A system for dynamically updating user accessible features of a software application on a client computer has a user interface, a local usage data file, and a data mining engine. The user interface is adapted to receive operator inputs. The local usage data file is adapted to store usage information corresponding to the operator inputs. The data mining engine is adapted to process the stored usage information and to generate local adjustments to a user interface of the software application based on the operator inputs. In one embodiment, a server is adapted to receive usage data from a plurality of application instances on a plurality of client computers and to generate global adjustments based on the received usage data. In one embodiment, the system has a merge feature adapted to blend and resolve conflicts between local and global adjustments to generate an interface adjustment for the user interface.

Type: Application

Filed: April 22, 2005

Publication date: October 26, 2006

Applicant: Microsoft Corporation

Inventors: Sin Shyh Lew, Pyungchul Kim, Sanjeev Katariya, Zijian Zheng
Creating a summary having sentences with the highest weight, and lowest length

Patent number: 6789230

Abstract: A method and system for generating a summary of a document. The summary generating system generates the summary from the sentences that form the document. The summary generating system calculates a weight for each of the sentences in the document. The weight indicates the importance of the sentence to the document. The summary generating system then selects sentences based on their calculated weights. The summary generating system creates a summary of the selected sentences such that selected sentences are ordered in the created summary in the same relative order as in the document. In one embodiment, the summary generating system identifies sets of sentences whose total length of the sentences in the set is less than a maximum length. The summary generating system then selects an identified set of sentences whose total of the calculated weights of the sentences is greatest as the generated summary. The length of a sentence may be measured in characters or words.

Type: Grant

Filed: December 18, 1998

Date of Patent: September 7, 2004

Assignee: Microsoft Corporation

Inventors: Sanjeev Katariya, William P. Jones
Scalable computing system for presenting customized aggregation of information

Patent number: 6564251

Abstract: A method and computing system for providing a highly reliable web site that is capable of handling millions of Internet users per day. The computing system is customizable on a per user basis yet seamlessly aggregates data from a number of information providers, such as news services, weather services, stock services, personal email, and even a community specific service. The computing system includes a plurality of personalization stores that maintain personalization data for a corresponding Internet user. The computing system is based on a federated model in which a single entry point spans to a plurality of web servers. Each web server has an aggregation module, such as an Internet Server API (ISAPI) filter, that retrieves user-specific data “clips” from the personalization store and passes the data to one or more provider objects.

Type: Grant

Filed: December 3, 1998

Date of Patent: May 13, 2003

Assignee: Microsoft Corporation

Inventors: Sanjeev Katariya, Alok K. Sinha
METHOD AND SYSTEM FOR GENERATING A DOCUMENT SUMMARY

Publication number: 20030079185

Abstract: A method and system for generating a summary of a document. The summary generating system generates the summary from the sentences that form the document. The summary generating system calculates a weight for each of the sentences in the document. The weight indicates the importance of the sentence to the document. The summary generating system then selects sentences based on their calculated weights. The summary generating system creates a summary of the selected sentences such that selected sentences are ordered in the created summary in the same relative order as in the document. In one embodiment, the summary generating system identifies sets of sentences whose total length of the sentences in the set is less than a maximum length. The summary generating system then selects an identified set of sentences whose total of the calculated weights of the sentences is greatest as the generated summary. The length of a sentence may be measured in characters or words.

Type: Application

Filed: December 18, 1998

Publication date: April 24, 2003

Inventors: SANJEEV KATARIYA, WILLIAM P. JONES
Method and system for calculating phrase-document importance

Patent number: 6549897

Abstract: A method and system for generating a weight for phrases within each document in a collection of documents. Each document has terms such as words and numbers. Each phrase comprises component terms. Each term frequency represents the number of occurrences of a term in a document, and the phrase frequency represents the number of occurrences of a phrase in a document. To generate the weight, the weighting system first estimates a document frequency for the phrase by multiplying an estimated phrase probability of the phrase times the number of documents that contain each component term. The estimated phrase probability is an estimation of the probability that any phrase in documents that contain each component term is the phrase whose weight is to be estimated. The document frequency is the number of the documents that contain the phrase. The weighting system then estimates a total phrase frequency for the phrase as the average phrase frequency for the phrase times the estimated document frequency for the phrase.

Type: Grant

Filed: December 17, 1998

Date of Patent: April 15, 2003

Assignee: Microsoft Corporation

Inventors: Sanjeev Katariya, William P. Jones
Method and system for calculating term-document importance

Patent number: 6473753

Abstract: A weighting system for calculating the term-document importance for each term within each document that is part of a collection of documents (i.e., a corpus). The weighting system calculates the importance of a term within a document based on a computed normalized term frequency and a computed inverse document frequency. The computed normalized term frequency is a function, referred to as the “computed term frequency function” (“A”), of a normalized term frequency. The normalized term frequency is the term frequency, which is the number of times that the term occurs in the document, normalized by the total term frequency of the term within all documents, which is the total number of times that the term occurs in all the documents. The weighting system normalizes the term frequency by dividing the term frequency by a function, referred to as the “normalizing term frequency function” (“&Ggr;”), of the total term frequency.

Type: Grant

Filed: December 18, 1998

Date of Patent: October 29, 2002

Assignee: Microsoft Corporation

Inventors: Sanjeev Katariya, William P. Jones
Distributed data cache with memory allocation model

Patent number: 6453404

Abstract: A cache system allocates memory for storage of data items by defining a series of small blocks that are uniform in size. The system allocates one or more blocks from memory and assigns them for storage of a data item. If the data item exceeds the predetermined block size, more blocks are assigned for storage. If a residual portion of the data item less than the predetermined the block size remains, the operating system may allocate an additional small storage block to store the residual portion of the data item. The operating system allocates a large number of small blocks with a plurality of blocks having different sizes where a first plurality of blocks has one block size and a second plurality of blocks has a different block size. The large number of blocks allocated by the operating system avoid contention from multiple users requesting access to the data cache. The predetermined blocks are allocated by the data cache software itself and are not allocated by the operating system.

Type: Grant

Filed: May 27, 1999

Date of Patent: September 17, 2002

Assignee: Microsoft Corporation

Inventors: Alexandre Bereznyi, Sanjeev Katariya
Data cache using plural lists to indicate sequence of data storage

Patent number: 6449695

Abstract: A cache system controls the insertion and deletion of data items using a plurality of utilization lists. When a data item is stored within the data cache, a corresponding data pointer, or other indicator, is stored within the utilization list in a manner indicative of the sequence in which data items were stored in the data cache. When a data item is subsequently retrieved from the data cache, the corresponding data pointer may be altered or moved to indicate that the data item has recently been retrieved. The data pointers corresponding to data items that have never been retrieved will indicate the sequence with which the data items were stored in the cache such that data items may be identified as least recently used (LRU) data items. The data pointers corresponding to data items that have been retrieved provide an indication of the sequence with which the data items have been retrieved such that the most recently retrieved data item is considered the most recently used (MRU) data item.

Type: Grant

Filed: May 27, 1999

Date of Patent: September 10, 2002

Assignee: Microsoft Corporation

Inventors: Alexandre Bereznyi, Sanjeev Katariya
SCALABLE COMPUTING SYSTEM FOR PRESENTING CUSTOMIZED AGGREGATION OF INFORMATION

Publication number: 20020091789

Abstract: A method and computing system for providing a highly reliable web site that is capable of handling millions of Internet users per day. The computing system is customizable on a per user basis yet seamlessly aggregates data from a number of information providers, such as news services, weather services, stock services, personal email, and even a community specific service. The computing system includes a plurality of personalization stores that maintain personalization data for a corresponding Internet user. The computing system is based on a federated model in which a single entry point spans to a plurality of web servers. Each web server has an aggregation module, such as an Internet Server API (ISAPI) filter, that retrieves user-specific data “clips” from the personalization store and passes the data to one or more provider objects.

Type: Application

Filed: December 3, 1998

Publication date: July 11, 2002

Inventors: SANJEEV KATARIYA, ALOK K. SINHA
Layered query management

Patent number: 6226635

Abstract: The present invention provides a distributed software facility for tailoring an information retrieval query issued by a user based upon the position of that user within an organization of users. In a query client, the facility generates an information retrieval query on behalf of an identified user. The identified user is a member both of first logical subset of the organization and of a second logical subset of the organization. In a first proxy server corresponding to the first logical subset of the organization, the facility modifies the generated query based upon query modification criteria specified for members of the first logical subset. In a second proxy server corresponding to a second logical subset of the organization, the facility modifies the generated query based upon query modification criteria specified for members of the second logical subset. The facility then processes the query as modified by the first and second proxy servers to retrieve information responsive to the modified query.

Type: Grant

Filed: August 14, 1998

Date of Patent: May 1, 2001

Assignee: Microsoft Corporation

Inventor: Sanjeev Katariya

prev 1 2 3