Patents Assigned to Verity, Inc.
-
Patent number: 7461085Abstract: A method of parametric group processing includes forming a parametric index from an indexed database. A first parametric group and a second parametric group corresponding to elements in the parametric index are specified. The first parametric group and the second parametric group are merged to produce a merged parametric group. A parametric result is extracted from the merged parametric group, where the parametric result specifies a set of documents.Type: GrantFiled: November 23, 2005Date of Patent: December 2, 2008Assignee: Verity, Inc.Inventors: Neil Latarche, John Wang
-
Patent number: 7085771Abstract: The invention is a method, system and computer program for automatically discovering concepts from a corpus of documents and automatically generating a labeled concept hierarchy. The method involves extraction of signatures from the corpus of documents. The similarity between signatures is computed using a statistical measure. The frequency distribution of signatures is refined to alleviate any inaccuracy in the similarity measure. The signatures are also disambiguated to address the polysemy problem. The similarity measure is recomputed based on the refined frequency distribution and disambiguated signatures. The recomputed similarity measure reflects actual similarity between signatures. The recomputed similarity measure is then used for clustering related signatures. The signatures are clustered to generate concepts and concepts are arranged in a concept hierarchy. The concept hierarchy automatically generates query for a particular concept and retrieves relevant documents associated with the concept.Type: GrantFiled: May 17, 2002Date of Patent: August 1, 2006Assignee: Verity, IncInventors: Christina Yip Chung, Jinhui Liu, Alpha Luk, Jianchang Mao, Sumit Taank, Vamsi Vutukuru
-
Patent number: 7031909Abstract: The present invention provides a method, system and computer program for naming a cluster, or a hierarchy of clusters, of words and phrases that have been extracted from a set of documents. The invention takes these clusters as the input and generates appropriate labels for the clusters using a lexical database. Naming involves first finding out all possible word senses for all the words in the cluster, using the lexical database; and then augmenting each word sense with words that are semantically similar to that word sense to form respective definition vectors. Thereafter, word sense disambiguation is done to find out the most relevant sense for each word. Definition vectors are clustered into groups. Each group represents a concept. These concepts are thereafter ranked based on their support. Finally, a pre-specified number of words and phrases from the definition vectors of the dominant concepts are selected as labels, based on their generality in the lexical database.Type: GrantFiled: March 12, 2002Date of Patent: April 18, 2006Assignee: Verity, Inc.Inventors: Jianchang Mao, Sumit Taank, Christina Chung, Alpha Luk
-
Patent number: 6999971Abstract: A method of parametric group processing includes forming a parametric index from an indexed database. A first parametric group and a second parametric group corresponding to elements in the parametric index are specified. The first parametric group and the second parametric group are merged to produce a merged parametric group. A parametric result is extracted from the merged parametric group, where the parametric result specifies a set of documents.Type: GrantFiled: May 8, 2001Date of Patent: February 14, 2006Assignee: Verity, Inc.Inventors: Neil Latarche, John Wang
-
Patent number: 6910026Abstract: A method of identifying features for a classifier includes identifying a set of elements that share a common characteristic, and then identifying a subset of elements within that set which share a second characteristic. Features are then selected that are more commonly possessed by the elements in the subset than the elements in the set but excluding the subset, and that are more commonly possessed by the elements in the set but excluding the subset, as compared to the elements outside the set. A further method of identifying features for a classifier includes defining a list of features, selecting a first feature from that list, identifying a set of elements that possess that first feature, and then identifying a subset of elements within that set which possess any other feature.Type: GrantFiled: August 27, 2001Date of Patent: June 21, 2005Assignee: Verity, Inc.Inventor: Alpha Kamchiu Luk
-
Patent number: 6792419Abstract: A system and method for ranking hyperlinked documents, such as web pages, is provided wherein a stochastic backoff process is used to rank those hyperlinked documents. In more detail, the stochastic process is derived from a random walk through the pages of the web. First, a directed graph may be generated from a crawl wherein the nodes are documents in the crawl and a directed edge from one node A to another node B indicates the presence of a hyperlink from the corresponding document docA to document docB. Using a stochastic backoff process on this graph, a weight between 0 and 1 is assigned to each document so that the documents may be ranked according to the weights.Type: GrantFiled: October 30, 2000Date of Patent: September 14, 2004Assignee: Verity, Inc.Inventor: Prabhakar Raghavan
-
Patent number: 6754647Abstract: Method and apparatus are disclosed for the development and implementation of virtual robot's (bot's) directed natural language interaction with computer users. Bots employing the present invention base natural language interaction on a predefined universe of discourse that is decomposed hierarchically into domains. A data structure provides a storage area for each domain. The data structure may reflect the hierarchical decomposition. Domain topics containing program code directing the bot's interaction are placed in domain storage areas. Pattern lists associate words expected to be “heard” by the bot with particular domain topics. Domain topics are provided, as appropriate, to direct a user's attention toward the instant domain's parent, siblings, or children, with lower topics in the hierarchy getting higher preference. Domain censoring and domain tiebreakers improve usability.Type: GrantFiled: September 26, 2000Date of Patent: June 22, 2004Assignee: Verity, Inc.Inventors: Walter Tackett, John B. Hodges, Scott Benson, D. Patrick Blair, Kate Boynton, Ray Dillinger, Martin Eggenberger, Tom Schofield
-
Patent number: 6738764Abstract: A method of ranking search results includes producing a relevance score for a document in view of a query. A similarity score is calculated for the query utilizing a feature vector that characterizes attributes and query words associated with the document. A rank value is assigned to the document based upon the relevance score and the similarity score.Type: GrantFiled: May 8, 2001Date of Patent: May 18, 2004Assignee: Verity, Inc.Inventors: Jianchang Mao, Mani Abrol, Rajat Mukherjee, Michel Tourn, Prabhakar Raghavan
-
Patent number: 6728704Abstract: This invention includes the step of transmitting a query to a set of search engines. Any result lists returned from these search engines is received, and a subset of entries in each result list is selected. Each entry in this subset is assigned a scoring value according to a scoring function, and each result list is then assigned a representative value according to the scoring values assigned to its entries. A merged list of entries is produced based upon the representative value assigned to each result list.Type: GrantFiled: August 27, 2001Date of Patent: April 27, 2004Assignee: Verity, Inc.Inventors: Jianchang Mao, Rajat Mukherjee, Prabhakar Raghavan, Panayiotis Tsaparas
-
Publication number: 20030217335Abstract: The invention is a method, system and computer program for automatically discovering concepts from a corpus of documents and automatically generating a labeled concept hierarchy. The method involves extraction of signatures from the corpus of documents. The similarity between signatures is computed using a statistical measure. The frequency distribution of signatures is refined to alleviate any inaccuracy in the similarity measure. The signatures are also disambiguated to address the polysemy problem. The similarity measure is recomputed based on the refined frequency distribution and disambiguated signatures. The recomputed similarity measure reflects actual similarity between signatures. The recomputed similarity measure is then used for clustering related signatures. The signatures are clustered to generate concepts and concepts are arranged in a concept hierarchy. The concept hierarchy automatically generates query for a particular concept and retrieves relevant documents associated with the concept.Type: ApplicationFiled: May 17, 2002Publication date: November 20, 2003Applicant: Verity, Inc.Inventors: Christina Yip Chung, Jinhui Liu, Alpha Luk, Jianchang Mao, Sumit Taank, Vamsi Vutukuru
-
Publication number: 20030177000Abstract: The present invention provides a method, system and computer program for naming a cluster, or a hierarchy of clusters, of words and phrases that have been extracted from a set of documents. The invention takes these clusters as the input and generates appropriate labels for the clusters using a lexical database. Naming involves first finding out all possible word senses for all the words in the cluster, using the lexical database; and then augmenting each word sense with words that are semantically similar to that word sense to form respective definition vectors. Thereafter, word sense disambiguation is done to find out the most relevant sense for each word. Definition vectors are clustered into groups. Each group represents a concept. These concepts are thereafter ranked based on their support. Finally, a pre-specified number of words and phrases from the definition vectors of the dominant concepts are selected as labels, based on their generality in the lexical database.Type: ApplicationFiled: March 12, 2002Publication date: September 18, 2003Applicant: Verity, Inc.Inventors: Jianchang Mao, Sumit Taank, Christina Chung, Alpha Luk
-
Publication number: 20030167295Abstract: The present invention provides a method, system and computer program to balance the computational and network load in networked computers using self-replicating programs, referred to as symbionts. The method presented here reduces hotspots by encapsulating a resource in a symbiont, and having a user access that symbiont through programs that host symbionts, referred to as hosts. When a host accesses a symbiont, it may replicate a copy of that symbiont resource on itself or may be redirected to some other replicate of the same symbiont. The host then offers the replicated resource on the network to alleviate the load experienced by the original symbiont's computer. If the load on a symbiont falls below a threshold, it is removed from the host on which it was hosted.Type: ApplicationFiled: March 1, 2002Publication date: September 4, 2003Applicant: Verity, Inc.Inventor: Kiam Choo
-
Patent number: 6567103Abstract: A system and method of creating a graphical presentation, such as a video, based on surfing the results of a web search. The graphical presentation may be constructed from the results of a search wherein each search result represents a URL and each URL is rendered as a graphical image of a web page (a frame) and stored in a file. When the file is viewed, it is displayed in a sequence of rendered frames wherein each frame is displayed for a variable, predetermined amount of time based on the relevance of the particular search result.Type: GrantFiled: August 2, 2000Date of Patent: May 20, 2003Assignee: Verity, Inc.Inventor: Abdul Chaudhry
-
Patent number: 6457047Abstract: An application caching system and method are provided wherein one or more applications may be cached throughout a distributed computer network. The system may include a central cache directory server, one or more distributed master application servers and one or more distributed application cache servers. The system may permit a service, such as a search, to be provided to the user more quickly.Type: GrantFiled: May 8, 2000Date of Patent: September 24, 2002Assignee: Verity, Inc.Inventors: Ashok Chandra, Neil LaTarche, Jianchang Mao, Prabhakar Raghavan
-
Patent number: 5778364Abstract: The invention enables evaluation of the content of a set of data to determine whether the data set satisfies one or more queries. The invention enables rapid evaluation of large numbers of data sets much more rapidly than has previously been possible, even when the number of queries is large and/or the queries are complex. The queries are evaluated using an execution plan of query terms that is constructed from one or more specified queries by translating each query term of each query into one or more evidence descriptors and one or more combination operators, and operably relating each of the combination operators to at least one of the evidence descriptors or other combination operators, such that each query is defined by one or more of the evidence descriptors and one or more of the combination operators that are operably related to each other. Preferably, none of the evidence descriptors or combination operators are duplicated in the execution plan.Type: GrantFiled: January 2, 1996Date of Patent: July 7, 1998Assignee: Verity, Inc.Inventor: Philip C. Nelson