Selection Or Weighting Of Terms For Indexing (epo) Patents (Class 707/E17.084)
  • Publication number: 20130132401
    Abstract: Methods, systems, and computer programs are presented for providing internet content, such as related news articles. One method includes an operation for defining a plurality of candidates based on a seed. For each candidate, scores are calculated for relevance, novelty, connection clarity, and transition smoothness. The score for connection clarity is based on a relevance score of the intersection between the words in the seed and the words in each of the candidates. Further, the score for transition smoothness measures the interest in reading each candidate when transitioning from the seed to the candidate. For each candidate, a relatedness score is calculated based on the calculated scores for relevance, novelty, connection clarity, and transition smoothness. In addition, at least one of the candidates is selected based on their relatedness scores for presentation to the user.
    Type: Application
    Filed: November 17, 2011
    Publication date: May 23, 2013
    Applicant: Yahoo! Inc.
    Inventors: Taesup Moon, Zhaohui Zheng, Yi Chang, Pranam Kolari, Xuanhui Wang, Yuanhua Lv
  • Publication number: 20130132407
    Abstract: Various embodiments of methods and apparatus for fitting a surface to a data set are disclosed. A frequency distribution of an input data set is determined. Determining the frequency distribution includes assigning each data point of the input data set to a category representing a value of a variable for the respective data point. Responsive to identifying one or more discontinuities of the frequency distribution, a continuous section of the frequency distribution is identified as a first data set. A first equation is fit to the first data set.
    Type: Application
    Filed: February 25, 2011
    Publication date: May 23, 2013
    Inventors: Balaji Krishnmurthy, Anubha Rastogi
  • Publication number: 20130117246
    Abstract: Examples of the present invention provide methods relating to identification of a portion of text data common with reference text data, the method including obtaining the text data and the reference text data, the text data and the reference text data comprising a number of lines of text, identifying one or more lines of text of the text data that are common to the lines of text of the reference text data, and for one or more further lines of text of the text data that are not common to the lines of text of the reference text data, comparing the line of text of the text data with a corresponding line of text of the reference data to identify one or more common characters of the line of text.
    Type: Application
    Filed: November 3, 2011
    Publication date: May 9, 2013
    Inventors: Sebastien Cabaniols, Nathalie Cabaniols-Viollet, Clément Poulain
  • Publication number: 20130110849
    Abstract: In a general aspect, an approach to query specification includes processing a query by identifying one or more terms and validating the terms using a first corpus of media elements. The result of the validation is used to form a modified query. In some examples, identifying the one or more terms is based on a second corpus of media elements, which may include a different type of media than the first corpus. In some examples, the validating of the terms includes accepting input from a user according to parts of the elements of the first corpus presented to the user.
    Type: Application
    Filed: November 1, 2011
    Publication date: May 2, 2013
    Applicant: Nexidia Inc.
    Inventors: Robert W. Morris, Neeraj Singh Verma, John Willcutts, Marsal Gavalda
  • Publication number: 20130110847
    Abstract: A system and method determines ambiguous or missing information about map features, generates questions to address the ambiguity or the missing information and determines users from whom to request feedback to clarify the ambiguity or supply the missing information.
    Type: Application
    Filed: October 31, 2011
    Publication date: May 2, 2013
    Applicant: GOOGLE INC.
    Inventors: Arnaud Sahuguet, Bernhard Seefeld
  • Publication number: 20130110824
    Abstract: A custom search ranking model is configured using a base ranking model that is combined with one or more additional ranking features. A base ranking model that has already been configured and tuned is selected that serves as the base ranking model for a custom search ranking model. The additional ranking feature(s) to combine with the base ranking model may be manually/automatically identified. For example, a feature selection algorithm may be used to automatically identify ranking features that are likely to have a positive impact on results provided by the base search ranking model. A user may also know of the ranking feature(s) that they would like to add to the base ranking model. The custom search ranking model may also be evaluated by automatically creating a set of virtual queries for evaluation.
    Type: Application
    Filed: November 1, 2011
    Publication date: May 2, 2013
    Applicant: MICROSOFT CORPORATION
    Inventors: Pedro Dantas DeRose, Vishwa Vinay, Dmitriy Meyerzon
  • Publication number: 20130097180
    Abstract: In one embodiment, a system includes one or more computing systems that implement a social networking environment and are operable to access stored information including a plurality of nodes including a first set of user nodes that each correspond to a respective user and a second set of concept nodes that each correspond to a respective concept. The system may generate a match coefficient for the user and concept, representing the degree of relevance of a particular concept node to a particular user node.
    Type: Application
    Filed: October 18, 2011
    Publication date: April 18, 2013
    Inventor: Erick Tseng
  • Publication number: 20130086084
    Abstract: A method of patent mapping comprises maintaining a database of patent portfolios and a database of patents, each patent stored in the database of patents associated with one or more of the patent portfolios; receiving a search query associated with a first patent portfolio; searching the first portfolio as a function of the search query; generating search results, the search results including one or more patent claims associated with the search query; generating a claim similarity index for at least one patent claim or portion thereof included in the search results, based on its similarity to at least one other patent claim or portion thereof in the search results; identifying, based on the similarity index, one or more patent claims included in the search results as primary targets to map a patent scope to; and mapping the one or more patent claims to the patent concept.
    Type: Application
    Filed: December 2, 2011
    Publication date: April 4, 2013
    Inventor: Steven W. Lundberg
  • Publication number: 20130086050
    Abstract: The present inventive subject matter relates to prior art analysis. Various embodiments of the present inventive subject matter include systems and methods for analyzing prior art in a patent portfolio and annuity management system. In an example embodiment, a method comprises maintaining a patent matter database and a database of prior art documents including data about the prior art documents such as the priority or publication dates of the documents. A keyword analysis is performed on a given patent matter and associated prior art documents to identify keywords occurring uniquely in the first patent matter as potential claim elements differentiating the patent matter over the disclosures contained in the one or more prior art documents.
    Type: Application
    Filed: October 5, 2011
    Publication date: April 4, 2013
    Inventor: Steven W. Lundberg
  • Publication number: 20130086076
    Abstract: Techniques are described for refining the manual classification of assets classified or categorized using the terms of a business glossary. A semantic refinement mechanism is used to refine the manual classification of such assets, as well as subsequently evaluate the refined asset classifications. Further, the refined asset classifications may be used as a training set for a machine learning classifier. That is, should the classification of an asset contributing to a refinement change, the refinement based on that classification may be undone, at least in some cases.
    Type: Application
    Filed: September 30, 2011
    Publication date: April 4, 2013
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: SUSHAIN PANDIT, CHARLES K. SHANK, CHARLES D. WOLFSON
  • Publication number: 20130080197
    Abstract: Various embodiments of systems and methods for evaluating a trust value for a report are disclosed herein. The method includes obtaining (110) one or more reports 270 by the computer 260, where the reports 270 are formed of one or more fields of data. An end-to-end lineage for the data is determined to trace the data back to the data source system 210, 211, and/or 212 from which the data had originated initially. Further, the method includes validating each of the multiple data source systems 210, 211, and 212 including intermediate tables, and determining (130) a data quality score for each of the multiple data source systems 210, 211, and 212. A trust value for the report 270 is calculated (140) based on the data quality scores for the one or more data source systems 210, 211, and 212 and intermediate tables, and rendered along with the report.
    Type: Application
    Filed: September 22, 2011
    Publication date: March 28, 2013
    Inventors: DAVID KUNG, Marc Maillart, Suryanarayana Mangipudi, Aun-Khuan Tan
  • Publication number: 20130080450
    Abstract: A method and system for providing a research relation service are provided. The method includes the steps of (a) acquiring research technologies of a specific conduct agent; (b) acquiring comparison conduct agents for each of the acquired research technologies; (c) obtaining a research relation between a comparison conduct agent and the specific conduct agent; and (d) sorting and providing the comparison conduct agents based on the research relation. Therefore, according to the present invention, research relations such as research similarity, research cooperativeness and research competitiveness between conduct agents performing researches can be provided.
    Type: Application
    Filed: June 9, 2012
    Publication date: March 28, 2013
    Applicant: KOREA INSTITUTE OF SCIENCE & TECHNOLOGY INFORMATION
    Inventors: Han Min JUNG, Mi Kyoung LEE, Pyung KIM, Seung Woo LEE, Dong Min SEO, Jin Hyung KIM, Jin Hee LEE, Won Kyung SUNG
  • Publication number: 20130080444
    Abstract: Chart recommendations may be provided. First, a summary of a dataset may be determined and each column and row in the dataset, based on the summary, may be classified into classifications. Next, based upon the classifications of each column and row in the dataset, the dataset may be mapped to a plurality of chart types. Each of the plurality of chart types may then be ranked.
    Type: Application
    Filed: September 26, 2011
    Publication date: March 28, 2013
    Applicant: Microsoft Corporation
    Inventors: Robin Wakefield, Nick Chiang
  • Publication number: 20130073358
    Abstract: Systems, methods, and computer-readable and executable instructions are provided for recommending print locations. Recommending print locations can include receiving recommendation, user, and geographic information for a first mobile print location (MPL) and a second MPL. Recommending print locations can also include indexing the recommendation, user, and geographic information for each of the first MPL and the second MPL as an automatically created uniform resource indicator (URI) on an MPL system. Furthermore, recommending print locations can include ranking the first MPL and the second MPL based on the recommendation, user, and geographic information and presenting a list, via a user interface, of the ranked first MPL and the second MPL.
    Type: Application
    Filed: September 15, 2011
    Publication date: March 21, 2013
    Inventors: Thomas E. Sandholm, Jose Diego Ferreira Martins, Deny Joao Correa Azzolin
  • Publication number: 20130073561
    Abstract: Described herein are methods, systems, apparatuses and products for random sampling from distributed streams. An aspect provides a method for distributed sampling on a network with a plurality of sites and a coordinator, including: receiving at the coordinator a data element from a site of the plurality of sites, said data element having a weight randomly associated therewith deemed reportable by comparison at the site to a locally stored global value; comparing the weight of the data element received with a global value stored at the coordinator; and performing one of: updating the global value stored at the coordinator to the weight of the data element received; and communicating the global value stored at the coordinator back to the site of the plurality of sites. Other embodiments are disclosed.
    Type: Application
    Filed: September 16, 2011
    Publication date: March 21, 2013
    Applicants: IOWA STATE UNIVERSITY RESEARCH FOUNDATION, INC., INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: David P. Woodruff, Srikanta N. Tirthapura
  • Publication number: 20130066885
    Abstract: A system and method for generating a Popularity Score for content objects in computer information systems based at least on user input. The system and method is functional in both binary (likes/dislikes) and ranked (numbered, star) rating systems. The Popularity Score utilizes the percentage of users that expressed a favorable opinion of the content object, as well as the total number of expressed user opinions to provide a more meaningful measure of the overall user likeability or appeal of the content object than systems that simply utilize user “likes” and “dislikes”. The system and method also generate Popularity Score Trends over various flexible time ranges that allow users to search, sort, and/or list content objects based on popularity over the selected date ranges.
    Type: Application
    Filed: July 16, 2012
    Publication date: March 14, 2013
    Applicant: BINGE, LLC
    Inventor: Christopher S. Komuves
  • Publication number: 20130060763
    Abstract: A request can be received and a request reading level representation for the request can be inferred. In response to the request, the request reading level representation can be compared with one or more reading difficulty level representations for one or more response items. Also in response to the request, one or more indications of results of comparing the request reading level representation with one or more reading difficulty level representations for the one or more response items can be returned. The indication(s) may include a ranking of the response items. The ranking can be based at least in part on a request reading level representation for the query and reading difficulty level representations for the response items. The response item(s) may also be returned.
    Type: Application
    Filed: September 6, 2011
    Publication date: March 7, 2013
    Applicant: MICROSOFT CORPORATION
    Inventors: Sebastian de la Chica, Kevyn B. Collins-Thompson, Paul N. Bennett, David Alexander Sontag, Ryen W. White
  • Publication number: 20130060744
    Abstract: Aspects of the subject matter described herein relate to social event searching. In aspects, a search engine may receive a query regarding a social event. The search engine may obtain static data that indicates a ranking of the event based on overall popularity and may change the ranking based on social data that is particular to the user issuing the query. The search results may be ordered by the ranking and returned together with other social data to a search component such as a Web browser. The Web browser may then display the results together with the other social data. The Web browser may receive additional input from the user regarding preferences and may provide the input to a backend system for use to satisfy subsequent social event queries.
    Type: Application
    Filed: September 7, 2011
    Publication date: March 7, 2013
    Applicant: MICROSOFT CORPORATION
    Inventors: Subrata Roychoudhuri, Sarabjit Singh Seera, Phanindra Kanumuri, Suresh Iyengar, Pratik Stephen
  • Publication number: 20130054618
    Abstract: Systems and methods are provided for maintaining a dynamic profile slice of a user profile of a user. In one embodiment, a real-time user-generated context of the user is monitored over time to accumulate keywords in the dynamic profile slice of the user that are representative of dynamic interests of the user. Weights are assigned to the keywords in the dynamic profile slice using a time and/or location weighting function.
    Type: Application
    Filed: October 30, 2012
    Publication date: February 28, 2013
    Applicant: WALDECK TECHNOLOGY, LLC
    Inventor: Waldeck Technology, LLC
  • Publication number: 20130054615
    Abstract: A system and method for providing automatic high-value listing feeds for online computer users is disclosed. A particular embodiment includes obtaining publisher information corresponding to a plurality of publisher content items from a plurality of publisher sites; obtaining merchant information including value information corresponding to the plurality of publisher content items; using a processor, the publisher information, and the merchant information to generate a set of high-value feeds for transfer to the plurality of publisher sites, the set of high-value feeds each being ranked corresponding to a quality score computed for each listing item of each high-value feed; and transferring the set of high-value feeds to corresponding publisher sites of the plurality of publisher sites.
    Type: Application
    Filed: August 25, 2011
    Publication date: February 28, 2013
    Applicant: eBay Inc.
    Inventors: Shaobo Liu, Tom Tang
  • Publication number: 20130054620
    Abstract: A system for organizing a plurality of candidates based on the relative similarity of a first candidate with respect to the remaining plurality of candidates is disclosed. The system includes a controller in communication with a storage device configured to receive and accessibly store a generated plurality of candidate images. The controller operable to analyze each of the plurality of candidate images to determined a numeric thumbnail based on a number of identified features in each of a plurality of grid elements of an array and a sum total number of all identified features in the array, calculate a similarity score between one of the plurality of determined numeric thumbnails and each of the remaining plurality of determined numeric thumbnails; and generate a logical group image order as a function of the highest similarity score between the one of the plurality of determined numeric thumbnails and each of the remaining plurality of determined numeric thumbnails.
    Type: Application
    Filed: August 29, 2011
    Publication date: February 28, 2013
    Applicant: DST Technologies, Inc.
    Inventor: Kevin W. Stokes
  • Publication number: 20130046771
    Abstract: Systems and methods (e.g., utilities) for use in providing automated, lightweight collection of online, open source data which may be content-based to reduce website source bias. In one aspect, a utility is disclosed for use in extracting content of interest from at least one website or other online data source (e.g., where the extracted content can be used in a subsequent search query). In other aspects, utilities are disclosed that are operable to perform various types of analyses on such extracted content and present graphical representations of such analyses on a display of a client device.
    Type: Application
    Filed: August 15, 2011
    Publication date: February 21, 2013
    Applicant: Lockheed Martin Corporation
    Inventors: Abha Moitra, David Brian Bracewell, Steven Matt Gustafson, T. Michael Baylor, Tina H. Chau
  • Publication number: 20130024464
    Abstract: Embodiments of the invention relate to a computer-implemented method for generating explanatory data from a personalized recommendations process for a primary user based at least on stored data about the primary user. The method comprises a server computer obtaining data related to one or more users who are relevant to the primary user, then determining at least one group of users relevant to the primary user. The server computer also obtains data related to one or more entities, determines one or more entities relevant to the primary user, and associates the at least one relevant group of users with the one or more relevant entities. One or more potential candidate factors are generated. A set of factors are selected from the one or more potential candidate factors, wherein the potential candidate factors are used as explanatory data to determine recommendations to the primary user.
    Type: Application
    Filed: July 19, 2012
    Publication date: January 24, 2013
    Applicant: Ness Computing, Inc.
    Inventors: Christopher Eric Shogo Berner, Jeremy Ryan Schiff, Corey Layne Reese, Paul Kenneth Twohey
  • Publication number: 20130018871
    Abstract: The most common automated search methods produce less-than-ideal results when searching online resumes, profiles, and the like (“biographies”) for the identities of people with a searcher-selected qualification (“candidates”). Keywords, their proximities, and their repetitions are less informative in biographies than in other informational documents. Similarly, chains of social connection (“referral paths”) do not always reveal the likelihood or ease of a searcher's introduction to a candidate. In both cases, the display order of results may be unrelated to any estimate of merit. To answer the question “Whom do I need and how do I reach them?” a classifier system uses heuristics or algorithms adapted to match the reactions of human experts on the selected qualifications. Terms in biographies, regardless of structure, are standardized and disambiguated for accurate comparisons, meaningful context is preserved, and biographies and referral paths are scored based on expected usefulness to the searcher.
    Type: Application
    Filed: July 13, 2011
    Publication date: January 17, 2013
    Applicant: NIMBLECAT, INC.
    Inventors: Sunil Mehta, David Meyer, Poonam Murgai
  • Publication number: 20130013613
    Abstract: A requirements management tool, where each requirement is defined by one or more design element values. Each of the design element values is a unique value and is a member of a group of design element values defined for the project. As each requirement is created, the design element values for the requirement are selected from the group of design element values, or alternatively, a new design element value may be entered by a user, and the new design element value will be added to the group of element values. Each design element value corresponds to a category that each of the requirements are broken down into. Design element values in the created requirement are compared to design element values in existing requirements, and results of this duplication check are presented to a user of the requirements management tool.
    Type: Application
    Filed: July 5, 2011
    Publication date: January 10, 2013
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventor: Maneesh Kumar Sharma
  • Publication number: 20130007011
    Abstract: An approach is provided for providing an ordering metric for a multi-dimensional contextual query. An ordering platform determines a multi-dimensional query associated with at least one user device, wherein the multi-dimensional query specifies, at least in part, one or more personas, one or more contexts, or a combination thereof associated with the at least one user device. The ordering platform further causes, at least in part, an execution of the multi-dimensional query on at least one context-sensitive database to generate one or more results. The ordering platform further determines at least one ordering metric for the one or more results based, at least in part, on one or more user contextual attributes of the at least one user device.
    Type: Application
    Filed: June 29, 2011
    Publication date: January 3, 2013
    Applicant: Nokia Corporation
    Inventors: Vidya Setlur, Agathe Battestini
  • Publication number: 20120330977
    Abstract: Techniques provide for searching pieces of document data using a search keyword. The technique includes: calculating, as a first vector, respective first scores at which or respective probabilities that each of the pieces of document data belongs to clusters or classes; calculating, as a second vector, respective second scores at which or respective probabilities that the search keyword or a relevant keyword associated with the search keyword belongs to the clusters or the classes; calculating an inner product of each of the first vectors and the second vector, the calculated inner product being a third score of the corresponding piece of document data regarding the search keyword; and acquiring a correlation value from document data containing each keyword in a classification keyword set and document data with the third score that is equal to or more than a predetermined threshold or is included in a predetermined high-ranking proportion.
    Type: Application
    Filed: September 6, 2012
    Publication date: December 27, 2012
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventor: Takeshi INAGAKI
  • Publication number: 20120330976
    Abstract: A relationship information expansion apparatus capable of acquiring a new relationship based on a relationship information piece including two or more language expressions having a semantic relationship is provided. The relationship information expansion apparatus generates a candidate expanded relationship information piece in which at least one language expression included in the relationship information piece was replaced with a similar language expression, and acquires a score that indicates a probability that the candidate expanded relationship information piece has a semantic relationship. The relationship information expansion apparatus selects an expanded relationship information piece, which is a candidate expanded relationship information piece having a high score among candidate expanded relationship information pieces.
    Type: Application
    Filed: January 5, 2011
    Publication date: December 27, 2012
    Applicant: National Institute of Information and Communications Technology
    Inventors: Masaaki Tsuchida, Stijn De Saeger, Kentaro Torisawa, Masaki Murata, Junichi Kazama, Kow Kuroda
  • Publication number: 20120323929
    Abstract: A plurality of indicators representing a plurality of respective candidate database configurations may be obtained, each of the candidate database configurations including a plurality of database queries and a plurality of candidate database indexes associated with a database table. A portion of the candidate database indexes included in the plurality of database indexes may be selected based on skyline selection. An enumeration of the portion of the plurality of the candidate database indexes may be determined based on a greedy algorithm.
    Type: Application
    Filed: June 17, 2011
    Publication date: December 20, 2012
    Applicant: MICROSOFT CORPORATION
    Inventors: Hideaki Kimura, Vivek Narasayya, Manoj Syamala
  • Publication number: 20120317125
    Abstract: A method for identifier retrieval. The method can include the steps of: extracting candidate identifiers from a data source according to a source identifier; obtaining a profile of the source identifier and profiles of the candidate identifiers from the data source; and selecting a target identifier associated with the source identifier from the candidate identifiers according to the profile of the source identifier and the profiles of the candidate identifiers. The method may efficiently, accurately and rapidly find a target identifier associated with a source identifier.
    Type: Application
    Filed: August 21, 2012
    Publication date: December 13, 2012
    Applicant: International Business Machines Corporation
    Inventors: Sheng Hua Bao, HongLei Guo, Zhong Su, Jian Yao, Li Zhang, Shuo Zhang, Hui Jia Zhu
  • Publication number: 20120317087
    Abstract: A training system is described for generating at least one ranking module using features derived, in part, from region information. The region information encodes characteristics about regions which are associated with queries in search log data. A query processing system is also described for applying the ranking model generated by the training system to process queries in real time. In one implementation, the training system can also generate plural ranking models corresponding to plural respective map areas. The training system can also generate a mapping model which correlates each region with a ranking model to be applied when processing queries that originate from that region. The query processing system can process a query by determining a region associated with the query and then identifying and applying a ranking model which corresponds to the region.
    Type: Application
    Filed: June 7, 2011
    Publication date: December 13, 2012
    Applicant: MICROSOFT CORPORATION
    Inventors: Dimitrios Lymberopoulos, Arnd C. Konig, Peixiang Zhao, Klaus L. Berberich, Jie Liu
  • Publication number: 20120317008
    Abstract: Systems and methods for storing transaction data associated with transactions of disparate types are provided. Transaction data is received describing a transaction that has occurred, the transaction being performed by an customer of a particular customer type and the transaction being performed using a channel of a particular channel type. Transaction data about the customer is stored in an customer segment according to one of a plurality of customer templates, the one of the plurality of customer templates being selected according to the customer type. Transaction data about the channel is stored in a channel segment according to one of a plurality of channel templates, the one of the plurality of channel templates being selected according to the channel type. Data from the customer segment, the activity segment, and the channel segment for the transaction is extracted and scored by a predictive model.
    Type: Application
    Filed: June 13, 2011
    Publication date: December 13, 2012
    Inventors: Revathi Subramanian, Ho Ming Luk, Brian Lee Duke, Paul C. Dulany
  • Publication number: 20120317101
    Abstract: A method and system for qualifying keyword(s) or phrase(s) to formulate a query string for submitting a search request when the query string contains one or more keywords that may have multiple meanings associated therewith. Database information containing keywords and associated meanings or forms of the keywords is maintained and a requester is prompted to identify one or more of the meanings of a keyword when building the query string. One or more advertisements pertaining to the associated meanings or forms of the keywords in the query string is presented to the requester submitting a search request.
    Type: Application
    Filed: July 25, 2012
    Publication date: December 13, 2012
    Applicant: CHACHA SEARCH, INC.
    Inventors: Scott A. Jones, Thomas E. Cooper
  • Publication number: 20120317124
    Abstract: A database may be virtually partitioned into virtual partitions. The virtual partitions are mapped to physical databases of a database. Data records added to the database are each assigned to a virtual partition and stored in the physical database mapped to the assigned virtual partition. The identifier generated for a data record includes an identifier of the assigned virtual partition. When additional databases are created, virtual partitions are remapped to the larger space of physical databases.
    Type: Application
    Filed: August 17, 2012
    Publication date: December 13, 2012
    Applicant: GOOGLE INC.
    Inventors: David L. Butcher, Dan Moisa, Wendy Tobagus, Sunil Kosalge
  • Publication number: 20120310930
    Abstract: Given a set of documents relevant to a litigation hold and a seed set of keywords, a second set of keywords can be generated and suggested to a user. Each document in a training set of documents is given an indication of relevance. Based on the indication of relevance, a set of further keywords relevant to the litigation is extracted from the documents and suggested to a user. The suggested set of keywords may or may not include keywords in the seed set. Additionally, the suggested set of keywords may be related to the seed set of keywords.
    Type: Application
    Filed: August 29, 2011
    Publication date: December 6, 2012
    Applicant: Google Inc.
    Inventors: Shailesh Kumar, Mahesh Chhaparia
  • Publication number: 20120310929
    Abstract: In one embodiment, a computing device may access a search query provided by a user; identify a set of search results in response to the search query, wherein one or more search results in the set are associated with a feature of a social-networking system; rank the set of search results based on one or more factors; boost one or more ranks of the one or more search results associated with the feature to bring the feature to the user's attention; and present the set of search results to the user in order of its ranking
    Type: Application
    Filed: June 3, 2011
    Publication date: December 6, 2012
    Inventors: Ryan Patterson, Michael Dudley Johnson, Erick Tseng
  • Publication number: 20120296918
    Abstract: The subject disclosure is directed towards using credibility-related data in conjunction with servicing a web request such as a search query or a request for page content. The credibility-related data may be used to convey information to a user indicative of a level of credibility, such as to view credibility information with each search result, or in association with returned web page content. The credibility-related data may be used to rank, re-rank and/or filter search results. Also described is extracting credibility-related feature data from search-related data and web pages, and using the feature data with a dataset of credibility-rated pages to learn/train relative feature weights in a credibility model used by the search engine.
    Type: Application
    Filed: May 18, 2011
    Publication date: November 22, 2012
    Applicant: Microsoft Corporation
    Inventors: Meredith June Morris, Julia Schwarz
  • Publication number: 20120284270
    Abstract: A method for detecting similar documents includes extracting an entity from each of a first web document and a second web document; determining an importance contribution element corresponding to each of the web documents; calculating, using the processor, weights for each entity based on the determined importance contribution elements; and determining whether the web documents are similar documents based on the calculated weights. A device to detect similar documents includes a storage device; an entity extractor stored on the storage device and configured to extract an entity from a first web document and a second web document and to determine an importance contribution element from each of the web documents; a weight calculator configured to calculate weights of each entity based on the determined importance contribution elements; and a similar document detection unit configured to determine whether the web documents are similar documents based on the calculated weights.
    Type: Application
    Filed: May 2, 2012
    Publication date: November 8, 2012
    Applicant: NHN CORPORATION
    Inventors: Chae Hyun LEE, Dong Yun SIM
  • Publication number: 20120278318
    Abstract: In accordance with some embodiments, processes and interfaces provide for enhancing search results of a group research project. For example, members of a group may be provided with information regarding other group member's search activities and/or be restricted from viewing certain search results (e.g., search results that are most popular with the public for a given search term, that are most popular with the group for the given search term or for the project, and/or search results that are restricted by a group manager).
    Type: Application
    Filed: December 15, 2011
    Publication date: November 1, 2012
    Inventor: Alan M. Reznik
  • Publication number: 20120278314
    Abstract: Queries submitted by users when interacting with a network-based system may be analyzed by a query mining machine that determines a theme common to the submitted queries. The machine accesses the submitted queries and identifies a portion of the submitted queries as corresponding to the theme. Identification of the portion may include determining a strength score of a submitted query, where the strength score indicates a degree of influence that the submitted query is to have on the identifying of the portion. The machine generates a thematic query based on the identified portion and obtains search results by executing the thematic query. The search results correspond to a group of items. The machine presents at least some of the group of items as a collection that corresponds to the theme (e.g., within a temporary electronic storefront).
    Type: Application
    Filed: April 26, 2011
    Publication date: November 1, 2012
    Applicant: eBay Inc.
    Inventors: Neelakantan Sundaresan, Karin Maugé
  • Publication number: 20120265772
    Abstract: Technologies for recommending relevant tags for the tagging of media based on one or more initial tags provided for the media and based on a large quantity of other tagged media. Sample media as candidates for recommendation are provided by a set of weak rankers based on corresponding relevance measures in semantic and visual domains. The various samples provided by the weak rankers are then ranked based on relative order to provide a list of recommended tags for the media. The weak rankers provide sample tags based on relevance measures including tag co-occurrence, tag content correlation, and image-conditioned tag correlation.
    Type: Application
    Filed: June 29, 2012
    Publication date: October 18, 2012
    Applicant: Microsoft Corporation
    Inventors: Linjun Yang, Lei Wu, Xian-Sheng Hua
  • Publication number: 20120259864
    Abstract: Embodiments of the invention provide for optimizing work site utilization within a business entity or the like. Work site optimization is realized by ranking a predetermined group of leased work sites relative to their feasibility for exiting and making decisioning based on the rankings. The ranking is based on an automated and qualitative scoring of the work sites. Additionally, embodiments of the invention account for the potential use of off-work site employees and, as such, the optimization rankings that are provided serve to predict future work site needs.
    Type: Application
    Filed: April 6, 2011
    Publication date: October 11, 2012
    Applicant: BANK OF AMERICA CORPORATION
    Inventors: William C. Jetton, Thomas R. Kurtz, Glenn McNairy, Benjamin T. Teal
  • Publication number: 20120254166
    Abstract: In an electronic discovery search tool, non-substantive information, such as signatures in e-mail, can bias a search tool and add processing time. A method and system for identifying recurring non-substantive text in documents has been developed so that non-substantive text may be processed or ignored by the search tool, as needed.
    Type: Application
    Filed: August 29, 2011
    Publication date: October 4, 2012
    Applicant: Google Inc.
    Inventors: Gaurav AGARWAL, Shailesh Kumar
  • Publication number: 20120254191
    Abstract: A method and a system for summarizing a concept are provided. A query corresponding to a concept is received from a user. A plurality of images and corresponding descriptive information may be collected based on the query. The plurality of images and the descriptive information may be processed to form feature vectors and processed descriptive information respectively. Further, one or more topics may be identified for the plurality of images. Each of the plurality of images may be assigned with one or more topic distribution values corresponding to the one or more topics. The one or more topics correspond to the processed descriptive information. A sparse set of images may be determined based on the feature vectors and the assigned topic distribution values, to summarize the concept. Also, a target summary may be built from the summarized concept, by regularizing one or more distribution constraints.
    Type: Application
    Filed: April 1, 2011
    Publication date: October 4, 2012
    Applicant: Yahoo! Inc.
    Inventors: Subhajit SANYAL, Dhruv Kumar Mahajan, Sundararajan Sellamanickam
  • Publication number: 20120254148
    Abstract: Multiple search indexes can be served from a common set of resources. Instead of requiring a processor to be dedicated to serving a single search index, a processor can provide responsive documents for search queries that are based on different ranking algorithms and/or different sets of documents.
    Type: Application
    Filed: March 28, 2011
    Publication date: October 4, 2012
    Applicant: MICROSOFT CORPORATION
    Inventors: JIANYONG XIAO, YI LI, YANBIAO ZHAO, XUN KANG, PIN LU, ASHISH CONSUL
  • Publication number: 20120246175
    Abstract: Methods and systems for determining schema element types are shown that include pooling potential annotations for an element of an unlabeled schema from a plurality of heterogeneous sources, scoring the pool of potential annotations according to relevancy using information using instance information from the plurality of heterogeneous sources to produce a relevancy score, and annotating the element of the unlabeled schema using the most relevant potential annotations.
    Type: Application
    Filed: March 23, 2011
    Publication date: September 27, 2012
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Songyun Duan, Achille B. Fokoue-Nkoutche, Oktie Hassanzadeh, Anastasios Kementsietsidis, Kavitha Srinivas, Michael J. Ward
  • Publication number: 20120226700
    Abstract: User data and a plurality of micro-segment definitions are received. Each micro-segment definition in the plurality of micro-segment definitions corresponds to one or more offers in an offer provider campaign. Further, a micro-segment parser parses each micro-segment definition from the plurality of micro-segment definitions into a plurality of parsed expression segments that indicate a plurality of micro-segment condition rules. In addition, a compiler compiles the plurality of parsed expression segments into an executable object that indicates a plurality of instructions to determine if the user data matches the plurality of micro-segment definitions. Each micro-segment definition is also serially processed, with a sequential evaluation engine, to apply the plurality of micro-segment condition rules to the user data to determine a match of a user belonging to a micro-segment. Further, the sequential evaluation engine assigns a score to indicate the strength of each match.
    Type: Application
    Filed: March 2, 2011
    Publication date: September 6, 2012
    Applicant: ADOBE SYSTEMS INCORPORATED
    Inventors: Walter Chang, Geoff Baum
  • Publication number: 20120226698
    Abstract: A system and method for providing an information repository that optimizes profiles of sensory characteristics of food or drink products. The system receives user preferences or search criteria of similar sensory characteristics to match against food or drink products in a database with a very high degree of certainty or accuracy. The system and method also provide personalization to users, i.e. personal recommendations based on personal preferences, as well as product matching processes.
    Type: Application
    Filed: March 3, 2011
    Publication date: September 6, 2012
    Inventors: OLIVIER SILVESTRE, Pierre Huguet
  • Publication number: 20120221572
    Abstract: Systems and methods are disclosed to search for a query image, by detecting local invariant features and local descriptors; retrieving best matching images by quantizing the local descriptors with a vocabulary tree; and reordering retrieved images with results from the vocabulary tree quantization.
    Type: Application
    Filed: December 28, 2011
    Publication date: August 30, 2012
    Applicant: NEC LABORATORIES AMERICA, INC.
    Inventors: Xiaoyu Wang, Ming Yang, Timothee Cour, Shenghuo Zhu, Kai Yu
  • Publication number: 20120215791
    Abstract: Systems and techniques for exploring relationships among entities are disclosed. The systems and techniques provide an entity-based information analysis and content aggregation platform that uses heterogeneous data sources to construct and maintain an ecosystem around tangible and logical entities. Entities are represented as vertices in a directed graph, and edges are generated using entity co-occurrences in unstructured documents and supervised information from structured data sources. Significance scores for the edges are computed using a method that combines supervised, unsupervised and temporal factors into a single score. Important entity attributes from the structured content and the entity neighborhood in the graph are automatically summarized as the entity fingerprint. Entities may be compared to one another based on similarity of their entity fingerprints. An interactive user interface is also disclosed that provides exploratory access to the graph and supports decision support processes.
    Type: Application
    Filed: August 19, 2011
    Publication date: August 23, 2012
    Inventors: Hassan H. Malik, Mans Olof-Ors, Ian MacGillivray