Based On Record Similarity And Relevance Patents (Class 707/749)
-
Patent number: 8775441Abstract: In one aspect, in general, a method is described for managing an archive. The archive is used for determining approximate matches associated with strings occurring in records. The method includes processing records to determine a set of string representations that correspond to strings occurring in the records. The method also includes generating, for each of at least some of the string representations in the set, a plurality of close representations that are each generated from at least some of the same characters in the string. The method also includes storing entries in the archive. Each stored entry represents a potential approximate match between at least two strings based on their respective close representations.Type: GrantFiled: January 16, 2008Date of Patent: July 8, 2014Assignee: Ab Initio Technology LLCInventor: Arlen Anderson
-
Publication number: 20140188902Abstract: A method and system for ranking an object that contains one or more keywords is disclosed. All linking objects are retrieved that contain a keyword of interest. Each linking object links to the object to be ranked. The locations of each link in each linking object are determined relative to the keywords in the linking objects. A drop off rate is computed for the keyword in each of the linking objects. A perceived importance of the keyword in the each of the linking objects is computed. A partial link rating for each of the linking objects is computed. A total link rating is computed for the at least one keyword across all linking objects. The link ratings each keyword is stored in the object to be ranked.Type: ApplicationFiled: December 30, 2013Publication date: July 3, 2014Inventor: Charles J. Reed
-
Publication number: 20140188904Abstract: A matching system and method for providing a degree of matching between a plurality of entities. There are: first, second and additional attribute receiving modules each configured to receive attribute information from an entity; an electronic storage device including a cascading condition module configured to define: a cascading condition; and/or a plurality of cascading information requests associated with the cascading condition; a request module configured to provide the plurality of cascading information requests from the additional attribute receiving module as the cascading request condition is satisfied; and a degree module in communication with the first attribute information receiving module, the second attribute information receiving module, and the additional attribute information receiving module and configured to compare the first attribute information and second attribute information in light of additional attribute information and configured to return degree of matching information.Type: ApplicationFiled: March 6, 2014Publication date: July 3, 2014Inventor: David Sciuk
-
Publication number: 20140188900Abstract: An approach for generating a pattern-based database includes accessing a log specifying one or more strings representing data having a dynamic portion and a static portion, and generating a pattern-based database, including one or more records representing compression of the data, by determining the dynamic portions and the static portions of the strings, and assigning pattern values to the strings based on the determined dynamic portions and the static portions, wherein the pattern values are used to provide compression of the static portions within the records of the pattern-based database.Type: ApplicationFiled: January 2, 2013Publication date: July 3, 2014Applicant: VERIZON PATENT AND LICENSING INC.Inventors: Anand N. Sankaran, Anierutha X. CHANDHIRAMOWULI, SyedTalat IQBAL, Rajesh NARAYANAN, Jubish C. PARAMBATH, Anil K. GUNTUPALLI, Lisa A. CAPUTO
-
Publication number: 20140188901Abstract: A method, system and computer program product for efficiently identifying images, videos, audio files or documents relevant to a user using binary search trees in attribute space for guiding relevance feedback. A binary tree is constructed for each relative attribute of interest. A “pivot exemplar” (at a node of the binary tree) is set for each relative attribute's binary tree as corresponding to the database image, video, audio file or document with a median relative attribute value among that subtree's child examples. A pivot exemplar out of the available current pivot exemplars that has the highest expected information gain is selected to be provided to the user. Comparative attribute feedback is then received from the user regarding whether a degree of the attribute in the user's target image, video, audio file or document is more, less or equal with the attribute displayed in the selected pivot exemplar.Type: ApplicationFiled: August 13, 2013Publication date: July 3, 2014Applicant: Board of Regents, The University of Texas SystemInventors: Kristen Grauman, Adriana Kovashka
-
Publication number: 20140188905Abstract: An item authority system is provided. The item authority system uses rules to identify item definitions that match or potentially match an item description. When a unique match is found, then the item authority system may indicate that the item description describes the same item as the item definition. If multiple matches or only potential matches are identified, then the item authority system may allow a user to manually indicate which item definition matches.Type: ApplicationFiled: March 7, 2014Publication date: July 3, 2014Applicant: AMAZON TECHNOLOGIES, INC.Inventors: NICHOLAS BICKNELL, SHAWN BOHN, ANMOL PARALKAR, ANUVRATA ARORA
-
Publication number: 20140188903Abstract: Systems and methods consistent with the invention relate to matching user attributes. In one exemplary implementation, the system and methods may store predetermined general attribute descriptors reflecting attributes of users generally, receive personal attribute descriptors selected from the predetermined general attribute descriptors as corresponding to attributes of a first user and a second user, receive a rating associated with each received personal attribute descriptor, compare at least one personal attribute descriptor associated with the first user with at least one personal attribute descriptor associated with the second user to determine a descriptor match, and calculate a match score based on the determined descriptor match and the received ratings. In addition, first and second display points may be displayed and may be separated by a one-dimensional display distance that is a function of the calculated match score.Type: ApplicationFiled: February 10, 2014Publication date: July 3, 2014Applicant: ACCENTURE GLOBAL SERVICES GMBHInventors: James Edward MARSHALL, Marcus Wilfrid BUCKINGHAM, Darren Joseph RAYMOND
-
Publication number: 20140188899Abstract: In one embodiment, a method includes accessing a social graph that includes a plurality of nodes and edges, receiving a structured query that includes references to selected nodes and edges, and generating one or more query modification for the structured query, where each query modification includes references to modified nodes or modified edges from the plurality of nodes and edges.Type: ApplicationFiled: December 31, 2012Publication date: July 3, 2014Inventors: Thomas S. Whitnah, Olivier Chatot, Erik N. Vee, William R. Maschmeyer, Keith L. Peiris, Alex Langenfeld
-
Patent number: 8768923Abstract: Methods and systems to generate derivative information sources, from original information sources, use an ontology that provides a logic-based representation formalism of each of a number of original information sources, the original information sources having heterogeneous representation formalisms. The original information sources are transformed to the ontology. A number of derivative information sources, corresponding to the original information sources, may be automatically generated from the ontology.Type: GrantFiled: July 29, 2008Date of Patent: July 1, 2014Assignee: SAP AGInventors: Christian Drumm, Jens Lemcke, Daniel Oberle, Ganapathy Subramanian, Vivek Krishnamurthy Dornal
-
Patent number: 8768936Abstract: A method and an apparatus for recommending information to users within a social network. The method builds a recommendation list with at least one two-tuple, where each two-tuple comprises a target user name and an information item and ranks the recommendation list by using two-tuples in the recommendation list as a basic unit. By selecting a two-tuple in the recommendation list, the user can recommend a corresponding information item to a user represented by a target user name. An apparatus is also provided by using a builder for building for a user a recommendation list comprising at least one two-tuple and a sorter for ranking the recommendation list by using two-tuples in the recommendation list as a basic unit, such that, by selecting a two-tuple in the recommendation list.Type: GrantFiled: June 15, 2011Date of Patent: July 1, 2014Assignee: International Business Machines CorporationInventors: Shenghua Bao, Jian Chen, Cheng En Lu, Rui Ma, Zhong Su
-
Publication number: 20140181125Abstract: Systems and methods (e.g., utilities) for use in providing automated, lightweight collection of online, open source data which may be content-based to reduce website source bias. In one aspect, a utility is disclosed for use in extracting content of interest from at least one website or other online data source (e.g., where the extracted content can be used in a subsequent search query). In other aspects, utilities are disclosed that are operable to perform various types of analyses on such extracted content and present graphical representations of such analyses on a display of a client device.Type: ApplicationFiled: January 16, 2014Publication date: June 26, 2014Applicant: LOCKHEED MARTIN CORPORATIONInventors: Abha Moitra, David Brian Bracewell, Steven Matt Gustafson, T. Michael Baylor, Tina H. Chau
-
Publication number: 20140181122Abstract: In various embodiments, systems and methods are provided for generating and using a customized index. In embodiments, an index structure is constructed to efficiently utilize machines containing index portions. In this regard, the index structure for a particular application is customizable such that a number of virtual index units for a particular index type and/or a number of machines associated with the virtual index units for the particular index type can be optimized for machine and/or system performance and efficiency. Utilizing the constructed index structure, documents can be distributed to various index units, virtual index units, and/or machines in real-time or near real-time. Further, the customized index structure can be used to efficiently serve search results in response to search queries.Type: ApplicationFiled: December 20, 2012Publication date: June 26, 2014Applicant: MICROSOFT CORPORATIONInventors: UTKARSH JAIN, FAN WANG, MARTIN IRMAN, ANDRIJA ANTONIJEVIC, XINGTAO WEI, SYED JAWAD
-
Publication number: 20140181124Abstract: A method determines a measure of similarity between a first document and a second document, in which a vector space model which takes into account word frequencies and coordinates is determined for the first document and for the second document. A measure of the similarity between the first document and the second document is determined using the vector space model. An apparatus, a computer program product and a storage medium are configured to execute the method.Type: ApplicationFiled: December 23, 2013Publication date: June 26, 2014Applicant: DOCUWARE GMBHInventors: Andreas HOFMEIER, Christoph WEIDLING, Michael BERGER
-
Publication number: 20140181123Abstract: A content recommendation method for use in a portable electronic device is provided. The method includes the steps of fetching current context information from the portable electronic device; calculating a relevant ranking value of each item within each type of media files stored in the portable electronic device based on the context information; sorting the relevant ranking value of each item within each type of the media files; highlighting at least one of the items of a first user interface of the portable electronic device according to the sorted ranking values.Type: ApplicationFiled: May 28, 2013Publication date: June 26, 2014Inventors: Augustin TUFFET BLAISE, Ya-Chu YANG
-
Patent number: 8762327Abstract: Embodiments of the present invention provide a way to combing websites that can be edited over the Internet using distributed revision control. This also makes it possible to use writable web sites while not being connected to the Internet. In some embodiments, the present invention is applied to wikis. When a wiki reconnects, differences are automatically sent over and changes from other wikis are merged automatically. Wikis may also be synchronized on a periodic or event driven basis. Embodiments of the present invention may also be used for load balancing between wikis, or to share information with users who can only occasionally connect to the Internet.Type: GrantFiled: February 28, 2007Date of Patent: June 24, 2014Assignee: Red Hat, Inc.Inventor: Henri Han Van Riel
-
Patent number: 8762396Abstract: A system may include an address manager configured to map a data item including a plurality of attributes to a blocked Bloom filter (BBF) of a plurality of blocked Bloom filters. The system also may include a blocked Bloom filter (BBF) generator configured to map each attribute of the plurality of attributes to a corresponding block of the blocked Bloom filter.Type: GrantFiled: December 22, 2011Date of Patent: June 24, 2014Assignee: SAP AGInventors: Benoit Hudzia, Eoghan O'Neill
-
Patent number: 8762391Abstract: Techniques for sorting search results using user characteristic data are described. These techniques may include receiving a query from a user device. A search may be performed based on the query to obtain multiple results. User responses corresponding to the multiple results may be obtained and then grouped to determine multiple users based on similarities among the multiple users. Based on user responses associated with the multiple users, the multiple results may then be ranked.Type: GrantFiled: November 27, 2012Date of Patent: June 24, 2014Assignee: Alibaba Group Holding LimitedInventors: Xu Zhang, Qing-Yan Liu, Peng-Song Wu, Yi-Huo Ye
-
Patent number: 8762392Abstract: Methods, systems, and apparatus, including computer program products, for presenting search query suggestions. In an aspect, content of a resource that is determined to be responsive to a search query is received, and a candidate set of search query suggestions for the search query is suggested based, in part, on search history data associated with the search query. A final set of search query suggestions based on the search history data and the content of the resource and provided for display on a client device.Type: GrantFiled: February 22, 2013Date of Patent: June 24, 2014Assignee: Google Inc.Inventor: Tomoaki Yamauchi
-
Publication number: 20140172882Abstract: System, method, and computer program product to reduce an amount of processing required to generate a response to a first case by a deep question answering system, by, determining that a similarity score, of the first case relative to a second case, exceeds a similarity threshold, identifying a first feature of the second case having a first relevance score exceeding a relevance threshold, identifying a first candidate answer for the first case that does not have the first feature, and refraining from analyzing the first candidate answer in generating the response to the first case, thereby reducing the amount of processing of the deep question answering system.Type: ApplicationFiled: December 17, 2012Publication date: June 19, 2014Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Adam T. Clark, Mark G. Megerian, John E. Petri, Richard J. Stevens
-
Publication number: 20140172883Abstract: System, method, and computer program product to reduce an amount of processing required to generate a response to a first case by a deep question answering system, by, determining that a similarity score, of the first case relative to a second case, exceeds a similarity threshold, identifying a first feature of the second case having a first relevance score exceeding a relevance threshold, identifying a first candidate answer for the first case that does not have the first feature, and refraining from analyzing the first candidate answer in generating the response to the first case, thereby reducing the amount of processing of the deep question answering system.Type: ApplicationFiled: March 11, 2013Publication date: June 19, 2014Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Adam T. Clark, Mark G. Megerian, John E. Petri, Richard J. Stevens
-
Publication number: 20140172859Abstract: The subject matter discloses a method for trade interaction chain reconstruction comprising: identifying a swap deal, the swap deal includes two or more of the received interactions and involves two or more participants; selecting a first interaction of the received interactions, said first interaction involves at least two participants of the two or more participants, said first interaction is stored on a computerized device; obtaining a first plurality of interactions of the received interactions that involve the at least two participants of the two or more participants; determining a first plurality of relevance scores between the first plurality of interactions and the first interaction; and associating interactions of the first plurality of interactions to be relevant to the swap deal according to the determined first plurality of relevance scores.Type: ApplicationFiled: December 13, 2012Publication date: June 19, 2014Applicant: NICE-SYSTEMS LTDInventors: Gudmundur KRISTJANSSON, Daniël te WINKEL, Moshe WASSERBLAT, Cromwell FRASER, Steve LOGALBO, Bastiaan SCHÖNHAGE, Bram NACHTEGAAL, Yaron MORGENSTERN, Jeroen VINK, Oren PEREG
-
Publication number: 20140172884Abstract: Methods, systems, and apparatus, include computer programs encoded on a computer-readable storage medium, for determining keywords for an image that supports an overlay content item. A method includes identifying, using one or more processors, an image that is to support an overlay content item, the image being presented on a web site and including a portion that is designated as being enabled to receive and display the overlay content item; evaluating pixel data associated with the image including determining one or more labels that are associated with content included within the image; and determining one or more keywords for the image based at least in part on the one or more labels.Type: ApplicationFiled: March 14, 2013Publication date: June 19, 2014Applicant: Google Inc.Inventors: Jingbin Wang, Xiangrong Chen, Charles J. Rosenberg
-
Patent number: 8756241Abstract: Methods, systems, and apparatus, including computer program products, for determining rewrite source-rewrite target similarity scores. In one aspect the method includes receiving a rewrite source-rewrite target pair; identifying first queries that include the rewrite source and second queries that include the rewrite target; identifying a first web document referenced by a first search result responsive to the first query; identifying third queries for which the first web document was referenced by a third search result responsive to the third query; identifying a second web document that was referenced by a second search result responsive to the second query; identifying one or more fourth queries for which the second web document was referenced by a fourth search result responsive to the fourth query; and determining a similarity score for the rewrite source-rewrite target pair based on a measure of matching terms between third query terms and fourth query terms.Type: GrantFiled: August 6, 2012Date of Patent: June 17, 2014Assignee: Google Inc.Inventors: Shripad V. Thite, Dandapani Sivakumar
-
Patent number: 8756240Abstract: System, methods, and apparatus for attribute-based rating of authors and content. In some methods, first content authored by a first author having an attribute that is common to other authors is received. A second author having the attribute is identified as well as content authored by the second author. A user feedback base rating that is assigned to the second author is identified. An initial rating for the first content is generated based on the user feedback based rating that is assigned to the second author, and the initial rating is assigned to the first content.Type: GrantFiled: November 13, 2008Date of Patent: June 17, 2014Assignee: Google Inc.Inventor: Michal Cierniak
-
Publication number: 20140164399Abstract: Method, system, and computer program product to improve a coverage of a plurality of classifications between a plurality of terms in a glossary and a set of values in a reference data management system, by identifying a first classification, of the plurality of classifications in the glossary, between a first term in the glossary and a first set of values in the reference data management system, detecting a relationship between the first set of values and a second set of values in the reference data management system, and upon determining that a relevance score for a relevant value from the second set of values exceeds a predefined threshold, identifying the relevant value to be classified with the term in the glossary, wherein the glossary is configured to create a second classification between the first term and the relevant value.Type: ApplicationFiled: December 7, 2012Publication date: June 12, 2014Applicant: International Business Machines CorporationInventors: Dan J. Mandelstein, Ivan M. Milman, Sushain Pandit
-
Publication number: 20140164400Abstract: Technologies related to personal assistant context building are generally described. In some examples, network service communications, such as network traffic resulting from the use of mobile applications or “apps” on a mobile device, may be captured, parsed, and included in personal assistant context databases for use in configuring automated personal assistant user interaction operations. In some examples, parsing services may be provided to parse forwarded network service communications and generate converted data for inclusion in personal assistant context databases.Type: ApplicationFiled: December 7, 2012Publication date: June 12, 2014Applicant: EMPIRE TECHNOLOGY DEVELOPMENT LLCInventor: Ezekiel Kruglick
-
Patent number: 8751497Abstract: A Multi-Shot Scheduling System chooses from multiple candidate playlists of positions to select a broadcast playlist. Candidate playlists are generated based upon scoring and selecting content items for the positions through the use of index values. Various embodiments of the Multi-Shot Scheduling System can select broadcast playlists for multiple groups of content and can provide different methods of controlling scheduling performance by restricting the range of candidate playlists from which the best playlist can be selected.Type: GrantFiled: October 7, 2011Date of Patent: June 10, 2014Assignee: Clear Channel Management Services, Inc.Inventors: Nigel Attwell, Chris Bean
-
Patent number: 8751488Abstract: A computer implemented method and system for identifying one or more part numbers stored in a digital memory comprises parsing of each part number into its primary and secondary components and assigning a relevance score to each; parsing a query part number into one or more primary and secondary components and assigning a relevance score to each query component; identifying each stored part number that has at least one component that matches a query component; calculating for each identified part number a first sum equal to the sum of the relevance scores of the query components that match a component of the identified stored part number; and a second sum equal to the sum of the relevance scores of the components of the identified stored part number that match a query component; and sorting the identified stored part numbers as a function of said first and second sums.Type: GrantFiled: January 18, 2012Date of Patent: June 10, 2014Assignee: WayPart, Inc.Inventors: Hisham Said Tawfick, Mohamed Sherif Danish
-
Patent number: 8751511Abstract: An information retrieval system is described herein that monitors a microblog data stream that includes microblog posts to discover and index fresh resources for searching by a search engine. The information retrieval system also uses data from the microblog data stream as well as data obtained from a microblog subscription system to compute novel and effective features for ranking fresh resources which would otherwise have impoverished representations. An embodiment of the present invention advantageously enables a search engine to produce a fresher set of resources and to rank such resources for both relevancy and freshness in a more accurate manner.Type: GrantFiled: March 30, 2010Date of Patent: June 10, 2014Assignee: Yahoo! Inc.Inventors: Anlei Dong, Pranam Kolari, Ruiqiang Zhang, Jing Bai, Yi Chang, Zhaohui Zheng
-
Publication number: 20140156679Abstract: Methods of securing the calculation of pairwise molecular similarity coefficients between molecules, from similarity measures that are based on 3-dimensional or 2-dimensional molecular properties and/or physicochemical properties, condensed into a fingerprint or bit-string representation, in such a way that a third party cannot deduce information about the underlying molecular structures. The apparatus and process are particularly applicable to generating secured or anonymized databases of bit-strings, so that the anonymized databases can be stored remotely from a corporation's computer system, or shared securely and confidentially with another company. The mapping key that permits the anonymized bit strings to be converted back to their original form need not be disclosed outside of the corporation. The methods also permit two companies to exchange molecular structure data securely and in a manner that permits similarity calculations to be performed within as well as between the respective databases.Type: ApplicationFiled: June 17, 2013Publication date: June 5, 2014Applicant: OpenEye Scientific Software, Inc.Inventors: Robert W. Tolbert, Joseph J. Corkery, Anthony Nicholls, Kevin Schmidt, Brian Kelley
-
Publication number: 20140156593Abstract: The present technology relates to an information processing apparatus, an information processing method, and a program allowing a user to access a reference document or the like written inside an electronic document by only clicking on a description of the reference document. A storing unit that stores information of an electronic document, an extraction unit that extracts a sentence including the information stored in the storing unit from a predetermined electronic document, and a generation unit that generates a link to the information stored in the storing unit from the sentence extracted by the extraction unit are provided. Even in a case where the electronic document is a document that is formed as the electronic document through scanning, when the degree of matching between the sentence included in the electronic document and the information stored in the storing unit is high, the sentence and the information are associated with each other, and a link is established.Type: ApplicationFiled: July 11, 2012Publication date: June 5, 2014Applicant: SONY CORPORATIONInventor: Kensuke Oonuma
-
Publication number: 20140156680Abstract: A user-interface method of selecting and presenting a collection of content items based on user navigation and selection actions associated with the content is provided. The method includes associating a relevance weight on a per user basis with content items to indicate a relative measure of likelihood that the user desires the content item. The method includes receiving a user's navigation and selections actions for identifying desired content items, and in response, adjusting the associated relevance weight of the selected content item and group of content items containing the selected item. The method includes, in response to subsequent user input, selecting and presenting a subset of content items and content groups to the user ordered by the adjusted associated relevance weights assigned to the content items and content groups.Type: ApplicationFiled: February 7, 2014Publication date: June 5, 2014Applicant: VEVEO, INC.Inventors: Murali ARAVAMUDAN, Kajamalai G. RAMAKRISHNAN, Rakesh BARVE, Sashikumar VENKATARAMAN, Ajit RAJASEKHARAN
-
Patent number: 8745055Abstract: In order to clustering documents, document vectors are formed for each of a plurality of documents of a corpus and plurality of reference vectors is generated. The document vectors are then compared to the reference vectors to generate similarity values for each of the document vectors. The document vectors are then sorted based on the similarity values for the document vectors to form a sorted list. Clusters are then formed based on the similarity between adjacent document vectors in the sorted list.Type: GrantFiled: September 28, 2006Date of Patent: June 3, 2014Assignee: Symantec Operating CorporationInventor: Eduardo Suarez
-
Patent number: 8745067Abstract: A system may include one or more databases to store comments relating to documents, the comments originating from first and second sources, where the comments from the first source include comments received from users via commenting functionality associated with browsers installed on client devices, and the comments from the second source include comments received from users independent of the commenting functionality associated with the browsers installed on the client devices. The system may also include one or more server devices to receive a request for comments relating to a particular document, search at least one of the one or more databases to identify comments relating to the particular document, and provide the identified comments for presentation in connection with the particular document.Type: GrantFiled: August 12, 2009Date of Patent: June 3, 2014Assignee: Google Inc.Inventors: Michal Cierniak, Donn Denman, Tony Hsieh, Derek Prothro, Marc Pawliger
-
Method for visual asset replacement accounting for cost, copyright, and confidentiality requirements
Patent number: 8745068Abstract: Systems and methods of replacing digital assets within a multimedia document are provided. The systems and methods include a user workstation that can receive a selection from a user for an original asset in the document to be replaced. Alternative assets can be retrieved that have a level of appropriateness with the selected original asset. Constraints on use of the alternative assets can be determined and a fitness value of each of the alternative assets can be calculated based on the appropriateness and the constraints on use. The alternative assets with the highest fitness values can be presented to the user for the user to select to replace the original asset.Type: GrantFiled: October 13, 2009Date of Patent: June 3, 2014Assignee: Xerox CorporationInventors: Tommaso Colombino, Robert John Rolleston, Luca Marchesotti -
Patent number: 8744839Abstract: Target word recognition includes: obtaining a candidate word set and corresponding characteristic computation data, the candidate word set comprising text data, and characteristic computation data being associated with the candidate word set; performing segmentation of the characteristic computation data to generate a plurality of text segments; combining the plurality of text segments to form a text data combination set; determining an intersection of the candidate word set and the text data combination set, the intersection comprising a plurality of text data combinations; determining a plurality of designated characteristic values for the plurality of text data combinations; based at least in part on the plurality of designated characteristic values and according to at least a criterion, recognizing among the plurality of text data combinations target words whose characteristic values fulfill the criterion.Type: GrantFiled: September 22, 2011Date of Patent: June 3, 2014Assignee: Alibaba Group Holding LimitedInventors: Haibo Sun, Yang Yang, Yining Chen
-
Patent number: 8745059Abstract: Aspects of the subject matter described herein relate to functions used for retrieving image results based on search queries. More specifically, image search queries can be pre-grouped or classified based on visual and semantic similarity. For example, a pairwise image similarity value for a pair of queries can be computed based on one or more of the sum of all of the overlapping the image results, the sum of the image distances between all of the pairs of images in the image results, and the rank of each of the images in the image results. The pairwise image similarity values can then be used to generate image query clusters. Each image query clusters can include a set of queries with high pairwise image similarity values. In some examples, a distance function can be determined for each image query cluster. This data can be used to provide image results.Type: GrantFiled: May 29, 2012Date of Patent: June 3, 2014Assignee: Google Inc.Inventors: Yushi Jing, Michele Covell, Stephen Conor Holiday
-
Publication number: 20140149427Abstract: A system and methods are provided for scoring assets for display in a tapestry interface. In one embodiment, a method includes identifying assets for display in a tapestry interface presentation, wherein the tapestry interface provides a presentation for a plurality of assets having relevance based sizing, arrangement of the assets based at least in part on a grid pattern, receiving data for identified assets of the tapestry presentation, and scoring assets based on the received data to determine presentation characteristics for the assets in the tapestry interface. The method may also include updating the presentation of the tapestry interface on a device based on the presentation characteristics.Type: ApplicationFiled: November 26, 2012Publication date: May 29, 2014Inventors: Brad WILDER, Bradley James BRIZENDINE, Martin A. STEIN, Andre Wilhelm RABOLD, Farhang M. ZARRINKELK
-
Publication number: 20140149429Abstract: A computer-implemented method and system for Web search ranking are provided herein. The method includes generating a number of training samples from clickthrough data, wherein the training samples include positive query-document pairs and negative query-document pairs. The method also includes discriminatively training a translation model based on the training samples and ranking a number of documents for a Web search based on the translation model.Type: ApplicationFiled: November 29, 2012Publication date: May 29, 2014Applicant: MICROSOFT CORPORATIONInventors: Jianfeng Gao, Zhonghua Qu, Gu Xu
-
Publication number: 20140149431Abstract: Embodiments relate to relevance-based information processing. An aspect includes storing a history of display operations performed by a user on a first electronic file. Another aspect includes inputting display operations into the stored history performed by a user on the first electronic file. Another aspect includes calculating, using a plurality of calculating methods, a plurality of degrees of relevance of a second electronic file to the first electronic file based on the stored history. Another aspect includes calculating a synthesized degree of relevance by synthesizing the plurality of degrees of relevance of the second electronic file to the first electronic file. Another aspect includes displaying an input region on a display for inputting display operations on the first electronic file, and automatically displaying the second electronic file based on the synthesized degree of relevance of the second electronic file to the first electronic file exceeding a predetermined threshold.Type: ApplicationFiled: October 10, 2013Publication date: May 29, 2014Applicant: International Business Machines Corporation.Inventors: Tomoka Mochizuki, Wen Lianzi
-
Publication number: 20140149378Abstract: A method for determining the significance of a web page, or a portion thereof, is disclosed. Accordingly, a search engine or some other application analyzes user-selected content portions (as well as user-provided comments associated with the portions) of a document to determine a document relevance score (e.g. Content Selection Rank) for the document containing the user-selected content portions. The particular algorithm for determining the document relevance score will vary depending upon the particular implementation, but may generally be based upon an analysis of the number and quality of user-selected portions, associated comments, the ratings of the user making the selections and the ratings of users contributing to interactions (such as sharing) with the portions. Based on this analysis, the document is assigned a document relevance score, which is used for processing the document in accordance with instructions associated with a search query.Type: ApplicationFiled: January 30, 2014Publication date: May 29, 2014Inventor: Rohit Chandra
-
Publication number: 20140149428Abstract: A computerized method for identifying a document. A signature may be determined for a first document and compared with a signature for each of one or more additional documents. A document similarity score may be determined and one or more similar documents may be identified based on the document similarity score.Type: ApplicationFiled: November 28, 2012Publication date: May 29, 2014Applicant: SAP AGInventors: Godfrey Hobbs, Stefanie Rupp, Axel Gustav
-
Publication number: 20140149430Abstract: A method of detecting an overlapping community in a network including nodes and links between the nodes, includes calculating a similarity between the links, and generating a line graph of the network. The method further includes detecting one or more cores in the line graph, and growing a cluster for each of the one or more cores. The method further includes converting the cluster into a cluster of nodes of a node graph.Type: ApplicationFiled: June 28, 2013Publication date: May 29, 2014Inventors: Seungwoo RYU, Sejeong KWON, Jae-Gil LEE, Sungsu LIM
-
Publication number: 20140149432Abstract: Described herein are systems and methods for selection-based contextual help retrieval. One example method involves (a) receiving first-query data including contextual data, the contextual data indicating a user-interface element type, a user-interface element location, and user-interface element text; (b) determining at least one first-query response based on at least the contextual data; and (c) causing an indication of the determined at least one first-query response to be provided via an output device.Type: ApplicationFiled: June 18, 2012Publication date: May 29, 2014Applicant: UNIVERSITY OF WASHINGTON THROUGH ITS CENTER FOR COMMERCIALIZATIONInventors: Parmit K. Chilana, Andrew J. Ko, Jacob O. Wobbrock
-
Patent number: 8738636Abstract: The present invention relates to computer implemented methods and system for determining correspondences between terms in two or more ontologies. The methods and systems are designed to accept as inputs ontologies in Web Ontology Language (OWL) syntax or any other ontology syntax, to calculate a similarity measure between terms in the ontologies, extract an alignment based on this similarity measure, and verify this alignment according to the semantics contained in the ontologies. This process is designed to be executed iteratively until the similarity measures converge, or until another suitable finalization condition is met. The result of these methods and of the systems implementing these methods is an alignment between two or more ontologies establishing semantic correspondences between the terms in the ontologies.Type: GrantFiled: September 17, 2009Date of Patent: May 27, 2014Inventors: Yves Reginald Jean-Mary, Mansur Kabuka
-
Patent number: 8739032Abstract: A document analysis system receives multiple concepts along with multiple reference documents and generates sensory indicators that assist a researcher in assessing the relevance of each of the documents to the concepts. In one exemplary aspect, the document analysis system displays a table of keywords separated into blocks, each block of keywords corresponding to one of the concepts. Each block is colored according to the prevalence of any keyword within a given keyword group. The color of a block thus indicates the relative presence of a concept in the document. The document analysis system also determines a unique color for each block of keywords for highlighting in the text of the document. In this manner a researcher can quickly identify passages that contain multiple concepts. Additionally, the researcher is provided the ability to quickly locate reference characters, figure numbers and patent numbers in the document.Type: GrantFiled: October 12, 2010Date of Patent: May 27, 2014Inventor: Patrick Sander Walsh
-
Patent number: 8738635Abstract: Embodiments are directed to ranking search results using a junk profile. For a given corpus of documents, one or more junk profiles may be created and maintained. The junk profile provides reference metrics to represent known junk documents. For example, a junk profile may comprise a dictionary of document data that is automatically inserted into documents created using a particular system or template. A junk profile may also comprise one or more representations (e.g., histograms) of a distribution of a particular junk variable for known junk documents. The junk profile provides a usable representation of known junk documents, and the present systems and methods employ the junk profile to predict the likelihood that documents in the corpus are junk. In embodiments, junk scores are calculated and used to rank such documents higher or lower in response to a search query.Type: GrantFiled: June 1, 2010Date of Patent: May 27, 2014Assignee: Microsoft CorporationInventors: Vladimir Tankovich, Dmitriy Meyerzon, Victor Poznanski
-
Patent number: 8732177Abstract: A search query is received. One or more listings is identified responsive to the search query. For each of the one or more of listings, the following are determined: a relevancy score based on one or more parameters in the search query, an expected click through rate, and at least one of a content density boost that is based on one or more fields that are included in or excluded from the listing and a geography type boost that is based on a comparison of one or more geography parameters of the query to one or more geography parameters of the listing. For each of the one or more listings, a performance score is calculated based on the relevancy score, the expected click through rate, and at least one of the content density boost and the geography type boost.Type: GrantFiled: April 26, 2010Date of Patent: May 20, 2014Assignee: JPMorgan Chase Bank, N.A.Inventors: Sivakumar Chinnasamy, Jeesmon Jacob, Tsu-Jung Kung, Than Kim-Thi Nguyen
-
Publication number: 20140136543Abstract: A system that provides secure autocomplete searching receives an autocomplete query from a user, the autocomplete query including a prefix of a search phrase, and retrieves security information of the user. The system searches one or more prefix indexes to find a set of matching objects, where the matching objects each include associated object security information. The system excludes matching objects that the user is not authorized to access from the set of matching objects based on the object security information and the user security information. The system then returns the set of matching objects to the user.Type: ApplicationFiled: November 13, 2012Publication date: May 15, 2014Applicant: ORACLE INTERNATIONAL CORPORATIONInventors: Kurt FRIEDEN, Don L. HAYLER, Michael RICHARDS, Vasif SHAIKH
-
Publication number: 20140136550Abstract: A vehicle identification number (VIN) decoder (VDC) implementing a unique VIN decoding method may, for a given VIN, shorten the VIN and form a stem and a leaf therefrom. Utilizing the stem, the VDC may operate to find matching leaf values, if any, from a set of look up tables. Depending upon a match outcome, one or more trim identification code (TIC) values can be assigned to the VIN and a candidate list can be constructed utilizing the assigned TIC value(s). The candidate list, which can be optimized, may contain one or more candidate trims for the VIN. For each candidate trim, a confidence score and a match probability can be generated. The VDC may provide decoded information containing trim data associated with at least one of the one or more candidate trims for the VIN to a client device over a network connection.Type: ApplicationFiled: January 17, 2014Publication date: May 15, 2014Applicant: TrueCar, Inc.Inventors: Thomas J. Sullivan, Michael D. Swinson