Based On Record Similarity And Relevance Patents (Class 707/749)
  • Patent number: 8775441
    Abstract: In one aspect, in general, a method is described for managing an archive. The archive is used for determining approximate matches associated with strings occurring in records. The method includes processing records to determine a set of string representations that correspond to strings occurring in the records. The method also includes generating, for each of at least some of the string representations in the set, a plurality of close representations that are each generated from at least some of the same characters in the string. The method also includes storing entries in the archive. Each stored entry represents a potential approximate match between at least two strings based on their respective close representations.
    Type: Grant
    Filed: January 16, 2008
    Date of Patent: July 8, 2014
    Assignee: Ab Initio Technology LLC
    Inventor: Arlen Anderson
  • Publication number: 20140188902
    Abstract: A method and system for ranking an object that contains one or more keywords is disclosed. All linking objects are retrieved that contain a keyword of interest. Each linking object links to the object to be ranked. The locations of each link in each linking object are determined relative to the keywords in the linking objects. A drop off rate is computed for the keyword in each of the linking objects. A perceived importance of the keyword in the each of the linking objects is computed. A partial link rating for each of the linking objects is computed. A total link rating is computed for the at least one keyword across all linking objects. The link ratings each keyword is stored in the object to be ranked.
    Type: Application
    Filed: December 30, 2013
    Publication date: July 3, 2014
    Inventor: Charles J. Reed
  • Publication number: 20140188904
    Abstract: A matching system and method for providing a degree of matching between a plurality of entities. There are: first, second and additional attribute receiving modules each configured to receive attribute information from an entity; an electronic storage device including a cascading condition module configured to define: a cascading condition; and/or a plurality of cascading information requests associated with the cascading condition; a request module configured to provide the plurality of cascading information requests from the additional attribute receiving module as the cascading request condition is satisfied; and a degree module in communication with the first attribute information receiving module, the second attribute information receiving module, and the additional attribute information receiving module and configured to compare the first attribute information and second attribute information in light of additional attribute information and configured to return degree of matching information.
    Type: Application
    Filed: March 6, 2014
    Publication date: July 3, 2014
    Inventor: David Sciuk
  • Publication number: 20140188900
    Abstract: An approach for generating a pattern-based database includes accessing a log specifying one or more strings representing data having a dynamic portion and a static portion, and generating a pattern-based database, including one or more records representing compression of the data, by determining the dynamic portions and the static portions of the strings, and assigning pattern values to the strings based on the determined dynamic portions and the static portions, wherein the pattern values are used to provide compression of the static portions within the records of the pattern-based database.
    Type: Application
    Filed: January 2, 2013
    Publication date: July 3, 2014
    Applicant: VERIZON PATENT AND LICENSING INC.
    Inventors: Anand N. Sankaran, Anierutha X. CHANDHIRAMOWULI, SyedTalat IQBAL, Rajesh NARAYANAN, Jubish C. PARAMBATH, Anil K. GUNTUPALLI, Lisa A. CAPUTO
  • Publication number: 20140188901
    Abstract: A method, system and computer program product for efficiently identifying images, videos, audio files or documents relevant to a user using binary search trees in attribute space for guiding relevance feedback. A binary tree is constructed for each relative attribute of interest. A “pivot exemplar” (at a node of the binary tree) is set for each relative attribute's binary tree as corresponding to the database image, video, audio file or document with a median relative attribute value among that subtree's child examples. A pivot exemplar out of the available current pivot exemplars that has the highest expected information gain is selected to be provided to the user. Comparative attribute feedback is then received from the user regarding whether a degree of the attribute in the user's target image, video, audio file or document is more, less or equal with the attribute displayed in the selected pivot exemplar.
    Type: Application
    Filed: August 13, 2013
    Publication date: July 3, 2014
    Applicant: Board of Regents, The University of Texas System
    Inventors: Kristen Grauman, Adriana Kovashka
  • Publication number: 20140188905
    Abstract: An item authority system is provided. The item authority system uses rules to identify item definitions that match or potentially match an item description. When a unique match is found, then the item authority system may indicate that the item description describes the same item as the item definition. If multiple matches or only potential matches are identified, then the item authority system may allow a user to manually indicate which item definition matches.
    Type: Application
    Filed: March 7, 2014
    Publication date: July 3, 2014
    Applicant: AMAZON TECHNOLOGIES, INC.
    Inventors: NICHOLAS BICKNELL, SHAWN BOHN, ANMOL PARALKAR, ANUVRATA ARORA
  • Publication number: 20140188903
    Abstract: Systems and methods consistent with the invention relate to matching user attributes. In one exemplary implementation, the system and methods may store predetermined general attribute descriptors reflecting attributes of users generally, receive personal attribute descriptors selected from the predetermined general attribute descriptors as corresponding to attributes of a first user and a second user, receive a rating associated with each received personal attribute descriptor, compare at least one personal attribute descriptor associated with the first user with at least one personal attribute descriptor associated with the second user to determine a descriptor match, and calculate a match score based on the determined descriptor match and the received ratings. In addition, first and second display points may be displayed and may be separated by a one-dimensional display distance that is a function of the calculated match score.
    Type: Application
    Filed: February 10, 2014
    Publication date: July 3, 2014
    Applicant: ACCENTURE GLOBAL SERVICES GMBH
    Inventors: James Edward MARSHALL, Marcus Wilfrid BUCKINGHAM, Darren Joseph RAYMOND
  • Publication number: 20140188899
    Abstract: In one embodiment, a method includes accessing a social graph that includes a plurality of nodes and edges, receiving a structured query that includes references to selected nodes and edges, and generating one or more query modification for the structured query, where each query modification includes references to modified nodes or modified edges from the plurality of nodes and edges.
    Type: Application
    Filed: December 31, 2012
    Publication date: July 3, 2014
    Inventors: Thomas S. Whitnah, Olivier Chatot, Erik N. Vee, William R. Maschmeyer, Keith L. Peiris, Alex Langenfeld
  • Patent number: 8768923
    Abstract: Methods and systems to generate derivative information sources, from original information sources, use an ontology that provides a logic-based representation formalism of each of a number of original information sources, the original information sources having heterogeneous representation formalisms. The original information sources are transformed to the ontology. A number of derivative information sources, corresponding to the original information sources, may be automatically generated from the ontology.
    Type: Grant
    Filed: July 29, 2008
    Date of Patent: July 1, 2014
    Assignee: SAP AG
    Inventors: Christian Drumm, Jens Lemcke, Daniel Oberle, Ganapathy Subramanian, Vivek Krishnamurthy Dornal
  • Patent number: 8768936
    Abstract: A method and an apparatus for recommending information to users within a social network. The method builds a recommendation list with at least one two-tuple, where each two-tuple comprises a target user name and an information item and ranks the recommendation list by using two-tuples in the recommendation list as a basic unit. By selecting a two-tuple in the recommendation list, the user can recommend a corresponding information item to a user represented by a target user name. An apparatus is also provided by using a builder for building for a user a recommendation list comprising at least one two-tuple and a sorter for ranking the recommendation list by using two-tuples in the recommendation list as a basic unit, such that, by selecting a two-tuple in the recommendation list.
    Type: Grant
    Filed: June 15, 2011
    Date of Patent: July 1, 2014
    Assignee: International Business Machines Corporation
    Inventors: Shenghua Bao, Jian Chen, Cheng En Lu, Rui Ma, Zhong Su
  • Publication number: 20140181125
    Abstract: Systems and methods (e.g., utilities) for use in providing automated, lightweight collection of online, open source data which may be content-based to reduce website source bias. In one aspect, a utility is disclosed for use in extracting content of interest from at least one website or other online data source (e.g., where the extracted content can be used in a subsequent search query). In other aspects, utilities are disclosed that are operable to perform various types of analyses on such extracted content and present graphical representations of such analyses on a display of a client device.
    Type: Application
    Filed: January 16, 2014
    Publication date: June 26, 2014
    Applicant: LOCKHEED MARTIN CORPORATION
    Inventors: Abha Moitra, David Brian Bracewell, Steven Matt Gustafson, T. Michael Baylor, Tina H. Chau
  • Publication number: 20140181122
    Abstract: In various embodiments, systems and methods are provided for generating and using a customized index. In embodiments, an index structure is constructed to efficiently utilize machines containing index portions. In this regard, the index structure for a particular application is customizable such that a number of virtual index units for a particular index type and/or a number of machines associated with the virtual index units for the particular index type can be optimized for machine and/or system performance and efficiency. Utilizing the constructed index structure, documents can be distributed to various index units, virtual index units, and/or machines in real-time or near real-time. Further, the customized index structure can be used to efficiently serve search results in response to search queries.
    Type: Application
    Filed: December 20, 2012
    Publication date: June 26, 2014
    Applicant: MICROSOFT CORPORATION
    Inventors: UTKARSH JAIN, FAN WANG, MARTIN IRMAN, ANDRIJA ANTONIJEVIC, XINGTAO WEI, SYED JAWAD
  • Publication number: 20140181124
    Abstract: A method determines a measure of similarity between a first document and a second document, in which a vector space model which takes into account word frequencies and coordinates is determined for the first document and for the second document. A measure of the similarity between the first document and the second document is determined using the vector space model. An apparatus, a computer program product and a storage medium are configured to execute the method.
    Type: Application
    Filed: December 23, 2013
    Publication date: June 26, 2014
    Applicant: DOCUWARE GMBH
    Inventors: Andreas HOFMEIER, Christoph WEIDLING, Michael BERGER
  • Publication number: 20140181123
    Abstract: A content recommendation method for use in a portable electronic device is provided. The method includes the steps of fetching current context information from the portable electronic device; calculating a relevant ranking value of each item within each type of media files stored in the portable electronic device based on the context information; sorting the relevant ranking value of each item within each type of the media files; highlighting at least one of the items of a first user interface of the portable electronic device according to the sorted ranking values.
    Type: Application
    Filed: May 28, 2013
    Publication date: June 26, 2014
    Inventors: Augustin TUFFET BLAISE, Ya-Chu YANG
  • Patent number: 8762327
    Abstract: Embodiments of the present invention provide a way to combing websites that can be edited over the Internet using distributed revision control. This also makes it possible to use writable web sites while not being connected to the Internet. In some embodiments, the present invention is applied to wikis. When a wiki reconnects, differences are automatically sent over and changes from other wikis are merged automatically. Wikis may also be synchronized on a periodic or event driven basis. Embodiments of the present invention may also be used for load balancing between wikis, or to share information with users who can only occasionally connect to the Internet.
    Type: Grant
    Filed: February 28, 2007
    Date of Patent: June 24, 2014
    Assignee: Red Hat, Inc.
    Inventor: Henri Han Van Riel
  • Patent number: 8762396
    Abstract: A system may include an address manager configured to map a data item including a plurality of attributes to a blocked Bloom filter (BBF) of a plurality of blocked Bloom filters. The system also may include a blocked Bloom filter (BBF) generator configured to map each attribute of the plurality of attributes to a corresponding block of the blocked Bloom filter.
    Type: Grant
    Filed: December 22, 2011
    Date of Patent: June 24, 2014
    Assignee: SAP AG
    Inventors: Benoit Hudzia, Eoghan O'Neill
  • Patent number: 8762391
    Abstract: Techniques for sorting search results using user characteristic data are described. These techniques may include receiving a query from a user device. A search may be performed based on the query to obtain multiple results. User responses corresponding to the multiple results may be obtained and then grouped to determine multiple users based on similarities among the multiple users. Based on user responses associated with the multiple users, the multiple results may then be ranked.
    Type: Grant
    Filed: November 27, 2012
    Date of Patent: June 24, 2014
    Assignee: Alibaba Group Holding Limited
    Inventors: Xu Zhang, Qing-Yan Liu, Peng-Song Wu, Yi-Huo Ye
  • Patent number: 8762392
    Abstract: Methods, systems, and apparatus, including computer program products, for presenting search query suggestions. In an aspect, content of a resource that is determined to be responsive to a search query is received, and a candidate set of search query suggestions for the search query is suggested based, in part, on search history data associated with the search query. A final set of search query suggestions based on the search history data and the content of the resource and provided for display on a client device.
    Type: Grant
    Filed: February 22, 2013
    Date of Patent: June 24, 2014
    Assignee: Google Inc.
    Inventor: Tomoaki Yamauchi
  • Publication number: 20140172882
    Abstract: System, method, and computer program product to reduce an amount of processing required to generate a response to a first case by a deep question answering system, by, determining that a similarity score, of the first case relative to a second case, exceeds a similarity threshold, identifying a first feature of the second case having a first relevance score exceeding a relevance threshold, identifying a first candidate answer for the first case that does not have the first feature, and refraining from analyzing the first candidate answer in generating the response to the first case, thereby reducing the amount of processing of the deep question answering system.
    Type: Application
    Filed: December 17, 2012
    Publication date: June 19, 2014
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Adam T. Clark, Mark G. Megerian, John E. Petri, Richard J. Stevens
  • Publication number: 20140172883
    Abstract: System, method, and computer program product to reduce an amount of processing required to generate a response to a first case by a deep question answering system, by, determining that a similarity score, of the first case relative to a second case, exceeds a similarity threshold, identifying a first feature of the second case having a first relevance score exceeding a relevance threshold, identifying a first candidate answer for the first case that does not have the first feature, and refraining from analyzing the first candidate answer in generating the response to the first case, thereby reducing the amount of processing of the deep question answering system.
    Type: Application
    Filed: March 11, 2013
    Publication date: June 19, 2014
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Adam T. Clark, Mark G. Megerian, John E. Petri, Richard J. Stevens
  • Publication number: 20140172859
    Abstract: The subject matter discloses a method for trade interaction chain reconstruction comprising: identifying a swap deal, the swap deal includes two or more of the received interactions and involves two or more participants; selecting a first interaction of the received interactions, said first interaction involves at least two participants of the two or more participants, said first interaction is stored on a computerized device; obtaining a first plurality of interactions of the received interactions that involve the at least two participants of the two or more participants; determining a first plurality of relevance scores between the first plurality of interactions and the first interaction; and associating interactions of the first plurality of interactions to be relevant to the swap deal according to the determined first plurality of relevance scores.
    Type: Application
    Filed: December 13, 2012
    Publication date: June 19, 2014
    Applicant: NICE-SYSTEMS LTD
    Inventors: Gudmundur KRISTJANSSON, Daniël te WINKEL, Moshe WASSERBLAT, Cromwell FRASER, Steve LOGALBO, Bastiaan SCHÖNHAGE, Bram NACHTEGAAL, Yaron MORGENSTERN, Jeroen VINK, Oren PEREG
  • Publication number: 20140172884
    Abstract: Methods, systems, and apparatus, include computer programs encoded on a computer-readable storage medium, for determining keywords for an image that supports an overlay content item. A method includes identifying, using one or more processors, an image that is to support an overlay content item, the image being presented on a web site and including a portion that is designated as being enabled to receive and display the overlay content item; evaluating pixel data associated with the image including determining one or more labels that are associated with content included within the image; and determining one or more keywords for the image based at least in part on the one or more labels.
    Type: Application
    Filed: March 14, 2013
    Publication date: June 19, 2014
    Applicant: Google Inc.
    Inventors: Jingbin Wang, Xiangrong Chen, Charles J. Rosenberg
  • Patent number: 8756241
    Abstract: Methods, systems, and apparatus, including computer program products, for determining rewrite source-rewrite target similarity scores. In one aspect the method includes receiving a rewrite source-rewrite target pair; identifying first queries that include the rewrite source and second queries that include the rewrite target; identifying a first web document referenced by a first search result responsive to the first query; identifying third queries for which the first web document was referenced by a third search result responsive to the third query; identifying a second web document that was referenced by a second search result responsive to the second query; identifying one or more fourth queries for which the second web document was referenced by a fourth search result responsive to the fourth query; and determining a similarity score for the rewrite source-rewrite target pair based on a measure of matching terms between third query terms and fourth query terms.
    Type: Grant
    Filed: August 6, 2012
    Date of Patent: June 17, 2014
    Assignee: Google Inc.
    Inventors: Shripad V. Thite, Dandapani Sivakumar
  • Patent number: 8756240
    Abstract: System, methods, and apparatus for attribute-based rating of authors and content. In some methods, first content authored by a first author having an attribute that is common to other authors is received. A second author having the attribute is identified as well as content authored by the second author. A user feedback base rating that is assigned to the second author is identified. An initial rating for the first content is generated based on the user feedback based rating that is assigned to the second author, and the initial rating is assigned to the first content.
    Type: Grant
    Filed: November 13, 2008
    Date of Patent: June 17, 2014
    Assignee: Google Inc.
    Inventor: Michal Cierniak
  • Publication number: 20140164399
    Abstract: Method, system, and computer program product to improve a coverage of a plurality of classifications between a plurality of terms in a glossary and a set of values in a reference data management system, by identifying a first classification, of the plurality of classifications in the glossary, between a first term in the glossary and a first set of values in the reference data management system, detecting a relationship between the first set of values and a second set of values in the reference data management system, and upon determining that a relevance score for a relevant value from the second set of values exceeds a predefined threshold, identifying the relevant value to be classified with the term in the glossary, wherein the glossary is configured to create a second classification between the first term and the relevant value.
    Type: Application
    Filed: December 7, 2012
    Publication date: June 12, 2014
    Applicant: International Business Machines Corporation
    Inventors: Dan J. Mandelstein, Ivan M. Milman, Sushain Pandit
  • Publication number: 20140164400
    Abstract: Technologies related to personal assistant context building are generally described. In some examples, network service communications, such as network traffic resulting from the use of mobile applications or “apps” on a mobile device, may be captured, parsed, and included in personal assistant context databases for use in configuring automated personal assistant user interaction operations. In some examples, parsing services may be provided to parse forwarded network service communications and generate converted data for inclusion in personal assistant context databases.
    Type: Application
    Filed: December 7, 2012
    Publication date: June 12, 2014
    Applicant: EMPIRE TECHNOLOGY DEVELOPMENT LLC
    Inventor: Ezekiel Kruglick
  • Patent number: 8751497
    Abstract: A Multi-Shot Scheduling System chooses from multiple candidate playlists of positions to select a broadcast playlist. Candidate playlists are generated based upon scoring and selecting content items for the positions through the use of index values. Various embodiments of the Multi-Shot Scheduling System can select broadcast playlists for multiple groups of content and can provide different methods of controlling scheduling performance by restricting the range of candidate playlists from which the best playlist can be selected.
    Type: Grant
    Filed: October 7, 2011
    Date of Patent: June 10, 2014
    Assignee: Clear Channel Management Services, Inc.
    Inventors: Nigel Attwell, Chris Bean
  • Patent number: 8751488
    Abstract: A computer implemented method and system for identifying one or more part numbers stored in a digital memory comprises parsing of each part number into its primary and secondary components and assigning a relevance score to each; parsing a query part number into one or more primary and secondary components and assigning a relevance score to each query component; identifying each stored part number that has at least one component that matches a query component; calculating for each identified part number a first sum equal to the sum of the relevance scores of the query components that match a component of the identified stored part number; and a second sum equal to the sum of the relevance scores of the components of the identified stored part number that match a query component; and sorting the identified stored part numbers as a function of said first and second sums.
    Type: Grant
    Filed: January 18, 2012
    Date of Patent: June 10, 2014
    Assignee: WayPart, Inc.
    Inventors: Hisham Said Tawfick, Mohamed Sherif Danish
  • Patent number: 8751511
    Abstract: An information retrieval system is described herein that monitors a microblog data stream that includes microblog posts to discover and index fresh resources for searching by a search engine. The information retrieval system also uses data from the microblog data stream as well as data obtained from a microblog subscription system to compute novel and effective features for ranking fresh resources which would otherwise have impoverished representations. An embodiment of the present invention advantageously enables a search engine to produce a fresher set of resources and to rank such resources for both relevancy and freshness in a more accurate manner.
    Type: Grant
    Filed: March 30, 2010
    Date of Patent: June 10, 2014
    Assignee: Yahoo! Inc.
    Inventors: Anlei Dong, Pranam Kolari, Ruiqiang Zhang, Jing Bai, Yi Chang, Zhaohui Zheng
  • Publication number: 20140156679
    Abstract: Methods of securing the calculation of pairwise molecular similarity coefficients between molecules, from similarity measures that are based on 3-dimensional or 2-dimensional molecular properties and/or physicochemical properties, condensed into a fingerprint or bit-string representation, in such a way that a third party cannot deduce information about the underlying molecular structures. The apparatus and process are particularly applicable to generating secured or anonymized databases of bit-strings, so that the anonymized databases can be stored remotely from a corporation's computer system, or shared securely and confidentially with another company. The mapping key that permits the anonymized bit strings to be converted back to their original form need not be disclosed outside of the corporation. The methods also permit two companies to exchange molecular structure data securely and in a manner that permits similarity calculations to be performed within as well as between the respective databases.
    Type: Application
    Filed: June 17, 2013
    Publication date: June 5, 2014
    Applicant: OpenEye Scientific Software, Inc.
    Inventors: Robert W. Tolbert, Joseph J. Corkery, Anthony Nicholls, Kevin Schmidt, Brian Kelley
  • Publication number: 20140156593
    Abstract: The present technology relates to an information processing apparatus, an information processing method, and a program allowing a user to access a reference document or the like written inside an electronic document by only clicking on a description of the reference document. A storing unit that stores information of an electronic document, an extraction unit that extracts a sentence including the information stored in the storing unit from a predetermined electronic document, and a generation unit that generates a link to the information stored in the storing unit from the sentence extracted by the extraction unit are provided. Even in a case where the electronic document is a document that is formed as the electronic document through scanning, when the degree of matching between the sentence included in the electronic document and the information stored in the storing unit is high, the sentence and the information are associated with each other, and a link is established.
    Type: Application
    Filed: July 11, 2012
    Publication date: June 5, 2014
    Applicant: SONY CORPORATION
    Inventor: Kensuke Oonuma
  • Publication number: 20140156680
    Abstract: A user-interface method of selecting and presenting a collection of content items based on user navigation and selection actions associated with the content is provided. The method includes associating a relevance weight on a per user basis with content items to indicate a relative measure of likelihood that the user desires the content item. The method includes receiving a user's navigation and selections actions for identifying desired content items, and in response, adjusting the associated relevance weight of the selected content item and group of content items containing the selected item. The method includes, in response to subsequent user input, selecting and presenting a subset of content items and content groups to the user ordered by the adjusted associated relevance weights assigned to the content items and content groups.
    Type: Application
    Filed: February 7, 2014
    Publication date: June 5, 2014
    Applicant: VEVEO, INC.
    Inventors: Murali ARAVAMUDAN, Kajamalai G. RAMAKRISHNAN, Rakesh BARVE, Sashikumar VENKATARAMAN, Ajit RAJASEKHARAN
  • Patent number: 8745055
    Abstract: In order to clustering documents, document vectors are formed for each of a plurality of documents of a corpus and plurality of reference vectors is generated. The document vectors are then compared to the reference vectors to generate similarity values for each of the document vectors. The document vectors are then sorted based on the similarity values for the document vectors to form a sorted list. Clusters are then formed based on the similarity between adjacent document vectors in the sorted list.
    Type: Grant
    Filed: September 28, 2006
    Date of Patent: June 3, 2014
    Assignee: Symantec Operating Corporation
    Inventor: Eduardo Suarez
  • Patent number: 8745067
    Abstract: A system may include one or more databases to store comments relating to documents, the comments originating from first and second sources, where the comments from the first source include comments received from users via commenting functionality associated with browsers installed on client devices, and the comments from the second source include comments received from users independent of the commenting functionality associated with the browsers installed on the client devices. The system may also include one or more server devices to receive a request for comments relating to a particular document, search at least one of the one or more databases to identify comments relating to the particular document, and provide the identified comments for presentation in connection with the particular document.
    Type: Grant
    Filed: August 12, 2009
    Date of Patent: June 3, 2014
    Assignee: Google Inc.
    Inventors: Michal Cierniak, Donn Denman, Tony Hsieh, Derek Prothro, Marc Pawliger
  • Patent number: 8745068
    Abstract: Systems and methods of replacing digital assets within a multimedia document are provided. The systems and methods include a user workstation that can receive a selection from a user for an original asset in the document to be replaced. Alternative assets can be retrieved that have a level of appropriateness with the selected original asset. Constraints on use of the alternative assets can be determined and a fitness value of each of the alternative assets can be calculated based on the appropriateness and the constraints on use. The alternative assets with the highest fitness values can be presented to the user for the user to select to replace the original asset.
    Type: Grant
    Filed: October 13, 2009
    Date of Patent: June 3, 2014
    Assignee: Xerox Corporation
    Inventors: Tommaso Colombino, Robert John Rolleston, Luca Marchesotti
  • Patent number: 8744839
    Abstract: Target word recognition includes: obtaining a candidate word set and corresponding characteristic computation data, the candidate word set comprising text data, and characteristic computation data being associated with the candidate word set; performing segmentation of the characteristic computation data to generate a plurality of text segments; combining the plurality of text segments to form a text data combination set; determining an intersection of the candidate word set and the text data combination set, the intersection comprising a plurality of text data combinations; determining a plurality of designated characteristic values for the plurality of text data combinations; based at least in part on the plurality of designated characteristic values and according to at least a criterion, recognizing among the plurality of text data combinations target words whose characteristic values fulfill the criterion.
    Type: Grant
    Filed: September 22, 2011
    Date of Patent: June 3, 2014
    Assignee: Alibaba Group Holding Limited
    Inventors: Haibo Sun, Yang Yang, Yining Chen
  • Patent number: 8745059
    Abstract: Aspects of the subject matter described herein relate to functions used for retrieving image results based on search queries. More specifically, image search queries can be pre-grouped or classified based on visual and semantic similarity. For example, a pairwise image similarity value for a pair of queries can be computed based on one or more of the sum of all of the overlapping the image results, the sum of the image distances between all of the pairs of images in the image results, and the rank of each of the images in the image results. The pairwise image similarity values can then be used to generate image query clusters. Each image query clusters can include a set of queries with high pairwise image similarity values. In some examples, a distance function can be determined for each image query cluster. This data can be used to provide image results.
    Type: Grant
    Filed: May 29, 2012
    Date of Patent: June 3, 2014
    Assignee: Google Inc.
    Inventors: Yushi Jing, Michele Covell, Stephen Conor Holiday
  • Publication number: 20140149427
    Abstract: A system and methods are provided for scoring assets for display in a tapestry interface. In one embodiment, a method includes identifying assets for display in a tapestry interface presentation, wherein the tapestry interface provides a presentation for a plurality of assets having relevance based sizing, arrangement of the assets based at least in part on a grid pattern, receiving data for identified assets of the tapestry presentation, and scoring assets based on the received data to determine presentation characteristics for the assets in the tapestry interface. The method may also include updating the presentation of the tapestry interface on a device based on the presentation characteristics.
    Type: Application
    Filed: November 26, 2012
    Publication date: May 29, 2014
    Inventors: Brad WILDER, Bradley James BRIZENDINE, Martin A. STEIN, Andre Wilhelm RABOLD, Farhang M. ZARRINKELK
  • Publication number: 20140149429
    Abstract: A computer-implemented method and system for Web search ranking are provided herein. The method includes generating a number of training samples from clickthrough data, wherein the training samples include positive query-document pairs and negative query-document pairs. The method also includes discriminatively training a translation model based on the training samples and ranking a number of documents for a Web search based on the translation model.
    Type: Application
    Filed: November 29, 2012
    Publication date: May 29, 2014
    Applicant: MICROSOFT CORPORATION
    Inventors: Jianfeng Gao, Zhonghua Qu, Gu Xu
  • Publication number: 20140149431
    Abstract: Embodiments relate to relevance-based information processing. An aspect includes storing a history of display operations performed by a user on a first electronic file. Another aspect includes inputting display operations into the stored history performed by a user on the first electronic file. Another aspect includes calculating, using a plurality of calculating methods, a plurality of degrees of relevance of a second electronic file to the first electronic file based on the stored history. Another aspect includes calculating a synthesized degree of relevance by synthesizing the plurality of degrees of relevance of the second electronic file to the first electronic file. Another aspect includes displaying an input region on a display for inputting display operations on the first electronic file, and automatically displaying the second electronic file based on the synthesized degree of relevance of the second electronic file to the first electronic file exceeding a predetermined threshold.
    Type: Application
    Filed: October 10, 2013
    Publication date: May 29, 2014
    Applicant: International Business Machines Corporation.
    Inventors: Tomoka Mochizuki, Wen Lianzi
  • Publication number: 20140149378
    Abstract: A method for determining the significance of a web page, or a portion thereof, is disclosed. Accordingly, a search engine or some other application analyzes user-selected content portions (as well as user-provided comments associated with the portions) of a document to determine a document relevance score (e.g. Content Selection Rank) for the document containing the user-selected content portions. The particular algorithm for determining the document relevance score will vary depending upon the particular implementation, but may generally be based upon an analysis of the number and quality of user-selected portions, associated comments, the ratings of the user making the selections and the ratings of users contributing to interactions (such as sharing) with the portions. Based on this analysis, the document is assigned a document relevance score, which is used for processing the document in accordance with instructions associated with a search query.
    Type: Application
    Filed: January 30, 2014
    Publication date: May 29, 2014
    Inventor: Rohit Chandra
  • Publication number: 20140149428
    Abstract: A computerized method for identifying a document. A signature may be determined for a first document and compared with a signature for each of one or more additional documents. A document similarity score may be determined and one or more similar documents may be identified based on the document similarity score.
    Type: Application
    Filed: November 28, 2012
    Publication date: May 29, 2014
    Applicant: SAP AG
    Inventors: Godfrey Hobbs, Stefanie Rupp, Axel Gustav
  • Publication number: 20140149430
    Abstract: A method of detecting an overlapping community in a network including nodes and links between the nodes, includes calculating a similarity between the links, and generating a line graph of the network. The method further includes detecting one or more cores in the line graph, and growing a cluster for each of the one or more cores. The method further includes converting the cluster into a cluster of nodes of a node graph.
    Type: Application
    Filed: June 28, 2013
    Publication date: May 29, 2014
    Inventors: Seungwoo RYU, Sejeong KWON, Jae-Gil LEE, Sungsu LIM
  • Publication number: 20140149432
    Abstract: Described herein are systems and methods for selection-based contextual help retrieval. One example method involves (a) receiving first-query data including contextual data, the contextual data indicating a user-interface element type, a user-interface element location, and user-interface element text; (b) determining at least one first-query response based on at least the contextual data; and (c) causing an indication of the determined at least one first-query response to be provided via an output device.
    Type: Application
    Filed: June 18, 2012
    Publication date: May 29, 2014
    Applicant: UNIVERSITY OF WASHINGTON THROUGH ITS CENTER FOR COMMERCIALIZATION
    Inventors: Parmit K. Chilana, Andrew J. Ko, Jacob O. Wobbrock
  • Patent number: 8738636
    Abstract: The present invention relates to computer implemented methods and system for determining correspondences between terms in two or more ontologies. The methods and systems are designed to accept as inputs ontologies in Web Ontology Language (OWL) syntax or any other ontology syntax, to calculate a similarity measure between terms in the ontologies, extract an alignment based on this similarity measure, and verify this alignment according to the semantics contained in the ontologies. This process is designed to be executed iteratively until the similarity measures converge, or until another suitable finalization condition is met. The result of these methods and of the systems implementing these methods is an alignment between two or more ontologies establishing semantic correspondences between the terms in the ontologies.
    Type: Grant
    Filed: September 17, 2009
    Date of Patent: May 27, 2014
    Inventors: Yves Reginald Jean-Mary, Mansur Kabuka
  • Patent number: 8739032
    Abstract: A document analysis system receives multiple concepts along with multiple reference documents and generates sensory indicators that assist a researcher in assessing the relevance of each of the documents to the concepts. In one exemplary aspect, the document analysis system displays a table of keywords separated into blocks, each block of keywords corresponding to one of the concepts. Each block is colored according to the prevalence of any keyword within a given keyword group. The color of a block thus indicates the relative presence of a concept in the document. The document analysis system also determines a unique color for each block of keywords for highlighting in the text of the document. In this manner a researcher can quickly identify passages that contain multiple concepts. Additionally, the researcher is provided the ability to quickly locate reference characters, figure numbers and patent numbers in the document.
    Type: Grant
    Filed: October 12, 2010
    Date of Patent: May 27, 2014
    Inventor: Patrick Sander Walsh
  • Patent number: 8738635
    Abstract: Embodiments are directed to ranking search results using a junk profile. For a given corpus of documents, one or more junk profiles may be created and maintained. The junk profile provides reference metrics to represent known junk documents. For example, a junk profile may comprise a dictionary of document data that is automatically inserted into documents created using a particular system or template. A junk profile may also comprise one or more representations (e.g., histograms) of a distribution of a particular junk variable for known junk documents. The junk profile provides a usable representation of known junk documents, and the present systems and methods employ the junk profile to predict the likelihood that documents in the corpus are junk. In embodiments, junk scores are calculated and used to rank such documents higher or lower in response to a search query.
    Type: Grant
    Filed: June 1, 2010
    Date of Patent: May 27, 2014
    Assignee: Microsoft Corporation
    Inventors: Vladimir Tankovich, Dmitriy Meyerzon, Victor Poznanski
  • Patent number: 8732177
    Abstract: A search query is received. One or more listings is identified responsive to the search query. For each of the one or more of listings, the following are determined: a relevancy score based on one or more parameters in the search query, an expected click through rate, and at least one of a content density boost that is based on one or more fields that are included in or excluded from the listing and a geography type boost that is based on a comparison of one or more geography parameters of the query to one or more geography parameters of the listing. For each of the one or more listings, a performance score is calculated based on the relevancy score, the expected click through rate, and at least one of the content density boost and the geography type boost.
    Type: Grant
    Filed: April 26, 2010
    Date of Patent: May 20, 2014
    Assignee: JPMorgan Chase Bank, N.A.
    Inventors: Sivakumar Chinnasamy, Jeesmon Jacob, Tsu-Jung Kung, Than Kim-Thi Nguyen
  • Publication number: 20140136543
    Abstract: A system that provides secure autocomplete searching receives an autocomplete query from a user, the autocomplete query including a prefix of a search phrase, and retrieves security information of the user. The system searches one or more prefix indexes to find a set of matching objects, where the matching objects each include associated object security information. The system excludes matching objects that the user is not authorized to access from the set of matching objects based on the object security information and the user security information. The system then returns the set of matching objects to the user.
    Type: Application
    Filed: November 13, 2012
    Publication date: May 15, 2014
    Applicant: ORACLE INTERNATIONAL CORPORATION
    Inventors: Kurt FRIEDEN, Don L. HAYLER, Michael RICHARDS, Vasif SHAIKH
  • Publication number: 20140136550
    Abstract: A vehicle identification number (VIN) decoder (VDC) implementing a unique VIN decoding method may, for a given VIN, shorten the VIN and form a stem and a leaf therefrom. Utilizing the stem, the VDC may operate to find matching leaf values, if any, from a set of look up tables. Depending upon a match outcome, one or more trim identification code (TIC) values can be assigned to the VIN and a candidate list can be constructed utilizing the assigned TIC value(s). The candidate list, which can be optimized, may contain one or more candidate trims for the VIN. For each candidate trim, a confidence score and a match probability can be generated. The VDC may provide decoded information containing trim data associated with at least one of the one or more candidate trims for the VIN to a client device over a network connection.
    Type: Application
    Filed: January 17, 2014
    Publication date: May 15, 2014
    Applicant: TrueCar, Inc.
    Inventors: Thomas J. Sullivan, Michael D. Swinson