Fuzzy Searching And Comparisons Patents (Class 707/780)
  • Patent number: 8799318
    Abstract: A function module allows fuzzy searching of data stored in an environment lacking inherent fuzzy search capability, by leveraging a native fuzzy search capability of an underlying database. The function module receives the data that is to be searched, as well as search terms/parameters. The function module creates a temporary table in the database, populates the table with the corresponding data, and executes the fuzzy search on the table according to the terms/parameters utilizing the database layer's native fuzzy search capability. After returning the fuzzy search result, the function module removes the table from the database. One embodiment implements the function module with the Advanced Business Application Program (ABAP) environment available from SAP AG, which lacks an inherent ability to perform fuzzy searching. That function module leverages native fuzzy search capability of an underlying in-memory HANA database architecture database available from SAP AG.
    Type: Grant
    Filed: July 27, 2012
    Date of Patent: August 5, 2014
    Assignee: SAP AG
    Inventor: Daniel Jakobs
  • Patent number: 8799316
    Abstract: A method for context-based query formulation and information retrieval and aggregation is described. The method includes modeling one or more workflow activities utilized to perform work tasks, preparing at least one meta-querying template, to generate queries that utilize the modeled workflow activities, retrieving information relevant to the work task as determined utilizing the at least one meta-querying template, and aggregating the retrieved information for presentation to the user.
    Type: Grant
    Filed: May 24, 2011
    Date of Patent: August 5, 2014
    Assignee: The Boeing Company
    Inventors: Ali Bahrami, Jun Yuan
  • Publication number: 20140214816
    Abstract: A system and method of discovering provider contact data is provided. Provider connectivity data can be built and maintained in a data-store. Once a query is received for identifying a provider contact's data, connectivity data for the querying data is also built. Then the provider connectivity data is searched on the basis of the seeker connectivity data. One or more provider contacts are identified based on query criteria. The provider connectivity data includes a first level provider association data and a first level provider relations data. The seeker connectivity data includes a first level seeker association data and a first level seeker relations data. The connectivity data can be obtained from a variety of sources including contacts databases and third party service providers. Third party service providers include social networks and information aggregators.
    Type: Application
    Filed: March 14, 2013
    Publication date: July 31, 2014
    Applicant: 2306748 ONTARIO INC.
    Inventors: Paul L.C. CHEN, Christopher Lawrence TRUDEAU
  • Patent number: 8793277
    Abstract: Embodiments of the inventive concept can extract digital document information related with a specific individual to achieve a work load reduction associated with evidentiary material preparation for litigation. Recorded digital information can be displayed and user-specifying information can be set for each of a plurality of document files. The user-specifying information shows which user contained in user information one or more document files is related with. A recording unit can record the set user-specifying information. At least one user is selected, and a document file where user-specifying information which corresponds to the selected user was set is searched. Additional information showing whether or not the searched document file is related with the litigation is set via a display unit. A document file which is related with litigation is outputted based on the additional information.
    Type: Grant
    Filed: March 24, 2011
    Date of Patent: July 29, 2014
    Assignee: UBIC, Inc.
    Inventors: Masahiro Morimoto, Yoshikatsu Shirai, Hideki Takeda
  • Patent number: 8782082
    Abstract: One embodiment relates to a computer-implemented method for multiple-keyword matching performed using a computer including at least a processor, data storage, and computer-readable instructions. A keyword set and a text input to be searched are obtained. The keyword set is processed to create a reverse trie. A search procedure which starts from the end of the text is then applied using the reverse trie to find keyword occurrences in the text input. Other embodiments, aspects, and features are also disclosed.
    Type: Grant
    Filed: November 7, 2011
    Date of Patent: July 15, 2014
    Assignee: Trend Micro Incorporated
    Inventors: Qiuer Xu, Liwei Ren
  • Patent number: 8782083
    Abstract: Dynamic sourcing, in which a data request that is associated with a query is received and a parameter of data needed for satisfaction of the query is identified. Parameter information defining data available in at least one cube stored in a cache is accessed and the parameter is compared with the parameter information. Based on comparison results, it is determined whether one or more cubes in the cache include sufficient data to satisfy the query. In response to a determination that one or more cubes include sufficient data to satisfy the query, a response to the data request is generated by executing the query against the one or more cubes. In response to a determination that the cubes do not include sufficient data to satisfy the query, a response to the data request is generated by executing at least a portion of the query against a database system.
    Type: Grant
    Filed: September 14, 2012
    Date of Patent: July 15, 2014
    Assignee: MicroStrategy Incorporated
    Inventors: Scott Cappiello, Xun Feng, Yuliyan Kiryakov, Jun Yuan
  • Publication number: 20140195548
    Abstract: Methods and systems to identify video content based on video fingerprint matching are described. In some example embodiments, the methods and systems generate a query fingerprint of a frame of video content captured at a client device, query a database of reference fingerprints, determine the query fingerprint of the frame of captured video content matches a reference fingerprint, and identify the video content based on the match of fingerprints.
    Type: Application
    Filed: January 7, 2013
    Publication date: July 10, 2014
    Inventor: Wilson Harron
  • Patent number: 8775467
    Abstract: A method and a mobile device comprising an address linking module assess a segment of text as comprising an address and create a link. The method comprises: searching a text for a segment of text having at least two character strings satisfying a proximity constraint, each character string being of a different predefined address indicator type; assessing whether or not the segment comprises an address; displaying at least a portion of the text comprising the segment on a display of a mobile device; and if the segment is assessed as comprising an address, including a link for display, the link pointing to at least one application.
    Type: Grant
    Filed: April 29, 2009
    Date of Patent: July 8, 2014
    Assignee: Blackberry Limited
    Inventors: Ronald Anthony Dicke, Michael Majid, Ngoc Bich Ngo, Hartmuth Gutsche, Xiaming Xi
  • Publication number: 20140188936
    Abstract: Methods and systems for knowledge discovery and organization employ a relational meta model and domain context-based knowledge inference engine to produce answers to queries that involve inferences among items stored as knowledge in a knowledgebase.
    Type: Application
    Filed: March 4, 2014
    Publication date: July 3, 2014
    Inventors: Ajay Manoj Rambhia, Henri P. Wiazowski, Reginald L. Bravo
  • Publication number: 20140188898
    Abstract: Apparatus and method searching data in a multi record data structure with at least first and second criteria where criteria are selected for search to preferentially select criteria for speeding search based on criteria from other records.
    Type: Application
    Filed: December 31, 2013
    Publication date: July 3, 2014
    Inventor: Daniel Esbensen
  • Patent number: 8768933
    Abstract: The subject application is directed to a system and method for type-ahead address lookup employing historically weighted address placement. A prompt is generated on a display for commencement of a new search operation and search data of text entries is received via a user interface. Entries are stored in an associated database, each entry having at least one searchable text field. At least a first character of a new search received via the user interface is tested against the entries relative to the searchable field. A display is generated corresponding to a subset of the entries based upon a testing output. Selection data is received corresponding to a selected entry from the displayed subset and weighting data is generated corresponding to received selection data. Displayed entries are ordered corresponding to the subset of database entries upon subsequent re-entry of the at least a first character during a subsequent search operation.
    Type: Grant
    Filed: February 5, 2009
    Date of Patent: July 1, 2014
    Assignees: Kabushiki Kaisha Toshiba, Toshiba Tec Kabushiki Kaisha
    Inventors: Michael Yeung, Hongfeng (Jason) Wei
  • Publication number: 20140181147
    Abstract: A method of searching a computer database containing a plurality of stored DNA profiles is provided. The method involves generating a search profile formed of two or more allele identities for each of one or more loci, at least one of the allele identities having a limited range of values, with the search profile being compared against the one or more stored DNA profiles from a database to establish matches between the search and stored profile.
    Type: Application
    Filed: October 7, 2013
    Publication date: June 26, 2014
    Applicant: Forensic Science Service Limited
    Inventor: Martin BILL
  • Patent number: 8762297
    Abstract: Architecture introduces a new pattern operator referred to as called an augmented transition network (ATN), which is a streaming adaptation of non-reentrant, fixed-state ATNs for dynamic patterns. Additional user-defined information is associated with automaton states and is accessible to transitions during execution. ATNs are created that directly model complex pattern continuous queries with arbitrary cycles in a transition graph. The architecture can express the desire to ignore some events during pattern detection, and can also detect the absence of data as part of a pattern. The architecture facilitates efficient support for negation, ignorable events, and state cleanup based on predicate punctuations.
    Type: Grant
    Filed: May 17, 2010
    Date of Patent: June 24, 2014
    Assignee: Microsoft Corporation
    Inventors: Badrish Chandramouli, Jonathan D. Goldstein, David Maier, Mohamed H. Ali, Roman Schindlauer
  • Patent number: 8756249
    Abstract: Techniques for searching data in a storage system are described herein. In one embodiment, in response to a request for searching target data in a storage system, first representative data for the target data being searched are generated by applying a predetermined algorithm to at least a portion of the target data. The first representative data are searched and compared with second representative data representing one or more data sets stored in the storage system. It is indicated a likelihood that the target data or similar content has been found in the storage system based on the search and comparison.
    Type: Grant
    Filed: August 23, 2011
    Date of Patent: June 17, 2014
    Assignee: EMC Corporation
    Inventors: Grant Wallace, Philip N. Shilane, Frederick Douglis
  • Publication number: 20140164434
    Abstract: When processing data tuples, operators of a streaming application may identify certain tuples as being relevant. To determine relevant tuples, the operators may, for example, process the received tuples and determine if they meet certain thresholds. If so, the tuples are deemed relevant, but if not they are characterized as irrelevant. The streaming application may use a pattern detector to parse the relevant data tuples to identify a pattern, such as a shared trait between the tuples. Based on this commonality, the pattern detector may generate filtering criteria that may be used to process subsequently received tuples. In one embodiment, the filtering criteria identified by one operator is transmitted to a second operator to be used to process tuples received there. Thus, once one of the operators determines a pattern, the operator generates filtering criteria that another, related operator uses for filtering received tuples.
    Type: Application
    Filed: December 10, 2012
    Publication date: June 12, 2014
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Michael J. Branson, John M. Santosuosso
  • Patent number: 8745086
    Abstract: A computer implemented method of analyzing and graphically representing the correlation of a plurality of transaction items, the method comprising the steps of: retrieving data associated with groups of the transaction items, correlating a plurality of groups of transaction items in a dimensionally reduced manner, creating a tree hierarchy which classifies the groups of transaction items in a hierarchy according to a defined user understandable factor, wherein the tree hierarchy is linked to the groups of transaction items, and graphically representing the correlated groups of transaction items and tree hierarchy to enable interaction between the correlated groups of transaction items and the linked tree hierarchy.
    Type: Grant
    Filed: December 4, 2009
    Date of Patent: June 3, 2014
    Assignee: New BIS Safe Luxco S.รก.r.l.
    Inventors: Andrew John Cardno, Peter Stewart Ingham, Bart Andrew Lewin, Ashok Kumar Singh
  • Patent number: 8738653
    Abstract: The present invention is and includes a device, system and method for providing an image enhancement widget. The device, system and method include a javascript component that, upon execution, obtains at least one subject of primary content on a networked page, and at least one permission for enhancement of the primary content, ones of a plurality of content produced remotely from the javascript component and according to the javascript component, and an enhancement widget. The enhancement widget may be a flash widget.
    Type: Grant
    Filed: July 11, 2011
    Date of Patent: May 27, 2014
    Assignee: Brand Affinity Technologies, Inc.
    Inventors: Chad Steelberg, Ryan Steelberg
  • Patent number: 8738651
    Abstract: A technique for cataloging documents based on user activity includes assigning documents to a relevant document list based on activity of a user of a device. In this case, at least two of the documents are associated with different applications. The technique then provides the relevant document list to the user.
    Type: Grant
    Filed: March 6, 2008
    Date of Patent: May 27, 2014
    Assignee: Lenovo (Singapore) Pte Ltd
    Inventors: Jennifer G. Zawacki, David C. Challener, Justin T. Dubs, James J. Thrasher
  • Patent number: 8732186
    Abstract: A computer-implemented method and system for enabling communication between networked users based on search queries and common characteristics is disclosed. Particular embodiments relate to receiving a search query from a first user and establishing a communication link between the first user and a second user based on the first user's search query. Particular embodiments relate to receiving a first search query from a first user, receiving a second search query from a second user, determining if the first user and the second user fit within match criteria, and establishing a communication link between the first user and the second user if the first user and the second user fit within match criteria.
    Type: Grant
    Filed: October 26, 2006
    Date of Patent: May 20, 2014
    Inventor: Peter Warren
  • Patent number: 8725766
    Abstract: A method and a system are provided for searching content (e.g., text, metadata and/or a fingerprint, etc.). In one example, the system receives content and a query for matching the content. The content includes computer readable data. The system generates a feature vector for the content. Generating the feature vector comprises generating a signal from the content, generating a spectrogram from the signal, and generating the feature vector from the spectrogram. The system searches for at least one feature vector that matches the feature vector for the content.
    Type: Grant
    Filed: March 25, 2010
    Date of Patent: May 13, 2014
    Assignee: Rovi Technologies Corporation
    Inventors: Joonas Asikainen, Kenneth Olson
  • Patent number: 8725841
    Abstract: Data indicates characteristics of a user's multiple media files. The multiple media files are associated with a media library. At least one of the multiple media files matches content in a master media file. The content in the matching media file is of a quality that is lower than the quality of the master media file. The user can provide payment for access to the master media file and, if the user does so, the master media file is associated with the media library and the user is provided with access to the master media file.
    Type: Grant
    Filed: September 30, 2011
    Date of Patent: May 13, 2014
    Assignee: Google Inc.
    Inventor: David L. Sparks
  • Publication number: 20140122532
    Abstract: A computer-implemented method and computing system for receiving, on a computing device, a tag associated with a first user concerning a first image within a social network. The method may be further configured to scan the first image to identify whether a human face is present in the first image. If the human face is identified, the method may be configured to compare the human face to that of the first user. If the human face is determined to be that of the first user, the method may be configured to allow the first image to be displayed in a social networking application associated with a second user, wherein the first user and second user are connected within the social network.
    Type: Application
    Filed: October 31, 2013
    Publication date: May 1, 2014
    Applicant: Google Inc.
    Inventors: Tomasz Charytoniuk, Doug Sherrets
  • Publication number: 20140122531
    Abstract: A computer-implemented method and computing system for comparing, on a computing device, data concerning a first image within a social network to data concerning a plurality of images within the social network. A subset of similar images is identified, chosen from the plurality of images, based, at least in part, upon the comparison. At least a portion of the subset is presented to a computing device associated with a user.
    Type: Application
    Filed: October 30, 2013
    Publication date: May 1, 2014
    Applicant: Google Inc.
    Inventors: SCOTT ZUCCARINO, Doug Sherrets, Yumio Saneyoshi
  • Patent number: 8712977
    Abstract: A computer-readable recording medium stores therein an information retrieval program that causes a computer to execute a retrieval process in which files to be retrieved are narrowed down by using a bit string for each character in the files to find characters making up a retrieval keyword to retrieve a keyword identical to or related to the retrieval keyword in the files to be retrieved. The bit strings indicate the presence of the characters in the files. The information retrieval program causes the computer to execute extracting, from among the bit strings, a bit string of an arbitrary character; and compressing the extracted bit string, by using a special Huffman tree having leaves of plural types of symbol strings covering patterns represented by a predetermined number of bits and a special symbol string having a number of bits greater than the predetermined number of bits.
    Type: Grant
    Filed: November 20, 2009
    Date of Patent: April 29, 2014
    Assignee: Fujitsu Limited
    Inventors: Masahiro Kataoka, Masahiro Kurishima, Takashi Tsubokura, Ryouta Komatsu
  • Patent number: 8713030
    Abstract: A video editing apparatus 100 includes a registering unit 91 configured to register a key candidate having a feature vector of a sound signal which is determined to be registered on the basis of a co-occurrence score to a managing unit 51 as a search key, and a cutting out unit 71 configured to obtain an integration score in each of the blocks from the degree of similarity of the registered search key in each of the blocks and cut out a group of blocks exceeding an integration threshold value from among the integration scores as one video scene.
    Type: Grant
    Filed: June 5, 2009
    Date of Patent: April 29, 2014
    Assignee: Kabushiki Kaisha Toshiba
    Inventors: Kazunori Imoto, Makoto Hirohata, Hisashi Aoki
  • Patent number: 8713541
    Abstract: Implementations of the present disclosure include methods, systems, and computer-readable storage mediums for identifying matching elements between a source model and a target model comprising receiving a source model and a target model, the source model and the target model each being stored in computer-readable memory; processing the source model and the target model to generate a plurality of similarity values, each similarity value being associated with an element of the source model and an element of the target model; generating a similarity value construct based on the plurality of similarity values and elements of the source model and the target model; and identifying matching elements between the source model and the target model based on the similarity value construct.
    Type: Grant
    Filed: December 29, 2011
    Date of Patent: April 29, 2014
    Assignee: SAP AG
    Inventors: Birgit Grammel, Stefan Kastenholz
  • Patent number: 8713008
    Abstract: An information processing apparatus includes the following elements. A feature amount extraction unit extracts feature amounts from a content block. An extraction unit extracts predetermined scenes from the content block using the feature amounts extracted by the feature amount extraction unit. An acquisition unit acquires information for retrieving the content block. A retrieval unit retrieves a scene that meets the information acquired by the acquisition unit from among the scenes extracted by the extraction unit. A presentation unit presents the content block including the scene retrieved by the retrieval unit as a result of retrieval.
    Type: Grant
    Filed: September 29, 2008
    Date of Patent: April 29, 2014
    Assignee: Sony Corporation
    Inventor: Daisuke Negi
  • Patent number: 8706758
    Abstract: Disclosed are improvements to a method for account reconciliation comprising improved, extended, and more flexible algorithms for (1) automatically determining what transaction features are best candidates for matching diverse datasets; (2) automatically determining how logically to subdivide accounting datasets prior to reconciliation; (3) matching groups of transactions (allowing one-to-many, many-to-one, and many-to-many matches instead of just one-to-one matches); (4) making use of more types of transaction feature, including transaction dates (where proximity of two transactions in date may be significant even if the dates do not exactly match). The improved method is, therefore, better able to perform its intended function of identifying matching transactions. It is applicable to a wider class of problems while still saving significant costs and labor, and still retaining flexibility in not requiring source data in a particular format, and not being domain-dependent or requiring extensive user setup.
    Type: Grant
    Filed: February 20, 2012
    Date of Patent: April 22, 2014
    Assignee: Galisteo Consulting Group, Inc.
    Inventor: Peter A. Chew
  • Patent number: 8682927
    Abstract: Embodiments of the inventive concept can extract digital document information related with a specific individual to achieve a work load reduction associated with evidentiary material preparation for litigation. Recorded digital information can be displayed and user-specifying information can be set for each of a plurality of document files. The user-specifying information shows which user contained in user information one or more document files is related with. A recording unit can record the set user-specifying information. At least one user is selected, and a document file where user-specifying information which corresponds to the selected user was set is searched. Additional information showing whether or not the searched document file is related with the litigation is set via a display unit. A document file which is related with litigation is outputted based on the additional information.
    Type: Grant
    Filed: March 24, 2011
    Date of Patent: March 25, 2014
    Assignee: UBIC, Inc.
    Inventors: Masahiro Morimoto, Yoshikatsu Shirai, Hideki Takeda
  • Patent number: 8676841
    Abstract: Techniques for detecting recurring non-occurrences of an event. In one embodiment, techniques are provided for detecting the non-occurrence of an event within each of a series of time periods following the occurrence of another event. Language extensions are provided that enable queries to be formulated for detecting recurring non-occurrence of an event following occurrence of a triggering event.
    Type: Grant
    Filed: August 26, 2009
    Date of Patent: March 18, 2014
    Assignee: Oracle International Corporation
    Inventors: Anand Srinivasan, Rakesh Komuravelli, Shailendra Mishra
  • Patent number: 8676813
    Abstract: A method for selecting a subset of information to communicate to others from a set of information comprising a plurality of content items. In accordance with the method, the set of information is stored in a user retrievable format, a relative priority is assigned to each of the plurality of content items, and the subset of information is automatically generated by selecting a predetermined number of the plurality of content items from the set of information based on the relative priorities of each of the plurality of content items. The predetermined number is less than the number of said plurality of content items and the subset of information is a prioritized subset of the set of information. A system, and a computer readable medium carrying computer readable instructions for carrying out the method are also disclosed.
    Type: Grant
    Filed: September 14, 2011
    Date of Patent: March 18, 2014
    Inventor: Denis J. Alarie
  • Patent number: 8671112
    Abstract: A system for automated classification of an image of an electronic document such as a facsimile document. The image is converted to a textual representation, and at least some of the terms in the textual representation may be associated with one or more predefined classification types, thereby enabling the document to be classified, and for multi-page documents, determining boundaries used to split the document into sections. The development of associations between terms and classification types may result from providing, to the system, a training set of manually-classified documents. A training module analyzes the training set to calculate probabilities that particular terms may appear in documents of a particular classification type. Probabilities established during training are used during automated document processing to assign a classification type to the document. A confidence score associated with the assigned classification type provides a metric for assessing the accuracy of the automated process.
    Type: Grant
    Filed: June 12, 2008
    Date of Patent: March 11, 2014
    Assignee: athenahealth, Inc.
    Inventors: Anshul Amar, JoRel Sallaska Nye
  • Patent number: 8671109
    Abstract: A method to detect video copying based on content. The method comprises providing a set of reference data elements derived from a set of reference video frames in a reference video stream; providing a set of query data elements derived from a set of query video frames in a query video stream, each of the query data elements having a corresponding query data element identifier; associating with each of the reference data elements a fingerprint selected from among the query data element identifiers; and determining a similarity measure for the query video stream relative to the reference video stream by a comparison of the query data element identifiers to the fingerprints.
    Type: Grant
    Filed: December 2, 2011
    Date of Patent: March 11, 2014
    Assignee: CRIM (Centre de Recherche Informatique de Montreal)
    Inventors: Vishwa N. Gupta, Parisa Darvish Zadeh Varcheie
  • Publication number: 20140067863
    Abstract: To identify a media item from a database of media items that have common content, a region of interest is defined to include a plurality of frames of a test fingerprint that correspond to different segments of a media item. A media identification system queries a database of reference fingerprints to identify candidate reference fingerprints that contain a frame that matches a frame of the test fingerprint. When a candidate reference fingerprint is found, additional matching frames are determined and the region of interest is reduced to eliminate the matched frames of the test fingerprint. This continues until the region of interest is empty or there are no further matching candidates. Once the set of candidate reference fingerprints are identified, the media identification system compares the test fingerprint to the candidates to determine a closest match, thereby identifying the media item associated with the test fingerprint.
    Type: Application
    Filed: November 4, 2013
    Publication date: March 6, 2014
    Applicant: Yahoo! Inc.
    Inventor: Sergiy Bilobrov
  • Publication number: 20140067862
    Abstract: This specification describes technologies relating to fixed width encoding/decoding of document posting lists. In general, one aspect of the subject matter described in this specification can be embodied in apparatuses that include a server obtaining a list of one or more of document identification numbers, each of the document identification numbers uniquely identifying a document; an encoding device operatively connected to the server, the encoding device generating a sequence of deltas from the sequential list of one or more of the document identification numbers, and encoding each delta in the sequence of deltas using a fixed-width encoding scheme.
    Type: Application
    Filed: June 3, 2011
    Publication date: March 6, 2014
    Inventors: Priyendra DESHWAL, Srdjan PETROVIC, Asim SHANKAR
  • Patent number: 8667015
    Abstract: Disclosed is a method of automatically extracting data from a target web page, comprising selecting (302) data in a source web page; determining (304) the respective DOM (document object model) trees of the source and target web page, and identifying the one or more nodes comprising the selected data in the source web page DOM tree; determining (306) matching paths in the respective DOM trees; for selected data in a node of an unmatched branch of the source web page DOM tree, identifying (308) the nearest matched path in the source web page; identifying (310) the unmatched branch nearest to the corresponding matched path in the target web page; determining (312) if said identified unmatched branch in the target web page DOM tree comprises a target node matching the selected data node; and if so: extracting (322) data from the target node if the mismatch between the respective unmatched branches does not exceed a predefined threshold.
    Type: Grant
    Filed: November 25, 2009
    Date of Patent: March 4, 2014
    Assignee: Hewlett-Packard Development Company, L.P.
    Inventors: Li-Mei Jiao, Yuhong Xiong
  • Patent number: 8666998
    Abstract: A method, system and computer program product provides a first characteristic associated with a first data set and a single data value, and a second characteristic associated with a second data set; and calculates at least one of: 1) the similarity of the first data set with the second data set based on the first and second characteristics, 2) the similarity of the first data set with the single data value based on the first characteristic and the single data value, 3) confidence indicating how well the first characteristic reflects properties of the first data set based on the first characteristic, and 4) confidence indicating how well the similarity of the first data set with the single data value reflects properties of the single data value based on the first characteristic and the single data value.
    Type: Grant
    Filed: June 30, 2011
    Date of Patent: March 4, 2014
    Assignee: International Business Machines Corporation
    Inventors: Sebastian Nelke, Martin A Oberhofer, Yannick Saillet, Jens Seifert
  • Publication number: 20140059079
    Abstract: A file search apparatus having a setting unit configured to set, as search conditions for specifying a file to be searched for, a plurality of pieces of attribute information and relationship information about a relationship between files, a first search unit configured to search for a file having at least one of the pieces of attribute information set by the setting unit, a second search unit configured to search for a plurality of files, among the files found by the first search unit, which satisfy a condition based on the relationship information set by the setting unit, and an output unit configured to output, as a search result, the plurality of files found by the second search unit.
    Type: Application
    Filed: August 7, 2013
    Publication date: February 27, 2014
    Applicant: CANON KABUSHIKI KAISHA
    Inventor: Hiroto Oka
  • Patent number: 8655913
    Abstract: The subject technology discloses techniques for locating an element in a document object model (DOM) tree structure based on fuzzy matching of attributes of the element and the relative positioning of other elements in the DOM tree structure. For instance, different attributes for searching an element in a DOM tree structure are received. The subject technology determines a location of an element in a DOM tree structure based on the plurality of attributes. A relative location of the element in the DOM tree structure is then determined if determining the location of the element is unsuccessful based on the plurality of attributes. In one example, the relative location of the element is based on fuzzy matching according to a predetermined percentage of one or more matching attributes and based on respective positions of one or more elements in the DOM tree structure.
    Type: Grant
    Filed: March 26, 2012
    Date of Patent: February 18, 2014
    Assignee: Google Inc.
    Inventors: Tejas Arvindkumar Shah, Po Hu
  • Patent number: 8655912
    Abstract: A computer-implemented method and system for combining keywords into logical clusters that share a similar behavior with respect to a considered dimension are disclosed. Various embodiments are operable to order a list of keywords from high activity to low activity, partition the list into at least two sets, a head partition including keywords with an activity level above a predefined threshold, a tail partition including the remainder of the keywords in the list, model the keywords in the head partition based on a set of variables, score the keywords in the head partition based on the modeling, and cluster head partition keywords with tail partition keywords having at least one common variable into at least one keyword cluster.
    Type: Grant
    Filed: August 20, 2010
    Date of Patent: February 18, 2014
    Assignee: eBay, Inc.
    Inventors: Xiaofeng Tang, Salvador Duran, Joel R. Minton
  • Publication number: 20140046967
    Abstract: Apparatus and methods are described herein for recognizing repeated patterns in audio data. An audio input is compared to search terms to determine whether a match can be found. Repeated machine-generated audio presented in telephone calls that support Integrated Voice Response (IVR) applications are identified and used to analyze the performance of IVR systems.
    Type: Application
    Filed: November 22, 2011
    Publication date: February 13, 2014
    Applicant: Listening Methods, LLC
    Inventors: Jim Nash-Walker, Greg Borton
  • Publication number: 20140040313
    Abstract: A system and method of record matching using regular expressions and finite state representations. In this manner, the time (or computational effort) involved in record matching is reduced.
    Type: Application
    Filed: August 2, 2012
    Publication date: February 6, 2014
    Applicant: SAP AG
    Inventors: Mohammad Shami, Kevin Wright
  • Patent number: 8645418
    Abstract: A method and an apparatus for word quality mining and evaluating are disclosed. The method includes: calculating a Document Frequency (DF) of a word in mass categorized data; evaluating the word in multiple single-aspects according to the DF of the word; and evaluating the word in multiple aspects according to the multiple single aspect evaluations to obtain an importance weight of the word. According to the solution of the present invention, the importance of the word in the mass categorized data may be evaluated, and words with high quality may be obtained through an integrated evaluation.
    Type: Grant
    Filed: May 7, 2012
    Date of Patent: February 4, 2014
    Assignee: Tencent Technology (Shenzhen) Company Limited
    Inventors: Huaijun Liu, Zhongbo Jiang, Gaolin Fang
  • Patent number: 8645404
    Abstract: A split data word including a portion of each of two word-aligned data words stored at two word-aligned address boundaries within a memory is read from a displaced-read memory address relative to the word-aligned address boundaries within the memory. The portions of each of the two word-aligned data words within the split data word are compared with corresponding portions of a word-aligned search pattern. A determination is made that a potential complete match for the word-aligned search pattern exists within at least one of the two word-aligned data words based upon an identified match of at least one of the portions of the two word-aligned data words within the split data word with a corresponding at least one portion of the word-aligned search pattern.
    Type: Grant
    Filed: October 21, 2011
    Date of Patent: February 4, 2014
    Assignee: International Business Machines Corporation
    Inventors: K. S. Sadananda Aithal, Ajay K. Sami
  • Publication number: 20140032598
    Abstract: A function module allows fuzzy searching of data stored in an environment lacking inherent fuzzy search capability, by leveraging a native fuzzy search capability of an underlying database. The function module receives the data that is to be searched, as well as search terms/parameters. The function module creates a temporary table in the database, populates the table with the corresponding data, and executes the fuzzy search on the table according to the terms/parameters utilizing the database layer's native fuzzy search capability. After returning the fuzzy search result, the function module removes the table from the database. One embodiment implements the function module with the Advanced Business Application Program (ABAP) environment available from SAP AG, which lacks an inherent ability to perform fuzzy searching. That function module leverages native fuzzy search capability of an underlying in-memory HANA database architecture database available from SAP AG.
    Type: Application
    Filed: July 27, 2012
    Publication date: January 30, 2014
    Applicant: SAP AG
    Inventor: Daniel Jakobs
  • Publication number: 20140019486
    Abstract: The embodiments herein relate to multi pattern searching and, more particularly, to multi pattern search or multi pattern matching using logic content processing. The input pattern is type cast to a Boolean alphabet and is then processed to create a corresponding signature set. Further, the signature set is divided into subsets and a Boolean logic function representing each signature subset is created. Further, the values of each subset are simultaneously compared with windows of an input data steam or data file to find a match. If a match is found, the system returns a hit, else a miss. Parallel stages may be added to enhance performance of the system, as multiple inputs may be processed at a time.
    Type: Application
    Filed: June 19, 2013
    Publication date: January 16, 2014
    Inventor: Amitava Majumdar
  • Patent number: 8631035
    Abstract: A method to support efficient, interactive, and fuzzy search on text data includes an interactive, fuzzy search on structured data used in applications such as query relaxation, autocomplete, and spell checking, where inconsistencies and errors exist in user queries as well as data. It utilizes techniques to efficiently and interactively answer fuzzy queries on structured data to allow users to efficiently search for information interactively, and they can find records and documents even if these records and documents are slightly different from the user keywords.
    Type: Grant
    Filed: November 14, 2011
    Date of Patent: January 14, 2014
    Assignee: The Regents of the University of California
    Inventors: Chen Li, Shengyue Ji, Guoliang Li, Jiannan Wang, Jianhua Feng
  • Patent number: 8631036
    Abstract: This invention relates to an advertisement machine which provides advertisements to a user searching for desired information within a data network. The machine receives, from a user, a search request including a search argument corresponding to the desired information and searches, based upon the received search argument, a first database having data network related information to generate search results. It also correlates the received search argument to a particular advertisement in a second database having advertisement related information. The search results together with the particular advertisement are provided by the machine to the user.
    Type: Grant
    Filed: December 21, 2012
    Date of Patent: January 14, 2014
    Assignee: Rockstar Consortium US LP
    Inventors: Richard Prescott Skillen, Frederick Caldwell Livermore
  • Publication number: 20140012880
    Abstract: A method and system for image search, the method comprising: receiving an indication regarding at least one feature of at least one image from a collection of images; creating an updated search algorithm according to the indication; and providing an updated collection of images by using the updated search algorithm.
    Type: Application
    Filed: September 12, 2013
    Publication date: January 9, 2014
    Applicant: A.L.D SOFTWARE LTD
    Inventors: Zigmund Bluvband, Sergey Porotsky, Alexander Dubinsky
  • Publication number: 20140012879
    Abstract: A DBMS is configured to identify a Boolean expression and conditional expressions from a search query, and to extract for each of the identified conditional expressions the value of a record ID conforming to a conditional expression. The DBMS is configured to change a conformity result value corresponding to the extracted record ID and conditional expression to a first value signifying conformity, in Boolean expression determination information, which is information that includes a determination set having a record ID value and a plurality of conformity result values respectively corresponding to a plurality of conditional expressions, and, on the basis of the Boolean expression, to perform logical operations on the plurality of conformity result values of a determination set, for each determination set in the Boolean expression determination information.
    Type: Application
    Filed: June 3, 2011
    Publication date: January 9, 2014
    Applicant: Hitachi, Ltd.
    Inventors: Shinsuke Hamada, Yasuhiro Tahara, Kouji Kimura