Fuzzy Searching And Comparisons Patents (Class 707/780)

Method and apparatus for processing electronically stored information for electronic discovery

Patent number: 8626737

Abstract: Method and apparatus for processing electronically stored information (ESI) for electronic discovery are described. In some examples, an electronic analysis of documents in the ESI indicated as being responsive to a search query of the ESI is performed. Results of the electronic analysis are stored in a database to provide a repository of case knowledge. Search parameters for at least one additional search query are generated automatically based on the case knowledge. The search parameters are provided as output.

Type: Grant

Filed: December 30, 2011

Date of Patent: January 7, 2014

Assignee: Symantec Corporation

Inventor: Sunil Sharad Panse
Double weighted correlation scheme

Patent number: 8626772

Abstract: A method is provided for determining a correlation between a reference user and another user on the basis of two sets of ratings, where each rating is associated with a respective user. In response to a trigger, user ratings associated with the reference user and user ratings associated with the other user are collected, and all co-rated items of these two sets are correlated in the basis of an adjusted cosine correlation function which is weighted by a first and a second weighting function. The correlation is then stored and may be repeated for a plurality of users. The stored correlations may be used e.g. for ranking purposes.

Type: Grant

Filed: September 29, 2008

Date of Patent: January 7, 2014

Assignee: Telefonaktiebolaget L M Ericsson (publ)

Inventors: Mattias Lidstrom, Jonas Bjork, Joakim Soderberg
RECIPROCAL DATA FILE PUBLISHING AND MATCHING SYSTEM

Publication number: 20140006448

Abstract: A method of reciprocally publishing and matching data files by facilitating the creation of Category A and Category B data files having a respective sets of characteristics defined utilizing a common language architecture, associating enhancement objects with each data file, identifying characteristics in each data file which must be met by other data files, weighting characteristics in each data file differently from other characteristics in the same data file, enabling the selection of participation levels for each data file in marketspaces, and matching Category A data files participating in particular marketspaces to Category B data files participating in the same marketspaces using the common language architecture, the enhancement objects, the weights and the required characteristics.

Type: Application

Filed: September 9, 2013

Publication date: January 2, 2014

Inventor: Danny A. McCALL
Retrieving configuration records from a configuration management database

Patent number: 8620908

Abstract: A method is provided of retrieving a configuration record from a configuration management database pertaining to an IT-infrastructure including host devices, a configuration record having attributes and assigned attribute values, the IT-infrastructure being managed by an operation management system providing messages indicating events in the IT-infrastructure. The method includes providing a message including a set of unidentified attribute values and an identified host value, retrieving from the configuration management database one or more configuration records containing the identified host value, and selecting from the one or more retrieved configuration records a configuration record depending on the number of matches between the unidentified attribute values and the attribute values of the retrieved configuration records.

Type: Grant

Filed: October 15, 2008

Date of Patent: December 31, 2013

Assignee: Hewlett-Packard Development Company, L.P.

Inventor: Jörn Schimmelpfeng
Ontology for use with a system, method, and computer readable medium for retrieving information and response to a query

Patent number: 8612208

Abstract: A system, method, and computer readable medium for applying one or more information retrieval technologies is disclosed for resolving a query. In one embodiment, an exemplary system generates a response using a language analysis module configured to parse a query into elements. This system can also include a rules engine configured to compare a condition of a rule against the elements, where the rule is configured to perform an action to retrieve information. Further, a response generator coupled to said rules engine receives the rule and retrieves the information for presentation to a querier in a portion of a display that adjusts proportionately to the degree of importance of said information.

Type: Grant

Filed: April 7, 2004

Date of Patent: December 17, 2013

Assignee: Oracle OTC Subsidiary LLC

Inventors: Edwin Riley Cooper, Gann Bierner, Laurel Kathleen Graham, Deniz Yuret, James Charles Williams, Filippo Beghelli
Calendar-based suggestion of a travel option

Patent number: 8606801

Abstract: Calendar-based suggestion of travel options involves suggesting a travel option based on events stored in a calendar of a user. For example, a user in San Francisco may be scheduled for a business trip to New York, and the user's calendar may show a dinner reservation in San Francisco on Saturday, September 3 and business meeting in New York on Monday, September 5. A suggestion machine accesses calendar data of the user and travel data for several available travel options (e.g., flights from San Francisco to New York). Based on the accessed calendar data and travel data, the suggestion machine determines that one or more travel options (e.g., a flight on Sunday, September 4) are compatible with the dinner reservation and the business meeting on the user's calendar. The suggestion machine presents these compatible travel options to the user.

Type: Grant

Filed: August 16, 2011

Date of Patent: December 10, 2013

Assignee: Hipmunk, Inc.

Inventors: Adam Julian Goldstein, Steven Ladd Huffman
COMPUTER PRODUCT, RETRIEVING APPARATUS, AND RETRIEVAL METHOD

Publication number: 20130318124

Abstract: A retrieving apparatus includes a processor that specifies in each tier of synonym dictionary data, classification codes of a search word in a search character string and those of a comparison word in character strings for comparison; extracts from among the specified classification codes, classification codes in a specific tier; judges for each character string for comparison, whether the extracted classification code of the search word and that of the comparison word match; counts for the specific tier, matching classification codes; determines based on the count, whether a character string is to be excluded whose classification code of the comparison word for the specific tier does not match that of the search word; calculates based on the specified classification code of the search word and that of the comparison word in the character string not to be excluded, similarity between the two character strings; and outputs a calculation result.

Type: Application

Filed: August 7, 2013

Publication date: November 28, 2013

Applicant: FUJITSU LIMITED

Inventors: Takafumi Ohta, Masahiro Kataoka
Methods for identifying video segments and displaying contextual targeted content on a connected television

Patent number: 8595781

Abstract: Systems and methods for identifying which video segment is being displayed on a screen of a television system. The video segment is identified by deriving data from the television signals, the derived data being indicative of the video segment being displayed on the screen. This feature can be used to extract a viewer's reaction (such as changing the channel) to a specific video segment (such as an advertisement) and reporting the extracted information as metrics. The systems and methods may further provide contextually targeted content to the television system. The contextual targeting is based on not only identification of the video segment being displayed, but also a determination concerning the playing time or offset time of the particular portion of the video segment being currently displayed.

Type: Grant

Filed: May 27, 2010

Date of Patent: November 26, 2013

Assignee: Cognitive Media Networks, Inc.

Inventors: Zeev Neumeier, Edo Liberty
Techniques for performing regular expression-based pattern matching in data streams

Patent number: 8589436

Abstract: Techniques for detecting patterns in one or more data streams. A pattern to be detected may be specified using a regular expression. Events received in a data stream are processed during runtime to detect occurrences of the specified pattern in the data stream.

Type: Grant

Filed: August 26, 2009

Date of Patent: November 19, 2013

Assignee: Oracle International Corporation

Inventors: Anand Srinivasan, Rakesh Komuravelli, Namit Jain, Shailendra Mishra
Token stitcher for a content search system having pipelined engines

Patent number: 8589405

Abstract: A content search system includes multiple pipelined search engines that implement different portions of a regular expression search operation. For some embodiments, the search pipeline includes a DFA engine, an NFA engine, and a token stitcher that combines partial match results generated by the DFA and NFA engines. The token stitcher can be configured to implement unbounded sub-expressions without utilizing resources of the DFA or NFA engines. A token stitcher may comprise an input line for receiving tokens that indicate a partial match between an input string and a regular expression, a flag bank that stores flags which, when activated, identify one or more of the sub-expressions that match the input string, a program memory that stores programs that each comprises instructions for processing tokens, and an engine configured to identify programs that are associated with a newly received token.

Type: Grant

Filed: September 17, 2010

Date of Patent: November 19, 2013

Assignee: NetLogic Microsystems, Inc.

Inventor: Cristian Estan
Route searching apparatus and route searching method

Patent number: 8583661

Abstract: A route searching apparatus includes a travel route accumulation unit, an algorithm selection unit to select route searching algorithms to meet the preference of a user based on the accumulated travel routes, and a route searching unit to perform a route search up to a destination by using the selected algorithms. Preferably, the algorithm searching unit searches for each of the travel routes accumulated in the accumulation unit by means of several algorithms, calculates, for each algorithm, a match rate between the searched route and the actual travel route, and selects an algorithm of the highest match rate. Also, it is preferable that a condition common to roads avoided by the user is extracted from the accumulated travel routes, and a route search is performed by adjusting search costs so as to make such roads less prone to be selected.

Type: Grant

Filed: June 24, 2009

Date of Patent: November 12, 2013

Assignee: Toyota Jidosha Kabushiki Kaisha

Inventors: Daigo Fujii, Toshiyuki Namba, Hiroaki Sekiyama, Keisuke Okamoto
Large graph measurement

Patent number: 8583667

Abstract: As provided herein, a pairwise distance between nodes in a large graph can be determined efficiently. URL-sketches are generated for respective nodes in an index by extracting labels from respective nodes, which provide a reference to a link between the nodes, aggregating the labels into sets for respective nodes, and storing the sets of labels as URL-sketches. Neighborhood-sketches are generated for the respective nodes in the index using the URL-sketches, by determining a neighborhood for a node and generating a sketch using labels that are associated with the respective neighboring nodes. A distance between two nodes is determined by computing an approximate number of paths and an approximate path length between the two nodes, using the neighborhood sketches for the two nodes.

Type: Grant

Filed: December 8, 2011

Date of Patent: November 12, 2013

Assignee: Microsoft Corporation

Inventor: Sreenivas Gollapudi
Method and recording medium

Patent number: 8577920

Abstract: A method executed by a computer includes searching a design subject having similar attribute information from attribute information on a design subject designed in the past and histories of descriptions of improvements based on input information on a design subject; obtaining a description of an improvement on the searched design subject having similar attribute information from the histories, obtaining a design subject the description of an improvement of which has an effect of the improvement, and outputting improved input information, wherein the attribute information contributing to the improvement of the design subject is obtained and reflected in the input information, and the history of the improved input information is stored.

Type: Grant

Filed: February 8, 2008

Date of Patent: November 5, 2013

Assignee: Fujitsu Limited

Inventor: Hirooki Hayashi
Generation of annotation tags based on multimodal metadata and structured semantic descriptors

Patent number: 8572086

Abstract: In one embodiment, a method of generating annotation tags (28) for a digital image (22) includes maintaining a library (16) of human-meaningful words or phrases organized as category entries (72) according to a number of defined image description categories (70), and receiving context metadata (20) associated with the capture of a given digital image (22). The method further includes selecting particular category entries (72-1, 72-2) as vocabulary metadata (24) for the digital image (22) by mapping the context metadata (20) into the library (16), and generating annotation tags (28) for the digital image (22) by logically combining the vocabulary metadata (24) according to a defined set of deductive logic rules (30) that are predicated on the defined image description categories (70). In another embodiment, a processing apparatus (12), such as a digital processor (18, 26) and supporting memory (14), etc., is configured to carry out the above method, or to carry out variations of the above method.

Type: Grant

Filed: January 21, 2009

Date of Patent: October 29, 2013

Assignee: Telefonaktiebolaget LM Ericsson (publ)

Inventors: Joakim Soderberg, Jonas Bjork, Andreas Fasbender
Memory management in a token stitcher for a content search system having pipelined engines

Patent number: 8572106

Abstract: A content search system includes multiple pipelined search engines that implement different portions of a regular expression search operation. For some embodiments, the search pipeline includes a DFA engine, an NFA engine, and a token stitcher that combines partial match results generated by the DFA and NFA engines. The token stitcher can be configured to implement unbounded sub-expressions without utilizing resources of the DFA or NFA engines. The token stitcher may comprise a flag bank for storing a number of flags. Each flag may identify a sub-expression that matches the input string. The flag bank may be configured to discard one or more flags upon satisfaction of a predetermined condition for purposes of recapturing hardware resources to provide a certain level of performance.

Type: Grant

Filed: November 15, 2010

Date of Patent: October 29, 2013

Assignee: NetLogic Microsystems, Inc.

Inventor: Cristian Estan
DISCRIMINATIVE CLASSIFICATION USING INDEX-BASED RANKING OF LARGE MULTIMEDIA ARCHIVES

Publication number: 20130282721

Abstract: Devices, systems, and methods of performing feature detection on a set of multimedia files are disclosed. One method of organization includes identifying a feature from each multimedia file within the set of multimedia files wherein each file has one feature, organizing the features based on their similarities wherein similar features are grouped based upon a proximity in a feature space and a representative feature is identified for each group, receiving a detection model having one or more detection criteria the detection model having previously been trained for detection using the organized features, and using the representative features to apply the detection model in a decreasing order of detection probability in order to detect the files satisfying the detection criteria within the set of multimedia files.

Type: Application

Filed: February 25, 2013

Publication date: October 24, 2013

Applicant: HONEYWELL INTERNATIONAL INC.

Inventors: Scott McCloskey, Ben A. Miller
Apparatus and Method for Random Database Sampling with Repeatable Results

Publication number: 20130282760

Abstract: A method of sampling data in a database includes designating permanent read locations in a database. The database is populated with randomly loaded data. The permanent read locations in the database are sampled to form sampled repeatable results attributable to the permanent read locations and the randomly loaded data.

Type: Application

Filed: April 19, 2012

Publication date: October 24, 2013

Applicant: SOLERA NETWORKS INC.

Inventors: Bhargav Madhav Pendse, Matthew Scott Wood, Neil Richard Terry
System and program for generating boolean search formulas

Patent number: 8566351

Abstract: To provide a technique for accurately and efficiently generating Boolean search formulas that serve as a basis of similarity search. A Boolean search formula generation apparatus according to the present invention establishes Boolean search formulas expressed in a standard sum of products form connecting products of search terms by a sum and evaluates the Boolean search formulas based on a recall and a precision. The Boolean search formula generation apparatus repeats connecting the products of the search terms with maximum evaluation values by a sum to establish a Boolean search formula.

Type: Grant

Filed: September 7, 2011

Date of Patent: October 22, 2013

Assignee: Hitachi, Ltd.

Inventor: Makoto Iwayama
Handwritten document categorizer and method of training

Patent number: 8566349

Abstract: A method and an apparatus for training a handwritten document categorizer are disclosed. For each category in a set into which handwritten documents are to be categorized, discriminative words are identified from the OCR output of a training set of typed documents labeled by category. A group of keywords is established including some of the discriminative words identified for each category. Samples of each of the keywords in the group are synthesized using a plurality of different type fonts. A keyword model is then generated for each keyword, parameters of the model being estimated, at least initially, based on features extracted from the synthesized samples. Keyword statistics for each of a set of scanned handwritten documents labeled by category are generated by applying the generated keyword models to word images extracted from the scanned handwritten documents. The categorizer is trained with the keyword statistics and respective handwritten document labels.

Type: Grant

Filed: September 28, 2009

Date of Patent: October 22, 2013

Assignee: Xerox Corporation

Inventors: Francois Ragnet, Florent C. Perronnin, Thierry Lehoux
Method and apparatus for facilitating document sanitization

Patent number: 8566350

Abstract: One embodiment provides a system for facilitating sanitizing a modified version of a document relative to one or more sensitive topics. During operation, the system determines a privacy risk for a term in the modified version relative to the sensitive topics, wherein the privacy risk measures the extent to which the sensitive topic(s) can be inferred based on the term. Next, the system determines an information utility and privacy loss or gain for the modified version, where the information utility reflects the extent to which the modified version has changed and the privacy loss or gain reflects the extent to which the modified version is reduced in sensitivity.

Type: Grant

Filed: November 2, 2009

Date of Patent: October 22, 2013

Assignee: Palo Alto Research Center Incorporated

Inventors: Richard Chow, Jessica N. Staddon, Ian S. Oberst
Automated propagation of non-conflicting queries in distributed databases

Patent number: 8560570

Abstract: A logical table definition is received as part of a database query within a distributed database system. A determination is made as to whether the received logical table definition matches a local logical table definition. The query is processed in response to determining that the received logical table definition matches the local logical table definition. The query is forwarded to at least one other distributed database node.

Type: Grant

Filed: February 2, 2012

Date of Patent: October 15, 2013

Assignee: International Business Machines Corporation

Inventors: Graham A. Bent, Patrick Dantressangle, David R. Vyvyan
Content based load balancer

Patent number: 8554790

Abstract: A content based load balancing system receives a request for data provided by a resource. The content based load balancing system searches a content history cache for a content history cache entry corresponding to the requested data. The content based history cache then selects a resource node to service the request based on the content history cache entry corresponding to the data.

Type: Grant

Filed: December 18, 2007

Date of Patent: October 8, 2013

Assignee: Red Hat, Inc.

Inventor: Mladen Turk
Protecting privacy in audience data

Patent number: 8554779

Abstract: Methods, systems and apparatus, including computer program products, for generating respective random weights for identified audience members matching an audience specification in order to mask the behavior of individuals. The mean value of the random weights for large audience results approaches one so that a sum of the weights approximates a count of the matching records.

Type: Grant

Filed: April 24, 2009

Date of Patent: October 8, 2013

Assignee: Google Inc.

Inventor: John D. Batali
Method and system for unified information representation and applications thereof

Patent number: 8548951

Abstract: Method, system, and programs for information search and retrieval. A query is received and is processed to generate a feature-based vector that characterizes the query. A unified representation is then created based on the feature-based vector, that integrates semantic and feature based characterizations of the query. Information relevant to the query is then retrieved from an information archive based on the unified representation of the query. A query response is generated based on the retrieved information relevant to the query and is then transmitted to respond to the query.

Type: Grant

Filed: March 10, 2011

Date of Patent: October 1, 2013

Assignee: Textwise LLC

Inventors: Robert Solmer, Wen Ruan
Forensic system, forensic method, and forensic program

Patent number: 8549037

Abstract: Embodiments of the inventive concept reduce the burden of creating litigant sources of evidence or other evidentiary materials in connection with litigation in a court of law. Designation of at least one document file included in digital document information is accepted and designation of a language into which the designated document file is translated is accepted. The document file, the designation of which is accepted, is translated into the language the designation of which is accepted. A common document file representing the same content as that of the designated document file is extracted from digital document information recorded in a recording unit. Translation-related information representing that the extracted common document file is translated by invoking a translated content of the translated document file is generated, and, based on the translation-related information, a litigant-related document file is output.

Type: Grant

Filed: July 27, 2011

Date of Patent: October 1, 2013

Assignee: UBIC, Inc.

Inventors: Masahiro Morimoto, Yoshikatsu Shirai, Hideki Takeda, Masatoshi Ohno
CONTENT COLLECTION SEARCH WITH ROBUST CONTENT MATCHING

Publication number: 20130254235

Abstract: Systems and approaches for searching a content collection corresponding to query content are provided. In particular, false positive match rates between the query content and the content collection may be reduced with a minimum content region test and/or a minimum features per scale test. For example, by correlating content descriptors of a content piece in the content collection with query descriptors of the query content, the content piece can be determined to match the query content when a particular region of the content piece and/or a particular region of a query descriptor have a proportionate size meeting or exceeding a specified minimum. Alternatively, or in addition, the false positive match rate between query content and a content piece can be reduced by comparing content descriptors and query descriptors of features at a plurality of scales. A content piece can be determined to match the query content according to descriptor proportion quotas for the plurality of scales.

Type: Application

Filed: September 15, 2012

Publication date: September 26, 2013

Applicant: A9.com, Inc.

Inventors: Arnab S. Dhua, Sunil Ramesh, Max Delgadillo, Raghavan Manmatha
Semantic object characterization and search

Patent number: 8543598

Abstract: Semantic object characterization and its use in indexing and searching a database directory is presented. In general, a first binary hash code is generated to represent a first representation or view of a semantic object which when compared to a characterized version of a second representation or view of the same semantic object in the form of a second binary hash code, the first and second binary hash codes exhibit a degree of similarity indicative of the objects being the same object. In one implementation the semantic objects correspond to peoples' names and the first and second representations or views correspond to two different languages. Thus, a user can search a database of information in one language with a search query in another language.

Type: Grant

Filed: March 1, 2010

Date of Patent: September 24, 2013

Assignee: Microsoft Corporation

Inventors: Uppinakuduru Raghavendra Udupa, Shaishav Kumar
Assigning weights to parts of a document

Patent number: 8538989

Abstract: Methods, systems, and apparatus, including computer program products, for assigning weights to parts of a document. In one aspect, document object is generated for a collection of documents. Weights for the respective portions of the document object model are generated, and a document is ranked in a search engine index, the ranking being based at least in part on the generated weights.

Type: Grant

Filed: February 8, 2008

Date of Patent: September 17, 2013

Assignee: Google Inc.

Inventors: Mayur Datar, Kedar Dhamdhere, Ashutosh Garg
METHODS AND SYSTEMS FOR MATCHING EXPRESSIONS

Publication number: 20130232172

Abstract: Methods and systems are provided for matching expressions to data items. One exemplary method involves identifying a subset of expressions that match data items collectively and then identifying individual data items that match expressions of the subset. In one embodiment, the data items are partitioned into data item subsets, and further subsets of expressions collectively matching the data item subsets are identified. Data items of a respective data item subset are then individually matched to expressions of the respective expression subset that collectively matched that respective data item subset.

Type: Application

Filed: August 23, 2012

Publication date: September 5, 2013

Applicant: salesforce.com, inc.

Inventor: Brendan Wood
Text-based fuzzy search

Patent number: 8521759

Abstract: An input feature vector is computed from an input text record, the input feature vector comprising one or more features, each feature including a subsequence of characters and a frequency of occurrence of the associated subsequence. A input fingerprint is generated out of the input feature vector by choosing one or more features with non-zero frequencies and alphabetizing the features chosen. One or more input indices are generated by alphabetizing features in the input fingerprint and concatenating features occurring in subsequent locations of the input fingerprint. The input text record is matched against a target text record if (1) one or more of the input indices match a target index corresponding to the target text record and (2) the corresponding input fingerprint matches a target fingerprint corresponding to the target text record. The target text record is outputted as a search result if it matches the input text record.

Type: Grant

Filed: May 23, 2011

Date of Patent: August 27, 2013

Assignee: Rovi Technologies Corporation

Inventors: Joonas Asikainen, Brian Kenneth Vogel
Annotations for three-dimensional (3D) object data models

Patent number: 8515982

Abstract: Methods and systems for interacting with multiple three-dimensional (3D) object data models are provided. An example method may involve receiving an annotated template of a first three-dimensional (3D) object data model. The annotated template may be associated with a given category of objects and may include one or more annotations to one or more aspects of an object described by the first 3D object data model. The method may also include determining matching correspondences between the first 3D object data model and a second 3D object data model that is a given object in the given category. For the one or more matching correspondences, an annotation that is associated with a correspondence point of the first 3D object data model may be applied as a call-out to a matching correspondence point of the second 3D object data model.

Type: Grant

Filed: October 4, 2012

Date of Patent: August 20, 2013

Assignee: Google Inc.

Inventors: Ryan Hickman, James J. Kuffner, Jr., Anthony Gerald Francis, Jr., Arshan Poursohi, James R. Bruce, Thor Lewis, Chaitanya Gharpure
RECORD LINKAGE BASED ON A TRAINED BLOCKING SCHEME

Publication number: 20130212103

Abstract: Some implementations disclosed herein provide techniques and arrangements to train a blocking scheme using both labeled data and unlabeled data. For example, training the blocking scheme may include iteratively: learning a conjunction, identifying first matches in the labeled data and the unlabeled data that are uncovered by the conjunction, and identifying second matches in the labeled data and the unlabeled data that are covered by the conjunction. The conjunction learned in each iteration may be combined using a disjunction. A search engine may use the search engine when searching for records that match an entity.

Type: Application

Filed: February 13, 2012

Publication date: August 15, 2013

Applicant: Microsoft Corporation

Inventors: Yunbo Cao, Chin-Yew Lin, Pei Yue, Zhiyuan Chen
COMBINED WORD TREE TEXT VISUALIZATION SYSTEM

Publication number: 20130204897

Abstract: An example method includes identifying instances of a search term in a source text, and identifying prefix strings and suffix strings comprising tokens preceding and subsequent to the search term in the source text. The method further includes displaying a visualization interface that includes the search term within a combined tree diagram that includes a prefix tree and a suffix tree extending to either side of the search term. The prefix tree displays the prefix strings and the suffix tree displays the suffix strings, and both have user-selectable branches. The method further includes providing, in response to a user input selecting a branch in the prefix tree or in the suffix tree, a visual indication of a prefix string and a suffix string in the visualization interface that form a matching sequence that originate together in the source text, for matching sequences that are connected through the selected branch.

Type: Application

Filed: February 3, 2012

Publication date: August 8, 2013

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventor: Steven R. McDougall
ASCRIBING ACTIONABLE ATTRIBUTES TO DATA THAT DESCRIBES A PERSONAL IDENTITY

Publication number: 20130204900

Abstract: There is provided a method that includes (a) receiving an inquiry to initiate a search for data for a specific individual, (b) determining, based on the inquiry, a strategy and flexible predictiveness equations to search a reference database, (c) searching the reference database, in accordance with the strategy, for a match to the inquiry; and (d) outputting the match. The method may also output flexible feedback related to the match that reflects inferred quality of the match experience which can be used by an end-user to determine the degree to which the matched entity meets that end-user's quality-based criteria. There is also provided a system that performs the method, and a storage medium that contains instructions that control a processor to perform the method.

Type: Application

Filed: March 15, 2013

Publication date: August 8, 2013

Applicant: THE DUN & BRADSTREET CORPORATION

Inventor: The Dun & Bradstreet Corporation
TEXT SEARCH APPARATUS AND TEXT SEARCH METHOD

Publication number: 20130204898

Abstract: The text search apparatus has an information storage that stores plural transposed indexes associating characters or character strings appearing in a document to be searched with the appearance positions of the characters or character strings. The transposed indexes were generated for a document in which beginning marks are added in front of texts to be subject to forward matching search. The incremental searcher of the text search apparatus adds a beginning mark in front of a search keyword and executes a forward matching search using a set of transposed indexes. The main searcher executes a partial match search using the same set of transposed indexes.

Type: Application

Filed: January 4, 2013

Publication date: August 8, 2013

Applicant: CASIO COMPUTER CO., LTD.

Inventor: CASIO COMPUTER CO., LTD.
Method and device for monitoring and analyzing signals

Publication number: 20130204899

Abstract: A method and system for monitoring and analyzing at least one signal are disclosed. An abstract of at least one reference signal is generated and stored in a reference database. An abstract of a query signal to be analyzed is then generated so that the abstract of the query signal can be compared to the abstracts stored in the reference database for a match. The method and system may optionally be used to record information about the query signals, the number of matches recorded, and other useful information about the query signals. Moreover, the method by which abstracts are generated can be programmable based upon selectable criteria. The system can also be programmed with error control software so as to avoid the re-occurrence of a query signal that matches more than one signal stored in the reference database.

Type: Application

Filed: March 13, 2013

Publication date: August 8, 2013

Applicant: BLUE SPIKE LLC

Inventor: Blue Spike LLC
Breast Cancer Biomarker Signatures for Invasiveness and Prognosis

Publication number: 20130190386

Abstract: MicroRNA profiles transition from normal breast to ductal carcinoma in situ and transition to invasive ductal carcinoma (IDC) and methods of use thereof are described. Methods of diagnosis and prognosis using microRNA signatures to differentiate invasive from in situ carcinoma are described. Also described is the use of microRNA expression for predicting overall survival and time to metastasis.

Type: Application

Filed: January 22, 2013

Publication date: July 25, 2013

Applicant: The Ohio State University

Inventor: The Ohio State University
Defining a data structure for pattern matching

Patent number: 8495101

Abstract: An information processing method for defining a data structure for pattern matching, the method executed by an information processing apparatus, comprises generating, by the apparatus, an ordered tree structure by defining transition edges between nodes using, as transition conditions, respective constraints from one or more constraint patterns each including plural constraints; searching, by the apparatus, for a second substructure similar to a first substructure from a root node by determining a set relation between transition conditions of respective transition edges; and defining, by the apparatus, an additional transition link from a tail node of the second substructure to a child node at a tail end of the first substructure, the additional transition link adding a constraint to be met by an indeterminant identified from the set relation.

Type: Grant

Filed: February 29, 2012

Date of Patent: July 23, 2013

Assignee: International Business Machines Corporation

Inventors: Teruo Koyanagi, Yuta Tsuboi
Decision tree induction that is sensitive to attribute computational complexity

Patent number: 8495096

Abstract: A decision tree for classifying computer files is constructed. Computational complexities of a set of candidate attributes are determined. A set of attribute vectors are created for a set of training files with known classification. A node is created to represent the set. A weighted impurity reduction score is calculated for each candidate attribute based on the computational complexity of the attribute. If a stopping criterion is satisfied then the node is set as a leaf node. Otherwise the node is set as a branch node and the attribute with the highest weighted impurity reduction score is selected as the splitting attribute for the branch node. The set of attribute vectors are split into subsets based on their attribute values of the splitting attribute. The above process is repeated for each subset. The tree is then pruned based on the computational complexities of the splitting attributes.

Type: Grant

Filed: April 18, 2012

Date of Patent: July 23, 2013

Assignee: Symantec Corporation

Inventors: Shane Pereira, Zulfikar Ramzan, Sourabh Satish
Entity Matching Using Machine Learning

Publication number: 20130185306

Abstract: Techniques for information retrieval include receiving a plurality of data records, each data record including data fields associated with a business enterprise, the data fields including a name of the business enterprise; updating a plurality of database records associated with the received plurality of data records stored in a database, each database record including attributes including the name of the business enterprise and an alias associated with the name of the business enterprise; receiving a query for a particular database record, the query including at least one of the name of the business enterprise or the alias associated with the name of the business enterprise; and preparing for display, in response to the query, one or more of the database records based on at least one of the name of the business enterprise or the alias associated with the name of the business enterprise.

Type: Application

Filed: January 13, 2012

Publication date: July 18, 2013

Applicant: Business Objects Software Ltd.

Inventor: Sherif Botros
Fuzzy data operations

Patent number: 8484215

Abstract: A method for clustering data elements stored in a data storage system includes reading data elements from the data storage system. Clusters of data elements are formed with each data element being a member of at least one cluster. At least one data element is associated with two or more clusters. Membership of the data element belonging to respective ones of the two or more clusters is represented by a measure of ambiguity. Information is stored in the data storage system to represent the formed clusters.

Type: Grant

Filed: October 23, 2009

Date of Patent: July 9, 2013

Assignee: Ab Initio Technology LLC

Inventor: Arlen Anderson
Model Matching for Trace Link Generation

Publication number: 20130174129

Abstract: Implementations of the present disclosure include methods, systems, and computer-readable storage mediums for identifying matching elements between a source model and a target model comprising receiving a source model and a target model, the source model and the target model each being stored in computer-readable memory; processing the source model and the target model to generate a plurality of similarity values, each similarity value being associated with an element of the source model and an element of the target model; generating a similarity value construct based on the plurality of similarity values and elements of the source model and the target model; and identifying matching elements between the source model and the target model based on the similarity value construct.

Type: Application

Filed: December 29, 2011

Publication date: July 4, 2013

Applicant: SAP AG

Inventors: Birgit Grammel, Stefan Kastenholz
Information processing apparatus, information processing method and program

Patent number: 8478781

Abstract: The present invention relates to an information processing apparatus, an information processing method and an information processing program that are capable of assigning an optimum keyword to a content. The information processing apparatus includes a content characteristic quantity extraction unit 52, a label information extraction unit 54, a associating information computation unit 56, a content characteristic quantity—latent space associating information storage unit 57 and a label information—latent space associating information storage unit 58. The content characteristic quantity extraction unit 52 extracts characteristic quantities related to a content whereas the label information extraction unit 54 extracts label information showing attributes of a content.

Type: Grant

Filed: December 4, 2006

Date of Patent: July 2, 2013

Assignee: Sony Corporation

Inventor: Kei Tateno
System And Method For Generating, Updating, And Using Meaningful Tags

Publication number: 20130159306

Abstract: A system and method for generating tag glossaries and use thereof is provided. A set of tags is accessed. Each tag is associated with a glossary that includes one or more terms and definitions for the terms. A new tag is generated and a new glossary is generated for the new tag based on the glossaries associated with the set of tags. The tag glossaries can be used to provide context for documents associated with the tags, to determine appropriate tags for untagged documents, to help in search for other documents, and to build indices for documents or collections of documents.

Type: Application

Filed: December 19, 2011

Publication date: June 20, 2013

Applicant: PALO ALTO RESEARCH CENTER INCORPORATED

Inventors: William C. Janssen, JR., Lauri J. Karttunen
FUZZY FULL TEXT SEARCH

Publication number: 20130151561

Abstract: A method and system for fuzzy full text search is disclosed. The system includes an inverted index where tokens are organized in a Next Valid Character (NVC) tree. Each path through the tree defines a token. The document lists for the tokens are stored at leaf nodes. When performing a fuzzy full text search, the system uses an edit distance greater than zero. After receiving search tokens, the system traverses the NVC tree to generate document error lists. The system then compares the error lists to identify relevant document identifiers and provides a result set.

Type: Application

Filed: December 8, 2011

Publication date: June 13, 2013

Inventor: Martin Pfeifle
OPTIMALLY RANKED NEAREST NEIGHBOR FUZZY FULL TEXT SEARCH

Publication number: 20130151503

Abstract: A method and system for fuzzy full text search is disclosed. The system includes an inverted index where tokens are organized in a Next Valid Character (NVC) tree. Each path through the tree defines a token. The document sets for the tokens are stored at leaf nodes. After receiving search tokens, the system traverses the NVC tree in a lazy manner by exploring paths through nodes associated with the smallest edit distance. The system provides a result set per edit distance. For example, the system provides a result set of document identifiers associated with documents having the least amount of error first, followed by result sets of document identifiers associated with documents that have progressively more error.

Type: Application

Filed: December 8, 2011

Publication date: June 13, 2013

Inventor: Martin Pfeifle
METHOD OF CALCULATING FEATURE-AMOUNT OF DIGITAL SEQUENCE, AND APPARATUS FOR CALCULATING FEATURE-AMOUNT OF DIGITAL SEQUENCE

Publication number: 20130151562

Abstract: The use of the known feature amounts results in a low level of accuracy in finding similar files. To deal with this problem, the similar file determination process of this invention involves dividing a file, while at the same time changing a file dividing condition, and repeating this dividing operation until the total number of divided pieces of data exceeds a predetermined number. For each of the divided pieces of data thus obtained a hash value is calculated and all the hash values are output.

Type: Application

Filed: February 2, 2011

Publication date: June 13, 2013

Applicant: HITACHI, LTD.

Inventors: Yasuhiro Fujii, Susumu Serita, Satoshi Kai, Takao Murakami, Takahiro Nakano
Centralized enterprise security policy framework

Patent number: 8463819

Abstract: A centralized enterprise security and provisioning policy framework is described. Enterprise wide security and provisioning is stored in a hierarchical fashion in a centralized LDAP based Directory server. Each policy and user maps directly to a unique entry in the directory. Policy entries can be created at specific administrative points in the Directory Information Tree instead of having to duplicate these policies as attributes of every user entry in the directory. The policies can be classified into provisioning, authentication, and authorization policies.

Type: Grant

Filed: March 17, 2005

Date of Patent: June 11, 2013

Assignee: Oracle International Corporation

Inventors: Gurudatt Shashikumar, Hari V. N. Sastry
Mapping product identification information to a product

Patent number: 8463805

Abstract: Mapping product identification data to product information. Generating classifiers operative to classify products referred to in product identification data. Receiving client product identification data. Applying at least one generated classifier on the received product identification data. Using at least one classification determined by the applied classifier(s) and a subset of the client product identification data to query the product database. Qualifying the results of the query using at least one classification.

Type: Grant

Filed: March 25, 2011

Date of Patent: June 11, 2013

Assignee: CBS Interactive, Inc.

Inventors: Kevin Jordan Osborn, Christophe Henri Edouard Laudon
METHOD AND SYSTEM FOR OPTIMIZING THE ARRANGMENT OF SPATIAL ELEMENTS

Publication number: 20130144910

Abstract: In a data visualization system, a method of analysing and representing spatial data sets to optimize the arrangement of spatial elements, the method including the steps of: retrieving data from a data storage module that is in communication with the data visualization system, determining lift values for a plurality of predefined spatial areas from the retrieved data based on a set of fuzzy association rules applied to the predefined spatial areas, determining spatial performance values for the predefined spatial areas, and calculating a weighted spatial relationship between the determined lift values and spatial performance values.

Type: Application

Filed: December 14, 2010

Publication date: June 6, 2013

Applicant: Business Intelligence Solutions Safe B. V.

Inventor: Andrew John Cardno

prev … 2 3 4 5 6 7 8 9 10 … next