Inverted Index Patents (Class 707/742)

Sequential chain registry

Patent number: 8819026

Abstract: Systems and methods are disclosed for tracking an object as it traverses a sequential chain. The relationships between the object, its movement through space and time, and the entities associated with the object at a discreet point of time are captured by a sequential chain. A unique identifier may be created that is continuously modified as the object traverses the sequential chain. The unique identifier may be used to capture relationship information between the object and its related entities and movements.

Type: Grant

Filed: August 25, 2011

Date of Patent: August 26, 2014

Assignee: SCR Technologies, Inc.

Inventor: Randal B. Fischer
Techniques for representing keywords in an encrypted search index to prevent histogram-based attacks

Patent number: 8819451

Abstract: A method and system for cryptographically indexing, searching for, and retrieving documents is provided. In some embodiments, an encryption system is provided that generates a document index that allows users to retrieve documents by performing encrypted queries for keywords associated with the documents. In some embodiments, each keyword maps to the same number of encrypted document identifiers. In some embodiments, an extractor graph is employed to map an indication of each keyword to a number of buckets storing encrypted document identifiers. In some embodiments, an order-preserving encryption system is provided. The encryption system uses an ordered index that maps encrypted instances of ordered attribute values to documents that are associated with those values. The ordered index enables queries containing query operators that rely on order, such as less than (“<”) or greater than (“>”), to be successfully performed on encrypted attribute values.

Type: Grant

Filed: May 28, 2009

Date of Patent: August 26, 2014

Assignee: Microsoft Corporation

Inventors: Satyanarayana V. Lokam, Ajay Manchepalli, Balasubramanyan Ashok, Sandeep P. Karanth, Raghav Bhaskar
Updating An Inverted Index In A Real Time Fashion

Publication number: 20140236962

Abstract: Systems and methods for regularly updating portions of a merged index are provided. Initially, upon receiving an indication that modifications have occurred to content of web-based documents, dynamic update of index (DUI) objects that identify the documents and expose the modified content are composed by ascertaining relative positions of the modified content within the documents, and packaging identifiers of the documents, the relative positions, and metadata underlying the modified content into a message. The DUI objects are applied to an overloading index that maintains structured records of recent modifications. In particular, portions of the overloading index are targeted utilizing the document identifiers and the relative positions specified by the DUI object, thereby updating the targeted portions within the overloading index corresponding to the modified content without rewriting the entire overloading index.

Type: Application

Filed: May 2, 2014

Publication date: August 21, 2014

Applicant: Microsoft Corporation

Inventors: Abhas Kumar, Pratibha Permandla, Gaurav Sareen, Anna Timasheva, Deepak Shankar
String and sub-string searching using inverted indexes

Patent number: 8805808

Abstract: Inverted indexes for terms and for term separators are separately provided to minimize data redundancy. Search queries are parsed to identify terms and term separators, if any, and the corresponding inverted indexes are searched for responsive documents. Related apparatus, systems, techniques and articles are also described.

Type: Grant

Filed: June 25, 2013

Date of Patent: August 12, 2014

Assignee: SAP AG

Inventors: Frederik Transier, Franz Faerber
SEQUENTIAL CHAIN REGISTRY

Publication number: 20140214853

Abstract: Systems and methods are disclosed for tracking an object as it traverses a sequential chain. The relationships between the object, its movement through space and time, and the entities associated with the object at a discreet point of time are captured by a sequential chain. A unique identifier may be created that is continuously modified as the object traverses the sequential chain. The unique identifier may be used to capture relationship information between the object and its related entities and movements.

Type: Application

Filed: April 1, 2014

Publication date: July 31, 2014

Applicant: SCR Technologies, Inc.

Inventor: Randal B. Fischer
Ordered index

Patent number: 8775435

Abstract: Systems and methods for processing an index are described. A postings list of items containing a particular term are ordered in a desired retrieval order, e.g., most recent first. The ordered items are inserted into an inverted index in the desired retrieval order, resulting in an ordered inverted index from which items may be efficiently retrieved in the desired retrieval order. During retrieval, items may first be retrieved from a live index, and the retrieved items from the live and ordered indexes may be merged. The retrieved items may also be filtered in accordance with the items' file grouping parameters.

Type: Grant

Filed: September 13, 2011

Date of Patent: July 8, 2014

Assignee: Apple Inc.

Inventors: Wayne Loofbourrow, John Martin Hoernkvist, Eric Richard Koebler, Yan Arrouye
Inverted indexes for accelerating analytics queries

Patent number: 8762387

Abstract: The disclosed embodiments provide a system that processes data. During operation, the system obtains a set of records, wherein each of the records comprises one or more metrics and at least one dimension associated with the one or more metrics. Next, the system creates, in a data segment comprising the records, an inverted index for a column in the records based on a cardinality of the column. Finally, the system compresses the inverted index based on a jump value associated with record identifiers in the column.

Type: Grant

Filed: July 31, 2013

Date of Patent: June 24, 2014

Assignee: LinkedIn Corporation

Inventors: Dhaval Patel, Sanjay Dubey, Praveen N. Naga, Volodymyr Zhabiuk, Jintae Jung
QUERY AND INDEX OVER DOCUMENTS

Publication number: 20140164388

Abstract: A document index is generated from a set of documents and is used to identify documents that match one or more queries. A tree is generated for each document with a node corresponding to each object of the document. The nodes of the generated trees are merged or combined to generate the document index, which is itself a tree. In addition, an inverted index is generated for each node of the index that identifies the tree(s) that the node originated from. When a query is received, the query is first executed against the document index tree: during the execution, proper set operations are applied to the inverted indices associated with the nodes matched by the query. The resulted set identifies the documents that may match the query. The query is then executed on the identified documents.

Type: Application

Filed: December 10, 2012

Publication date: June 12, 2014

Applicant: Microsoft Corporation

Inventors: Li Zhang, Mihai Budiu, Yuan Yu, Gordon D. Plotkin
Indexing and searching entity-relationship data

Patent number: 8751505

Abstract: Method, system, and computer program product for indexing and searching entity-relationship data are provided. The method includes: defining a logical document model for entity-relationship data including: representing an entity as a document containing the entity's searchable content and metadata; dually representing the entity as a document and as a category; and representing each relationship instance for the entity as a category set that contains categories of all participating entities in the relationship. The method also includes: translating entity-relationship data into the logical document model; and indexing the entity-relationship data of the populated logical document model as an inverted index. The method may include searching indexed entity-relationship data using a faceted search, wherein the categories are all categories required for supporting faceted navigation.

Type: Grant

Filed: March 11, 2012

Date of Patent: June 10, 2014

Assignee: International Business Machines Corporation

Inventors: David Carmel, Haggai Roitman, Sivan Yogev
Recognition of target words using designated characteristic values

Patent number: 8744839

Abstract: Target word recognition includes: obtaining a candidate word set and corresponding characteristic computation data, the candidate word set comprising text data, and characteristic computation data being associated with the candidate word set; performing segmentation of the characteristic computation data to generate a plurality of text segments; combining the plurality of text segments to form a text data combination set; determining an intersection of the candidate word set and the text data combination set, the intersection comprising a plurality of text data combinations; determining a plurality of designated characteristic values for the plurality of text data combinations; based at least in part on the plurality of designated characteristic values and according to at least a criterion, recognizing among the plurality of text data combinations target words whose characteristic values fulfill the criterion.

Type: Grant

Filed: September 22, 2011

Date of Patent: June 3, 2014

Assignee: Alibaba Group Holding Limited

Inventors: Haibo Sun, Yang Yang, Yining Chen
Inverted index and inverted list process for storing and retrieving information

Patent number: 8738631

Abstract: A process is disclosed for the computer management of inverted lists and inverted indices, in which the standard representation and processing of inverted lists is changed in order to achieve a simpler, more compact and more efficient architecture.

Type: Grant

Filed: September 24, 2013

Date of Patent: May 27, 2014

Inventor: Giovanni M. Sacco
SYSTEM FOR GENERATING INDEX RESISTANT AGAINST DIVULGING OF INFORMATION, INDEX GENERATION DEVICE, AND METHOD THEREFOR

Publication number: 20140129567

Abstract: In the present invention, scope search can be effectively performed in a database having encrypted registration information. A plurality of values, first identification information to identify the plurality of values, and a key are accepted as input. A value group is generated from the plurality of values. The value group is treated as a word group, and a secure index is generated from the word group, the first identification information, and the key. On the basis of a value to be retrieved and a key, trapdoor information for the value to be retrieved is generated. With respect to the generated secure index, a secure index assessment process is performed using the trapdoor information. When the value to be retrieved is assessed to be contained in the secure index as a result of the assessment process, second identification information to identify the secure index is output.

Type: Application

Filed: July 27, 2012

Publication date: May 8, 2014

Applicant: c/o NEC Corporation

Inventors: Toshinori Araki, Isamu Teranishi
Method and Apparatus for Geographic Document Retrieval

Publication number: 20140129566

Abstract: A geographic document retrieval method (GDR) can be executed by a computer system to index, retrieve and rank geographical documents. Textual and spatial attributes of geographical documents are indexed separately using inverted index and spatial index, respectively. Spatial attributes of a document are represented as one or more contiguously closed regions of arbitrary shapes. Upon receiving an input query carrying a geographic representation of a location using arbitrary regions, the GDR method retrieves one or more documents by executing an overlap test between arbitrary regions from the query and the arbitrary regions associated with the documents.

Type: Application

Filed: April 19, 2013

Publication date: May 8, 2014

Applicant: xAd, Inc.

Inventor: xAd, Inc.
Creation of Inverted Index System, and Data Processing Method and Apparatus

Publication number: 20140101167

Abstract: The present disclosure relate to techniques for establishing an inverted indexing system and related data processing. The techniques may include writing, by a computing device, inverted indexes of a massive amount of data records into at least one inverted file. The computing device may then write description information of the written inverted file into a description file associated with the inverted file, and establish the inverted indexing system based on the inverted file and the description file of the inverted file. The techniques enhance efficiency in establishing the inverted indexing system and in processing data using the systems.

Type: Application

Filed: October 3, 2013

Publication date: April 10, 2014

Applicant: Alibaba Group Holding Limited

Inventor: Jian Qin
Management of data segments for analytics queries

Patent number: 8688718

Abstract: The disclosed embodiments provide a method and system for processing data. During operation, the system obtains a set of records, wherein each of the records comprises one or more metrics and at least one dimension associated with the one or more metrics. Next, the system creates a data segment comprising at least one of a forward index and an inverted index for a column in the records. The system then stores the data segment in network-accessible storage and assigns the data segment to a partition. Finally, the system enables querying of the data segment through a query node associated with the partition.

Type: Grant

Filed: July 31, 2013

Date of Patent: April 1, 2014

Assignee: LinkedIn Corporation

Inventors: Sanjay Dubey, Dhaval Patel, Praveen N. Naga, Volodymyr Zhabiuk
Storage device having full-text search function

Patent number: 8682902

Abstract: According to one embodiment, a storage device includes an interface, a first and second memory blocks and a controller. The interface receives a content search request. The first memory block stores files and inverted files corresponding to contents included in the files. The second memory block stores a file search table. The controller creates the inverted file for each content included in the files and stores IDs of the files including the content in the inverted file. The controller obtains, by search of the content, a corresponding inverted file from the inverted files stored in the first memory block and stores, in the file search table, the IDs of the files included in the obtained inverted file. The controller outputs the IDs of the files stored in the file search table from the interface as a search result for the content search request.

Type: Grant

Filed: November 20, 2012

Date of Patent: March 25, 2014

Assignee: Kabushiki Kaisha Toshiba

Inventors: Kosuke Tatsumura, Atsuhiro Kinoshita
PARALLEL GENERATION OF TOPICS FROM DOCUMENTS

Publication number: 20140059054

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for enhanced parallel latent Dirichlet allocation (PLDA+). A PLDA+ system is a system of multiple processors that are configured to generate topics from multiple documents. The multiple processors are designated as two types: document processors and matrix processors. The documents are distributed among the document processors. Generated topics are distributed among the matrix processors. Tasks performed on the document processors and matrix processors are segregated into two types of tasks: computation-bound tasks and communication-bound tasks. Computation-bound tasks are CPU intensive tasks; communication-bound tasks are network intensive tasks. Data placement and pipeline strategies are employed such that the computation-bound tasks and the communication-bound tasks are distributed to the processors in a balanced manner, and performed in parallel.

Type: Application

Filed: May 11, 2011

Publication date: February 27, 2014

Inventors: Zhiyuan Liu, Yuzhou Zhang, Edward Y. Chang
Searching documents for ranges of numeric values

Patent number: 8655888

Abstract: Provided are a method, system, and article of manufacture for searching documents for ranges of numeric values. Document identifiers for documents are accessed, wherein the documents include at least one value that is a member of a set of values. A number of posting lists are generated. Each posting list is associated with a range of consecutive values within the set of values and includes document identifiers for documents including at least one value within the range of consecutive values associated with the posting list, and wherein each document identifier is associated with one value in the set of values included in the document identified by the document identifier. The generated posting lists are stored, wherein the posting lists are used to process a query on a range of values within the set of values.

Type: Grant

Filed: December 22, 2011

Date of Patent: February 18, 2014

Assignee: International Business Machines Corporation

Inventors: Marcus F. Fontoura, Ronny Lempel, Runping Qi, Jason Y. Zien
Method and system for processing a query

Patent number: 8645388

Abstract: A method for processing a query includes providing an inverted multi-path index for storing path-value pairs. Each path-value pair references at least one structured document stored in a database system, and comprises an index path expression of an indexed element and an indexed value associated with the indexed element. The method includes receiving a clause including a path expression-value pair comprising a path expression associated with an element, determining that the clause can be processed by the inverted multi-path index, processing the clause to identify a path-value pair in the inverted multi-path index matching the path expression-value pair of the clause, and identifying the structured document referenced by the matching path-value pair.

Type: Grant

Filed: June 16, 2011

Date of Patent: February 4, 2014

Assignee: EMC Corporation

Inventors: Edward C. Bueche, Francisco Borges, Petr Pleshachkov, Shanshan Quan, Marc Brette, Venkatesan Chandrasekaran
RESOURCE EFFICIENT DOCUMENT SEARCH

Publication number: 20140032567

Abstract: The present document relates to a system and method for searching a document using one or more search terms. In particular, the present document relates to a resource efficient method for searching a document within a database of documents. A method for determining an inverse index on an electronic device including a database is described. The inverse index is configured to map a plurality of text data entities from the database to a search term. The method includes determining a plurality of relevance vectors for a plurality of text data entities from the database. Determining a relevance vector for a text data entity from the database includes: selecting N terms which are descriptive of the text data entity; and determining the relevance vector from the selected N terms. Furthermore, the method includes determining the inverse index comprising a plurality of records.

Type: Application

Filed: July 29, 2013

Publication date: January 30, 2014

Applicant: ExB Asset Management GmbH

Inventors: Ramin ASSADOLLAHI, Stefan BORDAG
Method and system for inverted indexing of a dataset

Patent number: 8615519

Abstract: Methods and systems for providing an inverted index for a dataset are disclosed. The inverted index includes a position vector, with fields that correspond to values in the indexed dataset. The fields include data to be used in determining where each value appears in the dataset. The position vector is populated differently for different value types. A 1:1 value appears once in the dataset; a 1:n value appears multiple times. For a 1:1 value, the position vector stores information for where that value appears. For a 1:n value, the position vector stores a pointer, e.g. a memory reference, that identifies a list of locations where the value appears. The list can be encoded or otherwise compressed. A set of indicators can be stored for the fields indicating whether the field has 1:n or 1:1 value information. The indicator is used to control interpretation of the information in a field.

Type: Grant

Filed: March 29, 2012

Date of Patent: December 24, 2013

Assignee: SAP AG

Inventor: Alexander Froemmgen
Search Method and Apparatus

Publication number: 20130339369

Abstract: The present disclosure provides techniques to solve problems (e.g., the low efficiency and a waste of resources) derived from conventional methods. These techniques may include extracting, by a computing device, the first N keywords appearing the most in target information published by target users as target words, and creating an inverted index based on information on a page of the target users and the target words, wherein the inverted index includes a target field and a page information field, and N is an integer. The computing device may receive an inquiry phrase and determine target users matching the inquiry phrase in the inverted index based on the inquiry phrase. The computing device may calculate a relevance between the matched target users and the inquiry phrase through the target field and the page information field, and return a certain result based on the relevance.

Type: Application

Filed: June 17, 2013

Publication date: December 19, 2013

Inventors: Yaobing Li, Wei Zheng, Huaxing Jin, Feng Lin
Method and system for data compression

Patent number: 8610605

Abstract: In one aspect, methods and systems for variable-block length encoding of data, such as an inverted index for a file are disclosed. These methods and systems provide for relatively fast encoding and decoding, while also providing for compact storage. Other aspects include a nearly 1:1 inverted index comprising a position vector and a data store, wherein values that have a unique location mapping are represented directly in the position vector, while for 1:n values (n>1), the position vector can include a pointer, and potentially some portion of information that would typically be stored in the data area, in order to fully use fixed width portions of the position vector (where a maximum pointer size is smaller than a maximum location identifier size).

Type: Grant

Filed: March 29, 2012

Date of Patent: December 17, 2013

Assignee: SAP AG

Inventor: Alexander Froemmgen
Systems and methods for monitoring and detecting fraudulent uses of business applications

Patent number: 8612479

Abstract: A systems and methods are described detect fraud in existing logs of raw data. There can be several disparate logs, each including data of disparate data types and generated by different and possibly unrelated software enterprise applications. The fraud management system aggregates and organizes the raw log data, extends the raw data with reference data, archives the data in a manner that facilitates efficient access and processing of the data, allows for investigation of potentially fraudulent usage scenarios, and uses the results of the investigation to identify patterns of data that correspond to correspond to high risk usage scenarios and/or process steps. In subsequent processing, archived data can be compared against the identified patterns corresponding to high risk usage scenarios to detect matches, and the invention thereby automatically detects high risk usage scenarios and issues appropriate alerts and reports.

Type: Grant

Filed: May 15, 2007

Date of Patent: December 17, 2013

Assignee: FIS Financial Compliance Solutions, LLC

Inventors: Jwahar R. Bammi, Bagepalli C. Krishna, Robert Posniak, Joseph Walsh
Methods for indexing and searching based on language locale

Patent number: 8577891

Abstract: In response to a search query having a search term received from a client, a current language locale is determined. A state machine is built based on the current language locale, where the state machine includes one or more nodes to represent variance of the search term having identical meaning of the search term. Each node of the state machine is traversed to identify one or more postings lists of an inverted index corresponding to each node of the state machine. One or more item identifiers obtained from the one or more postings list are returned to the client, where the item identifiers identify one or more files that contain the variance of the search term represented by the state machine.

Type: Grant

Filed: October 27, 2010

Date of Patent: November 5, 2013

Assignee: Apple Inc.

Inventors: John M. Hörnkvist, Eric R. Koebler
String and Sub-String Searching Using Inverted Indexes

Publication number: 20130290345

Abstract: Inverted indexes for terms and for term separators are separately provided to minimize data redundancy. Search queries are parsed to identify terms and term separators, if any, and the corresponding inverted indexes are searched for responsive documents. Related apparatus, systems, techniques and articles are also described.

Type: Application

Filed: June 25, 2013

Publication date: October 31, 2013

Applicant: SAP AG

Inventors: Frederik Transier, Franz Faerber
Inverted index and inverted list process for storing and retrieving information

Patent number: 8566324

Abstract: A process is disclosed for the computer management of inverted lists and inverted indices, in which the standard representation and processing of inverted lists is changed in order to achieve a simpler, more compact and more efficient architecture.

Type: Grant

Filed: September 12, 2010

Date of Patent: October 22, 2013

Inventor: Giovanni M Sacco
PSEUDO-DOCUMENTS TO FACILITATE DATA DISCOVERY

Publication number: 20130275436

Abstract: Various embodiments promote the discoverability of data that can be contained within a database. In one or more embodiments, data within a database is organized in a structure having a schema. The structure and data can be processed in a manner that renders one or more pseudo-documents each of which constitutes a sub-structure that can be indexed. Once produced and indexed, the pseudo-documents constitute a set of searchable objects each of which relationally points back to its associated structure within the database. Searches can now be performed against the pseudo-documents which, in turn, returns a set of search results. The set of search results can include multiple sub-sets of pseudo-documents, each sub-set of which is associated with a different structure.

Type: Application

Filed: April 11, 2012

Publication date: October 17, 2013

Applicant: Microsoft Corporation

Inventors: Surajit Chaudhuri, Lev Novik, John C. Platt
DATA STRUCTURE, INDEX CREATION DEVICE, DATA SEARCH DEVICE, INDEX CREATION METHOD, DATA SEARCH METHOD, AND COMPUTER-READABLE RECORDING MEDIUM

Publication number: 20130262470

Abstract: In an inverted list of each node in a taxonomy, among each node, an inverted list of the highest node is a list of integer values indicating an identifier of search subject data, and an inverted list of a node other than the highest node, in place of the identifier, is a list of integer values indicating a position in an inverted list corresponding to a node that is higher by one than the node. Furthermore, a list of integer values in an inverted list of each node is divided into two or more blocks, and a differential value between an integer value and an integer value directly before the integer value in the block is converted into a bit string of a variable length integer code.

Type: Application

Filed: June 16, 2011

Publication date: October 3, 2013

Applicant: NEC CORPORATION

Inventors: Yukitaka Kusumura, Hironori Mizuguchi, Dai Kusui, Yusuke Muraoka
REAL TIME MAPPING OF USER MODELS TO AN INVERTED DATA INDEX FOR RETRIEVAL, FILTERING AND RECOMMENDATION

Publication number: 20130262471

Abstract: A catalog record is bridged to information stored in at least one inverted index by receiving an application user interface call associated with a predetermined filter request including a record identifier identifying a record in a relational database. A bitset is generated based on item identifiers in the record. The bitset is applied to at least one inverted index to obtain metadata associated with the item identifiers.

Type: Application

Filed: March 27, 2013

Publication date: October 3, 2013

Applicant: The Echo Nest Corporation

Inventors: Brian Whitman, Tyler Williams, Hui Ted Cao
Methods and systems for compressing indices

Patent number: 8549000

Abstract: Systems and methods for compressing indices are described. In one aspect, a plurality of items are selected where each item has an entry in an inverted index and each item entry comprises a listing of articles that the item appears in. At least a first item entry and a second item entry are determined for compression and the second item entry is compressed into the first item entry resulting in a compressed first item entry.

Type: Grant

Filed: November 14, 2011

Date of Patent: October 1, 2013

Assignee: Google Inc.

Inventor: Adam J. Weissman
MATCHING DOCUMENTS AGAINST MONITORS

Publication number: 20130254211

Abstract: Techniques and tools are described for matching documents against monitors. An index can be generated from a plurality of monitors, where the index represents the query logic of the plurality of monitors. The index can be searched using the documents as search queries. The searching can comprise matching the documents against the monitors using the query logic represented in the index. An index can be distributed to a plurality of computing devices to be searched at the plurality of computing devices, where each computing device searches a subset of a plurality of documents against the full index. Searching at the plurality of computing devices can be performed in parallel, and results can be aggregated at a central location.

Type: Application

Filed: March 23, 2012

Publication date: September 26, 2013

Applicant: Jive Software, Inc.

Inventor: Lance Riedel
Data format for website traffic statistics

Patent number: 8538969

Abstract: A data format is optimized for storing data such as website traffic data. The data format enables easy access to and filtering of data, for example in generating website traffic reports. The data format also provides significant data compression. A method for generating a data file according to the data format employs linear compression and indexing to efficiently store the data. Data stored according to the format can be easily retrieved, particularly when a known value is specified and particular entries matching the known value are sought.

Type: Grant

Filed: November 14, 2005

Date of Patent: September 17, 2013

Assignee: Adobe Systems Incorporated

Inventor: Michael Paul Bailey
INDEXING AND SEARCHING ENTITY-RELATIONSHIP DATA

Publication number: 20130238631

Abstract: Method, system, and computer program product for indexing and searching entity-relationship data are provided. The method includes: defining a logical document model for entity-relationship data including: representing an entity as a document containing the entity's searchable content and metadata; dually representing the entity as a document and as a category; and representing each relationship instance for the entity as a category set that contains categories of all participating entities in the relationship. The method also includes: translating entity-relationship data into the logical document model; and indexing the entity-relationship data of the populated logical document model as an inverted index. The method may include searching indexed entity-relationship data using a faceted search, wherein the categories are all categories required for supporting faceted navigation.

Type: Application

Filed: March 11, 2012

Publication date: September 12, 2013

Applicant: International Business Machines Corporation

Inventors: David Carmel, Haggai Roitman, Sivan Yogev
Searchable symmetric encryption with dynamic updating

Patent number: 8533489

Abstract: A Searchable Symmetric Encryption (SSE) mechanism is described which allows efficient dynamic updating of encrypted index information. The encrypted index information includes pointer information that is encrypted using a malleable encryption scheme. The SSE mechanism updates the encrypted index information by modifying at least one instance of the pointer information without decrypting the pointer information, and thereby without revealing the nature of the changes being made. In one implementation, the SSE mechanism includes a main indexing structure and a deletion indexing structure. An updating operation involves patching applied to both the main indexing structure and deletion indexing structure.

Type: Grant

Filed: September 29, 2010

Date of Patent: September 10, 2013

Assignee: Microsoft Corporation

Inventors: Thomas M. Roeder, Seny F. Kamara
ROBUST DISCOVERY OF ENTITY SYNONYMS USING QUERY LOGS

Publication number: 20130232129

Abstract: A similarity analysis framework is described herein which leverages two or more similarity analysis functions to generate synonyms for an entity reference string re. The functions are selected such that the synonyms that are generated by the framework satisfy a core set of synonym-related properties. The functions operate by leveraging query log data. One similarity analysis function takes into consideration the strength of similarity between a particular candidate string se and an entity reference string re even in the presence of sparse query log data, while another function takes into account the classes of se and re. The framework also provides indexing mechanisms that expedite its computations. The framework also provides a reduction module for converting long entity reference strings into shorter strings, where each shorter string (if found) contains a subset of the terms in its longer counterpart.

Type: Application

Filed: June 4, 2012

Publication date: September 5, 2013

Applicant: MICROSOFT CORPORATION

Inventors: Tao Cheng, Kaushik Chakrabarti, Surajit Chaudhuri, Dong Xin
Method for performing a database query in a relational database

Patent number: 8527512

Abstract: A method performs a database query in a relational database, the query being carried out by a database engine and being based on user-defined search criteria. The method includes retrieving a number N of properties of a record within a main database table, the number N being higher than zero, creating a search criteria option for each of the N properties, creating a search criteria table for every search criteria option, creating an index for every search criteria table, and performing the database query based on a user-defined combination of a plurality of the search criteria options.

Type: Grant

Filed: September 17, 2009

Date of Patent: September 3, 2013

Assignee: Siemens Aktiengesellschaft

Inventor: Frédéric Depreter
Transactionally consistent indexing for data blobs

Patent number: 8510304

Abstract: A transactionally consistent indexer is a tiered middleware framework component that updates a transactional index for a data blob according to a data transaction requested by an application. The transactionally consistent indexer determines index entries to be added or removed from a transaction index based on the application request. The transactionally consistent indexer further inserts each index entry to be added into the transaction index. With respect to each index entry to be removed, the transactionally consistent indexer uses a time stamp or version number of the index entry for ensuring optimistic concurrency during deletion. The transactionally consistent indexer then updates a data blob that is associated with each index entry to be added or each index entry to be removed based on the application request.

Type: Grant

Filed: August 27, 2010

Date of Patent: August 13, 2013

Assignee: Amazon Technologies, Inc.

Inventors: Gregory J. Briggs, Vincent M. Rohr
Faceted search with relationships between categories

Patent number: 8510306

Abstract: Method, system, and computer program product for faceted search with relationships between categories are provided. The method includes: having a document set of multiple documents, each document having associated categories to which it belongs; grouping multiple categories associated with a document into a category set based on a relationship between the multiple categories; associating the category set with the document; and indexing the category set for retrieval of documents from categories sharing a category set. Wherein indexing the category set includes: having an index entry of a textual representations of a category, wherein the index entry includes a single occurrence for each document to which the category is attached; adding a payload to a document occurrence of a serialization of an identifier of the category sets to which the category belongs associated with the document.

Type: Grant

Filed: May 30, 2011

Date of Patent: August 13, 2013

Assignee: International Business Machines Corporation

Inventors: David Carmel, Haggai Roitman, Sivan Yogev
Full text search capabilities integrated into distributed file systems— incrementally indexing files

Patent number: 8504565

Abstract: A hierarchical distributed search mechanism is integrated into a distributed file system. Traditional file system APIs (create, open, close, read, write, link, rename, delete, . . . ) and the over-the-wire protocols employed to project these APIs into remote client sites (CIFS, NFS, DDS, Appletalk) are extended to enable the dynamic creation of temporary directories containing links to objects identified by search engines (executing at sites “close” to “their” data) as meeting the search criteria specified by the first parameter of a search function call. The search function, derived from the standard file system API function create, is added to the file system API.

Type: Grant

Filed: September 9, 2005

Date of Patent: August 6, 2013

Inventor: William M. Pitts
Search techniques for rich internet applications

Patent number: 8504555

Abstract: A computing device includes one or more rich internet application (RIA) client engines. Each RIA client engine includes a corresponding private RIA storage area. The computing device also includes a per-RIA public storage area for each RIA. The per-RIA public storage area including a subset of data items in the private RIA storage area of the corresponding RIA client engine. A search engine of the computing device may search the data items in the one or more per-RIA public storage areas and link to content in the private RIA storage area of the corresponding RIA client engine at a given data item matching a search request.

Type: Grant

Filed: June 25, 2008

Date of Patent: August 6, 2013

Assignee: Microsoft Corporation

Inventor: Jonathan C. Hawkins
String and sub-string searching using inverted indexes

Patent number: 8498972

Abstract: Inverted indexes for terms and for term separators are separately provided to minimize data redundancy. Search queries are parsed to identify terms and term separators, if any, and the corresponding inverted indexes are searched for responsive documents. Related apparatus, systems, techniques and articles are also described.

Type: Grant

Filed: December 16, 2010

Date of Patent: July 30, 2013

Assignee: SAP AG

Inventors: Frederik Transier, Franz Faerber
Method, system and program for information re-organization

Patent number: 8489610

Abstract: An information re-organization system includes a plurality of counters coordinated to meaning attributes, and a re-organization incentive notification unit that updates, in case the information stored in preset storage unit has been updated, value of a counter out of the multiple counters that has the meaning attribute associated with contents updated. The information re-organization system also includes an information re-organization processor that executes, in case the value of the counter section updated has met one of a number of predetermined conditions for information re-organization, a processing for information re-organization corresponding to the condition for information re-organization on the information stored in the preset storage unit.

Type: Grant

Filed: March 27, 2009

Date of Patent: July 16, 2013

Assignee: NEC Corporation

Inventor: Masaki Kan
Encoding semi-structured data for efficient search and browsing

Patent number: 8489597

Abstract: A method for encoding XML tree data that includes the step of encoding the semi-structured data into strings of arbitrary length in a way that maintains non-structural and structural information about the XML data, and enables indexing the encoded XML data in a way that facilitates efficient search and browsing.

Type: Grant

Filed: September 1, 2004

Date of Patent: July 16, 2013

Assignee: Ori Software Development Ltd.

Inventors: Moshe Shadmon, Neal Sample, Brian Cooper, Michael J. Franklin
MULTIMEDIA METADATA ANALYSIS USING INVERTED INDEX WITH TEMPORAL AND SEGMENT IDENTIFYING PAYLOADS

Publication number: 20130151534

Abstract: The addition of relative term positions, temporal positions, and segment identifiers to an inverted index allows for temporal and phrase queries of multimedia assets. Segment identifiers enable any search results to be examined in context. The system makes advantageous use of Lucene's binary payload functionality to store temporal data and segment identifiers as additional binary data for each term instance in the inverted index. The payloads are made up of three variable-length integers, which account for twelve extra bytes of metadata, which are stored for each term instance. A content database on a Master/Administrator server node provides the indexes for search into content in response to user events, returning results in JSON format. The search results may then be used to locate and present content segments to a user containing both requested search term results and the time location within the multimedia asset in which the search term(s) is found.

Type: Application

Filed: December 10, 2012

Publication date: June 13, 2013

Applicant: Digitalsmiths, Inc.

Inventor: Digitalsmiths, Inc.
PROVISION OF QUERY SUGGESTIONS INDEPENDENT OF QUERY LOGS

Publication number: 20130151533

Abstract: Described herein are various technologies pertaining to provision of query suggestions to a user independent of a query log. Key phrases are automatically identified in documents of a document corpus, and a forward index and inverted index are generated. The forward index indexes key phrases by documents, and the inverted index indexes documents by key phrases. A query is received from a user, and documents relevant to the query are retrieved. Key phrases in the retrieved documents are identified via the forward index, and a subset of the key phrases are selected as query suggestions by determining coverage of the key phrases as identified in the inverted index.

Type: Application

Filed: December 7, 2011

Publication date: June 13, 2013

Applicant: Microsoft Corporation

Inventors: Uppinakuduru Raghavendra Udupa, Bhole Abhijit Narendra, Anuj Kumar Goyal, Bjørn Olstad
Managing deduplication of stored data

Patent number: 8463742

Abstract: Managing data in a data storage system includes: receiving data to be stored in the data storage system; computing values corresponding to different respective portions of the received data; generating identifiers corresponding to different respective portions of the received data, with an identifier corresponding to a particular portion of data including the computed value corresponding to the particular portion of data and metadata indicating a location where the particular portion of data is being stored in the data storage system; storing at least some of the identifiers in an index until the index reaches a predetermined size; and in response to determining that a first identifier corresponding to a first portion of data, received after the index reached the predetermined size, was not already stored in the index before the first portion of data was received, storing the first identifier in the index and designating for removal at least a second identifier corresponding to a second portion of data to be

Type: Grant

Filed: May 27, 2011

Date of Patent: June 11, 2013

Assignee: Permabit Technology Corp.

Inventors: Jered J. Floyd, Michael Fortson, Assar Westerlund, Jonathan Coburn
Image Searching

Publication number: 20130138636

Abstract: The present disclosure introduces a method and an apparatus for searching images. With respect to each image in an image searching database, respective labels of respective images are generated based on description information corresponding to the respective images. A corresponding relationship between the generated respective labels and the respective images is stored. Based on a received image searching request, description information corresponding to an image for search in the image searching request is obtained. Based on the description information of the image for search, the label of the image for search is generated. Based on the stored corresponding relationship between the respective labels and the respective images, one or more images corresponding to the label of the image for search are determined. The determined one or more images are sent to the client terminal that sends the image searching request.

Type: Application

Filed: November 21, 2012

Publication date: May 30, 2013

Applicant: Alibaba Group Holding Limited

Inventor: Alibaba Group Holding Limited
SYSTEM AND METHOD FOR DISTRIBUTED INDEX SEARCHING OF ELECTRONIC CONTENT

Publication number: 20130138660

Abstract: There are provided methods and systems for efficient search in a peer-to-peer network topology. In various embodiments, search methods and systems provide for response times and network traffic that are independent from the number of query terms, thereby producing constant run-time searches and bandwidth hits in a P2P network search implementation. By distributing inverse indexes between peers, and storing with each inverse index a Bloom filter populated with selected keywords, multi-term search and analysis can be conducted on one network node without requiring exchange of posting lists between various network nodes.

Type: Application

Filed: January 18, 2013

Publication date: May 30, 2013

Inventor: Wolf Garbe
Calculating Valence of Expressions within Documents for Searching a Document Index

Publication number: 20130097174

Abstract: Tools and techniques related to calculating valence of expressions within documents. These tools may provide methods that include receiving input documents for processing, and extracting expressions from the documents for valence analysis, with scope relationships occurring between terms contained in the expressions. The methods may calculate calculating valences of the expressions, based on the scope relationships between terms in the expressions.

Type: Application

Filed: December 3, 2012

Publication date: April 18, 2013

Applicant: MICROSOFT CORPORATION

Inventor: Microsoft Corporation

prev 1 2 3 4 5 6 next