Indexing (epo) Patents (Class 707/E17.083)

E Subclasses

Selection or weighting of terms for indexing (epo) (Class 707/E17.084)

Physical indexing structures (epo) (Class 707/E17.085)

SYSTEMS AND METHODS FOR PROVIDING A MICRODOCUMENT FRAMEWORK FOR STORAGE, RETRIEVAL, AND AGGREGATION

Publication number: 20110258177

Abstract: The present invention is directed towards systems and methods for providing a microdocument framework. The method and system includes receiving a plurality microdocuments and detecting content data for each of the plurality of microdocuments. The method and system further includes indexing at least a portion of the plurality of microdocuments based on the detected content and performing a searching operation using the content data associated with the microdocument data to determine a microdocument set. Thereupon, the method and system performs at least processing one operation on the microdocument set.

Type: Application

Filed: April 19, 2010

Publication date: October 20, 2011

Applicant: YAHOO! INC.

Inventors: Su-Lin Wu, Wei-Cheng Lai, Timothy P. Daly, JR., William Robert Pentney
GENERATING SERVICE SPECIFICATION OF A SERVICE ORIENTED ARCHITECTURE (SOA) SOLUTION

Publication number: 20110252037

Abstract: A system and associated method for automatically generating a service specification of a Service Oriented Architecture (SOA) solution. A process model framework and a data model framework are received as inputs. Processes in the process model framework perform services of various complexity levels. Processes are decomposed into a respective set of atomic service processes in the lowest complexity level and data objects are extracted from the decomposed atomic service processes. The data objects are associated with data elements of the data model framework. The data model framework is extended and flexibility patterns are added for reusability of the service specification. The service specification of the SOA solution is generated as process interfaces represented with the data objects according to inputs from a user customizing aspects of the service specification, for either a desired service of the SOA solution or a desired process in the process model framework.

Type: Application

Filed: April 13, 2010

Publication date: October 13, 2011

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Faried Abrahams, Kerard R. Hogg, Kent R. Ramchand, Gandhi Sivakumar
SYSTEM AND METHOD FOR CREATING SEARCH INDEX ON CLOUD DATABASE

Publication number: 20110252018

Abstract: A method for creating a search index on cloud database is provided. The method enables providing inputs for creating multiple indexes on documents stored in the cloud database. One of the inputs may include a first value representing number of documents to be assigned a single index. The method further enables determining total number of documents stored in the cloud database which is represented by a second value. Further, the method enables estimating total number of indexes to be created based on first value and second value. The method further comprises executing a loop to create multiple indexes for a predetermined number of iterations which corresponds to the estimated value. Furthermore, the method comprises indexing documents for creating the multiple indexes. Finally, the method comprises merging the multiple indexes to create a single index which facilitates a user to search documents stored in the cloud database.

Type: Application

Filed: June 9, 2010

Publication date: October 13, 2011

Applicant: INFOSYS TECHNOLOGIES LIMITED

Inventors: Rajarshi Bhose, Kashyap Chimanlal Santoki, Subhadip Sarkar
SEARCH EXTENSIBILITY TO THIRD PARTY APPLICATIONS

Publication number: 20110252038

Abstract: At least certain embodiments of the present disclosure include a method to extend search capabilities to third party applications installed on a device. In one embodiment, records associated with a third party application are indexed in a process isolated from other third party applications installed on the device using a search plugin specific to the third party application. Furthermore, the indexed records can be searched in response to a user search query without invoking the third party application.

Type: Application

Filed: July 1, 2010

Publication date: October 13, 2011

Inventors: Edward T. Schmidt, Gordon J. Freedman, Benjamin S. Phipps, David Rahardia
STORAGE SYSTEM

Publication number: 20110246431

Abstract: The present invention relates to a storage system including a de-duplicate function and a full-text search function or the like, and reduces an amount of index information about full-test search to save storage resource. In this system, a storage apparatus includes a processing unit for de-duplicating a plurality of files having the same content regarding a file group of data inputted/outputted through a host apparatus. A full-text search processing server performs a full-text search processing to the file group and includes a processing unit for causing the full-text search processing to correspond to de-duplicate. An index information creation processing performed to a plurality of target files having the same content by the full-text search processing unit is inhibited according to a status of de-duplicate to the file group by the processing unit. Thereby, the amount of index information can be reduced.

Type: Application

Filed: June 13, 2011

Publication date: October 6, 2011

Inventor: Takayoshi IITSUKA
NAVIGATION SYSTEM WITH INDEXED TERM SEARCHING AND METHOD OF OPERATION THEREOF

Publication number: 20110246478

Abstract: A method of operation of a navigation system includes: preconstructing an inverted term index having a nested spatial index of at least one location; providing a search term and a search range for searching the inverted term index; locating the search term in the inverted term index and having the nested spatial index bounded by the search range; and retrieving a location record linked to the nested spatial index and associated with the search term and the search range for displaying on a device.

Type: Application

Filed: March 31, 2010

Publication date: October 6, 2011

Applicant: TELENAV, INC.

Inventors: Kan Deng, Yueyu Lin, Yanyan Qin
System and Method for Matching Entities

Publication number: 20110238694

Abstract: Matching systems are provided that are configured to determine if a first entity received from a client device of a first user matches with at least one other entity of a plurality of entities indexed in an index in which each entity is associated with one or more index points. The system includes an application server adapted for communication with a matching engine and the client device. The matching engine is configured to index the first entity by associating the first entity with one or more index points in the index; and search for other entities matching the first entity among the plurality of entities indexed in the index by searching for other entities associated with at least one of the index points with which the first entity is associated.

Type: Application

Filed: December 2, 2008

Publication date: September 29, 2011

Inventors: Richard Carlsson, Olof Lundström, Gerardo Montero Arizmendi, Hjalmar Olsson
DOCUMENT MANAGEMENT SYSTEM

Publication number: 20110238668

Abstract: In a document management system that manages index item definition and document data by cabinet, an index can be easily provided. A user that can log into a first database can use an index item defined by the first database to provide an index value to document data stored in a second database.

Type: Application

Filed: March 16, 2011

Publication date: September 29, 2011

Applicant: CANON KABUSHIKI KAISHA

Inventor: Yoshitaka Matsumoto
Collective Acceleration Unit Tree Structure

Publication number: 20110238956

Abstract: A mechanism is provided in a collective acceleration unit for performing a collective operation to distribute or collect data among a plurality of participant nodes. The mechanism receives an input collective packet for a collective operation from a neighbor node within a collective tree. The input collective packet comprises a tree identifier and an input data field and wherein the collective tree comprises a plurality of sub trees. The mechanism maps the tree identifier to an index within the collective acceleration unit. The index identifies a portion of resources within the collective acceleration unit and is associated with a set of neighbor nodes in a given sub tree within the collective tree. For each neighbor node the collective acceleration unit stores destination information. The collective acceleration unit performs an operation on the input data field using the portion of resources to effect the collective operation.

Type: Application

Filed: March 29, 2010

Publication date: September 29, 2011

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Lakshminarayana B. Arimilli, Bernard C. Drerup, Paul F. Lecocq, Hanhong Xue
Self-Similar Medical Communications System

Publication number: 20110225008

Abstract: A health care monitoring system and network for monitoring a patient's use of medications, physiological conditions, and/or the environment around the patient. The system takes advantage of a hierarchical nodal network to allow for the transfer of information from sensors to authorized users of the network while providing personal control of sensitive information and providing a distributed database structure for increased data security. In one embodiment, sensors transmit information to wearable personal data recorders that use a combination of random access and overwriting to store information in a time-sequenced organization.

Type: Application

Filed: March 9, 2010

Publication date: September 15, 2011

Applicant: RESPIRA DV, LLC

Inventors: Nabil A. Elkouh, Gregory S. Fallon, Robert Harwood, Matthew J. Miller
SYSTEM AND METHOD FOR GUIDING ENTITY-BASED SEARCHING

Publication number: 20110225155

Abstract: A system and method are provided for refining a user's query. An entity index, generated from a corpus of text documents, is provided. The entity index includes a set of entity structures, each including a plurality of terms. Each of the terms of an entity structure is a feature of the same entity. Entity structures can be retrieved from the entity index which match at least a portion of the user's query. Clusters of the retrieved entity structures are identified which have at least one of their terms in common. A cluster hierarchy is generated from the identified clusters in which nodes of the hierarchy are defined by one or more of the terms of the retrieved entity structures. At least a portion of the cluster hierarchy is presented to the user for facilitating refinement of the user's query through user selection of a node which, when formulated as a search, retrieves one or more responsive documents from the corpus of documents.

Type: Application

Filed: March 10, 2010

Publication date: September 15, 2011

Applicant: Xerox Corporation

Inventors: Frederic Roulland, Stefania Castellani, Antonietta Grasso, Caroline Brun
Information Search Method and System

Publication number: 20110218989

Abstract: The present disclosure provides an information search method and system applicable in an information search system wherein each document has corresponding forward index data to address the issue of low search efficiency suffered by existing information search techniques. In one aspect, the method may include: receiving an inquiry word and obtaining one or more keywords contained in the inquiry word by segmentation; searching one or more documents matching the one or more keywords and forward index data corresponding to the one or more documents through the information search system's inverted index data; and determining an abstract of each of the one or more documents according to a corresponding document's forward index data, and outputting the abstract and information of the one or more documents as a search result. The proposed techniques can increase efficiency of information search and, at the meantime, guarantee accuracy of the search to a certain extent.

Type: Application

Filed: August 27, 2010

Publication date: September 8, 2011

Applicant: ALIBABA GROUP HOLDING LIMITED

Inventor: Yi Luo
METHOD AND APPARATUS FOR EFFICIENT INDEXED STORAGE FOR UNSTRUCTURED CONTENT

Publication number: 20110202540

Abstract: A method and apparatus for efficient indexed storage for unstructured content have been disclosed.

Type: Application

Filed: April 24, 2011

Publication date: August 18, 2011

Applicant: Nahava Inc.

Inventor: Russell T. Nakano
System and Method for Managing Multiple Domain Names for a Website in a Website Indexing System

Publication number: 20110179178

Abstract: When a website has a number of equivalent domain names including a preferred domain name, the locator for a document in the website can be rewritten using the preferred domain name before indexing the document, according to certain embodiments. According to certain embodiments, a user interface is provided to allow a user to specify the preferred domain name for a website for which the user is a verified owner.

Type: Application

Filed: March 28, 2011

Publication date: July 21, 2011

Inventors: Vanessa Fox, Matthew D. Cutts, Maxmilian Ibel, Michael E. Noth, David Michael Proudfoot, Andrey Yuryevich Stroilov
Method and apparatus to import unstructured content into a content management system

Publication number: 20110173153

Abstract: A content management system having a repository of information organized according to an index file, a method of importing unstructured content comprising an XML or other template of configurable import rules to enable retrieval of information components of the unstructured content; ascertaining at least one structural attribute of the unstructured content; enabling a user to configure import rules according to the ascertained structural attribute(s); accessing and examining information components of the unstructured content according to the attribute(s); optionally tagging information components of the unstructured content according to a value of the accessed and examined information components; importing information components of the unstructured content into a repository of the content management system according to indices of the index file; identifying a workflow task with respect to the information components of the imported content; and processing a workflow task of the content management system relati

Type: Application

Filed: January 8, 2010

Publication date: July 14, 2011

Inventors: Michael Domashchenko, Edward B. Heinz
INDEXING AND FILTERING USING COMPOSITE DATA STORES

Publication number: 20110167072

Abstract: Data stores may be combined into a composite data store. A method includes referencing a first index entry for a user specified first parameter pattern. The first index entry includes references to record addresses for records in the composite data store which include the first parameter pattern. A first beginning composite data store address of a first selected data store is referenced. A determination is made that the first beginning composite data store address is at or above an address at or above a predetermined threshold above the first record address. Based on determining that the first beginning composite data store address is at or above a predetermined threshold above the first record address, a speed-up data structure is used to eliminate one or more comparisons of record entries in the first index entry between the first record address and the first beginning composite data store address.

Type: Application

Filed: March 21, 2011

Publication date: July 7, 2011

Applicant: Perfect Search Corporation

Inventor: Ronald P. Millett
LAZY EVALUATION OF SEMANTIC INDEXING

Publication number: 20110137910

Abstract: A method for searching a database of digital media assets, comprising: designating a database of digital media assets, wherein the database of digital media assets has been indexed according to a set of general indexers; receiving a search query; defining specialized search conditions by identifying one or more elements of the search query corresponding to one or more specialized indexers; defining general search conditions by identifying elements of the search query corresponding to the general indexers; identifying a subset of the digital media assets by applying the general search conditions; indexing the subset of the digital media assets using the identified specialized indexers; and ranking the subset of the digital media assets by applying the specialized search conditions.

Type: Application

Filed: December 8, 2009

Publication date: June 9, 2011

Inventors: Stacie L. Hibino, Mark D. Wood
SYSTEM, METHOD AND COMPUTER PROGRAM PRODUCT FOR DOCUMENTS RETRIEVAL

Publication number: 20110137912

Abstract: The invention provides a system and method for retrieving documents from a collection of documents that match a word search query. A word index is generated for each document in which each entry is an enriched-term string built from the stemmed form of the word to be searched and a separator character followed by the original form of the word to be searched. During a retrieving operation, a search query is processed depending the original form or the stemmed form of a word to be searched. Cross-documents tables are addressed to find documents that match the enriched-term string of the word to be searched.

Type: Application

Filed: October 5, 2010

Publication date: June 9, 2011

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Roberto Ragusa, Ciro Ragusa, Roberto Guarda
INDEXING DOCUMENTS

Publication number: 20110131212

Abstract: A document to be indexed is initially indexed in dependence upon language-specific rules of a single language. A success metric is used to assess the effectiveness of the single language indexing. If a threshold level of success is not attained, the document is identified as multi-lingual. In response to identifying the document as multi-lingual, the document is queued for multi-lingual indexing. A document may be fragmented into a number of smaller documents, each of which is indexed separately.

Type: Application

Filed: December 2, 2009

Publication date: June 2, 2011

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventor: Deep Shikha
INFORMATION RETRIEVAL METHOD, COMPUTER READABLE MEDIUM AND INFORMATION RETRIEVAL APPARATUS

Publication number: 20110131214

Abstract: According to an aspect of the invention, a computer readable medium stores a program causing a computer to execute a process for retrieving information. The process includes an extracting process, an executing process, a first creating process, a second creating process, a determining process. The extracting process extracts, from a first composition that is an object to be searched for and that includes first sentence elements and a second composition that indicates a retrieval condition and that includes second sentence elements, the first sentence elements, the second sentence elements, and sentence element relations indicating relations between the first sentence elements and relations between the second sentence elements.

Type: Application

Filed: April 26, 2010

Publication date: June 2, 2011

Applicant: FUJI XEROX CO., LTD.

Inventor: Hiroshi UMEMOTO
Database System and Method for Data Acquisition and Perusal

Publication number: 20110106813

Abstract: A data acquisition and perusal system and method including a database selection module, a database index generator module and a search module. The database selection module enables selection of a plurality of files for inclusion into at least one selectable database. The database index generator module enables generation of a searchable index of the data contained in the selectable database. The search module enables a search to be performed of the searchable index according to search criteria. The system allows for the capture of HTML data which is automatically indexed without human intervention and has the ability to automatically and accurately locate or “pinpoint,” and highlight specific text or groups of text designated by the user within the resulting database.

Type: Application

Filed: November 12, 2010

Publication date: May 5, 2011

Inventors: Robert Leland Jensen, Daniel Victor Smith
MEASURING WEB SITE SATISFACTION OF INFORMATION NEEDS

Publication number: 20110106799

Abstract: A method, system, and computer program product for measuring web site satisfaction of information needs are provided. The method includes: selecting a page for analysis; generating a page profile in the form of a list of keywords representing the page; generating a page traffic profile in the form of lists of keywords representing information needs of users, wherein the page traffic profile is generated from keywords used by users to visit the page; determining the success of users' visits to the page; and analyzing whether a page satisfies users' information needs by applying a distance measure between the keywords of the page profile and the keywords of the page traffic profile and combining the distance measure result with a success rate of the keywords.

Type: Application

Filed: January 10, 2011

Publication date: May 5, 2011

Applicant: International Business Machines Corporation

Inventors: Gilad Barkai, David Carmel, David Konopnicki, Haggai Roitman
LEGAL COMPLIANCE, ELECTRONIC DISCOVERY AND ELECTRONIC DOCUMENT HANDLING OF ONLINE AND OFFLINE COPIES OF DATA

Publication number: 20110093471

Abstract: Systems and methods of electronic document handling permit organizations to comply with legal or regulatory requirements, electronic discovery and legal hold requirements, and/or other business requirements. The systems described provide a unified approach to data management that enables compliance, legal and IT personnel to focus efforts on, e.g., a single data repository. The systems permit users to define and utilize information governance policies that help automate and systematize different compliance tasks. In some examples, organizations may push data in any third-party data format to the systems described herein. The systems may permit compliance or IT personnel to detect when a legally sensitive production file has been changed or deleted. The systems may also provide a unified dashboard user interface. From a dashboard interface, users may perform searches, participate in collaborative data management workflows, obtain data management reports, and adjust policies.

Type: Application

Filed: September 7, 2010

Publication date: April 21, 2011

Inventors: Brian Brockway, Alan Bunte, Christie J. Van Wagoner, Simon Taylor, Marcus S. Muller, Anand Prahlad, Randy DeMeno, Rammohan G. Reddy
SELF-INDEXING DATA STRUCTURE

Publication number: 20110093467

Abstract: A machine based tool and associated logic and methodology are used in converting data from an input form to a target form using context dependent conversion rules, and in efficiency generating an index that may be utilized to access the converted data in a database. Once the data has been converted, an index data structure for each data object may be automatically generated that encodes one or more characteristics or attributes of the converted data so that an entity may access the data using the index structure. As an example, the one or more characteristics may include categories, subcategories, or other attributes of the data.

Type: Application

Filed: October 16, 2009

Publication date: April 21, 2011

Applicant: SILVER CREEK SYSTEMS, INC.

Inventors: Alec Sharp, Luis Rivas, Mark Kreider
SYSTEM AND METHOD FOR PROVIDING WEB SEARCH RESULTS TO A PARTICULAR COMPUTER USER BASED ON THE POPULARITY OF THE SEARCH RESULTS WITH OTHER COMPUTER USERS

Publication number: 20110087647

Abstract: A system and method for providing Web search results to a particular computer user based on the popularity of the search results with other computer users is described. One embodiment monitors, using one or more servers, at least one Web service for new actions of sharing of Web content by computer users; identifies, from the new actions of sharing of Web content by computer users, a data item that satisfies predetermined interestingness criteria; parses the data item to obtain at least one Uniform Resource Locator (URL); crawls at least one Web page corresponding to the at least one URL to obtain the content of the at least one Web page; analyzes the content of the at least one Web page; and updates an index based on the content of the at least one Web page, the index being usable in processing a Web search query from a particular user.

Type: Application

Filed: October 13, 2009

Publication date: April 14, 2011

Inventors: Alessio Signorini, Ioannls Pavlids, Nathaniel Fisher, Scott Engstrom, Peter J. Newcomb, David L. Young, Ron Benson
Methods and Systems for Compressing Indices

Publication number: 20110066623

Abstract: Systems and methods for compressing indices are described. In one aspect, a plurality of items are selected where each item has an entry in an inverted index and each item entry comprises a listing of articles that the item appears in. At least a first item entry and a second item entry are determined for compression and the second item entry is compressed into the first item entry resulting in a compressed first item entry.

Type: Application

Filed: September 20, 2010

Publication date: March 17, 2011

Inventor: Adam J. Weissman
PRODUCT LINE EXTRACTION

Publication number: 20110066622

Abstract: Methods, systems and computer readable media for extracting product lines from a plurality of product titles are provided. In one embodiment, the plurality of product titles are broken into tokens. Association rules are calculated for individual tokens and pairs of tokens. Brand specific terms and product class specific terms within the product titles are identified. In one embodiment, a token tree is used to identify product lines within the list of product titles using the association rules, the brand specific terms, and the product class specific terms.

Type: Application

Filed: November 22, 2010

Publication date: March 17, 2011

Applicant: MICROSOFT CORPORATION

Inventors: Nimish G. Dharawat, Meera Mahabala, Gitika Gupta
Locating and Retrieving Data Content Stored in a Compressed Digital Format

Publication number: 20110060743

Abstract: A method and apparatus is provided for locating and retrieving specified data content in a database. The data comprises compressed digital audio or video data files associated with the recorded speech. Retrieval of the specified content requires decompression of only a portion of the compressed data. A method for locating specified content of the above type is provided. A compressed audio file comprising recorded speech is converted into a corresponding text file. A searchable index is constructed from the text file. One or more specified search arguments are used to search respective elements of the searchable index in order to detect one or more text segments. The identifiers of respective detected segments are then used to locate the specified content in the audio file. Only portions of the audio file that contain specified content require decompression, in order to retrieve the content.

Type: Application

Filed: November 12, 2010

Publication date: March 10, 2011

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Oliver Keren Ban, Timothy Alan Dietz, Anthony Cappa Spielberg
Network cache, a user device, a computer program product and a method for managing files

Publication number: 20110047165

Abstract: A network cache (30) that includes multiple storage units (20) and multiple control units (10) that are coupled to multiple user devices (50) via a network (40), wherein the network cache is adpated to receive a file related request provided from a user device, and wherein the network cache is adapted to respond to the file related request by a selected control unit and by a selected storage unit, wherein the selected storage unit is selected in response to a file related request based on a file tag that is responsive to a content of the file, and wherein the selected control unit is selected in response to an identity of the user device.

Type: Application

Filed: July 16, 2007

Publication date: February 24, 2011

Inventors: Ofer Wald, Ayelet Wald
STREAMING QUERY SYSTEM AND METHOD FOR EXTENSIBLE MARKUP LANGUAGE

Publication number: 20110035398

Abstract: A streaming query system for extensible markup language is provided. An XPath query translator receives and analyzes a user-input XPath document. An abstract syntax tree analyzer establishes an abstract syntax tree. A XML parser receives and parses an XML document. An index generator generates an index for the XML document. A computation module performs a format calculation based on the abstract syntax tree and the index, and generates a query result accordingly.

Type: Application

Filed: July 23, 2010

Publication date: February 10, 2011

Applicant: NATIONAL TAIWAN UNIVERSITY OF SCIENCE & TECHNOLOGY

Inventors: Hahn-Ming Lee, Li-Zhen Liu, Chieh-Hung Lin, Jerome Yeh, Chia-Hsin Huang
SEARCHING AND ACCESSING DOCUMENTS ON PRIVATE NETWORKS FOR USE WITH CAPTURES FROM RENDERED DOCUMENTS

Publication number: 20110029504

Abstract: A facility for exposing an index of private documents is described. In a private network, the facility (1) identifies electronic versions of documents that are available inside the private network, including a distinguished document; (2) constructs an index covering the identified electronic versions of documents; and (3) exports the constructed index from the private network to an index publication server. At the index publication server, the facility (1) receives the exported index; (2) receives a query via a public network; and (3) uses an index, based upon the received index, to generate a query result for the received query that contains the distinguished document.

Type: Application

Filed: October 5, 2010

Publication date: February 3, 2011

Inventors: Martin T. King, Dale L. Grover, Clifford A. Kushier, James Q. Stafford-Fraser
FILE SYSTEM

Publication number: 20110022566

Abstract: A digitally signed file system in which data, metadata and files are objects, each object having a globally unique and content-derived fingerprint and wherein object references are mapped by the fingerprints; the file system has a root object comprising a mapping of all object fingerprints in the file system, such that a change to the file system results in a change in the root object, and tracking changes in the root object provides a history of file system activity.

Type: Application

Filed: June 25, 2010

Publication date: January 27, 2011

Applicant: SimpliVT Corporation

Inventors: Arthur J. Beaverson, Paul Bowden
PERFORMING DATA STORAGE OPERATIONS IN A CLOUD STORAGE ENVIRONMENT, INCLUDING SEARCHING, ENCRYPTION AND INDEXING

Publication number: 20100332479

Abstract: Systems and methods are disclosed for performing data storage operations, including content-indexing, containerized deduplication, and policy-driven storage, within a cloud environment. The systems support a variety of clients and cloud storage sites that may connect to the system in a cloud environment that requires data transfer over wide area networks, such as the Internet, which may have appreciable latency and/or packet loss, using various network protocols, including HTTP and FTP. Methods are disclosed for content indexing data stored within a cloud environment to facilitate later searching, including collaborative searching. Methods are also disclosed for performing containerized deduplication to reduce the strain on a system namespace, effectuate cost savings, etc. Methods are disclosed for identifying suitable storage locations, including suitable cloud storage sites, for data files subject to a storage policy.

Type: Application

Filed: March 31, 2010

Publication date: December 30, 2010

Inventors: Anand Prahlad, Rajiv Kottomtharayil, Srinivas Kavuri, Parag Gokhale, Manoj Vijayan
System and method for computer-assisted manual and automatic logging of time-based media

Publication number: 20100312770

Abstract: A customizable logging and content management system for indexing multimedia, including a synchronized timer object that provides a time reference upon request in connection with the media, and a logger object that logs predefined events that occur in the media by associating the events with respective time references from the timer object. A video server is provided that captures and digitally stores events logged by the logging application as media segments, and a search and retrieval engine is provided that enables the media segments to be located, retrieved and viewed based on the indexes. The system includes a graphical user interface generator that enables customized user interfaces and logging databases to be created from database tables for use in the logging application.

Type: Application

Filed: July 6, 2010

Publication date: December 9, 2010

Applicant: Charles Smith Enterprises, LLC

Inventors: Charles Smith-Semedo, Rolando Blackman, Stephen Jacobs, Guerrino Lupetin, Rafael Cortina
Method and system for search engine indexing and searching using the index

Publication number: 20100287166

Abstract: Data indexing includes receiving data from a data source; classifying the data into one of a plurality of categories according to a predetermined data classification criteria; establishing a corresponding relationship between the data and an index associated with the data, the index having a preset maximum capacity; and recording the relationship between the data and the index. The index is one of a plurality of indices, and each of the plurality of indices is exclusively written by an index writing device.

Type: Application

Filed: May 5, 2010

Publication date: November 11, 2010

Inventor: Hanfei Yang
INDEXING DOCUMENTS ACCORDING TO GEOGRAPHICAL RELEVANCE

Publication number: 20100250552

Abstract: A local search engine efficiently indexes documents relevant to a geographical area by indexing, for each document, multiple location identifiers that collectively define an aggregate geographic region. When creating the index, the search engine may determine a set of geographical areas surrounding a geographical area relevant to a document and associate references to the set of geographical areas with the document index.

Type: Application

Filed: June 15, 2010

Publication date: September 30, 2010

Applicant: GOOGLE INC.

Inventor: Daniel EGNOR
SEMANTIC DOCUMENT ANALYSIS

Publication number: 20100228794

Abstract: A technique for dynamic integration and semantic analysis of structured data and unstructured textual data including: defining and selecting static attributes and dynamic attribute from structured data, embedding static and dynamic views of the selected corresponding attributes in an annotated document, linking the unstructured textual data with the structured data using the defined static and dynamic attributes, populating an annotated document structure of multiple annotated documents, performing semantic analysis of a query across the unstructured textual data and structured data, querying the annotated document structure to provide query results satisfying static part of the query, processing static and dynamic parts of the query by querying structured data and the annotated document structure, as appropriate, and providing a combined query processing result satisfying the dynamic and static part the query. Other embodiments are also disclosed.

Type: Application

Filed: February 25, 2009

Publication date: September 9, 2010

Applicant: International Business Machines Corporation

Inventors: Sourashis Roy, Himanshu Gupta, Hiroki Oya, Mukesh Kumar Mohania, Inagaki Iwao
System and method for enhanced text matching

Patent number: 7783660

Abstract: The disclosure describes search systems and methods in which exact token searches, spelling suggestions, and split-token searches are used in conjunction to return search results to the user. Depending on the number and relevancy of results for the search query results from each of the steps the results are either merged or discarded into the final result set. The split-token search is adapted to generate two split-tokens from the token(s) of the search query in anticipation that the search token(s) is misspelled. As the location of the misspelling is unknown, the split-token search widens the scope of the results provided in response to the search. In an embodiment, the split-token search includes performing a prefix search for tokens matching a prefix split-token and a postfix search for tokens matching a postfix split-token. In an embodiment, the index is specially adapted to allow the postfix search to be performed more efficiently.

Type: Grant

Filed: October 5, 2006

Date of Patent: August 24, 2010

Assignee: Yahoo! Inc.

Inventors: Jagadeshwar R. Nomula, Christa Stelzmuller
Multiple-column bidirectional index

Publication number: 20100191750

Abstract: The present invention is a multiple-column index comprised of at least three columns. The columns are repetitious loops of identical content displayed over several pages wherein the relationship between the columns would be consistent. At least two columns have listings in the order of “A” to “Z” beginning at different points of the alphabet. At least one column has listings in “Z” to “A” order. If a search-result is organized on numerous pages, the invention would maximize the potential for discoverability for businesses listed beyond the first page.

Type: Application

Filed: January 26, 2010

Publication date: July 29, 2010

Inventor: A. Leon White
METHOD FOR ESTABLISHING A CONTROLLED DATA TRANSFER CONNECTION BETWEEN TWO SYSTEMS

Publication number: 20100191590

Abstract: Method for creating a controlled data transfer connection between a remote device and a subscriber terminal by a transmission system. The first party of the interconnection, (remote device), creates a connection to the transmission system, which verifies information used for the authentication informed by the remote device and allocates an unique identifier ID for the remote device, by which the remote device can be addressed in the transmission system. The other part of the interconnection, (subscriber terminal), requests the transmission system to transmit the request to the remote device, identified by the identifier. The transmission system transmits this request to the remote device, which processes the request and sends the response via the transmission system to the subscriber terminal. This response can be converted in the transmission system to a form suitable for the subscriber terminal, and subscriber-targeted advertisements, or other data, may be added in the response.

Type: Application

Filed: May 26, 2008

Publication date: July 29, 2010

Applicant: HUUKED LABS OY

Inventors: Harri Hakkarainen, Juha Utriainen
Complex Queries for Corpus Indexing and Search

Publication number: 20100161639

Abstract: Computer methods, apparatus and articles of manufacture therefor, are disclosed for developing a complex-query pattern that is transformed into a region-matching transducer. A corpus-level transducer and the region matching transducer are combined. The combined transducer is applied to a corpus to identify strings therein that satisfy patterns defined in the corpus-level transducer, including the complex-query pattern, with each identified pattern being recorded in a corpus index. The corpus and the corpus index are made available for receiving a query with the query tag for querying the corpus and applying the query using the corpus index to identify locations in the corpus that satisfy the query.

Type: Application

Filed: December 18, 2008

Publication date: June 24, 2010

Applicant: Palo Alto Research Center Incorporated

Inventors: Daniel G. Bobrow, Robert D. Cheslow
METHOD AND DEVICE FOR SEARCHING CONTENTS

Publication number: 20100145948

Abstract: Disclosed are a method and a device for searching contents by using time information or spatial information. The device for contents search includes a memory unit configured to store contents having spatial information and time information as search information and to further store groups into which the contents are classified by the spatial information or the time information. The device further includes a display unit configured to display a time information search tool and a spatial information search tool in response to receipt of a request for a contents search is received, and to further display the contents belonging to a searched group. Also the device includes an input unit configured to receive an input of search information and a control unit configured to search a group having the selected search information.

Type: Application

Filed: December 9, 2009

Publication date: June 10, 2010

Applicant: Samsung Electronics Co., Ltd.

Inventors: Gyung Hye Yang, Jin Young Jeon, Sang Woong Hwang, Ji Young Kwahk, Jee Young Her, Ji Sun Yang
IDENTIFYING INADEQUATE SEARCH CONTENT

Publication number: 20100138421

Abstract: Systems and methods for identifying inadequate search content are provided. Inadequate search content, for example, can be identified based on statistics associated with the search queries related to the content.

Type: Application

Filed: February 3, 2010

Publication date: June 3, 2010

Applicant: GOOGLE INC.

Inventors: Jeffrey David Oldham, Hal R. Varian, Matthew D. Cutts, Matt Rosencrantz
Search and indexing on a user device

Patent number: 7716224

Abstract: Search may be performed on a user device, such as a handheld electronic book reader device. A search query term may be received. Text of a collection of electronic items stored in memory of the user device may be searched for the queried term. Search results may be returned identifying locations in the electronic items at which the queried term appears.

Type: Grant

Filed: June 14, 2007

Date of Patent: May 11, 2010

Assignee: Amazon Technologies, Inc.

Inventors: James R. Reztlaff, II, John Lattyak
SYSTEM AND METHOD FOR ENHANCING KEYWORD RELEVANCE BY USER'S INTEREST ON THE SEARCH RESULT DOCUMENTS

Publication number: 20100106727

Abstract: A system and method are provided for enabling a user to search for documents that the user has previously viewed on its local machine. The system includes three main components: the desktop integration module, the index module, and the graphical user interface module. The desktop integration module is an application which monitors documents with which the user interacts for predetermined events, and obtains content data and metadata from the monitored documents. The index module indexes the content data and metadata received from the desktop integration module. The graphical user interface module then permits a user to utilize the desktop integration module and index module by allowing a user to search for a document.

Type: Application

Filed: September 25, 2009

Publication date: April 29, 2010

Applicant: IBM Corporation

Inventors: Tolga Oral, David L. Newbold, Michael Bolin, Raudel S. Rodriguez
Method and system for classifying text

Publication number: 20100094875

Abstract: A content classification system, method and computer product is presented. In connection with the invention, a data structure is created by identifying a plurality of words and mapping each word to one or more categories. The data structure is indexed. An item of content is identified and classified based on the data structure. The classification includes identifying all one—or more—word combinations in the item of content; for each word of at least a pre-determined number of characters in length in each of the word combinations, identifying each of the categories to which it is mapped; and determining a weight for each of the words based on an inverse proportion to the number of categories to which it is mapped.

Type: Application

Filed: August 11, 2009

Publication date: April 15, 2010

Applicant: Collective Media, Inc.

Inventors: Paul Harrison, James Oliphant, Hal Fulton, Armin Roehrl, Brenden Grace
INFORMATION SEARCH SYSTEM, METHOD, AND PROGRAM

Publication number: 20100088318

Abstract: Disclosed is a system in which an index registration unit registers an index, which will be used for search processing, as a partitioned index which is partitioned on a time series basis, and a search means reads indexes older than a specified point in time, which is used as a search base point, to perform search processing, thereby searching for information based on a point in time in the past.

Type: Application

Filed: October 2, 2007

Publication date: April 8, 2010

Inventors: Masaki Kan, Yoshihiro Kajiki, Satoshi Yamakawa, Takashi Torii, Yuji Kaneko
Method and Apparatus for Efficient Indexed Storage for Unstructured Content

Publication number: 20100076981

Abstract: A method and apparatus for efficient indexed storage for unstructured content have been disclosed.

Type: Application

Filed: November 30, 2009

Publication date: March 25, 2010

Applicant: Nahava Inc.

Inventor: Russell T. Nakano
Customized, personalized, integrated client-side search indexing of the web

Patent number: 7660787

Abstract: A client-side search indexing program works transparently and in conjunction with a server based search index. The combined search indexes provide a more accurate and up-to-date image of the Web, customized to the interests of each individual user. The client-side indexer customizes indexing of particular Web pages to the preferences and usage patterns of the user. The user initially installs and configures the client-side indexer on the client. The requested indexes are automatically refreshed and integrated with the main server-side indexes during a search. When the user performs a search, the client-side indexes may be combined with the main server-side index. The combined indexes provide accurate search results for the particular user.

Type: Grant

Filed: July 19, 2006

Date of Patent: February 9, 2010

Assignee: International Business Machines Corporation

Inventors: David Joseph Borrillo, Ryan Kirk Cradick, Zachary Adam Garbow
SYSTEM AND METHOD FOR GENERATING SUBPHRASE QUERIES

Publication number: 20090198671

Abstract: A system for generating subphrase queries. The system includes a sequence label modeling engine and a regression modeling engine. The sequence label modeling engine generates a plurality of subphrase queries by indexing through each token in a search phrase and labeling each token based on an association to other tokens in the search phrase. The regression modeling engine scores each subphrase query at least partially on the association according to a scoring model. The regression modeling engine identifies the subphrase query with the highest score which may then be used for identifying a sponsored search list or a web search item.

Type: Application

Filed: February 5, 2008

Publication date: August 6, 2009

Applicant: Yahoo! Inc.

Inventors: Ruofei Zhang, Haibin Cheng, Yefei Peng, Benjamin Rey, Jianchang Mao

prev 1 2 3 4 next