Indexing (epo) Patents (Class 707/E17.083)
  • Publication number: 20110258177
    Abstract: The present invention is directed towards systems and methods for providing a microdocument framework. The method and system includes receiving a plurality microdocuments and detecting content data for each of the plurality of microdocuments. The method and system further includes indexing at least a portion of the plurality of microdocuments based on the detected content and performing a searching operation using the content data associated with the microdocument data to determine a microdocument set. Thereupon, the method and system performs at least processing one operation on the microdocument set.
    Type: Application
    Filed: April 19, 2010
    Publication date: October 20, 2011
    Applicant: YAHOO! INC.
    Inventors: Su-Lin Wu, Wei-Cheng Lai, Timothy P. Daly, JR., William Robert Pentney
  • Publication number: 20110252037
    Abstract: A system and associated method for automatically generating a service specification of a Service Oriented Architecture (SOA) solution. A process model framework and a data model framework are received as inputs. Processes in the process model framework perform services of various complexity levels. Processes are decomposed into a respective set of atomic service processes in the lowest complexity level and data objects are extracted from the decomposed atomic service processes. The data objects are associated with data elements of the data model framework. The data model framework is extended and flexibility patterns are added for reusability of the service specification. The service specification of the SOA solution is generated as process interfaces represented with the data objects according to inputs from a user customizing aspects of the service specification, for either a desired service of the SOA solution or a desired process in the process model framework.
    Type: Application
    Filed: April 13, 2010
    Publication date: October 13, 2011
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Faried Abrahams, Kerard R. Hogg, Kent R. Ramchand, Gandhi Sivakumar
  • Publication number: 20110252018
    Abstract: A method for creating a search index on cloud database is provided. The method enables providing inputs for creating multiple indexes on documents stored in the cloud database. One of the inputs may include a first value representing number of documents to be assigned a single index. The method further enables determining total number of documents stored in the cloud database which is represented by a second value. Further, the method enables estimating total number of indexes to be created based on first value and second value. The method further comprises executing a loop to create multiple indexes for a predetermined number of iterations which corresponds to the estimated value. Furthermore, the method comprises indexing documents for creating the multiple indexes. Finally, the method comprises merging the multiple indexes to create a single index which facilitates a user to search documents stored in the cloud database.
    Type: Application
    Filed: June 9, 2010
    Publication date: October 13, 2011
    Applicant: INFOSYS TECHNOLOGIES LIMITED
    Inventors: Rajarshi Bhose, Kashyap Chimanlal Santoki, Subhadip Sarkar
  • Publication number: 20110252038
    Abstract: At least certain embodiments of the present disclosure include a method to extend search capabilities to third party applications installed on a device. In one embodiment, records associated with a third party application are indexed in a process isolated from other third party applications installed on the device using a search plugin specific to the third party application. Furthermore, the indexed records can be searched in response to a user search query without invoking the third party application.
    Type: Application
    Filed: July 1, 2010
    Publication date: October 13, 2011
    Inventors: Edward T. Schmidt, Gordon J. Freedman, Benjamin S. Phipps, David Rahardia
  • Publication number: 20110246431
    Abstract: The present invention relates to a storage system including a de-duplicate function and a full-text search function or the like, and reduces an amount of index information about full-test search to save storage resource. In this system, a storage apparatus includes a processing unit for de-duplicating a plurality of files having the same content regarding a file group of data inputted/outputted through a host apparatus. A full-text search processing server performs a full-text search processing to the file group and includes a processing unit for causing the full-text search processing to correspond to de-duplicate. An index information creation processing performed to a plurality of target files having the same content by the full-text search processing unit is inhibited according to a status of de-duplicate to the file group by the processing unit. Thereby, the amount of index information can be reduced.
    Type: Application
    Filed: June 13, 2011
    Publication date: October 6, 2011
    Inventor: Takayoshi IITSUKA
  • Publication number: 20110246478
    Abstract: A method of operation of a navigation system includes: preconstructing an inverted term index having a nested spatial index of at least one location; providing a search term and a search range for searching the inverted term index; locating the search term in the inverted term index and having the nested spatial index bounded by the search range; and retrieving a location record linked to the nested spatial index and associated with the search term and the search range for displaying on a device.
    Type: Application
    Filed: March 31, 2010
    Publication date: October 6, 2011
    Applicant: TELENAV, INC.
    Inventors: Kan Deng, Yueyu Lin, Yanyan Qin
  • Publication number: 20110238694
    Abstract: Matching systems are provided that are configured to determine if a first entity received from a client device of a first user matches with at least one other entity of a plurality of entities indexed in an index in which each entity is associated with one or more index points. The system includes an application server adapted for communication with a matching engine and the client device. The matching engine is configured to index the first entity by associating the first entity with one or more index points in the index; and search for other entities matching the first entity among the plurality of entities indexed in the index by searching for other entities associated with at least one of the index points with which the first entity is associated.
    Type: Application
    Filed: December 2, 2008
    Publication date: September 29, 2011
    Inventors: Richard Carlsson, Olof Lundström, Gerardo Montero Arizmendi, Hjalmar Olsson
  • Publication number: 20110238668
    Abstract: In a document management system that manages index item definition and document data by cabinet, an index can be easily provided. A user that can log into a first database can use an index item defined by the first database to provide an index value to document data stored in a second database.
    Type: Application
    Filed: March 16, 2011
    Publication date: September 29, 2011
    Applicant: CANON KABUSHIKI KAISHA
    Inventor: Yoshitaka Matsumoto
  • Publication number: 20110238956
    Abstract: A mechanism is provided in a collective acceleration unit for performing a collective operation to distribute or collect data among a plurality of participant nodes. The mechanism receives an input collective packet for a collective operation from a neighbor node within a collective tree. The input collective packet comprises a tree identifier and an input data field and wherein the collective tree comprises a plurality of sub trees. The mechanism maps the tree identifier to an index within the collective acceleration unit. The index identifies a portion of resources within the collective acceleration unit and is associated with a set of neighbor nodes in a given sub tree within the collective tree. For each neighbor node the collective acceleration unit stores destination information. The collective acceleration unit performs an operation on the input data field using the portion of resources to effect the collective operation.
    Type: Application
    Filed: March 29, 2010
    Publication date: September 29, 2011
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Lakshminarayana B. Arimilli, Bernard C. Drerup, Paul F. Lecocq, Hanhong Xue
  • Publication number: 20110225008
    Abstract: A health care monitoring system and network for monitoring a patient's use of medications, physiological conditions, and/or the environment around the patient. The system takes advantage of a hierarchical nodal network to allow for the transfer of information from sensors to authorized users of the network while providing personal control of sensitive information and providing a distributed database structure for increased data security. In one embodiment, sensors transmit information to wearable personal data recorders that use a combination of random access and overwriting to store information in a time-sequenced organization.
    Type: Application
    Filed: March 9, 2010
    Publication date: September 15, 2011
    Applicant: RESPIRA DV, LLC
    Inventors: Nabil A. Elkouh, Gregory S. Fallon, Robert Harwood, Matthew J. Miller
  • Publication number: 20110225155
    Abstract: A system and method are provided for refining a user's query. An entity index, generated from a corpus of text documents, is provided. The entity index includes a set of entity structures, each including a plurality of terms. Each of the terms of an entity structure is a feature of the same entity. Entity structures can be retrieved from the entity index which match at least a portion of the user's query. Clusters of the retrieved entity structures are identified which have at least one of their terms in common. A cluster hierarchy is generated from the identified clusters in which nodes of the hierarchy are defined by one or more of the terms of the retrieved entity structures. At least a portion of the cluster hierarchy is presented to the user for facilitating refinement of the user's query through user selection of a node which, when formulated as a search, retrieves one or more responsive documents from the corpus of documents.
    Type: Application
    Filed: March 10, 2010
    Publication date: September 15, 2011
    Applicant: Xerox Corporation
    Inventors: Frederic Roulland, Stefania Castellani, Antonietta Grasso, Caroline Brun
  • Publication number: 20110218989
    Abstract: The present disclosure provides an information search method and system applicable in an information search system wherein each document has corresponding forward index data to address the issue of low search efficiency suffered by existing information search techniques. In one aspect, the method may include: receiving an inquiry word and obtaining one or more keywords contained in the inquiry word by segmentation; searching one or more documents matching the one or more keywords and forward index data corresponding to the one or more documents through the information search system's inverted index data; and determining an abstract of each of the one or more documents according to a corresponding document's forward index data, and outputting the abstract and information of the one or more documents as a search result. The proposed techniques can increase efficiency of information search and, at the meantime, guarantee accuracy of the search to a certain extent.
    Type: Application
    Filed: August 27, 2010
    Publication date: September 8, 2011
    Applicant: ALIBABA GROUP HOLDING LIMITED
    Inventor: Yi Luo
  • Publication number: 20110202540
    Abstract: A method and apparatus for efficient indexed storage for unstructured content have been disclosed.
    Type: Application
    Filed: April 24, 2011
    Publication date: August 18, 2011
    Applicant: Nahava Inc.
    Inventor: Russell T. Nakano
  • Publication number: 20110179178
    Abstract: When a website has a number of equivalent domain names including a preferred domain name, the locator for a document in the website can be rewritten using the preferred domain name before indexing the document, according to certain embodiments. According to certain embodiments, a user interface is provided to allow a user to specify the preferred domain name for a website for which the user is a verified owner.
    Type: Application
    Filed: March 28, 2011
    Publication date: July 21, 2011
    Inventors: Vanessa Fox, Matthew D. Cutts, Maxmilian Ibel, Michael E. Noth, David Michael Proudfoot, Andrey Yuryevich Stroilov
  • Publication number: 20110173153
    Abstract: A content management system having a repository of information organized according to an index file, a method of importing unstructured content comprising an XML or other template of configurable import rules to enable retrieval of information components of the unstructured content; ascertaining at least one structural attribute of the unstructured content; enabling a user to configure import rules according to the ascertained structural attribute(s); accessing and examining information components of the unstructured content according to the attribute(s); optionally tagging information components of the unstructured content according to a value of the accessed and examined information components; importing information components of the unstructured content into a repository of the content management system according to indices of the index file; identifying a workflow task with respect to the information components of the imported content; and processing a workflow task of the content management system relati
    Type: Application
    Filed: January 8, 2010
    Publication date: July 14, 2011
    Inventors: Michael Domashchenko, Edward B. Heinz
  • Publication number: 20110167072
    Abstract: Data stores may be combined into a composite data store. A method includes referencing a first index entry for a user specified first parameter pattern. The first index entry includes references to record addresses for records in the composite data store which include the first parameter pattern. A first beginning composite data store address of a first selected data store is referenced. A determination is made that the first beginning composite data store address is at or above an address at or above a predetermined threshold above the first record address. Based on determining that the first beginning composite data store address is at or above a predetermined threshold above the first record address, a speed-up data structure is used to eliminate one or more comparisons of record entries in the first index entry between the first record address and the first beginning composite data store address.
    Type: Application
    Filed: March 21, 2011
    Publication date: July 7, 2011
    Applicant: Perfect Search Corporation
    Inventor: Ronald P. Millett
  • Publication number: 20110137910
    Abstract: A method for searching a database of digital media assets, comprising: designating a database of digital media assets, wherein the database of digital media assets has been indexed according to a set of general indexers; receiving a search query; defining specialized search conditions by identifying one or more elements of the search query corresponding to one or more specialized indexers; defining general search conditions by identifying elements of the search query corresponding to the general indexers; identifying a subset of the digital media assets by applying the general search conditions; indexing the subset of the digital media assets using the identified specialized indexers; and ranking the subset of the digital media assets by applying the specialized search conditions.
    Type: Application
    Filed: December 8, 2009
    Publication date: June 9, 2011
    Inventors: Stacie L. Hibino, Mark D. Wood
  • Publication number: 20110137912
    Abstract: The invention provides a system and method for retrieving documents from a collection of documents that match a word search query. A word index is generated for each document in which each entry is an enriched-term string built from the stemmed form of the word to be searched and a separator character followed by the original form of the word to be searched. During a retrieving operation, a search query is processed depending the original form or the stemmed form of a word to be searched. Cross-documents tables are addressed to find documents that match the enriched-term string of the word to be searched.
    Type: Application
    Filed: October 5, 2010
    Publication date: June 9, 2011
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Roberto Ragusa, Ciro Ragusa, Roberto Guarda
  • Publication number: 20110131212
    Abstract: A document to be indexed is initially indexed in dependence upon language-specific rules of a single language. A success metric is used to assess the effectiveness of the single language indexing. If a threshold level of success is not attained, the document is identified as multi-lingual. In response to identifying the document as multi-lingual, the document is queued for multi-lingual indexing. A document may be fragmented into a number of smaller documents, each of which is indexed separately.
    Type: Application
    Filed: December 2, 2009
    Publication date: June 2, 2011
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventor: Deep Shikha
  • Publication number: 20110131214
    Abstract: According to an aspect of the invention, a computer readable medium stores a program causing a computer to execute a process for retrieving information. The process includes an extracting process, an executing process, a first creating process, a second creating process, a determining process. The extracting process extracts, from a first composition that is an object to be searched for and that includes first sentence elements and a second composition that indicates a retrieval condition and that includes second sentence elements, the first sentence elements, the second sentence elements, and sentence element relations indicating relations between the first sentence elements and relations between the second sentence elements.
    Type: Application
    Filed: April 26, 2010
    Publication date: June 2, 2011
    Applicant: FUJI XEROX CO., LTD.
    Inventor: Hiroshi UMEMOTO
  • Publication number: 20110106813
    Abstract: A data acquisition and perusal system and method including a database selection module, a database index generator module and a search module. The database selection module enables selection of a plurality of files for inclusion into at least one selectable database. The database index generator module enables generation of a searchable index of the data contained in the selectable database. The search module enables a search to be performed of the searchable index according to search criteria. The system allows for the capture of HTML data which is automatically indexed without human intervention and has the ability to automatically and accurately locate or “pinpoint,” and highlight specific text or groups of text designated by the user within the resulting database.
    Type: Application
    Filed: November 12, 2010
    Publication date: May 5, 2011
    Inventors: Robert Leland Jensen, Daniel Victor Smith
  • Publication number: 20110106799
    Abstract: A method, system, and computer program product for measuring web site satisfaction of information needs are provided. The method includes: selecting a page for analysis; generating a page profile in the form of a list of keywords representing the page; generating a page traffic profile in the form of lists of keywords representing information needs of users, wherein the page traffic profile is generated from keywords used by users to visit the page; determining the success of users' visits to the page; and analyzing whether a page satisfies users' information needs by applying a distance measure between the keywords of the page profile and the keywords of the page traffic profile and combining the distance measure result with a success rate of the keywords.
    Type: Application
    Filed: January 10, 2011
    Publication date: May 5, 2011
    Applicant: International Business Machines Corporation
    Inventors: Gilad Barkai, David Carmel, David Konopnicki, Haggai Roitman
  • Publication number: 20110093471
    Abstract: Systems and methods of electronic document handling permit organizations to comply with legal or regulatory requirements, electronic discovery and legal hold requirements, and/or other business requirements. The systems described provide a unified approach to data management that enables compliance, legal and IT personnel to focus efforts on, e.g., a single data repository. The systems permit users to define and utilize information governance policies that help automate and systematize different compliance tasks. In some examples, organizations may push data in any third-party data format to the systems described herein. The systems may permit compliance or IT personnel to detect when a legally sensitive production file has been changed or deleted. The systems may also provide a unified dashboard user interface. From a dashboard interface, users may perform searches, participate in collaborative data management workflows, obtain data management reports, and adjust policies.
    Type: Application
    Filed: September 7, 2010
    Publication date: April 21, 2011
    Inventors: Brian Brockway, Alan Bunte, Christie J. Van Wagoner, Simon Taylor, Marcus S. Muller, Anand Prahlad, Randy DeMeno, Rammohan G. Reddy
  • Publication number: 20110093467
    Abstract: A machine based tool and associated logic and methodology are used in converting data from an input form to a target form using context dependent conversion rules, and in efficiency generating an index that may be utilized to access the converted data in a database. Once the data has been converted, an index data structure for each data object may be automatically generated that encodes one or more characteristics or attributes of the converted data so that an entity may access the data using the index structure. As an example, the one or more characteristics may include categories, subcategories, or other attributes of the data.
    Type: Application
    Filed: October 16, 2009
    Publication date: April 21, 2011
    Applicant: SILVER CREEK SYSTEMS, INC.
    Inventors: Alec Sharp, Luis Rivas, Mark Kreider
  • Publication number: 20110087647
    Abstract: A system and method for providing Web search results to a particular computer user based on the popularity of the search results with other computer users is described. One embodiment monitors, using one or more servers, at least one Web service for new actions of sharing of Web content by computer users; identifies, from the new actions of sharing of Web content by computer users, a data item that satisfies predetermined interestingness criteria; parses the data item to obtain at least one Uniform Resource Locator (URL); crawls at least one Web page corresponding to the at least one URL to obtain the content of the at least one Web page; analyzes the content of the at least one Web page; and updates an index based on the content of the at least one Web page, the index being usable in processing a Web search query from a particular user.
    Type: Application
    Filed: October 13, 2009
    Publication date: April 14, 2011
    Inventors: Alessio Signorini, Ioannls Pavlids, Nathaniel Fisher, Scott Engstrom, Peter J. Newcomb, David L. Young, Ron Benson
  • Publication number: 20110066623
    Abstract: Systems and methods for compressing indices are described. In one aspect, a plurality of items are selected where each item has an entry in an inverted index and each item entry comprises a listing of articles that the item appears in. At least a first item entry and a second item entry are determined for compression and the second item entry is compressed into the first item entry resulting in a compressed first item entry.
    Type: Application
    Filed: September 20, 2010
    Publication date: March 17, 2011
    Inventor: Adam J. Weissman
  • Publication number: 20110066622
    Abstract: Methods, systems and computer readable media for extracting product lines from a plurality of product titles are provided. In one embodiment, the plurality of product titles are broken into tokens. Association rules are calculated for individual tokens and pairs of tokens. Brand specific terms and product class specific terms within the product titles are identified. In one embodiment, a token tree is used to identify product lines within the list of product titles using the association rules, the brand specific terms, and the product class specific terms.
    Type: Application
    Filed: November 22, 2010
    Publication date: March 17, 2011
    Applicant: MICROSOFT CORPORATION
    Inventors: Nimish G. Dharawat, Meera Mahabala, Gitika Gupta
  • Publication number: 20110060743
    Abstract: A method and apparatus is provided for locating and retrieving specified data content in a database. The data comprises compressed digital audio or video data files associated with the recorded speech. Retrieval of the specified content requires decompression of only a portion of the compressed data. A method for locating specified content of the above type is provided. A compressed audio file comprising recorded speech is converted into a corresponding text file. A searchable index is constructed from the text file. One or more specified search arguments are used to search respective elements of the searchable index in order to detect one or more text segments. The identifiers of respective detected segments are then used to locate the specified content in the audio file. Only portions of the audio file that contain specified content require decompression, in order to retrieve the content.
    Type: Application
    Filed: November 12, 2010
    Publication date: March 10, 2011
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Oliver Keren Ban, Timothy Alan Dietz, Anthony Cappa Spielberg
  • Publication number: 20110047165
    Abstract: A network cache (30) that includes multiple storage units (20) and multiple control units (10) that are coupled to multiple user devices (50) via a network (40), wherein the network cache is adpated to receive a file related request provided from a user device, and wherein the network cache is adapted to respond to the file related request by a selected control unit and by a selected storage unit, wherein the selected storage unit is selected in response to a file related request based on a file tag that is responsive to a content of the file, and wherein the selected control unit is selected in response to an identity of the user device.
    Type: Application
    Filed: July 16, 2007
    Publication date: February 24, 2011
    Inventors: Ofer Wald, Ayelet Wald
  • Publication number: 20110035398
    Abstract: A streaming query system for extensible markup language is provided. An XPath query translator receives and analyzes a user-input XPath document. An abstract syntax tree analyzer establishes an abstract syntax tree. A XML parser receives and parses an XML document. An index generator generates an index for the XML document. A computation module performs a format calculation based on the abstract syntax tree and the index, and generates a query result accordingly.
    Type: Application
    Filed: July 23, 2010
    Publication date: February 10, 2011
    Applicant: NATIONAL TAIWAN UNIVERSITY OF SCIENCE & TECHNOLOGY
    Inventors: Hahn-Ming Lee, Li-Zhen Liu, Chieh-Hung Lin, Jerome Yeh, Chia-Hsin Huang
  • Publication number: 20110029504
    Abstract: A facility for exposing an index of private documents is described. In a private network, the facility (1) identifies electronic versions of documents that are available inside the private network, including a distinguished document; (2) constructs an index covering the identified electronic versions of documents; and (3) exports the constructed index from the private network to an index publication server. At the index publication server, the facility (1) receives the exported index; (2) receives a query via a public network; and (3) uses an index, based upon the received index, to generate a query result for the received query that contains the distinguished document.
    Type: Application
    Filed: October 5, 2010
    Publication date: February 3, 2011
    Inventors: Martin T. King, Dale L. Grover, Clifford A. Kushier, James Q. Stafford-Fraser
  • Publication number: 20110022566
    Abstract: A digitally signed file system in which data, metadata and files are objects, each object having a globally unique and content-derived fingerprint and wherein object references are mapped by the fingerprints; the file system has a root object comprising a mapping of all object fingerprints in the file system, such that a change to the file system results in a change in the root object, and tracking changes in the root object provides a history of file system activity.
    Type: Application
    Filed: June 25, 2010
    Publication date: January 27, 2011
    Applicant: SimpliVT Corporation
    Inventors: Arthur J. Beaverson, Paul Bowden
  • Publication number: 20100332479
    Abstract: Systems and methods are disclosed for performing data storage operations, including content-indexing, containerized deduplication, and policy-driven storage, within a cloud environment. The systems support a variety of clients and cloud storage sites that may connect to the system in a cloud environment that requires data transfer over wide area networks, such as the Internet, which may have appreciable latency and/or packet loss, using various network protocols, including HTTP and FTP. Methods are disclosed for content indexing data stored within a cloud environment to facilitate later searching, including collaborative searching. Methods are also disclosed for performing containerized deduplication to reduce the strain on a system namespace, effectuate cost savings, etc. Methods are disclosed for identifying suitable storage locations, including suitable cloud storage sites, for data files subject to a storage policy.
    Type: Application
    Filed: March 31, 2010
    Publication date: December 30, 2010
    Inventors: Anand Prahlad, Rajiv Kottomtharayil, Srinivas Kavuri, Parag Gokhale, Manoj Vijayan
  • Publication number: 20100312770
    Abstract: A customizable logging and content management system for indexing multimedia, including a synchronized timer object that provides a time reference upon request in connection with the media, and a logger object that logs predefined events that occur in the media by associating the events with respective time references from the timer object. A video server is provided that captures and digitally stores events logged by the logging application as media segments, and a search and retrieval engine is provided that enables the media segments to be located, retrieved and viewed based on the indexes. The system includes a graphical user interface generator that enables customized user interfaces and logging databases to be created from database tables for use in the logging application.
    Type: Application
    Filed: July 6, 2010
    Publication date: December 9, 2010
    Applicant: Charles Smith Enterprises, LLC
    Inventors: Charles Smith-Semedo, Rolando Blackman, Stephen Jacobs, Guerrino Lupetin, Rafael Cortina
  • Publication number: 20100287166
    Abstract: Data indexing includes receiving data from a data source; classifying the data into one of a plurality of categories according to a predetermined data classification criteria; establishing a corresponding relationship between the data and an index associated with the data, the index having a preset maximum capacity; and recording the relationship between the data and the index. The index is one of a plurality of indices, and each of the plurality of indices is exclusively written by an index writing device.
    Type: Application
    Filed: May 5, 2010
    Publication date: November 11, 2010
    Inventor: Hanfei Yang
  • Publication number: 20100250552
    Abstract: A local search engine efficiently indexes documents relevant to a geographical area by indexing, for each document, multiple location identifiers that collectively define an aggregate geographic region. When creating the index, the search engine may determine a set of geographical areas surrounding a geographical area relevant to a document and associate references to the set of geographical areas with the document index.
    Type: Application
    Filed: June 15, 2010
    Publication date: September 30, 2010
    Applicant: GOOGLE INC.
    Inventor: Daniel EGNOR
  • Publication number: 20100228794
    Abstract: A technique for dynamic integration and semantic analysis of structured data and unstructured textual data including: defining and selecting static attributes and dynamic attribute from structured data, embedding static and dynamic views of the selected corresponding attributes in an annotated document, linking the unstructured textual data with the structured data using the defined static and dynamic attributes, populating an annotated document structure of multiple annotated documents, performing semantic analysis of a query across the unstructured textual data and structured data, querying the annotated document structure to provide query results satisfying static part of the query, processing static and dynamic parts of the query by querying structured data and the annotated document structure, as appropriate, and providing a combined query processing result satisfying the dynamic and static part the query. Other embodiments are also disclosed.
    Type: Application
    Filed: February 25, 2009
    Publication date: September 9, 2010
    Applicant: International Business Machines Corporation
    Inventors: Sourashis Roy, Himanshu Gupta, Hiroki Oya, Mukesh Kumar Mohania, Inagaki Iwao
  • Patent number: 7783660
    Abstract: The disclosure describes search systems and methods in which exact token searches, spelling suggestions, and split-token searches are used in conjunction to return search results to the user. Depending on the number and relevancy of results for the search query results from each of the steps the results are either merged or discarded into the final result set. The split-token search is adapted to generate two split-tokens from the token(s) of the search query in anticipation that the search token(s) is misspelled. As the location of the misspelling is unknown, the split-token search widens the scope of the results provided in response to the search. In an embodiment, the split-token search includes performing a prefix search for tokens matching a prefix split-token and a postfix search for tokens matching a postfix split-token. In an embodiment, the index is specially adapted to allow the postfix search to be performed more efficiently.
    Type: Grant
    Filed: October 5, 2006
    Date of Patent: August 24, 2010
    Assignee: Yahoo! Inc.
    Inventors: Jagadeshwar R. Nomula, Christa Stelzmuller
  • Publication number: 20100191750
    Abstract: The present invention is a multiple-column index comprised of at least three columns. The columns are repetitious loops of identical content displayed over several pages wherein the relationship between the columns would be consistent. At least two columns have listings in the order of “A” to “Z” beginning at different points of the alphabet. At least one column has listings in “Z” to “A” order. If a search-result is organized on numerous pages, the invention would maximize the potential for discoverability for businesses listed beyond the first page.
    Type: Application
    Filed: January 26, 2010
    Publication date: July 29, 2010
    Inventor: A. Leon White
  • Publication number: 20100191590
    Abstract: Method for creating a controlled data transfer connection between a remote device and a subscriber terminal by a transmission system. The first party of the interconnection, (remote device), creates a connection to the transmission system, which verifies information used for the authentication informed by the remote device and allocates an unique identifier ID for the remote device, by which the remote device can be addressed in the transmission system. The other part of the interconnection, (subscriber terminal), requests the transmission system to transmit the request to the remote device, identified by the identifier. The transmission system transmits this request to the remote device, which processes the request and sends the response via the transmission system to the subscriber terminal. This response can be converted in the transmission system to a form suitable for the subscriber terminal, and subscriber-targeted advertisements, or other data, may be added in the response.
    Type: Application
    Filed: May 26, 2008
    Publication date: July 29, 2010
    Applicant: HUUKED LABS OY
    Inventors: Harri Hakkarainen, Juha Utriainen
  • Publication number: 20100161639
    Abstract: Computer methods, apparatus and articles of manufacture therefor, are disclosed for developing a complex-query pattern that is transformed into a region-matching transducer. A corpus-level transducer and the region matching transducer are combined. The combined transducer is applied to a corpus to identify strings therein that satisfy patterns defined in the corpus-level transducer, including the complex-query pattern, with each identified pattern being recorded in a corpus index. The corpus and the corpus index are made available for receiving a query with the query tag for querying the corpus and applying the query using the corpus index to identify locations in the corpus that satisfy the query.
    Type: Application
    Filed: December 18, 2008
    Publication date: June 24, 2010
    Applicant: Palo Alto Research Center Incorporated
    Inventors: Daniel G. Bobrow, Robert D. Cheslow
  • Publication number: 20100145948
    Abstract: Disclosed are a method and a device for searching contents by using time information or spatial information. The device for contents search includes a memory unit configured to store contents having spatial information and time information as search information and to further store groups into which the contents are classified by the spatial information or the time information. The device further includes a display unit configured to display a time information search tool and a spatial information search tool in response to receipt of a request for a contents search is received, and to further display the contents belonging to a searched group. Also the device includes an input unit configured to receive an input of search information and a control unit configured to search a group having the selected search information.
    Type: Application
    Filed: December 9, 2009
    Publication date: June 10, 2010
    Applicant: Samsung Electronics Co., Ltd.
    Inventors: Gyung Hye Yang, Jin Young Jeon, Sang Woong Hwang, Ji Young Kwahk, Jee Young Her, Ji Sun Yang
  • Publication number: 20100138421
    Abstract: Systems and methods for identifying inadequate search content are provided. Inadequate search content, for example, can be identified based on statistics associated with the search queries related to the content.
    Type: Application
    Filed: February 3, 2010
    Publication date: June 3, 2010
    Applicant: GOOGLE INC.
    Inventors: Jeffrey David Oldham, Hal R. Varian, Matthew D. Cutts, Matt Rosencrantz
  • Patent number: 7716224
    Abstract: Search may be performed on a user device, such as a handheld electronic book reader device. A search query term may be received. Text of a collection of electronic items stored in memory of the user device may be searched for the queried term. Search results may be returned identifying locations in the electronic items at which the queried term appears.
    Type: Grant
    Filed: June 14, 2007
    Date of Patent: May 11, 2010
    Assignee: Amazon Technologies, Inc.
    Inventors: James R. Reztlaff, II, John Lattyak
  • Publication number: 20100106727
    Abstract: A system and method are provided for enabling a user to search for documents that the user has previously viewed on its local machine. The system includes three main components: the desktop integration module, the index module, and the graphical user interface module. The desktop integration module is an application which monitors documents with which the user interacts for predetermined events, and obtains content data and metadata from the monitored documents. The index module indexes the content data and metadata received from the desktop integration module. The graphical user interface module then permits a user to utilize the desktop integration module and index module by allowing a user to search for a document.
    Type: Application
    Filed: September 25, 2009
    Publication date: April 29, 2010
    Applicant: IBM Corporation
    Inventors: Tolga Oral, David L. Newbold, Michael Bolin, Raudel S. Rodriguez
  • Publication number: 20100094875
    Abstract: A content classification system, method and computer product is presented. In connection with the invention, a data structure is created by identifying a plurality of words and mapping each word to one or more categories. The data structure is indexed. An item of content is identified and classified based on the data structure. The classification includes identifying all one—or more—word combinations in the item of content; for each word of at least a pre-determined number of characters in length in each of the word combinations, identifying each of the categories to which it is mapped; and determining a weight for each of the words based on an inverse proportion to the number of categories to which it is mapped.
    Type: Application
    Filed: August 11, 2009
    Publication date: April 15, 2010
    Applicant: Collective Media, Inc.
    Inventors: Paul Harrison, James Oliphant, Hal Fulton, Armin Roehrl, Brenden Grace
  • Publication number: 20100088318
    Abstract: Disclosed is a system in which an index registration unit registers an index, which will be used for search processing, as a partitioned index which is partitioned on a time series basis, and a search means reads indexes older than a specified point in time, which is used as a search base point, to perform search processing, thereby searching for information based on a point in time in the past.
    Type: Application
    Filed: October 2, 2007
    Publication date: April 8, 2010
    Inventors: Masaki Kan, Yoshihiro Kajiki, Satoshi Yamakawa, Takashi Torii, Yuji Kaneko
  • Publication number: 20100076981
    Abstract: A method and apparatus for efficient indexed storage for unstructured content have been disclosed.
    Type: Application
    Filed: November 30, 2009
    Publication date: March 25, 2010
    Applicant: Nahava Inc.
    Inventor: Russell T. Nakano
  • Patent number: 7660787
    Abstract: A client-side search indexing program works transparently and in conjunction with a server based search index. The combined search indexes provide a more accurate and up-to-date image of the Web, customized to the interests of each individual user. The client-side indexer customizes indexing of particular Web pages to the preferences and usage patterns of the user. The user initially installs and configures the client-side indexer on the client. The requested indexes are automatically refreshed and integrated with the main server-side indexes during a search. When the user performs a search, the client-side indexes may be combined with the main server-side index. The combined indexes provide accurate search results for the particular user.
    Type: Grant
    Filed: July 19, 2006
    Date of Patent: February 9, 2010
    Assignee: International Business Machines Corporation
    Inventors: David Joseph Borrillo, Ryan Kirk Cradick, Zachary Adam Garbow
  • Publication number: 20090198671
    Abstract: A system for generating subphrase queries. The system includes a sequence label modeling engine and a regression modeling engine. The sequence label modeling engine generates a plurality of subphrase queries by indexing through each token in a search phrase and labeling each token based on an association to other tokens in the search phrase. The regression modeling engine scores each subphrase query at least partially on the association according to a scoring model. The regression modeling engine identifies the subphrase query with the highest score which may then be used for identifying a sponsored search list or a web search item.
    Type: Application
    Filed: February 5, 2008
    Publication date: August 6, 2009
    Applicant: Yahoo! Inc.
    Inventors: Ruofei Zhang, Haibin Cheng, Yefei Peng, Benjamin Rey, Jianchang Mao