Index Generation Patents (Class 707/711)
  • Patent number: 8352455
    Abstract: The invention described herein can be implemented in numerous ways, including as a process; an apparatus; a system; a composition of matter; a computer program; a product embodied on a computer readable storage medium, e.g., RAM, ROM, and other hardware storage devices; and/or a processor, such as a processor configured to execute instructions stored on and/or provided by a memory coupled to the processor. In this specification, these implementations, or any other form that the invention may take, may be referred to as techniques. In general, the order of the steps of disclosed processes may be altered within the scope of the invention. Unless stated otherwise, a component such as a processor or a memory described as being configured to perform a task may be implemented as a general component that is temporarily configured to perform the task at a given time or a specific component that is manufactured to perform the task.
    Type: Grant
    Filed: June 27, 2008
    Date of Patent: January 8, 2013
    Assignee: Allvoices, Inc.
    Inventors: Amra Q. Tareen, Sanjay C. Sood, Lawrence A. Birnbaum, Kristian J. Hammond, Erik Sundelof
  • Publication number: 20130006958
    Abstract: Keyword expansion is used to expand a particular keyword into multiple related keyword candidates using a collection of data that relates to search queries performed using one or more search engines. The data identifies search terms from associated search queries. A particular keyword is identified and one or more candidate keywords related to the particular keyword are identified based on the data.
    Type: Application
    Filed: September 10, 2012
    Publication date: January 3, 2013
    Inventor: William Cava
  • Patent number: 8346774
    Abstract: In one exemplary embodiment, a storage medium embodying a program of instructions for performing operations including: storing raw information for primary properties of primary nodes within a primary network; in response to receiving a query, transforming the raw information into transformed information using key(s) and based on the query, the query is in relation to a queried property, the transformed information is transformed data for the queried property, the transformed information represents a surrogate network having surrogate nodes corresponding to a portion of the primary nodes, the transformed information enables performance of operation(s) without specific knowledge of and without revealing the entirety, the transformed information further enables one having the key(s) to relate an output of the operation to the raw information; and generating a solution that relates a transformed query to the transformed information, the transformed query is a transformed representation of the query obtained by us
    Type: Grant
    Filed: August 8, 2011
    Date of Patent: January 1, 2013
    Assignee: International Business Machines Corporation
    Inventors: Dimitri Kanevsky, James R. Kozloski, Clifford A. Pickover, Tara N. Sainath
  • Patent number: 8346753
    Abstract: A system and method for searching and voting on Internet-accessible content is presented. HTML-linked content on one or more Internet host servers is indexed and stored on associated meta servers. Non-HTML content and dynamic HTML content from the deep web, is processed by the present invention into HTML content, indexed, and stored on the meta servers. Each meta server includes a local search engine. The present invention also includes a central repository which tracks user votes, query language progressions, and information about content on each of the meta sites. The present invention also provides a plurality of enhancement plug-ins for standard web robots, including a toolbar for conventional web browsers. The toolbar can be used to securely solicit and authenticated user votes from standard web sites, while preserving the privacy of the voter. The plug-ins facilitate interoperability between existing Internet software (and technology) and the present invention.
    Type: Grant
    Filed: November 14, 2007
    Date of Patent: January 1, 2013
    Inventor: Paul V Hayes
  • Patent number: 8346534
    Abstract: The present invention provides a method and a system for automatic keyword extraction based on supervised or unsupervised machine learning techniques. Novel linguistically-motivated machine learning features are introduced, including discourse comprehension features based on construction integration theory, numeric features making use of syntactic part-of-speech patterns, and probabilistic features based on analysis of online encyclopedia annotations. The improved keyword extraction methods are combined with word sense disambiguation into a system for automatically generating annotations to enrich text with links to encyclopedic knowledge.
    Type: Grant
    Filed: November 6, 2009
    Date of Patent: January 1, 2013
    Assignee: University of North Texas System
    Inventors: Andras Csomai, Rada Mihalcea
  • Patent number: 8346812
    Abstract: Indexing in a resource description framework data system including a search key set of ranking relationships for the subject, predicate, and object of RDF statements, generation of a single nodal index including a node for each of the RDF statements, each node having a plurality of rank tracking indicators that include a position indicator representing the relative position of the RDF statement in each ranking relationship. A computer readable storage medium and a system for indexing a resource description framework data system are also provided.
    Type: Grant
    Filed: October 22, 2010
    Date of Patent: January 1, 2013
    Assignee: Intellidimension, Inc.
    Inventors: Geoffrey Chappell, Derrish Repchick
  • Patent number: 8346778
    Abstract: Techniques for compressing branch nodes in an index are provided. The branch nodes may be part of a main index of a multi-level index that also includes one or more journal indexes. A Bloom filter may be generated and associated with, e.g., a branch node in the main index. The Bloom filter is used to determine whether, without accessing any leaf blocks, a particular key value exists, e.g., in leaf blocks associated with the branch node.
    Type: Grant
    Filed: May 20, 2009
    Date of Patent: January 1, 2013
    Assignee: Oracle International Corporation
    Inventors: Amit Ganesh, Juan R. Loaiza, Krishna Kunchithapadam
  • Patent number: 8346756
    Abstract: Tools and techniques related to calculating valence of expressions within documents. These tools may provide methods that include receiving input documents for processing, and extracting expressions from the documents for valence analysis, with scope relationships occurring between terms contained in the expressions. The methods may calculate calculating valences of the expressions, based on the scope relationships between terms in the expressions.
    Type: Grant
    Filed: August 29, 2008
    Date of Patent: January 1, 2013
    Assignee: Microsoft Corporation
    Inventors: Livia Polanyi, Martin Henk Van den Berg, Barney Pell
  • Publication number: 20120330889
    Abstract: A remote content crawler continually crawls a digital communication network looking for content to provide to a content aggregator, The content provided to the aggregator may be stored in a form of an entire content file. The content may include an entire movie, television program or electronic book. Alternatively, the content provided to the aggregator may be a reference to a content file that is stored at, or that will he available at one of the remote locations. The content may be a reference to a future, scheduled live sports event that will be made available to system users. The sports event may be provided for a one time fee, as part of a sports package, for which a fee is collected, or as a free event.
    Type: Application
    Filed: September 7, 2012
    Publication date: December 27, 2012
    Applicant: Comcast IP Holdings I, LLC
    Inventors: William D. Swart, Michael L. Asmussen, John S. McCoskey
  • Patent number: 8341019
    Abstract: In embodiments disclosed, information on a click stream associated with a user and meta tags commonly used by search engines can be used to profile a user. The click stream may comprise network addresses and timestamps and may be the only source of user information used for the profile. The meta tags may contain metadata that correspond to the network addresses in the click stream, including network address categorization for these network addresses. The profile obtained for the user may be an existing profile that represents a person or a theoretical individual having attributes similar to or most closely match the user. Based at least in part on the profile, information regarding an item may be provided to the user or the item may be offered to the user. In some embodiments, the information may be filtered before being displayed to the user.
    Type: Grant
    Filed: October 15, 2010
    Date of Patent: December 25, 2012
    Assignee: Open Text, S.A.
    Inventor: Sean M. McCullough
  • Patent number: 8335779
    Abstract: An apparatus for and a method of gathering new data form a specific data source, categorizing this data into a hierarchical directory structure, and building parameters to further describe this data. A harvesting system is an application that can organize and locate external data; the harvesting system can be used to navigate, organize and locate this data. The harvesting system utilizes a defined data model, referred to as a core data structure, that describes the logical structure of the data. Within the core data structure are pointers, which refer back to the source data. The loading process can be manually done, or can be automatically accomplished using this harvesting process.
    Type: Grant
    Filed: May 9, 2011
    Date of Patent: December 18, 2012
    Assignee: Gamroe Applications, LLC
    Inventor: Robert Olan Keith, Jr.
  • Publication number: 20120317089
    Abstract: A scheduler for a search engine crawler includes a history log containing document identifiers (e.g., URLs) corresponding to documents (e.g., web pages) on a network (e.g., Internet). The scheduler is configured to process each document identifier in a set of the document identifiers by determining a content change frequency of the document corresponding to the document identifier, determining a first score for the document identifier that is a function of the determined content change frequency of the corresponding document, comparing the first score against a threshold value, and scheduling the corresponding document for indexing based on the results of the comparison.
    Type: Application
    Filed: April 17, 2012
    Publication date: December 13, 2012
    Inventor: Keith H. Randall
  • Patent number: 8332430
    Abstract: A flexible and extensible architecture allows for secure searching across an enterprise. Such an architecture can provide a simple Internet-like search experience to users searching secure content inside (and outside) the enterprise. The architecture allows for the crawling and searching of a variety of sources across an enterprise, regardless of whether any of these sources conform to a conventional user role model. The architecture further allows for security attributes to be submitted at query time, for example, in order to provide real-time secure access to enterprise resources. The user query also can be transformed to provide for dynamic querying that provides for a more current result list than can be obtained for static queries.
    Type: Grant
    Filed: February 28, 2007
    Date of Patent: December 11, 2012
    Assignee: Oracle International Corporation
    Inventors: Hiroshi Koide, Mark Ture, Muralidhar Krishnaprasad, Mark Davis, Cindy Hsin, Meeten Bhavsar, Steve Chi-Ming Yang, Visar Nimani, Hui Ouyang, Sachin Bhatkar, Thomas Chang, Thomas Baby, Ciya Liao
  • Patent number: 8331736
    Abstract: An image processing device is provided which generates an easily reusable electronic document from an input image in which different page sizes are mixed. The image processing device generates a plurality of pieces of display information from a plurality of document images, and, depending on the size and the direction of each of the images, converts the pieces of display information into electronic documents. That is, the plurality of pieces of display information are divided into a plurality of groups, depending on the size and the direction of each of the images, and the display information included in each of the groups is converted into a separate electronic document. Further, sequence information based on the input order of the plurality of document images is stored on an electronic document.
    Type: Grant
    Filed: May 20, 2009
    Date of Patent: December 11, 2012
    Assignee: Canon Kabushiki Kaisha
    Inventors: Keiko Nakanishi, Makoto Enomoto, Taeko Yamazaki
  • Publication number: 20120310915
    Abstract: Finding indexed documents is described herein, including: extracting a character string from the search query and segmenting the character string into a plurality of search terms, wherein each search term comprises one or more search words; retrieving indexed document sets corresponding to search words included in the plurality of search terms; processing the retrieved indexed document sets associated with a first search term of the plurality of search terms based at least in part on a selection operation, wherein processing results of the retrieved indexed document sets associated with the first search term comprise search results for the first search term; performing set intersection operations on search results corresponding to the plurality of search terms to generate a results document set; and determining and returning a first subset of indexed documents of the results document set that each includes each of the plurality of search terms.
    Type: Application
    Filed: May 30, 2012
    Publication date: December 6, 2012
    Applicant: Alibaba Group Holding Limited
    Inventor: Dong Yang
  • Patent number: 8326820
    Abstract: Described herein is a technology that facilitates efficient large-scale similarity-based retrieval. In several embodiments documents, images, and/or other multimedia files are compactly represented and efficiently indexed to enable robust search using a long-query in a large-scale corpus. As described herein, these techniques include performing decomposition of a file, e.g., a document or document-like representation. The techniques use dimension reduction to obtain three parts, topic-related words (major semantics), document specific words (minor semantics), and background words, representing the major semantics in a feature vector and the minor semantics as keywords. Using the techniques described, file vectors are matched in a topic model and the results ranked based on the keywords.
    Type: Grant
    Filed: September 30, 2009
    Date of Patent: December 4, 2012
    Assignee: Microsoft Corporation
    Inventors: Zhiwei Li, Lei Zhang, Rui Cai, Wei-Ying Ma, Heung-Yeung Shum
  • Patent number: 8326819
    Abstract: Disclosed herein is a method and system for hardware-accelerating the generation of metadata for a data stream using a coprocessor. Using these techniques, data can be richly indexed, classified, and clustered at high speeds. Reconfigurable logic such a field programmable gate arrays (FPGAs) can be used by the coprocessor for this hardware acceleration. Techniques such as exact matching, approximate matching, and regular expression pattern matching can be employed by the coprocessor to generate desired metadata for the data stream.
    Type: Grant
    Filed: November 12, 2007
    Date of Patent: December 4, 2012
    Assignee: Exegy Incorporated
    Inventors: Ronald S. Indeck, Naveen Singla, David E. Taylor
  • Publication number: 20120303607
    Abstract: Methods, apparatuses, and articles for receiving a search request associated with a transportation service from a client device, the search request including a plurality of search criteria, are described herein. Additionally, the methods, apparatuses, and articles further return to the client device an answer page having a plurality of answers potentially associated with the transportation service, the plurality of answers identifying a plurality of information locations having information potentially associated with at least a one of the plurality of search criteria, where at least one of the answers includes at least one input field of a query answer page for entry of at least one feature of the transportation service, the query answer page to be dynamically generated by one of the information locations in response to a query.
    Type: Application
    Filed: June 7, 2012
    Publication date: November 29, 2012
    Applicant: DEEP WEB, LLC
    Inventors: Michael Z. Moricz, John R. Kasha, JR.
  • Patent number: 8321434
    Abstract: A system (and a method) is disclosed to extract entity values from texts. The system receives, at a first tier entity recognition engine, an input data string having a plurality of entities. The first tier entity recognition engine marks entities of the plurality of entities that are regular expression and transmits the input data stream with the marked entities to a second tier entity recognition engine. The second tier entity recognition engine receives the input data stream and identifies unmarked entities in the input data stream received at the second tier entity recognition engine. The second tier entity recognition engine determines whether the unmarked entities comprise a predetermined data format, and if so, outputs those unmarked entities of the plurality of entities that comprise the predetermined data format.
    Type: Grant
    Filed: August 3, 2007
    Date of Patent: November 27, 2012
    Assignee: Trend Micro Incorporated
    Inventors: Liwei Ren, Shu Huang
  • Patent number: 8321198
    Abstract: This invention provides a terminal searching for web pages on the web and extracting the prescribed data from the web pages and a server verifying and accumulating the extracted data. The prescribed data can be extracted from the web pages on the web in a manner that the process relating to the data extraction is distributed between the terminal and the server. Therefore, necessary processes up to the data extraction are distributed, and the burden placed on each apparatus can be lessened. Further, new data not formerly found in the web pages can be found out and extracted from the web pages that has been updated or newly made.
    Type: Grant
    Filed: October 27, 2005
    Date of Patent: November 27, 2012
    Assignee: Kabushiki Kaisha Square Enix
    Inventor: Kengo Nakajima
  • Patent number: 8321401
    Abstract: Automatically and repeatedly crawling multiple multimedia websites to identify and collect information about the multimedia content that is available for delivery over the Internet to a client device for playback on a media player operating on the client device. In one embodiment, the method normalizes the collected information by converting the different formats of the collected information into a common format and converting the different nomenclatures of the collected information into a common nomenclature. The method updates an index with the normalized information, and sends a data feed to the client device to populate the user interface on the client device with the normalized information of the index. The user interface allows the user to navigate and select the multimedia content that is available for delivery over the Internet for playback on the media player.
    Type: Grant
    Filed: October 17, 2008
    Date of Patent: November 27, 2012
    Assignee: EchoStar Advanced Technologies L.L.C.
    Inventors: John Edwards, Nathan Edwards, David Brueck
  • Patent number: 8321392
    Abstract: A system, method, and computer program product are provided for invoking an operation to change a current data row of a table in a local database. As a result, a pending change data row is inserted in the table, distinguished by a primary key comprising a common key value and a pending change indicator. The pending change is then submitted for synchronization.
    Type: Grant
    Filed: June 10, 2010
    Date of Patent: November 27, 2012
    Assignee: Sybase, Inc.
    Inventors: Evan Ireland, David Clegg, Michael Ho, Hemal Pandya, Guo-ping Zhang
  • Patent number: 8316008
    Abstract: A method of performing a file search with specified attribute criteria includes generating or having access to a file system image of the data storage system. A substantially sequential scan of the file system image can then be performed. Based on the scan, a list of inodes (called a specified criteria inode map) on the file system image that have the specified attribute criteria can be built. This sequential scan eliminates the random scan of the file system image, thereby dramatically reducing the total time associated with reading of the inodes. A file tree of the data storage system can then be walked to find inodes (in the file tree) that match inodes in the specified criteria inode map. A list of files that correspond to matching inodes can thus be quickly and easily generated.
    Type: Grant
    Filed: April 14, 2006
    Date of Patent: November 20, 2012
    Assignee: Mirapoint Software, Inc.
    Inventor: Jaspal Kohli
  • Patent number: 8316046
    Abstract: A journaling subsystem on a mobile device stores event data related to applications or other subsystems running on the mobile device. The event data can be stored and indexed in a journal database so that a timeline of past events can be reconstructed in response to search queries. In some implementations, a timeline can be reconstructed with markers on a map display based on search results. When the user interacts with a marker on the map display, the event data collected by the mobile device is made available to the user.
    Type: Grant
    Filed: September 28, 2010
    Date of Patent: November 20, 2012
    Assignee: Apple Inc.
    Inventors: Ronald K. Huang, Patrick Piemonte
  • Publication number: 20120290557
    Abstract: A method and system associating labels and attribute values with items in a collection of data. Providers can associate attributes and labels with their data or attributes and labels can be added to existing data. A preferred embodiment allows a content provider to upload data and to attach their own custom labels and attributes to items or to use predefined labels and attributes. Providers can upload data using a user interface or a bulk upload mechanism.
    Type: Application
    Filed: January 11, 2012
    Publication date: November 15, 2012
    Applicant: GOOGLE INC.
    Inventors: Bindu Reddy, Marshall Spight, Ning Mosberger
  • Publication number: 20120290556
    Abstract: An anonymous referendum system provides a means for an end user to enter an answer to a referendum without being forced to navigate to a secondary interface. The anonymous referendum system has a server that includes a database for storing available referendums and their corresponding status, operating parameters and answers, an answer checker for tallying or discarding an answer and at least one client that includes an interface. In the case a cookie is stored at the interface, the message is transmitted back to the server with a hashed cookie as a field in the header portion of the message. In the case a cookie is not used, a hashed MAC address is inserted as a field in the header portion. A mechanism for causing sharing of information between referendum pools is further disclosed.
    Type: Application
    Filed: August 17, 2011
    Publication date: November 15, 2012
    Inventor: Jesus Acosta-Cazaubon
  • Publication number: 20120290558
    Abstract: Embodiments of the invention provide a system and method for searching and reporting on semistructured data that can include dynamic metadata. One embodiment can comprise providing a user interface to a user based on an object type definition for an object type that allows the user to specify search criteria associated with a set of metadata, mapping the user search criteria to a query that comprises at least one structured query constraint and at least one unstructured query constraint, processing the query to search a set of data objects containing semistructured data associated with the object type according to the query and returning a set of results to the user. The search results can be returned to a user based on user-specified reporting parameters. Additionally, the reporting definition can be saved as an object for future execution.
    Type: Application
    Filed: July 24, 2012
    Publication date: November 15, 2012
    Inventor: Thomas Robert Gruber
  • Patent number: 8311795
    Abstract: A conceptualization method uses maximum or other substrings of a string pattern to find specific N-tuples of substring triples with N?2 and m=1 . . . N inside a reference set (SET_r_i) of strings (STR_n_i). Each N-tuple is considered as a candidate for representing related concepts. Each concatenation of the substrings triples is an explicit member of the reference set (SET_r_i). Each middle substring out of middle substrings is unequal to another middle substring out of middle substrings within the substring triples found inside the reference set (SET_r_i). Each prefix substring (X_i) is equal to all other prefix substrings (X_i) within the substring triples found inside the reference set (SET_r_i). Each suffix substring (Z_i) is equal to all other suffix substrings (Z_i) within the substring triples found inside the reference set (SET_r_i). Either the prefix substring (X_i) or the suffix substring (Z_i) is not empty.
    Type: Grant
    Filed: December 31, 2008
    Date of Patent: November 13, 2012
    Assignee: International Business Machines Corporation
    Inventors: Andreas Arning, Roland Seiffert
  • Publication number: 20120284253
    Abstract: A new approach is proposed that contemplates systems and methods to provide query suggestions including real-time suggestion of complete query terms, which can be phrases, to a user by analyzing and indexing the real-time history/stream of content or documents in addition to the stream of queries entered. Since the real-time indexing generates a count of potential results for each term found and/or indexed in the stream, the terms found in that stream can then be used as potential query suggestions, knowing that it will be possible to provide results for those queries.
    Type: Application
    Filed: June 14, 2011
    Publication date: November 8, 2012
    Inventors: Rishab Aiyer Ghosh, Lun Ted Cui
  • Patent number: 8306988
    Abstract: A system, method, and computer program product are provided for segmenting a database based, at least in part, on a prevalence associated with known objects included in the database. In operation, a database including a plurality of known objects is identified. Additionally, the database is segmented into a plurality of segments. Furthermore, each of the plurality of known objects are assigned to one of the plurality of segments, based at least in part on a prevalence associated with each of the plurality of known objects.
    Type: Grant
    Filed: October 26, 2009
    Date of Patent: November 6, 2012
    Assignee: McAfee, Inc.
    Inventors: Nitin Jyoti, Prabhat Kumar Singh, Zhenyu Zhong, Guy William Welch Roberts, Jeffrey Martin Green, Sven Krasser
  • Patent number: 8306964
    Abstract: A search engine system is described herein that provides an alternative data path for collecting results provided by index servers. The alternative data path collects the results in a direct and asynchronous manner; this is in contrast to a synchronous path used to deliver search results to end users via one or more aggregator modules. An analysis system can use the alternative data path to collect a large amount of richly descriptive information regarding the performance of the search engine system, circumventing bottlenecks and other constraints that would otherwise be imposed by the synchronous data path. The analysis system can analyze the information collected from the index servers to improve the performance of the search engine system.
    Type: Grant
    Filed: July 20, 2010
    Date of Patent: November 6, 2012
    Assignee: Microsoft Corporation
    Inventors: Yinzhe Yu, Qinghua Zou, Pei Pei, Gaurav Sareen, Wee Wah Alfian Tan
  • Patent number: 8307276
    Abstract: A method of verifying content, for example webpage content, including receiving, at an indexing system (100), a first copy (240) of a document (220) from a hosting server (210), then processing the first copy (240) to obtain first indicating data including one or more properties of the first copy (240). Also obtaining second indicating data, the second indicating data obtained by processing a second copy (255) of the document (220) from the hosting server (210), the second copy (255) received by a client terminal (250), the second indicating data including one or more properties of the second copy (255). Then the first indicating data and the second indicating data can be compared, and the first copy (240) of the document (220) received by the indexing system (100) can be determined to be verified or not verified based on a comparison between the first indicating data and the second indicating data.
    Type: Grant
    Filed: May 16, 2007
    Date of Patent: November 6, 2012
    Assignee: Symantec Corporation
    Inventors: Rolf Repasi, Simon Clausen
  • Publication number: 20120278303
    Abstract: A flexible and extensible architecture allows for secure searching across an enterprise. Such an architecture can provide a simple Internet-like search experience to users searching secure content inside (and outside) the enterprise. The architecture allows for the crawling and searching of a variety or sources across an enterprise, regardless of whether any of these sources conform to a conventional user role model. The architecture further allows for security attributes to be submitted at query time, for example, in order to provide real-time secure access to enterprise resources. The user query also can be transformed to provide for dynamic querying that provides for a more current result list than can be obtained for static queries.
    Type: Application
    Filed: May 30, 2012
    Publication date: November 1, 2012
    Applicant: Oracle International Corporation
    Inventors: Muralidhar Krishnaprasad, Sachin Bhatkar, Visar Nimani, Thomas Chang, Meeten Bhavsar
  • Publication number: 20120278304
    Abstract: Embodiments are configured to provide information relevant to individuals of interest to a searching user. In an embodiment, a method includes identifying relevant individuals of a network using a relevance model that includes the use of a number of managed properties and ranking features to identify relevant individuals of a defined network. The relevance model of one embodiment is defined by a schema that includes a textual matching ranking feature, social distance ranking feature, a levels to top ranking feature, and a proximity ranking feature.
    Type: Application
    Filed: July 9, 2012
    Publication date: November 1, 2012
    Applicant: MICROSOFT CORPORATION
    Inventors: BOXIN LI, DMITRIY MEYERZON, JESSICA ALSPAUGH, VICTOR POZNANSKI
  • Patent number: 8301603
    Abstract: Disclosed is a system in which an index registration unit registers an index, which will be used for search processing, as a partitioned index which is partitioned on a time series basis, and a search means reads indexes older than a specified point in time, which is used as a search base point, to perform search processing, thereby searching for information based on a point in time in the past.
    Type: Grant
    Filed: October 2, 2007
    Date of Patent: October 30, 2012
    Assignee: NEC Corporation
    Inventors: Masaki Kan, Yoshihiro Kajiki, Satoshi Yamakawa, Takashi Torii, Yuji Kaneko
  • Patent number: 8301514
    Abstract: Purchase phrases can be analyzed to infer user interests. A purchase phrase can include one or more user-defined textual words that a user employs to purchase items in an expedited manner. Many users' purchase phrases may include words that reflect the users' interests, such as “golf” or “rose.” Users with similar purchase phrases may therefore be interested in purchasing similar items. A recommendation system can therefore find items purchased with similar phrases to a target user's purchase phrase and then recommend these items to the target user.
    Type: Grant
    Filed: September 14, 2010
    Date of Patent: October 30, 2012
    Assignee: Amazon Technologies, Inc.
    Inventor: Rajendra K. Vippagunta
  • Publication number: 20120271813
    Abstract: Subject matter described herein includes a multi-layer search-engine index. Accordingly, the search-engine index is divided into multiple indexes, each of which includes a respective set of information used to serve (i.e., respond to) a query. One index includes a term index, which organizes a set of terms that are found among a collection of documents. Another index includes a document index, which organizes a set of documents that are searchable. A computing device is used to serve the search-engine index (i.e., to analyze the index when identifying documents relevant to a search query). For example, a solid-state device might be used to serve the multi-layer search-engine index.
    Type: Application
    Filed: March 23, 2012
    Publication date: October 25, 2012
    Inventors: HUI SHEN, Mao Yang, Lintao Zhang, Zhenyu Zhao, Xiao Wu, Ying Yan, Xiaosong Yang, Chad Walters, Choong Soon Chang
  • Patent number: 8296276
    Abstract: Methods and systems provide a tool for prioritizing the ordering of outstanding indexing work in order to bring a particular portion of an indexing source up to date quickly and to reduce the likelihood of inconsistencies between an index-backed view and a direct view of a source. In accordance with the described embodiments, indexing of items can be prioritized based upon a user's view or metadata contained within a query. Further, in at least some embodiments, the tool can decide the order to index items based upon multiple prioritization requests.
    Type: Grant
    Filed: December 22, 2008
    Date of Patent: October 23, 2012
    Assignee: Microsoft Corporation
    Inventors: Michael J. Novak, Christopher C. McConnell
  • Patent number: 8296279
    Abstract: Systems, methods, and computer program products identifying a search query, the search query including at least one search term. An index is used to identify a word as a search result for the search query, where the index includes a substring of the word, one or more inclusive strings corresponding to the substring, the one or more inclusive strings including the substring and at least one more character, and one or more word objects, the one or more word objects identifying content including the substring of the word.
    Type: Grant
    Filed: June 3, 2008
    Date of Patent: October 23, 2012
    Assignee: Google Inc.
    Inventor: Mukesh Kumar Singh
  • Patent number: 8296284
    Abstract: A navigation system is used in combination with a data retrieval system to more effectively locate the correct answers or information in search content. The navigation system provides taxonomies that identify different information categories and sub-categories (facets). Classifiers automatically classify the search content by identifying the content corresponding with associated facets. The facets, in conjunction with the associated classifiers, provide robust classification and correlation between both structured and unstructured content. Rules can be used in conjunction with the facets and classifiers to automatically control different navigation operations. The navigation system can also leverage ontologies to dynamically create taxonomies and provides unique facet expansion and intra-document classification operations.
    Type: Grant
    Filed: January 12, 2011
    Date of Patent: October 23, 2012
    Assignee: Oracle International Corp.
    Inventor: Gann Alexander Bierner
  • Patent number: 8296286
    Abstract: Provided is a database processing system including: a computer for outputting data in response to a received query request; and a storage system including a storage device for storing the data, in which: the storage device stores a plurality of partial indices indicating a storage location of the data; the data stored in the storage device is grouped; and the computer is configured to: receive the query request for the data; acquire one of the plurality of partial indices; specify, based on the query request for the data and the acquired one of the plurality of partial indices, a location at which the requested data is stored; and send a request to acquire the data stored at the specified location to the storage system. Accordingly, in the database processing system, a time period necessary to input and output the data is shortened.
    Type: Grant
    Filed: February 25, 2010
    Date of Patent: October 23, 2012
    Assignee: Hitachi, Ltd.
    Inventors: Michiko Tanaka, Kazutomo Ushijima, Akira Shimizu, Seisuke Tokuda, Shinji Fujiwara, Nobuo Kawamura
  • Patent number: 8291048
    Abstract: A method, system, and computer-readable medium are described for registering and using multilingual domain names that include characters outside the ASCII character subset supported by the DNS system. Such multilingual domain names can in some situations be registered by first being converted into appropriate ASCII-Compatible Encodings (ACEs) that represent the corresponding multilingual domain names and that use only characters within the ASCII character subset. In addition, a variety of binary variants may be generated at registration for each multilingual domain name and then used as equivalents for the multilingual domain name, such as by storing the variants in the registry as alternative domain names or by otherwise reserving the binary variants. When requests to resolve such a registered multilingual domain name into a corresponding IP address or URL are received, the stored binary variants and/or ACE information can then be used to respond in an appropriate manner.
    Type: Grant
    Filed: August 3, 2010
    Date of Patent: October 16, 2012
    Assignee: Verisign, Inc.
    Inventors: Brian Cartmell, Jothan Frakes
  • Patent number: 8290924
    Abstract: A type of search engine (referred to as the “Get Engine”) receives one or more keywords, semantically formulates a question being asked from the keywords, generates specifications for the query, and searches a website index to determine websites that are likely owners of the answer to the question based on the query specifications and website classifications. The Get Engine determines a website that is most likely the owner of the answer based on credibility, searches the pages of the website using the keywords and additional keywords related to the query, retrieves the answer from the pages of the website, and receives feedback used in part to determine the credibility of the website.
    Type: Grant
    Filed: July 17, 2009
    Date of Patent: October 16, 2012
    Assignee: Empire Technology Development LLC
    Inventor: Lalgudi Natarajan Rajaram
  • Publication number: 20120259835
    Abstract: An embodiment of a method includes receiving a selection of an original content item, deriving a supplemental content item from the original content item, designating a unique identifier to the supplemental content item, and inserting a reference including the unique identifier into the primary document. A system includes a candidate content item repository having one or more content items that can be included in a structured electronic document, a copy module configured to derive a base supplemental content item from a selected one of the candidate content items and further configured to designate a unique identifier for the base supplemental content item, and a document editor configured to receive the unique identifier and insert the unique identifier into a reference to the base supplemental content in the primary document.
    Type: Application
    Filed: June 14, 2012
    Publication date: October 11, 2012
    Applicant: BRIEF-LYNX, INC.
    Inventors: David Michael Vanderport, Kristen Marie Vanderport, Tadd Loebbaka
  • Publication number: 20120259834
    Abstract: A method is carried out by storing information describing configurations of discussion threads formed of respective series of EMTs that are exchanged among at least two individuals. The discussion threads have a root EMT, zero or more reply EMTs, and a last offspring EMT. The method is further carried out by compacting the EMT discussion threads, and indexing the compacted EMT discussion threads.
    Type: Application
    Filed: June 12, 2012
    Publication date: October 11, 2012
    Applicant: International Business Machines Corporation
    Inventors: Andrei Z. Broder, Nadav Eiron, Marcus Fontoura, Michael Herscovici, Ronny Lempel, John McPherson, JR., Eugene Shekita
  • Patent number: 8285706
    Abstract: Human computation games are provided wherein a player is shown a page, such as a web page. The player is then asked to provide one or more terms that are intended to cause a search engine to return the page in response to performing a query using the terms. The terms provided by the player during game play are then collected, stored, and utilized to improve the performance of the search engine.
    Type: Grant
    Filed: June 10, 2009
    Date of Patent: October 9, 2012
    Assignee: Microsoft Corporation
    Inventors: Raman Chandrasekar, Christopher Brian Quirk, Sarthak Deepak Shah, Matthew Richardson, Christopher John Champness Burges, Abhishek Gupta, Hao Ma
  • Patent number: 8285709
    Abstract: In one embodiment, a method includes concurrently executing a set of multiple queries, through a processor, to improve a resource usage within a data warehouse system. The method also includes permitting a group of users of the data warehouse system to simultaneously run a set of queries. In addition, the method includes applying a high-concurrency query operator to continuously optimize a large number of concurrent queries for a set of highly concurrent dynamic workloads.
    Type: Grant
    Filed: May 12, 2010
    Date of Patent: October 9, 2012
    Assignee: Teradata US, Inc.
    Inventors: George Candea, Neoklis Polyzotis
  • Patent number: 8285594
    Abstract: Some embodiments of the invention include tools for extracting data of interest from the world wide web (WWW). The extraction is accomplished using descriptions of data of interest. The descriptions of data of interest can include computer programs comprising a sequence of instructions and extractor patterns. The extractor patterns can be developed interactively using a web browser integrated into the graphical development environment for creating the descriptions of data of interest. The instructions can be selected from a predetermined list of instructions designed for extracting information from the WWW. The descriptions of data of interest can be grouped into categories sharing common query elements. Multiple descriptions of data of interest in the same category can executed simultaneously using the same query. The descriptions of data of interest can be accessed by a client computer using a web browser to initiate a query.
    Type: Grant
    Filed: August 31, 2009
    Date of Patent: October 9, 2012
    Assignee: CBS Interactive Inc.
    Inventors: Yeogirl Yun, Brad Park
  • Publication number: 20120246143
    Abstract: A method of responding to a request for web page includes the steps of receiving a request, extracting search query parameters from a request and redirecting to a mapped web page or alternatively responding with the requested web page. The search query parameters and requested web page are associated with each and stored for later processing and assignment.
    Type: Application
    Filed: March 26, 2012
    Publication date: September 27, 2012
    Inventor: Lee Roberts
  • Publication number: 20120246144
    Abstract: Methods, apparatuses, and articles for receiving a search request associated with a transportation service from a client device, the search request including a plurality of search criteria, are described herein. Additionally, the methods, apparatuses, and articles further return to the client device an answer page having a plurality of answers potentially associated with the transportation service, the plurality of answers identifying a plurality of information locations having information potentially associated with at least a one of the plurality of search criteria, where at least one of the answers includes at least one input field of a query answer page for entry of at least one feature of the transportation service, the query answer page to be dynamically generated by one of the information locations in response to a query.
    Type: Application
    Filed: June 7, 2012
    Publication date: September 27, 2012
    Applicant: DEEP WEB, LLC
    Inventor: Michael Z. Moricz