Index Generation Patents (Class 707/711)
  • Publication number: 20120239639
    Abstract: A search engine to index web content with user content. A server computer receives, from a first client computer operated by a first user, an identification of first web content displayed by a web browser of the first client computer in a main browser window. The identification of the first web content is transmitted by the first user to the server computer via a user interface separate from the main browser window. The server computer then indexes the first web content. In response to receiving a search query from a web browser of a second client computer operated by a second user, the server computer transmits search results to the web browser of the second client computer. The search results include the first web content identified by the first user in a position relative to identifications of other web content received from other users.
    Type: Application
    Filed: March 14, 2012
    Publication date: September 20, 2012
    Applicant: SLANGWHO, INC.
    Inventors: John S. Shriber, Roman Zaks
  • Publication number: 20120239640
    Abstract: A mobile station is arranged to determine its location, which is searched against street addresses from a database, and at least one matching street address is retrieved. The street address is searched on the mobile station and/or over the network. The search engine conducts a search in the mobile station file system and/or the Internet and/or a file system over the network with the at least one query term, —at least one search result is arranged to be displayed to user on the screen of the mobile station. This facilitates on-demand effortless Mobile Internet Search that allows the users to access opportunities that they did not know about, or would not have had time to find out about with minimum effort as the software of the mobile phone is scanning the Internet and information pages for these opportunities and displaying the results dynamically on the mobile phone screen.
    Type: Application
    Filed: April 17, 2012
    Publication date: September 20, 2012
    Inventor: Mikko VÄÄNÄNEN
  • Patent number: 8271468
    Abstract: A method and system is provided for maintaining customer data in a data store system utilizing a scalable partitioning framework. More specifically, the data store of a customer service system is partitioned (divided) into multiple partitions by a partitionable key of customer data so that each partition owns a subset of the customer data. By maintaining several partitions that store a subset of the customer data exclusive to each other, blackout or brownout problems can be local to one partition and, thus, the availability for the entire system will be increased. Moreover, a set of partitionable keys, a minimal unit to be moved between partitions, is grouped and associated with a partition. By eliminating direct dependencies between the partitions and the partitionable keys, the system can have great flexibility with regard to migrating customer data between partitions and adding a new partition.
    Type: Grant
    Filed: July 1, 2010
    Date of Patent: September 18, 2012
    Assignee: Amazon Technologies, Inc.
    Inventors: Nicholas K. Peddy, Chelsea C. Krueger, Michael J. McCann, Suguna Subramaniam, W. Cary Hall
  • Patent number: 8266152
    Abstract: Indexing data items into an index. A method includes identifying a parameter pattern for a data item stored in a data store record of a data store. The data store record has a data store location identifier. The method further includes identifying a hash of the parameter pattern. The data store location identifier is correlated to at least a portion of the hash in the index. The index includes index entries where each index entry includes at least a portion of a hash and one or more references to data store records by reference to data store location identifiers.
    Type: Grant
    Filed: August 30, 2007
    Date of Patent: September 11, 2012
    Assignee: Perfect Search Corporation
    Inventor: Ronald P. Millett
  • Patent number: 8266136
    Abstract: Embodiments of the present invention provide a novel technique for maintaining directory-based quota, for performing directory notification to a client system, and performing other such operations by efficiently finding ancestral directory associations on a server system without performing multiple accesses to data structures associated with the directory associations. Since data structures do not need to be accessed multiple times, performance of the server system can be improved.
    Type: Grant
    Filed: April 13, 2009
    Date of Patent: September 11, 2012
    Assignee: NetApp, Inc.
    Inventors: Prashant Pogde, Manish M. Agarwal
  • Patent number: 8266135
    Abstract: A method, system, and article are provided for evaluating regular expressions over large data collections. A general purpose index is built to handle complex regular expressions at the character level. Characters, character classes, and associated metadata are identified and stored in an index of a collection of documents. Given a regular expression, a query is generated based on the contents of the index. This query is executed over the index to identify a set of documents in the collection of documents over which the regular expression can be evaluated. Based upon the query execution, the identified set of documents is returned for evaluation by the regular expression responsive to execution of the query over the index.
    Type: Grant
    Filed: January 5, 2009
    Date of Patent: September 11, 2012
    Assignee: International Business Machines Corporation
    Inventors: Ting Chen, Rajasekar Krishnamurthy, Shivakumar Vaithyanathan
  • Patent number: 8260766
    Abstract: A method of processing documents is described. The method includes the operation of receiving a document in a search engine crawler. The document includes an embedded first link tag. The first link tag includes one or more information pairs. A respective information pair includes a respective parameter and a corresponding value. The parameters in the one or more information pairs may correspond to content at one or more content locations or one or more document locations. The method also includes selecting a method of processing content associated with the first link tag in accordance with one or more of the information pairs.
    Type: Grant
    Filed: July 12, 2011
    Date of Patent: September 4, 2012
    Assignee: Google Inc.
    Inventors: Krishna Bharat, Matthew Daniel Cutts, Paul G. Haahr, Radhika A. Malpani, Vibhu Mittal, Marcin Kaszkiel
  • Patent number: 8255386
    Abstract: A system may identify a collection of documents, predict a utility score for each document in the collection of documents, and create a ranked list of the documents based on the predicted utility scores for the documents. The system may also select a number of the documents in the ranked list of documents, index the selected documents, and store the index.
    Type: Grant
    Filed: January 30, 2008
    Date of Patent: August 28, 2012
    Assignee: Google Inc.
    Inventors: Thomas M. Annau, Charles Garrett, Sharmila Subramaniam, Sanjay Joshi, Rosemary Emery-Montemerlo, Aaron A. D'Souza, Jonathan D. Wilson, Christopher W. Fraser
  • Patent number: 8250060
    Abstract: A file uploading method with a function of abstracting index information in real time and a web storage system using the same are provided. The file uploading method includes: performing a data filtering on an upload target file transmitted from a user through an online network; abstracting text information as index information of the filtered upload target file; if the upload target file is not a text file, abstracting index information by abstracting text information from the filtered upload target file or converting the file into a text format by using specific information convertible into a text format; creating an inverted file by using the abstracted text information; establishing an index database by using the abstracted text information; and storing and managing the upload target file, when the transmission of the upload target file in which the index information is stored through the index database establishing operation is completed.
    Type: Grant
    Filed: August 7, 2009
    Date of Patent: August 21, 2012
    Assignee: ESTsoft Corp.
    Inventor: Yoon Sik Oh
  • Patent number: 8244701
    Abstract: Systems and methods for applying user behavior data to improve search query result ranking are provided. Upon receiving an update file indicating that recent, significant user behavior data is available for a document associated with an inverted index, the update file is published periodically and frequently to an index server. After filtering out the relevant update information from the update file, the index server extracts identifiers of the documents having the associated user behavior data. The update file and the identifier of the documents are utilized to update an in-memory index containing representations of metadata indicative of the user behavior. The in-memory index is continuously updated and utilized to serve search query results in response to user search queries. Search query results from the in-memory index are ranked using the user behavior data prior to serving. Thus, results associated with recent, significant user-behavior metadata receive prominent placement on the search results page.
    Type: Grant
    Filed: June 27, 2011
    Date of Patent: August 14, 2012
    Assignee: Microsoft Corporation
    Inventors: Walter Sun, Jay Kumar Goyal, Pratibha Permandla, Yinzhe Yu, Jingfeng Li
  • Patent number: 8244700
    Abstract: Systems and methods for performing an updating process to an in-memory index are provided. Upon receiving notice of document modifications covered by an inverted index associated with a search engine, in the form of an update file, a representation of the modification is published onto various index serving machines. Each index serving machine receiving the update file determines if the modifications are applicable to the index serving machine. If an index serving machine determines that it contains mapping information corresponding to the modified documents, the index serving machine utilizes the update file and associated mapping information to update an in-memory index. In embodiments, the in-memory index is used to provide results to user queries in tandem with the inverted index. In some embodiments, an extra in-memory index is maintained that is revised with constantly incoming metadata updates and the existing in-memory index is periodically swapped with the revised in-memory index.
    Type: Grant
    Filed: February 12, 2010
    Date of Patent: August 14, 2012
    Assignee: Microsoft Corporation
    Inventors: Pratibha Permandla, Yinzhe Yu, Guarav Sareen, Abhas Kumar
  • Patent number: 8244711
    Abstract: A system, method, and apparatus for information retrieval are provided. Embodiments of the present invention may generate data structures that may be used to process user queries. According to embodiments of the present invention, a processor component configured to perform the operations of an indexing module and a storage module, the indexing module configured to generate a term list and a term-file matrix from information stored on the storage module, the indexing module further configured to generate an adjacency matrix from the one or more files, wherein the adjacency matrix represents a relationship of the one or more terms in each of the one or more files; and the indexing module further configured to generate a probability matrix using the adjacency matrix and a one-step or two-step random walk.
    Type: Grant
    Filed: September 28, 2009
    Date of Patent: August 14, 2012
    Inventor: Chin Lung Fong
  • Patent number: 8244674
    Abstract: A system is provided for locating peers having a desired expertise. User profile information is stored in a profiles database. A search engine indexes the profiles database and appends appropriate profile tags to this information. A peer relevancy algorithm searches for candidate peers among the indexed user profile information. Weights are assigned to candidate peers based on different categories of the indexed user profile information, and peer matches are selected based on the assigned weights. In order to provide matches that are most likely to accept a connection request, data is maintained as to which potential peers have a history of accepting requests to connect and which have a history of refusing to connect. Potential matches are biased to favor those that have a tendency to accept connection requests. Contact information of requester and recipients are not disclosed until the recipient accepts the requester's connection request.
    Type: Grant
    Filed: December 2, 2009
    Date of Patent: August 14, 2012
    Assignee: Gartner, Inc.
    Inventors: Ken Davis, Nir Polonsky, Girish Malangi, Malini Vittal, Tirath Mehta, Maria Patterson
  • Patent number: 8238669
    Abstract: A system and method detects matches between portions of video content. A matching module receives an input video fingerprint representing an input video and a set of reference fingerprints representing reference videos in a reference database. The matching module compares the reference fingerprints and input fingerprints to generate a list of candidate segments from the reference video set. Each candidate segment comprises a time-localized portion of a reference video that potentially matches the input video. A classifier is applied to each of the candidate segments to classify the segment as a matching segment or a non-matching segment. A result is then outputted identifying a matching portion of a reference video from the reference video set based on the segments classified as matches.
    Type: Grant
    Filed: July 16, 2008
    Date of Patent: August 7, 2012
    Assignee: Google Inc.
    Inventors: Michele Covell, Jay Yagnik, Jeff Faust, Shumeet Baluja
  • Publication number: 20120197864
    Abstract: A method and system provides a search interface that permits a user to interrogate a structured database, and includes retrieving database entries from one or more databases, flattening a plurality of database entries, indexing the plurality of flattened database entries to form a search engine index, and prompting the user to enter an input. The system continuously monitors the user input and each time an input is entered by the user, the system computes a set of non-null partial queries in response to the input entered by the user, associates a structured item to each non-null partial query, and allows the user to select one of the structured items. If the user selects one of the structured items, the system replaces the user input by the non-null partial query associated to the selected structured item. When the user validates the input, the system executes the input as a query. Finally, the system provides documents to the user corresponding to the executed query.
    Type: Application
    Filed: August 26, 2010
    Publication date: August 2, 2012
    Inventors: François Bourdoncle, Florian Douetteau, Jérémie Bordier
  • Patent number: 8233768
    Abstract: Playback and distribution systems and methods for multimedia files are provided. The multimedia files are encoded with indexes associated with the content data of the multimedia files. Through the use of the indexes, playback of the content is enhanced without significantly increasing the file size of the multimedia file.
    Type: Grant
    Filed: November 17, 2008
    Date of Patent: July 31, 2012
    Assignee: DIVX, LLC
    Inventors: Kourosh Soroushian, Roland Osborne, Jason Braness, Shaiwal Priyadarshi
  • Patent number: 8234263
    Abstract: A dynamic classification dictionary is built for use in profiling and targeting users for additional relevant content. Behavioral data is gathered from user activity, and user documents and actions are categorized. Author-generated document classification information is analyzed and assigned a first taxonomic noun to characterize the document. User-generated tags characterizing a portion of the document are assigned a second taxonomic noun. Search terms that resulted in the user accessing the document are identified and assigned a third taxonomic noun. Attributes related to the manner in which the document was accessed are evaluated and assigned a fourth taxonomic noun. The document is processed using pattern rules to extract a fifth taxonomic noun. The taxonomic nouns are aggregated into a composite set of taxonomic nouns, and the dynamic classification dictionary is build by storing the composite set of taxonomic nouns.
    Type: Grant
    Filed: January 29, 2010
    Date of Patent: July 31, 2012
    Assignee: CBS Interactive Inc.
    Inventors: Tushar Pradhan, Thomas Osborne, John Potter
  • Publication number: 20120191695
    Abstract: A local search engine geographically indexes information for searching by identifying a geocoded web page of a web site and identifying at least one geocodable web page of the web site. The system identifies a geocode contained within content of the geocoded web page of the web site. The geocode indicates a physical location of an entity associated with the web site. The system indexes content of the geocoded web page and content of the geocodable web page. The indexing including associating the geocode contained within content of the geocoded web page to the indexed content of the geocoded web page and the geocodable web page to allow geographical searching of the content of the web pages.
    Type: Application
    Filed: April 3, 2012
    Publication date: July 26, 2012
    Applicant: Local.com Corporation
    Inventor: Xiongwu Xia
  • Patent number: 8229971
    Abstract: A computer system configures data elements based on textual sources by identifying subunits of a textual source, indexing the subunits into a sequence comprised of terms, identifies based on a target a base subsequence of the sequence, and stores the terms in such a way that they can be expanded or contracted and a user can rapidly and efficiently derive relevant information and context even from a vast amount of information including by navigable display to the user. Other methods and systems of configuring and displaying data elements from textual sources are provided.
    Type: Grant
    Filed: September 29, 2008
    Date of Patent: July 24, 2012
    Assignee: Efrem Meretab
    Inventor: Efrem Meretab
  • Patent number: 8229906
    Abstract: A system and method for maintaining version information. An identifier (“ID”) that identifies a collection of associated files is obtained. An index is generated that specifies the contents of the collection of associated files. The ID may be saved along with the index in a target version file to convey version information about the collection of associated files. Subsequently, the index may be extracted from the target version file to compare with a corresponding index extracted from a reference version file. The result of the comparison may be used to determine whether the contents of the collection of associated files match a reference.
    Type: Grant
    Filed: May 14, 2004
    Date of Patent: July 24, 2012
    Assignee: SAP AG
    Inventor: Krasimir P. Semerdzhiev
  • Patent number: 8229916
    Abstract: There is provided, in a parallel pipelined structure on a multi-core device, a method for parallel pipelined multi-core indexing. The method includes generating one or more single document indexes respectively corresponding to one or more single documents of a given data stream. The method further includes generating one or more multi-document interval-based hash tables from the one or more single document indexes. The method also includes generating a global hash table formed from merging one or more of the multi-document interval-based hash tables, the global hash table representing a collective index for all of the single documents for which the one or more single document indexes were generated.
    Type: Grant
    Filed: October 9, 2008
    Date of Patent: July 24, 2012
    Assignee: International Business Machines Corporation
    Inventors: Ankur Narang, Vikas Agarwal, Vijay Kumar Garg, Douglas James Joseph, Monu Kedia, Maged M. Michael
  • Patent number: 8229905
    Abstract: Systems and methods for creating a Token and using it to operate and modify a case file in a document management system are described. An exemplary embodiment of a Token includes a representation of a case file. A Token may include a representation of a document associated with the case file, where the representation may include a brief description or visual summary of the document with an associated link to an electronic version of the document. In another exemplary embodiment, the Token may be used to facilitate retrieval and viewing of an electronic version of a document using its associated link. In another exemplary embodiment, the Token may be used to update the case file based on user input on the Token.
    Type: Grant
    Filed: June 22, 2005
    Date of Patent: July 24, 2012
    Assignee: Ricoh Co., Ltd.
    Inventors: Max E. McFarland, Robert Alexander, Michael G. E. Griffin, Jonathan J. Hull
  • Patent number: 8224805
    Abstract: The present disclosure relates to a method for generating a context hierarchy and a system for generating a context hierarchy, and more particularly, to a method for generating a context hierarchy from data streams configured of an infinite set of continuously transactions and a system for generating a context hierarchy from the data streams.
    Type: Grant
    Filed: March 18, 2011
    Date of Patent: July 17, 2012
    Assignee: Industry-Academic Cooperation Foundation, Yonsei University Yonsei Univ.
    Inventor: Won Suk Lee
  • Patent number: 8224804
    Abstract: A computing system hosts a plurality of websites, a plurality of external data sources, and a search service. External resources in each of the external data sources are partitioned among a plurality of external partitions. The search service automatically generates an index comprising index entries. The index entries are partitioned among a plurality of index partitions, each associated with one or more of the websites. For each given index partition, the index entries in the given index partition map terms to resources in the given set of resources that contain the terms. The given set of resources includes web pages in a given set of websites associated with the given index partition and external resources in external partitions used by the given set of websites. The search service uses the index entries in given index partition to identify resources in the given set of resources containing query terms.
    Type: Grant
    Filed: July 22, 2009
    Date of Patent: July 17, 2012
    Assignee: Microsoft Corporation
    Inventors: Umesh Unnikrishnan, Jason M. Cahill, Titus C. Miron, Zachary Rosenfield, William D. Taylor, Brian P. Robbins
  • Patent number: 8219545
    Abstract: An apparatus and system for authenticating and searching data is provided. The apparatus includes a tightly coupled housing, an authenticator in the tightly coupled housing, a first archival storage in the tightly coupled housing, an indexer in the tightly coupled housing, and an index storage in the tightly coupled housing. The authenticator receives a file from an external email system, the authenticator creates first and second self-authenticating copies of the file, the first archival storage stores the first self-authenticating file, the indexer dissembles the second self-authenticating copy into components of the second self-authenticating copy, and the index storage stores the components of the second self-authenticating in a searchable format.
    Type: Grant
    Filed: May 7, 2009
    Date of Patent: July 10, 2012
    Assignee: Goldhar/Jaffe Technology Development Corporation
    Inventors: Jonathan E. Jaffe, Joel D. Goldhar, Michael A. Warot
  • Patent number: 8219563
    Abstract: Techniques are provided for searching within a collection of XML documents. A relational table in an XML index stores an entry for each node of a set of nodes in the collection. Each entry of the relational table stores an order key and a path identifier along with the atomized value of the node. An index on the atomized value provides a mechanism to perform a node-aware full-text search. Instead of storing the atomized value in the table, a virtual column may be created to represent, for each node, the atomized value of the node. Alternately, each entry of the relational table stores an order key and a path identifier along with, for simple nodes, the atomized value, and for complex nodes, a null value. For a complex node with a descendant text node, a separate entry is stored for the descendant text node in the relational table.
    Type: Grant
    Filed: December 30, 2008
    Date of Patent: July 10, 2012
    Assignee: Oracle International Corporation
    Inventors: Thomas Baby, Zhen Hua Liu, Wesley Lin
  • Patent number: 8219543
    Abstract: An embodiment of a method includes receiving a selection of an original content item, deriving a supplemental content item from the original content item, designating a unique identifier to the supplemental content item, and inserting a reference including the unique identifier into the primary document. A system includes a candidate content item repository having one or more content items that can be included in a structured electronic document, a copy module configured to derive a base supplemental content item from a selected one of the candidate content items and further configured to designate a unique identifier for the base supplemental content item, and a document editor configured to receive the unique identifier and insert the unique identifier into a reference to the base supplemental content in the primary document. A computer-readable medium has instructions to cause a computer to carry out a structured document generation process.
    Type: Grant
    Filed: June 12, 2007
    Date of Patent: July 10, 2012
    Assignee: Etrial Communications, Inc.
    Inventors: David Michael Vanderport, Kristen Marie Vanderport, Tadd Loebbaka
  • Patent number: 8219544
    Abstract: A method and a computer program product for indexing files, the method includes: generating, by a remote indexing entity, user consolidated files indexing information representative of content of user files that are stored in either one of a user machine and in at least one server.
    Type: Grant
    Filed: March 17, 2008
    Date of Patent: July 10, 2012
    Assignee: International Business Machines Corporation
    Inventors: Michael Bluger, Nadav Golbandi, Nadav Y. Har'el
  • Patent number: 8219583
    Abstract: Mining of websites that in one embodiment includes obtaining web usage data of user sessions of a website, wherein the website has a hierarchical structure with granular levels and has mapping from each webpage of the website into the hierarchical structure, mapping the user sessions to the hierarchical structure of the website resulting in hierarchical user sessions, initiating an edit distance metrics to determine similarity in the hierarchical user sessions, and clustering similar hierarchical user sessions into groups.
    Type: Grant
    Filed: November 10, 2008
    Date of Patent: July 10, 2012
    Assignee: NBCUniversal Media, LLC
    Inventors: Abha Moitra, Steven Matt Gustafson, Feng Xue
  • Publication number: 20120173510
    Abstract: A priority hash index provides efficient lookup of posting lists for search query terms. The priority hash index is a data structure in which hash values for terms are distributed across multiple storage devices based on importance of the terms and access speeds of the storage devices. Terms are grouped into search lists with each search list including a storage location on each storage device. When a search query is received, a term is identified and hashed to a location on the first storage device and to generate a unique hash value for the term. The locations on the storage device for the term's search list are sequentially read until the hash value for the term is located to access the posting list for the term.
    Type: Application
    Filed: December 29, 2010
    Publication date: July 5, 2012
    Applicant: MICROSOFT CORPORATION
    Inventors: KNUT MAGNE RISVIK, MICHAEL HOPCROFT, JOHN G. BENNETT, KARTHIK KALYANARAMAN, TRISHUL CHILIMBI, HUI SHEN
  • Publication number: 20120173511
    Abstract: There are provided a file search system and program that perform efficient searches by creating, with respect to a file search environment where files for which a full-text search should be allowed and files for which a full-text search is unnecessary coexist, different index files. With respect to a file search system in which a file search server, a file server and a client are interconnected via a communications line 9, the file search server 1 comprises: metadata search means adapted to select, upon receiving a search request from the client 3, metadata matching records from an index 2 file based on a conditional search expression; full-text search means adapted to perform a search with respect to an index 1 file by referencing keywords based on the conditional search expression and the metadata matching records; and means adapted to transmit the search result to the client 3.
    Type: Application
    Filed: September 3, 2010
    Publication date: July 5, 2012
    Applicant: HITACHI SOLUTIONS, LTD.
    Inventors: Tatsuya Eto, Masakazu Matsuura, Seiya Tsuruta
  • Patent number: 8214346
    Abstract: Unstructured electronic documents are classified for profiling and targeting users for additional relevant content. Behavioral data is gathered from user activity, and user documents and actions are categorized. Profile information is combined with collaborative and editorial data to provide users with credible information regarding products. Author-generated document classification information is analyzed and assigned a first taxonomic noun to characterize the document. User-generated tags characterizing a portion of the document are assigned a second taxonomic noun. Search terms that resulted in the user accessing the document are identified and assigned a third taxonomic noun. Attributes related to how the document was accessed are evaluated and assigned a fourth taxonomic noun. The document is processed using pattern rules to extract a fifth taxonomic noun.
    Type: Grant
    Filed: January 30, 2009
    Date of Patent: July 3, 2012
    Assignee: CBS Interactive Inc.
    Inventors: Tushar Pradhan, Thomas Osborne, John Potter
  • Publication number: 20120166415
    Abstract: Embodiments are provided for creating a personalized search index and supplementing searches with an additional set of keywords. A first set of keywords is received from a user. A first index associated with the user is searched using the first set of keywords to generate a first set of search results. The first index includes content identified by the user. A second set of keywords is defined based on the first set of search results. A second index is searched using the second set of keywords to generate a second set of search results. At least a portion of the first and second sets of search results is provided to the user. Some embodiments are implemented as a cloud computing service.
    Type: Application
    Filed: December 23, 2010
    Publication date: June 28, 2012
    Applicant: MICROSOFT CORPORATION
    Inventors: James Henry Lewallen, Cezary Marcjan, Matthew Bret MacLaurin, Todd Daniel Newman, David Shigeru Taniguchi
  • Publication number: 20120166416
    Abstract: Techniques are provided for prediction locations of users that submit search queries. A query is received at a search engine. An inverted index is searched to identify one or more geographical locations associated with one or more terms of the received query. The inverted index lists a plurality of query terms and one or more geographical locations associated with each query term. Each geographic location that is associated with a listed query term in the inverted index is a determined location for at least one user previously having submitted the listed term in a search query. A geographical location is predicted for a user that submitted the received query based on the identified one or more geographical locations. In this manner, a location is predicted for the user based on similar queries previously submitted by users.
    Type: Application
    Filed: December 23, 2010
    Publication date: June 28, 2012
    Applicant: YAHOO! INC.
    Inventors: Vanessa Murdock, Hugues Bouchard
  • Patent number: 8204871
    Abstract: A file server has an extent cache of pointers to leaf-level indirect blocks containing file mapping metadata. The extent cache improves file access read and write performance by returning a mapping for the data blocks to be read or written without having to iterate through intermediate level indirect blocks of the file. In addition, the extent cache contains pointers to the leaf-level indirect blocks in the file system cache. Therefore, in most cases, the time spent looking up pointers in the extent cache is offset by a reduction in the time that would otherwise be spent in locating the leaf-level indirect blocks in the file system cache. In a preferred implementation, the extent cache has a first least recently used (LRU) list and cache entry allocation for production files, and a second LRU list and cache entry allocation for snapshot copies of the production files.
    Type: Grant
    Filed: September 28, 2010
    Date of Patent: June 19, 2012
    Assignee: EMC Corporation
    Inventors: Sitaram Pawar, Philippe Armangau, Yubing Wang, Christopher Seibel, Soumyadeep Sen
  • Patent number: 8204313
    Abstract: A high-density, distance-measuring laser system and an associated computer that processes the data collected by the laser system. The computer determines a data partition structure and stores that structure as a header file for the scan before data is collected. As the scan progresses, the computer collects data points until a predetermined threshold is met, at which point a block of data consisting of the data points up to the threshold is written to disk. The computer indexes each data block using all three coordinates of its constituent data points using, preferably, a flexible index, such as an R-tree. When a data block is completely filled, it is written to disk preferably with its index and, as a result, each data block is ready for access and manipulation virtually immediately after having been collected. Also, each data block can be independently manipulated and read from disk.
    Type: Grant
    Filed: September 2, 2008
    Date of Patent: June 19, 2012
    Assignee: Leica Geosystems AG
    Inventors: Mark Damon Wheeler, Barry Joel Schwarz, Richard William Bukowski, Minghua Wu
  • Publication number: 20120150839
    Abstract: A message is delivered to an external indexing service when a link is located within a given communication. The message includes a request to index content contained within the link to ensure that the link is indexed. When a keyword search query is performed within the communication system, both a local index and an external index associated with the external indexing service are queried. Results from the query are merged and surfaced.
    Type: Application
    Filed: December 8, 2010
    Publication date: June 14, 2012
    Applicant: MICROSOFT CORPORATION
    Inventors: Tore Sundelin, Alexander Wetmore, James Edelen, Jorge Pereira
  • Publication number: 20120150840
    Abstract: A search server interconnection method, a search server, and a system for mobile search are disclosed The method includes: sending an information list of a first group of member engines to a search server; receiving an information list of a second group of member engines, where the information list of the second group of member engines is returned by the search server; and sending meta-index information of member engines in the information list of the second group of member engines to the search server. In the present invention, an effective mobile search service across search servers is realized.
    Type: Application
    Filed: February 17, 2012
    Publication date: June 14, 2012
    Applicant: Huawei Technologies Co., Ltd.
    Inventors: Hanqiang Hu, Chong Gu, Jiangtao Jia
  • Publication number: 20120150838
    Abstract: Technologies pertaining to fact lookup queries are described herein. A relational database is automatically built by extracting attribute identities and attribute values from a one dimensional table, wherein the one dimensional table does not include an entity identity that corresponds to the attribute identity and the attribute value. The entity identity is inferred, and the attribute value is indexed in a relational database by the entity identity and the attribute identity. When a query is issued by a user that includes the entity identity and the attribute identity, the corresponding attribute value in the relational database is returned to the user.
    Type: Application
    Filed: December 8, 2010
    Publication date: June 14, 2012
    Applicant: Microsoft Corporation
    Inventors: Xiaoxin Yin, Wenzhao Tan, Chao Liu
  • Patent number: 8200663
    Abstract: A system and method for improving the relevance of search results is disclosed. Voters who may be human searchers or guides may review search results or other review items associated with a search request or other reference item. A review may be activated based on a usage indicator(s) which may improve utilization of guides. A vote by a voter may be weighted based on a voting history associated with the voter and one or more reference voters who may be designated by the system. A voter may be presented with a group of items for review including simultaneously. A number of comparison voting sessions or elections may be used to determine a rating or ranking of a review item associated with a reference item.
    Type: Grant
    Filed: April 25, 2008
    Date of Patent: June 12, 2012
    Assignee: ChaCha Search, Inc.
    Inventors: Scott A. Jones, Thomas E. Cooper
  • Patent number: 8195639
    Abstract: There is provided an information processing apparatus including a music storage portion to store music data, a music playback portion to play back music data, a metadata acquisition portion to acquire metadata describing information about the music data being played back in the music playback portion, a metadata transmission portion to transmit the metadata acquired in the metadata acquisition portion to the distribution server, a music reception portion to receive music data selected in the distribution server based on the information contained in the metadata and distributed from the distribution server, a music acquisition portion to acquire the music data from the music storage portion based on the information contained in the metadata, and a control portion to control the music playback portion to play back the music data distributed from the distribution server and the music data acquired from the music storage portion by the music acquisition portion.
    Type: Grant
    Filed: November 4, 2008
    Date of Patent: June 5, 2012
    Assignee: Sony Corporation
    Inventors: Yuki Murata, Soichiro Atsumi
  • Patent number: 8195634
    Abstract: Techniques are disclosed for providing a domain-aware snippet for a search result. With such techniques, a domain classification component is provided for identifying a template used to generate a plurality of web pages of a domain, associating the template and content of the web pages related to the template with a Uniform Resource Locator pattern of the plurality of web pages, and storing the associated template, the related content, and the Uniform Resource Locator pattern in a database. A snippet extraction component is also provided for extracting text from a section of a web page of the plurality of web pages for a snippet of a search result corresponding to a search query, wherein the extracted text is based on a ranking value of the section and the relevance of the extracted text to the search query.
    Type: Grant
    Filed: September 28, 2007
    Date of Patent: June 5, 2012
    Assignee: Microsoft Corporation
    Inventors: Girish Kumar, Fang Liu
  • Patent number: 8190594
    Abstract: One example embodiment includes a method for indexing online references of an entity. The method includes identifying one or more channels of the Internet to be searched for references to an entity and identifying one or more signals to be evaluated within each of the one or more channels. The method also includes crawling the Internet for online references to the entity, wherein crawling the Internet comprises searching the one or more channels of the Internet for references to the entity and evaluating the one or more signals. The method further includes constructing a reverse index of the references, wherein the reverse index is based on each channel in which a reference is found and the one or more signals evaluated for the reference.
    Type: Grant
    Filed: May 6, 2009
    Date of Patent: May 29, 2012
    Assignee: Brightedge Technologies, Inc.
    Inventors: Lemuel S. Park, Jimmy Yu
  • Publication number: 20120130981
    Abstract: Methods are provided for populating search indexes with atoms identified in documents. Documents that are to be indexed are identified, and for each document, atoms are identified and are categorized as unigrams, n-grams, and n-tuples. A list of atom/document pairs is generated such that an information metric can be computed for each pair. An information metric represents a ranking of the atom in relation to the particular document. Based on the information metric, some atom/document pairs are discarded and others are indexed.
    Type: Application
    Filed: March 10, 2011
    Publication date: May 24, 2012
    Applicant: MICROSOFT CORPORATION
    Inventors: KNUT MAGNE RISVIK, MIKE HOPCROFT, JOHN G. BENNETT, KARTHIK KALYANARAMAN, TRISHUL CHILIMBI
  • Patent number: 8185513
    Abstract: A method, apparatus, article of manufacture for generating a media program database having a plurality of media programs is disclosed. In one embodiment, the method is comprises the steps of receiving first media program metadata from a first source, searching the Internet to find second media program metadata from a second source distinct from the first source, determining if the first media program metadata and the second media program metadata are associated with the same media program, merging the first media program metadata and the second media program metadata if the first media program metadata and the second media program metadata are associated with the same media program, and storing the merged first media program metadata and second media program metadata in the media program database.
    Type: Grant
    Filed: December 31, 2008
    Date of Patent: May 22, 2012
    Assignee: Hulu LLC
    Inventors: Zhibing Wang, Yizhe Tang, Qian Chang, Ting-hao Yang
  • Patent number: 8185516
    Abstract: A method for filtering file clusters is presented. In the method, a plurality of advanced filter actions with different filter conditions and independent from each other is performed on an obtained main result file. Thereby, a history record of each advanced filter is kept, and the history record of each advanced filter and respective search results are presented on a target interface in a presentation mode of opening a new page or updating an index list.
    Type: Grant
    Filed: May 4, 2010
    Date of Patent: May 22, 2012
    Assignee: Esobi Inc.
    Inventors: Hong-Yang Tsai, Hung-Hsiang Ku, Hsun-Hsueh Cho
  • Patent number: 8184326
    Abstract: An information dissemination system that obtains information from a multifunction document production device that is programmed to store documents along with user provided dissemination level authorizations and with one or more keywords extracted from the document, and to disseminate such documents according to persons authorized to receive the documents based on prior and real time inquiries of a requestor, with a knowledge agent matching the extracted keywords against the queries.
    Type: Grant
    Filed: June 22, 2005
    Date of Patent: May 22, 2012
    Assignee: Xerox Corpoaration
    Inventor: Ramesh Nagarajan
  • Publication number: 20120124028
    Abstract: Techniques for discovery of applications across a plurality of application stores are described herein. The discovery may be in response to a query and intent process, which may be initiated by a search query. The search query may reveal user intent to discover an application by operation of a search engine. The user intent may be matched with indexed application content. The application content may be obtained by operation of a discovery and indexing process. A crawler may discover and crawl a plurality of application stores. Additionally or alternatively, a feed store may receive direct feeds of application information and links to download locations from a plurality of application stores. Both the discovered data and direct feeds may be sent to an indexer, to create and maintain an index. Search results pages may be returned to provide users with information, links to developer websites and links to download pages from a plurality of application stores.
    Type: Application
    Filed: November 12, 2010
    Publication date: May 17, 2012
    Applicant: Microsoft Corporation
    Inventors: Steven Charles Tullis, Steven William Macbeth, Ram Nick Talwar, Viswanath Vadlamani, Kamran Rajabi Zargahi
  • Publication number: 20120117007
    Abstract: Systems and methods use machine learning techniques to resolve location ambiguity in search queries. In one aspect, a dataset generator generates a training dataset using query logs of a search engine. A training engine applies a machine learning technique to the training dataset to generate a location disambiguation model. A location disambiguation engine uses the location disambiguation model to resolve location ambiguity in subsequent search queries.
    Type: Application
    Filed: November 4, 2010
    Publication date: May 10, 2012
    Applicant: AT&T INTELLECTUAL PROPERTY I, L.P.
    Inventors: Ritesh Jitendra Agrawal, James G. Shanahan
  • Patent number: 8176028
    Abstract: A multimedia administration system with a central media data repository. The system repository contains metadata utilized for various queries including search, channels of distribution and account information. The system organizes and distributes media over an area wide broadcast network. The system uses custom meta tags to selectively update media files and metadata on a scheduled basis automatically without having to manually update this data in the system. The system spiders these meta tags on a scheduled basis to automate the programming of the media content over the broadcast network Platform System. The system also groups, associates, and affiliates various networks into nodes for broadcast over networks. Clients can communicate with users who are watching their videos. The clients can upon demand present content to viewers with more content related to the original video content in the form of another video, web links URLs, PDF literature, and/or voice files.
    Type: Grant
    Filed: September 19, 2007
    Date of Patent: May 8, 2012
    Inventor: Alex O. Rodriguez