Index Generation Patents (Class 707/711)
  • Patent number: 8694507
    Abstract: This disclosure describes methods and systems for searching documents in a multi-tenant hosting environment. According to embodiments, to conserve hardware resources, a plurality of documents associated with a plurality of tenants may be mapped to the same search index in the multi-tenant hosting environment. In order to search documents associated only with a single tenant in the multi-tenant hosting environment, a tenant identifier is prepended to every key stored in the search index that is associated with the plurality of documents of the single tenant. Moreover, where one document links to another document within the multi-tenant hosting environment, the link is stored in a web graph when a source tenant identifier matches a target tenant identifier for the link. According to embodiments, when conducting a search, the link is resolved only if the link is stored in the web graph.
    Type: Grant
    Filed: November 2, 2011
    Date of Patent: April 8, 2014
    Assignee: Microsoft Corporation
    Inventors: Dmitriy Meyerzon, Mihai Petriuc, Nicolai Bodd
  • Publication number: 20140095467
    Abstract: Methods and systems are directed to an update-optimized and memory-efficient method for transactional full-text index view maintenance. Embodiments include a method and system of reducing the overhead requirements of maintaining a transactional index view in the case of a large number of updates at any particular time. The method involves maintaining a single blacklist bitmap across updated transactional query views, applying the latest set of updates to the bitmap when a new view is required, and post-adjusting the bitmap when the transactional view is created to ensure that entries that should be visible have not been removed.
    Type: Application
    Filed: September 28, 2012
    Publication date: April 3, 2014
    Applicant: EMC Corporation
    Inventor: EMC Corporation
  • Patent number: 8688605
    Abstract: Provided are techniques for incrementally integrating and persisting context over an available observational space. At least one feature associated with a new observation is used to create at least one index key. The at least one index key is used to query one or more reverse lookup tables to locate at least one previously persisted candidate observation. The new observation is evaluated against the at least one previously persisted candidate observation to determine at least one relationship. In response to determining the at least one relationship, a threshold is used to make a new assertion about the at least one relationship. The new observation is used to review previous assertions to determine whether a previous assertion is to be reversed. In response to reversing the previous assertion, the new observation, the new assertion, and the reversed assertion are incrementally integrated into persistent context.
    Type: Grant
    Filed: November 22, 2011
    Date of Patent: April 1, 2014
    Assignee: International Business Machines Corporation
    Inventors: Gregery G. Adair, Robert J. Butcher, Jeffrey J. Jonas
  • Publication number: 20140089290
    Abstract: Interaction of a user with external data sources related to the research and analysis of terms and content for improved indexing by search engines may be provided. One or more elements of the content may be used to return an analysis of those elements to the user within the experience of a word processing application environment. Systems and methods also may be provided for determining the relative competition of a term or terms to quantitatively derive the number of pages within a corpus that may be actively engaged to improve the indexing of the pages by information retrieval systems, such as search engines. Systems and methods may be further provided for determining the relative ranking of a term or terms based on the frequency and placement of the term or terms within a structured page.
    Type: Application
    Filed: September 20, 2013
    Publication date: March 27, 2014
    Inventors: Sean Jackson, Brian Clark, Chris Thompson
  • Patent number: 8682968
    Abstract: The present invention discloses a method, a system, and a node for node interconnection on a content delivery network. This content delivery network includes a first-level CDN node that is formed of a first CDN media server and a first CDN content storage and is configured to provide content for a user, and a second-level CDN node that is formed of a second CDN media server and a second CDN content storage and interconnects with the first-level CDN node, where the first CDN media server accesses the content in the second CDN content storage by using an IP-layer storage access protocol. The networking is simplified and the content access efficiency and the resource utilization are improved by using the technical solutions of node interconnection on the content delivery network provided in the present invention.
    Type: Grant
    Filed: October 17, 2012
    Date of Patent: March 25, 2014
    Assignee: Huawei Technologies Co., Ltd.
    Inventors: Mo Gan, Jin Xu, Yu Chen
  • Patent number: 8682883
    Abstract: Embodiments of the present invention relate to systems and methods for determining sets of products which are similar to each other in terms of consumers' wants and needs. Queries are performed on a particular product. Documents relating to the query are received and stored. A dictionary is created from the received documents, whereby the documents, which are text files, are scrubbed of certain data to create a scrubbed text file. Topic modeling is then performed on the cleansed text file. Various methods can be used to perform topic modeling, including, but not limited to, latent semantic analysis, nonnegative matrix factorization, and singular value decomposition.
    Type: Grant
    Filed: April 16, 2012
    Date of Patent: March 25, 2014
    Assignee: Predictix LLC
    Inventors: Loren Williams, Emir Pasalic, Nikolaos Vasiloglou
  • Publication number: 20140081948
    Abstract: Aspects of the subject matter described herein relate to searching files. In aspects, a search engine is able to search not only the current files but also is able to search for deleted and previous versions of files that satisfy queries. The search engine may maintain an index that facilitates searches. In addition, the search engine may also determine not only that a file satisfies a query but what version(s) of the file satisfies the query.
    Type: Application
    Filed: November 25, 2013
    Publication date: March 20, 2014
    Applicant: MICROSOFT CORPORATION
    Inventors: Apurva Ashwin Doshi, JUAN-LEE PANG, BOHDAN W. RACIBORSKI
  • Patent number: 8676784
    Abstract: Embodiments are configured to provide information relevant to individuals of interest to a searching user. In an embodiment, a method includes identifying relevant individuals of a network using a relevance model that includes the use of a number of managed properties and ranking features to identify relevant individuals of a defined network. The relevance model of one embodiment is defined by a schema that includes a textual matching ranking feature, social distance ranking feature, a levels to top ranking feature, and a proximity ranking feature.
    Type: Grant
    Filed: July 9, 2012
    Date of Patent: March 18, 2014
    Assignee: Microsoft Corporation
    Inventors: Boxin Li, Dmitriy Meyerzon, Jessica Alspaugh, Victor Poznanski
  • Patent number: 8676567
    Abstract: Automatic text skimming using lexical chains may be provided. First, at least one lexical chain may be created from an electronic document. Next, a list of positions within the electronic document may be created. The positions may include where at least one concept represented by one of the at least one lexical chain is mentioned. In addition, a list of the position where the at least one concept is mentioned may be assembled. A selection of at least one concept may be received from the list.
    Type: Grant
    Filed: December 16, 2011
    Date of Patent: March 18, 2014
    Inventor: William A. Hollingsworth
  • Publication number: 20140074817
    Abstract: Embodiments are directed towards generating data models that may give semantic meaning for unstructured data or structured data that may include data generated and/or received by search engines, including a time series engine. Data models also may be generated to provide semantic meaning to structured data. A data model may be composed of a hierarchical data model objects analogous to an object-oriented programming class hierarchy. Users may employ a data modeling application to produce reports using search objects that may be part of, or associated with the data model. The data modeling application may employ the search object and the data model to generate a query string for searching a data repository to produce a result set. A data modeling application may map the result set data to data model objects that may be used to generate reports.
    Type: Application
    Filed: October 26, 2012
    Publication date: March 13, 2014
    Applicant: Splunk Inc.
    Inventors: Alice Emily Neels, Archara Sulochana Ganapathi, Marc Vincent Robichaud, Stephen Phillip Sorkin, Steve Yu Zhang
  • Publication number: 20140067788
    Abstract: A system for contact information integration with search results, comprising a queue manager; an enterprise communications system; and a search engine. Upon receipt of search query from a user, the search engine submits a request to the queue manager. The queue manager: obtains from the enterprise communications system a communications status comprising an estimated wait time; sends a notification to the search engine advising the search engine of the estimated wait time for service; and upon receiving from the search engine a request to generate a callback to the user from the enterprise of a specific communications type, directing an enterprise communications system of the enterprise that is associated with the specific communications type to initiate a callback to the user from whom the search engine query was received in response to the callback request received from the search engine.
    Type: Application
    Filed: November 6, 2013
    Publication date: March 6, 2014
    Applicant: Virtual Hold Technology, LLC
    Inventor: Mark J. Williams
  • Patent number: 8666967
    Abstract: An exemplary system for managing an applications and data space includes a strategy layer configured to receive a query statement and to formulate one or more custom queries based on the query statement and a query scheduler layer configured to schedule issuance of the one or more custom queries to one or more query response modules associated with the applications and data space. Other methods, devices and systems are also disclosed.
    Type: Grant
    Filed: September 23, 2011
    Date of Patent: March 4, 2014
    Assignee: Microsoft Corporation
    Inventors: John D. Dunagan, Albert Greenberg, Emre M. Kiciman, Heather E. Warncke, Alastair Wolman
  • Publication number: 20140059036
    Abstract: Embodiments are directed towards a system and method for a cloud-based front end that may abstract and enable access to the underlying cloud-hosted elements and objects that may be part of a multi-tenant application, such as a search application. Search objects may be employed to access indexed objects. An amount of indexed data accessible to a user may be based on an index storage limit selected by the user, such that data that exceeds the index storage limit may continue to be indexed. Also, one or more projects can be elastically scaled for a user to provide resources that may meet the specific needs of each project.
    Type: Application
    Filed: October 31, 2013
    Publication date: February 27, 2014
    Applicant: Splunk Inc.
    Inventors: Robin Kumar Das, Ledio Ago, Declan Gerard Shanaghy, Gaurav Gupta
  • Publication number: 20140059035
    Abstract: A computer-based process for generating a composite search document for use in the electronic search and retrieval of corresponding and relevant documents and/or information from an existing database or collection of electronic documents. A composite search document is created by aggregating blocks of text in an interface into a single document, which is submitted to the mathematical space of a conceptual search index or similar search engine for the purpose of performing a query and returning results.
    Type: Application
    Filed: August 26, 2013
    Publication date: February 27, 2014
    Applicant: iCONECT Development, LLC
    Inventors: Cynthia J. Williams, Ian Campbell
  • Patent number: 8661006
    Abstract: A data file management system and method for a web browser is disclosed. The data file management method may include the step of deleting certain data files upon termination of a browsing session of the web browser. The data file management system may relate to the management of data files stored in a storage area of a computer. The data files may include at least one of a graphic file, a text file, and a cookie file.
    Type: Grant
    Filed: March 28, 2006
    Date of Patent: February 25, 2014
    Assignee: Charles E. Hill & Associates, Inc.
    Inventor: Charles E. Hill
  • Patent number: 8655865
    Abstract: A method and apparatus for facilitating indexing of rich Internet content by accessing a portion of rich Internet content containing a plurality of states and a plurality of transitions that connect various ones of the plurality of states through activation of at least one entity. The method and apparatus accesses contextual information coupled to at least one of a transition, a state or an entity, and examines at least one of the plurality of states using the at least one of the plurality of transitions and the contextual information.
    Type: Grant
    Filed: May 27, 2011
    Date of Patent: February 18, 2014
    Assignee: Adobe Systems Incorporated
    Inventors: Srikumar Sankaran, Naga Sateesh Kumar Busam, Jeetu Narayan, Sundaresan Krishnan Meenakshi, Thangaraj Umapathy, Kapil Raja Durga
  • Patent number: 8650182
    Abstract: The techniques presented herein are directed towards providing a user-directed keyword-based search on a large collection of XML documents, and displaying a summary of results to the user. Prior to receiving search requests from a user, an offline analysis of a large collection of XML documents is performed to construct an inverted index of keywords. For each keyword, the index stores a set of location indicators that identify all the instances of the keyword found in the collection of documents. Once the index is constructed, keyword searching can be done efficiently by a keyword lookup in the index. Various display strategies enable the user to see the specific portion of a large XML document containing the keyword and/or path frequency information allowing the user to easily refine the search to specific paths within the collection of documents.
    Type: Grant
    Filed: February 24, 2009
    Date of Patent: February 11, 2014
    Assignee: Oracle International Corporation
    Inventor: Ravi Murthy
  • Patent number: 8650178
    Abstract: The invention relates to an online information retrieval system having a queue for storing load requests and a set of two or more load managers for retrieving data from the queue and indexing documents based on the request retrieved from the queue. Each load manager resides in a different geographical location. A set of candidate documents comprise a unique identifier and a version indicator, wherein the unique identifier for each candidate document is identical for a given document and the version indicator is associated with a determination of which document within the set of candidate documents shall ultimately be communicated to a user.
    Type: Grant
    Filed: June 21, 2010
    Date of Patent: February 11, 2014
    Assignee: Thomson Reuters Global Resources
    Inventor: Jon Michael Verreaux
  • Patent number: 8645327
    Abstract: Methods, systems and apparatus for a distributed data environment can allow data to be seamlessly accessed from remote computing devices. For a given user having multiple computing devices, a personal distributed data bank can be formed from data storage capacities resident at the multiple computing device. The given user can access data on any of the multiple computing devices without knowing on which of the multiple computing devices the data resides. In one embodiment, an index for the user's data within the personal distributed data bank can be provided to facilitate searching, browsing and accessing of user data by the given user.
    Type: Grant
    Filed: September 30, 2009
    Date of Patent: February 4, 2014
    Assignee: Apple Inc.
    Inventors: David Robbins Falkenburg, Michael J. Nugent, Duncan Robert Kerr, Aaron Leiba
  • Publication number: 20140032524
    Abstract: Providing an interface to request actions of a website includes various elements. For example, website instructions of the website are provided to a search engine, the website instructions describing the website tool. The website instructions may be indexed by the search engine to be referenced when servicing a search query. When the website is deemed relevant to the search query, a search-result snippet may be built to include the interface to the website tool. As such, the interface may be provided in the search-results webpage. The interface may be used in the search-results webpage or may be stored locally. The interface may be used to access the website tool without a browser navigating to a landing page of the website.
    Type: Application
    Filed: July 24, 2012
    Publication date: January 30, 2014
    Applicant: MICROSOFT CORPORATION
    Inventor: ANTOINE EL DAHER
  • Patent number: 8639683
    Abstract: A system and method for data analysis and digital forensics is provided. The system for data analysis and digital forensics may include: an online data forensic server for collecting and analyzing usage history information from an object device, which is subject to data collection, downloading and collecting data on the Internet based on the usage history information, requesting issuance of a timestamp token for the collected data, and receiving the issued timestamp token; a timestamp token issuing server for issuing the timestamp token for the collected data in response to the request for issuance of the timestamp token and proving the issued timestamp token to the online data forensic server; and a storage device for storing the collected data.
    Type: Grant
    Filed: October 26, 2011
    Date of Patent: January 28, 2014
    Assignee: Electronics and Telecommunications Research Institute
    Inventors: Youn Hee Gil, Joo Young Lee, Su Hyung Jo, Young Soo Kim, Keon Woo Kim, Sang Su Lee, Woo Yong Choi, Sung Kyong Un, Do Won Hong, Hyun Sook Cho
  • Publication number: 20140025657
    Abstract: A system for automatic landing page creation comprises a processor configured to determine a set of popular search queries. The processor is configured to create, automatically, a landing page for each of the set of popular search queries.
    Type: Application
    Filed: July 16, 2013
    Publication date: January 23, 2014
    Inventor: Ilya Dorfman
  • Patent number: 8635251
    Abstract: A system and method to program computers directly from a Web Browser, by cascading sets of retrieval, computation and update requirement specifications to facilitate repetition, decision and execution control logics, without using query, procedural and programming languages.
    Type: Grant
    Filed: May 28, 2008
    Date of Patent: January 21, 2014
    Inventor: Paul Sui-Yuen Chan
  • Patent number: 8635192
    Abstract: A handheld electronic device, such as a GPS-enabled wireless communications device with an embedded camera, automatically geotags a set of data, such as a digital photo, video, notes, or a blog, with a textual plain-language description of the current location. When the data is generated, the current location of the device is determined, e.g. using a GPS receiver. A textual plain-language description of the current location is then generated, e.g. by reverse geocoding the GPS position coordinates or by correlating the current time with a calendar event from which language descriptive of the event can be extracted. This textual plain-language description is automatically generated and written into a tag or metadata file associated with the photo or other set of data. By automatically geotagging data with textual plain-language descriptions that go beyond mere coordinates of latitude and longitude, data can be searched and managed more efficiently.
    Type: Grant
    Filed: February 28, 2008
    Date of Patent: January 21, 2014
    Assignee: BlackBerry Limited
    Inventors: Gerhard Dietrich Klassen, Matthew Bells
  • Patent number: 8635205
    Abstract: The present disclosure involves systems, software, and computer-implemented methods for presenting local site name information associated with search results returned from a web search query. One process includes operations for receiving a search query request from a user and obtaining at least one search result responsive to the query, where the search result is associated with a uniform resource locator (URL). A local site name associated with at least a portion of the URL is retrieved for at least one of the obtained search results. Further, at least one search result is returned to the user via a search results page, where the at least one search result comprises the URL associated with the search result and the local site name associated therewith.
    Type: Grant
    Filed: June 18, 2010
    Date of Patent: January 21, 2014
    Assignee: Google Inc.
    Inventors: Dongwon Kang, Geunhwi Lim
  • Patent number: 8635107
    Abstract: An extensible offer inventory database of offers in a domain is established. Further, an offer ontology is generated based on the extensible offer inventory database. The offer ontology provides an extensible vocabulary that correlates to categories in the offer inventory database. In addition, offers are automatically located. The offers are also semantically analyzed to generate semantic analysis data. Further, user data is obtained. In addition, an optimal offer match is automatically determined based upon the semantic analysis data and the user data.
    Type: Grant
    Filed: June 3, 2011
    Date of Patent: January 21, 2014
    Assignee: Adobe Systems Incorporated
    Inventors: Walter Chang, Geoff Baum
  • Publication number: 20140019434
    Abstract: A method and associated systems for the efficient selection of runtime rules for programmable search engines, in which a processor of a computer system translates a set of programmable search-engine runtime rules into a set of fielded documents and then partitions the fielded documents into optimized clusters that are each in turn converted into an inverted index. When a user submits a query, the search engine converts the query into a set of fielded queries and uses the set of inverted indices and the set of fielded queries to identify a candidate subset of the runtime rules. The search engine matches this candidate subset against the input query to select a final set of applicable runtime rules.
    Type: Application
    Filed: August 12, 2013
    Publication date: January 16, 2014
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Arun Dobriyal, Monu Kedia, Sriram Raghavan
  • Patent number: 8631014
    Abstract: The method and system of the present invention provides an improved technique for integrated asset management. Information is aggregated from a variety of sources into a centralized computerized database. Thereafter, asset transition events are scheduled. Information from the centralized computerized database is used in the performance of the asset transition events and information relating to the asset transition events is added to the centralized computerized database. Subsequent changes to the asset are also recorded into the centralized computerized database. As a result, a plethora of information is available within said database for the purpose of managing future asset transition events.
    Type: Grant
    Filed: September 10, 2012
    Date of Patent: January 14, 2014
    Assignee: Caldvor Acquisitions Ltd., LLC
    Inventors: Shawn Thomas, Gregory Gray, Michael Woodfin, Warner Mizell, Brian Thomas
  • Patent number: 8631008
    Abstract: One exemplary embodiment involves receiving electronic content from a primary indexing application at a supplemental indexing application. The primary indexing application and the supplemental indexing application are separate applications executing on one or more devices. The supplemental indexing application identifies an external interaction that occurs during execution of the electronic content. Since the external interaction is not available from the supplemental indexing application, the supplemental indexing application communicates with the primary indexing application to receive information associated with the external interaction. The supplemental indexing application identifies one or more content-descriptive terms for the electronic content using the information associated with the external interaction.
    Type: Grant
    Filed: October 1, 2010
    Date of Patent: January 14, 2014
    Assignee: Adobe Systems Incorporated
    Inventors: Akshaya G., Thangaraj Umapathy
  • Publication number: 20140012832
    Abstract: A computer-implemented method, system and computer program product for collecting information from data sources by receiving a collection request at a collection tool to collect information, where the collection request includes data source information indicating a data source from which to retrieve the information. The data source information in the collection request is associated with one or more electronic data repositories in response to the data source indicated by the data source information being previously unidentified to the collection tool. The information is collected from the one or more associated electronic data repositories.
    Type: Application
    Filed: July 6, 2012
    Publication date: January 9, 2014
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Roman Kisin, Andrey Pogodin, Pierre Raynaud-Richard
  • Publication number: 20140012833
    Abstract: Various embodiments of systems and methods for protection of data privacy in an enterprise system are described herein. A data request is generated using an application. The data request is sent to a query engine and a database query is generated. A database is queried using the database query and a database response is generated. The database response is sent to the query engine. A blocking table is searched for an identifier in the database response. The blocking table comprises a listing of identifiers identifying tuples with one or more blocked attributes and a data overlay for redacting the one or more blocked attributes. The data overlay is substituted for the one or more blocked attributes in the database response if the identifier is found in the blocking table. After substituting, the database response is sent to the application.
    Type: Application
    Filed: September 11, 2013
    Publication date: January 9, 2014
    Inventor: HANS-CHRISTIAN HUMPRECHT
  • Publication number: 20140012834
    Abstract: A computer-implemented method, system and computer program product for collecting information from data sources by receiving a collection request at a collection tool to collect information, where the collection request includes data source information indicating a data source from which to retrieve the information. The data source information in the collection request is associated with one or more electronic data repositories in response to the data source indicated by the data source information being previously unidentified to the collection tool. The information is collected from the one or more associated electronic data repositories.
    Type: Application
    Filed: September 12, 2013
    Publication date: January 9, 2014
    Applicant: International Business Machines Corporation
    Inventors: Roman Kisin, Andrey Pogodin, Pierre Raynaud-Richard
  • Patent number: 8626508
    Abstract: Provided are a speech search device, the search speed of which is very fast, the search performance of which is also excellent, and which performs fuzzy search, and a speech search method. Not only the fuzzy search is performed, but also the distance between phoneme discrimination features included in speech data is calculated to determine the similarity with respect to the speech using both a suffix array and dynamic programming, and an object to be searched for is narrowed by means of search keyword division based on a phoneme and search thresholds relative to a plurality of the divided search keywords, the object to be searched for is repeatedly searched for while increasing the search thresholds in order, and whether or not there is the keyword division is determined according to the length of the search keywords, thereby implementing speech search, the search speed of which is very fast and the search performance of which is also excellent.
    Type: Grant
    Filed: February 10, 2010
    Date of Patent: January 7, 2014
    Assignee: National University Corporation TOYOHASHI UNIVERSITY OF TECHNOLOGY
    Inventors: Koichi Katsurada, Tsuneo Nitta, Shigeki Teshima
  • Patent number: 8620898
    Abstract: The present document describes a method for presenting modified information obtained from the internet on an application on a mobile device. The mobile device is set to communicate with the internet through a proxy server. A user of the mobile device is registered on a social network from which indexed search content specific to the user's connections on the social network are obtained. The method comprises: modifying, based on the indexed search content specific to the user's connections, an output for the application from an internet server to produce a modified output for the application; and sending the modified output for the application to the mobile device for presentation in the application on the mobile device.
    Type: Grant
    Filed: May 15, 2012
    Date of Patent: December 31, 2013
    Assignee: Wajam Internet Technologies Inc.
    Inventors: Martin-Luc Archambault, André-Philippe Paquet, Heikki Koivikko
  • Patent number: 8620966
    Abstract: Methods, systems, and computer-readable media provide for the modification of network source content. According to various embodiments described herein, source content that includes primary source content and secondary source content is received via a network. Secondary source content is identified for modification. Insertion content is identified for use in modifying the secondary source content. The secondary source content is then modified prior to providing the modified source content to a user device for presentation to a user. Content management criteria that may include provider agreements, user preferences, and a user profile may be used in the content modification procedure.
    Type: Grant
    Filed: September 21, 2007
    Date of Patent: December 31, 2013
    Assignee: AT&T Intellectual Property I, L.P.
    Inventor: Barbara Roden
  • Patent number: 8620892
    Abstract: One example embodiment includes a method for indexing online references of an entity. The method includes identifying one or more channels of the Internet to be searched for references to an entity and identifying one or more signals to be evaluated within each of the one or more channels. The method also includes crawling the Internet for online references to the entity, wherein crawling the Internet comprises searching the one or more channels of the Internet for references to the entity and evaluating the one or more signals. The method further includes constructing a reverse index of the references, wherein the reverse index is based on each channel in which a reference is found and the one or more signals evaluated for the reference.
    Type: Grant
    Filed: April 27, 2012
    Date of Patent: December 31, 2013
    Assignee: Brightedge Technologies, Inc.
    Inventors: Lemuel S. Park, Jimmy Yu
  • Patent number: 8612416
    Abstract: Techniques are disclosed for providing a domain-aware snippet for a search result. A uniform resource locator (URL) is identified for a search result obtained in response to a search query, and it is determined that the URL corresponds to a single domain that has a plurality of web pages that are generated using a template that is common to each of the web pages in the domain. The template comprises a hypertext markup language (HTML) layout pattern that includes multiple sections shared by the web pages. A ranking value is assigned to the multiple sections and is used to identify a first section of the template that is relevant to the search query. A snippet is provided to a user for the search result; the snippet includes at least a portion of text from the first section.
    Type: Grant
    Filed: May 1, 2012
    Date of Patent: December 17, 2013
    Assignee: Microsoft Corporation
    Inventors: Girish Kumar, Fang Liu
  • Patent number: 8612452
    Abstract: An information processing apparatus capable of collecting electronic data efficiently and avoiding sharing highly confidential and secure electronic data with an indefinite number of users. The information processing apparatus is connected to an external apparatus through a network. The information processing apparatus receives electronic data from the external apparatus, registers attribute information related to the received electronic data in association with the received electronic data so that the received electronic data can be retrieved by using the attribute information, and controls the registering unit so as to register the attribute information related to the received electronic data when the received electronic data is not confidential data, and so as not to register the attribute information related to the received electronic data when the received electronic data is confidential data.
    Type: Grant
    Filed: January 24, 2012
    Date of Patent: December 17, 2013
    Assignee: Canon Kabushiki Kaisha
    Inventor: Ritsuto Sako
  • Publication number: 20130332445
    Abstract: A method for providing metadata to a search engine for a document that is not in a mark-up language receives a request for contents of the document and locates metadata associated with the document. The method further creates name-value pairs for the metadata and provides to the search engine server a response comprising the name-value pair in an HTTP (or HTTPS) header and the contents of the document. In other implementations, a method includes sending a request for contents of the document and receiving a response to the request comprising an HTTP header with metadata about the document in a name-value pair and the document's content. The method also includes extracting the name-value pair from the HTTP header, creating a mark-up language tag for the name-value pair, and providing the make-up language tag and the contents of the document in a mark-up language format to a search index creation component.
    Type: Application
    Filed: December 20, 2012
    Publication date: December 12, 2013
    Applicant: GOOGLE INC.
    Inventors: Pawel Opalinski, Eric Jon Anderson, John Felton, Brandon Player Iles
  • Patent number: 8600974
    Abstract: A system and method for automatically determining a plurality of pieces of different information about a textual document from the text of the textual document are provided in which one or more particular characteristics of the text of the textual document are identified to generate a file containing the text of the textual document with the particular characteristics marked up within the text.
    Type: Grant
    Filed: April 29, 2009
    Date of Patent: December 3, 2013
    Assignee: West Services Inc.
    Inventors: Forrest Rhoads, Daniel Dabney, Robert J. Haschart, Jack G. Conrad, Howard Turtle, Kinley Larntz, Bokyung Yang-Stephens, Daniel Gannon
  • Patent number: 8600970
    Abstract: An email server receives an email sent to an email account and including one or more attachments. The one or more attachments are extracted from the email into one or more temporary storage elements. Content from the temporary storage elements is indexed. The indexed content is stored along with an identifier of the email in an index corresponding to the email account. The one or more temporary storage elements is disposed of after the indexing.
    Type: Grant
    Filed: November 11, 2011
    Date of Patent: December 3, 2013
    Assignee: Apple Inc.
    Inventors: Michael J. Abbott, Christopher P. Jalbert, Eric Richard Koebler
  • Patent number: 8595248
    Abstract: Techniques for maintaining a cascading index are provided. In one approach, one or more branch node compression techniques are applied to the main index of a cascading index. In an approach, a Bloom filter is generated and associated with, e.g., a branch node in the main index. The Bloom filter is used to determine whether, without accessing any leaf blocks, a particular key value exists, e.g., in leaf blocks associated with the branch node. In an approach, a new redo record is generated in response to a merge operation between two levels of the cascading index. The new redo record comprises (a) one or more addresses of blocks that are affected by the merge operation, (b) data is that being “pushed down” to a lower level of the cascading index, and (c) one or more addresses of blocks that are written to disk storage as a result of the merge operation.
    Type: Grant
    Filed: May 20, 2009
    Date of Patent: November 26, 2013
    Assignee: Oracle International Corporation
    Inventors: Amit Ganesh, Juan R. Loaiza, Krishna Kunchithapadam
  • Patent number: 8593686
    Abstract: There is provided an image scanning apparatus including a setting section which sets an image generation condition based on the attribute value of the existing image data stored in the image file; a generator section which scans the manuscript to generate the scanned image data based on the image generation condition set by the setting section; and a storing section which adds the scanned image data generated by the generator section into the image file to store the added image file.
    Type: Grant
    Filed: March 10, 2011
    Date of Patent: November 26, 2013
    Assignee: Brother Kogyo Kabushiki Kaisha
    Inventor: Ayako Kanemoto
  • Patent number: 8595212
    Abstract: An index advice record engine generates and stores index advice records. An index advice policy mechanism allows a user to define an index advice policy that specifies criteria for autonomic index creation, modification and deletion. An autonomic index mechanism reads the index advice records, compares this information with the criteria in the user-defined index advice policies, and determines whether an index should be created, modified or deleted based on the information in the index advice records and the index advice policies. By automating the process of creating, modifying and deleting indexes according to user-defined policies, the preferred embodiments alleviate human users from most of the work of manually creating, modifying and deleting indexes.
    Type: Grant
    Filed: August 15, 2008
    Date of Patent: November 26, 2013
    Assignee: International Business Machines Corporation
    Inventors: Mark John Anderson, Robert Joseph Bestgen, James Michael Flanagan, Scott Forstie, Thomas James Schreiber
  • Patent number: 8583622
    Abstract: Methods, computer systems, and computer-readable storage media for utilizing breadcrumb information associated with a Web page to improve a user's search experience on a search engine page and to improve ranking of search results are provided. Breadcrumb information is identified for a plurality of Web pages and is used to create an entity data structure. The entity data structure is used to determine related search items, deep links, and query suggestions on a search engine page. Further, contextual relationships between different breadcrumb elements that are part of a breadcrumb trail associated with a Web page are identified and used to adjust a ranking order of the Web page on a search engine results page.
    Type: Grant
    Filed: March 5, 2012
    Date of Patent: November 12, 2013
    Assignee: Microsoft Corporation
    Inventors: Vijay Krishnan, Puneet Agrawal, Ankur Khator, Sree Hari Nagaralu
  • Patent number: 8583747
    Abstract: A method system for labeling messages of conversations, where each conversation has a respective conversation identifier and comprises one or more messages sharing a common set of characteristics. A selected conversation has a plurality of messages associated with a first label. In response to an action specifying a second label to be associated with the selected conversation, the second label is associated with the messages of the selected conversation, such that the plurality of messages are each associated with both the first and second labels. An index entry for the second label is modified to account for the association of the second label with the messages. In some embodiments, a set of predetermined conditions is associated with the conversation; the first label is removed from the plurality of messages and is prevented from being automatically associated with newly received messages of the conversation until the predetermined conditions are satisfied.
    Type: Grant
    Filed: November 13, 2009
    Date of Patent: November 12, 2013
    Assignee: Google Inc.
    Inventors: Paul T. Buchheit, Bay-Wei W. Chang, Seth J. Golub, Jing Yee Lim, Sanjeev Singh, Russell R. Tuck, III
  • Patent number: 8577869
    Abstract: To prevent memory cells from being damaged, the present invention provides a data storage method for an electronic device, which comprises dividing a first section of a memory of the electronic device into N blocks, wherein each of the N blocks is corresponding to a sequence number and assigned an index value; rewriting an assigned index value corresponding to an nth block of the N blocks according to index values corresponding to the N blocks; storing an input data in the nth block; and rewriting the nth block by stored data in the (n?1)th block before storing the input data in the nth block when the index values corresponding to the N blocks are not identical such that a number of times of accessing sections of the memory is averaged, to prevent the memory from being damaged; wherein N is an integer greater than or equal to 1, and 1?n?N.
    Type: Grant
    Filed: August 22, 2011
    Date of Patent: November 5, 2013
    Assignee: Wistron Corporation
    Inventor: Chia-Ching Tsai
  • Patent number: 8572062
    Abstract: Provided are techniques for retrieving a document that includes for each page an area that is ignored by applications that process the document and that includes a different internal index set associated with each subset of pages of the document, wherein each different internal index set is associated with an area and stores indexes, and wherein each of the indexes consists of a name-value pair. Then, for each page in the document, it is determined whether the page is associated with an internal index set; and, in response to determining that the page is associated with an internal index set, one or more name-value pairs from the internal index set are extracted, wherein each of the one or more name-value pairs provides specific information about the document for use in identifying the document.
    Type: Grant
    Filed: December 21, 2009
    Date of Patent: October 29, 2013
    Assignee: International Business Machines Corporation
    Inventors: Gregory Scott Felderman, Brian Keith Hoyt, Paula Jean Muir
  • Patent number: 8572066
    Abstract: A method for extracting content from a data store includes creating a file identifier mapped to an item. The method includes receiving a request that seeks cached access to the item and that is not directed to the item. In response to the request, contents of the item are provided using the file identifier without caching the contents. A system for extracting content from a data store includes an indexer. The system includes a data store containing at least an item. The system includes a redirector containing a file identifier mapped to the item. When the redirector receives a request originated by the indexer that seeks cached access to the item and that is not directed to the item in the data store the redirector provides contents of the item using the file identifier without caching the contents in the system.
    Type: Grant
    Filed: May 18, 2012
    Date of Patent: October 29, 2013
    Assignee: Google Inc.
    Inventors: Sigurdur Asgeirsson, Tomas Gunnarsson
  • Patent number: 8564796
    Abstract: A first apparatus, which does not mount any metadata generation unit generating metadata used to specify image data from the image data, and a second apparatus, which mounts the metadata generation unit, are connected via a network in the image processing system. The first apparatus stores image data. The first apparatus requests the second apparatus to generate metadata associated with the stored image data, and the metadata generation unit mounted in the second apparatus generates metadata. The first apparatus requests the second apparatus to retrieve image data using the generated metadata, and a retrieval unit mounted in the second apparatus executes a high-level retrieval.
    Type: Grant
    Filed: May 4, 2009
    Date of Patent: October 22, 2013
    Assignee: Canon Kabushiki Kaisha
    Inventor: Naohiro Isshiki