Web Crawlers Patents (Class 707/709)
  • Publication number: 20130311306
    Abstract: Techniques are described herein for providing focus to a search module presented on a dynamic web page. The search module is configured to include an interface element for performance of a web search. Focus is provided to the search module by removing other modules from the presentation of the dynamic web page upon detecting an input from a user that activates the search module. In one example, component(s) of the search module (e.g., the interface element) may also be re-located and/or re-sized to further facilitate in providing focus to the search module. In another example, other modules associated with the search module may be added to the presentation of the dynamic web page in response to detecting the input from the user that activates the search module.
    Type: Application
    Filed: May 21, 2012
    Publication date: November 21, 2013
    Applicant: YAHOO! INC.
    Inventors: Li-Jen (Girvan) Liu, Ting-Liang (Liang) Chou, Geng-Dian (GD) Huang
  • Publication number: 20130311440
    Abstract: A computer implemented method, system and computer program product for providing search results in response to a search query includes receiving, by a processor, a search query from a user. A processor detects that the search query includes a request for a comparison-mode query and the processor automatically detects terms in the search query indicating that the query includes components. The comparison-mode query is decomposed into respective, individual component queries for the respective components and the query is performed as respective component queries for the respective, individual components. This includes finding an individual result for each respective, individual component from a single, remote website. The user is presented the individual results of the component queries, which includes aligning the results side-by-side and vertically, so that although the results are for respective, individual components, the alignment tends to help the user compare the individual results.
    Type: Application
    Filed: May 15, 2012
    Publication date: November 21, 2013
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Charles A. Cole, David B. Roberts, Eric Woods
  • Patent number: 8584045
    Abstract: Various systems and related methods involve navigating, finding, and visually presenting data objects or sets of data objects. In one implementation, a set of data objects is visually presented as a graphical element and one or more semantic relationships between the graphical element and other sets of data objects are visually presented. Furthermore, this includes a method for finding related sets of data objects by presenting different search paths and enabling the user to select a destination set of data objects based on the presented paths. One possible operation associated with this system includes presenting relationships between data objects outside of predefined work flows.
    Type: Grant
    Filed: January 19, 2007
    Date of Patent: November 12, 2013
    Assignee: SAP AG
    Inventors: Jochen Rode, Patrick Boettcher
  • Patent number: 8583621
    Abstract: A search operation can provide geographically restricted and verified information to a user. A two-step approach is used to perform these searches. The first step is to obtain high relevance search results by searching only in a specific region defined for a search operation. The second step further improves the quality of the search results by performing contact address correlation. If the search server finds a reliable reference address in the search results, then these search results can be presented to the user, whereby search results that are not correlating well with legitimate and registered addresses for the site are removed from the search result lists. Therefore, the region-restricted search does searching in a selected geographical region and only presents legitimate web pages or search results to a user.
    Type: Grant
    Filed: December 23, 2011
    Date of Patent: November 12, 2013
    Assignee: Enpulz, L.L.C.
    Inventor: James D. Bennett
  • Patent number: 8579187
    Abstract: A method and a system to identify machine-readable codes using a web crawler are provided. Machine-readable codes include, but are not limited to, Universal Product Codes (UPC), quick response (QR) codes, stock-keeping units (SKUs) and international standard book number (ISBN) codes. A web crawler downloads pages from the World Wide Web. A determination module accesses the downloaded pages and identifies a machine-readable code corresponding to a product description included in the downloaded pages. The machine-readable code is included in a downloaded page of the downloaded pages. The determination module further extracts the product description from the downloaded page. A code database stores a record of the machine-readable code and the product description.
    Type: Grant
    Filed: February 28, 2012
    Date of Patent: November 12, 2013
    Assignee: eBay Inc.
    Inventor: Tom Normile
  • Publication number: 20130297585
    Abstract: A computing device includes one or more rich internet application (RIM client engines. Each RIA client engine includes a corresponding private RIA storage area. The computing device also includes a per-RIA public storage area for each RIA. The per-RIA public storage area including a subset of data items in the private RIA storage area of the corresponding RIA client engine.
    Type: Application
    Filed: July 9, 2013
    Publication date: November 7, 2013
    Inventor: Jonathan C. Hawkins
  • Patent number: 8577938
    Abstract: A data mapping acceleration system may include a syntactic profile module to generate a syntactic profile of a data source to be mapped. A semantic classification module may generate a semantic classification of the data source. A similarity module may evaluate the syntactic profile and semantic classification to determine an overall similarity between attributes of data from the data source for mapping the data source.
    Type: Grant
    Filed: August 23, 2011
    Date of Patent: November 5, 2013
    Assignee: Accenture Global Services Limited
    Inventors: Peter Z. Yeh, Kunal Verma, Reymonrod Vasquez, Harsha Kumar Srivatsa, Mark Gilbert Kobe
  • Patent number: 8572065
    Abstract: A computer-implemented method, a computer system, and computer media for discovering links in scripts are provided. The computer system includes a crawler, a rules engine, and an index that are utilized to store links generated by scripts located in webpages in the index. The crawler traverses a network to locate webpages having scripts. The rules engine parses the located webpages and extracts the scripts based on rules that are satisfied by segments of the extracted scripts. The rules engine evaluates the segments of the extracted scripts to generate links. After the rules engine validates the links, the rules engine transmits the links to the index for storage.
    Type: Grant
    Filed: November 9, 2007
    Date of Patent: October 29, 2013
    Assignee: Microsoft Corporation
    Inventors: Kieran Richard McDonald, Srinath Reddy Aaleti, Richard J. Qian
  • Publication number: 20130282690
    Abstract: An optimization engine allows website publishers and other network document publishers to view and navigate statistics and scoring methodologies of a search engine. Publishers may thus gain a better understanding of how their website or network document is scored and how to optimize those documents to increase a search engine score. The user is thus able to navigate the network from the perspective of a search engine, viewing webpages, websites, and links in the same way a search engine would analyze them. Upon making changes to a website or network document, publishers may further request on-demand re-crawling of their website or network document to view changes in the score. Alerts may also be activated by a user to notify the user when certain conditions are met.
    Type: Application
    Filed: March 18, 2013
    Publication date: October 24, 2013
    Applicant: Efficient Systems, LLC
    Inventors: Scott A. Stouffer, Maura D. Stouffer
  • Publication number: 20130282689
    Abstract: An optimization engine allows website publishers and other network document publishers to view and navigate statistics and scoring methodologies of a search engine. Publishers may thus gain a better understanding of how their website or network document is scored and how to optimize those documents to increase a search engine score. The user is thus able to navigate the network from the perspective of a search engine, viewing webpages, websites, and links in the same way a search engine would analyze them. Upon making changes to a website or network document, publishers may further request on-demand re-crawling of their website or network document to view changes in the score. Alerts may also be activated by a user to notify the user when certain conditions are met.
    Type: Application
    Filed: March 18, 2013
    Publication date: October 24, 2013
    Applicant: Efficient Systems, LLC.
    Inventors: Scott A. Stouffer, Maura D. Stouffer
  • Patent number: 8566956
    Abstract: A computer-implemented system and method of monitoring data access activity of a user of a system is presented here. The method maintains a respective score for each of a plurality of monitored data access events, resulting in a set of scores for the user. The method continues by monitoring behavior of the user to detect occurrences of the monitored data access events, and updating the set of scores in response to detected occurrences of the monitored data access events. The method initiates an appropriate course of action when the updated set of scores is indicative of unauthorized, suspicious, or illegitimate data access activity.
    Type: Grant
    Filed: December 6, 2010
    Date of Patent: October 22, 2013
    Assignee: salesforce.com, inc.
    Inventor: Steve Slater
  • Publication number: 20130275406
    Abstract: Computer-implemented method for building a web corpus (WCD) comprising the steps of: sending by a web crawler (WC) a query to a reference web crawl agent (RWCA), this query containing a least one identifier of a resource, receiving by the web crawler (WC) a response from the reference web crawl agent (RWCA); if this response does not contain the resource identified by the identifier, downloading by the web crawler (WC) the resource from the website (WS) corresponding to the identifier and adding the resource to the web corpus (WCD; and if this response contains the resource identified by the identifier, adding the resource to the web corpus (WCD).
    Type: Application
    Filed: April 11, 2013
    Publication date: October 17, 2013
    Inventors: Sebastien Richard, Xavier Grehant, Jim Ferenczi
  • Patent number: 8560521
    Abstract: A system, method, and computer program product are provided for processing a prefix tree file utilizing a selected agent. In use, a file including a prefix tree is identified. Additionally, an agent is selected from a plurality of agents to process the file. Further, the file is processed utilizing the agent.
    Type: Grant
    Filed: June 16, 2011
    Date of Patent: October 15, 2013
    Assignee: McAfee, Inc.
    Inventors: Andreas Schlemmer, Thomas C. H. Steiner, Peter Blaimschein
  • Patent number: 8560520
    Abstract: Information retrieval using time is described. In an embodiment a web-crawler creates time objects which are composites of content of different media types obtained from potentially different sources and, for example, are about the same date or date range. For example, a time object may comprise an image and a piece of text about an event in 1977. In this case the time object may have the date 1977 stored with it as well as URLs of originating web pages. In an embodiment a user is able to query a search engine to retrieve time objects which are displayed in a manner related to the dates of the time objects. For example, the time objects may be displayed in chronological order against a time line. In examples time objects may be dynamically created as a web page is requested and displayed with that web page.
    Type: Grant
    Filed: August 26, 2010
    Date of Patent: October 15, 2013
    Assignee: Microsoft Corporation
    Inventors: Philip Charles Gosset, Richard Harper
  • Patent number: 8560407
    Abstract: A method for inventory management performed by a computerized inventory management system includes, with an inventory management system, receiving an initial inventory line comprising an item, a quantity, and a bin. The method further includes, with the inventory management system providing a split quantity user interface window in response to determining that the quantity of the item exceeds an actual quantity stored at the bin, creating and displaying a plurality of new inventory lines to replace the first inventory line in response to an autosplit control of the user interface being engaged, the new inventory lines including different bins, and providing a control through the user interface window to allow manual adjustment of the new inventory lines.
    Type: Grant
    Filed: March 8, 2011
    Date of Patent: October 15, 2013
    Assignee: International Business Machines Corporation
    Inventors: Kevin Buonagurio, Sarika Budhiraja
  • Patent number: 8560519
    Abstract: Relationships between linked and/or embedded documents as well as documents sharing data source(s) are captured and rendered through virtual documents. Virtual documents are created representing linked/embedded documents and data sources associated with a relevant document. Relationships between real and virtual documents are preserved and rendered along with search results providing a user a comprehensive picture of search results.
    Type: Grant
    Filed: March 19, 2010
    Date of Patent: October 15, 2013
    Assignee: Microsoft Corporation
    Inventors: Luming Wang, Xiaohong Yang, Anton Amirov, Pablo Barvo
  • Patent number: 8560637
    Abstract: A web server is connected to a terminal computer capable of performing hypertext transfer protocol communications with the web server. The terminal computer includes a browser for displaying information. The web server executes a plurality of web applications upon receiving a request from the terminal computer. The web server transmits messages output by the applications being executed to the terminal computer. The terminal computer displays messages received from the web server collectively in one window of the browser.
    Type: Grant
    Filed: May 31, 2006
    Date of Patent: October 15, 2013
    Assignee: Fujitsu Limited
    Inventors: Naoki Tsukada, Haruo Higashiwaki, Kyoko Sawada
  • Publication number: 20130268508
    Abstract: A system and method provides a dynamic table framework for managing data in a high performance web service. An example embodiment includes: receiving a request at a web service; creating a dynamic record from the request; obtaining a runtime corresponding to the dynamic record, the runtime including an associated plurality of symbol values corresponding to the request; choosing a model corresponding to the runtime, the model including a plurality of symbol managers, each of the plurality of symbol managers being associated with the plurality of symbol values, each of the plurality of symbol managers for processing a specific task of the model; executing the model, by use of a data processor, to process the request, the model using at least one of the plurality of symbol managers; and returning results generated by execution of the model.
    Type: Application
    Filed: November 29, 2012
    Publication date: October 10, 2013
    Applicant: EBAY INC.
    Inventors: Charles Bracher, Rodolfo G. Caguiat, Hao Lian, Ramon Cruz
  • Publication number: 20130268507
    Abstract: Among other things, one or more techniques and/or systems are provided for facilitating the completion of a user task. That is, user intent (e.g., intentions of a user to perform a user task) may be identified. The user intent may comprise an entity (e.g., a movie entity) and/or an action (e.g., an order movie tickets action) that the user wants to perform on the entity. A provider list may be created based upon one or more providers capable of performing the action on the entity (e.g., a movie application may be capable of performing the order movie tickets action on the movie entity). Providers may be dynamically selected for inclusion within the provider list at run-time. For example, an open market of providers may be maintained (e.g., providers may be added, removed, and/or updated over time), such that providers may be selected from the open market to complete user tasks.
    Type: Application
    Filed: April 8, 2012
    Publication date: October 10, 2013
    Applicant: Microsoft Corporation
    Inventors: Steven William Macbeth, Michael William O'Connor, Zhaowei Charlie Jiang, Michael Van Snellenberg, Li Jiang, Shajib Sadhukha, Amit Kumar
  • Publication number: 20130268482
    Abstract: Systems, methods, and computer-readable media for determining the Internet search popularity of an entity are provided. Embodiments of the present invention include receiving a group of Internet search records and assigning a popularity ranking based on the number of times an entity descriptor associated with an entity occurs within the group of Internet search records created over a designated time period. An entity descriptor is one or more terms commonly used to identify an entity. The trend in an entity's popularity rank may also be calculated. An entity's popularity rank and trend in popularity rank may be presented in a graph or in a list.
    Type: Application
    Filed: March 14, 2013
    Publication date: October 10, 2013
    Inventors: Tabreez Govani, Hugh Williams, Jamie Buckley, Nitin Agrawal, Andy Lam, Kenneth A. Moss
  • Patent number: 8554758
    Abstract: Method and apparatus for monitoring and maintaining group health and membership in a searchable data service. The searchable data service may provide a searchable index to a backend data store, and an interface to build and query the searchable index, that enables client applications to search for and retrieve locators for stored entities in the backend data store. Embodiments of the searchable data service may implement a group health mechanism that may allow one or more components of the searchable data service to identify sets of nodes that the components may monitor. A group health component may be queried by the other components for automatically-refreshed health information about those nodes. The group health component may perform as a node failure detection mechanism. Node health information may be communicated among nodes at least in part using a gossip protocol.
    Type: Grant
    Filed: March 29, 2006
    Date of Patent: October 8, 2013
    Assignee: Amazon Technologies, Inc.
    Inventors: James S. Larson, Patrick W. Ransil, Aleksey V. Martynov, James R. Collette, Robert Wai-Chi Chu, Partha Saha
  • Patent number: 8554869
    Abstract: Disclosed herein is a technique for providing an interface that allows a user to navigate backwards through linked webpages. Initially, a request to display inlinks of linking webpages that contain a link to a particular webpage is received. In response to the request, a new page that contains a set of inlinks that correspond to a set of linking webpages that each contain a link to the particular webpage is provided. Each of the inlinks may be associated with a particular clickable item. An indication of a selection of a clickable item, associated with a particular inlink is received. In response, a second new page which contains a second set of inlinks that correspond to a second set of linking webpages that each contain a link to the webpage that corresponds to the particular inlink is provided. Some of the displayed inlinks may correspond to webpages that redirect to the particular webpage.
    Type: Grant
    Filed: August 2, 2006
    Date of Patent: October 8, 2013
    Assignee: Yahoo! Inc.
    Inventors: Priyank S. Garg, Conrad Newman, Amit Kumar, Vladimir Ofitserov
  • Publication number: 20130262416
    Abstract: One embodiment of a method of the present invention for discovering sensitive information on computer network provides for discovering databases on a computer network, defining a pattern for a data discovery, discovering qualifying records by matching the pattern with field names and/or record values in the databases, sending electronic notification to a database administrator managing the qualifying database, receiving a selection choice from the database administrator managing the qualifying database identifying the status for the qualifying records.
    Type: Application
    Filed: March 30, 2012
    Publication date: October 3, 2013
    Applicant: GO DADDY OPERATING COMPANY, LLC
    Inventors: Ganesh Devarajan, Todd Redfoot
  • Publication number: 20130262428
    Abstract: One embodiment of a system of the present invention for discovering sensitive information on computer network includes means for discovering databases on a computer network, means for defining a pattern for a data discovery, means for discovering qualifying records by matching the pattern with field names and/or record values in the databases, means for sending electronic notification to a database administrator managing the qualifying database, means for receiving a selection choice from the database administrator managing the qualifying database identifying the status for the qualifying records.
    Type: Application
    Filed: March 30, 2012
    Publication date: October 3, 2013
    Applicant: GO DADDY OPERATING COMPANY, LLC
    Inventors: Ganesh Devarajan, Todd Redfoot
  • Patent number: 8548977
    Abstract: A computer-related and/or business type method is presented for embedding one or more media hotspots within a digital media file and, in response to interaction from a separate target entity, such as via an associating request, associating one or more resultant actions with the media hotspot(s). In exchange for associating the one or more resultant actions with the media hotspot(s), an interactive media service entity being affiliated with a web site displaying the digital media file and/or a user being affiliated with the digital media file itself is compensated based upon at least one compensation plan.
    Type: Grant
    Filed: March 1, 2012
    Date of Patent: October 1, 2013
    Assignee: Clayco Research Limited Liability Company
    Inventor: Leigh Rothschild
  • Patent number: 8549029
    Abstract: A method for generating data enabling searching for content complements intended for terminals of a telecommunication network is provided. In one embodiment, the method includes selecting content from a terminal, generating identification information of the selected content, sending a marker generation request to a content complement server, and, in response to receiving the request, generating and saving a marker at the content complement server. The request includes the user's identification data and the identification information of the selected content. The marker includes the user's identification data, the identification information of the selected content, and search characteristics data enabling an additional search for content related to the selected content. A system, a terminal, and a server associated with the method are also provided.
    Type: Grant
    Filed: January 29, 2009
    Date of Patent: October 1, 2013
    Assignee: Alcatel Lucent
    Inventors: Gérard Delegue, Erwan Baynaud
  • Patent number: 8548975
    Abstract: A system graphically plots articles. The system executes one or more search queries to search stored articles and receives results of the executed one or more search queries, where the results identify selected articles of the stored articles. The system further designates a graphical marker for each of the selected articles and plots each of the graphical markers on a graphical display.
    Type: Grant
    Filed: March 2, 2012
    Date of Patent: October 1, 2013
    Assignee: Google Inc.
    Inventors: Krishna Bharat, Michael Curtiss, Duane Scott Hess, Kerah Pelczarski, Marissa Mayer
  • Patent number: 8548976
    Abstract: A method for connecting to a web service includes selecting a web service, selecting a server among one or more servers capable of running the selected web service, determining a real address for the selected web service running on the selected server and connecting to the selected web service running on the selected server using the determined real address.
    Type: Grant
    Filed: May 19, 2005
    Date of Patent: October 1, 2013
    Assignee: CA, Inc.
    Inventors: Christopher Betts, Tony Rogers
  • Publication number: 20130254170
    Abstract: A server executing a social media identity and discovery application and method are provided that scan social networking sites for communications. The target content is found with content indicators when communications are put on a social networking site. The content is recorded and evaluated. If the identified content is contextually significant, the alias and the user account data and/or user data from public records are correlated based on keywords and/or events, and a notification of the correlation is sent to an agency, agent, or a contact center system. The agent or agency may verify that the identity of a poster has been accurately correlated with a customer record in the database or with user data from public records. The agent, the agency, or the system has the opportunity to respond to the communication, despite the anonymity of the poster on the social networking site.
    Type: Application
    Filed: May 21, 2013
    Publication date: September 26, 2013
    Applicant: Avaya Inc
    Inventors: George Erhart, Valentine C. Matula, David Skiba
  • Publication number: 20130254181
    Abstract: Disclosed is a computer-implemented method to aggregate products from online stores, the method comprising crawling one or more websites associated with one or more online stores; collecting information pertaining to products of the stores; extracting key data about each product; and classifying the products into one or more categories based on the key data.
    Type: Application
    Filed: January 3, 2013
    Publication date: September 26, 2013
    Inventors: Edward Balassanian, Scott W. Bradley, Guy Carpenter
  • Patent number: 8538949
    Abstract: The claimed subject matter provides a system or method for web crawling hidden files. An exemplary method includes loading a web page with a browser agent, and executing any dynamic elements hosted on the web page using the browser agent to insert pre-determined values. A list of form controls may be retrieved from the web page using the browser agent, and the controls may be analyzed using a driver component. Form control values may be sent from the driver component to the browser agent, and an event may be submitted to the web page by the browser agent or scripted content may be run to trigger operations on the web page corresponding to the form control values. A URL may be generated for various form control values using a generalizer.
    Type: Grant
    Filed: June 17, 2011
    Date of Patent: September 17, 2013
    Assignee: Microsoft Corporation
    Inventors: Chao Liu, Chao Zhaou, Yi-Min Wang
  • Publication number: 20130238590
    Abstract: Systems and methods for providing an enterprise crawl and search framework, including features such as use with middleware and enterprise application environments, pluggable security, search development tools, user interfaces, and governance. In accordance with an embodiment, the system includes an enterprise crawl and search framework which abstracts an underlying search engine, provides a common set of application programming interfaces for developing search functionalities, and allows the framework to serve as an integration layer between one or more enterprise search engine and one or more enterprise application. A computing environment can be used to display an administration interface for use in administering the framework.
    Type: Application
    Filed: January 2, 2013
    Publication date: September 12, 2013
    Applicant: Oracle International Corporation
    Inventors: Tulasi Kodali, DJ Vasant Ursal
  • Publication number: 20130238592
    Abstract: An application store tastemaker recommendation service determines experts within a user's social network(s), receives recommendations from the experts, filters and/or ranks mobile application query results based at least in part on the recommendations. Additionally, the service may further determine the experts based on data compiled about previous actions, reviews, comments, etc., of the experts. Further, the service may provide recommendations to the user to aid in selecting mobile applications for purchase, and may provide an avenue for completing such purchases.
    Type: Application
    Filed: April 23, 2013
    Publication date: September 12, 2013
    Applicant: Microsoft Corporation
    Inventors: Joseph Futty, Matthew Graham Dyor, Danny Lange, Miller T. Abel
  • Publication number: 20130238589
    Abstract: Systems and methods for providing an enterprise crawl and search framework, including features such as use with middleware and enterprise application environments, pluggable security, search development tools, user interfaces, and governance. In accordance with an embodiment, the system includes an enterprise crawl and search framework which abstracts an underlying search engine, provides a common set of application programming interfaces for developing search functionalities, and allows the framework to serve as an integration layer between one or more enterprise search engine and one or more enterprise application.
    Type: Application
    Filed: January 2, 2013
    Publication date: September 12, 2013
    Applicant: ORACLE INTERNATIONAL CORPORATION
    Inventors: Sandeep Yarramreddy, DJ Vasant Ursal
  • Publication number: 20130238591
    Abstract: Systems and methods for providing an enterprise crawl and search framework, including features such as use with middleware and enterprise application environments, pluggable security, search development tools, user interfaces, and governance. In accordance with an embodiment, the system includes an enterprise crawl and search framework which abstracts an underlying search engine, provides a common set of application programming interfaces for developing search functionalities, and allows the framework to serve as an integration layer between one or more enterprise search engine and one or more enterprise application. A user interface is provided for use in validating a search box against at target environment as part of implementing search within that environment.
    Type: Application
    Filed: January 2, 2013
    Publication date: September 12, 2013
    Applicant: Oracle International Corporation
    Inventors: DJ Vasant Ursal, Sherry Mead
  • Publication number: 20130238593
    Abstract: Systems and methods for providing an enterprise crawl and search framework, including features such as use with middleware and enterprise application environments, pluggable security, search development tools, user interfaces, and governance. In accordance with an embodiment, the system includes an enterprise crawl and search framework which abstracts an underlying search engine, provides a common set of application programming interfaces for developing search functionalities, and allows the framework to serve as an integration layer between one or more enterprise search engine and one or more enterprise application. A plurality of searchable objects which are sets of data derived from enterprise applications are used to make view objects available for full text search.
    Type: Application
    Filed: January 2, 2013
    Publication date: September 12, 2013
    Applicant: ORACLE INTERNATIONAL CORPORATION
    Inventors: DJ Vasant Ursal, Tulasi Kodali
  • Publication number: 20130232132
    Abstract: A method for managing the Search Engine Optimization (SEO) content of web pages is disclosed. In one embodiment, such a method includes providing a set of web pages organized in a hierarchical structure. Each web page has an SEO content pattern associated therewith. The method establishes an inheritance scheme for the hierarchical structure such that the SEO content patterns of parent web pages are inherited by children web pages. The method further enables a user to override the inheritance scheme for selected web pages such that the SEO content patterns of the selected web pages override the SEO content patterns of their respective parent web pages. A corresponding apparatus and computer program product are also disclosed.
    Type: Application
    Filed: March 4, 2012
    Publication date: September 5, 2013
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Pavan Kini, Rakesh Nagaraja Kote, Karson Ng, Walfrey Ng, Siddharth C. Ravi Kanth Rao, Guru Prasad Shamanna
  • Patent number: 8527495
    Abstract: A plug-in interface is provided in a crawling search engine. Plug-in parsers are also provided for use with the search engine. The plug-in interface allows the search engine to be configured with different plug-in parsers. Thus, a customer may configure a search engine with a parser that best suits the needs of the customer and to try new parsing algorithms to find the best results.
    Type: Grant
    Filed: February 19, 2002
    Date of Patent: September 3, 2013
    Assignee: International Business Machines Corporation
    Inventor: Richard J. Redpath
  • Publication number: 20130226896
    Abstract: Many websites publish variants of their web pages based on language and region. However, when a user is directed toward the incorrect web page for the user's language preference, it there is not a simple way for the user to select the appropriate localized or region specific version of the web page. According to an embodiment, a language preference from a user may be received. A first language for a first web page may be identified and the first web page may be received by a computing device of the user. A second language for a second web page may be identified. The second web page may comprise an alternate version of the first web page. The first web page or the second web page may be selected according to the language preference of the user and the selected web page may be presented to the user.
    Type: Application
    Filed: February 26, 2013
    Publication date: August 29, 2013
    Applicant: GOOGLE INC
    Inventors: Jeffrey Chin, Jun Yin, Christopher Semturs, Joshua James Estelle
  • Publication number: 20130226898
    Abstract: Systems and methods for scheduling documents for crawling are disclosed. In some implementations, a method includes obtaining sitemap information for a plurality of websites; and analyzing the sitemap information to identify a website, in the plurality of websites. The website has sitemap information that is at least potentially out of date. The method also includes updating the sitemap information for the identified website by downloading updated sitemap information for the identified website; and scheduling documents for crawling in accordance with the updated sitemap information for the identified website.
    Type: Application
    Filed: April 8, 2013
    Publication date: August 29, 2013
    Applicant: Google Inc.
    Inventor: Google Inc.
  • Publication number: 20130226895
    Abstract: A system can search for data streams. A processor searches for a data stream device or provider on a network. It is determined if the data stream device or provider includes a stored stream index. The stream index is accessed as a reference stream index if the stream index is discovered. Data streams are searched for using the reference stream index.
    Type: Application
    Filed: February 29, 2012
    Publication date: August 29, 2013
    Applicant: Broadcom Corporation
    Inventor: Bruno Gauthier
  • Publication number: 20130226897
    Abstract: A method includes comparing a first instance with a second instance of a document in a plurality of documents. The first instance is obtained from a remote location at a specified time before the second instance is obtained from the remote location, and (i) the specified time is determined in accordance with a first crawl interval associated with the document, (ii) each document in the plurality of documents is assigned to a tier in a plurality of tiers, each tier having a distinct associated range of web crawl intervals, and (iii) the first crawl interval is assigned a first tier. The method also includes computing a second crawl interval for the document, which is a function of the document comparison; and determining whether the second crawl interval is in the first tier. When the second crawl interval is not, the first document is reassigned to another tier.
    Type: Application
    Filed: March 22, 2013
    Publication date: August 29, 2013
    Inventor: Anton P.T. Carver
  • Publication number: 20130226899
    Abstract: A method of triggering crawling of a domain includes receiving information related to a domain from a registrar and processing the information related to the domain. The method also includes storing the processed information in a registry zone file and forming a list of registry data based on the processed information. The list of registry data comprises a subset of the registry zone file. The method farther includes crawling one or more of the domains in the list of registry data.
    Type: Application
    Filed: April 9, 2013
    Publication date: August 29, 2013
    Applicant: VERISIGN, INC.
    Inventors: Karthik Shyamsunder, Srinivas Sunkara, Alexander Durych Holmes, Andrew Simpson, Phani Pattappu
  • Patent number: 8521718
    Abstract: Systems, methods, and computer program products identify one or more web page impressions satisfying one or more simply queries, each of the one or more web page impressions associated with a respective impression ID. Respective impression IDs of the one or more web pages satisfying the one or more simple queries are stored in an impression log. Subsequent to storing the respective impression IDs, a query is received from a client device, and a number of impression IDs for the one or more web pages satisfying the query are identified based on the identified one or more web page impressions satisfying the one or more simple queries.
    Type: Grant
    Filed: May 17, 2012
    Date of Patent: August 27, 2013
    Assignee: Google Inc.
    Inventors: Dawn Chamberlain, Gurmeet Singh Manku, Barbara Stanley
  • Patent number: 8521715
    Abstract: One or more computer systems for using favicons to send queries to searchable webpages and obtaining a search results page for each searchable webpage over a network are provided.
    Type: Grant
    Filed: May 20, 2011
    Date of Patent: August 27, 2013
    Assignee: Accrue Search Concepts, Inc.
    Inventor: Troy Fearnow
  • Patent number: 8521717
    Abstract: Web pages of a Website may be processed to improve search results. For example, information likely to pertain to more than just the Web page it is directly associated with may be identified. One or more other, related, Web pages that such information is likely to pertain to is also identified. The identified information is associated with the identified other Web page(s) and this association is saved in a way to affect a search result score of the Web page(s).
    Type: Grant
    Filed: April 21, 2011
    Date of Patent: August 27, 2013
    Assignee: Google Inc.
    Inventors: Daniel Egnor, Paul Haahr, Kevin Lackner, John Lamping, Amitabh K. Singhal, Ke Yang
  • Publication number: 20130218864
    Abstract: Some embodiments provide a method that receives several attributes of a property and a price of the property. For each attribute in the several attributes of the property, the method performs a hedonic analysis to compute a value that correlates a portion of the price of the property to the attribute of the property. The method stores the computed values for later use in a search for the property.
    Type: Application
    Filed: February 18, 2012
    Publication date: August 22, 2013
    Inventor: Harrison Gregory Hong
  • Publication number: 20130218865
    Abstract: This disclosure describes systems, methods, and apparatus for generating reports enhancing an understanding of Internet users based on their generated content and actions taken by others in response to the generated content.
    Type: Application
    Filed: February 21, 2013
    Publication date: August 22, 2013
    Applicant: SPOTRIGHT, INC.
    Inventor: SPOTRIGHT, INC.
  • Patent number: 8515938
    Abstract: An information processing system including, a client capable of receiving and reproducing content from a media server, and a collecting server for receiving content management information on the content from the media server and managing the content management information.
    Type: Grant
    Filed: May 6, 2008
    Date of Patent: August 20, 2013
    Assignee: Sony Corporation
    Inventors: Toshiaki Kusakabe, Satoshi Hiroi, Masahiro Hara
  • Patent number: 8515994
    Abstract: A method includes determining a source URI and extracting meta data from a resource downloaded from a network; storing the source URI and extracted meta data; and utilizing the stored URI and extracted meta data to formulate a search for another resource located on the network. A computer and computer program code are also provided.
    Type: Grant
    Filed: June 12, 2008
    Date of Patent: August 20, 2013
    Assignee: International Business Machines Corporation
    Inventors: Venkat A. Reddy, Krishna C. Kumar