Web Crawlers Patents (Class 707/709)
-
Publication number: 20130311306Abstract: Techniques are described herein for providing focus to a search module presented on a dynamic web page. The search module is configured to include an interface element for performance of a web search. Focus is provided to the search module by removing other modules from the presentation of the dynamic web page upon detecting an input from a user that activates the search module. In one example, component(s) of the search module (e.g., the interface element) may also be re-located and/or re-sized to further facilitate in providing focus to the search module. In another example, other modules associated with the search module may be added to the presentation of the dynamic web page in response to detecting the input from the user that activates the search module.Type: ApplicationFiled: May 21, 2012Publication date: November 21, 2013Applicant: YAHOO! INC.Inventors: Li-Jen (Girvan) Liu, Ting-Liang (Liang) Chou, Geng-Dian (GD) Huang
-
Publication number: 20130311440Abstract: A computer implemented method, system and computer program product for providing search results in response to a search query includes receiving, by a processor, a search query from a user. A processor detects that the search query includes a request for a comparison-mode query and the processor automatically detects terms in the search query indicating that the query includes components. The comparison-mode query is decomposed into respective, individual component queries for the respective components and the query is performed as respective component queries for the respective, individual components. This includes finding an individual result for each respective, individual component from a single, remote website. The user is presented the individual results of the component queries, which includes aligning the results side-by-side and vertically, so that although the results are for respective, individual components, the alignment tends to help the user compare the individual results.Type: ApplicationFiled: May 15, 2012Publication date: November 21, 2013Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Charles A. Cole, David B. Roberts, Eric Woods
-
Patent number: 8584045Abstract: Various systems and related methods involve navigating, finding, and visually presenting data objects or sets of data objects. In one implementation, a set of data objects is visually presented as a graphical element and one or more semantic relationships between the graphical element and other sets of data objects are visually presented. Furthermore, this includes a method for finding related sets of data objects by presenting different search paths and enabling the user to select a destination set of data objects based on the presented paths. One possible operation associated with this system includes presenting relationships between data objects outside of predefined work flows.Type: GrantFiled: January 19, 2007Date of Patent: November 12, 2013Assignee: SAP AGInventors: Jochen Rode, Patrick Boettcher
-
Patent number: 8583621Abstract: A search operation can provide geographically restricted and verified information to a user. A two-step approach is used to perform these searches. The first step is to obtain high relevance search results by searching only in a specific region defined for a search operation. The second step further improves the quality of the search results by performing contact address correlation. If the search server finds a reliable reference address in the search results, then these search results can be presented to the user, whereby search results that are not correlating well with legitimate and registered addresses for the site are removed from the search result lists. Therefore, the region-restricted search does searching in a selected geographical region and only presents legitimate web pages or search results to a user.Type: GrantFiled: December 23, 2011Date of Patent: November 12, 2013Assignee: Enpulz, L.L.C.Inventor: James D. Bennett
-
Patent number: 8579187Abstract: A method and a system to identify machine-readable codes using a web crawler are provided. Machine-readable codes include, but are not limited to, Universal Product Codes (UPC), quick response (QR) codes, stock-keeping units (SKUs) and international standard book number (ISBN) codes. A web crawler downloads pages from the World Wide Web. A determination module accesses the downloaded pages and identifies a machine-readable code corresponding to a product description included in the downloaded pages. The machine-readable code is included in a downloaded page of the downloaded pages. The determination module further extracts the product description from the downloaded page. A code database stores a record of the machine-readable code and the product description.Type: GrantFiled: February 28, 2012Date of Patent: November 12, 2013Assignee: eBay Inc.Inventor: Tom Normile
-
Publication number: 20130297585Abstract: A computing device includes one or more rich internet application (RIM client engines. Each RIA client engine includes a corresponding private RIA storage area. The computing device also includes a per-RIA public storage area for each RIA. The per-RIA public storage area including a subset of data items in the private RIA storage area of the corresponding RIA client engine.Type: ApplicationFiled: July 9, 2013Publication date: November 7, 2013Inventor: Jonathan C. Hawkins
-
Patent number: 8577938Abstract: A data mapping acceleration system may include a syntactic profile module to generate a syntactic profile of a data source to be mapped. A semantic classification module may generate a semantic classification of the data source. A similarity module may evaluate the syntactic profile and semantic classification to determine an overall similarity between attributes of data from the data source for mapping the data source.Type: GrantFiled: August 23, 2011Date of Patent: November 5, 2013Assignee: Accenture Global Services LimitedInventors: Peter Z. Yeh, Kunal Verma, Reymonrod Vasquez, Harsha Kumar Srivatsa, Mark Gilbert Kobe
-
Patent number: 8572065Abstract: A computer-implemented method, a computer system, and computer media for discovering links in scripts are provided. The computer system includes a crawler, a rules engine, and an index that are utilized to store links generated by scripts located in webpages in the index. The crawler traverses a network to locate webpages having scripts. The rules engine parses the located webpages and extracts the scripts based on rules that are satisfied by segments of the extracted scripts. The rules engine evaluates the segments of the extracted scripts to generate links. After the rules engine validates the links, the rules engine transmits the links to the index for storage.Type: GrantFiled: November 9, 2007Date of Patent: October 29, 2013Assignee: Microsoft CorporationInventors: Kieran Richard McDonald, Srinath Reddy Aaleti, Richard J. Qian
-
Publication number: 20130282690Abstract: An optimization engine allows website publishers and other network document publishers to view and navigate statistics and scoring methodologies of a search engine. Publishers may thus gain a better understanding of how their website or network document is scored and how to optimize those documents to increase a search engine score. The user is thus able to navigate the network from the perspective of a search engine, viewing webpages, websites, and links in the same way a search engine would analyze them. Upon making changes to a website or network document, publishers may further request on-demand re-crawling of their website or network document to view changes in the score. Alerts may also be activated by a user to notify the user when certain conditions are met.Type: ApplicationFiled: March 18, 2013Publication date: October 24, 2013Applicant: Efficient Systems, LLCInventors: Scott A. Stouffer, Maura D. Stouffer
-
Publication number: 20130282689Abstract: An optimization engine allows website publishers and other network document publishers to view and navigate statistics and scoring methodologies of a search engine. Publishers may thus gain a better understanding of how their website or network document is scored and how to optimize those documents to increase a search engine score. The user is thus able to navigate the network from the perspective of a search engine, viewing webpages, websites, and links in the same way a search engine would analyze them. Upon making changes to a website or network document, publishers may further request on-demand re-crawling of their website or network document to view changes in the score. Alerts may also be activated by a user to notify the user when certain conditions are met.Type: ApplicationFiled: March 18, 2013Publication date: October 24, 2013Applicant: Efficient Systems, LLC.Inventors: Scott A. Stouffer, Maura D. Stouffer
-
Patent number: 8566956Abstract: A computer-implemented system and method of monitoring data access activity of a user of a system is presented here. The method maintains a respective score for each of a plurality of monitored data access events, resulting in a set of scores for the user. The method continues by monitoring behavior of the user to detect occurrences of the monitored data access events, and updating the set of scores in response to detected occurrences of the monitored data access events. The method initiates an appropriate course of action when the updated set of scores is indicative of unauthorized, suspicious, or illegitimate data access activity.Type: GrantFiled: December 6, 2010Date of Patent: October 22, 2013Assignee: salesforce.com, inc.Inventor: Steve Slater
-
Publication number: 20130275406Abstract: Computer-implemented method for building a web corpus (WCD) comprising the steps of: sending by a web crawler (WC) a query to a reference web crawl agent (RWCA), this query containing a least one identifier of a resource, receiving by the web crawler (WC) a response from the reference web crawl agent (RWCA); if this response does not contain the resource identified by the identifier, downloading by the web crawler (WC) the resource from the website (WS) corresponding to the identifier and adding the resource to the web corpus (WCD; and if this response contains the resource identified by the identifier, adding the resource to the web corpus (WCD).Type: ApplicationFiled: April 11, 2013Publication date: October 17, 2013Inventors: Sebastien Richard, Xavier Grehant, Jim Ferenczi
-
Patent number: 8560521Abstract: A system, method, and computer program product are provided for processing a prefix tree file utilizing a selected agent. In use, a file including a prefix tree is identified. Additionally, an agent is selected from a plurality of agents to process the file. Further, the file is processed utilizing the agent.Type: GrantFiled: June 16, 2011Date of Patent: October 15, 2013Assignee: McAfee, Inc.Inventors: Andreas Schlemmer, Thomas C. H. Steiner, Peter Blaimschein
-
Patent number: 8560520Abstract: Information retrieval using time is described. In an embodiment a web-crawler creates time objects which are composites of content of different media types obtained from potentially different sources and, for example, are about the same date or date range. For example, a time object may comprise an image and a piece of text about an event in 1977. In this case the time object may have the date 1977 stored with it as well as URLs of originating web pages. In an embodiment a user is able to query a search engine to retrieve time objects which are displayed in a manner related to the dates of the time objects. For example, the time objects may be displayed in chronological order against a time line. In examples time objects may be dynamically created as a web page is requested and displayed with that web page.Type: GrantFiled: August 26, 2010Date of Patent: October 15, 2013Assignee: Microsoft CorporationInventors: Philip Charles Gosset, Richard Harper
-
Patent number: 8560407Abstract: A method for inventory management performed by a computerized inventory management system includes, with an inventory management system, receiving an initial inventory line comprising an item, a quantity, and a bin. The method further includes, with the inventory management system providing a split quantity user interface window in response to determining that the quantity of the item exceeds an actual quantity stored at the bin, creating and displaying a plurality of new inventory lines to replace the first inventory line in response to an autosplit control of the user interface being engaged, the new inventory lines including different bins, and providing a control through the user interface window to allow manual adjustment of the new inventory lines.Type: GrantFiled: March 8, 2011Date of Patent: October 15, 2013Assignee: International Business Machines CorporationInventors: Kevin Buonagurio, Sarika Budhiraja
-
Patent number: 8560519Abstract: Relationships between linked and/or embedded documents as well as documents sharing data source(s) are captured and rendered through virtual documents. Virtual documents are created representing linked/embedded documents and data sources associated with a relevant document. Relationships between real and virtual documents are preserved and rendered along with search results providing a user a comprehensive picture of search results.Type: GrantFiled: March 19, 2010Date of Patent: October 15, 2013Assignee: Microsoft CorporationInventors: Luming Wang, Xiaohong Yang, Anton Amirov, Pablo Barvo
-
Patent number: 8560637Abstract: A web server is connected to a terminal computer capable of performing hypertext transfer protocol communications with the web server. The terminal computer includes a browser for displaying information. The web server executes a plurality of web applications upon receiving a request from the terminal computer. The web server transmits messages output by the applications being executed to the terminal computer. The terminal computer displays messages received from the web server collectively in one window of the browser.Type: GrantFiled: May 31, 2006Date of Patent: October 15, 2013Assignee: Fujitsu LimitedInventors: Naoki Tsukada, Haruo Higashiwaki, Kyoko Sawada
-
Publication number: 20130268508Abstract: A system and method provides a dynamic table framework for managing data in a high performance web service. An example embodiment includes: receiving a request at a web service; creating a dynamic record from the request; obtaining a runtime corresponding to the dynamic record, the runtime including an associated plurality of symbol values corresponding to the request; choosing a model corresponding to the runtime, the model including a plurality of symbol managers, each of the plurality of symbol managers being associated with the plurality of symbol values, each of the plurality of symbol managers for processing a specific task of the model; executing the model, by use of a data processor, to process the request, the model using at least one of the plurality of symbol managers; and returning results generated by execution of the model.Type: ApplicationFiled: November 29, 2012Publication date: October 10, 2013Applicant: EBAY INC.Inventors: Charles Bracher, Rodolfo G. Caguiat, Hao Lian, Ramon Cruz
-
Publication number: 20130268507Abstract: Among other things, one or more techniques and/or systems are provided for facilitating the completion of a user task. That is, user intent (e.g., intentions of a user to perform a user task) may be identified. The user intent may comprise an entity (e.g., a movie entity) and/or an action (e.g., an order movie tickets action) that the user wants to perform on the entity. A provider list may be created based upon one or more providers capable of performing the action on the entity (e.g., a movie application may be capable of performing the order movie tickets action on the movie entity). Providers may be dynamically selected for inclusion within the provider list at run-time. For example, an open market of providers may be maintained (e.g., providers may be added, removed, and/or updated over time), such that providers may be selected from the open market to complete user tasks.Type: ApplicationFiled: April 8, 2012Publication date: October 10, 2013Applicant: Microsoft CorporationInventors: Steven William Macbeth, Michael William O'Connor, Zhaowei Charlie Jiang, Michael Van Snellenberg, Li Jiang, Shajib Sadhukha, Amit Kumar
-
Publication number: 20130268482Abstract: Systems, methods, and computer-readable media for determining the Internet search popularity of an entity are provided. Embodiments of the present invention include receiving a group of Internet search records and assigning a popularity ranking based on the number of times an entity descriptor associated with an entity occurs within the group of Internet search records created over a designated time period. An entity descriptor is one or more terms commonly used to identify an entity. The trend in an entity's popularity rank may also be calculated. An entity's popularity rank and trend in popularity rank may be presented in a graph or in a list.Type: ApplicationFiled: March 14, 2013Publication date: October 10, 2013Inventors: Tabreez Govani, Hugh Williams, Jamie Buckley, Nitin Agrawal, Andy Lam, Kenneth A. Moss
-
Patent number: 8554758Abstract: Method and apparatus for monitoring and maintaining group health and membership in a searchable data service. The searchable data service may provide a searchable index to a backend data store, and an interface to build and query the searchable index, that enables client applications to search for and retrieve locators for stored entities in the backend data store. Embodiments of the searchable data service may implement a group health mechanism that may allow one or more components of the searchable data service to identify sets of nodes that the components may monitor. A group health component may be queried by the other components for automatically-refreshed health information about those nodes. The group health component may perform as a node failure detection mechanism. Node health information may be communicated among nodes at least in part using a gossip protocol.Type: GrantFiled: March 29, 2006Date of Patent: October 8, 2013Assignee: Amazon Technologies, Inc.Inventors: James S. Larson, Patrick W. Ransil, Aleksey V. Martynov, James R. Collette, Robert Wai-Chi Chu, Partha Saha
-
Patent number: 8554869Abstract: Disclosed herein is a technique for providing an interface that allows a user to navigate backwards through linked webpages. Initially, a request to display inlinks of linking webpages that contain a link to a particular webpage is received. In response to the request, a new page that contains a set of inlinks that correspond to a set of linking webpages that each contain a link to the particular webpage is provided. Each of the inlinks may be associated with a particular clickable item. An indication of a selection of a clickable item, associated with a particular inlink is received. In response, a second new page which contains a second set of inlinks that correspond to a second set of linking webpages that each contain a link to the webpage that corresponds to the particular inlink is provided. Some of the displayed inlinks may correspond to webpages that redirect to the particular webpage.Type: GrantFiled: August 2, 2006Date of Patent: October 8, 2013Assignee: Yahoo! Inc.Inventors: Priyank S. Garg, Conrad Newman, Amit Kumar, Vladimir Ofitserov
-
Publication number: 20130262416Abstract: One embodiment of a method of the present invention for discovering sensitive information on computer network provides for discovering databases on a computer network, defining a pattern for a data discovery, discovering qualifying records by matching the pattern with field names and/or record values in the databases, sending electronic notification to a database administrator managing the qualifying database, receiving a selection choice from the database administrator managing the qualifying database identifying the status for the qualifying records.Type: ApplicationFiled: March 30, 2012Publication date: October 3, 2013Applicant: GO DADDY OPERATING COMPANY, LLCInventors: Ganesh Devarajan, Todd Redfoot
-
Publication number: 20130262428Abstract: One embodiment of a system of the present invention for discovering sensitive information on computer network includes means for discovering databases on a computer network, means for defining a pattern for a data discovery, means for discovering qualifying records by matching the pattern with field names and/or record values in the databases, means for sending electronic notification to a database administrator managing the qualifying database, means for receiving a selection choice from the database administrator managing the qualifying database identifying the status for the qualifying records.Type: ApplicationFiled: March 30, 2012Publication date: October 3, 2013Applicant: GO DADDY OPERATING COMPANY, LLCInventors: Ganesh Devarajan, Todd Redfoot
-
Patent number: 8548977Abstract: A computer-related and/or business type method is presented for embedding one or more media hotspots within a digital media file and, in response to interaction from a separate target entity, such as via an associating request, associating one or more resultant actions with the media hotspot(s). In exchange for associating the one or more resultant actions with the media hotspot(s), an interactive media service entity being affiliated with a web site displaying the digital media file and/or a user being affiliated with the digital media file itself is compensated based upon at least one compensation plan.Type: GrantFiled: March 1, 2012Date of Patent: October 1, 2013Assignee: Clayco Research Limited Liability CompanyInventor: Leigh Rothschild
-
Patent number: 8549029Abstract: A method for generating data enabling searching for content complements intended for terminals of a telecommunication network is provided. In one embodiment, the method includes selecting content from a terminal, generating identification information of the selected content, sending a marker generation request to a content complement server, and, in response to receiving the request, generating and saving a marker at the content complement server. The request includes the user's identification data and the identification information of the selected content. The marker includes the user's identification data, the identification information of the selected content, and search characteristics data enabling an additional search for content related to the selected content. A system, a terminal, and a server associated with the method are also provided.Type: GrantFiled: January 29, 2009Date of Patent: October 1, 2013Assignee: Alcatel LucentInventors: Gérard Delegue, Erwan Baynaud
-
Patent number: 8548975Abstract: A system graphically plots articles. The system executes one or more search queries to search stored articles and receives results of the executed one or more search queries, where the results identify selected articles of the stored articles. The system further designates a graphical marker for each of the selected articles and plots each of the graphical markers on a graphical display.Type: GrantFiled: March 2, 2012Date of Patent: October 1, 2013Assignee: Google Inc.Inventors: Krishna Bharat, Michael Curtiss, Duane Scott Hess, Kerah Pelczarski, Marissa Mayer
-
Patent number: 8548976Abstract: A method for connecting to a web service includes selecting a web service, selecting a server among one or more servers capable of running the selected web service, determining a real address for the selected web service running on the selected server and connecting to the selected web service running on the selected server using the determined real address.Type: GrantFiled: May 19, 2005Date of Patent: October 1, 2013Assignee: CA, Inc.Inventors: Christopher Betts, Tony Rogers
-
Publication number: 20130254170Abstract: A server executing a social media identity and discovery application and method are provided that scan social networking sites for communications. The target content is found with content indicators when communications are put on a social networking site. The content is recorded and evaluated. If the identified content is contextually significant, the alias and the user account data and/or user data from public records are correlated based on keywords and/or events, and a notification of the correlation is sent to an agency, agent, or a contact center system. The agent or agency may verify that the identity of a poster has been accurately correlated with a customer record in the database or with user data from public records. The agent, the agency, or the system has the opportunity to respond to the communication, despite the anonymity of the poster on the social networking site.Type: ApplicationFiled: May 21, 2013Publication date: September 26, 2013Applicant: Avaya IncInventors: George Erhart, Valentine C. Matula, David Skiba
-
Publication number: 20130254181Abstract: Disclosed is a computer-implemented method to aggregate products from online stores, the method comprising crawling one or more websites associated with one or more online stores; collecting information pertaining to products of the stores; extracting key data about each product; and classifying the products into one or more categories based on the key data.Type: ApplicationFiled: January 3, 2013Publication date: September 26, 2013Inventors: Edward Balassanian, Scott W. Bradley, Guy Carpenter
-
Patent number: 8538949Abstract: The claimed subject matter provides a system or method for web crawling hidden files. An exemplary method includes loading a web page with a browser agent, and executing any dynamic elements hosted on the web page using the browser agent to insert pre-determined values. A list of form controls may be retrieved from the web page using the browser agent, and the controls may be analyzed using a driver component. Form control values may be sent from the driver component to the browser agent, and an event may be submitted to the web page by the browser agent or scripted content may be run to trigger operations on the web page corresponding to the form control values. A URL may be generated for various form control values using a generalizer.Type: GrantFiled: June 17, 2011Date of Patent: September 17, 2013Assignee: Microsoft CorporationInventors: Chao Liu, Chao Zhaou, Yi-Min Wang
-
Publication number: 20130238590Abstract: Systems and methods for providing an enterprise crawl and search framework, including features such as use with middleware and enterprise application environments, pluggable security, search development tools, user interfaces, and governance. In accordance with an embodiment, the system includes an enterprise crawl and search framework which abstracts an underlying search engine, provides a common set of application programming interfaces for developing search functionalities, and allows the framework to serve as an integration layer between one or more enterprise search engine and one or more enterprise application. A computing environment can be used to display an administration interface for use in administering the framework.Type: ApplicationFiled: January 2, 2013Publication date: September 12, 2013Applicant: Oracle International CorporationInventors: Tulasi Kodali, DJ Vasant Ursal
-
Publication number: 20130238592Abstract: An application store tastemaker recommendation service determines experts within a user's social network(s), receives recommendations from the experts, filters and/or ranks mobile application query results based at least in part on the recommendations. Additionally, the service may further determine the experts based on data compiled about previous actions, reviews, comments, etc., of the experts. Further, the service may provide recommendations to the user to aid in selecting mobile applications for purchase, and may provide an avenue for completing such purchases.Type: ApplicationFiled: April 23, 2013Publication date: September 12, 2013Applicant: Microsoft CorporationInventors: Joseph Futty, Matthew Graham Dyor, Danny Lange, Miller T. Abel
-
Publication number: 20130238589Abstract: Systems and methods for providing an enterprise crawl and search framework, including features such as use with middleware and enterprise application environments, pluggable security, search development tools, user interfaces, and governance. In accordance with an embodiment, the system includes an enterprise crawl and search framework which abstracts an underlying search engine, provides a common set of application programming interfaces for developing search functionalities, and allows the framework to serve as an integration layer between one or more enterprise search engine and one or more enterprise application.Type: ApplicationFiled: January 2, 2013Publication date: September 12, 2013Applicant: ORACLE INTERNATIONAL CORPORATIONInventors: Sandeep Yarramreddy, DJ Vasant Ursal
-
Publication number: 20130238591Abstract: Systems and methods for providing an enterprise crawl and search framework, including features such as use with middleware and enterprise application environments, pluggable security, search development tools, user interfaces, and governance. In accordance with an embodiment, the system includes an enterprise crawl and search framework which abstracts an underlying search engine, provides a common set of application programming interfaces for developing search functionalities, and allows the framework to serve as an integration layer between one or more enterprise search engine and one or more enterprise application. A user interface is provided for use in validating a search box against at target environment as part of implementing search within that environment.Type: ApplicationFiled: January 2, 2013Publication date: September 12, 2013Applicant: Oracle International CorporationInventors: DJ Vasant Ursal, Sherry Mead
-
Publication number: 20130238593Abstract: Systems and methods for providing an enterprise crawl and search framework, including features such as use with middleware and enterprise application environments, pluggable security, search development tools, user interfaces, and governance. In accordance with an embodiment, the system includes an enterprise crawl and search framework which abstracts an underlying search engine, provides a common set of application programming interfaces for developing search functionalities, and allows the framework to serve as an integration layer between one or more enterprise search engine and one or more enterprise application. A plurality of searchable objects which are sets of data derived from enterprise applications are used to make view objects available for full text search.Type: ApplicationFiled: January 2, 2013Publication date: September 12, 2013Applicant: ORACLE INTERNATIONAL CORPORATIONInventors: DJ Vasant Ursal, Tulasi Kodali
-
Publication number: 20130232132Abstract: A method for managing the Search Engine Optimization (SEO) content of web pages is disclosed. In one embodiment, such a method includes providing a set of web pages organized in a hierarchical structure. Each web page has an SEO content pattern associated therewith. The method establishes an inheritance scheme for the hierarchical structure such that the SEO content patterns of parent web pages are inherited by children web pages. The method further enables a user to override the inheritance scheme for selected web pages such that the SEO content patterns of the selected web pages override the SEO content patterns of their respective parent web pages. A corresponding apparatus and computer program product are also disclosed.Type: ApplicationFiled: March 4, 2012Publication date: September 5, 2013Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Pavan Kini, Rakesh Nagaraja Kote, Karson Ng, Walfrey Ng, Siddharth C. Ravi Kanth Rao, Guru Prasad Shamanna
-
Patent number: 8527495Abstract: A plug-in interface is provided in a crawling search engine. Plug-in parsers are also provided for use with the search engine. The plug-in interface allows the search engine to be configured with different plug-in parsers. Thus, a customer may configure a search engine with a parser that best suits the needs of the customer and to try new parsing algorithms to find the best results.Type: GrantFiled: February 19, 2002Date of Patent: September 3, 2013Assignee: International Business Machines CorporationInventor: Richard J. Redpath
-
Publication number: 20130226896Abstract: Many websites publish variants of their web pages based on language and region. However, when a user is directed toward the incorrect web page for the user's language preference, it there is not a simple way for the user to select the appropriate localized or region specific version of the web page. According to an embodiment, a language preference from a user may be received. A first language for a first web page may be identified and the first web page may be received by a computing device of the user. A second language for a second web page may be identified. The second web page may comprise an alternate version of the first web page. The first web page or the second web page may be selected according to the language preference of the user and the selected web page may be presented to the user.Type: ApplicationFiled: February 26, 2013Publication date: August 29, 2013Applicant: GOOGLE INCInventors: Jeffrey Chin, Jun Yin, Christopher Semturs, Joshua James Estelle
-
Publication number: 20130226898Abstract: Systems and methods for scheduling documents for crawling are disclosed. In some implementations, a method includes obtaining sitemap information for a plurality of websites; and analyzing the sitemap information to identify a website, in the plurality of websites. The website has sitemap information that is at least potentially out of date. The method also includes updating the sitemap information for the identified website by downloading updated sitemap information for the identified website; and scheduling documents for crawling in accordance with the updated sitemap information for the identified website.Type: ApplicationFiled: April 8, 2013Publication date: August 29, 2013Applicant: Google Inc.Inventor: Google Inc.
-
Publication number: 20130226895Abstract: A system can search for data streams. A processor searches for a data stream device or provider on a network. It is determined if the data stream device or provider includes a stored stream index. The stream index is accessed as a reference stream index if the stream index is discovered. Data streams are searched for using the reference stream index.Type: ApplicationFiled: February 29, 2012Publication date: August 29, 2013Applicant: Broadcom CorporationInventor: Bruno Gauthier
-
Publication number: 20130226897Abstract: A method includes comparing a first instance with a second instance of a document in a plurality of documents. The first instance is obtained from a remote location at a specified time before the second instance is obtained from the remote location, and (i) the specified time is determined in accordance with a first crawl interval associated with the document, (ii) each document in the plurality of documents is assigned to a tier in a plurality of tiers, each tier having a distinct associated range of web crawl intervals, and (iii) the first crawl interval is assigned a first tier. The method also includes computing a second crawl interval for the document, which is a function of the document comparison; and determining whether the second crawl interval is in the first tier. When the second crawl interval is not, the first document is reassigned to another tier.Type: ApplicationFiled: March 22, 2013Publication date: August 29, 2013Inventor: Anton P.T. Carver
-
Publication number: 20130226899Abstract: A method of triggering crawling of a domain includes receiving information related to a domain from a registrar and processing the information related to the domain. The method also includes storing the processed information in a registry zone file and forming a list of registry data based on the processed information. The list of registry data comprises a subset of the registry zone file. The method farther includes crawling one or more of the domains in the list of registry data.Type: ApplicationFiled: April 9, 2013Publication date: August 29, 2013Applicant: VERISIGN, INC.Inventors: Karthik Shyamsunder, Srinivas Sunkara, Alexander Durych Holmes, Andrew Simpson, Phani Pattappu
-
Patent number: 8521718Abstract: Systems, methods, and computer program products identify one or more web page impressions satisfying one or more simply queries, each of the one or more web page impressions associated with a respective impression ID. Respective impression IDs of the one or more web pages satisfying the one or more simple queries are stored in an impression log. Subsequent to storing the respective impression IDs, a query is received from a client device, and a number of impression IDs for the one or more web pages satisfying the query are identified based on the identified one or more web page impressions satisfying the one or more simple queries.Type: GrantFiled: May 17, 2012Date of Patent: August 27, 2013Assignee: Google Inc.Inventors: Dawn Chamberlain, Gurmeet Singh Manku, Barbara Stanley
-
Patent number: 8521715Abstract: One or more computer systems for using favicons to send queries to searchable webpages and obtaining a search results page for each searchable webpage over a network are provided.Type: GrantFiled: May 20, 2011Date of Patent: August 27, 2013Assignee: Accrue Search Concepts, Inc.Inventor: Troy Fearnow
-
Patent number: 8521717Abstract: Web pages of a Website may be processed to improve search results. For example, information likely to pertain to more than just the Web page it is directly associated with may be identified. One or more other, related, Web pages that such information is likely to pertain to is also identified. The identified information is associated with the identified other Web page(s) and this association is saved in a way to affect a search result score of the Web page(s).Type: GrantFiled: April 21, 2011Date of Patent: August 27, 2013Assignee: Google Inc.Inventors: Daniel Egnor, Paul Haahr, Kevin Lackner, John Lamping, Amitabh K. Singhal, Ke Yang
-
Publication number: 20130218864Abstract: Some embodiments provide a method that receives several attributes of a property and a price of the property. For each attribute in the several attributes of the property, the method performs a hedonic analysis to compute a value that correlates a portion of the price of the property to the attribute of the property. The method stores the computed values for later use in a search for the property.Type: ApplicationFiled: February 18, 2012Publication date: August 22, 2013Inventor: Harrison Gregory Hong
-
Publication number: 20130218865Abstract: This disclosure describes systems, methods, and apparatus for generating reports enhancing an understanding of Internet users based on their generated content and actions taken by others in response to the generated content.Type: ApplicationFiled: February 21, 2013Publication date: August 22, 2013Applicant: SPOTRIGHT, INC.Inventor: SPOTRIGHT, INC.
-
Patent number: 8515938Abstract: An information processing system including, a client capable of receiving and reproducing content from a media server, and a collecting server for receiving content management information on the content from the media server and managing the content management information.Type: GrantFiled: May 6, 2008Date of Patent: August 20, 2013Assignee: Sony CorporationInventors: Toshiaki Kusakabe, Satoshi Hiroi, Masahiro Hara
-
Patent number: 8515994Abstract: A method includes determining a source URI and extracting meta data from a resource downloaded from a network; storing the source URI and extracted meta data; and utilizing the stored URI and extracted meta data to formulate a search for another resource located on the network. A computer and computer program code are also provided.Type: GrantFiled: June 12, 2008Date of Patent: August 20, 2013Assignee: International Business Machines CorporationInventors: Venkat A. Reddy, Krishna C. Kumar