Category Specific Web Crawling Patents (Class 707/710)
  • Patent number: 8799263
    Abstract: Embodiment of the disclosure may includes systems, methods, and devices for providing multidimensional search results on a plurality of search planes. Such systems, methods, and devices may: (i) receive one or more search terms from one or more user interfaces of the system; (ii) perform a search of one or more informational repositories to obtain a list of search results wherein the informational repositories may include the Internet and one or more databases; (iii) process the list of search results to classify each search result in one of a plurality of categories; (iv) cause a presentation of the search results in a plurality of search planes on the display of the system such that each search plane corresponds to one of the plurality of categories. In addition, the software applications may include a sorting software application that groups the list of search results into one of a plurality of categories.
    Type: Grant
    Filed: February 7, 2012
    Date of Patent: August 5, 2014
    Inventor: Leigh M Rothschild
  • Patent number: 8799772
    Abstract: A system, method and search engine for searching images for data contained therein. Training images are provided and image attributes are extracted from the training images. Attributes extracted from training images include image features characteristic of a particular numerically generated image type, such as horizontal lines, vertical lines, percentage white area, circular arcs and text. Then, the training images are classified according to extracted attributes and a particular classifier is selected for each group of training images. Classifiers can include classification trees, discriminant functions, regression trees, support vector machines, neural nets and hidden Markov models. Available images are collected from remotely connected computers, e.g., over the Internet. Collected images are indexed and provided for interrogation by users. As a user enters queries, indexed images are identified and returned to the user. The user may provide additional data as supplemental data to the extracted image data.
    Type: Grant
    Filed: July 18, 2005
    Date of Patent: August 5, 2014
    Assignee: International Business Machines Corporation
    Inventors: Nimrod Megiddo, Shivakumar Vaithyanathan
  • Patent number: 8799261
    Abstract: A method for incremental crawling of content stored on a plurality of content providers using aggregation is provided. The method comprises receiving a request to crawl content on one or more associated content providers; retrieving one or more first references to content on a first content provider; retrieving one or more second references to content on one or more second content providers during the same request; aggregating the first and second references; and returning the aggregated first and second references. This is done while taking into consideration opaque timestamp object which is managed in a distributed manner. The opaque timestamp is filled in by the content providers but stored in the crawler side between crawling sessions.
    Type: Grant
    Filed: December 23, 2008
    Date of Patent: August 5, 2014
    Assignee: International Business Machines Corporation
    Inventors: Batya Kenig, Constantin Radchenko, Eitan Shapiro
  • Publication number: 20140214791
    Abstract: Architecture that utilizes geotiles to return locally relevant results across a geographically distributed set of locations. As applied to retail operations the business entity may have a retail presence in many different geographical areas (e.g., regions) of a country. Each retail presence is processed to obtain the associated geographical coordinates, which are then utilized to select one or more geotiles of a mapping system. The geotile(s) for those geographical coordinates are than identified and related to the location. The business entity may be associated with a deal (offer) that is queried using a search engine. The relationships of the deal, retail stores, geographical coordinates of the retail stores, and related geotiles are memorialized in a feed document. Thus, when a query is made for the deal, the search engine accesses the feed document and returns the geotiles for visual presentation of the associated retailer as part of the search result.
    Type: Application
    Filed: January 31, 2013
    Publication date: July 31, 2014
    Applicant: MICROSOFT CORPORATION
    Inventors: Tejeshwar Singh, Hiren Shah, Haibo Lu, Gilbert Wong
  • Patent number: 8788514
    Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for triggering music answer boxes. In one aspect, a method includes receiving a query, obtaining a plurality of search results responsive to the query, the search results being results from a search of web resources on the Internet, and determining from the plurality of search results that the query is a music query. Music data for a song responsive to the query is obtained, where the music data comprises a Uniform Resource Locator (URL) of song content of the song on the Internet. A music answer box is generated for the query, where the music answer box comprises the music data and a link to the URL of the song content, and the music answer box is provided in addition to search results.
    Type: Grant
    Filed: October 28, 2010
    Date of Patent: July 22, 2014
    Assignee: Google Inc.
    Inventors: Ganesh Ramanarayanan, Jun Gong, Murali Krishna Viswanathan, Daphne Dembo, Pravir K. Gupta, Tal Cohen, Lev Finkelstein, Adi Mano, Evan B. Roseman
  • Patent number: 8788479
    Abstract: Disclosed is a system and method to search the World Wide Web for latest user activities and information and update the user activities and information to user subscribed social media websites with user's approval. User subscribes their personal information and interests to the present invention. Present invention crawls and formats the information available in the World Wide Web for the provided user information and interests. The formatted information is notified to the user for approval. The user reviews the information, accepts or rejects the information. The user can edit the information to change the content. The approved information is updated to user subscribed social media websites.
    Type: Grant
    Filed: December 26, 2012
    Date of Patent: July 22, 2014
    Inventors: Johnson Manuel-Devadoss, Christy Aarthi Jones
  • Publication number: 20140201186
    Abstract: The present invention generally relates to computer and web-based contact searches. Specifically, this invention relates to systems and methods for extending contact searches to include contacts beyond those of the user initiating the search. Embodiments of the present invention allow users to search for indirect contacts beyond their direct contacts by providing the user results that include the contacts of their contacts and so on to a specified depth level and restricted by security implementations selectable by the indirect contacts.
    Type: Application
    Filed: January 11, 2013
    Publication date: July 17, 2014
    Inventor: Ge Zhao
  • Patent number: 8782031
    Abstract: A politeness manager estimates traffic to the sites based on historical log data generated and sent by plug-ins or toolbars on client web browsers. The historical log data details dates and times the web browsers visit different web sites that is used to understand what timeframes specific web sites are busy and what timeframes the web sites are not busy. Crawl rates for different timeframes for a web site are determined based on the historical log data, and web crawlers are scheduled to crawl the web site according to the crawl rates to minimize the chances that web crawler requests are responsible for the site crashing.
    Type: Grant
    Filed: August 9, 2011
    Date of Patent: July 15, 2014
    Assignee: Microsoft Corporation
    Inventors: Dean M. Wierman, Fabrice Canel, Balaji Shyamkumar, Charles (Xi) Zhang
  • Patent number: 8782033
    Abstract: The present invention outlines a genuine entity following system that also addresses data source limitation. When reviewing entity-related objects in web content, a web user designates one or more entities to follow in real time. More particularly, the present invention is directed through strategic deployment of a dynamic crawler upon selection of a “follow” pointer over an object in a web browser such that a web user can automatically designate entities to be followed and receive alerts at predetermined temporal intervals when new information regarding such designated entities becomes available. A web entity engine of the present invention is designed to discover trending entities at any given time while generating output activity (i.e., signal) streams for this entity.
    Type: Grant
    Filed: December 1, 2010
    Date of Patent: July 15, 2014
    Assignee: Microsoft Corporation
    Inventors: Zhaowei Jiang, Xavier Legros, Ronald H. Jones, Jr., Ryan Panchadsaram
  • Publication number: 20140195510
    Abstract: An illustrative embodiment of a computer-implemented process for partitioning a crawling space computes an event identifier for each event in the set of events to form an identified set of events, segments the identified set of events into a number of partitions, assigns a partition to each node in a set of nodes and executes each event in each assigned partition by a respective node. In response to a determination that a new state is discovered, other nodes are notified of the new state, in which information associated with the new state is added to a respective assigned set of event IDs at each node. In response to a determination that no more notifications exist, the computer-implemented process determines whether more events to process exist and terminates in response to a determination that no more events to process exist.
    Type: Application
    Filed: September 24, 2013
    Publication date: July 10, 2014
    Applicant: International Business Machines Corporation
    Inventors: Guy-Vincent Jourdan, losif Viorel Onut, Seyed M. Taheri, Gregor von Bochmann
  • Publication number: 20140195511
    Abstract: A search query is received. Personal information for a user is then determined. A search is performed in a general subdomain of general content using the search query. For example, the general subdomain of general content may be a WWW search. Then, a vertical subdomain is determined based on the personal information. A search is then performed in the vertical subdomain of specialized content using the search query. The search performed in the general subdomain and the search performed in the vertical subdomain generate general search results and vertical search results. The results may be combined and outputted to a client.
    Type: Application
    Filed: March 12, 2014
    Publication date: July 10, 2014
    Applicant: Yahoo! Inc.
    Inventors: Qi Lu, John Thrall, David Ku
  • Patent number: 8775434
    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for determining a catchment area for a search result. In one aspect, a method includes receiving search log data specifying a resource that was referenced by search results that were presented to users, query locations for the resource, and whether the users interacted with the search results. A catchment area is determined for the resource based on the search log data, where the catchment area specifies a geographic region for which the resource is a candidate resource. In turn, the resource is indexed according to its catchment area. The catchment area is used in response to receiving a search query that is associated with a particular query location to select candidate search results that reference resources having catchment areas that include the particular query location. Final search results are selected from the candidate search results.
    Type: Grant
    Filed: October 19, 2010
    Date of Patent: July 8, 2014
    Assignee: Google Inc.
    Inventor: James Robert Macgil
  • Patent number: 8775403
    Abstract: A scheduler for a search engine crawler includes a history log containing document identifiers (e.g., URLs) corresponding to documents (e.g., web pages) on a network (e.g., Internet). The scheduler is configured to process each document identifier in a set of the document identifiers by determining a content change frequency of the document corresponding to the document identifier, determining a first score for the document identifier that is a function of the determined content change frequency of the corresponding document, comparing the first score against a threshold value, and scheduling the corresponding document for indexing based on the results of the comparison.
    Type: Grant
    Filed: April 17, 2012
    Date of Patent: July 8, 2014
    Assignee: Google Inc.
    Inventor: Keith H. Randall
  • Publication number: 20140188838
    Abstract: Generally, the invention is relevant to computers and data processing and, in particular, covers the method for searching and further processing and collaborative rating the content of interest in a network environment. The invention is a search engine, processing and information rating system comprising at least one user terminal connectable to a server component comprising at least one database arranged as a set of alternatives associated with at least one criterion, an alternative search block and an alternative ranking block along with a block enabling a user to specify a score for any alternative by any criterion.
    Type: Application
    Filed: December 28, 2012
    Publication date: July 3, 2014
    Inventor: EDUARD MIKHAILOVICH STRUGOV
  • Patent number: 8768911
    Abstract: A computerized system and method is presented for analyzing quotations made in a quoting document of text originally found in a source document. The quoting document and source document can be web pages publicly available on the World Wide Web. The present invention analyzes the quoting document for quoted text, searches the source document for that text, and stores the existence of the quotation in association with the source document. When displaying the source document, quoted text is highlighted. A link is provided between items of quoted text and a list of documents that have quoted that text. From this list the full text of a quoting document may be displayed.
    Type: Grant
    Filed: March 26, 2009
    Date of Patent: July 1, 2014
    Assignee: Geronimo Development
    Inventor: Orin Russell Armstrong
  • Patent number: 8768909
    Abstract: The present invention includes systems and methods for retrieving information via a flexible and consistent targeted search model that employs interactive multi-prefix, multi-tier and dynamic menu information retrieval techniques (including predictive text techniques to facilitate the generation of targeted ads) that provide context-specific functionality tailored to particular information channels, as well as to records within or across such channels, and other known state information. Users are presented with a consistent search interface among multiple tiers across and within a large domain of information sources, and need not learn different or special search syntax. A thin-client server-controlled architecture enables users of resource-constrained mobile communications devices to locate targeted information more quickly by entering fewer keystrokes and performing fewer query iterations and web page refreshes, which in turn reduces required network bandwidth.
    Type: Grant
    Filed: May 11, 2010
    Date of Patent: July 1, 2014
    Assignee: Tropare, Inc.
    Inventors: G. Gregory Carpenter, Timothy L. Kay
  • Patent number: 8762365
    Abstract: Disclosed are various embodiments for classifying network sites into site categories. A network site is classified into at least one of a plurality of categories based at least in part on similarity. Query popularity, query competitiveness, and/or query importance may be used in determining similarity. The similarity is measured between a first plurality of search queries and a second plurality of search queries. The first plurality of search queries is one that produced first search results that include the network site. The second plurality of search queries is one that produced second search results that include other network sites. Each of the other network sites has a respective category. The query-similarity based scoring may be combined with more scoring based on crawling and processing network page or network site contents.
    Type: Grant
    Filed: August 5, 2011
    Date of Patent: June 24, 2014
    Assignee: Amazon Technologies, Inc.
    Inventors: Soo-Min Pantel, Amber Roy Chowdhury
  • Patent number: 8756213
    Abstract: A system, method, and computer program product are provided for crawling a website based on a scheme of the website. In use, a difference between a first content and second content of a website is identified. Additionally, a scheme of the website is identified based on the difference. Furthermore, the website is crawled based on the scheme.
    Type: Grant
    Filed: July 10, 2008
    Date of Patent: June 17, 2014
    Assignee: McAfee, Inc.
    Inventor: Gabriel Richard Pack
  • Publication number: 20140164351
    Abstract: Methods and systems are described for processing and display content. Web page data for a first web page is received from a remote system, wherein the web page is to be displayed on a terminal associated with a user. An automatic identification is performed of a first content in the first web page data. A user-defined profile is accessed. A second content is automatically selected based at least in part on the user profile. The first content is replaced with the second content so that if the first web page is displayed on the terminal associated with the user, the second content is displayed and the first content is not displayed.
    Type: Application
    Filed: October 3, 2013
    Publication date: June 12, 2014
    Applicant: AD-VANTAGE NETWORKS, LLC
    Inventors: David Grant, John W. Grant, Sanjeev Kuwadekar
  • Publication number: 20140156630
    Abstract: A method includes receiving a request to generate data which describes the data. A database of seed content and an algorithms database are searched. If both seed content and an algorithm are found, the algorithm is applied to the seed content, thereby generating data. Some embodiments may include advertising a content generation service. Users may register for the service.
    Type: Application
    Filed: November 30, 2012
    Publication date: June 5, 2014
    Applicant: DELL PRODUCTS, LP
    Inventors: Jianwen Yin, Li Jun Zhou, Thomas P. Maddox, Ryan D. King, Tsen-Loong Peng
  • Patent number: 8744839
    Abstract: Target word recognition includes: obtaining a candidate word set and corresponding characteristic computation data, the candidate word set comprising text data, and characteristic computation data being associated with the candidate word set; performing segmentation of the characteristic computation data to generate a plurality of text segments; combining the plurality of text segments to form a text data combination set; determining an intersection of the candidate word set and the text data combination set, the intersection comprising a plurality of text data combinations; determining a plurality of designated characteristic values for the plurality of text data combinations; based at least in part on the plurality of designated characteristic values and according to at least a criterion, recognizing among the plurality of text data combinations target words whose characteristic values fulfill the criterion.
    Type: Grant
    Filed: September 22, 2011
    Date of Patent: June 3, 2014
    Assignee: Alibaba Group Holding Limited
    Inventors: Haibo Sun, Yang Yang, Yining Chen
  • Publication number: 20140149383
    Abstract: Enhanced computer- and network-based methods, systems, techniques are provided for retrieving more accurate and responsive search results when searching content for a designated entity using an off-the-shelf keyword-based search engine. For example, the embodiments described herein may be used to improve search results by eliminating off-topic results when presenting queries to an existing keyword-based search engine invoked by means of an API from an intermediating application. Example embodiments provide a Keyword-Based Search Enhancement System (“KBSES”), which enables intermediating applications to obtain information more closely related to user queries by enhancing such queries, on behalf of the user, with disambiguating information when deemed necessary.
    Type: Application
    Filed: January 31, 2014
    Publication date: May 29, 2014
    Applicant: Vulcan Inc.
    Inventors: Ted Diamond, Jisheng Liang, Jonathan Reichhold, Krzysztof Koperski
  • Patent number: 8732150
    Abstract: Disclosed are systems, apparatus, methods, and computer readable media for suppressing network feed activities using an information feed in an on-demand database service environment. In one embodiment, a message is received, including data indicative of a user action. An entity associated with the user action is identified, where the entity is a type of record stored in a database. A type of the entity is identified. It is determined whether the entity type is a prohibited entity type. When the entity type is not a prohibited entity type, the message data is saved to one or more tables in the database. The tables are configured to store feed items of an information feed capable of being displayed on a device. When the entity type is a prohibited entity type, the saving of the message data, to the one or more tables in the database configured to store the feed items, is prohibited.
    Type: Grant
    Filed: February 10, 2011
    Date of Patent: May 20, 2014
    Assignee: salesforce.com, inc.
    Inventors: William Gradin, Matthew Davidchuk, Qiu Ma, Leonid Zemskov, Amy Palke
  • Patent number: 8732166
    Abstract: A strategy is described for delivering requested books to users along with customized bookmarks or other functional objects. An exemplary bookmark can include various informational items, including an informational item that describes the delivered book, an informational item that rates the desired book, an informational item that recommends one or more other books based on various factors, and an informational item that provides a call-to-action. The call-to-action encourages the user to access an electronic service to review the delivered book, purchase one or more additional books, or take some other action. The electronic service can provide accounting which identifies and registers user actions that are motivationally linked to the information imparted by the bookmarks.
    Type: Grant
    Filed: December 14, 2006
    Date of Patent: May 20, 2014
    Assignee: Amazon Technologies, Inc.
    Inventor: William Alexander Strand
  • Patent number: 8725719
    Abstract: In accordance with certain embodiments, requests to collect structured data in a web page and to subscribe to that structured data are received. This structured data is stored in a data store to allow offline use of the structured data. In accordance with other embodiments, a computing device displays multiple links each of which identifies a different one of multiple web pages. Additionally, the multiple pages include structured data. The display of these multiple links is altered as the computing device detects changes to the structured data in the web pages. In accordance with other embodiments, a web page includes structured data that has been subscribed to. The computing device detects changes to the web page, and notifies a user of a change to the web page only if the change is a change to the structured data and not a change to other portions of the web page.
    Type: Grant
    Filed: February 13, 2007
    Date of Patent: May 13, 2014
    Assignee: Microsoft Corporation
    Inventors: Jane T. Kim, Walter VonKoch, Sean O. Lyndersay, Benjamin N. Truelove, Miladin Pavlicic
  • Publication number: 20140129542
    Abstract: A method of producing search results is disclosed. The method comprises, at a computerized search engine system distinct from a client system: receiving a search request associated with a user from the client system, the search request having one or more search terms; obtaining a user profile corresponding to the user, where the user profile is generated based in part on the user's prior computing activities, comprising one or more of browsing, searching, and messaging; obtaining search results for the search request; generating a personalized snippet for at least one of the search results in accordance with the obtained user profile, the snippet comprising a text portion of the search result chosen based on at least one or more search terms and one or more terms of the obtained user profile; and transmitting the search results and personalized snippet to the client system for display.
    Type: Application
    Filed: January 13, 2014
    Publication date: May 8, 2014
    Applicant: Google Inc.
    Inventors: Taher H. Haveliwala, Sepandar D. Kamvar
  • Publication number: 20140122458
    Abstract: Anchor images and information associated therewith are accumulated during a Web crawling operation. One or more rules are applied to the accumulated candidate anchor images to filter out candidate anchor images that are not appropriate for use as the anchor image for a particular target video. The remaining candidate anchor image is then selected as the anchor image for the particular video.
    Type: Application
    Filed: January 3, 2014
    Publication date: May 1, 2014
    Applicant: Microsoft Corporation
    Inventors: Xiao Kong, Wei Wang, Rui Cai, Haifeng Li, Yanfeng Sun
  • Publication number: 20140114947
    Abstract: Computer systems and methods allow users to annotate content items found in a corpus such as the World Wide Web. Annotations, which can include any descriptive and/or evaluative metadata related to a document, are collected from a user and stored in association with that user. Users are able to annotate and view their annotations for any document they encounter while interacting with the corpus, including hits returned in a search of the corpus. Users are also able to search their annotations or to limit searches to documents they have annotated. Metadata from annotations can also be aggregated across users and aggregated metadata applied in generating search results.
    Type: Application
    Filed: December 30, 2013
    Publication date: April 24, 2014
    Applicant: Yahoo! Inc.
    Inventors: Eckart Walther, Qi Lu, David Ku, Kevin Lee, Chung-Man Tam, Ali Diab
  • Publication number: 20140108377
    Abstract: Methods, systems, and computer readable media for locating a lost animal are disclosed. According to one aspect, a system includes a pet verification website host server that is configured to receive an uploaded video media file associated with an animal from a registered user, index the video media file in accordance to one or more animal feature criteria, receive a search query that includes the one or more animal feature criteria one or more animal feature criteria from a guest user, and provide access to one or more video media files associated with the search query to the guest user.
    Type: Application
    Filed: October 15, 2013
    Publication date: April 17, 2014
    Inventor: Stephen M. West
  • Patent number: 8688681
    Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for identifying web hosting entities. In one aspect, a system includes one or more computers programmed to perform operations including maintaining an Internet Protocol (IP) address history for each hostname in a plurality of hostnames. Each IP address history is a time series of IP addresses. The operations further include organizing the hostnames into a collection of groups so that each hostname of the plurality of hostnames is a member of exactly one group in the collection of groups. Each group has a kernel calculated from the IP address histories of the members of the group, and the IP address history of each member of the group is within a threshold distance of the kernel of the group.
    Type: Grant
    Filed: June 17, 2010
    Date of Patent: April 1, 2014
    Assignee: Google Inc.
    Inventors: Li Xiao, Arup Mukherjee
  • Patent number: 8688521
    Abstract: A system and method to facilitate matching of content to advertising information in a network are described. A request for advertising information is received over a network, the advertising information to be displayed for a user entity in association with content information within a web page requested by the user entity. Advertising information related to one or more themes of the content information on the web page is further determined, the themes representing subject matter contextually related to the content information. Advertisements are further selected from the advertising information based on keywords and metadata stored within the web page and based on a set of predetermined parameters stored within the data storage module. The selected advertisements are further ranked to obtain a ranked list of advertisements.
    Type: Grant
    Filed: July 20, 2007
    Date of Patent: April 1, 2014
    Assignee: Yahoo! Inc.
    Inventors: Andrei Zary Broder, Marcus Felipe Fontoura, Vanja Josifovski, Lance Alan Riedel
  • Patent number: 8688860
    Abstract: A method for migrating information, and a migrator for migrating information, are disclosed. The method may include extracting organizational information from at least two service providers, accessing a first at least one of the at least two service providers upon selection of a migration selection interface by the user, receiving of a first plurality of information related to the user from one of the service providers, accessing a second at least one of the at least two service providers, and writing the first plurality of information to the second at least one of the at least two service providers.
    Type: Grant
    Filed: October 31, 2013
    Date of Patent: April 1, 2014
    Assignee: LinkedIn Corporation
    Inventors: Tomy K. Isaac, Mark Kasiraja
  • Patent number: 8682723
    Abstract: Conversations in an online content universe are monitored. A social analysis module analyzes individual conversations between publishers in the online content universe. Publishers that influence a conversation are identified.
    Type: Grant
    Filed: September 14, 2009
    Date of Patent: March 25, 2014
    Assignee: Twelvefold Media Inc.
    Inventors: Todd Parsons, Mitch Ratcliffe, Rob Crumpler, Will Kessler, Kurt Freytag
  • Patent number: 8682883
    Abstract: Embodiments of the present invention relate to systems and methods for determining sets of products which are similar to each other in terms of consumers' wants and needs. Queries are performed on a particular product. Documents relating to the query are received and stored. A dictionary is created from the received documents, whereby the documents, which are text files, are scrubbed of certain data to create a scrubbed text file. Topic modeling is then performed on the cleansed text file. Various methods can be used to perform topic modeling, including, but not limited to, latent semantic analysis, nonnegative matrix factorization, and singular value decomposition.
    Type: Grant
    Filed: April 16, 2012
    Date of Patent: March 25, 2014
    Assignee: Predictix LLC
    Inventors: Loren Williams, Emir Pasalic, Nikolaos Vasiloglou
  • Publication number: 20140074816
    Abstract: The present invention provides a method and apparatus for generating a query candidate set. The method comprises automatically tagging a sequence of words in a digital document to obtain a sequence of tags, comparing the sequence of tags with one or more reference sequences and including the sequence of words in the query candidate set if the sequence of tags matches the one or more reference sequences. Each tag of the sequence of tags represents a part of speech.
    Type: Application
    Filed: June 25, 2013
    Publication date: March 13, 2014
    Inventors: KALPANA BANERJEE, Surabhi Khandavalli, Vishal Shah, Gaurav Ruhela
  • Patent number: 8671090
    Abstract: A method of utilizing a Web Service folder interface. A user defines a folder in a local folder directory as a Web Services enabled folder. The folder includes file data and metadata corresponding to the file data. The metadata includes a configurable Web Services type property that corresponds to a remote Web Service. The metadata also includes a configurable data handling property that includes one or more allowable file formats. When a user submits the file data to the remote Web Service by selecting an option in a pull down menu of a graphical user interface (GUI) or dropping the file data in a local output folder, the operating system (OS) sends the file data to the remote Web Service. The OS automatically converts an output file received from the remote Web Service into one of the allowable file formats and updates the local file data with the output file.
    Type: Grant
    Filed: August 29, 2007
    Date of Patent: March 11, 2014
    Assignee: International Business Machines Corporation
    Inventors: Indran Naick, Jeffrey K. Wilson
  • Patent number: 8666964
    Abstract: Determining a schedule for recrawling pages is disclosed. A crawling schedule that specifies a due date at which each page is to be crawled is determined according to a first scheme. A set of pages that includes one or more pages each of which has a due date that has passed is determined. The set of pages is ordered according to a second scheme.
    Type: Grant
    Filed: April 25, 2005
    Date of Patent: March 4, 2014
    Assignee: Google Inc.
    Inventor: Jesse L. Alpert
  • Patent number: 8666819
    Abstract: A system and method to facilitate classification and storage of events in a network are described. An event and associated content information are received from an entity over a network. The content information is further analyzed to determine one or more themes representing subject matter related to the content information. The event is further classified according to the themes into one or more corresponding categories. Finally, the event is stored into one or more corresponding databases of a data storage module according to the one or more corresponding categories.
    Type: Grant
    Filed: July 20, 2007
    Date of Patent: March 4, 2014
    Assignee: Yahoo! Overture
    Inventors: Andrei Zary Broder, Marcus Felipe Fontoura, Vanja Josifovski, Lance Alan Riedel
  • Publication number: 20140059034
    Abstract: A computer implemented method for displaying a plurality of web pages within a single web browsing display area includes determining a Uniform Resource Locator (URL) for each of the plurality of web pages to be displayed. Each of the URLs may be determined from user inputs or predefined settings. The method may also include allocating a display region within the web browsing display area to define an allocated display region and displaying the one of the plurality of web pages within the allocated display region.
    Type: Application
    Filed: November 1, 2013
    Publication date: February 27, 2014
    Applicant: Exceedland Incorporated
    Inventor: Quanying Wang
  • Publication number: 20140052709
    Abstract: A document information management system in which a search-engine-compatible interface unit makes a word in a document displayed on the screen to be specified, transfers the specified word to a search engine as a keyword to be used in the search engine, receives a search result from the search engine, and displays the search result on the screen, while a browser-compatible interface unit performs a search (a keyword search and/or global search) by using the keyword transferred from a browser and transfers a search result to the browser.
    Type: Application
    Filed: September 11, 2013
    Publication date: February 20, 2014
    Inventors: Takashi YANO, Yasuhiro TABATA, Hisashi ISHIJIMA
  • Patent number: 8655872
    Abstract: Systems and methods are provided for implementing searches using contextual information associated with a Web page (or other document) that a user is viewing when a query is entered. The page includes a contextual search interface that has an associated context vector representing content of the page. When the user submits a search query via the contextual search interface, the query and the context vector are both provided to the query processor and used in responding to the query.
    Type: Grant
    Filed: October 15, 2008
    Date of Patent: February 18, 2014
    Assignee: Yahoo! Inc.
    Inventor: Reiner Kraft
  • Publication number: 20140046926
    Abstract: The invention described herein solves the challenges encountered in searching for clinical and genomic information from multiple data sources. Systems, methods, and devices of the invention allow a user to search a number of dissimilar information sources simultaneously, and view, process, and perform correlations on the information. The invention uses faceted search to process clinical values, genomic data, subject characteristics, and population characteristics, thereby providing a user with an array of information useful for monitoring or improving the state of health of a subject or a subject population. The invention allows a user to evaluate clinical and research information in a subject-centric way, and analyze information at either the individual or the population level.
    Type: Application
    Filed: February 5, 2013
    Publication date: February 13, 2014
    Applicant: MyCare, LLC
    Inventor: MyCare, LLC
  • Patent number: 8650177
    Abstract: In an example, disclosed is a machine automated method of identifying a set of skills. In some examples, the method includes extracting a plurality of skill seed phrases from a plurality of member profiles of a social networking site, creating a plurality of disambiguated skill seed phrases by disambiguating the plurality of skill seed phrases using one or more computer processors, and de-duplicating the plurality of disambiguated skill seed phrases to create a plurality of de-duplicated skill seed phrases.
    Type: Grant
    Filed: January 24, 2012
    Date of Patent: February 11, 2014
    Assignee: LinkedIn Corporation
    Inventors: Peter N. Skomoroch, Matthew T. Hayes, Abhishek Gupta, Dhanurjay A. S. Patil
  • Patent number: 8639680
    Abstract: A method for creating a hidden text data index for ranking computerized query search results that includes generating a render tree based on a document object model (DOM) tree for a web page. The render tree includes nodes that correspond to text that will be visually displayed by a client device when executed. The method includes comparing nodes corresponding to text of the DOM tree with the nodes corresponding to text of the render tree to identify the nodes in the DOM tree that will not be visually displayed when executed by the client device. The method also includes creating a hidden text data index for the nodes corresponding to text of the DOM tree not in the render tree. The hidden text data index identifies nodes corresponding to text of the DOM tree as hidden that will not be visually displayed when executed by the client device.
    Type: Grant
    Filed: May 7, 2012
    Date of Patent: January 28, 2014
    Assignee: Google Inc.
    Inventors: Peter Ciccolo, Michael Edward Flaster
  • Patent number: 8631159
    Abstract: A method for migrating information, and a migrator for migrating information, are disclosed. The method may include extracting organizational information from at least two service providers, accessing a first at least one of the at least two service providers upon selection of a migration selection interface by the user, receiving of a first plurality of information related to the user from one of the service providers, accessing a second at least one of the at least two service providers, and writing the first plurality of information to the second at least one of the at least two service providers.
    Type: Grant
    Filed: April 12, 2010
    Date of Patent: January 14, 2014
    Assignee: LinkedIn Corporation
    Inventors: Tomy K. Isaac, Mark Kasiraja
  • Patent number: 8630996
    Abstract: A data management system (102) has a memory (102B-C), and a processor (102A) coupled thereto. The processor is programmed to extract (206) historical data from a historical database according to predetermined extraction criteria, search (208) for one or more potential duplicate entries in the historical data according to a portion of selection criteria used for generating the historical database, and submit (214) a notification when one or more potential duplicate entries have been identified.
    Type: Grant
    Filed: May 5, 2005
    Date of Patent: January 14, 2014
    Assignee: AT&T Intellectual Property I, L.P.
    Inventor: David Bilotti
  • Patent number: 8626740
    Abstract: Determining the relevance of a page to a topic in a hierarchy is disclosed. A plurality of paths that include arrivals at the page is determined. A proportion of the paths that include relevant arrivals at the page is determined. And, the relevance of the page is determined based at least in part on the proportion.
    Type: Grant
    Filed: January 13, 2006
    Date of Patent: January 7, 2014
    Assignee: Wal-Mart Stores, Inc.
    Inventors: Venky Harinarayan, Anand Rajaraman
  • Publication number: 20140006377
    Abstract: A method and system for social media ecosystem searching. A desired person can be searched for from public search engines and social media sites directly by name and/or by unique search keywords and search categories created and publically published by the desired person, a social media index of the desired person or a social commerce connection associated with the desired person. The search results are publically viewable. However, communication with the desired person located within the social media ecosystem is via a private system in which a searcher must provide login information to privately communicate with the desired person. The private system helps ensure that social media index values and social commerce connections are properly established, recorded and updated for the desired person and provides a layer of security and privacy. The social media searching ecosystem is provided on a cloud communications network for mobile and non-mobile devices.
    Type: Application
    Filed: August 27, 2013
    Publication date: January 2, 2014
    Inventor: Jon Anthony ASTORE
  • Patent number: 8620895
    Abstract: During a data-access technique, a query that is associated with an organizational accounting code is used to generate a set of queries for business databases. In particular, when generating the set of queries, the organizational accounting code is mapped to a set of database-specific accounting codes using a reclassification list. After receiving answers to the set of queries (which are associated with the set of database-specific accounting codes) from the business databases, the answers are presented to the user. In this way, the user can access the business databases, which may have incompatible database-specific accounting codes, from a single environment with little or no additional effort or expense.
    Type: Grant
    Filed: November 21, 2011
    Date of Patent: December 31, 2013
    Assignee: Intuit Inc.
    Inventors: David F. Lish, Memet Firat Ozkan, Alan M. Poulin, Jason K. De Mello, Johan A. Johansson, Kathleen P. Russell
  • Publication number: 20130339338
    Abstract: A method and system associating labels and attribute values with items in a collection of data. Providers can associate attributes and labels with their data or attributes and labels can be added to existing data. A preferred embodiment allows a content provider to upload data and to attach their own custom labels and attributes to items or to use predefined labels and attributes. Providers can upload data using a user interface or a bulk upload mechanism.
    Type: Application
    Filed: August 23, 2013
    Publication date: December 19, 2013
    Applicant: Google Inc.
    Inventors: Bindu Reddy, Marshall Spight, Ning Mosberger