Category Specific Web Crawling Patents (Class 707/710)
  • Patent number: 7904441
    Abstract: An apparatus and method of recovering a final display are provided. The apparatus includes a query-string-creating module creating query strings in response to a cursor-request message, a query-string-controlling module creating a first cursor as a result of processing the query strings, and returning the created first cursor to the query-string-creating module, and a cursor-recovery module storing information about the first cursor and recovering information about a second cursor in response to the cursor-request message.
    Type: Grant
    Filed: November 7, 2007
    Date of Patent: March 8, 2011
    Assignee: Samsung Electronics Co., Ltd.
    Inventors: Hee-Gyu Jin, Kyoung-Gu Woo
  • Publication number: 20110055195
    Abstract: A computer-implemented system and method for keyword extraction and contextual advertisement generation are disclosed. The system in an example embodiment includes a keyword extraction service to receive from a consumer application a request for activation of a keyword extraction service via an application programming interface, the request including an identity of a content source, the request further including an identification of a particular extraction process to be used by the keyword extraction service on the identified content source; determine if the keyword extraction service has already processed the identified content source and retained extracted keywords in a data store; extract keywords from the identified content source using the particular extraction process identified in the request; and make the extracted keywords accessible to the consumer application.
    Type: Application
    Filed: November 5, 2010
    Publication date: March 3, 2011
    Applicant: eBay Inc.
    Inventors: Alec Reitter, Barb Chang, Ken Sun, Raghav Gupta, Alvaro Bolivar, Alan Lewis
  • Patent number: 7885951
    Abstract: A computer-related and/or business type method is presented for embedding one or more media hotspots within a digital media file and, in response to interaction from a separate target entity, such as via an associating request, associating one or more resultant actions with the media hotspot(s). In exchange for associating the one or more resultant actions with the media hotspot(s), an interactive media service entity being affiliated with a web site displaying the digital media file and/or a user being affiliated with the digital media file itself is compensated based upon at least one compensation plan.
    Type: Grant
    Filed: February 15, 2008
    Date of Patent: February 8, 2011
    Assignee: LMR Inventions, LLC
    Inventor: Leigh Rothschild
  • Patent number: 7885952
    Abstract: The subject disclosure pertains to systems and methods that facilitate detection of cloaked web pages. Commercial value of search terms and/or queries can be indicative of the likelihood that web pages associated with the keywords or queries are cloaked. Commercial value can be determined based upon popularity of terms and/or advertisement market value as established based upon advertising revenue, fees and the like. Commercial value can be utilized in conjunction with term frequency difference analysis to identify a cloaked page automatically. In addition, commercial values of terms associated with web pages can be used to order or prioritize web pages for further analysis.
    Type: Grant
    Filed: December 20, 2006
    Date of Patent: February 8, 2011
    Assignee: Microsoft Corporation
    Inventors: Kumar H. Chellapilla, David M. Chickering
  • Publication number: 20110025710
    Abstract: Systems and methods are described for determining manipulation history among a plurality of images. The described techniques include selecting a pair of images from the plurality of images, detecting one or more manipulations operable to transform one of the images to the other, and based on the manipulations detected, determining a parent-child relationship between the pair or pairs of images. The described techniques can further include repeating the selecting two images, detecting manipulations, and determining the parent-child relationship for each pairs of images in the plurality of images, constructing a visual migration map for the images, and presenting the visual migration map in a user readable format.
    Type: Application
    Filed: August 23, 2010
    Publication date: February 3, 2011
    Applicant: The Trustees of Columbia University In The City Of New York
    Inventors: Lyndon Kennedy, Shih-Fu Chang
  • Publication number: 20110015497
    Abstract: An apparatus and method for measuring a person's biometric data as well as associated data and for using that data to determine the person's talents and well-being state, as well as predicting an optimal career path for the person. Biometric data is measured using a sensor, a memory configured to store the biometric signals, a database configured to store and retrieve profiles, and a processor configured to compare biometric data as well as associated data with anonymous profiles stored in the database and create a profile for the person.
    Type: Application
    Filed: July 16, 2009
    Publication date: January 20, 2011
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Christian Eggenberger, Peter Kenneth Malkin, Andreas Johannes Schindler, Jeffrey William Mersereau
  • Patent number: 7873640
    Abstract: A method, apparatus and computer program product provides for a semantic analyzer to produce and rank semantic terms to reflect their relationship to the theme and topics of a document. The text and the document can have no relationship to any pre-selected keywords before the semantic analyzer performs text extraction. The semantic analyzer extracts text from a document and performs semantic analysis on the extracted text. The semantic analyzer provides a plurality of ranked semantic terms as a result of the semantic analysis and associates semantic terms with the document as semantic keywords. The semantic terms define content to be presented with the document where the content is an advertisement, a link to a remote information resource or a second document.
    Type: Grant
    Filed: March 27, 2007
    Date of Patent: January 18, 2011
    Assignee: Adobe Systems Incorporated
    Inventors: Walter Chang, Nadia Ghamrawi
  • Patent number: 7870158
    Abstract: A remote access medical image exchange system utilizes a decentralized, i.e., self-organizing, distribution system combined with bid queues to establish a market place which allows for continuously negotiated prices with control (over who reads the images, when they are read and what the fee will be for such a reading) being placed in the hands of the patient/gatekeeper and the diagnostic physician.
    Type: Grant
    Filed: January 5, 2007
    Date of Patent: January 11, 2011
    Assignee: Integrated Claims Systems, Inc.
    Inventor: Andrew L DiRienzo
  • Patent number: 7860857
    Abstract: The invention provides, in one aspect, human machine interface (HMI) methods and apparatus that permit users to search and/or view plant and other real-time process automation data in a manner similar to that with which they search and/or view pages on the Internet (web). Related aspects of the invention provide such methods and apparatus as permit users to search and/or view such real-time process automation data concurrently with such Internet web pages. Further related aspects of the invention provide such methods and apparatus as permit users to search and/or view such real-time process automation data concurrently with business data maintained in pages on an enterprise network (e.g., a LAN, WAN or otherwise).
    Type: Grant
    Filed: March 30, 2007
    Date of Patent: December 28, 2010
    Assignee: Invensys Systems, Inc.
    Inventors: Harris D. Kagan, David Hardin
  • Publication number: 20100323336
    Abstract: An electronic system generates and manages multiple knowledge bases of medical students and practitioners. The knowledge bases are organized according to and for the function of specific limited medical problems. Data regarding each problem cross-references both basic sciences and clinical courses. Users are able to create their own knowledge base with the use of teaching data, their own user generated data, and third party user generated data. Data is dynamically updated, and the knowledge bases support the future medical practice of students.
    Type: Application
    Filed: June 19, 2009
    Publication date: December 23, 2010
    Applicant: ALERT LIFE SCIENCES COMPUTING, S.A.
    Inventor: Manuel Jorge Vaz da Cunha Guimaraes
  • Patent number: 7853583
    Abstract: Embodiments of the present invention provide systems, methods and computer program products for generating search results comprising web documents with associated expert information. One embodiment of a method for generating such search results includes receiving one or more search queries, selecting one of the one or more search queries, determining one or more categories of web documents responsive to the selected search query and crawling a web graph of linked web documents to identify one or more web documents tagged as within the one or more categories responsive to the selected search query. The method further includes generating a result set of the one or more web documents identified as within the one or more categories responsive to the selected search query, ranking the result set and generating a list of ranked search results responsive to the selected search.
    Type: Grant
    Filed: December 27, 2007
    Date of Patent: December 14, 2010
    Assignee: Yahoo! Inc.
    Inventor: Joshua Schachter
  • Patent number: 7844591
    Abstract: A method for displaying an image with search results is described, including detecting a trigger related to a search request, selecting an image and retrieving the image in response to detecting the trigger, determining a search result in response to detecting the trigger, presenting the search result and the image with the search result.
    Type: Grant
    Filed: October 12, 2006
    Date of Patent: November 30, 2010
    Assignee: Adobe Systems Incorporated
    Inventors: Tyler J. Lettau, Andrew Borovsky
  • Patent number: 7840620
    Abstract: A playlist generator implements a net-based playlist generation process that comprises a multi-stage, hierarchical process. At a highest hierarchical level, the playlist generator applies parameters corresponding to a user's “general” preferences, wherein the parameters are relatively constant. At a lower level of the hierarchy, the playlist generator applies parameters corresponding to the user's “specific” preferences, wherein the parameters are dynamic time-dependent or event-dependent. The playlist generator uses the high-level parameters to generate a subset of material from a global collection of material, and applies the lower-level preferences to this subset of material in response to a change in the user's immediate preferences.
    Type: Grant
    Filed: January 17, 2005
    Date of Patent: November 23, 2010
    Assignee: Koninklijke Philips Electronics N.V.
    Inventors: Fabio Vignoli, Steffen Clarence Pauws
  • Publication number: 20100291518
    Abstract: The present invention discloses an interactive digital learning system and method using multiple representations to assist in geometry proofs. The system of the present invention links to a computer and comprises a database, a problem representation unit, a proof representation unit and a visual representation unit. The database respectively links to the computer, problem representation unit, proof representation unit and visual representation unit and has a geometry proof problem containing a given condition, a prove statement, a static geometric figure and a solution. The problem representation unit displays the geometry proof problem. The solution is presented in a formal proof unit and a proof tree unit in form of short-answer questions, multi-choice questions and logic arrangement tests to train students. The visual representation unit presents the static geometric figure and a dynamic geometric figure to deepen the students' comprehension on geometry proof problem.
    Type: Application
    Filed: May 12, 2009
    Publication date: November 18, 2010
    Inventors: Wing-Kwong WONG, Hsi-Hsun Yang, Sheng-Kai Yin
  • Patent number: 7836040
    Abstract: A method and system for creating a search result list, which can simplify a system configuration by searching a single database for search information, and also can display search information on a plurality of display areas with only one keyword purchase. According to the present invention, there may be provided a method and system for creating a search result list, which can differ from a conventional method and system of providing an individual database for each of display areas and thereby effectively managing search information, and can enroll a keyword in a single database according to one keyword purchase and thereby display search information on a plurality of display areas.
    Type: Grant
    Filed: April 20, 2007
    Date of Patent: November 16, 2010
    Assignee: NHN Business Platform Corporation
    Inventor: Woosung Lee
  • Patent number: 7831586
    Abstract: A computer-implemented system and method for keyword extraction and contextual advertisement generation are disclosed. The system in an example embodiment includes a keyword extraction service to receive from a consumer application a request for activation of a keyword extraction service via an application programming interface, the request including an identity of a content source, the request further including an identification of a particular extraction process to be used by the keyword extraction service on the identified content source; determine if the keyword extraction service has already processed the identified content source and retained extracted keywords in a data store; extract keywords from the identified content source using the particular extraction process identified in the request; and make the extracted keywords accessible to the consumer application.
    Type: Grant
    Filed: December 27, 2006
    Date of Patent: November 9, 2010
    Assignee: eBay Inc.
    Inventors: Alec Reitter, Barb Chang, Ken Sun, Raghav Gupta, Alvaro Bolivar, Alan Lewis
  • Patent number: 7827274
    Abstract: A method can be used to profile a user using network addresses, category information, and demographic data when the user requested or received information from those network addresses. A table can be created that includes the user identifier, category information, and demographic data. The user profile can be generated and based at least in part on the user identifier, category information, and at least some of the demographic data.
    Type: Grant
    Filed: March 29, 2007
    Date of Patent: November 2, 2010
    Assignee: Vignette Software LLC
    Inventor: Sean M. McCullough
  • Patent number: 7827403
    Abstract: One embodiment of the present invention provides a system that decrypts an encrypted column in a row. During operation, the system receives the encrypted column in the row. The system then determines a security domain associated with the encrypted column in the row, wherein the security domain represents a set of columns in rows encrypted using the same key. Next, the system determines a key associated with the security domain. The system then decrypts the encrypted column in the row using the key. Note that using a security domain to represent a set of columns in rows enables the database to grant access to data within the database at arbitrary levels of granularity.
    Type: Grant
    Filed: April 13, 2005
    Date of Patent: November 2, 2010
    Assignee: Oracle International Corporation
    Inventors: Daniel ManHung Wong, Chon Hei Lei
  • Patent number: 7827161
    Abstract: According to one aspect of the present invention, there is provided a method of identifying a document in a support automation system in response to receiving diagnostic data, the documents being stored in a database, comprising analysing the diagnostic data and retrieving, in response to the analysis, one more keywords, and searching the database using the one or more retrieved keywords to identifying one or more documents therein.
    Type: Grant
    Filed: March 12, 2007
    Date of Patent: November 2, 2010
    Assignee: Hewlett-Packard Development Company, L.P.
    Inventor: Yassine Faihe
  • Patent number: 7822743
    Abstract: A technique is described for delivering contextual information to end users of a data network which includes at least one client system associated with an end user. According to a specific embodiment, the technique of the present invention provides a contextual-based platform for delivering to an end user in real-time proactive, personalized, contextual information relating to web page content currently being displayed to the user.
    Type: Grant
    Filed: August 10, 2007
    Date of Patent: October 26, 2010
    Assignee: Kontera Technologies, Inc.
    Inventors: Assaf Henkin, Yoav Shaham, Henit Vitos, Benny Friedman, Itai Brickner
  • Patent number: 7814085
    Abstract: A system and method for scoring documents is described. One or more documents are identified responsive to a search criteria. A text match score indicating a quality of match of the identified documents is determined. A category match score is determined over categories. A document-categories score is determined indicating a quality of match between an identified document and a plurality of categories. A search criteria-categories score is determined indicating a quality of match between the search criteria and the categories. An overall score is determined based on the text match score and the category match score.
    Type: Grant
    Filed: February 26, 2004
    Date of Patent: October 12, 2010
    Assignee: Google Inc.
    Inventors: Karl Pfleger, Brian Larson
  • Publication number: 20100235475
    Abstract: A system and method for tracking and identifying digital content distributors using file sharing networks. The system monitors distribution networks, logs pertinent network and distributor information, generates network statistics, gathers evidence of content distribution, and notifies interested parties of the availability of content on file sharing networks.
    Type: Application
    Filed: March 30, 2010
    Publication date: September 16, 2010
    Inventors: Mark M. Ishikawa, Travis Hill, Lawrence Low
  • Publication number: 20100235343
    Abstract: Exemplary methods, computer-readable media, and systems are presented for learning to recommend questions and other user-generated submissions to community sites based on user ratings. The size of available training data is enlarged by taking into consideration questions without user ratings, which in turn benefits the learned model. Question or other user-generated submissions are obtained by crawling Internet-accessible Web sites including community sites. Questions and other submissions, even when not tagged, voted or indicated as “popular” or “interesting” by users are quantitatively indentified as “interesting.
    Type: Application
    Filed: September 29, 2009
    Publication date: September 16, 2010
    Applicant: Microsoft Corporation
    Inventors: Yunbo Cao, Chin-Yew Lin, Young-In Song
  • Publication number: 20100228720
    Abstract: A mobile wireless communications device includes a processor cooperating with a wireless transceiver for communicating with a remote server to search a markup language file of at least one web page stored on a remote web server to determine addresses of a plurality of web feeds on the at the least one web page. The processor also cooperates with an input device and a display to display on the display a list of the plurality of web feeds and to generate a list of selected web feeds based upon the input device. Information from the selected web feeds is downloaded using the wireless transceiver and based upon respective addresses of the selected web feeds. The downloaded information from the selected web feeds is displayed on the display.
    Type: Application
    Filed: February 26, 2009
    Publication date: September 9, 2010
    Applicant: Research In Motion Limited
    Inventors: Chris Wormald, Gerhard D. Klassen, Kalu Kalu
  • Patent number: 7792818
    Abstract: Described herein are methods for creating categorized documents, categorizing documents in a distributed database and categorizing Resulting Pages. Also described herein is an apparatus for searching a distributed database. The method for creating categorized documents generally comprises: initially assuming all documents are of type 1; filtering out all type 2 documents and placing them in a first category; filtering out all type 3 documents and placing them in a second category; and defining all remaining documents as type 4 documents and placing all type 4 documents in a third category. The apparatus for searching a distributed database generally comprises at least one memory device; a computing apparatus; an indexer; a transactional score generator; and a category assignor; a search server; and a user interface in communication with the search server.
    Type: Grant
    Filed: April 28, 2006
    Date of Patent: September 7, 2010
    Assignee: Overture Services, Inc.
    Inventors: Daniel C. Fain, Paul T. Ryan, Peter Savich
  • Publication number: 20100222709
    Abstract: A method for determining the biological age of the companion animal. A companion animal ambulates from a first region to a second region of a pressure detection unit and the footfall data is utilized in the determination of the biological age of the companion animal.
    Type: Application
    Filed: March 2, 2009
    Publication date: September 2, 2010
    Inventors: Allan John Lepine, Dennis Richard Ditmer, Lori Lee Halsey, John Russell Burr
  • Patent number: 7788254
    Abstract: A collection of web pages is modeled as a directed graph, in which the nodes of the graph are the web pages and directed edges are hyperlinks. Web pages can also be represented by content, or by other features, to obtain a similarity graph over the web pages, where nodes again denote the web pages and the links or edges between each pair of nodes is weighted by a corresponding similarity between those two nodes. A random walk is defined for each graph, and a mixture of the random walks is obtained for the set of graphs. The collection of web pages is then analyzed based on the mixture to obtain a web page analysis result. The web page analysis result can be, for example, clustering of the web pages to discover web communities, classifying or categorizing the web pages, or spam detection indicating whether a given web page is spam or content.
    Type: Grant
    Filed: September 14, 2007
    Date of Patent: August 31, 2010
    Assignee: Microsoft Corporation
    Inventors: Christopher J. C. Burges, Dengyong Zhou
  • Publication number: 20100217757
    Abstract: A computerized search engine for use in association with one or more networked social sites is disclosed. The computerized search engine includes a widgetized avatar representative of a user of at least two of the networked social sites, a crawler that crawls each of the at least two networked social sites for modification of information related to one or more contacts of the user on at least one of the at least two networked social sites, and a display of search results. The display includes the modified information from the at least two networked social sites.
    Type: Application
    Filed: February 22, 2010
    Publication date: August 26, 2010
    Inventor: Robb Fujioka
  • Patent number: 7783698
    Abstract: The claimed subject matter provides systems and/or methods that facilitate providing a generalized web service. An interface component can obtain data from a client component. Additionally, a general web service component can store the data with user selected access permissions and enable retrieving and modifying the data from any location. The general web service component can employ a centralized infrastructure, a peer-to-peer infrastructure built upon any number of client components, or a combination thereof.
    Type: Grant
    Filed: April 18, 2006
    Date of Patent: August 24, 2010
    Assignee: Microsoft Corporation
    Inventor: Kamal Jain
  • Patent number: 7783617
    Abstract: A computer implemented method of searching personals ads comprising: performing a criteria search to identify one or more personals ads; and performing an affinity search to identify personals ads having an affinity to at least one of the personals ads identified by the criteria search.
    Type: Grant
    Filed: June 25, 2003
    Date of Patent: August 24, 2010
    Assignee: Yahoo! Inc.
    Inventors: Guotao Lu, Jagdish Chand, Bryan Call, Andy Scott, Roger Urrabazo
  • Patent number: 7769766
    Abstract: A method and an apparatus to store content rating information have been disclosed. In one embodiment, the method includes receiving a user request to access a web page, sending a domain name system (DNS) request to a first one of a plurality of DNS servers from a content filtering client to get content rating information of the web page in response to the user request, and receiving from the first one DNS server a DNS response containing the content rating information to the content filtering client. Other embodiments have been claimed and described.
    Type: Grant
    Filed: May 24, 2004
    Date of Patent: August 3, 2010
    Assignee: SonicWALL, Inc.
    Inventors: Alex M. Dubrovsky, Nikolay V. Popov, Alexander Shor, Roman Yanovsky, Shunhui Zhu, Boris Yanovsky
  • Publication number: 20100191071
    Abstract: A method of patient assessment, treatment, and outcome modeling is disclosed. The method includes obtaining patient characteristic information from a current patient, defining a plurality of therapeutic factors based on the characteristic information of the current patient, and weighting the therapeutic factors. The method also includes accessing at least one database having medical records of prior patients, the medical records including prior patient characteristic information, prior patient treatment plan, and prior patient outcome, comparing the weighted factors of the current patient to the medical records of the prior patients to identify one or more relevant prior patient records, and retrieving at least a portion of the relevant prior patient records, the portion including at least the prior patient treatment plan and the prior patient outcome.
    Type: Application
    Filed: January 23, 2009
    Publication date: July 29, 2010
    Applicant: WARSAW ORTHOPEDIC, INC.
    Inventors: Kent M. Anderson, Matthew M. Morrison, Thomas Carls, Eric C. Lange, David W. Poley, Patrick Farrell Turner, Scott James Drapeau, Michael Allen Ferguson
  • Publication number: 20100174686
    Abstract: A system of reducing the possibility of crawling duplicate document identifiers partitions a plurality of document identifiers into multiple clusters, each cluster having a cluster name and a set of document parameters. The system generates an equivalence rule for each cluster of document identifiers, the rule specifying which document parameters associated with the cluster are content-relevant. Next, the system groups each cluster of document identifiers into one or more equivalence classes in accordance with its associated equivalence rule, each equivalence class including one or more document identifiers that correspond to a document content and having a representative document identifier identifying the document content.
    Type: Application
    Filed: March 16, 2010
    Publication date: July 8, 2010
    Inventors: Anurag Acharya, Arvind Jain, Arup Mukherjee
  • Patent number: 7752200
    Abstract: A method and system for identifying search terms for placing advertisements along with search results is provided. The advertisement system selects a description of an item that is to be advertised. The advertisement system then retrieves documents that match the selected description. The advertisement system generates a score for each word of the retrieved documents that indicates relatedness of the word to the item to be advertised. After generating the scores for the words, the advertisement system identifies phrases of the words within the documents that are related to the item. The advertisement system then generates search terms for the item to be advertised from the identified phrases. The advertisement system submits the search terms and an advertisement to a search engines service for placement of a paid-for advertisement for the item.
    Type: Grant
    Filed: August 9, 2004
    Date of Patent: July 6, 2010
    Assignee: Amazon Technologies, Inc.
    Inventors: Nathaniel B. Scholl, Alexander W. DeNeui
  • Patent number: 7747600
    Abstract: A computer-implementable method and system for performing a multi-level search. The method includes performing a primary search that involves executing a query submitted by a user, and returning primary search results (a list of documents, for example). The method further includes automatically performing a secondary search. The secondary search involves identifying at least one third-party source of information based on the query, and automatically assessing a semantic interpretation of the query. The secondary search utilizes the identified at least one third-party source of information and the semantic interpretation of the query to derive secondary search results, which are displayed along with the primary search results.
    Type: Grant
    Filed: June 13, 2007
    Date of Patent: June 29, 2010
    Assignee: Microsoft Corporation
    Inventors: Krysta Svore, Chris Burges, Silviu-Petru Cucerzan
  • Patent number: 7743047
    Abstract: The concept of variability pertains to whether users exhibit consistent search interaction patterns, for example, in terms of interaction flow or information targeted. Methods are provided for analyzing variability, and then adapting search-related functionality (e.g., processes and/or interfaces) to account for variability characteristics, for example, to account for predictable search interaction behavior.
    Type: Grant
    Filed: September 26, 2007
    Date of Patent: June 22, 2010
    Assignee: Microsoft Corporation
    Inventors: Ryen White, Eric Brill, Steven Drucker, Christopher Burges
  • Patent number: 7743046
    Abstract: A precision information collating device is disclosed. The device comprises means for inputting keywords and key phrases and defining level of traverse for establishing interlinks and interdependencies between keywords and key phrases entered and using previously stored interlinks and interdependencies to form a map. The device facilitates searching for keywords and key phrases and filtering out unwanted results, indexing, rating and posting of the result obtained and means for browsing the result.
    Type: Grant
    Filed: April 18, 2006
    Date of Patent: June 22, 2010
    Assignee: Tata Consultancy Services Ltd
    Inventors: Kumar Anand, Nori Kesav Vithal, Mandaleeka Guru Prasada Lakshmi Narayana
  • Publication number: 20100153359
    Abstract: The project represents an instrument that, by a filing of historical-icono graphic information and a software, provides a innovative support to the study, to the filing and to the diagnostics of manufacts in the field of art. The software researchs, within a database containing the files of all of those attributes that, in the long run, have characterized the iconography of a subject. These attributes, then, are often typical of a certain historical period and of a localized geographical area; therefore, they are underlined because they are a precious value for the manufact's diagnostics. For example, a saint can be represented with some attributes up to a certain age and then, maybe as a consequence of what decided during a Council, from that moment onwards the representation method changes: this element is very significant for the datation.
    Type: Application
    Filed: November 6, 2007
    Publication date: June 17, 2010
    Inventor: Sara Penco
  • Publication number: 20100153360
    Abstract: A method and system for discovering a control event from electronically published documents is provided, in which a control program on a computer identifies electronically published documents stored in a plurality of network servers which potentially contain control events relevant to the control of goods and/or services, the control events being identified by reference to a user interest database containing user interest identifiers. Identified documents are analyzed by a classification program to determine whether control events are present, referring to a control event database. A control event classification is assigned to documents determined to contain at least one discovered control event, the assigned control event classification and information identifying the associated document is stored in a classification database, and a report of discovery of documents containing control events is be provided to a user.
    Type: Application
    Filed: December 8, 2009
    Publication date: June 17, 2010
    Applicant: Decernis, LLC
    Inventors: Patrick Blackmon WALDO, Andrew B. Waldo
  • Publication number: 20100153361
    Abstract: Certain embodiments of the present invention provide a system for clinical decision support including a crawler agent component. The crawler agent component is adapted to receive a search parameter. The search parameter specifies a criteria for evidence data to be searched for. The crawler agent component is adapted to initiate a search of a plurality of evidence sources based at least in part on the search parameter. The search identifies the evidence data. The evidence data is utilized by the clinical decision support system to provide decision support to a healthcare provider for a patient.
    Type: Application
    Filed: February 17, 2010
    Publication date: June 17, 2010
    Inventor: Shrikant L. Deshpande
  • Patent number: 7739258
    Abstract: One embodiment of the present invention provides a system that facilitates crawling through web-based forms to gather information to facilitate subsequent searches through content which is accessible though the web-based forms. During operation, the system first obtains web-based forms to be searched. Note that the system can obtain these web-based forms from a number of sources. For example, the system can crawl through web sites to identify web-based forms, the system can receive manually provided web-based forms, or the system can find web-based forms through methods other than crawling. Next, the system creates database entries for the identified forms. This involves obtaining and storing metadata describing the identified forms into database entries and then storing these database entries in a form database to facilitate searches through content which is accessible through the identified forms.
    Type: Grant
    Filed: April 5, 2006
    Date of Patent: June 15, 2010
    Assignee: Google Inc.
    Inventors: Alon Y. Halevy, Jayant Madhavan, David H. Ko
  • Publication number: 20100145927
    Abstract: A method and system for enhancing the relevance and usefulness of information searches, such as web searches, by introducing individual and shared user's judgment; first, to define the universe of the search, automatically internalizing the content of that universe (via a copyright-compliant system) in an automatically updated repository that can integrate other (internally generated or imported) content and enable sharing according to user preferences; and, secondly, to organize the internalized content through tagging, book marking and filtering.
    Type: Application
    Filed: January 9, 2008
    Publication date: June 10, 2010
    Inventors: Kiron Kasbekar, Chirag Kasbekar, Ghulam Mustafa
  • Patent number: 7730050
    Abstract: An information retrieval apparatus includes a display which displays document information, an input unit which adds additional information to the document information displayed the display, a first storage which stores mark symbol information specifying a particular symbol used for marking, a detector which detects an input from the input unit and decides whether or not the input additional information is identical to or similar to the mark symbol information stored in the first storage, a second storage which stores the mark symbol information and the additional information which is decided that it is similar to the mark symbol information by the detector, associating with the mark symbol information, and a retrieval unit which retrieves the mark symbol information and the additional information associated with the mark symbol information from the second storage, and a retrieval result by the retrieval unit is displayed on the display.
    Type: Grant
    Filed: March 20, 2007
    Date of Patent: June 1, 2010
    Assignee: Kabushiki Kaisha Toshiba
    Inventors: Naoki Iketani, Hideo Umeki, Kenta Cho, Sogo Tsuboi, Masayuki Okamoto
  • Publication number: 20100131797
    Abstract: A system and method for assessing and remedying accessibility of websites is provided. The method includes receiving a website address for assessment, an accessibility guideline and level of assessment to be performed from the user. The method further includes crawling the website for extracting information. The information comprises HTML tags used in designing a webpage. Thereafter, the website is scanned for checking conformance to one or more accessibility parameters. Finally, one or more assessment reports are provided to the user.
    Type: Application
    Filed: November 13, 2009
    Publication date: May 27, 2010
    Applicant: INFOSYS TECHNOLOGIES LIMITED
    Inventors: Jai GANESH, Navin KASA, Shaurabh BHARTI, Srinivas PADMANABHUNI, Mayank MATHUR, Ajay KOLHATKAR, Shrirang Prakash SAHASRABUDHE
  • Publication number: 20100131489
    Abstract: A method for performing Web searches from a mobile device comprises tracking the personal interactions between the user of the mobile device and initiator of a search process and his social relations, and ranking information retrieved from the World Wide Web according to its connection with said social relations.
    Type: Application
    Filed: November 24, 2008
    Publication date: May 27, 2010
    Applicant: SAMSUNG ELECTRONICS CO., LTD.
    Inventors: Claudia V. Goldman-Shenhar, Zvika Rubinstein
  • Patent number: 7720835
    Abstract: TruCast is a method for management, by way of gathering, storing, analyzing, tracking, sorting, determining the relevance of, visualizing, and responding to all available consumer generated media. Some examples of consumer generated media include web logs or “blogs”, mobile phone blogs or “mo-blogs”, forums, electronic discussion messages, Usenet, message boards, BBS emulating services, product review and discussion web sites, online retail sites that support customer comments, social networks, media repositories, and digital libraries. Any web hosted system for the persistent public storage of human commentary is a potential target for this method. The system is comprised of a coordinated software and hardware system designed to perform management, collection, storage, analysis, workflow, visualization, and response tasks upon this media.
    Type: Grant
    Filed: May 7, 2007
    Date of Patent: May 18, 2010
    Assignee: Visible Technologies LLC
    Inventors: Miles Ward, Jim Webber, Dean Michael Graziano
  • Patent number: 7707203
    Abstract: A computer system and method for capture, managing and presenting data obtained from various often unrelated postings via the Internet for examination by a user. This system includes a scraping module having one or more scraping engines operable to scrape information data sets from listings on the corporate sites and web sites, direct feeds, and other sources, wherein the scraping module receives and stores the scraped listing information data sets in a database. The system also has a management platform coordinating all operation of and communication between the sources, system administrators and processing modules. The processing modules in the platform include scraping management module analyzing selected scraped data stored in the database, and a categorization module that examines and categorizes each data set stored in the database into one or more of a predetermined set of categories and returns categorized data sets to the database.
    Type: Grant
    Filed: June 30, 2005
    Date of Patent: April 27, 2010
    Assignee: Yahoo! Inc.
    Inventors: Adam Hyder, Sandeep Khanna, Joseph Ting
  • Patent number: 7707198
    Abstract: A method and apparatus for enabling a user to access media objects, such as images, from a website without requiring the user to enter the website. In one embodiment, a search engine searches for websites that match a submitted search term. A selected URL to one of the resulting websites is submitted to a harvester that accesses a web page of the selected website and identifies a media object of the web page. The harvester determines the characteristic(s) of the media object, such as a dimension, an aspect ratio, a proximity to other media objects, etc. The harvester determines a second media object with substantially the same characteristic(s). The determined media objects, or subportions, are rendered in a client user interface. Relationships are mapped between a selected media object and projects that use the object. Manipulating a rendered portion causes a related operation on the whole media object.
    Type: Grant
    Filed: December 12, 2006
    Date of Patent: April 27, 2010
    Assignee: Yahoo! Inc.
    Inventors: Karon A. Weber, Samantha M. Tripodi, David Ayman Shamma
  • Patent number: 7702675
    Abstract: A method and system for creating a database of categorized web feeds for facilitating web feed organization is disclosed. One exemplary method includes ascertaining a web feed identifier and searching a categorized collection of websites to determine a hierarchical folder path for the web feed identifier. For example, the Open Directory Project may be searched to determine an appropriate hierarchical folder path for the web feed identifier. The web feed identifier is placed in a corresponding hierarchical folder path within the database of categorized web feeds. The database of categorized web feeds can then be used as a tool for organizing web feeds on a client computer.
    Type: Grant
    Filed: November 11, 2006
    Date of Patent: April 20, 2010
    Assignee: AOL Inc.
    Inventors: Aditya Khosla, Brock D. LaPorte, Alberto Cobas, Colin Chang
  • Patent number: 7702674
    Abstract: A computer system and method for capture, managing and presenting data obtained from various often unrelated postings via the Internet for examination by a user. This system includes a scraping module having one or more scraping engines operable to scrape information data sets from listings on the corporate sites and web sites, direct feeds, and other sources, wherein the scraping module receives and stores the scraped listing information data sets in a database. The system also has a management platform coordinating all operation of and communication between the sources, system administrators and processing modules. The processing modules in the platform include scraping management module analyzing selected scraped data stored in the database, and a categorization module that examines and categorizes each data set stored in the database into one or more of a predetermined set of categories and returns categorized data sets to the database.
    Type: Grant
    Filed: June 30, 2005
    Date of Patent: April 20, 2010
    Assignee: Yahoo! Inc.
    Inventors: Adam Hyder, Sandeep Khanna, Joseph Ting