By Querying, E.g., Search Engines Or Meta-search Engines, Crawling Techniques, Push Systems, Etc. (epo) Patents (Class 707/E17.108)
  • Publication number: 20120239637
    Abstract: A new approach is proposed that contemplates systems and methods to examine and determine quality of objects cited by citations in a search result based on a citation graph that includes citing subjects, citations, and cited objects. First, influence scores of a plurality of subjects/sources that compose the citations of the objects in the search result are calculated. The quality of the objects cited by the subjects can then be determined by examining the influence scores for the subjects of the citations. Finally, the cited objects selected can be presented to a user or provided to a thirty party for further processing together with the relevant citations and citing subjects.
    Type: Application
    Filed: June 13, 2011
    Publication date: September 20, 2012
    Inventors: Vipul Ved Prakash, Lun Ted Cui, Rishab Aiyer Ghosh, Thomas James Emerson
  • Publication number: 20120239636
    Abstract: The invention relates to a method and system for processing complex queries each corresponding to a plurality of components to be combined. Elements corresponding to these components are searched. The elements are of different element types and are originally described by definition data having heterogeneous data structures. A conversion step transforms the heterogeneous definition data into converted definition data of one single format. An engine then retrieves elements and builds associations of elements matching the query parameters based on the converted definition data, thereby providing with a reply to the query in an optimized manner.
    Type: Application
    Filed: March 24, 2011
    Publication date: September 20, 2012
    Inventors: David Pauchet, Benoît Janin, Rudy Daniello, Thierry Dufresne, Marc Patoureaux
  • Publication number: 20120239639
    Abstract: A search engine to index web content with user content. A server computer receives, from a first client computer operated by a first user, an identification of first web content displayed by a web browser of the first client computer in a main browser window. The identification of the first web content is transmitted by the first user to the server computer via a user interface separate from the main browser window. The server computer then indexes the first web content. In response to receiving a search query from a web browser of a second client computer operated by a second user, the server computer transmits search results to the web browser of the second client computer. The search results include the first web content identified by the first user in a position relative to identifications of other web content received from other users.
    Type: Application
    Filed: March 14, 2012
    Publication date: September 20, 2012
    Applicant: SLANGWHO, INC.
    Inventors: John S. Shriber, Roman Zaks
  • Publication number: 20120239640
    Abstract: A mobile station is arranged to determine its location, which is searched against street addresses from a database, and at least one matching street address is retrieved. The street address is searched on the mobile station and/or over the network. The search engine conducts a search in the mobile station file system and/or the Internet and/or a file system over the network with the at least one query term, —at least one search result is arranged to be displayed to user on the screen of the mobile station. This facilitates on-demand effortless Mobile Internet Search that allows the users to access opportunities that they did not know about, or would not have had time to find out about with minimum effort as the software of the mobile phone is scanning the Internet and information pages for these opportunities and displaying the results dynamically on the mobile phone screen.
    Type: Application
    Filed: April 17, 2012
    Publication date: September 20, 2012
    Inventor: Mikko VÄÄNÄNEN
  • Publication number: 20120239497
    Abstract: In various exemplary embodiments, a system, a method, and a related non-transitory computer-readable storage medium, of targeting advertisements based on a lifestyle change for an individual is disclosed. The method includes scanning a plurality of social sites to determine the lifestyle change and making a determination whether one or more keywords related to the lifestyle change are found in at least one of the plurality of social sites. Based on the determination that the one or more keywords are found in at least one of the plurality of social sites, the method continues with querying an end-user of the at least one of the plurality of social sites, determining an event related to the lifestyle change, matching an advertisement targeted to the event, and electronically sending the targeted advertisement to either the end-user or the individual.
    Type: Application
    Filed: March 17, 2011
    Publication date: September 20, 2012
    Applicant: eBay Inc.
    Inventor: Frank Anthony Nuzzi
  • Publication number: 20120239667
    Abstract: The keyword extraction technique described herein extracts keywords from Uniform Resource Locators (URLs) in web logs. The technique leverages the content and the structure of URLs to extract relevant keywords. First, a URL is divided into multiple components based on its structure. A set of keywords are extracted from each component of the URL independently with the help of a controlled vocabulary. Then a second set of keywords are generated by forming combinations of terms from different segments of the URL. Only those combinations which are present in the controlled vocabulary are retained as keywords. Finally, the keywords are scored with a function which took into account of a wide set of features.
    Type: Application
    Filed: March 15, 2011
    Publication date: September 20, 2012
    Applicant: MICROSOFT CORPORATION
    Inventors: Santosh R. Vysyaraju, Uppinakuduru Raghavendra Udupa, Abhijit N. Bhole, Guy Dassa, Weiguo Liu, Qing Xiao
  • Publication number: 20120239519
    Abstract: Some embodiments provide personalized search services. The personalized search services operate to contextualize what is being searched for and to provide at least some search results in that context that are specific to the user to whom the results are to be provided.
    Type: Application
    Filed: March 15, 2011
    Publication date: September 20, 2012
    Applicant: eBay Inc.
    Inventors: Xiaoyuan Wu, Sunil Mohan
  • Patent number: 8271425
    Abstract: In an image processing system including a plurality of image processing devices, each image processing device includes an authenticating unit, a function implementing unit and a function specifying unit. The authenticating unit performs user authentication. The function specifying unit specifies a specific image processing function in another image processing device for each of the users. The function implementing unit implements the specific image processing function for the user already logging in the authenticating unit.
    Type: Grant
    Filed: October 20, 2005
    Date of Patent: September 18, 2012
    Assignee: Konica Minolta Business Technologies, Inc.
    Inventor: Kenji Matsuhara
  • Publication number: 20120233096
    Abstract: Historical usage data related to user queries and training properties for a plurality of web pages is received and utilized to train a mathematical model to predict the likelihood of retrieval of a web page during a web search. Properties are extracted from the plurality of web pages in the index and the mathematical model is applied to the properties for each web page to calculate a sortrank value. The index is reordered based on the sortrank value such that the web pages most likely to be retrieved by a user submitting a search query appear first in the index. After a search query is received from a user the index is traversed in an order determined by the sortrank value. Responsive web pages are presented to the user in an order determined by a search engine ranking algorithm.
    Type: Application
    Filed: March 7, 2011
    Publication date: September 13, 2012
    Applicant: MICROSOFT CORPORATION
    Inventors: ATUL KUMAR GUPTA, ANNA V. TIMASHEVA, YUAN WANG, RAJKIRAN PANUGANTI, GARGI GHOSH, CHAOPING QIN, YASSER GANJISAFFAR, GIRISH KUMAR, HONGYAN ZHOU
  • Publication number: 20120233147
    Abstract: Indexing and searching features are provided including associated system, methods, and other implementations. A computing system of an embodiment is configured to reuse or repurpose physical index fields for different tenants as part of providing efficient and scalable indexing and searching services. A method of one embodiment operates to provide an indexed data structure that includes a number of reusable index fields that are shared and used to index information associated with a plurality of tenants. Other embodiments are included.
    Type: Application
    Filed: March 11, 2011
    Publication date: September 13, 2012
    Applicant: MICROSOFT CORPORATION
    Inventors: Helge Grenager Solheim, Øystein Fledsberg, Evan Matthew Roark, Michael Susaeg
  • Publication number: 20120233144
    Abstract: Various embodiments described herein provide systems, methods, and software to automatically reorder search results presented to users based on information specific to the user or the computing environment of the user. Some embodiments include a data store holding user or environment specific data that is used to identify search results that are more likely to be relevant to the user. These and other embodiments are described in greater detail herein.
    Type: Application
    Filed: May 18, 2012
    Publication date: September 13, 2012
    Inventors: Barbara Rosario, William Noah Schilit
  • Publication number: 20120233140
    Abstract: A model generation module is described herein for using a machine learning technique to generate a model for use by a search engine. The model assists the search engine in generating alterations of search queries, so as to improve the relevance and performance of the search queries. The model includes a plurality of features having weights and levels of uncertainty associated therewith, where each feature defines a rule for altering a search query in a defined manner when a context condition, specified by the rule, is present. The model generation module generates the model based on user behavior information, including query reformulation information and user preference information. The query reformulation information indicates query reformulations made by at least one agent (such as users). The preference information indicates at extent to which the users were satisfied with the query reformulations.
    Type: Application
    Filed: March 9, 2011
    Publication date: September 13, 2012
    Applicant: Microsoft Corporation
    Inventors: Kevyn B. Collins-Thompson, Ni Lao
  • Publication number: 20120233143
    Abstract: Systems and method for providing an image-based search interface. In one embodiment, for example, there is provided a method comprising displaying an image, and upon a user's activation of the image, presenting to the user a pre-populated search interface. There is also provided an image processing method for providing a web user with a pre-populated search interface, comprising: (a) receiving an image from a source; (b) analyzing the image to identify the subject matter within the image; (c) generating a search tag based on the subject matter within the image; and (d) sending the search tag to the source. In one embodiment, the systems and methods described herein are used in computer-implemented advertising.
    Type: Application
    Filed: February 16, 2012
    Publication date: September 13, 2012
    Inventor: James R. Everingham
  • Publication number: 20120233181
    Abstract: The present invention may provide a system, method and computer program for searching a sub-domain that involves linking to other sub-domains. In particular, the present invention may provide a sub-domain search that ranks and/or weighs search results based on relevance, such relevance being determined by a calculation operating on a broader range of domain and/or sub-domains than a single sub-domain. The present invention may weight and score a search of web pages in a sub-domain based on the content of web pages in other sub-domains.
    Type: Application
    Filed: September 24, 2010
    Publication date: September 13, 2012
    Inventors: Shady Shehata, Fakhri Karray, Mohamed Salem Kamel
  • Publication number: 20120233141
    Abstract: An apparatus is provided that includes a processor and memory storing executable instructions that in response to execution by the processor cause the apparatus to at least perform a number of operations. The apparatus is caused to query a search engine index based upon a user query including a search term, and based upon one or more synonyms of the term. The search engine index is of a database of unstructured, free-text reports, and the search engine index is queried to locate identifiers of image studies associated with reports including the search term or synonym(s). The apparatus may be caused to sort or filter the search results based on patient information from the image studies of the located identifiers, receive user selection of one or more image studies from the search results, and retrieve the selected one or more image studies.
    Type: Application
    Filed: March 9, 2011
    Publication date: September 13, 2012
    Inventor: Stephen Lambie
  • Publication number: 20120233145
    Abstract: To provide an improved user experience for users of a web browser, embodiments of the invention save queries entered by a user via the web browser. The queries may be for execution on different network search services, such as search engines, social networks, message posting services, and the like. At various times in the future, the web browser then executes the saved search queries on their corresponding network search services, identifies search results that are new and highly relevant to the user, and provides the identified search results to the user.
    Type: Application
    Filed: May 22, 2012
    Publication date: September 13, 2012
    Applicant: RockMelt, Inc.
    Inventors: Timothy Howes, Eric Vishria
  • Publication number: 20120226708
    Abstract: Media collections (MC) service embodiments are presented which generally facilitate access to diverse forms of media by resolving an identifier tuple assigned to a content item into a set of one or more Uniform Resource Identifiers (URIs) which point to an instance of the content item. This scheme supports the upload and query of collections of media elements such as images, audio, video, deep zoom images, photosynth and so on. In addition, the foregoing scheme affords a standard way to bind to media that persists, and makes it easier to author and play content while being flexible about where the media is located.
    Type: Application
    Filed: March 1, 2011
    Publication date: September 6, 2012
    Applicant: Microsoft Corporation
    Inventors: Gopal Ranganatha Srinivasa, Joseph M. Joy
  • Publication number: 20120226696
    Abstract: In various embodiments, a transcript that represents a media file is created. Keyword candidates that may represent topics and/or content associated with the media content are then be extracted from the transcript. Furthermore, a keyword set may be generated for the media content utilizing a mutual information criteria. In other embodiments, one or more queries may be generated based at least in part on the transcript, and a plurality of web documents may be retrieved based at least in part on the one or more queries. Additional keyword candidates may be extracted from each web document and then ranked. A subset of the keyword candidates may then be selected to form a keyword set associated with the media content.
    Type: Application
    Filed: March 4, 2011
    Publication date: September 6, 2012
    Applicant: MICROSOFT CORPORATION
    Inventors: Albert Joseph Kishan Thambiratnam, Sha Meng, Gang Li, Frank Torsten Bernd Seide
  • Publication number: 20120226675
    Abstract: A computer identifies multiple resource identifiers in accordance with a first set of predefined criteria for selecting a respective document that satisfies user-specified search keywords from a user. Each resource identifier corresponds to a document at a respective data source. For at least one of the resource identifiers, the computer retrieves the corresponding document from the respective document source; identifies within the retrieved document a chunk by applying a second set of predefined criteria to the retrieved document; and provides the identified chunk and a link to the identified chunk within the document for display to the user. The first set of predefined criteria requires that at least a first subset of the search keywords be found within an identified respective document, and the second set of predefined criteria requires that at least a second subset of the search keywords be found within an identified chunk.
    Type: Application
    Filed: March 27, 2012
    Publication date: September 6, 2012
    Inventors: Jeffrey Matthew Dexter, Robert Smik
  • Publication number: 20120226678
    Abstract: Methods for optimizing social media are disclosed. Such methods may include identifying at least one keyword utilized for at least one webpage, identifying social media correspondence referencing the at least one keyword, analyzing content collected from the social media to determine a frequency of references to the at least one keyword and generating at least one report including information based on the analysis. The report may include recommendations for optimizing social media by, for example, increasing visibility by using high-performing keywords. Systems for performing the methods are also disclosed.
    Type: Application
    Filed: March 1, 2012
    Publication date: September 6, 2012
    Applicant: BRIGHTEDGE TECHNOLOGIES, INC.
    Inventors: LEMUEL S. PARK, JIMMY YU, SAMMY YU, EMEKA AJOKU, THOMAS J. ZIOLA
  • Publication number: 20120226661
    Abstract: Documents are replicated among servers comprising a search engine based on the value of each document by approximating its value as one of the top search results for one or more exemplary queries. Documents are allocated among servers comprising a search engine by calculating a relevance value for each document and then distributing the documents evenly to the servers. A subset of servers are selected from among a plurality of servers comprising a search engine using term-based, server-specific histograms reflecting the number of instances of the term in each document allocated to each server, and then selecting servers to service a query based on the documents on those servers.
    Type: Application
    Filed: March 3, 2011
    Publication date: September 6, 2012
    Applicant: Microsoft Corporation
    Inventors: Krishnaram N. G. Kenthapadi, Shuai Ding, Sreenivas Gollapudi, Samuel Ieong, Alexandros Ntoulas
  • Publication number: 20120226676
    Abstract: A method for adaptation of a free text query to a customized query. The method comprises selecting at least one resource from a plurality of resources of information for responding to a query received from a user device; performing an analysis of the received query; performing at least one of: customizing the query to meet an input query format of a selected at least one resource, or and providing an input query that is transformed to meet an input requirement of the selected at least one resource; and sending the customized query to the selected at least one resource, wherein each of the selected at least one resource receives an appropriately customized query format.
    Type: Application
    Filed: May 17, 2012
    Publication date: September 6, 2012
    Applicant: DOAT MEDIA LTD.
    Inventors: Rami Kasterstein, Amihay Ben-David, Joey Joseph Simhon
  • Publication number: 20120226677
    Abstract: Examples of methods, systems, and computer-readable media for detection of sensitive information are described using multiple techniques. The techniques may include applying pre-defined field structure layouts to records, applying simple template structure to records as a single field, and inferring data structure by building a map of potential packed decimal locations. The resulting information may then be analyzed for detection of sensitive information.
    Type: Application
    Filed: March 1, 2011
    Publication date: September 6, 2012
    Applicant: Xbridge Systems, Inc.
    Inventors: Benjamin R. Bolton, James M. Sagawa
  • Publication number: 20120221547
    Abstract: Embodiments of the present invention are directed to automated information-search and information-retrieval systems that provide information, on a continuous or periodic basis, to users or subscribers. In one embodiment of the present invention, information is gathered from a user's computer, or from computers accessible from the user's computer, on an essentially continuous basis in order to provide a database of information from which meaningful and focused search queries can be automatically constructed. The search queries are then employed to find, on behalf of the user or subscriber, current information useful to, and needed by, the user or subscriber.
    Type: Application
    Filed: February 24, 2012
    Publication date: August 30, 2012
    Applicant: GIST INC. FKA MINEBOX INC.
    Inventors: Stephen G. HALL, Thomas A. McCann, III, Timothy David CASE, Adam LOVING, Matthew HARTZLER, Tobias James Padilla
  • Publication number: 20120221539
    Abstract: Embodiments provide methods and systems for encoding and decoding variable-length data, which may include methods for encoding and decoding search engine posting lists. Embodiments may include different encoding formats including group unary, packed unary, and/or packed binary formats. Some embodiments may utilize single instruction multiple data (SIMD) instructions that may perform a parallel shuffle operation on encoded data as part of the decoding processes. Some embodiments may utilize lookup tables to determine shuffle sequences and/or masks and/or shifts to be utilized in the decoding processes. Some embodiments may utilize hybrid formats.
    Type: Application
    Filed: March 31, 2011
    Publication date: August 30, 2012
    Applicant: A9.com, Inc.
    Inventors: Daniel E. Rose, Alexander A. Stepanov, Anil Ramesh Gangolli, Paramjit S. Oberoi, Ryan Jacob Ernst
  • Publication number: 20120221543
    Abstract: Systems and methods are provided for improved web searching. In one implementation, suggested search queries are provided based on previous search queries and click data. A weighted bi-partite graph or index may be used to identify related search queries based on overlapping clicked URLs. According to a method, query-click log data of a search engine is processed to generate sets of suggested search queries, data corresponding to each suggested search query, and a set of clicked URLs related to each suggested search query. Additionally, or independently, methods may be provided for contextually correcting spelling errors within sets of suggested search queries using a contextual algorithm, and/or identifying and discarding sets of suggested search queries and URLs that lead to restricted material, such as restricted content and related URLs.
    Type: Application
    Filed: May 7, 2012
    Publication date: August 30, 2012
    Inventors: Sean Christopher TIMM, Sudhir Achuthan
  • Publication number: 20120221545
    Abstract: Desired content, metadata, or both can be isolated from the full content of social media websites having content-rich pages. Achieving this can include obtaining from the content-rich pages a language-independent representation having a hierarchical structure of nodes and then generating a node representation for each node. Feature vectors for the nodes are generated and a label is assigned to each node representation according to a schema. Assignment can occur by executing a trained classification algorithm on the feature vectors. The schema has schema elements and each schema element corresponds to a label. For each schema element, all node representations having matching labels are gathered and then one node representation is elected from among those with matching labels to be assigned to a schema element field in a template. The template can be applied to extract desired content, metadata, or both according to the schema from all the content-rich pages.
    Type: Application
    Filed: February 28, 2011
    Publication date: August 30, 2012
    Applicant: BATTELLE MEMORIAL INSTITUTE
    Inventors: Eric B. Bell, Shawn J. Bohn, Andrew J. Cowell, Michelle L. Gregory, Eric J. Marshall, Deborah A. Payne
  • Publication number: 20120221546
    Abstract: A method for facilitating Web content aggregation initiated by a client is disclosed. A Web site aggregation list is created. At least one Web site in the aggregation list is spidered from a user-identified computer. At least one attribute of content of the at least one spidered Web site is merged with at least one attribute of content of another Web site. The merged attributes are displayed to a user.
    Type: Application
    Filed: February 23, 2012
    Publication date: August 30, 2012
    Inventors: Lawrence C. Rafsky, Robert E. Ungar, Thomas B. Donchez, Jonathan A. Marshall
  • Publication number: 20120221540
    Abstract: Embodiments provide methods and systems for encoding and decoding variable-length data, which may include methods for encoding and decoding search engine posting lists. Embodiments may include different encoding formats including group unary, packed unary, and/or packed binary formats. Some embodiments may utilize single instruction multiple data (SIMD) instructions that may perform a parallel shuffle operation on encoded data as part of the decoding processes. Some embodiments may utilize lookup tables to determine shuffle sequences and/or masks and/or shifts to be utilized in the decoding processes. Some embodiments may utilize hybrid formats.
    Type: Application
    Filed: March 31, 2011
    Publication date: August 30, 2012
    Applicant: A9.com, Inc.
    Inventors: Daniel E. Rose, Alexander A. Stepanov, Anil Ramesh Gangolli, Paramjit S. Oberoi, Ryan Jacob Ernst
  • Publication number: 20120215759
    Abstract: A building automation system (BAS) comprising a plurality of end devices, at least one communication network, and a server engine comprising a data harvester. The end devices are each associated with at least one of a space, a system, or a subsystem for at least a portion of a building or a campus. The communication network communicatively couples to at least a portion of the plurality of end devices to the server engine. In one embodiment, the server engine is adapted to dynamically implement the data harvesting capability to periodically establish communications with, to receive and store data about, end devices and to selectively control the utilization of the communication network in order to prevent overrun or data loss. Methods of handling log collection from end devices in a building automation system (BAS) based upon a distributed schedule provided by a user or a priority scheme are also disclosed.
    Type: Application
    Filed: April 25, 2012
    Publication date: August 23, 2012
    Inventors: Sean M. McCoy, Shane M. Gydesen, Christopher M. Markus
  • Publication number: 20120215758
    Abstract: In one embodiment, a method includes generating, by a computer system, a search-engine query from stored identity-theft nomenclature. The method also includes querying, by the computer system, at least one search engine via the search-engine query. Further, the method includes crawling, by the computer system, at least one computer-network resource identified via the querying. In addition, the method includes collecting, by the computer system, identity-theft information from the at least one computer-network resource. Additionally, the method includes processing, by the computer system, the identity-theft information for compromised personally-identifying information (PII).
    Type: Application
    Filed: February 16, 2012
    Publication date: August 23, 2012
    Inventors: Harold E. Gottschalk, JR., Michael Caldwell, Joel Carleton
  • Publication number: 20120215761
    Abstract: Embodiments of the present invention are directed to automated information-search and information-retrieval systems that provide information, on a continuous or periodic basis, to users or subscribers. In one embodiment of the present invention, information is gathered from a user's computer, or from computers accessible from the user's computer, on an essentially continuous basis in order to provide a database of information from which meaningful and focused search queries can be automatically constructed. The search queries are then employed to find, on behalf of the user or subscriber, current information useful to, and needed by, the user or subscriber.
    Type: Application
    Filed: February 24, 2012
    Publication date: August 23, 2012
    Applicant: GIST INC. FKA MINEBOX INC.
    Inventors: Stephen G. HALL, Thomas A. McCann, III, Timothy David CASE, Adam LOVING, Matthew HARTZLER, Tobias James Padilla
  • Publication number: 20120215756
    Abstract: A gateway is provided that includes an integration gateway portion, a domain gateway portion, and a hyper-memory portion is provided. The integration gateway portion has an integration rules engine, a search engine, and a first virtual machine. The domain gateway portion has a domain rules engine. The hyper-memory portion has a hyper-memory engine, a hyper-memory, and a second virtual machine. The integration portion accesses a database via the integration rules engine and the first virtual machine or via the search engine and the first virtual machine. The domain gateway portion accesses datasets of the database that are resident in the hyper-memory via the domain objects rules engine and the hyper-memory engine or via the search engine, the second virtual machine, and the hyper-memory engine.
    Type: Application
    Filed: May 1, 2012
    Publication date: August 23, 2012
    Inventors: Scott Edward Fraser, Suresh Venkata Muppalla
  • Publication number: 20120215755
    Abstract: Techniques are disclosed for providing a domain-aware snippet for a search result. A uniform resource locator (URL) is identified for a search result obtained in response to a search query, and it is determined that the URL corresponds to a single domain that has a plurality of web pages that are generated using a template that is common to each of the web pages in the domain. The template comprises a hypertext markup language (HTML) layout pattern that includes multiple sections shared by the web pages. A ranking value is assigned to the multiple sections and is used to identify a first section of the template that is relevant to the search query. A snippet is provided to a user for the search result; the snippet includes at least a portion of text from the first section.
    Type: Application
    Filed: May 1, 2012
    Publication date: August 23, 2012
    Applicant: MICROSOFT COPORATION
    Inventors: Girish KUMAR, Fang LIU
  • Publication number: 20120215757
    Abstract: A crawler including a document retriever configured to retrieve a first computer-based document, a link identifier configured to identify an actual string within the computer-based document as being a hyperlink-type string, and a static analyzer configured to perform static analysis of an operation on a variable within the first computer-based document to identify a possible string value of the variable as being a hyperlink-type string, where any of the strings indicate a location of at least a second computer-based document.
    Type: Application
    Filed: February 22, 2011
    Publication date: August 23, 2012
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Omri Weisman, Yinnon Avraham Haviv, Adi Sharabani, Omer Tripp, Marco Pistoia, Takaaki Tateishi, Guy Podjarny
  • Publication number: 20120215760
    Abstract: One example embodiment includes a method for indexing online references of an entity. The method includes identifying one or more channels of the Internet to be searched for references to an entity and identifying one or more signals to be evaluated within each of the one or more channels. The method also includes crawling the Internet for online references to the entity, wherein crawling the Internet comprises searching the one or more channels of the Internet for references to the entity and evaluating the one or more signals. The method further includes constructing a reverse index of the references, wherein the reverse index is based on each channel in which a reference is found and the one or more signals evaluated for the reference.
    Type: Application
    Filed: April 27, 2012
    Publication date: August 23, 2012
    Applicant: BRIGHTEDGE TECHNOLOGIES, INC.
    Inventors: Lemuel S. PARK, Jimmy YU
  • Publication number: 20120215533
    Abstract: A method of and system for error correction in multiple input modality search engines is presented. A method of processing input information based on an information type of the input information includes receiving input information for performing a search for identifying at least one item desired by a user and determining an information type associated with the input information. The method also includes forming a query input for identifying the at least one item desired by the user based on the input information and on the information type. The method further includes submitting the query input to at least one search engine system.
    Type: Application
    Filed: January 25, 2012
    Publication date: August 23, 2012
    Applicant: Veveo, Inc.
    Inventors: Murali Aravamudan, Pankaj Garg, Rakesh Barve, Ajit Rajasekharan
  • Publication number: 20120215762
    Abstract: Embodiments of the present invention are directed to automated information-search and information-retrieval systems that provide information, on a continuous or periodic basis, to users or subscribers. In one embodiment of the present invention, information is gathered from a user's computer, or from computers accessible from the user's computer, on an essentially continuous basis in order to provide a database of information from which meaningful and focused search queries can be automatically constructed. The search queries are then employed to find, on behalf of the user or subscriber, current information useful to, and needed by, the user or subscriber.
    Type: Application
    Filed: February 24, 2012
    Publication date: August 23, 2012
    Applicant: GIST INC. FKA MINEBOX INC.
    Inventors: Stephen G. HALL, Thomas A. McCann, III, Timothy David CASE, Adam LOVING, Matthew HARTZLER, Tobias James Padilla
  • Publication number: 20120209826
    Abstract: An approach is provided for providing location based information according to a predetermined format. A location information manager associates location information with web content. The location information manager also causes, at least in part, publication of the web content and the associated location information according to a predetermined format, wherein the predetermined format facilitates, at least in part, discovery of the location information.
    Type: Application
    Filed: February 10, 2011
    Publication date: August 16, 2012
    Applicant: Nokia Corporation
    Inventor: Petros Belimpasakis
  • Publication number: 20120209828
    Abstract: The present invention includes: acquiring plural web pages of an identical category into which targets stated in the web pages are classified (S1); acquiring an attribute-related term related to an attribute of the targets stated in the web pages or an attribute description pattern used to describe the attribute of the targets as initial data (S2); extracting the attribute-related term of the attribute matching the attribute description pattern from the plural web pages (S3); and extracting an attribute description pattern matching the attribute-related term from plural web pages (S4).
    Type: Application
    Filed: February 28, 2011
    Publication date: August 16, 2012
    Applicant: RAKUTEN, INC.
    Inventors: Takamasa Takenaka, Satoshi Sekine
  • Publication number: 20120209827
    Abstract: A method, apparatus, article of manufacture for generating a media program database having a plurality of media programs is disclosed. In one embodiment, the method is comprises the steps of receiving first media program metadata from a first source, searching the Internet to find second media program metadata from a second source distinct from the first source, determining if the first media program metadata and the second media program metadata are associated with the same media program, merging the first media program metadata and the second media program metadata if the first media program metadata and the second media program metadata are associated with the same media program, and storing the merged first media program metadata and second media program metadata in the media program database.
    Type: Application
    Filed: February 29, 2012
    Publication date: August 16, 2012
    Applicant: HULU LLC
    Inventors: Zhibing Wang, Yizhe Tang, Qian Chang, Ting-hao Yang
  • Publication number: 20120209986
    Abstract: A method and system are disclosed for monitoring user interactions and generating proactive responses thereto within a social media environment. Social media interactions are monitored, collected, and processed to determine whether they contain content outside of a threshold. If so, they are processed to determine the content causing the content to be outside of the threshold. Once the issues have been determined, proactive actions are performed to counteract the affect of the content.
    Type: Application
    Filed: February 15, 2011
    Publication date: August 16, 2012
    Inventors: Shesha Shah, Rajiv Narang
  • Patent number: 8244711
    Abstract: A system, method, and apparatus for information retrieval are provided. Embodiments of the present invention may generate data structures that may be used to process user queries. According to embodiments of the present invention, a processor component configured to perform the operations of an indexing module and a storage module, the indexing module configured to generate a term list and a term-file matrix from information stored on the storage module, the indexing module further configured to generate an adjacency matrix from the one or more files, wherein the adjacency matrix represents a relationship of the one or more terms in each of the one or more files; and the indexing module further configured to generate a probability matrix using the adjacency matrix and a one-step or two-step random walk.
    Type: Grant
    Filed: September 28, 2009
    Date of Patent: August 14, 2012
    Inventor: Chin Lung Fong
  • Publication number: 20120203752
    Abstract: A classification method includes constructing queries from category descriptors representing categories of a taxonomy of hierarchically organized categories. The query constructed for a category c includes a query component based on descriptors of the category c and at least one query component based on descriptors of an ancestor or descendant category of the category c. A documents database is queried using the constructed queries to retrieve pseudo-relevant documents. Language models for the categories of the taxonomy are extracted from the pseudo-relevant documents by inferring a hierarchical topic model representing the taxonomy. An input document is classified by optimizing mixture weights of a weighted combination of categories of the hierarchical topic model respective to the input document.
    Type: Application
    Filed: February 8, 2011
    Publication date: August 9, 2012
    Applicant: XEROX CORPORATION
    Inventors: Viet Ha-Thuc, Jean-Michel Renders
  • Publication number: 20120203754
    Abstract: A pattern matching accelerator (PMA) for assisting software threads to find the presence and location of strings in an input data stream that match a given pattern. The patterns are defined using regular expressions that are compiled into a data structure comprised of rules subsequently processed by the PMA. The patterns to be searched in the input stream are defined by the user as a set of regular expressions. The patterns to be searched are grouped in pattern context sets. The sets of regular expressions which define the pattern context sets are compiled to generate a rules structure used by the PMA hardware. The rules are compiled before search run time and stored in main memory, in rule cache memory within the PMA or a combination thereof. For each input character, the PMA executes the search and returns the search results.
    Type: Application
    Filed: February 8, 2011
    Publication date: August 9, 2012
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Giora Biran, Christoph Hagleitner, Timothy H. Heil, Jan Van Lunteren
  • Publication number: 20120203759
    Abstract: The present invention relates to a method for writing a newly recognized image. The method includes the steps of: (a) comparing pre-stored image in the image database with a queried image; (b) storing the queried image onto a database for unrecognized images if there is no image similar to the queried image; (c) grouping the images in the database for unrecognized images based on degrees of similarity thereamong; and (d) comparing, if a specific image and its tag information are inputted, the specific image with some images included in a specific set of images among the organized sets of the images, determining whether there is any image in the specific set of images which has a degree of similarity exceeding the pre-set value and allowing images determined to have degrees of similarity exceeding the pre-set value with the tag information to be automatically written onto the image database.
    Type: Application
    Filed: November 17, 2011
    Publication date: August 9, 2012
    Applicant: OLAWORKS, INC.
    Inventors: Tae Hoon Kim, Min Je Park, Song Ki Choi
  • Publication number: 20120203757
    Abstract: Methods, apparatus, and articles of manufacture to measure search results are disclosed. A disclosed example method to measure search results includes identifying a preview event for a search result associated with a search query, and storing the preview event in association with a search engine identifier and a web page identifier.
    Type: Application
    Filed: February 8, 2011
    Publication date: August 9, 2012
    Inventor: Balaji Ravindran
  • Publication number: 20120203751
    Abstract: An approach is provided with a search request including search terms and a user identified as a member of a common group. A search engine receives search results based on the search request and as set of previously searched data corresponding to the group of users by comparing with the search terms. The comparison results in refined search results that are displayed. A further approach is provided with a search request with search terms being compared against group historical search data to identify historical search terms as well as historical search actions. A search action request corresponding to one of the historical actions is received and executed by the information handling system.
    Type: Application
    Filed: February 7, 2011
    Publication date: August 9, 2012
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Dhanashree Srivastava, Amy Delphine Travis
  • Publication number: 20120203755
    Abstract: A pattern matching accelerator (PMA) for assisting software threads to find the presence and location of strings in an input data stream that match a given pattern. The patterns are defined using regular expressions that are compiled into a data structure comprised of rules subsequently processed by the PMA. The patterns to be searched in the input stream are defined by the user as a set of regular expressions. The patterns to be searched are grouped in pattern context sets. The sets of regular expressions which define the pattern context sets are compiled to generate a rules structure used by the PMA hardware. The rules are compiled before search run time and stored in main memory, in rule cache memory within the PMA or a combination thereof. For each input character, the PMA executes the search and returns the search results.
    Type: Application
    Filed: February 8, 2011
    Publication date: August 9, 2012
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Giora Biran, Christoph Hagleitner, Timothy H. Heil, Jan Van Lunteren
  • Publication number: 20120203750
    Abstract: A mobile station is arranged to determine its location, which is searched against street addresses from a database, and at least one matching street address is retrieved. The street address is searched on the mobile station and/or over the network. The search engine conducts a search in the mobile station file system and/or the Internet and/or a file system over the network with the at least one query term, —at least one search result is arranged to be displayed to user on the screen of the mobile station. This facilitates on-demand effortless Mobile Internet Search that allows the users to access opportunities that they did not know about, or would not have had time to find out about with minimum effort as the software of the mobile phone is scanning the Internet and information pages for these opportunities and displaying the results dynamically on the mobile phone screen.
    Type: Application
    Filed: February 4, 2011
    Publication date: August 9, 2012
    Inventor: Mikko VÄÄNÄNEN