Patents by Inventor Jasmine Novak

Jasmine Novak has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 9053186
    Abstract: A method and apparatus are provided for detecting and explaining bursty stream events in targeted groups. In one example, the method includes receiving validated bursty events, finding explanatory data sources having relevant bursty events that are relevant to the validated bursty events, wherein the explanatory sources explain the presence of the validated bursty events, correlating the validated bursty events to the relevant bursty events of the explanatory data sources to obtain burst results, and sending the burst results to a burst database that is accessible to an end user.
    Type: Grant
    Filed: December 6, 2013
    Date of Patent: June 9, 2015
    Assignee: Yahoo! Inc.
    Inventors: Andrew Tomkins, Shanmugasundaram Ravikumar, Cameron Marlow, Jasmine Novak
  • Patent number: 8914729
    Abstract: Methods and systems are described for navigating a corpus of content items stored in one or more information repositories within a distributed communications system. The content items may include video feeds, audio feeds, television broadcasts, website, a web log or the like. Using any browser application, the user views content items presented in an interactive scroll display. Modules are included for filtering, sorting and sampling content to ensure delivery of relevant high-quality content to the user. A feedback mechanism is also included for refining relevance filtering, sorting and sampling processes.
    Type: Grant
    Filed: October 30, 2006
    Date of Patent: December 16, 2014
    Assignee: Yahoo! Inc.
    Inventors: Micah Dubinko, Joseph Magnani, Jasmine Novak, Ravi Kumar, Prabhakar Raghavan, Andrew Tomkins
  • Publication number: 20140164344
    Abstract: A method and apparatus are provided for detecting and explaining bursty stream events in targeted groups. In one example, the method includes receiving validated bursty events, finding explanatory data sources having relevant bursty events that are relevant to the validated bursty events, wherein the explanatory sources explain the presence of the validated bursty events, correlating the validated bursty events to the relevant bursty events of the explanatory data sources to obtain burst results, and sending the burst results to a burst database that is accessible to an end user.
    Type: Application
    Filed: December 6, 2013
    Publication date: June 12, 2014
    Applicant: Yahoo! Inc.
    Inventors: Andrew Tomkins, Shanmugasundaram Ravikumar, Cameron Marlow, Jasmine Novak
  • Patent number: 8612409
    Abstract: A method and apparatus are provided for detecting and explaining bursty stream events in targeted groups. In one example, the method includes receiving validated bursty events, finding explanatory data sources having relevant bursty events that are relevant to the validated bursty events, wherein the explanatory sources explain the presence of the validated bursty events, correlating the validated bursty events to the relevant bursty events of the explanatory data sources to obtain burst results, and sending the burst results to a burst database that is accessible to an end user.
    Type: Grant
    Filed: December 18, 2007
    Date of Patent: December 17, 2013
    Assignee: Yahoo! Inc.
    Inventors: Andrew Tomkins, Shanmugasundaram Ravikumar, Cameron Marlow, Jasmine Novak
  • Patent number: 8209335
    Abstract: Disclosed is a method of extracting informative phrases from a full corpus of documents. An index of phrases contained in the full corpus of documents is built. Then, a user specifies a subset of text to analyze. The subset may be defined as: (1) all paragraphs or sentences containing terms selected as defining a subject; (2) all documents in a category; (3) all documents written within a date range; and/or (3) all documents matching a Boolean query of terms. Once the subset is specified, it is analyzed to extract informative phrases. Specifically, the index is queried to retrieve all phrases within the subset. The number of times each of the phases occurs in the subset and in the corpus is counted. Each phrase contained in the subset is scored according to informativeness based on a comparison of a likelihood that the phrase occurs in the subset and a likelihood that the phrase occurs in the corpus as a whole.
    Type: Grant
    Filed: September 20, 2005
    Date of Patent: June 26, 2012
    Assignee: International Business Machines Corporation
    Inventor: Jasmine Novak
  • Patent number: 8001144
    Abstract: Disclosed are embodiments of a system and a method for detecting relationships described in unstructured text-based electronic documents. The system and method incorporate the use of an input file that contains one or more text patterns that represent particular relationships. The text patterns each include regular text expressions that describe the particular relationship and slots for the location of each entity in that relationship. Document(s) are selected by a user and scanned by a proper noun tagger that identifies and tags every occurrence of proper names within the document(s). Then, a pattern matcher scans the document(s) to match text patterns. If a text pattern is matched within a document a relationship detector extracts all pairs of proper names found in the slots for each matched text pattern.
    Type: Grant
    Filed: March 26, 2008
    Date of Patent: August 16, 2011
    Assignee: International Business Machines Corporation
    Inventor: Jasmine Novak
  • Publication number: 20100318533
    Abstract: A system and method for aggregating anchor text over the web graph and using the aggregated anchor text to enrich document representations. For a target page, its internal inlinks, which point to the target page and are within the site containing the target page, are identified first. Then external anchors that point to the internal inlinks from pages outside of the site are identified. Anchor text of the external anchors are collected, weighted, stored, and used to enrich document presentations. The method not only reduces the number of pages with no anchor text, but also adds lines of anchor text to URLs.
    Type: Application
    Filed: June 10, 2009
    Publication date: December 16, 2010
    Applicant: YAHOO! INC.
    Inventors: Jasmine Novak, Donald Metzler, Hang Cui, Srihari Reddy, Emre Velipasaoglu
  • Patent number: 7739275
    Abstract: An improved system and method for selecting and visualizing object metadata evolving over time is provided. An application may generate a visualization depicting the temporal evolution of metadata describing objects in an object store over a plurality of time intervals. The application may switch between a visualization of object metadata flowing like a river or cascading like a waterfall over time. A ranked list of metadata items may be determined for some pre-selected intervals during a pre-processing step. Then at runtime when a request may be received for providing a ranked list of metadata items for a query interval, a combination of time intervals from the pre-selected time intervals may be determined that cover the query time interval, and the ranked lists of metadata items for each time interval in the combination of time intervals that cover the query time interval may be aggregated and output for visualization.
    Type: Grant
    Filed: May 19, 2006
    Date of Patent: June 15, 2010
    Assignee: Yahoo! Inc.
    Inventors: Micah Joel Dubinko, Shanmugasundaram Ravikumar, Joseph Andrew Magnani, Jasmine Novak, Prabhakar Raghavan, Andrew Tomkins
  • Patent number: 7725346
    Abstract: A sales prediction system predicts sales from online public discussions. The system utilizes manually or automatically formulated predicates to capture subsets of postings in online public discussions. The system predicts spikes in sales rank based on online chatter. The system comprises automated algorithms that predict spikes in sales rank given a time series of counts of online discussions such as blog postings. The system utilizes a stateless model of customer behavior based on a series of states of excitation that are increasingly likely to lead to a purchase decision. The stateless model of customer behavior yields a predictor of sales rank spikes that is significantly more accurate than conventional techniques operating on sales rank data alone.
    Type: Grant
    Filed: July 27, 2005
    Date of Patent: May 25, 2010
    Assignee: International Business Machines Corporation
    Inventors: Daniel Frederick Gruhl, Ramanathan Vaidhyanath Guha, Jasmine Novak, Shanmugasundaram Ravikumar, Andrew S. Tomkins
  • Patent number: 7581184
    Abstract: An improved system and method for selecting and visualizing object metadata evolving over time is provided. An application may generate a visualization depicting the temporal evolution of metadata describing objects in an object store over a plurality of time intervals. The application may switch between a visualization of object metadata flowing like a river or cascading like a waterfall over time. A ranked list of metadata items may be determined for some pre-selected intervals during a pre-processing step. Then at runtime when a request may be received for providing a ranked list of metadata items for a query interval, a combination of time intervals from the pre-selected time intervals may be determined that cover the query time interval, and the ranked lists of metadata items for each time interval in the combination of time intervals that cover the query time interval may be aggregated and output for visualization.
    Type: Grant
    Filed: May 19, 2006
    Date of Patent: August 25, 2009
    Assignee: Yahoo! Inc.
    Inventors: Micah Joel Dubinko, Shanmugasundaram Ravikumar, Joseph Andrew Magnani, Jasmine Novak, Prabhakar Raghavan, Andrew Tomkins
  • Publication number: 20090157651
    Abstract: A method and apparatus are provided for detecting and explaining bursty stream events in targeted groups. In one example, the method includes receiving validated bursty events, finding explanatory data sources having relevant bursty events that are relevant to the validated bursty events, wherein the explanatory sources explain the presence of the validated bursty events, correlating the validated bursty events to the relevant bursty events of the explanatory data sources to obtain burst results, and sending the burst results to a burst database that is accessible to an end user.
    Type: Application
    Filed: December 18, 2007
    Publication date: June 18, 2009
    Inventors: Andrew Tomkins, Shanmugasundaram Ravikumar, Cameron Marlow, Jasmine Novak
  • Publication number: 20080177740
    Abstract: Disclosed are embodiments of a system and a method for detecting relationships described in unstructured text-based electronic documents. The system and method incorporate the use of an input file that contains one or more text patterns that represent particular relationships. The text patterns each include regular text expressions that describe the particular relationship and slots for the location of each entity in that relationship. Document(s) are selected by a user and scanned by a proper noun tagger that identifies and tags every occurrence of proper names within the document(s). Then, a pattern matcher scans the document(s) to match text patterns. If a text pattern is matched within a document a relationship detector extracts all pairs of proper names found in the slots for each matched text pattern.
    Type: Application
    Filed: March 26, 2008
    Publication date: July 24, 2008
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventor: Jasmine Novak
  • Publication number: 20080104521
    Abstract: Methods and systems are described for navigating a corpus of content items stored in one or more information repositories within a distributed communications system. The content items may include video feeds, audio feeds, television broadcasts, website, a web log or the like. Using any browser application, the user views content items presented in an interactive scroll display. Modules are included for filtering, sorting and sampling content to ensure delivery of relevant high-quality content to the user. A feedback mechanism is also included for refining relevance filtering, sorting and sampling processes.
    Type: Application
    Filed: October 30, 2006
    Publication date: May 1, 2008
    Applicant: Yahoo! Inc.
    Inventors: Micah Dubinko, Joseph Magnani, Jasmine Novak, Ravi Kumar, Prabhakar Raghavan, Andrew Tomkins
  • Publication number: 20070283290
    Abstract: An improved system and method for selecting and visualizing object metadata evolving over time is provided. An application may generate a visualization depicting the temporal evolution of metadata describing objects in an object store over a plurality of time intervals. The application may switch between a visualization of object metadata flowing like a river or cascading like a waterfall over time. A ranked list of metadata items may be determined for some pre-selected intervals during a pre-processing step. Then at runtime when a request may be received for providing a ranked list of metadata items for a query interval, a combination of time intervals from the pre-selected time intervals may be determined that cover the query time interval, and the ranked lists of metadata items for each time interval in the combination of time intervals that cover the query time interval may be aggregated and output for visualization.
    Type: Application
    Filed: May 19, 2006
    Publication date: December 6, 2007
    Applicant: Yahoo! Inc.
    Inventors: Micah Joel Dubinko, Shanmugasundaram Ravikumar, Joseph Andrew Magnani, Jasmine Novak, Prabhakar Raghavan, Andrew Tomkins
  • Publication number: 20070271270
    Abstract: An improved system and method for selecting and visualizing object metadata evolving over time is provided. An application may generate a visualization depicting the temporal evolution of metadata describing objects in an object store over a plurality of time intervals. The application may switch between a visualization of object metadata flowing like a river or cascading like a waterfall over time. A ranked list of metadata items may be determined for some pre-selected intervals during a pre-processing step. Then at runtime when a request may be received for providing a ranked list of metadata items for a query interval, a combination of time intervals from the pre-selected time intervals may be determined that cover the query time interval, and the ranked lists of metadata items for each time interval in the combination of time intervals that cover the query time interval may be aggregated and output for visualization.
    Type: Application
    Filed: May 19, 2006
    Publication date: November 22, 2007
    Applicant: Yahoo! Inc.
    Inventors: Micah Joel Dubinko, Shanmugasundaram Ravikumar, Joseph Andrew Magnani, Jasmine Novak, Prabhakar Raghavan, Andrew Tomkins
  • Patent number: 7257577
    Abstract: A modular scoring system using rank aggregation merges search results into an ordered list of results using many different features of documents. The ranking functions of the present system can easily be customized to the needs of a particular corpus or collection of users such as an intranet. Rank aggregation is independent of the underlying score distributions between the different factors, and can be applied to merge any set of ranking functions. Rank aggregation holds the advantage of combining the influence of many different heuristic factors in a robust way to produce high-quality results for queries. The modular scoring system combines factors such as indegree, page ranking, URL length, proximity to the root server of an intranet, etc, to form a single ordering on web pages that closely obeys the individual orderings, but also mediates between the collective wisdom of individual heuristics.
    Type: Grant
    Filed: May 7, 2004
    Date of Patent: August 14, 2007
    Assignee: International Business Machines Corporation
    Inventors: Ronald Fagin, Kevin Snow McCurley, Jasmine Novak, Shanmugasundram Ravikumar, Dandapani Sivakumar, John Anthony Tomlin, David Paul Williamson
  • Publication number: 20070067320
    Abstract: Disclosed are embodiments of a system and a method for detecting relationships described in unstructured text-based electronic documents. The system and method incorporate the use of an input file that contains one or more text patterns that represent particular relationships. The text patterns each include regular text expressions that describe the particular relationship and slots for the location of each entity in that relationship. Document(s) are selected by a user and scanned by a proper noun tagger that identifies and tags every occurrence of proper names within the document(s). Then, a pattern matcher scans the document(s) to match text patterns. If a text pattern is matched within a document a relationship detector extracts all pairs of proper names found in the slots for each matched text pattern.
    Type: Application
    Filed: September 20, 2005
    Publication date: March 22, 2007
    Applicant: International Business Machines Corporation
    Inventor: Jasmine Novak
  • Publication number: 20070067289
    Abstract: Disclosed is a method of extracting informative phrases from a full corpus of documents. An index of phrases contained in the full corpus of documents is built. Then, a user specifies a subset of text to analyze. The subset may be defined as: (1) all paragraphs or sentences containing terms selected as defining a subject; (2) all documents in a category; (3) all documents written within a date range; and/or (3) all documents matching a Boolean query of terms. Once the subset is specified, it is analyzed to extract informative phrases. Specifically, the index is queried to retrieve all phrases within the subset. The number of times each of the phases occurs in the subset and in the corpus is counted. Each phrase contained in the subset is scored according to informativeness based on a comparison of a likelihood that the phrase occurs in the subset and a likelihood that the phrase occurs in the corpus as a whole.
    Type: Application
    Filed: September 20, 2005
    Publication date: March 22, 2007
    Applicant: International Business Machines Corporation
    Inventor: Jasmine Novak
  • Publication number: 20070067157
    Abstract: A phrase extraction system combines a dictionary method, a statistical/heuristic approach, and a set of pruning steps to extract frequently occurring and interesting phrases from a corpus. The system finds the “top k” phrases in a corpus, where k is an adjustable parameter. For a time-varying corpus, the system uses historical statistics to extract new and increasingly frequent phrases. The system finds interesting phrases that occur near a set of user-designated phrases. The system uses these designated phrases as anchor phrases to identify phrases that occur near the anchor phrases. The system finds frequently occurring and interesting phrases in a time-varying corpus is changing in time, as in finding frequent phrases in an on-going, long term document feed or continuous, regular web crawl.
    Type: Application
    Filed: September 22, 2005
    Publication date: March 22, 2007
    Inventors: Vinay Kaku, Keiko Kurita, Carlton Niblack, Jasmine Novak, Zengyan Zhang
  • Publication number: 20070027741
    Abstract: A sales prediction system predicts sales from online public discussions. The system utilizes manually or automatically formulated predicates to capture subsets of postings in online public discussions. The system predicts spikes in sales rank based on online chatter. The system comprises automated algorithms that predict spikes in sales rank given a time series of counts of online discussions such as blog postings. The system utilizes a stateless model of customer behavior based on a series of states of excitation that are increasingly likely to lead to a purchase decision. The stateless model of customer behavior yields a predictor of sales rank spikes that is significantly more accurate than conventional techniques operating on sales rank data alone.
    Type: Application
    Filed: July 27, 2005
    Publication date: February 1, 2007
    Inventors: Daniel Gruhl, Ramanathan Guha, Jasmine Novak, Shanmugasundaram Ravikumar, Andrew Tomkins