Patents by Inventor Jasmine Novak
Jasmine Novak has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Patent number: 9053186Abstract: A method and apparatus are provided for detecting and explaining bursty stream events in targeted groups. In one example, the method includes receiving validated bursty events, finding explanatory data sources having relevant bursty events that are relevant to the validated bursty events, wherein the explanatory sources explain the presence of the validated bursty events, correlating the validated bursty events to the relevant bursty events of the explanatory data sources to obtain burst results, and sending the burst results to a burst database that is accessible to an end user.Type: GrantFiled: December 6, 2013Date of Patent: June 9, 2015Assignee: Yahoo! Inc.Inventors: Andrew Tomkins, Shanmugasundaram Ravikumar, Cameron Marlow, Jasmine Novak
-
Patent number: 8914729Abstract: Methods and systems are described for navigating a corpus of content items stored in one or more information repositories within a distributed communications system. The content items may include video feeds, audio feeds, television broadcasts, website, a web log or the like. Using any browser application, the user views content items presented in an interactive scroll display. Modules are included for filtering, sorting and sampling content to ensure delivery of relevant high-quality content to the user. A feedback mechanism is also included for refining relevance filtering, sorting and sampling processes.Type: GrantFiled: October 30, 2006Date of Patent: December 16, 2014Assignee: Yahoo! Inc.Inventors: Micah Dubinko, Joseph Magnani, Jasmine Novak, Ravi Kumar, Prabhakar Raghavan, Andrew Tomkins
-
Publication number: 20140164344Abstract: A method and apparatus are provided for detecting and explaining bursty stream events in targeted groups. In one example, the method includes receiving validated bursty events, finding explanatory data sources having relevant bursty events that are relevant to the validated bursty events, wherein the explanatory sources explain the presence of the validated bursty events, correlating the validated bursty events to the relevant bursty events of the explanatory data sources to obtain burst results, and sending the burst results to a burst database that is accessible to an end user.Type: ApplicationFiled: December 6, 2013Publication date: June 12, 2014Applicant: Yahoo! Inc.Inventors: Andrew Tomkins, Shanmugasundaram Ravikumar, Cameron Marlow, Jasmine Novak
-
Patent number: 8612409Abstract: A method and apparatus are provided for detecting and explaining bursty stream events in targeted groups. In one example, the method includes receiving validated bursty events, finding explanatory data sources having relevant bursty events that are relevant to the validated bursty events, wherein the explanatory sources explain the presence of the validated bursty events, correlating the validated bursty events to the relevant bursty events of the explanatory data sources to obtain burst results, and sending the burst results to a burst database that is accessible to an end user.Type: GrantFiled: December 18, 2007Date of Patent: December 17, 2013Assignee: Yahoo! Inc.Inventors: Andrew Tomkins, Shanmugasundaram Ravikumar, Cameron Marlow, Jasmine Novak
-
Patent number: 8209335Abstract: Disclosed is a method of extracting informative phrases from a full corpus of documents. An index of phrases contained in the full corpus of documents is built. Then, a user specifies a subset of text to analyze. The subset may be defined as: (1) all paragraphs or sentences containing terms selected as defining a subject; (2) all documents in a category; (3) all documents written within a date range; and/or (3) all documents matching a Boolean query of terms. Once the subset is specified, it is analyzed to extract informative phrases. Specifically, the index is queried to retrieve all phrases within the subset. The number of times each of the phases occurs in the subset and in the corpus is counted. Each phrase contained in the subset is scored according to informativeness based on a comparison of a likelihood that the phrase occurs in the subset and a likelihood that the phrase occurs in the corpus as a whole.Type: GrantFiled: September 20, 2005Date of Patent: June 26, 2012Assignee: International Business Machines CorporationInventor: Jasmine Novak
-
Patent number: 8001144Abstract: Disclosed are embodiments of a system and a method for detecting relationships described in unstructured text-based electronic documents. The system and method incorporate the use of an input file that contains one or more text patterns that represent particular relationships. The text patterns each include regular text expressions that describe the particular relationship and slots for the location of each entity in that relationship. Document(s) are selected by a user and scanned by a proper noun tagger that identifies and tags every occurrence of proper names within the document(s). Then, a pattern matcher scans the document(s) to match text patterns. If a text pattern is matched within a document a relationship detector extracts all pairs of proper names found in the slots for each matched text pattern.Type: GrantFiled: March 26, 2008Date of Patent: August 16, 2011Assignee: International Business Machines CorporationInventor: Jasmine Novak
-
Publication number: 20100318533Abstract: A system and method for aggregating anchor text over the web graph and using the aggregated anchor text to enrich document representations. For a target page, its internal inlinks, which point to the target page and are within the site containing the target page, are identified first. Then external anchors that point to the internal inlinks from pages outside of the site are identified. Anchor text of the external anchors are collected, weighted, stored, and used to enrich document presentations. The method not only reduces the number of pages with no anchor text, but also adds lines of anchor text to URLs.Type: ApplicationFiled: June 10, 2009Publication date: December 16, 2010Applicant: YAHOO! INC.Inventors: Jasmine Novak, Donald Metzler, Hang Cui, Srihari Reddy, Emre Velipasaoglu
-
Patent number: 7739275Abstract: An improved system and method for selecting and visualizing object metadata evolving over time is provided. An application may generate a visualization depicting the temporal evolution of metadata describing objects in an object store over a plurality of time intervals. The application may switch between a visualization of object metadata flowing like a river or cascading like a waterfall over time. A ranked list of metadata items may be determined for some pre-selected intervals during a pre-processing step. Then at runtime when a request may be received for providing a ranked list of metadata items for a query interval, a combination of time intervals from the pre-selected time intervals may be determined that cover the query time interval, and the ranked lists of metadata items for each time interval in the combination of time intervals that cover the query time interval may be aggregated and output for visualization.Type: GrantFiled: May 19, 2006Date of Patent: June 15, 2010Assignee: Yahoo! Inc.Inventors: Micah Joel Dubinko, Shanmugasundaram Ravikumar, Joseph Andrew Magnani, Jasmine Novak, Prabhakar Raghavan, Andrew Tomkins
-
Patent number: 7725346Abstract: A sales prediction system predicts sales from online public discussions. The system utilizes manually or automatically formulated predicates to capture subsets of postings in online public discussions. The system predicts spikes in sales rank based on online chatter. The system comprises automated algorithms that predict spikes in sales rank given a time series of counts of online discussions such as blog postings. The system utilizes a stateless model of customer behavior based on a series of states of excitation that are increasingly likely to lead to a purchase decision. The stateless model of customer behavior yields a predictor of sales rank spikes that is significantly more accurate than conventional techniques operating on sales rank data alone.Type: GrantFiled: July 27, 2005Date of Patent: May 25, 2010Assignee: International Business Machines CorporationInventors: Daniel Frederick Gruhl, Ramanathan Vaidhyanath Guha, Jasmine Novak, Shanmugasundaram Ravikumar, Andrew S. Tomkins
-
Patent number: 7581184Abstract: An improved system and method for selecting and visualizing object metadata evolving over time is provided. An application may generate a visualization depicting the temporal evolution of metadata describing objects in an object store over a plurality of time intervals. The application may switch between a visualization of object metadata flowing like a river or cascading like a waterfall over time. A ranked list of metadata items may be determined for some pre-selected intervals during a pre-processing step. Then at runtime when a request may be received for providing a ranked list of metadata items for a query interval, a combination of time intervals from the pre-selected time intervals may be determined that cover the query time interval, and the ranked lists of metadata items for each time interval in the combination of time intervals that cover the query time interval may be aggregated and output for visualization.Type: GrantFiled: May 19, 2006Date of Patent: August 25, 2009Assignee: Yahoo! Inc.Inventors: Micah Joel Dubinko, Shanmugasundaram Ravikumar, Joseph Andrew Magnani, Jasmine Novak, Prabhakar Raghavan, Andrew Tomkins
-
Publication number: 20090157651Abstract: A method and apparatus are provided for detecting and explaining bursty stream events in targeted groups. In one example, the method includes receiving validated bursty events, finding explanatory data sources having relevant bursty events that are relevant to the validated bursty events, wherein the explanatory sources explain the presence of the validated bursty events, correlating the validated bursty events to the relevant bursty events of the explanatory data sources to obtain burst results, and sending the burst results to a burst database that is accessible to an end user.Type: ApplicationFiled: December 18, 2007Publication date: June 18, 2009Inventors: Andrew Tomkins, Shanmugasundaram Ravikumar, Cameron Marlow, Jasmine Novak
-
Publication number: 20080177740Abstract: Disclosed are embodiments of a system and a method for detecting relationships described in unstructured text-based electronic documents. The system and method incorporate the use of an input file that contains one or more text patterns that represent particular relationships. The text patterns each include regular text expressions that describe the particular relationship and slots for the location of each entity in that relationship. Document(s) are selected by a user and scanned by a proper noun tagger that identifies and tags every occurrence of proper names within the document(s). Then, a pattern matcher scans the document(s) to match text patterns. If a text pattern is matched within a document a relationship detector extracts all pairs of proper names found in the slots for each matched text pattern.Type: ApplicationFiled: March 26, 2008Publication date: July 24, 2008Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventor: Jasmine Novak
-
Publication number: 20080104521Abstract: Methods and systems are described for navigating a corpus of content items stored in one or more information repositories within a distributed communications system. The content items may include video feeds, audio feeds, television broadcasts, website, a web log or the like. Using any browser application, the user views content items presented in an interactive scroll display. Modules are included for filtering, sorting and sampling content to ensure delivery of relevant high-quality content to the user. A feedback mechanism is also included for refining relevance filtering, sorting and sampling processes.Type: ApplicationFiled: October 30, 2006Publication date: May 1, 2008Applicant: Yahoo! Inc.Inventors: Micah Dubinko, Joseph Magnani, Jasmine Novak, Ravi Kumar, Prabhakar Raghavan, Andrew Tomkins
-
Publication number: 20070283290Abstract: An improved system and method for selecting and visualizing object metadata evolving over time is provided. An application may generate a visualization depicting the temporal evolution of metadata describing objects in an object store over a plurality of time intervals. The application may switch between a visualization of object metadata flowing like a river or cascading like a waterfall over time. A ranked list of metadata items may be determined for some pre-selected intervals during a pre-processing step. Then at runtime when a request may be received for providing a ranked list of metadata items for a query interval, a combination of time intervals from the pre-selected time intervals may be determined that cover the query time interval, and the ranked lists of metadata items for each time interval in the combination of time intervals that cover the query time interval may be aggregated and output for visualization.Type: ApplicationFiled: May 19, 2006Publication date: December 6, 2007Applicant: Yahoo! Inc.Inventors: Micah Joel Dubinko, Shanmugasundaram Ravikumar, Joseph Andrew Magnani, Jasmine Novak, Prabhakar Raghavan, Andrew Tomkins
-
Publication number: 20070271270Abstract: An improved system and method for selecting and visualizing object metadata evolving over time is provided. An application may generate a visualization depicting the temporal evolution of metadata describing objects in an object store over a plurality of time intervals. The application may switch between a visualization of object metadata flowing like a river or cascading like a waterfall over time. A ranked list of metadata items may be determined for some pre-selected intervals during a pre-processing step. Then at runtime when a request may be received for providing a ranked list of metadata items for a query interval, a combination of time intervals from the pre-selected time intervals may be determined that cover the query time interval, and the ranked lists of metadata items for each time interval in the combination of time intervals that cover the query time interval may be aggregated and output for visualization.Type: ApplicationFiled: May 19, 2006Publication date: November 22, 2007Applicant: Yahoo! Inc.Inventors: Micah Joel Dubinko, Shanmugasundaram Ravikumar, Joseph Andrew Magnani, Jasmine Novak, Prabhakar Raghavan, Andrew Tomkins
-
Patent number: 7257577Abstract: A modular scoring system using rank aggregation merges search results into an ordered list of results using many different features of documents. The ranking functions of the present system can easily be customized to the needs of a particular corpus or collection of users such as an intranet. Rank aggregation is independent of the underlying score distributions between the different factors, and can be applied to merge any set of ranking functions. Rank aggregation holds the advantage of combining the influence of many different heuristic factors in a robust way to produce high-quality results for queries. The modular scoring system combines factors such as indegree, page ranking, URL length, proximity to the root server of an intranet, etc, to form a single ordering on web pages that closely obeys the individual orderings, but also mediates between the collective wisdom of individual heuristics.Type: GrantFiled: May 7, 2004Date of Patent: August 14, 2007Assignee: International Business Machines CorporationInventors: Ronald Fagin, Kevin Snow McCurley, Jasmine Novak, Shanmugasundram Ravikumar, Dandapani Sivakumar, John Anthony Tomlin, David Paul Williamson
-
Publication number: 20070067157Abstract: A phrase extraction system combines a dictionary method, a statistical/heuristic approach, and a set of pruning steps to extract frequently occurring and interesting phrases from a corpus. The system finds the “top k” phrases in a corpus, where k is an adjustable parameter. For a time-varying corpus, the system uses historical statistics to extract new and increasingly frequent phrases. The system finds interesting phrases that occur near a set of user-designated phrases. The system uses these designated phrases as anchor phrases to identify phrases that occur near the anchor phrases. The system finds frequently occurring and interesting phrases in a time-varying corpus is changing in time, as in finding frequent phrases in an on-going, long term document feed or continuous, regular web crawl.Type: ApplicationFiled: September 22, 2005Publication date: March 22, 2007Inventors: Vinay Kaku, Keiko Kurita, Carlton Niblack, Jasmine Novak, Zengyan Zhang
-
Publication number: 20070067289Abstract: Disclosed is a method of extracting informative phrases from a full corpus of documents. An index of phrases contained in the full corpus of documents is built. Then, a user specifies a subset of text to analyze. The subset may be defined as: (1) all paragraphs or sentences containing terms selected as defining a subject; (2) all documents in a category; (3) all documents written within a date range; and/or (3) all documents matching a Boolean query of terms. Once the subset is specified, it is analyzed to extract informative phrases. Specifically, the index is queried to retrieve all phrases within the subset. The number of times each of the phases occurs in the subset and in the corpus is counted. Each phrase contained in the subset is scored according to informativeness based on a comparison of a likelihood that the phrase occurs in the subset and a likelihood that the phrase occurs in the corpus as a whole.Type: ApplicationFiled: September 20, 2005Publication date: March 22, 2007Applicant: International Business Machines CorporationInventor: Jasmine Novak
-
Publication number: 20070067320Abstract: Disclosed are embodiments of a system and a method for detecting relationships described in unstructured text-based electronic documents. The system and method incorporate the use of an input file that contains one or more text patterns that represent particular relationships. The text patterns each include regular text expressions that describe the particular relationship and slots for the location of each entity in that relationship. Document(s) are selected by a user and scanned by a proper noun tagger that identifies and tags every occurrence of proper names within the document(s). Then, a pattern matcher scans the document(s) to match text patterns. If a text pattern is matched within a document a relationship detector extracts all pairs of proper names found in the slots for each matched text pattern.Type: ApplicationFiled: September 20, 2005Publication date: March 22, 2007Applicant: International Business Machines CorporationInventor: Jasmine Novak
-
Publication number: 20070027741Abstract: A sales prediction system predicts sales from online public discussions. The system utilizes manually or automatically formulated predicates to capture subsets of postings in online public discussions. The system predicts spikes in sales rank based on online chatter. The system comprises automated algorithms that predict spikes in sales rank given a time series of counts of online discussions such as blog postings. The system utilizes a stateless model of customer behavior based on a series of states of excitation that are increasingly likely to lead to a purchase decision. The stateless model of customer behavior yields a predictor of sales rank spikes that is significantly more accurate than conventional techniques operating on sales rank data alone.Type: ApplicationFiled: July 27, 2005Publication date: February 1, 2007Inventors: Daniel Gruhl, Ramanathan Guha, Jasmine Novak, Shanmugasundaram Ravikumar, Andrew Tomkins