Patents by Inventor Jasmine Novak

Jasmine Novak has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Method and apparatus for detecting and explaining bursty stream events in targeted groups

Patent number: 9053186

Abstract: A method and apparatus are provided for detecting and explaining bursty stream events in targeted groups. In one example, the method includes receiving validated bursty events, finding explanatory data sources having relevant bursty events that are relevant to the validated bursty events, wherein the explanatory sources explain the presence of the validated bursty events, correlating the validated bursty events to the relevant bursty events of the explanatory data sources to obtain burst results, and sending the burst results to a burst database that is accessible to an end user.

Type: Grant

Filed: December 6, 2013

Date of Patent: June 9, 2015

Assignee: Yahoo! Inc.

Inventors: Andrew Tomkins, Shanmugasundaram Ravikumar, Cameron Marlow, Jasmine Novak
Methods and systems for providing a customizable guide for navigating a corpus of content

Patent number: 8914729

Abstract: Methods and systems are described for navigating a corpus of content items stored in one or more information repositories within a distributed communications system. The content items may include video feeds, audio feeds, television broadcasts, website, a web log or the like. Using any browser application, the user views content items presented in an interactive scroll display. Modules are included for filtering, sorting and sampling content to ensure delivery of relevant high-quality content to the user. A feedback mechanism is also included for refining relevance filtering, sorting and sampling processes.

Type: Grant

Filed: October 30, 2006

Date of Patent: December 16, 2014

Assignee: Yahoo! Inc.

Inventors: Micah Dubinko, Joseph Magnani, Jasmine Novak, Ravi Kumar, Prabhakar Raghavan, Andrew Tomkins
METHOD AND APPARATUS FOR DETECTING AND EXPLAINING BURSTY STREAM EVENTS IN TARGETED GROUPS

Publication number: 20140164344

Abstract: A method and apparatus are provided for detecting and explaining bursty stream events in targeted groups. In one example, the method includes receiving validated bursty events, finding explanatory data sources having relevant bursty events that are relevant to the validated bursty events, wherein the explanatory sources explain the presence of the validated bursty events, correlating the validated bursty events to the relevant bursty events of the explanatory data sources to obtain burst results, and sending the burst results to a burst database that is accessible to an end user.

Type: Application

Filed: December 6, 2013

Publication date: June 12, 2014

Applicant: Yahoo! Inc.

Inventors: Andrew Tomkins, Shanmugasundaram Ravikumar, Cameron Marlow, Jasmine Novak
Method and apparatus for detecting and explaining bursty stream events in targeted groups

Patent number: 8612409

Abstract: A method and apparatus are provided for detecting and explaining bursty stream events in targeted groups. In one example, the method includes receiving validated bursty events, finding explanatory data sources having relevant bursty events that are relevant to the validated bursty events, wherein the explanatory sources explain the presence of the validated bursty events, correlating the validated bursty events to the relevant bursty events of the explanatory data sources to obtain burst results, and sending the burst results to a burst database that is accessible to an end user.

Type: Grant

Filed: December 18, 2007

Date of Patent: December 17, 2013

Assignee: Yahoo! Inc.

Inventors: Andrew Tomkins, Shanmugasundaram Ravikumar, Cameron Marlow, Jasmine Novak
Extracting informative phrases from unstructured text

Patent number: 8209335

Abstract: Disclosed is a method of extracting informative phrases from a full corpus of documents. An index of phrases contained in the full corpus of documents is built. Then, a user specifies a subset of text to analyze. The subset may be defined as: (1) all paragraphs or sentences containing terms selected as defining a subject; (2) all documents in a category; (3) all documents written within a date range; and/or (3) all documents matching a Boolean query of terms. Once the subset is specified, it is analyzed to extract informative phrases. Specifically, the index is queried to retrieve all phrases within the subset. The number of times each of the phases occurs in the subset and in the corpus is counted. Each phrase contained in the subset is scored according to informativeness based on a comparison of a likelihood that the phrase occurs in the subset and a likelihood that the phrase occurs in the corpus as a whole.

Type: Grant

Filed: September 20, 2005

Date of Patent: June 26, 2012

Assignee: International Business Machines Corporation

Inventor: Jasmine Novak
Detecting relationships in unstructured text

Patent number: 8001144

Abstract: Disclosed are embodiments of a system and a method for detecting relationships described in unstructured text-based electronic documents. The system and method incorporate the use of an input file that contains one or more text patterns that represent particular relationships. The text patterns each include regular text expressions that describe the particular relationship and slots for the location of each entity in that relationship. Document(s) are selected by a user and scanned by a proper noun tagger that identifies and tags every occurrence of proper names within the document(s). Then, a pattern matcher scans the document(s) to match text patterns. If a text pattern is matched within a document a relationship detector extracts all pairs of proper names found in the slots for each matched text pattern.

Type: Grant

Filed: March 26, 2008

Date of Patent: August 16, 2011

Assignee: International Business Machines Corporation

Inventor: Jasmine Novak
ENRICHED DOCUMENT REPRESENTATIONS USING AGGREGATED ANCHOR TEXT

Publication number: 20100318533

Abstract: A system and method for aggregating anchor text over the web graph and using the aggregated anchor text to enrich document representations. For a target page, its internal inlinks, which point to the target page and are within the site containing the target page, are identified first. Then external anchors that point to the internal inlinks from pages outside of the site are identified. Anchor text of the external anchors are collected, weighted, stored, and used to enrich document presentations. The method not only reduces the number of pages with no anchor text, but also adds lines of anchor text to URLs.

Type: Application

Filed: June 10, 2009

Publication date: December 16, 2010

Applicant: YAHOO! INC.

Inventors: Jasmine Novak, Donald Metzler, Hang Cui, Srihari Reddy, Emre Velipasaoglu
System and method for selecting object metadata evolving over time

Patent number: 7739275

Abstract: An improved system and method for selecting and visualizing object metadata evolving over time is provided. An application may generate a visualization depicting the temporal evolution of metadata describing objects in an object store over a plurality of time intervals. The application may switch between a visualization of object metadata flowing like a river or cascading like a waterfall over time. A ranked list of metadata items may be determined for some pre-selected intervals during a pre-processing step. Then at runtime when a request may be received for providing a ranked list of metadata items for a query interval, a combination of time intervals from the pre-selected time intervals may be determined that cover the query time interval, and the ranked lists of metadata items for each time interval in the combination of time intervals that cover the query time interval may be aggregated and output for visualization.

Type: Grant

Filed: May 19, 2006

Date of Patent: June 15, 2010

Assignee: Yahoo! Inc.

Inventors: Micah Joel Dubinko, Shanmugasundaram Ravikumar, Joseph Andrew Magnani, Jasmine Novak, Prabhakar Raghavan, Andrew Tomkins
Method and computer program product for predicting sales from online public discussions

Patent number: 7725346

Abstract: A sales prediction system predicts sales from online public discussions. The system utilizes manually or automatically formulated predicates to capture subsets of postings in online public discussions. The system predicts spikes in sales rank based on online chatter. The system comprises automated algorithms that predict spikes in sales rank given a time series of counts of online discussions such as blog postings. The system utilizes a stateless model of customer behavior based on a series of states of excitation that are increasingly likely to lead to a purchase decision. The stateless model of customer behavior yields a predictor of sales rank spikes that is significantly more accurate than conventional techniques operating on sales rank data alone.

Type: Grant

Filed: July 27, 2005

Date of Patent: May 25, 2010

Assignee: International Business Machines Corporation

Inventors: Daniel Frederick Gruhl, Ramanathan Vaidhyanath Guha, Jasmine Novak, Shanmugasundaram Ravikumar, Andrew S. Tomkins
System and method for visualizing the temporal evolution of object metadata

Patent number: 7581184

Abstract: An improved system and method for selecting and visualizing object metadata evolving over time is provided. An application may generate a visualization depicting the temporal evolution of metadata describing objects in an object store over a plurality of time intervals. The application may switch between a visualization of object metadata flowing like a river or cascading like a waterfall over time. A ranked list of metadata items may be determined for some pre-selected intervals during a pre-processing step. Then at runtime when a request may be received for providing a ranked list of metadata items for a query interval, a combination of time intervals from the pre-selected time intervals may be determined that cover the query time interval, and the ranked lists of metadata items for each time interval in the combination of time intervals that cover the query time interval may be aggregated and output for visualization.

Type: Grant

Filed: May 19, 2006

Date of Patent: August 25, 2009

Assignee: Yahoo! Inc.

Inventors: Micah Joel Dubinko, Shanmugasundaram Ravikumar, Joseph Andrew Magnani, Jasmine Novak, Prabhakar Raghavan, Andrew Tomkins
Method and Apparatus for Detecting and Explaining Bursty Stream Events in Targeted Groups

Publication number: 20090157651

Abstract: A method and apparatus are provided for detecting and explaining bursty stream events in targeted groups. In one example, the method includes receiving validated bursty events, finding explanatory data sources having relevant bursty events that are relevant to the validated bursty events, wherein the explanatory sources explain the presence of the validated bursty events, correlating the validated bursty events to the relevant bursty events of the explanatory data sources to obtain burst results, and sending the burst results to a burst database that is accessible to an end user.

Type: Application

Filed: December 18, 2007

Publication date: June 18, 2009

Inventors: Andrew Tomkins, Shanmugasundaram Ravikumar, Cameron Marlow, Jasmine Novak
DETECTING RELATIONSHIPS IN UNSTRUCTURED TEXT

Publication number: 20080177740

Abstract: Disclosed are embodiments of a system and a method for detecting relationships described in unstructured text-based electronic documents. The system and method incorporate the use of an input file that contains one or more text patterns that represent particular relationships. The text patterns each include regular text expressions that describe the particular relationship and slots for the location of each entity in that relationship. Document(s) are selected by a user and scanned by a proper noun tagger that identifies and tags every occurrence of proper names within the document(s). Then, a pattern matcher scans the document(s) to match text patterns. If a text pattern is matched within a document a relationship detector extracts all pairs of proper names found in the slots for each matched text pattern.

Type: Application

Filed: March 26, 2008

Publication date: July 24, 2008

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventor: Jasmine Novak
Methods and systems for providing a customizable guide for navigating a corpus of content

Publication number: 20080104521

Abstract: Methods and systems are described for navigating a corpus of content items stored in one or more information repositories within a distributed communications system. The content items may include video feeds, audio feeds, television broadcasts, website, a web log or the like. Using any browser application, the user views content items presented in an interactive scroll display. Modules are included for filtering, sorting and sampling content to ensure delivery of relevant high-quality content to the user. A feedback mechanism is also included for refining relevance filtering, sorting and sampling processes.

Type: Application

Filed: October 30, 2006

Publication date: May 1, 2008

Applicant: Yahoo! Inc.

Inventors: Micah Dubinko, Joseph Magnani, Jasmine Novak, Ravi Kumar, Prabhakar Raghavan, Andrew Tomkins
System and method for visualizing the temporal evolution of object metadata

Publication number: 20070283290

Abstract: An improved system and method for selecting and visualizing object metadata evolving over time is provided. An application may generate a visualization depicting the temporal evolution of metadata describing objects in an object store over a plurality of time intervals. The application may switch between a visualization of object metadata flowing like a river or cascading like a waterfall over time. A ranked list of metadata items may be determined for some pre-selected intervals during a pre-processing step. Then at runtime when a request may be received for providing a ranked list of metadata items for a query interval, a combination of time intervals from the pre-selected time intervals may be determined that cover the query time interval, and the ranked lists of metadata items for each time interval in the combination of time intervals that cover the query time interval may be aggregated and output for visualization.

Type: Application

Filed: May 19, 2006

Publication date: December 6, 2007

Applicant: Yahoo! Inc.

Inventors: Micah Joel Dubinko, Shanmugasundaram Ravikumar, Joseph Andrew Magnani, Jasmine Novak, Prabhakar Raghavan, Andrew Tomkins
System and method for selecting object metadata evolving over time

Publication number: 20070271270

Abstract: An improved system and method for selecting and visualizing object metadata evolving over time is provided. An application may generate a visualization depicting the temporal evolution of metadata describing objects in an object store over a plurality of time intervals. The application may switch between a visualization of object metadata flowing like a river or cascading like a waterfall over time. A ranked list of metadata items may be determined for some pre-selected intervals during a pre-processing step. Then at runtime when a request may be received for providing a ranked list of metadata items for a query interval, a combination of time intervals from the pre-selected time intervals may be determined that cover the query time interval, and the ranked lists of metadata items for each time interval in the combination of time intervals that cover the query time interval may be aggregated and output for visualization.

Type: Application

Filed: May 19, 2006

Publication date: November 22, 2007

Applicant: Yahoo! Inc.

Inventors: Micah Joel Dubinko, Shanmugasundaram Ravikumar, Joseph Andrew Magnani, Jasmine Novak, Prabhakar Raghavan, Andrew Tomkins
System, method and service for ranking search results using a modular scoring system

Patent number: 7257577

Abstract: A modular scoring system using rank aggregation merges search results into an ordered list of results using many different features of documents. The ranking functions of the present system can easily be customized to the needs of a particular corpus or collection of users such as an intranet. Rank aggregation is independent of the underlying score distributions between the different factors, and can be applied to merge any set of ranking functions. Rank aggregation holds the advantage of combining the influence of many different heuristic factors in a robust way to produce high-quality results for queries. The modular scoring system combines factors such as indegree, page ranking, URL length, proximity to the root server of an intranet, etc, to form a single ordering on web pages that closely obeys the individual orderings, but also mediates between the collective wisdom of individual heuristics.

Type: Grant

Filed: May 7, 2004

Date of Patent: August 14, 2007

Assignee: International Business Machines Corporation

Inventors: Ronald Fagin, Kevin Snow McCurley, Jasmine Novak, Shanmugasundram Ravikumar, Dandapani Sivakumar, John Anthony Tomlin, David Paul Williamson
System and method for automatically extracting interesting phrases in a large dynamic corpus

Publication number: 20070067157

Abstract: A phrase extraction system combines a dictionary method, a statistical/heuristic approach, and a set of pruning steps to extract frequently occurring and interesting phrases from a corpus. The system finds the “top k” phrases in a corpus, where k is an adjustable parameter. For a time-varying corpus, the system uses historical statistics to extract new and increasingly frequent phrases. The system finds interesting phrases that occur near a set of user-designated phrases. The system uses these designated phrases as anchor phrases to identify phrases that occur near the anchor phrases. The system finds frequently occurring and interesting phrases in a time-varying corpus is changing in time, as in finding frequent phrases in an on-going, long term document feed or continuous, regular web crawl.

Type: Application

Filed: September 22, 2005

Publication date: March 22, 2007

Inventors: Vinay Kaku, Keiko Kurita, Carlton Niblack, Jasmine Novak, Zengyan Zhang
Extracting informative phrases from unstructured text

Publication number: 20070067289

Abstract: Disclosed is a method of extracting informative phrases from a full corpus of documents. An index of phrases contained in the full corpus of documents is built. Then, a user specifies a subset of text to analyze. The subset may be defined as: (1) all paragraphs or sentences containing terms selected as defining a subject; (2) all documents in a category; (3) all documents written within a date range; and/or (3) all documents matching a Boolean query of terms. Once the subset is specified, it is analyzed to extract informative phrases. Specifically, the index is queried to retrieve all phrases within the subset. The number of times each of the phases occurs in the subset and in the corpus is counted. Each phrase contained in the subset is scored according to informativeness based on a comparison of a likelihood that the phrase occurs in the subset and a likelihood that the phrase occurs in the corpus as a whole.

Type: Application

Filed: September 20, 2005

Publication date: March 22, 2007

Applicant: International Business Machines Corporation

Inventor: Jasmine Novak
Detecting relationships in unstructured text

Publication number: 20070067320

Abstract: Disclosed are embodiments of a system and a method for detecting relationships described in unstructured text-based electronic documents. The system and method incorporate the use of an input file that contains one or more text patterns that represent particular relationships. The text patterns each include regular text expressions that describe the particular relationship and slots for the location of each entity in that relationship. Document(s) are selected by a user and scanned by a proper noun tagger that identifies and tags every occurrence of proper names within the document(s). Then, a pattern matcher scans the document(s) to match text patterns. If a text pattern is matched within a document a relationship detector extracts all pairs of proper names found in the slots for each matched text pattern.

Type: Application

Filed: September 20, 2005

Publication date: March 22, 2007

Applicant: International Business Machines Corporation

Inventor: Jasmine Novak
System, service, and method for predicting sales from online public discussions

Publication number: 20070027741

Abstract: A sales prediction system predicts sales from online public discussions. The system utilizes manually or automatically formulated predicates to capture subsets of postings in online public discussions. The system predicts spikes in sales rank based on online chatter. The system comprises automated algorithms that predict spikes in sales rank given a time series of counts of online discussions such as blog postings. The system utilizes a stateless model of customer behavior based on a series of states of excitation that are increasingly likely to lead to a purchase decision. The stateless model of customer behavior yields a predictor of sales rank spikes that is significantly more accurate than conventional techniques operating on sales rank data alone.

Type: Application

Filed: July 27, 2005

Publication date: February 1, 2007

Inventors: Daniel Gruhl, Ramanathan Guha, Jasmine Novak, Shanmugasundaram Ravikumar, Andrew Tomkins

1 2 next