Patents by Inventor Sumit Basu

Sumit Basu has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20100288106
    Abstract: Relating higher-level descriptive musical metadata to lower-level musical elements to enable creation of a song map, song model, backing track, or the like. The musical elements are queried based on input metadata to create a set of musical elements of varying types such as notes, chords, song structures, and the like. The set of musical elements is provided to a user for selection of particular musical elements The selected musical elements represent the song model.
    Type: Application
    Filed: July 27, 2010
    Publication date: November 18, 2010
    Applicant: MICROSOFT CORPORATION
    Inventors: Adil Ahmed Sherwani, Chad Gibson, Sumit Basu
  • Patent number: 7790974
    Abstract: Relating higher-level descriptive musical metadata to lower-level musical elements to enable creation of a song map, song model, backing track, or the like. The musical elements are queried based on input metadata to create a set of musical elements of varying types such as notes, chords, song structures, and the like. The set of musical elements is provided to a user for selection of particular musical elements The selected musical elements represent the song model.
    Type: Grant
    Filed: May 1, 2006
    Date of Patent: September 7, 2010
    Assignee: Microsoft Corporation
    Inventors: Adil Ahmed Sherwani, Chad Gibson, Sumit Basu
  • Publication number: 20100192755
    Abstract: A graphical user interface for facilitating generation of an accompaniment for a recorded audio melody is described. A Hidden Markov Model, trained with blended chord transition matrices and melody observation matrices, is used for providing the accompaniment for the recorded audio melody. The recorded audio melody includes segments. Frequency analysis of the recorded audio melody is performed. The total duration of each fundamental frequency within a segment of the melody is summed. Based on the summing, a probability for each possible chord for each segment is computed. Based on the computed probabilities, a set of chords are selected for the segments. The chords are displayed on a chord chart of the graphical user interface. The graphical user interface facilitates various manipulations using the chords and/or controls, and generation of a new accompaniment for a recorded audio melody based on the manipulations.
    Type: Application
    Filed: April 12, 2010
    Publication date: August 5, 2010
    Applicant: Microsoft Corporation
    Inventors: Dan Morris, Sumit Basu, Ian Simon
  • Publication number: 20100162151
    Abstract: Techniques to organize information on a computing device using movable objects are described. A computer system may include a display operative to present a graphical user interface with a pointer to select one or more movable objects and position the movable objects at various target positions on the graphical user interface, an input device operative to receive selected movable objects and user movement to position the selected movable objects at a target position on the graphical user interface, and an object position component operative to anchor the selected movable objects at the target position using an anchor element to form a group of anchored objects, and arrange the group of anchored objects in a visual pattern relative to the anchor element. Other embodiments are described and claimed.
    Type: Application
    Filed: December 19, 2008
    Publication date: June 24, 2010
    Applicant: MICROSOFT CORPORATION
    Inventors: Gregory G. Class, Eliot J. Graff, Connie Missimer, Julie A. Guinn, Sumit Basu
  • Patent number: 7737354
    Abstract: A “Concatenative Synthesizer” applies concatenative synthesis to create a musical output from a database of musical notes and an input musical score (such as a MIDI score or other computer readable musical score format). In various embodiments, the musical output is either a music score, or an analog or digital audio file. This musical output is constructed by evaluating the database of musical notes to identify sets of candidate notes for each note of the input musical score. An “optimal path” through candidate notes is identified by minimizing an overall cost function through the candidate notes relative to the input musical score. The musical output is then constructed by concatenating the selected candidate notes. In further embodiments, the database of musical notes is generated from any desired musical genre, performer, performance, or instrument. Furthermore, notes in the database may be modified to better fit notes of the input musical score.
    Type: Grant
    Filed: June 15, 2006
    Date of Patent: June 15, 2010
    Assignee: Microsoft Corporation
    Inventors: Sumit Basu, Ian Simon, David Salesin, Maneesh Agrawala, Adil Sherwani, Chad Gibson
  • Publication number: 20100110412
    Abstract: Systems and methods for localization and mapping an environment using a measurement device are provided. One system includes a vehicle, first and second measurement devices configured to generate range/azimuth data for reference landmarks in the environment, a processor, and memory. The memory includes a map module storing data representing a map of the environment including known coordinates for the plurality of reference landmarks. One method includes generating a map of the environment based on known coordinates for each of the plurality of reference landmarks, receiving range and azimuth data for each of a plurality of candidate reference landmarks from a measurement device, and identifying each of the plurality of reference landmarks in the plurality of candidate reference landmark. The method further includes associating the plurality of candidate reference landmarks with the plurality of reference landmarks in the map and localizing a mobile platform based on the associating step.
    Type: Application
    Filed: October 31, 2008
    Publication date: May 6, 2010
    Applicant: HONEYWELL INTERNATIONAL INC.
    Inventors: Sumit Basu, Bob Touchton, Dave Galbraith, Kaniyanoor Srinivasan Srikanth, Vijayendra Setty
  • Patent number: 7705231
    Abstract: A method for generating an accompaniment for a recorded audio melody includes providing a recorded audio melody that includes segments; performing a frequency analysis of the recorded audio melody; summing the total duration of each fundamental frequency within a segment of the melody; based on the summing, computing a probability for each possible chord for each segment; based on the computed probabilities, selecting a set of chords for the segments; and outputting the set of chords as an accompaniment for the recorded audio melody. Various other methods, devices, systems, etc. are also disclosed.
    Type: Grant
    Filed: November 27, 2007
    Date of Patent: April 27, 2010
    Assignee: Microsoft Corporation
    Inventors: Dan Morris, Sumit Basu, Ian Simon
  • Publication number: 20100088143
    Abstract: The described implementations relate to calendar event scheduling. One system includes a storage component configured to store scheduling constraints relating to at least one user. The system also includes a declarative calendar component configured to automatically schedule declarative calendar events for the at least one user based upon the scheduling constraints.
    Type: Application
    Filed: October 7, 2008
    Publication date: April 8, 2010
    Applicant: Microsoft Corporation
    Inventors: John C. Platt, Jonathan J. Cadiz, Joseph L. Hellerstein, Brian D. Williams, Sumit Basu
  • Publication number: 20100082516
    Abstract: An illustrative frustration processing system modifies the operation of a target system to improve its performance. In one case, the frustration processing system receives express indications that a user is frustrated in the course of interacting with the target system. The frustration processing system responds to these indications by modifying the operation of the target system to reduce the likelihood that the user will be frustrated in the future. The frustration processing system can modify the operation of the target system by applying a policy to the target system. The policy, in turn, is created using a prediction model. The prediction model predicts when a user is likely to be frustrated based on the user's prior indications of frustration.
    Type: Application
    Filed: September 29, 2008
    Publication date: April 1, 2010
    Applicant: Microsoft Corporation
    Inventors: Sumit Basu, John D. Dunagan, Kevin K. Duh, Kiran-Kumar Muniswamy-Reddy
  • Patent number: 7650317
    Abstract: An active learning framework is provided to extract information from particular fields from a variety of protocols. Extraction is performed in an unknown protocol, in which the user presents the system with a small number of labeled instances. The system then automatically generates an abundance of features and negative examples. A boosting approach is then used for feature selection and classifier combination. The system then displays its results for the user to correct and/or add new examples. The process can be iterated until the user is satisfied with the performance of the extraction capabilities provided by the classifiers generated by the system.
    Type: Grant
    Filed: December 6, 2006
    Date of Patent: January 19, 2010
    Assignee: Microsoft Corporation
    Inventors: Sumit Basu, Karthik Gopalratnam, John David Dunagan, Jiahe Helen Wang
  • Publication number: 20090319449
    Abstract: An overwhelming number of articles are available everyday via the internet. Unfortunately, it is impossible to peruse more than a handful, and it is difficult to ascertain an article's social context. The techniques disclosed herein address this problem by harnessing implicit and explicit contextual information from social media. By extracting text surrounding a hyperlink to an article in a post and assessing the article as a function of content surrounding the hyperlink, an article's social context is determined and presented. Additionally, articles that are sufficiently similar in content may be grouped to establish a many-to-one relationship between posts and an article, creating a more accurate assessment.
    Type: Application
    Filed: June 21, 2008
    Publication date: December 24, 2009
    Applicant: MICROSOFT CORPORATION
    Inventors: Michael Gamon, Sumit Basu, Dmitriy A. Belenko, Danyel A. Fisher, Arnd C. Konig, Matthew F. Hurst
  • Patent number: 7634405
    Abstract: The subject invention leverages spectral “palettes” or representations of an input sequence to provide recognition and/or synthesizing of a class of data. The class can include, but is not limited to, individual events, distributions of events, and/or environments relating to the input sequence. The representations are compressed versions of the data that utilize a substantially smaller amount of system resources to store and/or manipulate. Segments of the palettes are employed to facilitate in reconstruction of an event occurring in the input sequence. This provides an efficient means to recognize events, even when they occur in complex environments. The palettes themselves are constructed or “trained” utilizing any number of data compression techniques such as, for example, epitomes, vector quantization, and/or Huffman codes and the like.
    Type: Grant
    Filed: January 24, 2005
    Date of Patent: December 15, 2009
    Assignee: Microsoft Corporation
    Inventors: Sumit Basu, Nebojsa Jojic, Ashish Kapoor
  • Patent number: 7596498
    Abstract: Conversations that take place over an electronically recordable channel are analyzed by constructing a set of features from the speech of two participants in the conversation. The set of features is applied to a model or a plurality of models to determine the likelihood of the set of features for each model. These likelihoods are then used to classify the conversation into categories, provide real-time monitoring of the conversation, and/or identify anomalous conversations.
    Type: Grant
    Filed: September 2, 2005
    Date of Patent: September 29, 2009
    Assignee: Microsoft Corporation
    Inventors: Sumit Basu, Mauricio Gonzalez de la Fuente
  • Patent number: 7594177
    Abstract: A “Video Browser” provides an intuitive user interface for indexing, and interactive visual browsing, of particular elements within a video recording. In general, the Video Browser operates by first generating a set of one or more mosaic images from the video recording. In one embodiment, these mosaics are further clustered using an adjustable similarity threshold. User selection of a particular video mosaic then initiates a playback of corresponding video frames. However, in contrast to conventional mosaicking schemes which simply play back the set of frames used to construct the mosaic, the Video Browser provides a playback of only those individual frames within which a particular point selected within the image mosaic was observed. Consequently, user selection of a point in one of the image mosaics serves to provide a targeted playback of only those frames of interest, rather than playing back the entire image sequence used to generate the mosaic.
    Type: Grant
    Filed: December 8, 2004
    Date of Patent: September 22, 2009
    Assignee: Microsoft Corporation
    Inventors: Nebojsa Jojic, Sumit Basu
  • Patent number: 7548662
    Abstract: A system and process for creating an apparently higher resolution image on a display exhibiting a lower resolution is presented. The basic idea is to make multiple decimated versions of an image at different offsets in a smooth path (all of which will contain different bits of detail), and then animate through the resulting decimated images (i.e., show them in rapid succession). The viewer sees what looks like a higher-resolution image moving in a smooth path. The viewer sees this since the human eye is capable of integrating details over the continuous motion. Thus, images such as text enjoy an enhanced legibility.
    Type: Grant
    Filed: January 21, 2005
    Date of Patent: June 16, 2009
    Assignee: Microsoft Corporation
    Inventors: Sumit Basu, Patrick Baudisch
  • Publication number: 20090064851
    Abstract: A method for generating an accompaniment for a recorded audio melody includes providing a recorded audio melody that includes segments; performing a frequency analysis of the recorded audio melody; summing the total duration of each fundamental frequency within a segment of the melody; based on the summing, computing a probability for each possible chord for each segment; based on the computed probabilities, selecting a set of chords for the segments; and outputting the set of chords as an accompaniment for the recorded audio melody. Various other methods, devices, systems, etc. are also disclosed.
    Type: Application
    Filed: November 27, 2007
    Publication date: March 12, 2009
    Applicant: Microsoft Corporation
    Inventors: Dan Morris, Sumit Basu, Ian Simon
  • Patent number: 7502480
    Abstract: A system and method for implementing a flat volume model for software application audio signals is provided. A computer system generates a flat volume characteristics tree based upon current hardware/software values from an audio setting hierarchy. The computer system processes the flat volume characteristics tree and converts the optimized values into new audio hierarchy settings. The computer system can generate user interfaces representative of the flat volume settings for the software application. Adjustments to the flat volume settings result in modification to the flat volume characteristics tree and the audio setting hierarchy.
    Type: Grant
    Filed: June 30, 2004
    Date of Patent: March 10, 2009
    Assignee: Microsoft Corporation
    Inventors: Patrick M. Baudisch, Sumit Basu
  • Publication number: 20090006085
    Abstract: An automated voice message or caller prioritization system that extracts words, prosody, and/or metadata from a voice input. The data extracted is classified with a statistical classifier into groups of interest. These groups could indicate the likelihood that a call is urgent versus nonurgent, from someone the user knows well versus someone that the user only knows casually or not at all, from someone using a mobile phone versus a landline, or a business call versus a personal calls. The system then can determine an action based on results of the groups, including the display of likely category labels on the message. Call handling and display actions can be defined by user preferences.
    Type: Application
    Filed: June 29, 2007
    Publication date: January 1, 2009
    Applicant: MICROSOFT CORPORATION
    Inventors: Eric J. Horvitz, Ashish Kapoor, Sumit Basu
  • Publication number: 20080300872
    Abstract: Providing for browsing a summary of content formed of keywords that can scale to a user-defined level of detail is disclosed herein. Components of a system can include a summarization component that extracts keywords related to the content and associates the keywords with portions thereof, and a zooming component that displays a number of keywords based on a keyword/keyphrase relevance rank and a zoom factor. Additionally, a speech to text component can translate speech associated with the content into text, wherein the keywords are extracted from the translated text. Consequently, the claimed subject matter can present a variable hierarchy of keywords to form a scalable summary of such recorded content.
    Type: Application
    Filed: May 31, 2007
    Publication date: December 4, 2008
    Applicant: MICROSOFT CORPORATION
    Inventors: Sumit Basu, Surabhi Gupta, John C. Platt, Patrick Nguyen, Milind V. Mahajan
  • Patent number: 7458029
    Abstract: A system and process for controlling common information displays, referred to as shared displays, is presented. The system and process allows multiple modes of input using a set of modules that accept and display data from a variety of sources. Input modules are able to understand data from a single mode of communication and to be able to generate messages as output accordingly. An optional translation module takes discrete message units and converts them into commands or requests that can be processed by a logic module. The logic module includes any application that is running on the shared display. A layout module lays out the information output by the logic module and a display module takes the layout data and converts the information to a form that can be readily displayed on a display device.
    Type: Grant
    Filed: January 15, 2004
    Date of Patent: November 25, 2008
    Assignee: Microsoft Corporation
    Inventors: Maneesh Agrawala, Sumit Basu, Steven M. Drucker, Ronald Keith Logan, Trausti Thor Kristjansson, Tim Paek, Kentaro Toyama, Andrew David Wilson