Patents by Inventor Dusan Macho

Dusan Macho has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 9646610
    Abstract: An apparatus, method, and computer program for initiating a word spotting algorithm (220) on one or more wireless communication devices in a first power mode to detect a keyword data sequence (224) embedded within a sampled audio signal (222). In response to detecting the keyword data sequence (226), the word spotting algorithm is terminated and a plurality of identification algorithms (230) consisting of speech, voice, image recognition and a predetermined isolation time criterion are initiated on the one or more wireless communication devices operating in a second power mode to detect the presence of identification data (240). If identification data is detected on a particular wireless communication device it is activated to accept speech and/or voice commands (242).
    Type: Grant
    Filed: October 30, 2012
    Date of Patent: May 9, 2017
    Assignee: MOTOROLA SOLUTIONS, INC.
    Inventor: Dusan Macho
  • Patent number: 9158959
    Abstract: A user is identified and an in-place personalized interactive display provided by detecting, via a first imaging system, one or more unique characteristics of a user's palm, identifying the user via the one or more unique characteristics and a database containing mappings between detectable unique characteristics and user identities, retrieving user-specific interactive content as a function of the identity of the user, projecting, via a second imaging system, the user-specific interactive content onto the user's palm, and detecting, via a third imaging system, a user's interaction with the projected user-specific interactive content. The user may be identified by transmitting the one or more unique characteristics to a remote authentication server and receiving, in response, an identity of the user. User-specific content as a function of the identity of the user may be retrieved from a remote interactive content server.
    Type: Grant
    Filed: July 17, 2013
    Date of Patent: October 13, 2015
    Assignee: MOTOROLA SOLUTIONS, INC.
    Inventor: Dusan Macho
  • Patent number: 8959022
    Abstract: A method for determining a relatedness between a query video and a database video is provided. A processor extracts an audio stream from the query video to produce a query audio stream, extracts an audio stream from the database video to produce a database audio stream, produces a first-sized snippet from the query audio stream, and produces a first-sized snippet from the database audio stream. An estimation is made of a first most probable sequence of latent evidence probability vectors generating the first-sized audio snippet of the query audio stream. An estimation is made of a second most probable sequence of latent evidence probability vectors generating the first-sized audio snippet of the database audio stream. A similarity is measured between the first sequence and the second sequence producing a score of relatedness between the two snippets. Finally a relatedness is determined between the query video and a database video.
    Type: Grant
    Filed: November 19, 2012
    Date of Patent: February 17, 2015
    Assignee: Motorola Solutions, Inc.
    Inventors: Yang M. Cheng, Dusan Macho
  • Publication number: 20150023567
    Abstract: A user is identified and an in-place personalized interactive display provided by detecting, via a first imaging system, one or more unique characteristics of a user's palm, identifying the user via the one or more unique characteristics and a database containing mappings between detectable unique characteristics and user identities, retrieving user-specific interactive content as a function of the identity of the user, projecting, via a second imaging system, the user-specific interactive content onto the user's palm, and detecting, via a third imaging system, a user's interaction with the projected user-specific interactive content. The user may be identified by transmitting the one or more unique characteristics to a remote authentication server and receiving, in response, an identity of the user. User-specific content as a function of the identity of the user may be retrieved from a remote interactive content server.
    Type: Application
    Filed: July 17, 2013
    Publication date: January 22, 2015
    Inventor: DUSAN MACHO
  • Publication number: 20140181083
    Abstract: A method and user terminal are provided that graphically formulate a search query. The method and user terminal display, via a display screen, a multi-dimensional graphical representation of a search query space, receive a plurality of parameters from a user, wherein the parameters define the search query space, position a multi-dimensional icon in the multi-dimensional representation of the search query space, associate one or more of a keyword and multimedia content with the icon, and generate a search query based on the keyword and the position of the icon in the multi-dimensional representation of the search query space. The method and user terminal further may graphically display the results of the corresponding database search, wherein the retrieved content is displayed as one or more icons positioned in a multi-dimensional graph having a plurality of axes associated with the plurality of parameters defining a context of the search query.
    Type: Application
    Filed: December 21, 2012
    Publication date: June 26, 2014
    Applicant: MOTOROLA SOLUTIONS, INC.
    Inventors: DUSAN MACHO, KENNETH W. DOUROS, SAMEER B. TOTEY
  • Publication number: 20140122087
    Abstract: An apparatus, method, and computer program for initiating a word spotting algorithm (220) on one or more wireless communication devices in a first power mode to detect a keyword data sequence (224) embedded within a sampled audio signal (222). In response to detecting the keyword data sequence (226), the word spotting algorithm is terminated and a plurality of identification algorithms (230) are initiated on the one or more wireless communication devices operating in a second power mode to detect the presence of identification data (240). If identification data is detected on a particular wireless communication device it is activated to accept speech and/or voice commands (242). On the other hand, if identification data is not detected, the plurality of identification algorithms are terminated, and the word spotting algorithm is reinitiated on the one or more wireless communication devices that are then operating in the first power mode (244).
    Type: Application
    Filed: October 30, 2012
    Publication date: May 1, 2014
    Applicant: MOTOROLA SOLUTIONS, INC.
    Inventor: DUSAN MACHO
  • Publication number: 20140009682
    Abstract: A method for determining a relatedness between a query video and a database video is provided. A processor extracts an audio stream from the query video to produce a query audio stream, extracts an audio stream from the database video to produce a database audio stream, produces a first-sized snippet from the query audio stream, and produces a first-sized snippet from the database audio stream. An estimation is made of a first most probable sequence of latent evidence probability vectors generating the first-sized audio snippet of the query audio stream. An estimation is made of a second most probable sequence of latent evidence probability vectors generating the first-sized audio snippet of the database audio stream. A similarity is measured between the first sequence and the second sequence producing a score of relatedness between the two snippets. Finally a relatedness is determined between the query video and a database video.
    Type: Application
    Filed: November 19, 2012
    Publication date: January 9, 2014
    Applicant: MOTOROLA SOLUTIONS, INC.
    Inventors: YANG M. CHENG, DUSAN MACHO
  • Patent number: 8442823
    Abstract: A method of performing a search of a database of speakers, includes: receiving a query speech sample spoken by a query speaker; deriving a query utterance from the query speech sample; extracting query utterance statistics from the query utterance; performing Kernelized Locality-Sensitive Hashing (KLSH) using a kernel function, the KLSH using as input the query utterance statistics and utterance statistics extracted from a plurality of utterances included in a database of speakers in order to select a subset of the plurality of utterances; and comparing, using an utterance comparison equation, the query utterance statistics to the utterance statistics for each utterance in the subset to generate a list of speakers from the database of utterances having a highest similarity to the query speaker.
    Type: Grant
    Filed: October 19, 2010
    Date of Patent: May 14, 2013
    Assignee: Motorola Solutions, Inc.
    Inventors: Woojay Jeon, Yan-Ming Cheng, Changxue Ma, Dusan Macho
  • Publication number: 20120095764
    Abstract: A method of performing a search of a database of speakers, includes: receiving a query speech sample spoken by a query speaker; deriving a query utterance from the query speech sample; extracting query utterance statistics from the query utterance; performing Kernelized Locality-Sensitive Hashing (KLSH) using a kernel function, the KLSH using as input the query utterance statistics and utterance statistics extracted from a plurality of utterances included in a database of speakers in order to select a subset of the plurality of utterances; and comparing, using an utterance comparison equation, the query utterance statistics to the utterance statistics for each utterance in the subset to generate a list of speakers from the database of utterances having a highest similarity to the query speaker.
    Type: Application
    Filed: October 19, 2010
    Publication date: April 19, 2012
    Applicant: MOTOROLA, INC.
    Inventors: WOOJAY JEON, YAN-MING CHEN, CHANGXUE MA, DUSAN MACHO
  • Publication number: 20080147389
    Abstract: A method and apparatus for robust speech activity detection is disclosed. The method may include calculating autocorrelations by filtering input signals using order statistic filtering, averaging the autocorrelations over a time period, obtaining a voiced speech feature from the averaged autocorrelations, classifying the input signal as one of speech and non-speech based on the obtained voiced speech feature, and outputting only the classified speech signals or the input signals along with the speech/non-speech classification information, to an automated speech recognizer.
    Type: Application
    Filed: December 15, 2006
    Publication date: June 19, 2008
    Applicant: Motorola, Inc.
    Inventor: Dusan Macho
  • Patent number: 6678656
    Abstract: A voice sample characterization front-end suitable for use in a distributed speech recognition context. A digitized voice sample 31 is split between a low frequency path 32 and a high frequency path 33. Both paths are used to determine spectral content suitable for use when determining speech recognition parameters (such as cepstral coefficients) that characterize the speech sample for recognition purposes. The low frequency path 32 has a thorough noise reduction capability. In one embodiment, the results of this noise reduction are used by the high frequency path 33 to aid in de-noising without requiring the same level of resource capacity as used by the low frequency path 32.
    Type: Grant
    Filed: January 30, 2002
    Date of Patent: January 13, 2004
    Assignee: Motorola, Inc.
    Inventors: Dusan Macho, Yan Ming Cheng
  • Publication number: 20030144834
    Abstract: A voice sample characterization front-end suitable for use in a distributed speech recognition context. A digitized voice sample 31 is split between a low frequency path 32 and a high frequency path 33. Both paths are used to determine spectral content suitable for use when determining speech recognition parameters (such as cepstral coefficients) that characterize the speech sample for recognition purposes. The low frequency path 32 has a thorough noise reduction capability. In one embodiment, the results of this noise reduction are used by the high frequency path 33 to aid in de-noising without requiring the same level of resource capacity as used by the low frequency path 32.
    Type: Application
    Filed: January 30, 2002
    Publication date: July 31, 2003
    Applicant: Motorola, Inc.
    Inventors: Dusan Macho, Yan Ming Cheng
  • Patent number: 6480821
    Abstract: A system for enhancing the signal-to-noise ratio of a speech signal is avoided. A plurality of local energy maximums associated with a speech signal are determined. Presumably, each of these local energy maximums defines a speech pitch period. Typically, human pitch periods are approximately 100-400 Hz depending on the sex and age of the speaker. Because human speech typically includes more energy near the beginning of a pitch period than at the end of the pitch period, and background noise tends to remain relatively constant throughout the pitch period, the speech signal may be enhanced by increasing the energy associated with the beginning of the pitch period and/or by decreasing the energy associated with the end of the pitch period. Preferably, the amount of energy increase in the earlier portion of the pitch period is approximately equal to the amount of energy reduction in the later portion of the pitch period. In this manner, the total energy remains the constant.
    Type: Grant
    Filed: January 31, 2001
    Date of Patent: November 12, 2002
    Assignee: Motorola, Inc.
    Inventors: Dusan Macho, Yan Ming Cheng
  • Publication number: 20020103640
    Abstract: A system for enhancing the signal-to-noise ratio of a speech signal is disclosed. A plurality of local energy maximums associated with a speech signal are determined. Presumably, each of these local energy maximums defines a speech pitch period. Typically, human pitch periods are approximately 100-400 Hz depending on the sex and age of the speaker. Because human speech typically includes more energy near the beginning of a pitch period than at the end of the pitch period, and background noise tends to remain relatively constant throughout the pitch period, the speech signal may be enhanced by increasing the energy associated with the beginning of the pitch period and/or by decreasing the energy associated with the end of the pitch period. Preferably, the amount of energy increase in the earlier portion of the pitch period is approximately equal to the amount of energy reduction in the later portion of the pitch period. In this manner, the total energy remains the constant.
    Type: Application
    Filed: January 31, 2001
    Publication date: August 1, 2002
    Inventors: Dusan Macho, Yan Ming Cheng