Patents Assigned to Nexidia, Inc.
  • Publication number: 20130035936
    Abstract: A transcription system is applicable to transcription for a language in which there is limited pronunciation and/or acoustic data. A transcription station is configured using pronunciation data and acoustic data for use with the language. The pronunciation data and/or the acoustic data is initially from another dialect of a language, another language from a language group, or is universal (e.g., not specific to any particular language). A partial transcription of the audio recording is accepted via the transcription station (e.g., from a transcriptionist). One or more repetitions of one or more portions of the partial transcription are identified in the audio recording, and can be accepted during transcription. The pronunciation data and/or the acoustic data is updated in a bootstrapping manner during transcription, thereby improving the efficiency of the transcription process.
    Type: Application
    Filed: August 1, 2012
    Publication date: February 7, 2013
    Applicant: Nexidia Inc.
    Inventors: Jacob B. Garland, Marsal Gavalda
  • Patent number: 8311828
    Abstract: In some aspects, a wordspotter is used to locate occurrences in an audio corpus of each of a set of predetermined subword units, which may be phoneme sequences. To locate a query (e.g., a keyword or phrase) in the audio corpus, constituent subword units in the query are indentified and then locations of those subwords are determined based on the locations of those subword units determined earlier by the wordspotter, for example, using a pre-built inverted index that maps subword units to their locations.
    Type: Grant
    Filed: August 27, 2008
    Date of Patent: November 13, 2012
    Assignee: Nexidia Inc.
    Inventors: Jon A. Arrowood, Robert W. Morris, Mark Finlay, Scott A. Judy
  • Publication number: 20120284026
    Abstract: In an aspect, in general, a method for computer assisted speaker authentication in a voice communication session includes establishing a voice communication session between a first speaker and an agent, accepting a first voice signal from the first speaker, determining a voice characteristic measure of the first voice signal, including characterizing a similarity of the first voice signal to each of one or more stored characterizations of voice signals previously acquired from one or more known speakers, and providing an interface to the agent during the voice communication session between the agent and the first speaker, including presenting an indicator based on the determined voice characteristic measure to the agent.
    Type: Application
    Filed: May 6, 2011
    Publication date: November 8, 2012
    Applicant: Nexidia Inc.
    Inventors: Peter S. Cardillo, Marsal Gavalda
  • Publication number: 20120278071
    Abstract: A transcription system automates the control of the playback of the audio to accommodate the user's ability to transcribe the words spoken. In some examples, a delay between playback and typed input is estimated by processing the typed words using a wordspotting approach. The estimated delay is used as in input to an automated speed control, for example, to maintain a target or maximum delay between playback and typed input.
    Type: Application
    Filed: April 29, 2011
    Publication date: November 1, 2012
    Applicant: Nexidia Inc.
    Inventors: Jacob B. Garland, Marsal Gavalda
  • Patent number: 8170873
    Abstract: An approach to comparing events in word spotting, such as comparing putative and reference instances of a keyword, makes use of a set of models of subword units. For each of two acoustic events and for each of a series of times in each of the events, a probability associated with each of the models of the set of subword units is computed. Then, a quantity characterizing a comparison of the two acoustic events, one occurring in each of the two acoustic signals, is computed using the computed probabilities associated with each of the models.
    Type: Grant
    Filed: July 22, 2004
    Date of Patent: May 1, 2012
    Assignee: Nexidia Inc.
    Inventor: Robert W. Morris
  • Publication number: 20120059656
    Abstract: A method for determining a similarity between a first audio source and a second audio source includes: for the first audio source, determining a first frequency of occurrence for each of a plurality of phoneme sequences and determining a first weighted frequency for each of the plurality of phoneme sequences based on the first frequency of occurrence for the phoneme sequence; for the second audio source, determining a second frequency of occurrence for each of a plurality of phoneme sequences and determining a second weighted frequency for each of the plurality of phoneme sequences based on the second frequency of occurrence for the phoneme sequence; comparing the first weighted frequency for each phoneme sequence with the second weighted frequency for the corresponding phoneme sequence; and generating a similarity score representative of a similarity between the first audio source and the second audio source based on the results of the comparing.
    Type: Application
    Filed: August 30, 2011
    Publication date: March 8, 2012
    Applicant: Nexidia Inc.
    Inventors: Jacob B. Garland, Jon A. Arrowood, Drew Lanham, Marsal Gavalda
  • Publication number: 20120010736
    Abstract: A method for detecting sections of a known input in an unknown input includes processing the known input to form a series of discrete-valued feature values associated with corresponding time locations in the known input. Index data associating a plurality of the feature values each with one or more time locations in the known input is then formed. The unknown input is processed to form a series of discrete-valued features values. A time offset between the unknown input and the known input is determined by determining time locations in the known input associated with the feature values of the unknown input. Determining the time offset may include maintaining a distribution of time offsets based on successive determined time locations of the feature values of the unknown input.
    Type: Application
    Filed: July 9, 2010
    Publication date: January 12, 2012
    Applicant: Nexidia Inc.
    Inventors: Peter S. Cardillo, Marsal Gavalda
  • Patent number: 8051086
    Abstract: Some general aspects of the invention relate to systems and methods of processing data, for example, to improve customer interactions. One aspect, in particular, relates to a computer-implemented method that includes accepting user input for analysis of a database having media data and metadata. The media data includes a group of audio recordings and the metadata includes descriptive information of the group of audio recordings. A representation of a set of call series is formed based on user input, and processed to generate an analysis report. A visual representation of the analysis report is formed for presentation to a user.
    Type: Grant
    Filed: June 24, 2009
    Date of Patent: November 1, 2011
    Assignee: Nexidia Inc.
    Inventors: Christopher J. Jeffs, Marsal Gavalda, Philip Kyle Pledger, Robert Troy Surdick
  • Publication number: 20110216905
    Abstract: Techniques implemented as systems, methods, and apparatuses, including computer program products, for logging multi-channel audio signals.
    Type: Application
    Filed: March 5, 2010
    Publication date: September 8, 2011
    Applicant: Nexidia Inc.
    Inventors: Marsal Gavalda, Mark Finlay
  • Publication number: 20110125499
    Abstract: Systems, methods, and apparatus, including computer program products for accepting a predetermined vocabulary-dependent characterization of a set of audio signals, the predetermined characterization including an identification of putative occurrences of each of a plurality of vocabulary items in the set of audio signals, the plurality of vocabulary items included in the vocabulary; accepting a new vocabulary item not included in the vocabulary; accepting putative occurrences of the new vocabulary item in the set of audio signals; and generating, by an analysis engine of a speech processing system, an augmented characterization of the set of audio signals based on the identified putative occurrences of the new vocabulary item.
    Type: Application
    Filed: November 24, 2009
    Publication date: May 26, 2011
    Applicant: Nexidia Inc.
    Inventors: Kenneth King Griggs, Jon A. Arrowood
  • Patent number: 7949527
    Abstract: This invention relates to processing of audio files, and more specifically, to an improved technique of searching audio. More particularly, a method and system for processing audio using a multi-stage searching process is disclosed.
    Type: Grant
    Filed: December 19, 2007
    Date of Patent: May 24, 2011
    Assignee: Nexidia, Inc.
    Inventors: Jon A. Arrowood, Robert W. Morris, Kenneth K. Griggs
  • Patent number: 7904296
    Abstract: An approach to wordspotting (180) using query data from one or more spoken instance of a query (140). The query data is processed to determining a representation of the query (160) that defines multiple sequences of subword (130) units each representing the query. Then putative instances of the query (190) are located in input data from an audio signal using the determined representation of the query.
    Type: Grant
    Filed: July 22, 2004
    Date of Patent: March 8, 2011
    Assignee: Nexidia Inc.
    Inventor: Robert W. Morris
  • Publication number: 20110044447
    Abstract: Techniques for processing data representative of text associated with one or more content sources to generate a specification of a set of keyphrases of interest; processing a first set of audio signals collected during a first time period to generate first data characterizing putative occurrences of one or more keyphrases of the set in the first set of audio signals; evaluating the first data to generate keyphrase-specific comparison values for the first set of audio signals; deriving first trending data between the first set of audio signals and a second set of audio signals based in part on an analysis of the keyphrase-specific comparison values for the first set of audio signals relative to stored keyphrase-specific baseline values; and generating a visual representation of at least some of the first trending data and causing the visual representation of the first trending data to be presented on a display terminal.
    Type: Application
    Filed: August 21, 2009
    Publication date: February 24, 2011
    Applicant: Nexidia Inc.
    Inventors: Robert W. Morris, Marsal Gavalda, Peter S. Cardillo, Jon A. Arrowood
  • Publication number: 20110037766
    Abstract: Systems and methods are providing for using cluster maps in managing multimedia content including, for example, analyzing audio files stored at a call center. Very generally, a cluster map can be used as an effective tool for visualizing condensed information and for improving the understanding of the characteristics and relationships of the data under study. For example, a set of nodes can be displayed in a cluster map as corresponding to a set of information objects. Each information object may represent the result of a respective query conducted against the data. In some embodiments, multiple relationships between various information objects (such as between different query results) can be displayed simultaneously as graphical links in the map, making data comparison and exploration easier and more intuitive.
    Type: Application
    Filed: August 17, 2010
    Publication date: February 17, 2011
    Applicant: Nexidia Inc.
    Inventors: Scott A. Judy, Marsal Gavalda
  • Publication number: 20110033036
    Abstract: Some general aspects of the invention relate to systems and methods for improving contact center agent performance, for instance, by integrating real-time call monitoring with speech analytics to present agents with information useful to the handling of the current calls. In some implementations, phonetically based speech analysis techniques are applied to process live audio streams to identify key words and/or phrases of relevance, based on which knowledge articles can be selectively presented to agents to drive more efficient business processes.
    Type: Application
    Filed: July 16, 2010
    Publication date: February 10, 2011
    Applicant: Nexidia Inc.
    Inventors: Gordon Edwards, John Willcutts, Jon W. Ezrine, Marsal Gavalda
  • Publication number: 20100329437
    Abstract: A method includes accepting, via an input interface, a caller identifier parameter and a target value of at least one call series parameter; identifying, using a data processor, a plurality of calls each associated with the caller identifier parameter from amongst a set of calls stored in a call center database; analyzing, using the data processor, the identified plurality of calls to determine a value of the at least one call series parameter for the identified plurality of calls; comparing, using the data processor, the determined value of the at least one call series parameter with the target value of the at least one call series parameter; and defining, using the data processor, the identified plurality of calls as a call series based at least in part on results of the comparing.
    Type: Application
    Filed: June 24, 2010
    Publication date: December 30, 2010
    Applicant: Nexidia Inc.
    Inventors: Christopher J. Jeffs, Marsal Gavalda, Philip Kyle Pledger, Robert Troy Surdick
  • Publication number: 20100332225
    Abstract: Some general aspects relate to systems and methods for media processing. One aspect, for example, relates to a method for aligning multimedia recording with a transcript. A group of search terms are formed from the transcript, with each search term being associated with a location within the transcript. Putative locations of the search terms are determined in a time interval of the multimedia recording. For each search term, zero or more putative locations are determined and, for at least some of the search terms, multiple putative locations are determined in the time interval of the multimedia recording. According to a first sequencing constraint, a first representation of a group of sequences each of a subset of the putative locations of the search terms is formed. A second representation of a group of sequences each of a subset of the search terms is formed. Using the first and the second representations, the time interval of the multimedia recording is partially aligned with the transcript.
    Type: Application
    Filed: June 29, 2009
    Publication date: December 30, 2010
    Applicant: Nexidia Inc.
    Inventors: Jon A. Arrowood, Kenneth King Griggs, Marsal Gavalda, Robert W. Morris
  • Publication number: 20100332477
    Abstract: Some general aspects of the invention relate to systems and methods of processing data, for example, to improve customer interactions. One aspect, in particular, relates to a computer-implemented method that includes accepting user input for analysis of a database having media data and metadata. The media data includes a group of audio recordings and the metadata includes descriptive information of the group of audio recordings. A representation of a set of call series is formed based on user input, and processed to generate an analysis report. A visual representation of the analysis report is formed for presentation to a user.
    Type: Application
    Filed: June 24, 2009
    Publication date: December 30, 2010
    Applicant: Nexidia Inc.
    Inventors: Christopher J. Jeffs, Marsal Gavalda, Philip Kyle Pledger, Robert Troy Surdick
  • Publication number: 20100299131
    Abstract: Some general aspects relate to systems, software, and methods for media processing. In one aspect, a script associated with a multimedia recording is accepted, wherein the script includes dialogue, speaker indications and video event indications. A group of search terms are formed from the dialogue, with each search term being associated with a location within the script. Zero or more putative locations of each of the search terms are identified in a time interval of the multimedia recording. For at least some of the search terms, multiple putative locations are identified in the time interval of the multimedia recording. The time interval of the multimedia recording and the script are partially aligned using the determined putative locations of the search terms and one or more of the following: a result of matching audio characteristics of the multimedia recording with the speaker indications, and a result of matching video characteristics of the multimedia recording with the video event indications.
    Type: Application
    Filed: May 21, 2009
    Publication date: November 25, 2010
    Applicant: Nexidia Inc.
    Inventors: Drew Lanham, Daryl Kip Watters, Marsal Gavalda
  • Publication number: 20100274667
    Abstract: A computer-implemented method provides access to multimedia content, which include units of content that include audio components. Meta data for the units of content is formed to an association of key phrases detected in the audio components and the units. In some examples, forming the meta data includes determining a candidate set of key phrases associated with the unit of multimedia and searching for the presence of the candidate key phrases in the audio components. Forming the meta data then includes forming data representing the presence of key phrases in the audio components.
    Type: Application
    Filed: April 24, 2009
    Publication date: October 28, 2010
    Applicant: Nexidia Inc.
    Inventors: Drew Lanham, Marsal Gavalda, John Willcutts, Gordon Edwards