Patents Assigned to Nexidia, Inc.
-
Publication number: 20130035936Abstract: A transcription system is applicable to transcription for a language in which there is limited pronunciation and/or acoustic data. A transcription station is configured using pronunciation data and acoustic data for use with the language. The pronunciation data and/or the acoustic data is initially from another dialect of a language, another language from a language group, or is universal (e.g., not specific to any particular language). A partial transcription of the audio recording is accepted via the transcription station (e.g., from a transcriptionist). One or more repetitions of one or more portions of the partial transcription are identified in the audio recording, and can be accepted during transcription. The pronunciation data and/or the acoustic data is updated in a bootstrapping manner during transcription, thereby improving the efficiency of the transcription process.Type: ApplicationFiled: August 1, 2012Publication date: February 7, 2013Applicant: Nexidia Inc.Inventors: Jacob B. Garland, Marsal Gavalda
-
Patent number: 8311828Abstract: In some aspects, a wordspotter is used to locate occurrences in an audio corpus of each of a set of predetermined subword units, which may be phoneme sequences. To locate a query (e.g., a keyword or phrase) in the audio corpus, constituent subword units in the query are indentified and then locations of those subwords are determined based on the locations of those subword units determined earlier by the wordspotter, for example, using a pre-built inverted index that maps subword units to their locations.Type: GrantFiled: August 27, 2008Date of Patent: November 13, 2012Assignee: Nexidia Inc.Inventors: Jon A. Arrowood, Robert W. Morris, Mark Finlay, Scott A. Judy
-
Publication number: 20120284026Abstract: In an aspect, in general, a method for computer assisted speaker authentication in a voice communication session includes establishing a voice communication session between a first speaker and an agent, accepting a first voice signal from the first speaker, determining a voice characteristic measure of the first voice signal, including characterizing a similarity of the first voice signal to each of one or more stored characterizations of voice signals previously acquired from one or more known speakers, and providing an interface to the agent during the voice communication session between the agent and the first speaker, including presenting an indicator based on the determined voice characteristic measure to the agent.Type: ApplicationFiled: May 6, 2011Publication date: November 8, 2012Applicant: Nexidia Inc.Inventors: Peter S. Cardillo, Marsal Gavalda
-
Publication number: 20120278071Abstract: A transcription system automates the control of the playback of the audio to accommodate the user's ability to transcribe the words spoken. In some examples, a delay between playback and typed input is estimated by processing the typed words using a wordspotting approach. The estimated delay is used as in input to an automated speed control, for example, to maintain a target or maximum delay between playback and typed input.Type: ApplicationFiled: April 29, 2011Publication date: November 1, 2012Applicant: Nexidia Inc.Inventors: Jacob B. Garland, Marsal Gavalda
-
Patent number: 8170873Abstract: An approach to comparing events in word spotting, such as comparing putative and reference instances of a keyword, makes use of a set of models of subword units. For each of two acoustic events and for each of a series of times in each of the events, a probability associated with each of the models of the set of subword units is computed. Then, a quantity characterizing a comparison of the two acoustic events, one occurring in each of the two acoustic signals, is computed using the computed probabilities associated with each of the models.Type: GrantFiled: July 22, 2004Date of Patent: May 1, 2012Assignee: Nexidia Inc.Inventor: Robert W. Morris
-
Publication number: 20120059656Abstract: A method for determining a similarity between a first audio source and a second audio source includes: for the first audio source, determining a first frequency of occurrence for each of a plurality of phoneme sequences and determining a first weighted frequency for each of the plurality of phoneme sequences based on the first frequency of occurrence for the phoneme sequence; for the second audio source, determining a second frequency of occurrence for each of a plurality of phoneme sequences and determining a second weighted frequency for each of the plurality of phoneme sequences based on the second frequency of occurrence for the phoneme sequence; comparing the first weighted frequency for each phoneme sequence with the second weighted frequency for the corresponding phoneme sequence; and generating a similarity score representative of a similarity between the first audio source and the second audio source based on the results of the comparing.Type: ApplicationFiled: August 30, 2011Publication date: March 8, 2012Applicant: Nexidia Inc.Inventors: Jacob B. Garland, Jon A. Arrowood, Drew Lanham, Marsal Gavalda
-
Publication number: 20120010736Abstract: A method for detecting sections of a known input in an unknown input includes processing the known input to form a series of discrete-valued feature values associated with corresponding time locations in the known input. Index data associating a plurality of the feature values each with one or more time locations in the known input is then formed. The unknown input is processed to form a series of discrete-valued features values. A time offset between the unknown input and the known input is determined by determining time locations in the known input associated with the feature values of the unknown input. Determining the time offset may include maintaining a distribution of time offsets based on successive determined time locations of the feature values of the unknown input.Type: ApplicationFiled: July 9, 2010Publication date: January 12, 2012Applicant: Nexidia Inc.Inventors: Peter S. Cardillo, Marsal Gavalda
-
Patent number: 8051086Abstract: Some general aspects of the invention relate to systems and methods of processing data, for example, to improve customer interactions. One aspect, in particular, relates to a computer-implemented method that includes accepting user input for analysis of a database having media data and metadata. The media data includes a group of audio recordings and the metadata includes descriptive information of the group of audio recordings. A representation of a set of call series is formed based on user input, and processed to generate an analysis report. A visual representation of the analysis report is formed for presentation to a user.Type: GrantFiled: June 24, 2009Date of Patent: November 1, 2011Assignee: Nexidia Inc.Inventors: Christopher J. Jeffs, Marsal Gavalda, Philip Kyle Pledger, Robert Troy Surdick
-
Publication number: 20110216905Abstract: Techniques implemented as systems, methods, and apparatuses, including computer program products, for logging multi-channel audio signals.Type: ApplicationFiled: March 5, 2010Publication date: September 8, 2011Applicant: Nexidia Inc.Inventors: Marsal Gavalda, Mark Finlay
-
Publication number: 20110125499Abstract: Systems, methods, and apparatus, including computer program products for accepting a predetermined vocabulary-dependent characterization of a set of audio signals, the predetermined characterization including an identification of putative occurrences of each of a plurality of vocabulary items in the set of audio signals, the plurality of vocabulary items included in the vocabulary; accepting a new vocabulary item not included in the vocabulary; accepting putative occurrences of the new vocabulary item in the set of audio signals; and generating, by an analysis engine of a speech processing system, an augmented characterization of the set of audio signals based on the identified putative occurrences of the new vocabulary item.Type: ApplicationFiled: November 24, 2009Publication date: May 26, 2011Applicant: Nexidia Inc.Inventors: Kenneth King Griggs, Jon A. Arrowood
-
Patent number: 7949527Abstract: This invention relates to processing of audio files, and more specifically, to an improved technique of searching audio. More particularly, a method and system for processing audio using a multi-stage searching process is disclosed.Type: GrantFiled: December 19, 2007Date of Patent: May 24, 2011Assignee: Nexidia, Inc.Inventors: Jon A. Arrowood, Robert W. Morris, Kenneth K. Griggs
-
Patent number: 7904296Abstract: An approach to wordspotting (180) using query data from one or more spoken instance of a query (140). The query data is processed to determining a representation of the query (160) that defines multiple sequences of subword (130) units each representing the query. Then putative instances of the query (190) are located in input data from an audio signal using the determined representation of the query.Type: GrantFiled: July 22, 2004Date of Patent: March 8, 2011Assignee: Nexidia Inc.Inventor: Robert W. Morris
-
Publication number: 20110044447Abstract: Techniques for processing data representative of text associated with one or more content sources to generate a specification of a set of keyphrases of interest; processing a first set of audio signals collected during a first time period to generate first data characterizing putative occurrences of one or more keyphrases of the set in the first set of audio signals; evaluating the first data to generate keyphrase-specific comparison values for the first set of audio signals; deriving first trending data between the first set of audio signals and a second set of audio signals based in part on an analysis of the keyphrase-specific comparison values for the first set of audio signals relative to stored keyphrase-specific baseline values; and generating a visual representation of at least some of the first trending data and causing the visual representation of the first trending data to be presented on a display terminal.Type: ApplicationFiled: August 21, 2009Publication date: February 24, 2011Applicant: Nexidia Inc.Inventors: Robert W. Morris, Marsal Gavalda, Peter S. Cardillo, Jon A. Arrowood
-
Publication number: 20110037766Abstract: Systems and methods are providing for using cluster maps in managing multimedia content including, for example, analyzing audio files stored at a call center. Very generally, a cluster map can be used as an effective tool for visualizing condensed information and for improving the understanding of the characteristics and relationships of the data under study. For example, a set of nodes can be displayed in a cluster map as corresponding to a set of information objects. Each information object may represent the result of a respective query conducted against the data. In some embodiments, multiple relationships between various information objects (such as between different query results) can be displayed simultaneously as graphical links in the map, making data comparison and exploration easier and more intuitive.Type: ApplicationFiled: August 17, 2010Publication date: February 17, 2011Applicant: Nexidia Inc.Inventors: Scott A. Judy, Marsal Gavalda
-
Publication number: 20110033036Abstract: Some general aspects of the invention relate to systems and methods for improving contact center agent performance, for instance, by integrating real-time call monitoring with speech analytics to present agents with information useful to the handling of the current calls. In some implementations, phonetically based speech analysis techniques are applied to process live audio streams to identify key words and/or phrases of relevance, based on which knowledge articles can be selectively presented to agents to drive more efficient business processes.Type: ApplicationFiled: July 16, 2010Publication date: February 10, 2011Applicant: Nexidia Inc.Inventors: Gordon Edwards, John Willcutts, Jon W. Ezrine, Marsal Gavalda
-
Publication number: 20100329437Abstract: A method includes accepting, via an input interface, a caller identifier parameter and a target value of at least one call series parameter; identifying, using a data processor, a plurality of calls each associated with the caller identifier parameter from amongst a set of calls stored in a call center database; analyzing, using the data processor, the identified plurality of calls to determine a value of the at least one call series parameter for the identified plurality of calls; comparing, using the data processor, the determined value of the at least one call series parameter with the target value of the at least one call series parameter; and defining, using the data processor, the identified plurality of calls as a call series based at least in part on results of the comparing.Type: ApplicationFiled: June 24, 2010Publication date: December 30, 2010Applicant: Nexidia Inc.Inventors: Christopher J. Jeffs, Marsal Gavalda, Philip Kyle Pledger, Robert Troy Surdick
-
Publication number: 20100332225Abstract: Some general aspects relate to systems and methods for media processing. One aspect, for example, relates to a method for aligning multimedia recording with a transcript. A group of search terms are formed from the transcript, with each search term being associated with a location within the transcript. Putative locations of the search terms are determined in a time interval of the multimedia recording. For each search term, zero or more putative locations are determined and, for at least some of the search terms, multiple putative locations are determined in the time interval of the multimedia recording. According to a first sequencing constraint, a first representation of a group of sequences each of a subset of the putative locations of the search terms is formed. A second representation of a group of sequences each of a subset of the search terms is formed. Using the first and the second representations, the time interval of the multimedia recording is partially aligned with the transcript.Type: ApplicationFiled: June 29, 2009Publication date: December 30, 2010Applicant: Nexidia Inc.Inventors: Jon A. Arrowood, Kenneth King Griggs, Marsal Gavalda, Robert W. Morris
-
Publication number: 20100332477Abstract: Some general aspects of the invention relate to systems and methods of processing data, for example, to improve customer interactions. One aspect, in particular, relates to a computer-implemented method that includes accepting user input for analysis of a database having media data and metadata. The media data includes a group of audio recordings and the metadata includes descriptive information of the group of audio recordings. A representation of a set of call series is formed based on user input, and processed to generate an analysis report. A visual representation of the analysis report is formed for presentation to a user.Type: ApplicationFiled: June 24, 2009Publication date: December 30, 2010Applicant: Nexidia Inc.Inventors: Christopher J. Jeffs, Marsal Gavalda, Philip Kyle Pledger, Robert Troy Surdick
-
Publication number: 20100299131Abstract: Some general aspects relate to systems, software, and methods for media processing. In one aspect, a script associated with a multimedia recording is accepted, wherein the script includes dialogue, speaker indications and video event indications. A group of search terms are formed from the dialogue, with each search term being associated with a location within the script. Zero or more putative locations of each of the search terms are identified in a time interval of the multimedia recording. For at least some of the search terms, multiple putative locations are identified in the time interval of the multimedia recording. The time interval of the multimedia recording and the script are partially aligned using the determined putative locations of the search terms and one or more of the following: a result of matching audio characteristics of the multimedia recording with the speaker indications, and a result of matching video characteristics of the multimedia recording with the video event indications.Type: ApplicationFiled: May 21, 2009Publication date: November 25, 2010Applicant: Nexidia Inc.Inventors: Drew Lanham, Daryl Kip Watters, Marsal Gavalda
-
Publication number: 20100274667Abstract: A computer-implemented method provides access to multimedia content, which include units of content that include audio components. Meta data for the units of content is formed to an association of key phrases detected in the audio components and the units. In some examples, forming the meta data includes determining a candidate set of key phrases associated with the unit of multimedia and searching for the presence of the candidate key phrases in the audio components. Forming the meta data then includes forming data representing the presence of key phrases in the audio components.Type: ApplicationFiled: April 24, 2009Publication date: October 28, 2010Applicant: Nexidia Inc.Inventors: Drew Lanham, Marsal Gavalda, John Willcutts, Gordon Edwards