Patents Assigned to Nexidia, Inc.

LANGUAGE TRANSCRIPTION

Publication number: 20130035936

Abstract: A transcription system is applicable to transcription for a language in which there is limited pronunciation and/or acoustic data. A transcription station is configured using pronunciation data and acoustic data for use with the language. The pronunciation data and/or the acoustic data is initially from another dialect of a language, another language from a language group, or is universal (e.g., not specific to any particular language). A partial transcription of the audio recording is accepted via the transcription station (e.g., from a transcriptionist). One or more repetitions of one or more portions of the partial transcription are identified in the audio recording, and can be accepted during transcription. The pronunciation data and/or the acoustic data is updated in a bootstrapping manner during transcription, thereby improving the efficiency of the transcription process.

Type: Application

Filed: August 1, 2012

Publication date: February 7, 2013

Applicant: Nexidia Inc.

Inventors: Jacob B. Garland, Marsal Gavalda
Keyword spotting using a phoneme-sequence index

Patent number: 8311828

Abstract: In some aspects, a wordspotter is used to locate occurrences in an audio corpus of each of a set of predetermined subword units, which may be phoneme sequences. To locate a query (e.g., a keyword or phrase) in the audio corpus, constituent subword units in the query are indentified and then locations of those subwords are determined based on the locations of those subword units determined earlier by the wordspotter, for example, using a pre-built inverted index that maps subword units to their locations.

Type: Grant

Filed: August 27, 2008

Date of Patent: November 13, 2012

Assignee: Nexidia Inc.

Inventors: Jon A. Arrowood, Robert W. Morris, Mark Finlay, Scott A. Judy
SPEAKER VERIFICATION SYSTEM

Publication number: 20120284026

Abstract: In an aspect, in general, a method for computer assisted speaker authentication in a voice communication session includes establishing a voice communication session between a first speaker and an agent, accepting a first voice signal from the first speaker, determining a voice characteristic measure of the first voice signal, including characterizing a similarity of the first voice signal to each of one or more stored characterizations of voice signals previously acquired from one or more known speakers, and providing an interface to the agent during the voice communication session between the agent and the first speaker, including presenting an indicator based on the determined voice characteristic measure to the agent.

Type: Application

Filed: May 6, 2011

Publication date: November 8, 2012

Applicant: Nexidia Inc.

Inventors: Peter S. Cardillo, Marsal Gavalda
TRANSCRIPTION SYSTEM

Publication number: 20120278071

Abstract: A transcription system automates the control of the playback of the audio to accommodate the user's ability to transcribe the words spoken. In some examples, a delay between playback and typed input is estimated by processing the typed words using a wordspotting approach. The estimated delay is used as in input to an automated speed control, for example, to maintain a target or maximum delay between playback and typed input.

Type: Application

Filed: April 29, 2011

Publication date: November 1, 2012

Applicant: Nexidia Inc.

Inventors: Jacob B. Garland, Marsal Gavalda
Comparing events in word spotting

Patent number: 8170873

Abstract: An approach to comparing events in word spotting, such as comparing putative and reference instances of a keyword, makes use of a set of models of subword units. For each of two acoustic events and for each of a series of times in each of the events, a probability associated with each of the models of the set of subword units is computed. Then, a quantity characterizing a comparison of the two acoustic events, one occurring in each of the two acoustic signals, is computed using the computed probabilities associated with each of the models.

Type: Grant

Filed: July 22, 2004

Date of Patent: May 1, 2012

Assignee: Nexidia Inc.

Inventor: Robert W. Morris
Speech Signal Similarity

Publication number: 20120059656

Abstract: A method for determining a similarity between a first audio source and a second audio source includes: for the first audio source, determining a first frequency of occurrence for each of a plurality of phoneme sequences and determining a first weighted frequency for each of the plurality of phoneme sequences based on the first frequency of occurrence for the phoneme sequence; for the second audio source, determining a second frequency of occurrence for each of a plurality of phoneme sequences and determining a second weighted frequency for each of the plurality of phoneme sequences based on the second frequency of occurrence for the phoneme sequence; comparing the first weighted frequency for each phoneme sequence with the second weighted frequency for the corresponding phoneme sequence; and generating a similarity score representative of a similarity between the first audio source and the second audio source based on the results of the comparing.

Type: Application

Filed: August 30, 2011

Publication date: March 8, 2012

Applicant: Nexidia Inc.

Inventors: Jacob B. Garland, Jon A. Arrowood, Drew Lanham, Marsal Gavalda
SPOTTING MULTIMEDIA

Publication number: 20120010736

Abstract: A method for detecting sections of a known input in an unknown input includes processing the known input to form a series of discrete-valued feature values associated with corresponding time locations in the known input. Index data associating a plurality of the feature values each with one or more time locations in the known input is then formed. The unknown input is processed to form a series of discrete-valued features values. A time offset between the unknown input and the known input is determined by determining time locations in the known input associated with the feature values of the unknown input. Determining the time offset may include maintaining a distribution of time offsets based on successive determined time locations of the feature values of the unknown input.

Type: Application

Filed: July 9, 2010

Publication date: January 12, 2012

Applicant: Nexidia Inc.

Inventors: Peter S. Cardillo, Marsal Gavalda
Enhancing call center performance

Patent number: 8051086

Abstract: Some general aspects of the invention relate to systems and methods of processing data, for example, to improve customer interactions. One aspect, in particular, relates to a computer-implemented method that includes accepting user input for analysis of a database having media data and metadata. The media data includes a group of audio recordings and the metadata includes descriptive information of the group of audio recordings. A representation of a set of call series is formed based on user input, and processed to generate an analysis report. A visual representation of the analysis report is formed for presentation to a user.

Type: Grant

Filed: June 24, 2009

Date of Patent: November 1, 2011

Assignee: Nexidia Inc.

Inventors: Christopher J. Jeffs, Marsal Gavalda, Philip Kyle Pledger, Robert Troy Surdick
CHANNEL COMPRESSION

Publication number: 20110216905

Abstract: Techniques implemented as systems, methods, and apparatuses, including computer program products, for logging multi-channel audio signals.

Type: Application

Filed: March 5, 2010

Publication date: September 8, 2011

Applicant: Nexidia Inc.

Inventors: Marsal Gavalda, Mark Finlay
SPEECH RECOGNITION

Publication number: 20110125499

Abstract: Systems, methods, and apparatus, including computer program products for accepting a predetermined vocabulary-dependent characterization of a set of audio signals, the predetermined characterization including an identification of putative occurrences of each of a plurality of vocabulary items in the set of audio signals, the plurality of vocabulary items included in the vocabulary; accepting a new vocabulary item not included in the vocabulary; accepting putative occurrences of the new vocabulary item in the set of audio signals; and generating, by an analysis engine of a speech processing system, an augmented characterization of the set of audio signals based on the identified putative occurrences of the new vocabulary item.

Type: Application

Filed: November 24, 2009

Publication date: May 26, 2011

Applicant: Nexidia Inc.

Inventors: Kenneth King Griggs, Jon A. Arrowood
Multiresolution searching

Patent number: 7949527

Abstract: This invention relates to processing of audio files, and more specifically, to an improved technique of searching audio. More particularly, a method and system for processing audio using a multi-stage searching process is disclosed.

Type: Grant

Filed: December 19, 2007

Date of Patent: May 24, 2011

Assignee: Nexidia, Inc.

Inventors: Jon A. Arrowood, Robert W. Morris, Kenneth K. Griggs
Spoken word spotting queries

Patent number: 7904296

Abstract: An approach to wordspotting (180) using query data from one or more spoken instance of a query (140). The query data is processed to determining a representation of the query (160) that defines multiple sequences of subword (130) units each representing the query. Then putative instances of the query (190) are located in input data from an audio signal using the determined representation of the query.

Type: Grant

Filed: July 22, 2004

Date of Patent: March 8, 2011

Assignee: Nexidia Inc.

Inventor: Robert W. Morris
TREND DISCOVERY IN AUDIO SIGNALS

Publication number: 20110044447

Abstract: Techniques for processing data representative of text associated with one or more content sources to generate a specification of a set of keyphrases of interest; processing a first set of audio signals collected during a first time period to generate first data characterizing putative occurrences of one or more keyphrases of the set in the first set of audio signals; evaluating the first data to generate keyphrase-specific comparison values for the first set of audio signals; deriving first trending data between the first set of audio signals and a second set of audio signals based in part on an analysis of the keyphrase-specific comparison values for the first set of audio signals relative to stored keyphrase-specific baseline values; and generating a visual representation of at least some of the first trending data and causing the visual representation of the first trending data to be presented on a display terminal.

Type: Application

Filed: August 21, 2009

Publication date: February 24, 2011

Applicant: Nexidia Inc.

Inventors: Robert W. Morris, Marsal Gavalda, Peter S. Cardillo, Jon A. Arrowood
CLUSTER MAP DISPLAY

Publication number: 20110037766

Abstract: Systems and methods are providing for using cluster maps in managing multimedia content including, for example, analyzing audio files stored at a call center. Very generally, a cluster map can be used as an effective tool for visualizing condensed information and for improving the understanding of the characteristics and relationships of the data under study. For example, a set of nodes can be displayed in a cluster map as corresponding to a set of information objects. Each information object may represent the result of a respective query conducted against the data. In some embodiments, multiple relationships between various information objects (such as between different query results) can be displayed simultaneously as graphical links in the map, making data comparison and exploration easier and more intuitive.

Type: Application

Filed: August 17, 2010

Publication date: February 17, 2011

Applicant: Nexidia Inc.

Inventors: Scott A. Judy, Marsal Gavalda
REAL-TIME AGENT ASSISTANCE

Publication number: 20110033036

Abstract: Some general aspects of the invention relate to systems and methods for improving contact center agent performance, for instance, by integrating real-time call monitoring with speech analytics to present agents with information useful to the handling of the current calls. In some implementations, phonetically based speech analysis techniques are applied to process live audio streams to identify key words and/or phrases of relevance, based on which knowledge articles can be selectively presented to agents to drive more efficient business processes.

Type: Application

Filed: July 16, 2010

Publication date: February 10, 2011

Applicant: Nexidia Inc.

Inventors: Gordon Edwards, John Willcutts, Jon W. Ezrine, Marsal Gavalda
Enterprise Speech Intelligence Analysis

Publication number: 20100329437

Abstract: A method includes accepting, via an input interface, a caller identifier parameter and a target value of at least one call series parameter; identifying, using a data processor, a plurality of calls each associated with the caller identifier parameter from amongst a set of calls stored in a call center database; analyzing, using the data processor, the identified plurality of calls to determine a value of the at least one call series parameter for the identified plurality of calls; comparing, using the data processor, the determined value of the at least one call series parameter with the target value of the at least one call series parameter; and defining, using the data processor, the identified plurality of calls as a call series based at least in part on results of the comparing.

Type: Application

Filed: June 24, 2010

Publication date: December 30, 2010

Applicant: Nexidia Inc.

Inventors: Christopher J. Jeffs, Marsal Gavalda, Philip Kyle Pledger, Robert Troy Surdick
TRANSCRIPT ALIGNMENT

Publication number: 20100332225

Abstract: Some general aspects relate to systems and methods for media processing. One aspect, for example, relates to a method for aligning multimedia recording with a transcript. A group of search terms are formed from the transcript, with each search term being associated with a location within the transcript. Putative locations of the search terms are determined in a time interval of the multimedia recording. For each search term, zero or more putative locations are determined and, for at least some of the search terms, multiple putative locations are determined in the time interval of the multimedia recording. According to a first sequencing constraint, a first representation of a group of sequences each of a subset of the putative locations of the search terms is formed. A second representation of a group of sequences each of a subset of the search terms is formed. Using the first and the second representations, the time interval of the multimedia recording is partially aligned with the transcript.

Type: Application

Filed: June 29, 2009

Publication date: December 30, 2010

Applicant: Nexidia Inc.

Inventors: Jon A. Arrowood, Kenneth King Griggs, Marsal Gavalda, Robert W. Morris
Enhancing Call Center Performance

Publication number: 20100332477

Abstract: Some general aspects of the invention relate to systems and methods of processing data, for example, to improve customer interactions. One aspect, in particular, relates to a computer-implemented method that includes accepting user input for analysis of a database having media data and metadata. The media data includes a group of audio recordings and the metadata includes descriptive information of the group of audio recordings. A representation of a set of call series is formed based on user input, and processed to generate an analysis report. A visual representation of the analysis report is formed for presentation to a user.

Type: Application

Filed: June 24, 2009

Publication date: December 30, 2010

Applicant: Nexidia Inc.

Inventors: Christopher J. Jeffs, Marsal Gavalda, Philip Kyle Pledger, Robert Troy Surdick
TRANSCRIPT ALIGNMENT

Publication number: 20100299131

Abstract: Some general aspects relate to systems, software, and methods for media processing. In one aspect, a script associated with a multimedia recording is accepted, wherein the script includes dialogue, speaker indications and video event indications. A group of search terms are formed from the dialogue, with each search term being associated with a location within the script. Zero or more putative locations of each of the search terms are identified in a time interval of the multimedia recording. For at least some of the search terms, multiple putative locations are identified in the time interval of the multimedia recording. The time interval of the multimedia recording and the script are partially aligned using the determined putative locations of the search terms and one or more of the following: a result of matching audio characteristics of the multimedia recording with the speaker indications, and a result of matching video characteristics of the multimedia recording with the video event indications.

Type: Application

Filed: May 21, 2009

Publication date: November 25, 2010

Applicant: Nexidia Inc.

Inventors: Drew Lanham, Daryl Kip Watters, Marsal Gavalda
MULTIMEDIA ACCESS

Publication number: 20100274667

Abstract: A computer-implemented method provides access to multimedia content, which include units of content that include audio components. Meta data for the units of content is formed to an association of key phrases detected in the audio components and the units. In some examples, forming the meta data includes determining a candidate set of key phrases associated with the unit of multimedia and searching for the presence of the candidate key phrases in the audio components. Forming the meta data then includes forming data representing the presence of key phrases in the audio components.

Type: Application

Filed: April 24, 2009

Publication date: October 28, 2010

Applicant: Nexidia Inc.

Inventors: Drew Lanham, Marsal Gavalda, John Willcutts, Gordon Edwards

prev 1 2 3 next