Preliminary Matching Patents (Class 704/247)

Systems and methods for hands-free user interfaces for hospital management systems

Patent number: 11854691

Abstract: One embodiment provides a method including receiving natural language speech from a user. The method includes automatically converting the natural language speech into a text format and executing a function corresponding to the function portion of the natural language speech. The method includes translating data returned by executing the function into natural language, wherein the translating includes modifying sensitive information within the data returned by (i) changing a selected audible output device and output volume to prevent the at least one unauthorized user from hearing the data returned and (ii) modifying the data returned. The method also includes outputting the natural language via an audible output device. Other aspects are described and claimed.

Type: Grant

Filed: June 20, 2022

Date of Patent: December 26, 2023

Assignee: TELETRACKING TECHNOLOGIES, INC.

Inventors: Deepak Bhurani, Manoj Kamavarapu, Sagar Cheekati, Shane Torsell, Clinton Wadley
Voice conversation analysis method and apparatus using artificial intelligence

Patent number: 11769492

Abstract: The present invention relates to a voice conversation analysis apparatus and a method therefor and, more specifically, to: a voice conversation analysis apparatus categorizing voices generated during a voice conversation so as to predict required functions and further analyzing the voices so as to provide proper functions; and a method therefor. In addition, disclosed are: an artificial intelligence (AI) system for simulating the functions of recognition, decision-making, and the like of the human brain by using a machine learning algorithm; and an application thereof.

Type: Grant

Filed: March 26, 2019

Date of Patent: September 26, 2023

Assignee: Samsung Electronics Co., Ltd.

Inventors: Changhan Kim, Bowon Kim, Jinsuk Lee, Hyeontaek Lim, Yangwook Kim, Guiwon Seo, Jonghwa Lee
Identifying a location of a person

Patent number: 11765565

Abstract: A location of a person is identified. A controller core is configured for interfacing with a plurality of sensing devices at a plurality of locations. Each sensing device has a sensor for detecting a threat-condition and an acoustic sensor for generating an acoustic indicator indicative of an incident acoustic wave. The controller core is configured to: (i) upon receiving a trigger, instruct wireless transmission to the sensing devices, of a request for an acoustic indicator generated in the sensing device, wherein the request configures each of the sensing devices to activate a process in respect of an incident acoustic wave; (ii) receive one or more of said requested acoustic indicators; and (iii) based on the received one or more acoustic indicators, identify a location of the person (103) within the building.

Type: Grant

Filed: June 5, 2019

Date of Patent: September 19, 2023

Assignee: Essence SmartCare Ltd

Inventors: Ohad Amir, Yaron Oppenheim, Jonathan Schnapp
Systems and methods for pre-filtering audio content based on prominence of frequency content

Patent number: 11735202

Abstract: A system is disclosed for processing electronic audio signals. The system includes an input process for receiving digital samples of an electronic audio signal; a frame division process for allocating sequences of the digital samples of the electronic audio signal to respective frames; a frequency transform process for processing the digital samples by frame thereby to register, for each of the frames, a respective frequency set; a filtering process for filtering frequencies of each frequency set into a respective one of a plurality of orders based on relative prominence; an amplitude sequence process for generating multiple amplitude sequences based on the orders, each amplitude sequence n respectively comprising a sequence of amplitudes of the nth-order frequency content in the frames; and an output process for generating user-apprehendable content for a user interface of the system based on the multiple amplitude sequences. Related systems, methods and computer-readable media are also disclosed.

Type: Grant

Filed: January 22, 2020

Date of Patent: August 22, 2023

Assignee: Sound Genetics, Inc.

Inventors: Anwar Almojarkesh, Rick Kennedy, Doyle Gerard
Method and device for user registration, and electronic device

Patent number: 11568876

Abstract: Provided in embodiments of the present application are a method and apparatus for user registration and electronic device. The method includes: after obtaining a wake-up voice of a user each time, extracting and storing a first voiceprint feature corresponding to the wake-up voice; clustering the stored first voiceprint features to divide the stored first voiceprint features into at least one category, wherein, each of the at least one category includes at least one first voiceprint feature which belongs to the same user; assigning one category identifier to each category; storing each category identifier in correspondence to at least one first voiceprint feature corresponding to this category identifier to complete user registration. The embodiments of the present application can simplify the user operation and improve the user experience.

Type: Grant

Filed: April 10, 2018

Date of Patent: January 31, 2023

Assignee: BEIJING ORION STAR TECHNOLOGY CO., LTD.

Inventors: Fuxiang Li, Xiao Li, Guoguang Li
Biometric, physiological or environmental monitoring using a closed chamber

Patent number: 11504067

Abstract: A monitoring device configured for insertion into a conduit of a subject includes a housing, at least one physiological sensor coupled to the housing, a transmitter coupled to the housing, an expandable element, inflatable element, stretched membrane or balloon coupled to the housing and configured to occlude at least a portion of the conduit, a power source attached to the housing, and a processor coupled to the housing and operatively coupled to memory containing computer instruction causing the monitoring device to obtain physiological information via the at least one physiological sensor where the physiological information includes one or more of pulse rate information, body temperature information, breathing rate information, blood pressure information, cardiac output information, or blood gas level information, and processing and analyzing the physiological information to provide a result.

Type: Grant

Filed: May 8, 2020

Date of Patent: November 22, 2022

Assignee: Staton Techiya, LLC

Inventor: Steven Wayne Goldstein
Electronic apparatus, controlling method of electronic apparatus and computer readable medium

Patent number: 11437046

Abstract: An electronic apparatus and a method thereof are provided. The electronic apparatus according to an embodiment includes a memory to store a speaker model including characteristic information of a voice of a user corresponding to a specific word, and a processor configured to, based on a first input voice corresponding to the specific word, provide a voice recognition service through the electronic apparatus based on a first authentication performed based on data on the first input voice and the speaker model, and perform a second authentication based on data corresponding to the specific word among data of the second input voice that is input while the voice recognition service is provided and the speaker model.

Type: Grant

Filed: October 2, 2019

Date of Patent: September 6, 2022

Assignee: SAMSUNG ELECTRONICS CO., LTD.

Inventor: Jaehyun Bae
Automatically determining language for speech recognition of spoken utterance received via an automated assistant interface

Patent number: 11393476

Abstract: Implementations relate to determining a language for speech recognition of a spoken utterance, received via an automated assistant interface, for interacting with an automated assistant. In various implementations, audio data indicative of a voice input that includes a natural language request from a user may be applied as input across multiple speech-to-text (“STT”) machine learning models to generate multiple candidate speech recognition outputs. Each STT machine learning model may trained in a particular language. For each respective STT machine learning model of the multiple STT models, the multiple candidate speech recognition outputs may be analyzed to determine an entropy score for the respective STT machine learning model. Based on the entropy scores, a target language associated with at least one STT machine learning model of the multiple STT machine learning models may be selected. The automated assistant may respond to the request using the target language.

Type: Grant

Filed: January 8, 2019

Date of Patent: July 19, 2022

Assignee: GOOGLE LLC

Inventors: Ignacio Lopez Moreno, Lukas Lopatovsky, Ágoston Weisz
Modifying application function based on login attempt confidence score

Patent number: 11392677

Abstract: Account permissions and data accessibility can be modified based on level of confidence for a login attempt to the account. User activity observations corresponding to one or more login attempts to access a user account can be stored. A confidence score associated with a successful login attempt of the user account can be determined. The confidence score is based on the user activity observations. A level of access to an application with functions and data for the user account can be determined. The level of access is based on the confidence score. The level of access is associated with the functions and the data that are executable and accessible subsequent to the successful login attempt.

Type: Grant

Filed: March 14, 2019

Date of Patent: July 19, 2022

Assignee: Truist Bank

Inventors: Amy Rose, Joseph Aguayo, David Stone
Systems and methods for hands-free user interfaces for hospital management systems

Patent number: 11367524

Abstract: Systems, methods, and media are provided for hands-free interaction with a database. Disclosed embodiments may receive speech from the user via a microphone. Disclosed embodiments may also automatically translate the speech into text wherein the text comprises at least one command. Further, disclosed embodiments may, based on the at least one command, execute a function associated with the at least one command, wherein the function comprises querying a database. And, disclosed embodiments may translate data returned by querying the database into natural language. Additionally, disclosed embodiments may output the translated data via at least one speaker.

Type: Grant

Filed: October 22, 2018

Date of Patent: June 21, 2022

Assignee: TeleTracking Technologies, Inc.

Inventors: Deepak Bhurani, Manoj Kamavarapu, Sagar Cheekati, Shane Torsell, Clinton Wadley
Detection of replay attack

Patent number: 11276409

Abstract: In order to detect a replay attack on a voice biometrics system, a first signal from received sound is generated at a first microphone, and a second signal from the received sound is generated at a second microphone. The first and second signals are used to determine a location of an apparent source of the received sound. It is determined that the received sound may result from a replay attack if the apparent source of the received sound is diffuse.

Type: Grant

Filed: November 8, 2018

Date of Patent: March 15, 2022

Assignee: Cirrus Logic, Inc.

Inventor: John Paul Lesso
Speaker identification

Patent number: 11264037

Abstract: A method of speaker identification comprises receiving an audio signal representing speech; performing a first voice biometric process on the audio signal to attempt to identify whether the speech is the speech of an enrolled speaker; and, if the first voice biometric process makes an initial determination that the speech is the speech of an enrolled user, performing a second voice biometric process on the audio signal to attempt to identify whether the speech is the speech of the enrolled speaker. The second voice biometric process is selected to be more discriminative than the first voice biometric process.

Type: Grant

Filed: January 23, 2018

Date of Patent: March 1, 2022

Assignee: Cirrus Logic, Inc.

Inventor: John Paul Lesso
Speaker identification

Patent number: 11227607

Abstract: A method of speaker identification comprises receiving an audio signal representing speech; performing a first voice biometric process on the audio signal to attempt to identify whether the speech is the speech of an enrolled speaker; and, if the first voice biometric process makes an initial determination that the speech is the speech of an enrolled user, performing a second voice biometric process on the audio signal to attempt to identify whether the speech is the speech of the enrolled speaker. The second voice biometric process is selected to be more discriminative than the first voice biometric process.

Type: Grant

Filed: January 23, 2018

Date of Patent: January 18, 2022

Assignee: Cirrus Logic, Inc.

Inventor: John Paul Lesso
Method and apparatus for detecting an end of an utterance

Patent number: 11195545

Abstract: A device to perform end-of-utterance detection includes a speaker vector extractor configured to receive a frame of an audio signal and to generate a speaker vector that corresponds to the frame. The device also includes an end-of-utterance detector configured to process the speaker vector and to generate an indicator that indicates whether the frame corresponds to an end of an utterance of a particular speaker.

Type: Grant

Filed: October 18, 2019

Date of Patent: December 7, 2021

Assignee: QUALCOMM Incorporated

Inventors: Hye Jin Jang, Kyu Woong Hwang, Sungrack Yun, Janghoon Cho
Method and apparatus for voiceprint creation and registration

Patent number: 11100934

Abstract: A method and an apparatus for voiceprint creation and registration, comprising: prompting to create a voiceprint and register when a device is enabled for a first time(101); using a text-related training method to create a voiceprint model for a user(102); generating an ID for the user(103); and prompting the user to input user ID-related data; storing the ID for the user and the voiceprint model correspondingly in a voiceprint registration database(104). The problems in the prior art that the technology of the voiceprint creation and registration method has a high learning cost and is more disturbing to the user may be avoided. The voiceprint creation process may cover various scenes, the voiceprint creation may guide the user in all stages, or the voiceprint creation is separated from registration through a frequency to minimize user's disturbance, and after the user is guided to register the voiceprint, the speech interaction product may provide personalized service to the user based on the voiceprint.

Type: Grant

Filed: November 30, 2017

Date of Patent: August 24, 2021

Assignees: BAIDU ONLINE NETWORK TECHNOLOGY (BEIJING) CO., LTD., SHANGHAI XIAODU TECHNOLOGY CO. LTD.

Inventors: Wenyu Wang, Yuan Hu
Seamless text dependent enrollment

Patent number: 11062712

Abstract: Methods and systems for transforming a text-independent enrollment of a customer in a self-service system into a text-dependent enrollment are provided. A request for authentication of a customer that is enrolled in the self-service system with a text-independent voice print is received. A request is transmitted to the customer to repeat a passphrase and the customer's response is received as an audio stream of the passphrase. The customer is authenticated by comparing the audio stream of the passphrase against the text-independent voice print and if the customer is authenticated then a text-dependent voice print is created based on the passphrase, otherwise discard the audio stream of the passphrase.

Type: Grant

Filed: February 7, 2020

Date of Patent: July 13, 2021

Assignee: NICE LTD.

Inventors: Matan Keret, Omer Kochba, Amnon Buzaglo
Thin film type inductor

Patent number: 11031174

Abstract: A thin film type inductor includes: a body including a support member including a through hole, upper and lower coils disposed on upper and lower surfaces of the support member, respectively, and a via penetrating through the support member while connecting the upper and lower coils to each other; and external electrodes disposed on an external surface of the body. A coil pattern directly connected to the via may include an inclined surface.

Type: Grant

Filed: May 21, 2018

Date of Patent: June 8, 2021

Assignee: SAMSUNG ELECTRO-MECHANICS CO., LTD.

Inventors: Jae Hun Kim, Boum Seock Kim, Byeong Cheol Moon
Smart surveillance system for swimming pools

Patent number: 10964187

Abstract: A swimming pool monitoring system using a plurality of sensor systems to detect the presence of objects within a defined perimeter of the swimming pool prior to the objects reaching the edge of the pool. The objects may be people or animals. The sensor systems may include a ranging sensor, an audio sensor, an olfactory sensor and a video imaging sensor. The sensors are monitored by a computer system that has stored data for objects authorized to be within the pool perimeter. The system provides an alert or alarm upon detection of an unauthorized or unknown object in the monitored perimeter by comparing detected objects to the stored data. The system can determine the distance of the object from the pool edge and provide an adjustable alarm as the objects nears the pool edge.

Type: Grant

Filed: January 28, 2020

Date of Patent: March 30, 2021

Assignee: POOL KNIGHT, LLC

Inventors: Ramy R. Gali, Phillip E. Matar, David C. May
Dynamic voice authentication

Patent number: 10957318

Abstract: A transaction authentication system including a vocabulary database including at least one of a dictionary's words; an utterance database including at least one speaker's utterances; at least one processor in communication with the vocabulary database and the utterance database, the at least one processor programmed or configured to digitally record an utterance from at least one speaker; isolate at least one word of he utterance based on at least one statistical analysis, one acoustic analysis, or any combination thereof; of; match at least one dictionary word from the vocabulary database with at least one word of the utterance in accordance with at least one predefined accuracy criterion; and in response to matching the at least one dictionary word with the at least one word of the utterance, transmit an authentication message.

Type: Grant

Filed: November 2, 2018

Date of Patent: March 23, 2021

Assignee: Visa International Service Association

Inventors: Arjun Mishra, Suman Biswas
Query response using media consumption history

Patent number: 10860639

Abstract: Methods, systems, and apparatus for receiving a natural language query of a user, and environmental data, identifying a media item based on the environmental data, determining an entity type based on the natural language query, selecting an entity associated with the media item that matches the entity type, selecting, from a media consumption database that identifies media items that have been indicated as consumed by the user, one or more media items that have been indicated as consumed by the user and that are associated with the selected entity, and providing a response to the query based on selecting the one or more media items that have been indicated as consumed by the user and that are associated with the selected entity.

Type: Grant

Filed: September 16, 2016

Date of Patent: December 8, 2020

Assignee: Google LLC

Inventor: Matthew Sharifi
Identifying parameter values and determining features for boosting rankings of relevant distributable digital assistant operations

Patent number: 10838746

Abstract: Embodiments described herein are generally directed towards systems and methods relating to a crowd-sourced digital assistant system and related methods. In particular, embodiments describe techniques for effectively searching, modifying, identifying parameter values, and determining features for selecting action datasets for distribution to digital assistant devices based on commands received therefrom. Action datasets include computing events or tasks that can be reproduced when a command is received by a digital assistant device and communicated to the server device. The digital assistant server described herein can receive action datasets, maintain action datasets, receive commands from digital assistant devices, and effectively select most relevant action datasets for distribution to the digital assistant devices based on the received commands.

Type: Grant

Filed: July 24, 2019

Date of Patent: November 17, 2020

Assignee: AIQUDO, INC.

Inventors: Conal Loughrey, Hudson Leonardo Mendes
Visually-impaired-accessible building safety system

Patent number: 10796539

Abstract: Building safety systems, methods, and mediums are provided. A method includes receiving a voice input by the building safety system. The method includes receiving voice data produced by a speech recognition process performed on the voice input. The method includes determining a response to the voice input based on the voice data. The method includes producing the response by the building safety system.

Type: Grant

Filed: February 20, 2018

Date of Patent: October 6, 2020

Assignee: Siemens Industry, Inc.

Inventor: Leslie A. Field
Establishing a voice authentication credential

Patent number: 10764438

Abstract: This disclosure describes techniques for establishing a voice authentication credential for an authenticated user of a mobile device. In one example, a system comprises an IVR system and a computing system, where the computing system comprises processing circuitry configured to: receive, over a network, authentication data from a mobile device; authenticate, based on the authentication data, a user operating the mobile device; output, over the network and to the mobile device, instructions for communicating with the IVR system; after outputting the instructions, receive, from the IVR system, information indicating that the mobile device, operated by the user, has contacted the IVR system; communicate with the IVR system to authenticate the user of the mobile device, receive, from the IVR system, a voiceprint created by the user of the mobile device; store the voiceprint as an authentication credential for the user of the mobile device.

Type: Grant

Filed: May 24, 2019

Date of Patent: September 1, 2020

Assignee: Wells Fargo Bank, N.A.

Inventors: Daniel S. Sumner, Douglas Innocenti
Technologies for authenticating a speaker using voice biometrics

Patent number: 10614814

Abstract: Technologies for authenticating a speaker in a voice authentication system using voice biometrics include a speech collection computing device and a speech authentication computing device. The speech collection computing device is configured to collect a speech signal from a speaker and transmit the speech signal to the speech authentication computing device. The speech authentication computing device is configured to compute a speech signal feature vector for the received speech signal, retrieve a speech signal classifier associated with the speaker, and feed the speech signal feature vector to the retrieved speech signal classifier. Additionally, the speech authentication computing device is configured to determine whether the speaker is an authorized speaker based on an output of the retrieved speech signal classifier. Additional embodiments are described herein.

Type: Grant

Filed: June 2, 2017

Date of Patent: April 7, 2020

Inventors: Rajesh Dachiraju, Aravind Ganapathiraju, Ananth Nagaraja Iyer, Felix Immanuel Wyss
Authentication via a dynamic passphrase

Patent number: 10592649

Abstract: A computerize method for voice authentication of a customer in a self-service system is provided. A request for authentication of the customer is received and the customer is enrolled in the self-service system with a text-independent voice print. A passphrase from a plurality of passphrases to transmit to the customer is determined based on comparing each of the plurality of passphrases to a text-dependent or text-independent voice biometric model. The passphrase is transmitted to the customer, and when the customer responds, an audio stream of the passphrase is received. The customer is authenticated by comparing the audio stream of the passphrase against the text-independent voice print. If the customer is authenticated, then storing the audio stream of the passphrase and the topic of the passphrase.

Type: Grant

Filed: August 9, 2017

Date of Patent: March 17, 2020

Assignee: NICE LTD.

Inventors: Matan Keret, Amnon Buzaglo
Unified N-best ASR results

Patent number: 10580406

Abstract: A system and method receives a spoken utterance and converts the spoken utterance into recognized speech results through automatic speech recognition modules. The system and method renders a composite recognition speech result comprising the recognized speech results joined in a return function. The system and method interprets the recognized speech results joined in a return function from each of the automatic speech recognition modules through multiple conversation modules.

Type: Grant

Filed: November 8, 2017

Date of Patent: March 3, 2020

Assignee: 2236008 Ontario Inc.

Inventor: Darrin Kenneth John Fry
Verifying occupancy of a building

Patent number: 10522012

Abstract: A method for detecting occupancy of a building is described. In one embodiment, the method includes using a microphone to monitor for sounds at a building, detecting a sound via the microphone, and determining whether the sound is made by a human or made by an animal. In some cases, the microphone is a glass break sensor microphone.

Type: Grant

Filed: July 10, 2018

Date of Patent: December 31, 2019

Assignee: Vivint, Inc.

Inventors: James E. Nye, Jeremy B. Warren
Voice mode asset retrieval

Patent number: 10262660

Abstract: A method includes detecting an event published to a workflow activity by a voice based dialog view, wherein the event indicates a state of asset retrieval, navigating to a built-in asset retrieval work activity, retrieving an asset, and dismissing the workflow activity to revert to a workflow activity associated with the voice based dialog view.

Type: Grant

Filed: January 5, 2016

Date of Patent: April 16, 2019

Assignee: HAND HELD PRODUCTS, INC.

Inventors: Shawn Zabel, Jeffrey Pike, Brian Bender, Dennis Doubleday, Mark Murawski
Speaker identification device, speaker identification method, and recording medium

Patent number: 10249306

Abstract: A speaker identification device includes: a primary speaker identification unit that computes, for each pre-stored registered speaker, a score that indicates the similarity between input speech and speech of the registered speakers; a similar speaker selection unit that selects a plurality of the registered speakers as similar speakers according to the height of the scores thereof; a learning unit that creates a classifier for each similar speaker by sorting the speech of a certain similar speaker among the similar speakers as a positive instance and the speech of the other similar speakers as negative instances; and a secondary speaker identification unit that computes, for each classifier, a score of the classifier with respect to the input speech, and outputs an identification result.

Type: Grant

Filed: January 16, 2014

Date of Patent: April 2, 2019

Assignee: NEC CORPORATION

Inventors: Masahiro Tani, Takafumi Koshinaka, Yoshifumi Onishi, Shigeru Sawada
System and method for voice authentication over a computer network

Patent number: 10140992

Abstract: Systems, computer-implemented methods, and tangible computer-readable media are provided for voice authentication. The method includes receiving, on a mobile device, a speech sample from a user as part of a request for a restricted-access resource separate from the mobile device. When the user has previously established an identity with the mobile device, the method includes transmitting the speech sample along with the request to an authentication server which compares the speech sample to a previously established speech profile associated with the user and providing access to the restricted-access resource based on a response to the request from the authentication server if the speech sample from the user matches the speech profile on the authentication server with a minimum certainty threshold.

Type: Grant

Filed: April 6, 2017

Date of Patent: November 27, 2018

Assignee: NUANCE COMMUNICATIONS, INC.

Inventor: Saurabh Kumar
Systems and methods of home-specific sound event detection

Patent number: 10068445

Abstract: Systems and methods of a security system are provided, including detecting, by a sensor, a sound event, and selecting, by a processor coupled to the sensor, at least a portion of sound data captured by the sensor that corresponds to at least one sound feature of the detected sound event. The systems and methods include classifying the at least one sound feature into one or more sound categories, and determining, by a processor, based upon a database of home-specific sound data, whether the at least one sound feature is a human-generated sound. A notification can be transmitted to a computing device according to the sound event.

Type: Grant

Filed: June 24, 2015

Date of Patent: September 4, 2018

Assignee: Google LLC

Inventors: Rajeev Conrad Nongpiur, Michael Dixon
Frequency band extending device and method, encoding device and method, decoding device and method, and program

Patent number: 9691410

Abstract: The present invention relates to a frequency band extending device and method, an encoding device and method, a decoding device and method, and a program, whereby music signals can be played with higher sound quality due to the extension of frequency bands. A bandpass filter 13 divides an input signal into multiple sub-band signals, a feature amount calculating circuit 14 calculates feature amount using at least one of the multiple divided sub-band signals and the input signal, a high frequency sub-band power estimating circuit 15 calculates an estimated value of a high frequency sub-band power based on the calculated feature amount, a high frequency signal generating circuit 16 generates a high frequency signal component based on the multiple sub-band signals divided by the bandpass filter 13, and the estimated value of the high frequency sub-band power calculated by the high frequency sub-band power estimating circuit 15.

Type: Grant

Filed: September 30, 2015

Date of Patent: June 27, 2017

Assignee: Sony Corporation

Inventors: Yuki Yamamoto, Toru Chinen, Hiroyuki Honma, Yuhki Mitsufuji
Voice activity detection method and method used for voice activity detection and apparatus thereof

Patent number: 9672841

Abstract: The present document relates to a voice activity detection (VAD) method and methods used for voice activity detection and apparatus thereof, the VAD method includes: obtaining sub-band signals and spectrum amplitudes of a current frame; computing values of a energy feature and a spectral centroid feature of the current frame according to the sub-band signals; computing a signal to noise ratio parameter of the current frame according to a background noise energy estimated from a previous frame, an energy of SNR sub-bands and a energy feature of the current frame; computing a VAD decision result according to a tonality signal flag, a signal to noise ratio parameter, a spectral centroid feature, and a frame energy feature. The methods and apparatus of the present document can improve the accuracy of non-stationary noise (such as office noise) and music detection.

Type: Grant

Filed: June 30, 2015

Date of Patent: June 6, 2017

Assignee: ZTE Corporation

Inventors: Dongping Jiang, Hao Yuan, Changbao Zhu
System, method, and apparatus for user-initiated provisioning of a communication device

Patent number: 9584562

Abstract: An embodiment of a method and apparatus for provisioning of a communication device includes receiving a registration request from a first communication device. The registration request includes an address associated with the first communication device. The method further includes registering the first communication device in response to receiving the registration request, placing a call request to the first communication device, and establishing a call session with the first communication device. The method further includes prompting a user of the first communication device for a user identifier, and receiving a user identifier from the user of the first communication device. The method still further includes sending one or more configuration parameters associated with the user identifier to the first communication device. The one or more configuration parameters are operable to configure the first communication device.

Type: Grant

Filed: November 17, 2014

Date of Patent: February 28, 2017

Assignee: CenturyLink Intellectual Property LLC

Inventors: Mike A. Roberts, Shekhar Gupta, Jim Kevin Edwards
Method and system for dual scoring for text-dependent speaker verification

Patent number: 9489950

Abstract: Embodiments of systems and methods for speaker verification are provided. In various embodiments, a method includes receiving an utterance from a speaker and determining a text-independent speaker verification score and a text-dependent speaker verification score in response to the utterance. Various embodiments include a system for speaker verification, the system comprising an audio receiving device for receiving an utterance from a speaker and converting the utterance to an utterance signal, and a processor coupled to the audio receiving device for determining speaker verification in response to the utterance signal, wherein the processor determines speaker verification in response to a UBM-independent speaker-normalized score.

Type: Grant

Filed: May 23, 2013

Date of Patent: November 8, 2016

Assignee: Agency for Science, Technology and Research

Inventors: Anthony Larcher, Kong Aik Lee, Bin Ma, Thai Ngoc Thuy Huong
Customer identification through voice biometrics

Patent number: 9396730

Abstract: Systems and methods for determining an identity of an individual are provided. Audio may be received that includes a key phrase spoken by the individual, and the key phrase may include an identifier spoken by the individual. A key phrase voice print and key phrase text corresponding to the audio may be obtained. The key phrase text may include text corresponding to the identifier spoken by the individual. Voice prints may be retrieved based on the text corresponding to the identifier, and the voice prints may be provided to a voice biometric engine for comparison to the key phrase voice print. The individual may be authenticated based on a comparison of the key phrase voice print to the voice prints. The identifier may include a first name and a last name of the individual.

Type: Grant

Filed: September 30, 2013

Date of Patent: July 19, 2016

Assignee: Bank of America Corporation

Inventors: David Karpey, Mark Pender
Apparatus and method for clustering speakers, and a non-transitory computer readable medium thereof

Patent number: 9251808

Abstract: According to one embodiment, a speaker clustering apparatus includes a clustering unit, an extraction unit, and an error detection unit. The clustering unit is configured to extract acoustic features for speakers from an acoustic signal, and to cluster utterances included in the acoustic signal into the speakers by using the acoustic features. The extraction unit is configured to acquire character strings representing contents of the utterances, and to extract linguistic features of the speakers by using the character strings. The error detection unit is configured to decide that, when one of the character strings does not fit with a linguistic feature of a speaker into which an utterance of the one is clustered, the utterance is erroneously clustered by the clustering unit.

Type: Grant

Filed: March 6, 2012

Date of Patent: February 2, 2016

Assignee: Kabushiki Kaisha Toshiba

Inventors: Tomoo Ikeda, Manabu Nagao, Osamu Nishiyama, Hirokazu Suzuki, Koji Ueno, Nobuhiro Shimogori
Recording infrastructure having biometrics engine and analytics service

Patent number: 9237232

Abstract: Systems and methods for analyzing digital recordings of the human voice in order to find characteristics unique to an individual. A biometrics engine may use an analytics service in a contact center to supply audio streams based on configured rules and providers for biometric detection. The analytics service may provide call audio data and attributes to connected engines based on a provider-set of selection rules. The connected providers send call audio data and attributes through the analytics service. The engines are notified when a new call is available for processing and can then retrieve chunks of audio data and call attributes by polling an analytics service interface. A mathematical model of the human vocal tract in the call audio data is created and/or matched against existing models. The result is analogous to a fingerprint, i.e., a pattern unique to an individual to within some level of probability.

Type: Grant

Filed: March 14, 2014

Date of Patent: January 12, 2016

Assignee: VERINT AMERICAS INC.

Inventors: Jamie Richard Williams, Robert John Barnes, Ian Linsdell, Scott M. Bluman
Animated preview of images

Patent number: 9123182

Abstract: Computer program products, methods, systems, etc. for generating an animated preview of a number of images are disclosed. A selection of a group of images is received. A set of digital images from the group of images are identified as being representative of the group. At least some portion of the identified set of representative digital images from the group is then used to create an animated image. The animated image serves as a preview of the group of images, such that, when a user browses the images and sees the preview associated with a corresponding folder or directory, the user is able to quickly and easily associate the images in the group with a particular event and identify contents of the folder or directory.

Type: Grant

Filed: November 5, 2010

Date of Patent: September 1, 2015

Assignee: Adobe Systems Incorporated

Inventor: Sanjeev Kumar Biswas
Method and device for automatic recognition of given keywords and/or terms within voice data

Patent number: 9064494

Abstract: The present invention relates to a method of and a device (30) for automatic recognition of given keywords and/or terms within voice data (25) of a talk between at least two participants, said voice data (25) being continuously compared during said talk with said given keywords and/or terms (26) with regard to the occurrence of said given keywords and/or terms (26) within said voice data (25). In order to provide a solution which guarantees that topics which should be part of a talk are actually dealt with during said talk, the method is characterized in that a visualized representation (48) of the results (45) of said comparison is presented to at least one participant during the talk.

Type: Grant

Filed: September 13, 2010

Date of Patent: June 23, 2015

Assignee: VODAFONE GMBH

Inventor: Stefan Holtel
Biometric voice command and control switching device and method of use

Patent number: 9043210

Abstract: A biometric voice command and control switching device has a microphone assembly for receiving a currently spoken challenge utterance and a reference utterance, and a voice processing circuit for creating electronic signals indicative thereof. The device further includes a memory for storing the electronic signals, and a processor for comparing the electronic signals to determine if there is a match. If there is a match, an interface circuit enables the operable control of the controlled device.

Type: Grant

Filed: October 2, 2012

Date of Patent: May 26, 2015

Assignee: Voice Security Systems, Inc.

Inventors: Sherrie Adcock, Kent Robinson, Brad Clements, Mark Keith Brockelsby, William Keith Brockelsby
Script compliance and quality assurance based on speech recognition and duration of interaction

Patent number: 9037470

Abstract: Apparatus and methods are provided for using automatic speech recognition to analyze a voice interaction and verify compliance of an agent reading a script to a client during the voice interaction. In one aspect of the invention, a communications system includes a user interface, a communications network, and a call center having an automatic speech recognition component. In other aspects of the invention, a script compliance method includes the steps of conducting a voice interaction between an agent and a client and evaluating the voice interaction with an automatic speech recognition component adapted to analyze the voice interaction and determine whether the agent has adequately followed the script. In yet still further aspects of the invention, the duration of a given interaction can be analyzed, either apart from or in combination with the script compliance analysis above, to seek to identify instances of agent non-compliance, of fraud, or of quality-analysis issues.

Type: Grant

Filed: June 25, 2014

Date of Patent: May 19, 2015

Assignee: West Business Solutions, LLC

Inventors: Mark J. Pettay, Fonda J. Narke
Command and control of devices and applications by voice using a communication base system

Patent number: 9026447

Abstract: A first communication path for receiving a communication is established. The communication includes speech, which is processed. A speech pattern is identified as including a voice-command. A portion of the speech pattern is determined as including the voice-command. That portion of the speech pattern is separated from the speech pattern and compared with a second speech pattern. If the two speech patterns match or resemble each other, the portion of the speech pattern is accepted as the voice-command. An operation corresponding to the voice-command is determined and performed. The operation may perform an operation on a remote device, forward the voice-command to a remote device, or notify a user. The operation may create a second communication path that may allow a headset to join in a communication between another headset and a communication device, several headsets to communicate with each other, or a headset to communicate with several communication devices.

Type: Grant

Filed: January 25, 2008

Date of Patent: May 5, 2015

Assignee: CenturyLink Intellectual Property LLC

Inventors: Erik Geldbach, Kelsyn D. Rooks, Sr., Shane M. Smith, Mark Wilmoth
Choosing recognized text from a background environment

Patent number: 9015043

Abstract: A computer-implemented method includes receiving an electronic representation of one or more human voices, recognizing words in a first portion of the electronic representation of the one or more human voices, and sending suggested search terms to a display device for display to a user in a text format. The suggested search terms are based on the recognized words in the first portion of the electronic representation of the one or more human voices. A search query is received from the user, which includes one or more of the suggested search terms that were displayed to the user.

Type: Grant

Filed: October 1, 2010

Date of Patent: April 21, 2015

Assignee: Google Inc.

Inventor: Scott Jenson
Context-based utterance recognition

Patent number: 9009025

Abstract: In some implementations, a digital work provider may provide language model information related to a plurality of different contexts, such as a plurality of different digital works. For example, the language model information may include language model difference information identifying a plurality of sequences of one or more words in a digital work that have probabilities of occurrence that differ from probabilities of occurrence in a base language model by a threshold amount. The language model difference information corresponding to a particular context may be used in conjunction with the base language model to recognize an utterance made by a user of a user device. In some examples, the recognition is performed on the user device. In other examples, the utterance and associated context information are sent over a network to a recognition computing device that performs the recognition.

Type: Grant

Filed: December 27, 2011

Date of Patent: April 14, 2015

Assignee: Amazon Technologies, Inc.

Inventor: Brandon W. Porter
System and method for applying dynamic contextual grammars and language models to improve automatic speech recognition accuracy

Patent number: 9002710

Abstract: The invention involves the loading and unloading of dynamic section grammars and language models in a speech recognition system. The values of the sections of the structured document are either determined in advance from a collection of documents of the same domain, document type, and speaker; or collected incrementally from documents of the same domain, document type, and speaker; or added incrementally to an already existing set of values. Speech recognition in the context of the given field is constrained to the contents of these dynamic values. If speech recognition fails or produces a poor match within this grammar or section language model, speech recognition against a larger, more general vocabulary that is not constrained to the given section is performed.

Type: Grant

Filed: September 12, 2012

Date of Patent: April 7, 2015

Assignee: Nuance Communications, Inc.

Inventors: Alwin B. Carus, Larissa Lapshina, Raghu Vemula
Release of transaction data

Patent number: 8996387

Abstract: For clearing transaction data selected for a processing, there is generated in a portable data carrier (1) a transaction acoustic signal (003; 103; 203) (S007; S107; S207) upon whose acoustic reproduction by an end device (10) at least transaction data selected for the processing are reproduced superimposed acoustically with a melody specific to a user of the data carrier (1) (S009; S109; S209). The generated transaction acoustic signal (003; 103; 203) is electronically transferred to an end device (10) (S108; S208), which processes the selected transaction data (S011; S121; S216) only when the user of the data carrier (1) confirms vis-à-vis the end device (10) an at least partial match both of the acoustically reproduced melody with the user-specific melody and of the acoustically reproduced transaction data with the selected transaction data (S010; S110, S116; S210).

Type: Grant

Filed: September 8, 2009

Date of Patent: March 31, 2015

Assignee: Giesecke & Devrient GmbH

Inventors: Thomas Stocker, Michael Baldischweiler
Online maximum-likelihood mean and variance normalization for speech recognition

Patent number: 8996368

Abstract: A feature transform for speech recognition is described. An input speech utterance is processed to produce a sequence of representative speech vectors. A time-synchronous speech recognition pass is performed using a decoding search to determine a recognition output corresponding to the speech input. The decoding search includes, for each speech vector after some first threshold number of speech vectors, estimating a feature transform based on the preceding speech vectors in the utterance and partial decoding results of the decoding search. The current speech vector is then adjusted based on the current feature transform, and the adjusted speech vector is used in a current frame of the decoding search.

Type: Grant

Filed: February 22, 2010

Date of Patent: March 31, 2015

Assignee: Nuance Communications, Inc.

Inventor: Daniel Willett
Telephony service interaction management

Patent number: 8990071

Abstract: A method for managing an interaction of a calling party to a communication partner is provided. The method includes automatically determining if the communication partner expects DTMF input. The method also includes translating speech input to one or more DTMF tones and communicating the one or more DTMF tones to the communication partner, if the communication partner expects DTMF input.

Type: Grant

Filed: March 29, 2010

Date of Patent: March 24, 2015

Assignee: Microsoft Technology Licensing, LLC

Inventors: Yun-Cheng Ju, Stefanie Tomko, Frank Liu, Ivan Tashev
Alert mode management method and communication device having alert mode management function

Patent number: 8983837

Abstract: A computerized alert mode management method of a communication device, the communication device includes a sound capture unit. Vocal sounds of the environment around the communication device are extracted at regular intervals using the sound capture unit. Voice characteristic information of the captured vocal sounds is extracted using a speech recognition method and/or a voice recognition method. The communication device is controlled to work at one of a plurality of predetermined alert modes according to the extracted voice characteristic information.

Type: Grant

Filed: December 6, 2012

Date of Patent: March 17, 2015

Assignee: Hon Hai Precision Industry Co., Ltd.

Inventor: Tsung-Jen Chuang

1 2 3 4 5 … next