Voice Control Of Transmission Direction Patents (Class 379/388.04)

Voice switching by attenuation/amplification (Class 379/388.05)

Comparing signal level of receiving and transmitting circuits (Class 379/388.06)

Controlling acoustic feedback (Class 379/388.07)

Unvoiced voiced decision for speech processing cross reference to related applications

Patent number: 11328739

Abstract: Method and apparatus for speech processing are disclosed. A first unvoicing parameter for a first frame of a speech signal is determined, and furthered smoothed based on a second unvoicing parameter for a second frame prior to the first frame. A difference between the first unvoicing parameter and the smoothed unvoicing parameter for the first subframe is computed and a unvoiced/voiced classification of the first frame is determined using the computed difference as a decision parameter. Further processing, such as Bandwidth extension (BWE) is performed on based on the classification of the first frame.

Type: Grant

Filed: July 9, 2019

Date of Patent: May 10, 2022

Assignee: HUAWEI TECHNOLOGIES CO., LTD.

Inventor: Yang Gao
Smart network device and method thereof

Patent number: 10636416

Abstract: A network device is connected to user device and includes a processor and a memory storing executable code executed by the processor. The network device is configured to receive first keyword data and speech data followed by the first keyword data; determine whether the first keyword data corresponds to a first keyword; in response to determining that the first keyword data corresponds to the first keyword, recognize word information from the speech data to generate at least one word recognition result; send the at least one word recognition result through a first communication path to a first network; and in response to determining that the first keyword data corresponds to a second keyword, stop recognizing the word information from the speech data followed by the first keyword data, and send the speech data through a second communication path to the first user device.

Type: Grant

Filed: November 15, 2018

Date of Patent: April 28, 2020

Assignee: WISTRON NEWEB CORPORATION

Inventors: Yee-Lee Shyong, Chen-Chao Chang, Ying-Hui Liang
Electronic device and method for recognizing speech

Patent number: 10540995

Abstract: An electronic device and a method for recognizing a speech are provided. The method for recognizing a speech by an electronic device includes: receiving sounds generated from a sound source through a plurality of microphones; calculating power values from a plurality of audio signals generated by performing signal processing on each sound input through the plurality of microphones and calculating direction information on the sound source based on the calculated power values and storing the calculated direction information; and performing the speech recognition on a speech section included in the audio signal based on the direction information on the sound source. As a result, the electronic device may correctly detect only a speech section from an audio signal while improving a speech section detection related processing speed.

Type: Grant

Filed: November 1, 2016

Date of Patent: January 21, 2020

Assignee: SAMSUNG ELECTRONICS CO., LTD.

Inventor: Ki-hoon Shin
Telephone user interface providing enhanced call blocking

Patent number: 10320977

Abstract: Aspects of the subject disclosure may include, for example, a method in which a device comprising a processor detects a communication session between a calling device and a called device, and receives a motion signal from the called device; the motion signal is generated at a motion sensor of the called device during or after the communication session. The device analyzes the motion signal to determine whether a portion of the motion signal corresponds to a preselected motion of the called device and whether a subsequent call from the calling device accordingly is to be blocked. Responsive to a determination that the subsequent call is to be blocked, the device also updates a list of blocked caller identifiers associated with the called device to add an identifier associated with the calling device. Other embodiments are disclosed.

Type: Grant

Filed: April 13, 2017

Date of Patent: June 11, 2019

Assignee: AT&T Intellectual Property I, L.P.

Inventors: Kim Brackett, Liaqat Ali, William Morris, IV
Unvoiced/voiced decision for speech processing

Patent number: 10043539

Abstract: A method for speech processing includes determining an unvoicing parameter for a first frame of a speech signal and determining a smoothed unvoicing parameter for the first frame by weighting the unvoicing parameter of the first frame and a smoothed unvoicing parameter of a second frame. The unvoicing parameter reflects a speech characteristic of the first frame. The smoothed unvoicing parameter of the second frame is weighted less heavily when the smoothed unvoicing parameter of the second frame is greater than the unvoicing parameter of the first frame. The method further includes computing a difference, by a processor, between the unvoicing parameter of the first frame and the smoothed unvoicing parameter of the first frame, and determining a classification of the first frame according to the computed difference. The classification includes unvoiced speech or voiced speech. The first frame is processed in accordance with the classification of the first frame.

Type: Grant

Filed: December 27, 2016

Date of Patent: August 7, 2018

Assignee: Huawei Technologies Co., Ltd.

Inventor: Yang Gao
Unvoiced/voiced decision for speech processing

Patent number: 9570093

Abstract: In accordance with an embodiment of the present invention, a method for speech processing includes determining an unvoicing/voicing parameter reflecting a characteristic of unvoiced/voicing speech in a current frame of a speech signal comprising a plurality of frames. A smoothed unvoicing/voicing parameter is determined to include information of the unvoicing/voicing parameter in a frame prior to the current frame of the speech signal. A difference between the unvoicing/voicing parameter and the smoothed unvoicing/voicing parameter is computed. The method further includes generating an unvoiced/voiced decision point for determining whether the current frame comprises unvoiced speech or voiced speech using the computed difference as a decision parameter.

Type: Grant

Filed: September 3, 2014

Date of Patent: February 14, 2017

Assignee: Huawei Technologies Co., Ltd.

Inventor: Yang Gao
Conference mixing using turbo-VAD

Patent number: 9246962

Abstract: A conference mixer includes a unit configured to receive a plurality of input streams, a spectral voice activity detection (VAD) unit configured to, for each of the input streams, generate and output a spectral VAD decision indicating whether a frame including data packets is voice, a turbo VAD unit configured to generate and output a turbo VAD decision that indicates for a frame including data packets which input stream is active, the turbo VAD decision being based on the spectral VAD decisions and a power-based decision indicating whether an estimated instantaneous power level of a frame including data packets is greater than a power threshold, and a finite state machine (FSM) unit configured to select which of the input streams to output as an active stream based on a plurality of the turbo VAD decisions, the turbo VAD decision being based in part on feedback provided by the FSM.

Type: Grant

Filed: March 19, 2015

Date of Patent: January 26, 2016

Assignee: Marvell World Trade Ltd.

Inventors: Anatoli Plotnikov, Timor Kardashov, Maxim Kovalenko
Non-linear post-processing control in stereo acoustic echo cancellation

Patent number: 9123324

Abstract: Methods, systems, and apparatus are provided for multiple-input multiple-output acoustic echo cancellation. A multiple-input multiple-output acoustic echo canceller (MIMO AEC) is provided as a high quality echo canceller for voice and/or audio communication over a network (e.g., packet switched network). The MIMO AEC is an extension of, as well as an application/usage of a single-input single-output acoustic echo canceller (“mono AEC”). The MIMO AEC is an extension of the mono AEC in that the code/theory underlying the mono AEC is adjusted for use with multiple channels. The manner in which AEC is applied (e.g., on each microphone signal using separate mono-AECs) is an application of mono-AECs.

Type: Grant

Filed: February 28, 2013

Date of Patent: September 1, 2015

Assignee: GOOGLE INC.

Inventor: Bjorn Volcker
Identifying caller preferences based on voice print analysis

Patent number: 8532268

Abstract: A call directing system receives an incoming call from a caller. The caller is prompted to speak, thus enabling a prosody analyzer to generate an analysis of a prosody of the caller's voice. This analysis provides a basis for generating a caller profile that describes caller preferences of the caller. Based on the analysis of the prosody of the caller's voice and the generated caller profile, the call is directed to a particular call recipient.

Type: Grant

Filed: July 18, 2012

Date of Patent: September 10, 2013

Assignee: International Business Machines Corporation

Inventors: Peeyush Jaiswal, Naveen Narayan
Volume adjustment for multiple voice over internet protocal streams

Patent number: 8483409

Abstract: Systems and methods for managing the volume of multiple VoIP streams are disclosed. The system includes a VoIP server configured to receive an input audio stream from a first VoIP handset, create separate output audio streams from the input audio stream for transmission to second and third VoIP handsets, and to connect to a communications network. The system also includes a volume control table coupled to the VoIP server, the volume control table including records of volume adjustments made during prior conversations between the two or more VoIP handsets. The VoIP server is further configured create the separate output audio streams such that one or more of the output streams has a volume that is different than input audio stream based on the records.

Type: Grant

Filed: June 23, 2008

Date of Patent: July 9, 2013

Assignee: International Business Machines Corporation

Inventor: Nicholas F. Campion
Voice recognition system, method, and program

Patent number: 8417518

Abstract: A voice recognition system comprises: a voice input unit that receives an input signal from a voice input element and output it; a voice detection unit that detects an utterance segment in the input signal; a voice recognition unit that performs voice recognition for the utterance segment; and a control unit that outputs a control signal to at least one of the voice input unit and the voice detection unit and suppresses a detection frequency if the detection frequency satisfies a predetermined condition.

Type: Grant

Filed: February 27, 2008

Date of Patent: April 9, 2013

Assignee: NEC Corporation

Inventor: Toru Iwasawa
Identifying caller preferences based on voice print analysis

Patent number: 8249225

Abstract: A call directing system receives an incoming call from a caller. The caller is prompted to speak, thus enabling a prosody analyzer to generate an analysis of a prosody of the caller's voice. This analysis provides a basis for generating a caller profile that describes caller preferences of the caller. Based on the analysis of the prosody of the caller's voice and the generated caller profile, the call is directed to a particular call recipient.

Type: Grant

Filed: March 14, 2008

Date of Patent: August 21, 2012

Assignee: International Business Machines Corporation

Inventors: Peeyush Jaiswal, Naveen Narayan
Communication apparatus capable of adjusting volume of voice to be reproduced

Patent number: 8023640

Abstract: A communication apparatus through which voice communication can be performed, including: a line interface portion which receives and transmits data form and to a line and which includes a data access arrangement device in which is incorporated a detection circuit that detects at least one of a line voltage and a line current; a voice reproducing device which reproduces a voice that is based on voice communication data transmitted from the line to the line interface portion; and a volume adjuster which adjusts reproduction volume with which the voice is to be reproduced by the voice reproducing device, on the basis of a detected value of the detection circuit.

Type: Grant

Filed: March 29, 2005

Date of Patent: September 20, 2011

Assignee: Brother Kogyo Kabushiki Kaisha

Inventor: Tomohiro Ito
Information processing system

Patent number: 7760868

Abstract: In an information processing system provided with a camera and a microphone, for transmitting and receiving information of a user to and from another information processing system through a transmission line, image data of the user obtained by a camera is stored in a memory in advance. When one user communicates another user, image data of the one user is obtained by the camera and is synthesized with the image data stored in the memory in advance by image-processing. The clothes, hair, background, make-up etc. of the one user are made different to reality and the image data are transmitted to the another user in the communication.

Type: Grant

Filed: March 15, 2004

Date of Patent: July 20, 2010

Assignee: Semiconductor Energy Laboratory Co., Ltd

Inventors: Yuji Kawasaki, Jun Koyama, Futoshi Ishi, Shunpei Yamazaki
SYSTEMS AND METHODS FOR HALF-DUPLEX SPEAKERPHONES AND OTHER TWO-WAY COMMUNICATION DEVICES

Publication number: 20100119055

Abstract: An apparatus to enable half-duplexing capabilities in a two-way communication device is disclosed. The apparatus estimates the signal power and background noise of a first input signal and a second input signal during approximately the same period. The apparatus further provides at least one control signal based on the result of one or more determinations. These determinations may include whether the estimated signal power of at least one of the first and second input signals exceeds a threshold value; whether the estimated signal power of the first input signal exceeds the sum of a first threshold value and the estimated background noise of the first input signal; and whether the estimated signal power of the second input signal exceeds the sum of a second threshold value and the estimated background noise of the second input signal. Other embodiments for use with two-way communication devices and related methods are also disclosed.

Type: Application

Filed: November 17, 2008

Publication date: May 13, 2010

Inventors: Pengfei Zhang, Caogang Yu
Voice transmission device and voice transmission system

Patent number: 7181000

Abstract: A voice transmission device has a tandem pass through function in an STM, ATM, IP network, and a noise canceller (21) is provided at a latter stage of an echo canceller (12) in a coding part, and in a case where a multistage connection state does not occur, noise removal is performed to carry out efficient coding, and in a case where a relay is performed in the multistage connection state, switching is performed to stop the operation of the noise canceller (21), and voice deterioration by redundant decoding and coding is prevented, and therefore, even in a cellular phone having no noise cancel function, or the like, and at both a normal time and a time of a tandem pass through state, high quality voice transmission can be performed, and further, since an unvoiced portion in voice data is increased by removing a noise component, a portion subjected to coding becomes small, and transmission amount of the line is reduced.

Type: Grant

Filed: April 4, 2003

Date of Patent: February 20, 2007

Assignee: Mitsubishi Denki Kabushiki Kaisha

Inventor: Nobuyoshi Horie
Speakerphone having improved outbound audio quality

Patent number: 7123714

Abstract: A telephone (310) and a method for providing outbound audio when the telephone is operating in a speakerphone mode. A first data unit (350) including a first unit type identifier (360) can be received by the telephone. The first unit type identifier can be an indicator of a type of audio data contained in the first data unit. For instance, the first unit type identifier can indicate whether the audio data is music or non-music audio data. If the first unit type identifier has a first value, for example a value indicating that the audio data is music data, unmuted outbound audio reproduced from the first data unit can be provided and voice activity detection can be disabled. Additionally, inbound audio can be muted.

Type: Grant

Filed: August 25, 2004

Date of Patent: October 17, 2006

Assignee: Motorola, Inc.

Inventors: Marc A. Boillot, Ali Behboodian, Pratik V. Desai
Communication terminals with a dual use speaker for sensing background noise and generating sound, and related methods and computer program products

Patent number: 7054436

Abstract: Communication terminals, methods, and computer program products are provided that sense background noise via a speaker that is also used to generate sounds. In some methods of operating a communication terminal, and a speaker signal is supplied to a first speaker to generate sound therefrom, a noise sensing signal is received from the first speaker. The noise sensing signal includes a contribution associated with background noise that is incident to the first speaker. Presence of the background noise in the noise sensing signal is determined.

Type: Grant

Filed: August 2, 2004

Date of Patent: May 30, 2006

Assignee: Sony Ericsson Mobile Communication, AB

Inventor: Fredrik Stenmark
Automatic volume adjustment of voice transmitted over a communication device

Patent number: 7023984

Abstract: A system and method for adjusting the volume level of a communications device in response to a voice volume/ambient noise relationship. Ambient noise and voice volume are sampled and compared to a predetermined ambient noise to voice volume relationship. Voice volume is adjusted up or down in response to a control signal that is generated in view of the comparison between the sampled voice volume to ambient noise relationship and the predetermined ambient noise to voice volume relationship.

Type: Grant

Filed: March 21, 2002

Date of Patent: April 4, 2006

Assignee: BellSouth Intellectual Property Corp.

Inventors: Shannon M. Short, William A. Hartselle, Vernon Meadows
Method of acoustic echo cancellation in full-duplex hands free audio conferencing with spatial directivity

Patent number: 6990193

Abstract: A method is set forth of controlling an acoustic echo canceller at the output of a beamformer in an audio conferencing device. Information is saved to, and retrieved from, memory that characterizes each of a finite number of look directions, or regions of focus, covering the entire spatial span of the conferencing device. Each time a change occurs from a first look direction to a second look direction, information relating to the workspace captured by the acoustic echo canceller is saved for the first look direction, and previously saved information for the second look direction is retrieved from memory. The acoustic echo cancellation then takes place for the second look direction with the retrieved information.

Type: Grant

Filed: November 29, 2002

Date of Patent: January 24, 2006

Assignee: Mitel Knowledge Corporation

Inventors: Franck Beaucoup, Michael Tetelbaum
Detection of both voice and tones using Goertzel filters

Patent number: 6950511

Abstract: A plurality of Goertzel filters whose operating frequencies are distributed across the voice baseband are used to detect voice and control tones in a signal. Filters operating at frequencies of control tones and detecting that most of the signal energy occurs at those frequencies indicates presence of the control tones. At least three of the filters detecting that about 10% to 20% of the signal energy occurs at each of their operating frequencies indicate presence of voice. The total energy detected in the signal being below a noise threshold indicates presence of noise or silence.

Type: Grant

Filed: November 13, 2003

Date of Patent: September 27, 2005

Assignee: Avaya Technology Corp.

Inventors: Sharmistha Das, Matthew McShea
Method and apparatus for reducing echo feedback in a communication system

Patent number: 6947773

Abstract: A method and apparatus for reducing echo feedback in a wireless communication system (10) is accomplished when a receiving communication unit (24) senses a feedback signal, or echo, via an ancillary communication path. The receiving communication unit is a targeted recipient of an original audio signal generated by a transmitting communication unit (22), where the original audio signal (42) is conveyed to the receiving communication unit. Upon detecting the feedback signal and determining that it exceeds a feedback threshold, the receiving communication unit attenuates an audible output of the original signal to reduce echo to the transmitting communication unit. In addition, the receiving communication unit, and/or the transmitting communication unit include echo canceller to further minimize the echo within the digital communication system.

Type: Grant

Filed: April 5, 2004

Date of Patent: September 20, 2005

Assignee: Motorola, Inc.

Inventors: Robert Novorita, Eric Ziolko, Gary Grube
Method and apparatus for personalized conference and hands-free telephony using audio beaming

Patent number: 6937718

Abstract: The present invention is directed to the provision of a personalized speaker phone and hands-free telephony. In particular, the present invention allows communications to be output along a narrowly defined path, rather than being broadcast. In this way, a private voice communication signal can be provided to a user, even though the user is not holding the output device to the user's ear. Furthermore, by providing audible signals along narrowly defined paths, different audible signals may be provided to users at the same location, without interfering with one another.

Type: Grant

Filed: September 4, 2002

Date of Patent: August 30, 2005

Assignee: Avaya Technology Corp.

Inventor: Alexander Martin Scholte
Telephone having four VAD circuits

Patent number: 6754337

Abstract: Voice activity is detected by comparing a signal with two thresholds and producing data representing the energy of the signal. The data, in binary form, is compared with thresholds to determine voice activity. In accordance with another aspect of the invention, the thresholds are adjusted based upon statistical information. In accordance with another aspect of the invention, the data can be weighted to provide an indication of the quasi-RMS energy of an input signal. In accordance with another aspect of the invention, voice activity detectors, individually weighted, are provided at each input and each output of a telephone for reliably controlling echo cancelling circuitry within the telephone.

Type: Grant

Filed: January 25, 2002

Date of Patent: June 22, 2004

Assignee: Acoustic Technologies, Inc.

Inventors: Steven M. Domer, Kellie Michele Vanda
Detection of the speech activity of a source

Patent number: 6707910

Abstract: The scope of the present invention is a device for detecting the source of a voice, which device comprises microphone means (2; 2a, 2b, 2M) for receiving a voice signal and detecting means for detecting the voice from the received voice signal. The device comprises means (15, 17) for determining the direction of arrival of the received signal, means (17) for storing the assumed direction of arrival of the voice of a certain source and means (18) for comparing the direction of arrival of said received signal with said assumed direction of arrival. The device further comprises means (18) for indicating that the source of the voice is said certain source when the comparison proves that the direction of arrival of said received signal matches with said assumed direction of arrival within a certain tolerance.

Type: Grant

Filed: September 2, 1998

Date of Patent: March 16, 2004

Assignee: Nokia Mobile Phones Ltd.

Inventors: Päivi Valve, Juha Häkkinen
Architectural sound enhancement with DTMF control

Publication number: 20030142814

Abstract: A unique, fully integrated, fully programmable, and highly flexible sound distribution system and methodology for providing masking sound, background music, and paging capabilities in up to eight zones of a building or space is provided. The methodology embodied in the system includes internal masking sounds that are uniquely pre-filtered to provide efficient and effective masking of distracting sounds within selectable zones of the space with a minimum masking sound dB sound level and with a pleasant sounding and non-annoying masking sound. The system also incorporates the capacity to be controlled from a remote or local telephone to adjust the volume level in any zone serviced by the system by issuing appropriate DTMF codes from the telephone's keypad. Unique bi-tone diagnostic functions are provided for assuring that the entire system is correctly wired and installed and for troubleshooting operational anomalies.

Type: Application

Filed: March 28, 2002

Publication date: July 31, 2003

Inventors: Kenneth P. Roy, Thomas J. Johnson, Ronald Fuller, Steve Dove
Radio telecommunication terminal

Patent number: 6564072

Abstract: The invention relates to a mobile radio terminal including an audio transducer (5) an outlet (6) of which is oriented towards the front face of the terminal and provides earpiece, loudspeaker and ringer functions, the terminal further including, coupled to the transducer, means (R1, 9′, 11, 12) for adjusting the level of the sound wave emitted by the transducer in a main direction (D) substantially perpendicular to the front face, the adjustment being effected regardless of the position of the terminal relative to a user.

Type: Grant

Filed: November 3, 1999

Date of Patent: May 13, 2003

Assignee: Alcatel

Inventors: Luc Attimont, Jannick Bodin
Method for screening active voice mail messages

Patent number: 6529587

Abstract: A method for screening an active incoming voice mail message broadcasts the incoming message in real time on a speaker in or associated with the subscriber's telephone set upon and concurrent with receipt of the same by the voice mail system. Upon detection during the broadcast of an interrupt request provided by the subscriber via the subscriber's telephone set, the calling party is connected with the subscriber, and normal recording of the message by the voice mail system is discontinued.

Type: Grant

Filed: April 27, 1999

Date of Patent: March 4, 2003

Assignee: Agere Systems Inc.

Inventors: Joseph M. Cannon, Donald Alfred Fleck, James A. Johanson, Philip David Mooney
Method of displaying alternating transmitting and receiving phases of voice communication in a mobile phone in a speakerphone mode

Patent number: 6473629

Abstract: A method of displaying alternating transmitting and receiving phases of voice communication in a mobile phone with a display when it is operated in the speakerphone mode includes the steps of comparing the intensity of a received voice signal from a caller with that of a transmitted voice signal, outputting the received voice signal through the speaker of the mobile phone while displaying a visual indication representing the receiving phase on the display if the intensity of the received voice signal is stronger than that of the transmitted voice signal, and sending the transmitted voice signal to the caller while displaying a visual indication representing the transmitting phase on the display if the intensity of the received voice signal is weaker than that of the transmitted voice signal.

Type: Grant

Filed: January 12, 2000

Date of Patent: October 29, 2002

Assignee: Samsung Electronics, Co., Ltd.

Inventor: Yun-Seok Chang
Method for automatically assisting unaided voice communication

Patent number: 6353732

Abstract: Automatic assistance to unaided voice communication is provided by activating and deactivating a wireless communication link as needed (300). A communication device, associated with a particular individual, monitors to determine whether unaided communication occurring with another individual is satisfactory according to a predetermined criteria (310, 320, 330). The communication device automatically switches to provide aided communication, when the unaided communication is not satisfactory. Preferably, an open communication link is established between the communication device and one associated with the other individual when sound reception characteristics or separation characteristics do not meet a particular criteria (340, 350, 360, 370).

Type: Grant

Filed: June 8, 1998

Date of Patent: March 5, 2002

Assignee: Motorola, Inc.

Inventor: Joseph L. Dvorak