WEARABLE HEADSET WITH SELF-CONTAINED VOCAL FEEDBACK AND VOCAL COMMAND
A headset includes a wearable body, first and second earphones extending from the wearable body, controls for controlling an external communication/multimedia device wirelessly, a microphone for picking up vocal data from a user of the headset system and a signal processing unit. The signal processing unit includes circuitry for processing the vocal data into a distinctly audible vocal feedback signal, circuitry for enhancing the vocal feedback signal thereby producing an enhanced vocal feedback signal and circuitry for mixing the enhanced vocal feedback signal with audio signals originating from the external communication/multimedia device, thereby producing a mixed output signal and then sending the mixed output signal to the user via the earphones. The external communication/multimedia device comprises a vocal command application and the headset further comprises a vocal command control for sending vocal commands to the external communication/multimedia device and to the vocal command application.
Latest ONVOCAL, INC. Patents:
This application is a continuation of U.S. Non-Provisional application Ser. No. 13/928,535 filed on Jun. 27, 2013, which is a continuation of U.S. Non-Provisional application Ser. No. 12/539,009 filed on Aug. 11, 2009 and issued as U.S. Pat. No. 8,498,425 on Jul. 30, 2013, which is commonly assigned, and the contents of which are expressly incorporated herein by reference. This application also claims the benefit of U.S. Provisional Application No. 61/088,417 filed on Aug. 13, 2008, which is commonly assigned, and the contents of which are expressly incorporated herein by reference. This application claims also the benefit of U.S. Provisional Application No. 61/149,372 filed on Feb. 3, 2009, which is commonly assigned, and the contents of which are expressly incorporated herein by reference.
FIELD OF THE INVENTIONThe present invention relates to a wearable headset and in particular to a wearable stereo headset with self-contained vocal feedback and vocal command.
BACKGROUND OF THE INVENTIONHeadsets or headphones are typically used in connection with communication and multimedia devices in order to listen to audio signals produced by or transferred from these devices. Examples of such communication and multimedia devices include mobile phones, radio receivers, portable music players like CD players and MP3 players.
SUMMARY OF THE INVENTIONThe present invention relates to a wearable wireless headset and in particular to a wearable stereo wireless headset with focused vocal feedback and vocal command of the user wearing the device. The invention is designed to focus the microphone on the user's vocal frequency spectrum and to provide real-time (zero time delay) distinctly audible feedback to the user so that he can adjust his own vocal volume, pitch, tone and ambient environment noise, among others, accordingly. The invention also includes a vocal command button for capturing and transferring voice stream data wirelessly to a communication/multimedia device, for voice to text recognition, voice application control, and voice authentication. The voice stream data are executed by a vocal command application located on the communication/multimedia device or on a remote server to which the device is connected either via a wired or a wireless connection. The invention may be used for interacting with and controlling various mobile device functions including controlling phone calls and music player, and other devices with Bluetooth or adaptors to Bluetooth. The invention provides valuable vocal feedback of the user to enhance his/her experience in making phone calls for more clear and accurate conversations with either humans or computer voice recognition systems. The invention may also be used for entertainment or music training by providing accurate vocal feedback to a singer, or in leisure to someone who wants to karaoke with the songs he/she is listening to. The invention may also be used in connection with learning languages, to help the learner with vocal feedback in practicing with audio training material. The invention focuses on the vocals only and cuts away ambient noise and therefore can also be used widely for more accurate voice dictation and voice commands in conjunction with other mobile devices, PCs, video games and other interactive devices. The invention may be connected to any device with synchronized wireless transmission such as Bluetooth. It may also be connected directly via a standard stereo jack, or stereo jack with microphone support to many mobile phones. Most applications would be connected with other Bluetooth enabled devices, or via Bluetooth adapted devices.
In general, in one aspect, the invention features a headset including a wearable body, first 30 and second earphones extending from the wearable body, controls for controlling an external communication/multimedia device wirelessly, a microphone for picking up vocal data from a user of the headset system and a signal processing unit. The signal processing unit includes circuitry for processing the vocal data into a distinctly audible vocal feedback signal, circuitry for enhancing the vocal feedback signal thereby producing an enhanced vocal feedback signal and circuitry for mixing the enhanced vocal feedback signal with audio signals originating from the external communication/multimedia device, thereby producing a mixed output signal and then sending the mixed output signal to the user via the earphones.
Implementations of this aspect of the invention may include one or more of the following 10 features. The external communication/multimedia device comprises a vocal command application and the headset further comprises a vocal command control for sending vocal commands to the external communication/multimedia device and to the vocal command application. The wearable body is configured to be worn around the back of the user's neck and comprises bendable and flexible material. The wearable body comprises a U-shaped frame that conforms to the back of the user's neck and includes first and second ends and the microphone extends from one of the U-shaped frame ends. The U-shaped frame is foldable and comprises adjustable length. The first and second earphones are contained and concealed within first and second openings in the first and second ends of the U-shaped frame, respectively. The earphones are pulled out from the openings when in use and retract back into the openings when not in use. The external communication/multimedia device may be a mobile phone, MP3 player, portable music player, personal digital assistant, personal computer, or television set. The signal processing unit is contained within the U-shaped frame and comprises an interface, a signal processor, filters, a mixer, a battery, and a recorder. During operation the headset communicates with and sends and receives data to and from the external communication/multimedia device via the interface. The interface comprises a wireless interface and the wireless interface may be a Bluetooth interface. The enhancement circuitry comprises filters for enhancing the user's vocal frequencies, filters for reducing ambient noise and controls for adjusting the volume of the enhanced vocal feedback 30 signal relative to the volume of the audio signals originating from the communication/multimedia device. Activating and holding the vocal command control on sends a vocal command activation signal from the headset to the external communication/multimedia device, wakes up the vocal command application and sends voice data to the vocal command application via the interface. The vocal command application comprises means for converting the voice data into text or pattern, locally or remotely via a server. The vocal command application further translates the text or pattern into commands that perform one or more functions by the vocal command application and other applications on the external communication/multimedia device. The signal processing unit further comprises a recorder and upon waking up of the vocal command application, a first alert tone is sent to the earphones indicating that recording of voice data in the recorder has commenced. Releasing of the vocal command control stops the recording and then sends a second alert tone to the earphones indicating that the recording is stopped.
In general, in another aspect, the invention features a system comprising a headset and a communication/multimedia device. The headset communicates with the communication/multimedia device wirelessly and includes a wearable body, first and second earphones extending from the wearable body, controls for controlling the external communication/multimedia device wirelessly, a microphone for picking up vocal data from a user of the headset, a signal processing unit and a vocal command control. The external communication/multimedia device comprises a vocal command application and the vocal command control sends vocal commands to the external communication/multimedia device and to the vocal command application.
Implementations of this aspect of the invention may include one or more of the following features. The signal processing unit includes circuitry for processing the vocal data into a vocal feedback signal, circuitry for enhancing the vocal feedback signal thereby producing an enhanced vocal feedback signal and circuitry for mixing the enhanced vocal feedback signal with audio signals originating from the external communication/multimedia device, thereby producing a mixed output signal and then sending the mixed output signal to the user via the earphones. The system may further include a remote server and the communication/multimedia device comprises means for communicating with and sending the vocal commands voice data to the remote server via a network and the server comprises means for converting voice data contained in the vocal commands into text or pattern and then return the text or pattern back to the vocal command application for executing commands translated from the text or pattern. The external communication/multimedia device further comprises a voice recognition application and the voice recognition application receives voice data from the headset and converts them into text or pattern. The vocal command application translates the text or pattern into commands that perform one or more functions by the vocal command application and other applications on the external communication/multimedia device. The remote server comprises an authentication application for recognizing and authenticating the user from the voice data. The commands may be “call a person”, “e-mail a person”, “search a content”, “text a person” or “Goto a location” based on Global Positioning System (GPS).
In general, in another aspect, the invention features a method for issuing vocal commands to a communication/multimedia device via a headset. The method includes the following steps. First, providing a headset comprising a wearable body, first and second ear phones extending from the wearable body, controls for controlling the external communication/multimedia device wirelessly, a microphone for picking up vocal data from a user of the headset, a signal processing unit and vocal command control for sending voice data to the communication/multimedia device. Next, activating the communication/multimedia device's vocal command mode by turning and holding the vocal command control on. Next, sending a wake-up signal to the communication/multimedia device via an interface of the headset for waking up a vocal command application (VCA) and begin recording of vocal command data. Upon start of recording of voice command, sending a first alert sound to the earphones indicating that recording of voice data has commenced. Next, speaking voice commands into the microphone and recording the captured voice data. Next, releasing the vocal command control, thereby stopping the recording of the captured voice data and sending a second alert sound to the earphones indicating that recording of voice data has stopped. The voice data can be sent to the VCA as a stream while recording continues or as a block of data after recording stops. Finally, sending the recorded voice data to the communication/multimedia device and the VCA for processing. Alternatively, the recorded data are sent by the VCA to a remote server via the communication/multimedia device for further processing. The processing of the recorded voice data by the VCA or the remote server includes applying voice recognition software and converting the voice data into text or pattern and executing commands contained in the voice data by the VCA. The signal processing unit may comprise circuitry for processing the vocal data into a vocal feedback signal, circuitry for enhancing the vocal feedback signal thereby producing an enhanced vocal feedback signal and circuitry for mixing the enhanced vocal feedback signal with audio signals originating from the external communications/multimedia device, thereby producing a mixed output signal and then sending the mixed output signal to the user via the earphones.
The ability to hear one's own vocals accurately is critical to musicians performing well. This is why feedback is set up for the musician either by speakers in front of them, or via headsets, or earpieces. Microphones to localize and amplify the vocals of a singer through the feedback systems allow for accurate pitch control, volume control and tone control by the singer. While professional musicians take great care in setting up systems that enhance their vocal “output” for their audiences receiving their “input”, there is no feedback control for general consumers to enhance their vocal output to their “audience” on the other end of their mobile phones. Mobile phone users suffer from not knowing how well or poorly their own microphone position, their volume or ambient noise are affecting the quality of their communication with parties on the other line (human or machine: i.e. phone calls, or to IVR systems, voice recognition systems from many customer services solutions, as well as voice commands, and voice to text dictation, and so forth,) which causes annoyance at best, and miscommunication at worst. Without proper vocal feedback, it is very hard to adjust one's microphone, pitch and volume to provide accurate and clean voice signal. The next generation of voice and speech technology requires clean and accurate voice commands and thus requires vocal feedback on standard headsets.
For language education, one of the most important things for accurate pronunciation and tone control is the ability to have real-time feedback of one's vocals. This invention allows for accurate feedback, which helps the learner control its pitch and learn better.
For leisure and entertainment, we all have seen how popular Karaoke has become. The added benefit of this invention provides everyone with portable music player with a personal Karaoke device.
Prior art headsets or headphones are typically used in connection with communication and multimedia devices in order to listen to audio signals produced by or transferred from these devices. Examples of such communication and multimedia devices include mobile phones, radio receivers, portable music players like CD players and MP3 players. None of the prior art headsets provide vocal feedback in real time or vocal command.
There are headsets with a microphone that can be turned on to hear ambient noise better while music is playing, but they are not intended for vocal feedback of the user. There are even noise cancellation headphones with microphones to pick up and cancel ambient noise, but none for picking up vocals and enhancing the vocal sounds. There are professional musician wireless feedback systems for vocal feedback, but none are self-contained single board headsets that can be used with mobile phones. They do not control mobile phones and MP3 players to play, track forward and backward, answer phone calls, hang up, or send vocal commands to mobile phones. These systems are built on separate circuit boards and are combined with separate transceivers that go to a mixing board or an alternative source, and then via a transceiver back to the earpiece. Therefore, they are not designed for this invention's intended purpose.
This invention is also unique in its form factor, in that it is completely wearable around the back of the neck as the support structure with ear phones that extend to the ears. Other wireless headsets typically fit around the ear, or are worn over the head, or around the back of the head. No wireless headsets today are worn on the neck which provides stability, support, comfort, and is ideal for the mobile professional or for many forms of exercises such as jogging, skiing, biking and exercise class instruction, among others.
Referring to
Referring to
Referring to
In another embodiment, the voice stream data are passed to an authentication server, which identifies and matches the user's voice patterns for authentication, and thus allowing an application or data to be used by the user, or a transaction to be processed.
In another embodiment the user initiates the vocal command mode by pressing a button 92 on the communication/multimedia device 90 which causes the same sequence of events as described above. Button 92 may be a physical button or a soft button on a touch screen in the application. When the button 92 is pressed an alert sound is transmitted to the ear pieces and recording starts. The captured voice data are streamed or sent as a file to application 200. Upon completion of the recording, the button 92 is released, an alert sound is sent to the ear pieces and the recording stops.
As was described above, pressing the vocal command button 190 causes a signal to be sent to the mobile phone 90 and activates application 200 so that it is ready to receive a voice command. A signal is sent back to the headset device 100 to alert that it is “Ready to Listen”. The user speaks and the voice command is captured in an audio file which is then sent to application 200.
Application 200 then sends the audio file to a server 50 for voice recognition. The recognized command returns back to application 200 where it is interpreted. Subsequently an action is taken by the application. Examples of voice commands and follow-up actions include the following:
1) “Call John Smith”—the application dials John Smith's number in the phone contact list.
2) “Email or Text John Smith, Subject Meeting Tomorrow”—the application initiates an email application, fills in the contact email address and subject heading, and then waits to do Voice To Text for the rest of the email.
3) “Search Sushi Restaurant Downtown Boston” the application initiates Yahoo One Search or other browser and searches for results.
The user may control the mobile device via additional control buttons 93 located on the device 90 or via control buttons 170 integrated in the headset 100, as shown in
Other embodiments of the headset include one or more of the following. The U-shaped frame 104 may be foldable in one or more than two locations. The frame may also include electronic circuitry to allow for the various size adjustments. Frame 104 may have an ergonomic design and may be supported on top of the user's head 50, around the back of his head, around and/or on top of the ears. The headset may include a memory for storing music or other information. Signal mixer 160 may also be part of the DSP 120. The communication/multimedia device 90 may be an MP3 player, iphone, ipod, PDA, mobile phone, personal computer, television set, or any other wireless or wired multimedia device. The microphone 130 may be a high quality microphone and the headset may be stereo or mono headset. In one example the microphone is a 4 mm microphone with a pre-amplifier. The wireless interface may be Bluetooth hands-free, Bluetooth A2DP (stereo music), Bluetooth AVRCP (stereo gaming), infrared, or any other wireless format. In one example the wireless interface is a BlueCore7 provided by Cambridge Silicon Radio (CSR) of Cambridge, UK. The headset may include digital encryption for secure conversations. The communication/multimedia device may be incorporated within the headset. The microphone may be telescopic, rotatable, and/or detachable.
Several embodiments of the present invention have been described. Nevertheless, it will be understood that various modifications may be made without departing from the spirit and scope of the invention. Accordingly, other embodiments are within the scope of the following claims.
Claims
1. A headset comprising:
- a headset comprising a wearable body, and first and second earphones extending from said wearable body;
- a wireless interface disposed in relation to the wearable body, interfacing an external communication device configured to be controlled wirelessly via commands from said headset, said external communication device comprising a vocal command application and wherein said headset further comprises a vocal command control sending vocal commands wirelessly via said wireless interface to said external communication device and to said vocal command application, wherein activating said vocal command control sends a vocal command activation signal from said headset to said external communication device, wakes up said vocal command application and sends voice data to said vocal command application via said interface, and wherein the vocal command application translates the vocal commands to perform one or more functions by said vocal command application;
- a microphone picking up the voice data from a user of the headset;
- a signal processing unit installed in said headset and comprising, signal processing circuitry processing said voice data into a vocal feedback signal, and enhancing said vocal feedback signal in a vocal frequency range thereby producing an enhanced vocal feedback signal; mixer circuitry mixing said enhanced vocal feedback signal with audio signals originating from said external communication device, thereby producing a mixed output signal including the user's enhanced vocal feedback signal; communication circuitry sending the mixed output signal to said earphones; and
- a control adjusting the volume of said enhanced vocal feedback signal relative to volume of audio signals originating from said external communication device.
2. The headset of claim 1 wherein the vocal frequency range enhanced to produce the enhanced vocal feedback signal is approximately 200 Hertz to 5000 Hertz.
3. The headset of claim 1 wherein said signal processing unit further comprises a recorder for recording voice data.
4. The headset of claim 3 wherein upon starting of recording, a first alert tone is sent to said earphones indicating that recording of voice data has commenced and wherein releasing of said vocal command control stops said recording and then sends a second alert tone to said earphones indicating that the recording is stopped.
5. The headset of claim 1 wherein said wearable body comprises a U-shaped frame that conforms to the user's neck and comprises first and second ends and wherein said microphone extends from at least one of said U-shaped frame ends.
6. The headset of claim 5 wherein said U-shaped frame is foldable and comprises adjustable length.
7. The headset of claim 1 wherein said first and second earphones are concealable within first and second openings in said wearable body
8. The headset of claim 1 wherein said wearable body comprises a U-shaped frame that conforms to the user's neck and wherein said first and second earphones are concealable within first and second openings in first and second ends of said U-shaped frame, respectively, and wherein said earphones are pulled out from said openings when in use and retract back into said openings when not in use.
9. The headset of claim 1 wherein said signal processing circuitry comprises filters enhancing said user's vocal frequencies.
10. A headset system comprising:
- a headset comprising a wearable body, first and second earphones extending from said wearable body and a wireless interface;
- an external communication device configured to be controlled wirelessly via commands, including vocal commands, from said headset through said wireless interface, said external communication device comprising a vocal command application and wherein said headset sends vocal commands wirelessly via said wireless interface to said vocal command application, wherein sending a vocal command activation signal from said headset to said external communication device, wakes up said vocal command application and sends voice data to said vocal command application via said wireless interface, and wherein the vocal command application translates the vocal commands and performs one or more functions;
- a microphone picking up the voice data from a user of the headset;
- a signal processing unit installed in said headset and comprising signal processing circuitry processing said voice data into a vocal feedback signal, and enhancing said vocal feedback signal in a vocal frequency range producing an enhanced vocal feedback signal, said signal processing circuitry mixing said enhanced vocal feedback signal with audio signals originating from said external communication device producing a mixed output signal including the user's enhanced vocal feedback signal;
- mixer circuitry sending the mixed output signal to said earphones;
- a control adjusting the volume of said enhanced vocal feedback signal relative to volume of audio signals originating from said external communication device; and
- a recorder for recording voice data, wherein upon starting of recording, a first alert tone is sent to said earphones indicating that recording of voice data has commenced and wherein deactivating of said vocal command control stops said recording and then sends a second alert tone to said earphones indicating that the recording is stopped.
11. The headset system of claim 10 wherein said external communication device comprises one of mobile phone, MP3 player, portable music player, personal digital assistant, personal computer, or television set.
12. The headset system of claim 10 wherein said wireless interface comprises a Bluetooth Interface.
13. The headset system of claim 10 wherein said signal processing unit is contained within said U-shaped frame and comprises said wireless interface, a signal processor, filters, a mixer, a battery and a recorder.
14. The headset system of claim 10 further comprising a remote server and wherein said communication device communicates with and sends said vocal commands voice data to said remote server via a network and wherein said server converts voice data contained in said vocal commands into text or pattern and then returns said text or pattern back to said vocal command application for executing commands translated from said text or pattern.
15. The headset system of claim 10 wherein said external communication device further comprises a voice recognition application and said voice recognition application receives voice data from said headset and converts said voice data into text or pattern and wherein said vocal command application translates said text or pattern into vocal commands that perform one or more functions by said vocal command application and other applications on said external communication device.
16. The headset system of claim 14 wherein said remote server comprises an authentication application for recognizing and authenticating said user from said voice data.
17. The headset system of claim 10 wherein said vocal commands comprise one of “call a person”, “e-mail a person”, “search a content”, “text a person” or “Goto a location” based on GPS.
18. The headset system of claim 10 wherein said signal processing circuitry comprises filters enhancing said user's vocal frequencies.
19. The headset system of claim 10 wherein the vocal frequency range enhanced to produce the enhanced vocal feedback signal is approximately 200 Hertz to 5000 Hertz.
Type: Application
Filed: Sep 11, 2017
Publication Date: Jan 11, 2018
Applicant: ONVOCAL, INC. (Northborough, MA)
Inventor: WILL WANG GRAYLIN (Winchester, MA)
Application Number: 15/700,956