COMMUNICATION METHODS AND APPARATUS FOR ONLINE GAMES

Info

Publication number: 20140194201
Type: Application
Filed: Mar 11, 2014
Publication Date: Jul 10, 2014
Applicant: ROCKSTAR BIDCO, LP (New York, NY)
Inventors: Ian Domville (Wallingford), Henry Rawlinson (Maidenhead)
Application Number: 14/204,242

Abstract

By providing voice recognition in a gaming environment, an IVR server may operate to revoice a player with a voice tone pitch and accent more appropriate to the player's character. This improves the quality of the illusion of the game play. Furthermore, accurate volume may be applied to particular voices dependent on their location relative to the player and the game may be controlled using voice commands.

Description

Description

CROSS REFERENCE TO RELATED APPLICATIONS

This application is a Continuation of copending patent application Ser. No. 11/537,040, filed Sep. 29, 2006, the content of which is herein incorporated by reference in its entirety.

FIELD OF THE INVENTION

The present invention relates to online gaming and in particular, but not exclusively, to the provision of IVR (Interactive Voice Response) enhancement of multiplayer online role-playing games.

BACKGROUND OF THE INVENTION

Multiplayer online role-playing games enable large numbers of players to participate in a single game. Some of the more popular games include World of Warcraft, Everquest II, Runescape and Star Wars Galaxies.

A key element of these games is that players can role-play a character that is very different to their real self. For example, a character may be a different age, sex or race. Fantasy role-playing games often allow players to play, as non-human characters such as a dwarf or an elf. In both cases, players typically appear to each other in the game as the characters they are playing.

Within the context of the games, players routinely interact with each other. Players can cooperate and/or compete with each other to achieve their aims. In many cases, cooperation between several players is essential in order for certain objectives to be achieved and competition between teams of players is not uncommon.

Communication between players is typically achieved using text-based communications such as chat or private messages. Many games apply profanity filters to these messages, but it is relatively easy for players to bypass them with creative spellings or by using non-alphabetic characters to construct letters.

Players can also interact with computer controlled non-player characters (s) in which dialogue between a player and an will be a textual interaction; with the player selecting from a range of possible responses at each stage of the dialogue.

In addition, some groups of players use 3rd party voice conferencing services to coordinate their activities when operating as a team.

However, if a player is very different to the character they are role-playing, for example, a different sex, this direct voice communication inevitably detracts from the illusion that a player wishes to project.

SUMMARY OF THE INVENTION

The present invention allows players to establish voice communications with an IVR system, which performs real-time voice recognition and reproduces the player's speech using a voice appropriate to the player's character. Typically, the voice will have been chosen by the player from an appropriate selection offered by the game during the character creation phase.

Since each player will individually log in to play the game, a user-specific speaker profile can be established and used to improve the effectiveness of the speech recognition, enabling a wider vocabulary to be used. As part of the player's registration process, a player is asked to provide some simple speech samples of known text, enabling a speech profile to be established. In addition, players may be provided with a facility that allows them to further coach the system with additional 20 words, including names of other characters.

The speech output from the IVR process can be used as the player's voice in a number of ways.

A private dialogue between two players. This requires the modified voice to be delivered to the other player's computer or gaming console.

A closed dialogue between several players. This requires either a dedicated conference facility or a mixing facility on each player's PC able to combine multiple voice streams.

An open dialogue between all players in a locale. Since each player should 30 only hear the voices of other characters close to them and this will often be a unique combination for every player, it is necessary to provide a dedicated voice stream mixing facility for each player. This can be network-based, but a mixing facility on each PC is likely to offer the best solution. As a further refinement, distant voices can be reduced in volume to deliver a more realistic experience.

The speech can also be reproduced as text to provide confirmation that the speech has been correctly recognised. If errors in recognition do occur the player can use a coaching system to reduce the chances of the same error being repeated.

Similarly, a text to speech facility can be provided to allow the player to type in text and have it spoken by their character.

As a further refinement, a profanity filter can be applied to the speech before it is regenerated as the modified voice, enabling obscene language to be replaced with an alternative such as silence. Unlike profanity filters applied to text messages, it is impossible to bypass a speech filter using creative spelling of words.

The IVR system can also be used to enable players to interact verbally with s. The IVR system can identify key words in the player's speech and use these to select an appropriate response by the. The use of natural language analysis provides a more fluid dialogue, enhancing the player's experience.

According to a first aspect, the invention provides communication apparatus for an online gaming network comprising a voice input, voice recognition means, voice generation means and voice output means connectable to a voice bridge, the voice recognition means being operable to recognise voice received via the voice input and to generate voice data representative of recognised words, the voice generation means being operable to regenerate voice from the voice data using a predetermined voice type and to pass the regenerated voice to the voice output.

In a second aspect, the invention provides a method of communicating in an online game comprising, receiving speech from a game player, performing speech recognition on the received speech, regenerating the speech using a predetermined speech type and passing the regenerated speech to at least one other player in the online game.

Other aspects and features of the present invention will become apparent to those ordinarily skilled in the art upon review of the following description of specific embodiments of the invention in conjunction with the accompanying figures.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 is a schematic block diagram of a gaming system including apparatus in accordance with the invention;

FIG. 2 is a schematic block diagram of an IVR server in accordance with the invention; and

FIG. 3 is a schematic block diagram of a game server in accordance with the invention

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS

With reference to FIG. 1, a terminal 2 such as an IBM compatible PC or games console is connected to a game server 4 in the conventional way. Typically this will be a connection via a broadband internet connection. The game server operates to co-ordinate the activities of a plurality of players connected via a plurality of terminals 2 (not shown). Control signals pass between the game server and the terminal. The terminal passes control signals to the game server indicating the activities of the role played by the player and control signals pass back from the game server to the terminal to indicate to the terminal 2 what images to render and sounds to create based on the activities of the players and non-player characters.

In use, voice output from the player is passed to an IVR server 6. This is described in more detail below. The IVR server 6 controls an audio bridge 8 which feeds sound back to the terminal 2. The game server 4 also may control the activity of the audio bridge 8 as described below. The audio bridge 8 may be dedicated to a single player or may operate to provide audio feedback to more than one player.

With reference also to FIG. 2, the IVR server 6 serves to recognise the voice output of a player operating at terminal 2. A voice recognition unit 8 converts the recognised voice into a normalised internal format such as text. Optionally the internal format may be checked for profanity in a profanity filter 10. The filter may block profane comments and words. A revoicer 12 converts the text back to audible voice data which is then fed to the audio bridge for onward transmission to players. The revoicer voices the players according to characteristics selected by the player and/or game server. Thus the revoiced sounds typically will sound very different to the original sounds produced by the player prior to voice recognition.

Optionally, the internal format produced by the voice recognition unit may also be passed back to the game server. In this way certain aspects of the game may be controlled using voice commands.

With reference to FIG. 3, the game server 4 optionally may control the audio bridge 8. If, for example, a dedicated audio bridge is used for a single player then the game server 4 may accurately control which voices of other players or NPCs may be heard by the player. Also, volume control and other effects such as reverberation and echo may be applied to particular voices depending on the distance and location of the voice in the game, relative to the player. This information is know by the game server and may be processed by audio control unit 14 to provide realistic audio control. Alternatively, an audio bridge may be shared by a group of people for example a group of people located in a particular unit within the game such as a room. In this case, all people in the room may be able to hear one another at the same volume. This simplifies audio control and also reduces the number of bridges required.

Words may be taught to the IVR server 6 via the game client operating on the terminal 2. The game client may include a coaching or training module which allows the player to pronounce particular words which are not initially understood by the IVR server 6.

In this way, the realism of the illusion of the game is maintained by providing voices which match those of a particular player/character. Furthermore, the option of voice control of the game is also provided, improved profanity filtering which cannot be bypassed by unusual spellings or use of characters and furthermore, realistic voice volumes may also be applied. These features greatly enhance the accuracy of the game experience.

Claims

1. A method of communicating in an online game, comprising:

receiving speech from a player of the online game;

performing speech recognition on the received speech;

generating a representation of the recognized speech as speech data in non-speech format; and

regenerating speech from the speech data in the non-speech format for presentation to at least one other player of the online game, the regenerated speech having a predetermined speech characteristics.

2. The method of claim 1, wherein receiving the speech from the player of the online game is performed at a user device operated by the player of the online game.

3. The method of claim 2, wherein performing speech recognition on the received speech is performed at a server coupled to the user device by a communication network.

4. The method of claim 2, wherein generating a representation of the recognized speech as speech data in an non-speech format is performed at a server coupled to the user device by a communication network.

5. The method of claim 2, wherein regenerating speech from the speech data in the non-speech format for presentation to at least one other player of the online game is performed at a server coupled to the user device by a communication network.

6. The method of claim 1, comprising presenting the regenerated speech to multiple other players of the online game.

7. The method of claim 1, comprising not presenting the regenerated speech to at least one other player of the online game.

8. The method of claim 1, wherein regenerating speech from the speech data in the non-speech format comprises processing the speech for presentation to a particular player of the online game based on data indicating a respective location of the particular player in the game.

9. The method of claim 8, wherein processing the speech for presentation to a particular player of the online game based on data indicating a respective location of the particular player in the game comprises adjusting at least one of:

volume of the speech for presentation to the particular player;

reverberation of the speech for presentation to the particular player; and

echo of the speech for presentation to the particular player.

10. The method of claim 1, wherein performing speech recognition on the received speech comprises performing speech recognition on the received speech using a speech recognition process which is specific to the player of the online game from whom the speech is received.

11. The method of claim 10, comprising training the speech recognition process which is specific to the player of the online game from whom the speech is received based on speech received from the player of the online game during a training process.

12. The method of claim 1, comprising:

performing speech recognition on speech received from each of a plurality of players of the online game to provide a respective recognized speech stream from each respective player;

generating a respective representation of each respective recognized speech stream as a respective stream of speech data in non-speech format; and

regenerating a respective speech stream from each respective stream of speech data in the non-speech format;

mixing a respective selection of the regenerated speech streams for presentation to each respective player of the game.

13. The method of claim 12, wherein performing speech recognition on speech received from each of the plurality of players of the online game comprises performing speech recognition on the speech received from each respective player using a respective speech recognition process which is specific to the respective player of the online game from whom the speech is received.

14. The method of claim 13, comprising training each respective speech recognition process based on respective speech received from each respective player during a respective training process.

15. The method of claim 1, comprising selecting at least some of the predetermined speech characteristics based on at least one input received from the player of the online game.

16. The method of claim 15, wherein selecting at least some of the predetermined speech characteristics based on input received from the player of the online game comprises selecting at least some of the predetermined speech characteristics based on at least one input received from the player of the online game during setup of the online game.

17. The method of claim 15, wherein selecting at least some of the predetermined speech characteristics based on input received from the player of the online game comprises selecting at least some of the predetermined speech characteristics based on at least one input received from the player of the online game during play of the online game.

18. The method of claim 1, comprising processing the recognized speech to block at least one predetermined word.

19. The method of claim 1, comprising processing the recognized speech to replace at least one predetermined word with another predetermined word.

20. The method of claim 2, comprising controlling at least one aspect of the online game based on at least some of the speech data in the non-speech format.

21. The method of claim 1, wherein the non-speech format is a text format.

22. A method of controlling an online game, comprising:

receiving speech from a player of the online game;

performing speech recognition on the received speech;

generating a representation of the recognized speech as speech data in an internal format; and

controlling at least one aspect of the online game based on at least some of the speech data in the internal format.

23. The method of claim 22, wherein receiving the speech from the player of the online game is performed at a user device operated by the player of the online game.

24. The method of claim 23, wherein performing speech recognition on the received speech is performed at a server coupled to the user device by a communication network.

25. The method of claim 23, wherein generating a representation of the recognized speech as speech data in an non-speech format is performed at a server coupled to the user device by a communication network.

26. The method of claim 23, wherein regenerating speech from the speech data in the non-speech format for presentation to at least one other player of the online game is performed at a server coupled to the user device by a communication network.

27. The method of claim 22, wherein performing speech recognition on the received speech comprises performing speech recognition on the received speech using a speech recognition process which is specific to the player of the online game from whom the speech is received.

28. The method of claim 27, comprising training the speech recognition process which is specific to the player of the online game from whom the speech is received based on speech received from the player of the online game during a training process.