Patents by Inventor Xavier Menendez-Pidal
Xavier Menendez-Pidal has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Patent number: 11524245Abstract: Methods and systems for improving engagement metrics of a spectator include identifying a group of spectators watching game play of a video game and generating an aggregate group profile for the group. Engagement metrics for the group are analyzed to identify engagement level of the group toward the game play of the player. One or more suggestions are provided to adjust game play of the video game so as to improve engagement level of the group toward the game play of the video game.Type: GrantFiled: June 19, 2020Date of Patent: December 13, 2022Assignee: Sony Interactive Entertainment Inc.Inventors: Steven Osman, Saket Kumar, Yuichiro Nakamura, Katrine Chow, Xavier Menendez-Pidal
-
Patent number: 11406907Abstract: A method is provided, including the following operations: analyzing gameplay data and spectator data from previous sessions of a video game, wherein the analyzing is configured to correlate a spectator preference to a gameplay condition of the video game; using the correlated spectator preference to generate an in-game recommendation for the gameplay condition; identifying the gameplay condition occurring in a current session of the video game; responsive to identifying the gameplay condition occurring in the current session, then presenting the in-game recommendations to a player of the current session of the video game.Type: GrantFiled: March 31, 2020Date of Patent: August 9, 2022Assignee: Sony Interactive Entertainment Inc.Inventors: Katrine Chow, Saket Kumar, Steven Osman, Yuichiro Nakamura, Xavier Menendez-Pidal
-
Publication number: 20210394073Abstract: Methods and systems for improving engagement metrics of a spectator include identifying a group of spectators watching game play of a video game and generating an aggregate group profile for the group. Engagement metrics for the group are analyzed to identify engagement level of the group toward the game play of the player. One or more suggestions are provided to adjust game play of the video game so as to improve engagement level of the group toward the game play of the video game.Type: ApplicationFiled: June 19, 2020Publication date: December 23, 2021Inventors: Steven Osman, Saket Kumar, Yuichiro Nakamura, Katrine Chow, Xavier Menendez-Pidal
-
Publication number: 20210299580Abstract: A method is provided, including the following operations: analyzing gameplay data and spectator data from previous sessions of a video game, wherein the analyzing is configured to correlate a spectator preference to a gameplay condition of the video game; using the correlated spectator preference to generate an in-game recommendation for the gameplay condition; identifying the gameplay condition occurring in a current session of the video game; responsive to identifying the gameplay condition occurring in the current session, then presenting the in-game recommendations to a player of the current session of the video game.Type: ApplicationFiled: March 31, 2020Publication date: September 30, 2021Inventors: Katrine Chow, Saket Kumar, Steven Osman, Yuichiro Nakamura, Xavier Menendez-Pidal
-
Patent number: 10714076Abstract: A method for improved initialization of speech recognition system comprises mapping a trained hidden markov model based recognition node network (HMM) to a Connectionist Temporal Classification (CTC) based node label scheme. The central state of each frame in the HMM are mapped to CTC-labeled output nodes and the non-central states of each frame are mapped to CTC-blank nodes to generate a CTC-labeled HMM and each central state represents a phoneme from human speech detected and extracted by a computing device. Next the CTC-labeled HMM is trained using a cost function, wherein the cost function is not part of a CTC cost function. Finally the CTC-labeled HMM is trained using a CTC cost function to produce a CTC node network. The CTC node network may be iteratively trained by repeating the initialization steps.Type: GrantFiled: July 10, 2017Date of Patent: July 14, 2020Assignee: Sony Interactive Entertainment Inc.Inventors: Xavier Menendez-Pidal, Ruxin Chen
-
Publication number: 20190341025Abstract: A system and method for multimodal classification of user characteristics is described. The method comprises receiving audio and other inputs, extracting fundamental frequency information from the audio input, extracting other feature information from the video input, classifying the fundamental frequency information, textual information and video feature information using the multimodal neural network.Type: ApplicationFiled: April 15, 2019Publication date: November 7, 2019Inventors: Masanori Omote, Ruxin Chen, Xavier Menendez-Pidal, Jaekwon Yoo, Koji Tashiro, Sudha Krishnamurthy, Komath Naveen Kumar
-
Patent number: 10376785Abstract: Consumer electronic devices have been developed with enormous information processing capabilities, high quality audio and video outputs, large amounts of memory, and may also include wired and/or wireless networking capabilities. Additionally, relatively unsophisticated and inexpensive sensors, such as microphones, video camera, GPS or other position sensors, when coupled with devices having these enhanced capabilities, can be used to detect subtle features about users and their environments. A variety of audio, video, simulation and user interface paradigms have been developed to utilize the enhanced capabilities of these devices. These paradigms can be used separately or together in any combination. One paradigm automatically creating user identities using speaker identification. Another paradigm includes a control button with 3-axis pressure sensitivity for use with game controllers and other input devices.Type: GrantFiled: June 30, 2016Date of Patent: August 13, 2019Assignee: SONY INTERACTIVE ENTERTAINMENT INC.Inventors: Gustavo Hernandez-Abrego, Xavier Menendez-Pidal, Steven Osman, Ruxin Chen, Rishi Deshpande, Care Michaud-Wideman, Richard Marks, Eric J. Larsen, Xiaodong Mao
-
Publication number: 20190013015Abstract: A method for improved initialization of speech recognition system comprises mapping a trained hidden markov model based recognition node network (HMM) to a Connectionist Temporal Classification (CTC) based node label scheme. The central state of each frame in the HMM are mapped to CTC-labeled output nodes and the non-central states of each frame are mapped to CTC-blank nodes to generate a CTC-labeled HMM and each central state represents a phoneme from human speech detected and extracted by a computing device. Next the CTC-labeled HMM is trained using a cost function, wherein the cost function is not part of a CTC cost function. Finally the CTC-labeled HMM is trained using a CTC cost function to produce a CTC node network. The CTC node network may be iteratively trained by repeating the initialization steps.Type: ApplicationFiled: July 10, 2017Publication date: January 10, 2019Inventors: Xavier Menendez-Pidal, Ruxin Chen
-
Publication number: 20160310847Abstract: Consumer electronic devices have been developed with enormous information processing capabilities, high quality audio and video outputs, large amounts of memory, and may also include wired and/or wireless networking capabilities. Additionally, relatively unsophisticated and inexpensive sensors, such as microphones, video camera, GPS or other position sensors, when coupled with devices having these enhanced capabilities, can be used to detect subtle features about users and their environments. A variety of audio, video, simulation and user interface paradigms have been developed to utilize the enhanced capabilities of these devices. These paradigms can be used separately or together in any combination. One paradigm automatically creating user identities using speaker identification. Another paradigm includes a control button with 3-axis pressure sensitivity for use with game controllers and other input devices.Type: ApplicationFiled: June 30, 2016Publication date: October 27, 2016Applicant: Sony Interactive Entertainment Inc.Inventors: Gustavo Hernandez-Abrego, Xavier Menendez-Pidal, Steven Osman, Ruxin Chen, Rishi Deshpande, Care Michaud-Wideman, Richard Marks, Eric J. Larsen, Xiaodong Mao
-
Patent number: 9405363Abstract: Consumer electronic devices have been developed with enormous information processing capabilities, high quality audio and video outputs, large amounts of memory, and may also include wired and/or wireless networking capabilities. Additionally, relatively unsophisticated and inexpensive sensors, such as microphones, video camera, GPS or other position sensors, when coupled with devices having these enhanced capabilities, can be used to detect subtle features about users and their environments. A variety of audio, video, simulation and user interface paradigms have been developed to utilize the enhanced capabilities of these devices. These paradigms can be used separately or together in any combination. One paradigm automatically creating user identities using speaker identification. Another paradigm includes a control button with 3-axis pressure sensitivity for use with game controllers and other input devices.Type: GrantFiled: August 13, 2014Date of Patent: August 2, 2016Assignee: SONY INTERACTIVE ENTERTAINMENT INC. (SIEI)Inventors: Gustavo Hernandez-Abrego, Xavier Menendez-Pidal, Steven Osman, Ruxin Chen, Rishi Deshpande, Care Michaud-Wideman, Richard Marks, Eric J. Larsen, Xiaodong Mao
-
Publication number: 20140347272Abstract: Consumer electronic devices have been developed with enormous information processing capabilities, high quality audio and video outputs, large amounts of memory, and may also include wired and/or wireless networking capabilities. Additionally, relatively unsophisticated and inexpensive sensors, such as microphones, video camera, GPS or other position sensors, when coupled with devices having these enhanced capabilities, can be used to detect subtle features about users and their environments. A variety of audio, video, simulation and user interface paradigms have been developed to utilize the enhanced capabilities of these devices. These paradigms can be used separately or together in any combination. One paradigm automatically creating user identities using speaker identification. Another paradigm includes a control button with 3-axis pressure sensitivity for use with game controllers and other input devices.Type: ApplicationFiled: August 13, 2014Publication date: November 27, 2014Applicant: Sony Computer Entertainment Inc.Inventors: Gustavo Hernandez-Abrego, Xavier Menendez-Pidal, Steven Osman, Ruxin Chen, Rishi Deshpande, Care Michaud-Wideman, Richard Marks, Eric J. Larsen, Xiaodong Mao
-
Patent number: 8825482Abstract: Consumer electronic devices have been developed with enormous information processing capabilities, high quality audio and video outputs, large amounts of memory, and may also include wired and/or wireless networking capabilities. Additionally, relatively unsophisticated and inexpensive sensors, such as microphones, video camera, GPS or other position sensors, when coupled with devices having these enhanced capabilities, can be used to detect subtle features about users and their environments. A variety of audio, video, simulation and user interface paradigms have been developed to utilize the enhanced capabilities of these devices. These paradigms can be used separately or together in any combination. One paradigm automatically creating user identities using speaker identification. Another paradigm includes a control button with 3-axis pressure sensitivity for use with game controllers and other input devices.Type: GrantFiled: September 15, 2006Date of Patent: September 2, 2014Assignee: Sony Computer Entertainment Inc.Inventors: Gustavo Hernandez-Abrego, Xavier Menendez-Pidal, Steven Osman, Ruxin Chen, Rishi Deshpande, Care Michaud-Wideman, Richard Marks, Eric Larsen, Xiaodong Mao
-
Patent number: 8788256Abstract: Computer implemented speech processing generates one or more pronunciations of an input word in a first language by a non-native speaker of the first language who is a native speaker of a second language. The input word is converted into one or more pronunciations. Each pronunciation includes one or more phonemes selected from a set of phonemes associated with the second language. Each pronunciation is associated with the input word in an entry in a computer database. Each pronunciation in the database is associated with information identifying a pronunciation language and/or a phoneme language.Type: GrantFiled: February 2, 2010Date of Patent: July 22, 2014Assignee: Sony Computer Entertainment Inc.Inventors: Ruxin Chen, Gustavo Hernandez-Abrego, Masanori Omote, Xavier Menendez-Pidal
-
Patent number: 8719023Abstract: An apparatus to improve robustness to environmental changes of a context dependent speech recognizer for an application, that includes a training database to store sounds for speech recognition training, a dictionary to store words supported by the speech recognizer, and a speech recognizer training module to train a set of one or more multiple state Hidden Markov Models (HMMs) with use of the training database and the dictionary. The speech recognizer training module performs a non-uniform state clustering process on each of the states of each HMM, which includes using a different non-uniform cluster threshold for at least some of the states of each HMM to more heavily cluster and correspondingly reduce a number of observation distributions for those of the states of each HMM that are less empirically affected by one or more contextual dependencies.Type: GrantFiled: May 21, 2010Date of Patent: May 6, 2014Assignee: Sony Computer Entertainment Inc.Inventors: Xavier Menendez-Pidal, Ruxin Chen
-
Publication number: 20110288869Abstract: An apparatus to improve robustness to environmental changes of a context dependent speech recognizer for an application, that includes a training database to store sounds for speech recognition training, a dictionary to store words supported by the speech recognizer, and a speech recognizer training module to train a set of one or more multiple state Hidden Markov Models (HMMs) with use of the training database and the dictionary. The speech recognizer training module performs a non-uniform state clustering process on each of the states of each HMM, which includes using a different non-uniform cluster threshold for at least some of the states of each HMM to more heavily cluster and correspondingly reduce a number of observation distributions for those of the states of each HMM that are less empirically affected by one or more contextual dependencies.Type: ApplicationFiled: May 21, 2010Publication date: November 24, 2011Inventors: Xavier Menendez-Pidal, Ruxin Chen
-
Publication number: 20100211376Abstract: Computer implemented speech processing generates one or more pronunciations of an input word in a first language by a non-native speaker of the first language who is a native speaker of a second language. The input word is converted into one or more pronunciations. Each pronunciation includes one or more phonemes selected from a set of phonemes associated with the second language. Each pronunciation is associated with the input word in an entry in a computer database. Each pronunciation in the database is associated with information identifying a pronunciation language and/or a phoneme language.Type: ApplicationFiled: February 2, 2010Publication date: August 19, 2010Applicant: Sony Computer Entertainment Inc.Inventors: Ruxin Chen, Gustavo Hernandez-Abrego, Masanori Omote, Xavier Menendez-Pidal
-
Patent number: 7716047Abstract: A system and method for an automatic set-up of speech recognition engines may include a speech recognizer configured to perform speech recognition procedures to identify input speech data according to one or more operating parameters. A merit manager may be utilized to automatically calculate merit values corresponding to the foregoing recognition procedures. These merit values may incorporate recognition accuracy information, recognition speed information, and a user-specified weighting factor that shifts the relative effect of the recognition accuracy information and the recognition speed information on the merit values. The merit manager may then automatically perform a merit value optimization procedure to select operating parameters that correspond to an optimal one of the merit values.Type: GrantFiled: March 31, 2003Date of Patent: May 11, 2010Assignees: Sony Corporation, Sony Electronics Inc.Inventors: Gustavo Hernandez-Abrego, Xavier Menendez-Pidal, Thomas Kemp, Katsuki Minamino, Helmut Lucke
-
Patent number: 7502731Abstract: The present invention comprises a system and method for speech recognition utilizing a multi-language dictionary, and may include a recognizer that is configured to compare input speech data to a series of dictionary entries from the multi-language dictionary to detect a recognized phrase or command. The multi-language dictionary may be implemented with a mixed-language technique that utilizes dictionary entries which incorporate multiple different languages such as Cantonese and English. The speech recognizer may thus advantageously achieve more accurate speech recognition accuracy in an efficient and compact manner.Type: GrantFiled: August 11, 2003Date of Patent: March 10, 2009Assignees: Sony Corporation, Sony Electronics Inc.Inventors: Michael Emonts, Xavier Menendez-Pidal, Lex Olorenshaw
-
Patent number: 7467086Abstract: A system and method for effectively performing speech recognition procedures includes enhanced demiphone acoustic models that a speech recognition engine utilizes to perform the speech recognition procedures. The enhanced demiphone acoustic models each have three states that are collectively arranged to form a preceding demiphone and a succeeding demiphone. An acoustic model generator may utilize a decision tree for analyzing speech context information from a training database. The acoustic model generator then effectively configures each of the enhanced demiphone acoustic models as either a succeeding-dominant enhanced demiphone acoustic model or a preceding-dominant enhanced demiphone acoustic model to accurately model speech characteristics.Type: GrantFiled: December 16, 2004Date of Patent: December 16, 2008Assignees: Sony Corporation, Sony Electronics Inc.Inventors: Xavier Menendez-Pidal, Lex S. Olorenshaw, Gustavo Hernandez Abrego
-
Patent number: 7392186Abstract: A system and method for effectively implementing an optimized language model for speech recognition includes initial language models each created by combining source models according to selectable interpolation coefficients that define proportional relationships for combining the source models. A rescoring module iteratively utilizes the initial language models to process input development data for calculating word-error rates that each correspond to a different one of the initial language models. An optimized language model is then selected from the initial language models by identifying an optimal word-error rate from among the foregoing word-error rates. The speech recognizer may then utilize the optimized language model for effectively performing various speech recognition procedures.Type: GrantFiled: March 30, 2004Date of Patent: June 24, 2008Assignees: Sony Corporation, Sony Electronics Inc.Inventors: Lei Duan, Gustavo Abrego, Xavier Menendez-Pidal, Lex Olorenshaw