Patents by Inventor Kazumi Aoyama

Kazumi Aoyama has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20210027779
    Abstract: [Technical Problem] To more highly accurately detect an operation target. [Solution to Problem] Provided is an information processing device including: a determination unit configured to determine whether an object that outputs voice is a dialogue target related to voice dialogue based on a result of recognition of an input image; and a dialogue function unit configured to perform control related to the voice dialogue based on determination by the determination unit. The dialogue function unit provides a voice dialogue function to the object based on a fact that it is determined by the determination unit that the object is the dialogue target. Additionally provided is an information processing method executed by a processor, the method including: determining whether an object that outputs voice is a dialogue target related to voice dialogue based on a result of recognition of an input image; and performing control related to the voice dialogue based on a result of the determining.
    Type: Application
    Filed: January 23, 2019
    Publication date: January 28, 2021
    Inventors: HIROMI KURASAWA, KAZUMI AOYAMA, YASUHARU ASANO
  • Publication number: 20200219487
    Abstract: The present technology relates to an information processing apparatus and an information processing method that make it possible to provide a speech interaction of higher convenience. An information processing apparatus including a processor is provided, thereby making it possible to provide a speech interaction of higher convenience. The processor couples utterances before and after a pause included in an utterance of a user in accordance with a matching degree between the utterances before and after in terms of a semantic unit. For example, the present technology is applicable to a speech dialog system.
    Type: Application
    Filed: July 27, 2018
    Publication date: July 9, 2020
    Applicant: Sony Corporation
    Inventors: Takashi SHIBUYA, Kazumi AOYAMA, Katsuki MINAMINO
  • Patent number: 10110817
    Abstract: Provided is an image processing device including an acquisition unit configured to acquire information on an imaging position and an imaging direction in units of frame images that constitute a moving image obtained through capturing by an imaging unit, a converted image generation unit configured to generate converted images having different imaging directions for each frame image that constitutes the moving image based on the frame image itself and preceding and succeeding frame images of the frame image, an evaluation value calculation unit configured to calculate an evaluation value for each converted moving image constituted by combining the converted image and the original frame image, the evaluation value being used to evaluate a blur between the converted images or between the original frame images, and a selection unit configured to select a converted moving image with less blur based on an evaluation value calculated by the evaluation value calculation unit.
    Type: Grant
    Filed: December 27, 2016
    Date of Patent: October 23, 2018
    Assignee: SONY CORPORATION
    Inventors: Yasuhiro Sutou, Hideki Shimomura, Atsushi Okubo, Kazumi Aoyama, Akichika Tanaka
  • Patent number: 9830352
    Abstract: There is provided an information processing device including a storage unit configured to store identification data and attribute data of each of a plurality of pieces of content, the attribute data being associated with the identification data, and a retrieval unit configured to specify attribute data corresponding to a retrieval key and perform retrieval of identification data related to another attribute data associated with identification data related to the specified attribute data.
    Type: Grant
    Filed: December 31, 2014
    Date of Patent: November 28, 2017
    Assignee: Sony Corporation
    Inventors: Michael Spranger, Kazumi Aoyama, Mario Tokoro, Tetsu Natsume, Katsuki Minamino
  • Publication number: 20170337921
    Abstract: There is provided an information processing device to control response to a sound input in a preferred mode corresponding to a change in a situation or a user, the information processing device including: a control unit configured to control output of a response to speech of a user in accordance with acquired information regarding a speech state of the user.
    Type: Application
    Filed: November 26, 2015
    Publication date: November 23, 2017
    Applicant: SONY CORPORATION
    Inventors: Kazumi AOYAMA, Yoko ITO
  • Publication number: 20170111586
    Abstract: Provided is an image processing device including an acquisition unit configured to acquire information on an imaging position and an imaging direction in units of frame images that constitute a moving image obtained through capturing by an imaging unit, a converted image generation unit configured to generate converted images having different imaging directions for each frame image that constitutes the moving image based on the frame image itself and preceding and succeeding frame images of the frame image, an evaluation value calculation unit configured to calculate an evaluation value for each converted moving image constituted by combining the converted image and the original frame image, the evaluation value being used to evaluate a blur between the converted images or between the original frame images, and a selection unit configured to select a converted moving image with less blur based on an evaluation value calculated by the evaluation value calculation unit.
    Type: Application
    Filed: December 27, 2016
    Publication date: April 20, 2017
    Applicant: SONY CORPORATION
    Inventors: Yasuhiro SUTOU, Hideki SHIMOMURA, Atsushi OKUBO, Kazumi AOYAMA, Akichika TANAKA
  • Patent number: 9569823
    Abstract: Provided is an image processing device including an acquisition unit configured to acquire information on an imaging position and an imaging direction in units of frame images that constitute a moving image obtained through capturing by an imaging unit, a converted image generation unit configured to generate converted images having different imaging directions for each frame image that constitutes the moving image based on the frame image itself and preceding and succeeding frame images of the frame image, an evaluation value calculation unit configured to calculate an evaluation value for each converted moving image constituted by combining the converted image and the original frame image, the evaluation value being used to evaluate a blur between the converted images or between the original frame images, and a selection unit configured to select a converted moving image with less blur based on an evaluation value calculated by the evaluation value calculation unit.
    Type: Grant
    Filed: December 12, 2013
    Date of Patent: February 14, 2017
    Assignee: Sony Corporation
    Inventors: Yasuhiro Sutou, Hideki Shimomura, Atsushi Okubo, Kazumi Aoyama, Akichika Tanaka
  • Publication number: 20150348243
    Abstract: Provided is an image processing device including an acquisition unit configured to acquire information on an imaging position and an imaging direction in units of frame images that constitute a moving image obtained through capturing by an imaging unit, a converted image generation unit configured to generate converted images having different imaging directions for each frame image that constitutes the moving image based on the frame image itself and preceding and succeeding frame images of the frame image, an evaluation value calculation unit configured to calculate an evaluation value for each converted moving image constituted by combining the converted image and the original frame image, the evaluation value being used to evaluate a blur between the converted images or between the original frame images, and a selection unit configured to select a converted moving image with less blur based on an evaluation value calculated by the evaluation value calculation unit.
    Type: Application
    Filed: December 12, 2013
    Publication date: December 3, 2015
    Applicant: SONY CORPORATION
    Inventors: Yasuhiro SUTOU, Hideki SHIMOMURA, Atsushi OKUBO, Kazumi AOYAMA, Akichika TANAKA
  • Publication number: 20150331486
    Abstract: Provided is an image processing device including an eye-gaze direction detection unit configured to detect an eye-gaze direction of a user toward an image, an estimation unit configured to estimate a gaze area in the image on the basis of the eye-gaze direction and the image, the eye-gaze direction being detected by the eye-gaze direction detection unit, a chased object detection unit configured to detect a chased object being eye-chased by the user in the image, on the basis of the time-series gaze areas estimated by the estimation unit, a tracking unit configured to search for and track the chased object detected by the chased object detection unit, and an image control unit configured to control an image of the chased object tracked by the tracking unit.
    Type: Application
    Filed: December 12, 2013
    Publication date: November 19, 2015
    Applicant: SONY CORPORATION
    Inventors: Atsushi OKUBO, Hideki SHIMOMURA, Kazumi AOYAMA, Yasuhiro SUTOU, Akichika TANAKA
  • Publication number: 20150227580
    Abstract: There is provided an information processing device including a storage unit configured to store identification data and attribute data of each of a plurality of pieces of content, the attribute data being associated with the identification data, and a retrieval unit configured to specify attribute data corresponding to a retrieval key and perform retrieval of identification data related to another attribute data associated with identification data related to the specified attribute data.
    Type: Application
    Filed: December 31, 2014
    Publication date: August 13, 2015
    Applicant: SONY CORPORATION
    Inventors: Michael SPRANGER, Kazumi AOYAMA, Mario TOKORO, Tetsu NATSUME, Katsuki MINAMINO
  • Patent number: 8560467
    Abstract: A data processing apparatus includes an obtaining unit for obtaining time-series data, an activity model learning unit for learning an activity model representing a user activity state as a stochastic state transition model from the obtained time-series data, a recognition unit for recognizing a current user activity state by using the learned activity model, and a prediction unit for predicting a user activity state after a predetermined time elapses from a current time from the recognized current user activity state, wherein the prediction unit predicts the user activity state as an occurrence probability, and calculates the occurrence probabilities of the respective states on the basis of the state transition probability of the stochastic state transition model to predict the user activity state, while it is presumed that observation probabilities of the respective states at the respective times of the stochastic state transition model are an equal probability.
    Type: Grant
    Filed: July 19, 2010
    Date of Patent: October 15, 2013
    Assignee: Sony Corporation
    Inventors: Masato Ito, Kohtaro Sabe, Hirotaka Suzuki, Jun Yokono, Kazumi Aoyama, Takashi Hasuo
  • Patent number: 8538750
    Abstract: This invention realizes a speech communication system and method, and a robot apparatus capable of significantly improving entertainment property. A speech communication system with a function to make conversation with a conversation partner is provided with a speech recognition means for recognizing speech of the conversation partner, a conversation control means for controlling conversation with the conversation partner based on the recognition result of the speech recognition means, an image recognition means for recognizing the face of the conversation partner, and a tracking control means for tracing the existence of the conversation partner based on one or both of the recognition result of the image recognition means and the recognition result of the speech recognition means. The conversation control means controls conversation so as to continue depending on tracking of the tracking control means.
    Type: Grant
    Filed: November 2, 2012
    Date of Patent: September 17, 2013
    Assignee: Sony Corporation
    Inventors: Kazumi Aoyama, Hideki Shimomura
  • Patent number: 8416998
    Abstract: An information processing device includes an imaging unit configured to perform imaging of one of the object person and a registrant, a first feature amount calculation unit configured to calculate a feature amount of a face of the registrant, a second feature amount calculation unit configured to calculate time series of feature amount of a lip of the registrant, a registration unit configured to register the time series of feature amount of the lip in a database to be associated with the feature amount of the face of the registrant, an identification unit configured to identify the face of the object person, a recognition unit configured to recognize speech content of the object person, and an authentication unit configured to perform personal authentication of the object person based on an identification result of the face and a recognition result of the speech content of the object person.
    Type: Grant
    Filed: February 2, 2011
    Date of Patent: April 9, 2013
    Assignee: Sony Corporation
    Inventors: Kiyoto Ichikawa, Kazumi Aoyama
  • Patent number: 8412715
    Abstract: An information processing apparatus for performing dialogue processing. The apparatus acquires text data described in a natural language and stores a plurality of examples each including an example statement and frame information described using a frame format and corresponding to the example statement. A similarity between the text data and the example statement is calculated. An example is selected corresponding to an example statement whose similarity to the text data is the highest from among the plurality of examples in accordance with the calculated similarity. Text data is converted into the frame format in accordance with the frame information corresponding to the example selected by the example selection. Dialogue processing is performed in accordance with the text data converted into the frame format.
    Type: Grant
    Filed: April 12, 2005
    Date of Patent: April 2, 2013
    Assignee: Sony Corporation
    Inventors: Yasuharu Asano, Keiichi Yamada, Seiichi Aoyagi, Kazumi Aoyama
  • Publication number: 20130060566
    Abstract: This invention realizes a speech communication system and method, and a robot apparatus capable of significantly improving entertainment property. A speech communication system with a function to make conversation with a conversation partner is provided with a speech recognition means for recognizing speech of the conversation partner, a conversation control means for controlling conversation with the conversation partner based on the recognition result of the speech recognition means, an image recognition means for recognizing the face of the conversation partner, and a tracking control means for tracing the existence of the conversation partner based on one or both of the recognition result of the image recognition means and the recognition result of the speech recognition means. The conversation control means controls conversation so as to continue depending on tracking of the tracking control means.
    Type: Application
    Filed: November 2, 2012
    Publication date: March 7, 2013
    Inventors: Kazumi AOYAMA, Hideki Shimomura
  • Patent number: 8321221
    Abstract: This invention realizes a speech communication system and method, and a robot apparatus capable of significantly improving entertainment property. A speech communication system with a function to make conversation with a conversation partner is provided with a speech recognition means for recognizing speech of the conversation partner, a conversation control means for controlling conversation with the conversation partner based on the recognition result of the speech recognition means, an image recognition means for recognizing the face of the conversation partner, and a tracking control means for tracing the existence of the conversation partner based on one or both of the recognition result of the image recognition means and the recognition result of the speech recognition means. The conversation control means controls conversation so as to continue depending on tracking of the tracking control means.
    Type: Grant
    Filed: May 16, 2012
    Date of Patent: November 27, 2012
    Assignee: Sony Corporation
    Inventors: Kazumi Aoyama, Hideki Shimomura
  • Patent number: 8306930
    Abstract: A learning device, learning method, and program for learning a pattern are disclosed. A learning device includes: a plurality of learning modules, each of which performs update learning to update a plurality of model parameters of a pattern learning model that learns a pattern using input data; model parameter sharing means for causing two or more learning modules from among the plurality of learning modules to share the model parameters; and sharing strength updating means for updating sharing strengths between the learning modules so as to minimize learning errors when the plurality of model parameters are updated by the update learning.
    Type: Grant
    Filed: July 7, 2009
    Date of Patent: November 6, 2012
    Assignee: Sony Corporation
    Inventors: Masato Ito, Kazumi Aoyama, Kuniaki Noda
  • Patent number: 8290887
    Abstract: A device for implementing a pattern learning model, the device including a plurality of learning modules, each of which performs update learning to update a plurality of model parameters of the pattern learning model that learns a pattern using input data. The device further including a model parameter sharing means for causing two or more learning modules from among the plurality of learning modules to share the model parameters; and a classification means for classifying the plurality of learning modules on the basis of the plurality of model parameters of each of the learning modules after the update learning.
    Type: Grant
    Filed: July 7, 2009
    Date of Patent: October 16, 2012
    Assignee: Sony Corporation
    Inventors: Masato Ito, Kazumi Aoyama, Kuniaki Noda
  • Publication number: 20120232891
    Abstract: This invention realizes a speech communication system and method, and a robot apparatus capable of significantly improving entertainment property. A speech communication system with a function to make conversation with a conversation partner is provided with a speech recognition means for recognizing speech of the conversation partner, a conversation control means for controlling conversation with the conversation partner based on the recognition result of the speech recognition means, an image recognition means for recognizing the face of the conversation partner, and a tracking control means for tracing the existence of the conversation partner based on one or both of the recognition result of the image recognition means and the recognition result of the speech recognition means. The conversation control means controls conversation so as to continue depending on tracking of the tracking control means.
    Type: Application
    Filed: May 16, 2012
    Publication date: September 13, 2012
    Applicant: SONY CORPORATION
    Inventors: Kazumi Aoyama, Hideki Shimomura
  • Patent number: 8209179
    Abstract: This invention realizes a speech communication system and method, and a robot apparatus capable of significantly improving entertainment property. A speech communication system with a function to make conversation with a conversation partner is provided with a speech recognition means for recognizing speech of the conversation partner, a conversation control means for controlling conversation with the conversation partner based on the recognition result of the speech recognition means, an image recognition means for recognizing the face of the conversation partner, and a tracking control means for tracing the existence of the conversation partner based on one or both of the recognition result of the image recognition means and the recognition result of the speech recognition means. The conversation control means controls conversation so as to continue depending on tracking of the tracking control means.
    Type: Grant
    Filed: July 2, 2004
    Date of Patent: June 26, 2012
    Assignee: Sony Corporation
    Inventors: Kazumi Aoyama, Hideki Shimomura