Patents by Inventor Runyu Shi

Runyu Shi has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20200221146
    Abstract: The present technology relates to an encoding device, an encoding method, a reproduction device, a reproduction method, and a program enabling each reproduction equipment to reproduce an appropriate content in a simplified manner. A content data decoding unit decodes encoded metadata and outputs zoom area information, which is included in metadata acquired as a result thereof, designating an area to be zoomed. A zoom area selecting unit selects one or a plurality of pieces of zoom area information from among the zoom area information. A video segmenting unit segments a zoom area represented by the selected zoom area information in a video based on video data and outputs zoom video data acquired as a result thereof. An audio converting unit performs an audio converting process according to the selected zoom area information for audio data and outputs zoom audio data acquired as a result thereof. The present technology can be applied to a reproduction device.
    Type: Application
    Filed: March 23, 2020
    Publication date: July 9, 2020
    Applicant: Sony Corporation
    Inventors: Minoru Tsuji, Toru Chinen, Runyu Shi, Masayuki Nishiguchi, Yuki Yamamoto
  • Patent number: 10692511
    Abstract: The present technology relates to a decoding apparatus, a decoding method and a program which make it possible to obtain sound with higher quality. A demultiplexing circuit demultiplexes an input code string into a gain code string and a signal code string. A signal decoding circuit decodes the signal code string to output a time series signal. A gain decoding circuit decodes the gain code string. That is, the gain decoding circuit reads out gain values and gain inclination values at predetermined gain sample positions of the time series signal and interpolation mode information. An interpolation processing unit obtains a gain value at each sample position between two gain sample positions through linear interpolation or non-linear interpolation according to the interpolation mode based on the gain values and the gain inclination values. A gain applying circuit adjusts a gain of the time series signal based on the gain values. The present technology can be applied to a decoding apparatus.
    Type: Grant
    Filed: December 12, 2014
    Date of Patent: June 23, 2020
    Assignee: Sony Corporation
    Inventors: Yuki Yamamoto, Toru Chinen, Hiroyuki Honma, Runyu Shi
  • Publication number: 20200135216
    Abstract: There is provided a decoding device including at least one circuit configured to acquire one or more encoded audio signals including a plurality of channels and/or a plurality of objects and priority information for each of the plurality of channels and/or the plurality of objects, and to decode the one or more encoded audio signals according to the priority information.
    Type: Application
    Filed: December 24, 2019
    Publication date: April 30, 2020
    Applicant: Sony Corporation
    Inventors: Toru Chinen, Masayuki Nishiguchi, Runyu Shi, Mitsuyuki Hatanaka, Yuki Yamamoto
  • Patent number: 10631025
    Abstract: The present technology relates to an encoding device, an encoding method, a reproduction device, a reproduction method, and a program enabling each reproduction equipment to reproduce an appropriate content in a simplified manner. A content data decoding unit decodes encoded metadata and outputs zoom area information, which is included in metadata acquired as a result thereof, designating an area to be zoomed. A zoom area selecting unit selects one or a plurality of pieces of zoom area information from among the zoom area information. A video segmenting unit segments a zoom area represented by the selected zoom area information in a video based on video data and outputs zoom video data acquired as a result thereof. An audio converting unit performs an audio converting process according to the selected zoom area information for audio data and outputs zoom audio data acquired as a result thereof. The present technology can be applied to a reproduction device.
    Type: Grant
    Filed: September 28, 2015
    Date of Patent: April 21, 2020
    Assignee: Sony Corporation
    Inventors: Minoru Tsuji, Toru Chinen, Runyu Shi, Masayuki Nishiguchi, Yuki Yamamoto
  • Patent number: 10587976
    Abstract: The present technology relates to a sound processing apparatus and method, and a program for enabling more stable localization of a sound image. A virtual speaker is assumed to exist on the lower side among the sides of a tetragon having its corners formed with four speakers surrounding a target sound image position on a spherical plane. Three-dimensional VBAP is performed with respect to the virtual speaker and the two speakers located at the upper right and the upper left, to calculate gains of the two speakers at the upper right and the upper left and the virtual speaker, the gains being to be used for fixing a sound image at the target sound image position. Further, two-dimensional VBAP is performed with respect to the lower right and lower left speakers, to calculate gains of the lower right and lower left speakers, the gains being to be used for fixing a sound image at the position of the virtual speaker.
    Type: Grant
    Filed: January 15, 2019
    Date of Patent: March 10, 2020
    Assignee: Sony Corporation
    Inventors: Yuki Yamamoto, Toru Chinen, Runyu Shi, Mitsuyuki Hatanaka
  • Patent number: 10573325
    Abstract: There is provided a decoding device comprising at least one buffer and at least one processor. The at least one processor is configured to select, based at least in part on a size of the at least one buffer, at least one audio element from among multiple audio elements in an input bit stream; and generate an audio signal by decoding the at least one audio element.
    Type: Grant
    Filed: June 16, 2015
    Date of Patent: February 25, 2020
    Assignee: Sony Corporation
    Inventors: Yuki Yamamoto, Toru Chinen, Runyu Shi, Mitsuhiro Hirabayashi
  • Patent number: 10523975
    Abstract: The present disclosure relates to an information processing device and information processing method capable of recognizing an acquisition position of voice data on an image. A web server transmits image frame size information indicating image frame size of image data and audio position information indicating acquisition position of voice data. The present disclosure is applicable to an information processing system or other like system including file generation device, web server, and video playback terminal to perform tiled streaming using a manner compliant with moving picture experts group phase-dynamic adaptive streaming over HTTP (MPEG-DASH).
    Type: Grant
    Filed: July 1, 2014
    Date of Patent: December 31, 2019
    Assignee: SONY CORPORATION
    Inventors: Shinobu Hattori, Mitsuhiro Hirabayashi, Ohji Nakagami, Toru Chinen, Runyu Shi, Minoru Tsuji, Yuki Yamamoto
  • Patent number: 10455345
    Abstract: The present technology relates to a sound processing apparatus and a sound processing system for enabling more stable localization of a sound image. A virtual speaker is assumed to exist on the lower side among the sides of a tetragon having its corners formed with four speakers surrounding a target sound image position on a spherical plane. Three-dimensional VBAP is performed with respect to the virtual speaker and the two speakers located at the upper right and the upper left, to calculate gains of the two speakers at the upper right and the upper left and the virtual speaker, the gains being to be used for fixing a sound image at the target sound image position. Further, two-dimensional VBAP is performed with respect to the lower right and lower left speakers, to calculate gains of the lower right and lower left speakers, the gains being to be used for fixing a sound image at the position of the virtual speaker.
    Type: Grant
    Filed: February 16, 2018
    Date of Patent: October 22, 2019
    Assignee: Sony Corporation
    Inventors: Yuki Yamamoto, Toru Chinen, Runyu Shi, Mitsuyuki Hatanaka
  • Publication number: 20190306648
    Abstract: The present technology relates to a sound processing apparatus and a sound processing system for enabling more stable localization of a sound image. A virtual speaker is assumed to exist on the lower side among the sides of a tetragon having its corners formed with four speakers surrounding a target sound image position on a spherical plane. Three-dimensional VBAP is performed with respect to the virtual speaker and the two speakers located at the upper right and the upper left, to calculate gains of the two speakers at the upper right and the upper left and the virtual speaker, the gains being to be used for fixing a sound image at the target sound image position. Further, two-dimensional VBAP is performed with respect to the lower right and lower left speakers, to calculate gains of the lower right and lower left speakers, the gains being to be used for fixing a sound image at the position of the virtual speaker.
    Type: Application
    Filed: June 18, 2019
    Publication date: October 3, 2019
    Applicant: Sony Corporation
    Inventors: Yuki Yamamoto, Toru Chinen, Runyu Shi, Mitsuyuki Hatanaka
  • Patent number: 10375439
    Abstract: The present disclosure relates to an information processing apparatus and an information processing method which are capable of improving an efficiency of acquiring a predetermined type of audio data among a plurality of types of audio data. Audio data of a predetermined track is acquired in one audio file in which audio data of 3D audio is divided into a plurality of tracks depending on the type of 3D audio and the tracks are arranged, the audio data of each track being successively arranged in the file for a predetermined length of time. The present disclosure is applicable to, for example, an information processing system including a file generation device that generates a file, a Web server that records a file generated by the file generation device, and a video playback terminal that plays back a file.
    Type: Grant
    Filed: May 22, 2015
    Date of Patent: August 6, 2019
    Assignee: SONY CORPORATION
    Inventors: Mitsuhiro Hirabayashi, Toru Chinen, Yuki Yamamoto, Runyu Shi
  • Publication number: 20190149935
    Abstract: The present technology relates to a sound processing apparatus and method, and a program for enabling more stable localization of a sound image. A virtual speaker is assumed to exist on the lower side among the sides of a tetragon having its corners formed with four speakers surrounding a target sound image position on a spherical plane. Three-dimensional VBAP is performed with respect to the virtual speaker and the two speakers located at the upper right and the upper left, to calculate gains of the two speakers at the upper right and the upper left and the virtual speaker, the gains being to be used for fixing a sound image at the target sound image position. Further, two-dimensional VBAP is performed with respect to the lower right and lower left speakers, to calculate gains of the lower right and lower left speakers, the gains being to be used for fixing a sound image at the position of the virtual speaker.
    Type: Application
    Filed: January 15, 2019
    Publication date: May 16, 2019
    Applicant: Sony Corporation
    Inventors: Yuki Yamamoto, Toru Chinen, Runyu Shi, Mitsuyuki Hatanaka
  • Patent number: 10225677
    Abstract: The present technology relates to a sound processing apparatus and method, and a program for enabling more stable localization of a sound image. A virtual speaker is assumed to exist on the lower side among the sides of a tetragon having its corners formed with four speakers surrounding a target sound image position on a spherical plane. Three-dimensional VBAP is performed with respect to the virtual speaker and the two speakers located at the upper right and the upper left, to calculate gains of the two speakers at the upper right and the upper left and the virtual speaker, the gains being to be used for fixing a sound image at the target sound image position. Further, two-dimensional VBAP is performed with respect to the lower right and lower left speakers, to calculate gains of the lower right and lower left speakers, the gains being to be used for fixing a sound image at the position of the virtual speaker.
    Type: Grant
    Filed: May 10, 2017
    Date of Patent: March 5, 2019
    Assignee: Sony Corporation
    Inventors: Yuki Yamamoto, Toru Chinen, Runyu Shi, Mitsuyuki Hatanaka
  • Patent number: 10171926
    Abstract: The present technology relates to a sound processing apparatus and a sound processing system for enabling more stable localization of a sound image. A virtual speaker is assumed to exist on the lower side among the sides of a tetragon having its corners formed with four speakers surrounding a target sound image position on a spherical plane. Three-dimensional VBAP is performed with respect to the virtual speaker and the two speakers located at the upper right and the upper left, to calculate gains of the two speakers at the upper right and the upper left and the virtual speaker, the gains being to be used for fixing a sound image at the target sound image position. Further, two-dimensional VBAP is performed with respect to the lower right and lower left speakers, to calculate gains of the lower right and lower left speakers, the gains being to be used for fixing a sound image at the position of the virtual speaker.
    Type: Grant
    Filed: April 11, 2014
    Date of Patent: January 1, 2019
    Assignee: Sony Corporation
    Inventors: Yuki Yamamoto, Toru Chinen, Runyu Shi, Mitsuyuki Hatanaka
  • Publication number: 20180242030
    Abstract: The present technology relates to an encoding device, an encoding method, a reproduction device, a reproduction method, and a program enabling each reproduction equipment to reproduce an appropriate content in a simplified manner. A content data decoding unit decodes encoded metadata and outputs zoom area information, which is included in metadata acquired as a result thereof, designating an area to be zoomed. A zoom area selecting unit selects one or a plurality of pieces of zoom area information from among the zoom area information. A video segmenting unit segments a zoom area represented by the selected zoom area information in a video based on video data and outputs zoom video data acquired as a result thereof. An audio converting unit performs an audio converting process according to the selected zoom area information for audio data and outputs zoom audio data acquired as a result thereof. The present technology can be applied to a reproduction device.
    Type: Application
    Filed: September 28, 2015
    Publication date: August 23, 2018
    Applicant: Sony Corporation
    Inventors: Minoru Tsuji, Toru Chinen, Runyu Shi, Masayuki Nishiguchi, Yuki Yamamoto
  • Publication number: 20180197555
    Abstract: The present technology relates to a decoding apparatus, a decoding method and a program which make it possible to obtain sound with higher quality. A demultiplexing circuit demultiplexes an input code string into a gain code string and a signal code string. A signal decoding circuit decodes the signal code string to output a time series signal. A gain decoding circuit decodes the gain code string. That is, the gain decoding circuit reads out gain values and gain inclination values at predetermined gain sample positions of the time series signal and interpolation mode information. An interpolation processing unit obtains a gain value at each sample position between two gain sample positions through linear interpolation or non-linear interpolation according to the interpolation mode based on the gain values and the gain inclination values. A gain applying circuit adjusts a gain of the time series signal based on the gain values. The present technology can be applied to a decoding apparatus.
    Type: Application
    Filed: December 12, 2014
    Publication date: July 12, 2018
    Applicant: Sony Corporation
    Inventors: Yuki Yamamoto, Toru Chinen, Hiroyuki Honma, Runyu Shi
  • Publication number: 20180184222
    Abstract: The present technology relates to a sound processing apparatus and a sound processing system for enabling more stable localization of a sound image. A virtual speaker is assumed to exist on the lower side among the sides of a tetragon having its corners formed with four speakers surrounding a target sound image position on a spherical plane. Three-dimensional VBAP is performed with respect to the virtual speaker and the two speakers located at the upper right and the upper left, to calculate gains of the two speakers at the upper right and the upper left and the virtual speaker, the gains being to be used for fixing a sound image at the target sound image position. Further, two-dimensional VBAP is performed with respect to the lower right and lower left speakers, to calculate gains of the lower right and lower left speakers, the gains being to be used for fixing a sound image at the position of the virtual speaker.
    Type: Application
    Filed: February 16, 2018
    Publication date: June 28, 2018
    Applicant: Sony Corporation
    Inventors: Yuki Yamamoto, Toru Chinen, Runyu Shi, Mitsuyuki Hatanaka
  • Publication number: 20180165358
    Abstract: The present disclosure relates to an information processing apparatus and an information processing method that enable easy reproduction of audio data of a predetermined kind, of audio data of a plurality of kinds. A file generation device generates an audio file in which audio streams of a plurality of groups is divided into tracks for each one or more of the groups and arranged, and information related to the plurality of groups is arranged. The present disclosure can be applied to an information processing system configured from the file generation device that generates a file, a web server that records the file generated by the file generation device, and a moving image reproduction terminal that reproduces the file, for example.
    Type: Application
    Filed: June 30, 2015
    Publication date: June 14, 2018
    Inventors: MITSUHIRO HIRABAYASHI, YUKI YAMAMOTO, TORU CHINEN, RUNYU SHI
  • Patent number: 9998845
    Abstract: The present technology relates to an information processing device and method for allowing a sound image to be localized with higher precision, and a program. When a target sound image is outside a mesh, the target sound image is moved in a vertical direction while a position in a horizontal direction of the target sound image remains fixed, so that the target sound image is present on a boundary of the mesh. Specifically, a mesh detection unit detects a mesh including a position in the horizontal direction of the target sound image. A candidate position calculation unit calculates a position that is a movement target of the target sound image, based on loudspeaker positions that are at opposite ends of an arc of the detected mesh that is a destination, and the position in the horizontal direction of the target sound image. As a result, the target sound image can be moved onto a boundary of the mesh. The present technology is applicable to a sound processing device.
    Type: Grant
    Filed: July 11, 2014
    Date of Patent: June 12, 2018
    Assignee: Sony Corporation
    Inventors: Runyu Shi, Toru Chinen, Yuki Yamamoto, Mitsuyuki Hatanaka
  • Patent number: 9966084
    Abstract: A method and a device for achieving object audio recording and an electronic apparatus are disclosed. The method includes performing a sound collection operation via a plurality of microphones simultaneously to obtain a mixed sound signal. The method also includes identifying the number of sound sources and position information of each sound source and separating out an object sound signal corresponding to each sound source from the mixed sound signal according to the mixed sound signal and set position information of each microphone. The method further includes combining the position information and the object sound signal of individual sound sources to obtain audio data in an object audio format.
    Type: Grant
    Filed: July 18, 2016
    Date of Patent: May 8, 2018
    Assignee: Xiaomi Inc.
    Inventors: Runyu Shi, Chiafu Yen, Hui Du
  • Patent number: 9930467
    Abstract: A sound recording method and device are provided in the field of multimedia processing. The method is applied in a mobile terminal including three microphones, including: acquiring three channels of sound signals collected by the three microphones; calculating a central channel signal, a left channel signal, a right channel signal, a rear left channel signal and a rear right channel signal in a multi-channel surround audio system according to the three channels of sound signals; calculating a bass channel signal in the multi-channel surround audio system according to the three channels of sound signals; and combining the above signals to obtain a sound signal of the multi-channel surround audio system.
    Type: Grant
    Filed: March 2, 2016
    Date of Patent: March 27, 2018
    Assignee: Xiaomi Inc.
    Inventors: Runyu Shi, Dawei Xiong, Weishan Li