Patents by Inventor Runyu Shi
Runyu Shi has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Publication number: 20200221146Abstract: The present technology relates to an encoding device, an encoding method, a reproduction device, a reproduction method, and a program enabling each reproduction equipment to reproduce an appropriate content in a simplified manner. A content data decoding unit decodes encoded metadata and outputs zoom area information, which is included in metadata acquired as a result thereof, designating an area to be zoomed. A zoom area selecting unit selects one or a plurality of pieces of zoom area information from among the zoom area information. A video segmenting unit segments a zoom area represented by the selected zoom area information in a video based on video data and outputs zoom video data acquired as a result thereof. An audio converting unit performs an audio converting process according to the selected zoom area information for audio data and outputs zoom audio data acquired as a result thereof. The present technology can be applied to a reproduction device.Type: ApplicationFiled: March 23, 2020Publication date: July 9, 2020Applicant: Sony CorporationInventors: Minoru Tsuji, Toru Chinen, Runyu Shi, Masayuki Nishiguchi, Yuki Yamamoto
-
Patent number: 10692511Abstract: The present technology relates to a decoding apparatus, a decoding method and a program which make it possible to obtain sound with higher quality. A demultiplexing circuit demultiplexes an input code string into a gain code string and a signal code string. A signal decoding circuit decodes the signal code string to output a time series signal. A gain decoding circuit decodes the gain code string. That is, the gain decoding circuit reads out gain values and gain inclination values at predetermined gain sample positions of the time series signal and interpolation mode information. An interpolation processing unit obtains a gain value at each sample position between two gain sample positions through linear interpolation or non-linear interpolation according to the interpolation mode based on the gain values and the gain inclination values. A gain applying circuit adjusts a gain of the time series signal based on the gain values. The present technology can be applied to a decoding apparatus.Type: GrantFiled: December 12, 2014Date of Patent: June 23, 2020Assignee: Sony CorporationInventors: Yuki Yamamoto, Toru Chinen, Hiroyuki Honma, Runyu Shi
-
Publication number: 20200135216Abstract: There is provided a decoding device including at least one circuit configured to acquire one or more encoded audio signals including a plurality of channels and/or a plurality of objects and priority information for each of the plurality of channels and/or the plurality of objects, and to decode the one or more encoded audio signals according to the priority information.Type: ApplicationFiled: December 24, 2019Publication date: April 30, 2020Applicant: Sony CorporationInventors: Toru Chinen, Masayuki Nishiguchi, Runyu Shi, Mitsuyuki Hatanaka, Yuki Yamamoto
-
Patent number: 10631025Abstract: The present technology relates to an encoding device, an encoding method, a reproduction device, a reproduction method, and a program enabling each reproduction equipment to reproduce an appropriate content in a simplified manner. A content data decoding unit decodes encoded metadata and outputs zoom area information, which is included in metadata acquired as a result thereof, designating an area to be zoomed. A zoom area selecting unit selects one or a plurality of pieces of zoom area information from among the zoom area information. A video segmenting unit segments a zoom area represented by the selected zoom area information in a video based on video data and outputs zoom video data acquired as a result thereof. An audio converting unit performs an audio converting process according to the selected zoom area information for audio data and outputs zoom audio data acquired as a result thereof. The present technology can be applied to a reproduction device.Type: GrantFiled: September 28, 2015Date of Patent: April 21, 2020Assignee: Sony CorporationInventors: Minoru Tsuji, Toru Chinen, Runyu Shi, Masayuki Nishiguchi, Yuki Yamamoto
-
Patent number: 10587976Abstract: The present technology relates to a sound processing apparatus and method, and a program for enabling more stable localization of a sound image. A virtual speaker is assumed to exist on the lower side among the sides of a tetragon having its corners formed with four speakers surrounding a target sound image position on a spherical plane. Three-dimensional VBAP is performed with respect to the virtual speaker and the two speakers located at the upper right and the upper left, to calculate gains of the two speakers at the upper right and the upper left and the virtual speaker, the gains being to be used for fixing a sound image at the target sound image position. Further, two-dimensional VBAP is performed with respect to the lower right and lower left speakers, to calculate gains of the lower right and lower left speakers, the gains being to be used for fixing a sound image at the position of the virtual speaker.Type: GrantFiled: January 15, 2019Date of Patent: March 10, 2020Assignee: Sony CorporationInventors: Yuki Yamamoto, Toru Chinen, Runyu Shi, Mitsuyuki Hatanaka
-
Patent number: 10573325Abstract: There is provided a decoding device comprising at least one buffer and at least one processor. The at least one processor is configured to select, based at least in part on a size of the at least one buffer, at least one audio element from among multiple audio elements in an input bit stream; and generate an audio signal by decoding the at least one audio element.Type: GrantFiled: June 16, 2015Date of Patent: February 25, 2020Assignee: Sony CorporationInventors: Yuki Yamamoto, Toru Chinen, Runyu Shi, Mitsuhiro Hirabayashi
-
Patent number: 10523975Abstract: The present disclosure relates to an information processing device and information processing method capable of recognizing an acquisition position of voice data on an image. A web server transmits image frame size information indicating image frame size of image data and audio position information indicating acquisition position of voice data. The present disclosure is applicable to an information processing system or other like system including file generation device, web server, and video playback terminal to perform tiled streaming using a manner compliant with moving picture experts group phase-dynamic adaptive streaming over HTTP (MPEG-DASH).Type: GrantFiled: July 1, 2014Date of Patent: December 31, 2019Assignee: SONY CORPORATIONInventors: Shinobu Hattori, Mitsuhiro Hirabayashi, Ohji Nakagami, Toru Chinen, Runyu Shi, Minoru Tsuji, Yuki Yamamoto
-
Patent number: 10455345Abstract: The present technology relates to a sound processing apparatus and a sound processing system for enabling more stable localization of a sound image. A virtual speaker is assumed to exist on the lower side among the sides of a tetragon having its corners formed with four speakers surrounding a target sound image position on a spherical plane. Three-dimensional VBAP is performed with respect to the virtual speaker and the two speakers located at the upper right and the upper left, to calculate gains of the two speakers at the upper right and the upper left and the virtual speaker, the gains being to be used for fixing a sound image at the target sound image position. Further, two-dimensional VBAP is performed with respect to the lower right and lower left speakers, to calculate gains of the lower right and lower left speakers, the gains being to be used for fixing a sound image at the position of the virtual speaker.Type: GrantFiled: February 16, 2018Date of Patent: October 22, 2019Assignee: Sony CorporationInventors: Yuki Yamamoto, Toru Chinen, Runyu Shi, Mitsuyuki Hatanaka
-
Publication number: 20190306648Abstract: The present technology relates to a sound processing apparatus and a sound processing system for enabling more stable localization of a sound image. A virtual speaker is assumed to exist on the lower side among the sides of a tetragon having its corners formed with four speakers surrounding a target sound image position on a spherical plane. Three-dimensional VBAP is performed with respect to the virtual speaker and the two speakers located at the upper right and the upper left, to calculate gains of the two speakers at the upper right and the upper left and the virtual speaker, the gains being to be used for fixing a sound image at the target sound image position. Further, two-dimensional VBAP is performed with respect to the lower right and lower left speakers, to calculate gains of the lower right and lower left speakers, the gains being to be used for fixing a sound image at the position of the virtual speaker.Type: ApplicationFiled: June 18, 2019Publication date: October 3, 2019Applicant: Sony CorporationInventors: Yuki Yamamoto, Toru Chinen, Runyu Shi, Mitsuyuki Hatanaka
-
Patent number: 10375439Abstract: The present disclosure relates to an information processing apparatus and an information processing method which are capable of improving an efficiency of acquiring a predetermined type of audio data among a plurality of types of audio data. Audio data of a predetermined track is acquired in one audio file in which audio data of 3D audio is divided into a plurality of tracks depending on the type of 3D audio and the tracks are arranged, the audio data of each track being successively arranged in the file for a predetermined length of time. The present disclosure is applicable to, for example, an information processing system including a file generation device that generates a file, a Web server that records a file generated by the file generation device, and a video playback terminal that plays back a file.Type: GrantFiled: May 22, 2015Date of Patent: August 6, 2019Assignee: SONY CORPORATIONInventors: Mitsuhiro Hirabayashi, Toru Chinen, Yuki Yamamoto, Runyu Shi
-
Publication number: 20190149935Abstract: The present technology relates to a sound processing apparatus and method, and a program for enabling more stable localization of a sound image. A virtual speaker is assumed to exist on the lower side among the sides of a tetragon having its corners formed with four speakers surrounding a target sound image position on a spherical plane. Three-dimensional VBAP is performed with respect to the virtual speaker and the two speakers located at the upper right and the upper left, to calculate gains of the two speakers at the upper right and the upper left and the virtual speaker, the gains being to be used for fixing a sound image at the target sound image position. Further, two-dimensional VBAP is performed with respect to the lower right and lower left speakers, to calculate gains of the lower right and lower left speakers, the gains being to be used for fixing a sound image at the position of the virtual speaker.Type: ApplicationFiled: January 15, 2019Publication date: May 16, 2019Applicant: Sony CorporationInventors: Yuki Yamamoto, Toru Chinen, Runyu Shi, Mitsuyuki Hatanaka
-
Patent number: 10225677Abstract: The present technology relates to a sound processing apparatus and method, and a program for enabling more stable localization of a sound image. A virtual speaker is assumed to exist on the lower side among the sides of a tetragon having its corners formed with four speakers surrounding a target sound image position on a spherical plane. Three-dimensional VBAP is performed with respect to the virtual speaker and the two speakers located at the upper right and the upper left, to calculate gains of the two speakers at the upper right and the upper left and the virtual speaker, the gains being to be used for fixing a sound image at the target sound image position. Further, two-dimensional VBAP is performed with respect to the lower right and lower left speakers, to calculate gains of the lower right and lower left speakers, the gains being to be used for fixing a sound image at the position of the virtual speaker.Type: GrantFiled: May 10, 2017Date of Patent: March 5, 2019Assignee: Sony CorporationInventors: Yuki Yamamoto, Toru Chinen, Runyu Shi, Mitsuyuki Hatanaka
-
Patent number: 10171926Abstract: The present technology relates to a sound processing apparatus and a sound processing system for enabling more stable localization of a sound image. A virtual speaker is assumed to exist on the lower side among the sides of a tetragon having its corners formed with four speakers surrounding a target sound image position on a spherical plane. Three-dimensional VBAP is performed with respect to the virtual speaker and the two speakers located at the upper right and the upper left, to calculate gains of the two speakers at the upper right and the upper left and the virtual speaker, the gains being to be used for fixing a sound image at the target sound image position. Further, two-dimensional VBAP is performed with respect to the lower right and lower left speakers, to calculate gains of the lower right and lower left speakers, the gains being to be used for fixing a sound image at the position of the virtual speaker.Type: GrantFiled: April 11, 2014Date of Patent: January 1, 2019Assignee: Sony CorporationInventors: Yuki Yamamoto, Toru Chinen, Runyu Shi, Mitsuyuki Hatanaka
-
Publication number: 20180242030Abstract: The present technology relates to an encoding device, an encoding method, a reproduction device, a reproduction method, and a program enabling each reproduction equipment to reproduce an appropriate content in a simplified manner. A content data decoding unit decodes encoded metadata and outputs zoom area information, which is included in metadata acquired as a result thereof, designating an area to be zoomed. A zoom area selecting unit selects one or a plurality of pieces of zoom area information from among the zoom area information. A video segmenting unit segments a zoom area represented by the selected zoom area information in a video based on video data and outputs zoom video data acquired as a result thereof. An audio converting unit performs an audio converting process according to the selected zoom area information for audio data and outputs zoom audio data acquired as a result thereof. The present technology can be applied to a reproduction device.Type: ApplicationFiled: September 28, 2015Publication date: August 23, 2018Applicant: Sony CorporationInventors: Minoru Tsuji, Toru Chinen, Runyu Shi, Masayuki Nishiguchi, Yuki Yamamoto
-
Publication number: 20180197555Abstract: The present technology relates to a decoding apparatus, a decoding method and a program which make it possible to obtain sound with higher quality. A demultiplexing circuit demultiplexes an input code string into a gain code string and a signal code string. A signal decoding circuit decodes the signal code string to output a time series signal. A gain decoding circuit decodes the gain code string. That is, the gain decoding circuit reads out gain values and gain inclination values at predetermined gain sample positions of the time series signal and interpolation mode information. An interpolation processing unit obtains a gain value at each sample position between two gain sample positions through linear interpolation or non-linear interpolation according to the interpolation mode based on the gain values and the gain inclination values. A gain applying circuit adjusts a gain of the time series signal based on the gain values. The present technology can be applied to a decoding apparatus.Type: ApplicationFiled: December 12, 2014Publication date: July 12, 2018Applicant: Sony CorporationInventors: Yuki Yamamoto, Toru Chinen, Hiroyuki Honma, Runyu Shi
-
Publication number: 20180184222Abstract: The present technology relates to a sound processing apparatus and a sound processing system for enabling more stable localization of a sound image. A virtual speaker is assumed to exist on the lower side among the sides of a tetragon having its corners formed with four speakers surrounding a target sound image position on a spherical plane. Three-dimensional VBAP is performed with respect to the virtual speaker and the two speakers located at the upper right and the upper left, to calculate gains of the two speakers at the upper right and the upper left and the virtual speaker, the gains being to be used for fixing a sound image at the target sound image position. Further, two-dimensional VBAP is performed with respect to the lower right and lower left speakers, to calculate gains of the lower right and lower left speakers, the gains being to be used for fixing a sound image at the position of the virtual speaker.Type: ApplicationFiled: February 16, 2018Publication date: June 28, 2018Applicant: Sony CorporationInventors: Yuki Yamamoto, Toru Chinen, Runyu Shi, Mitsuyuki Hatanaka
-
Publication number: 20180165358Abstract: The present disclosure relates to an information processing apparatus and an information processing method that enable easy reproduction of audio data of a predetermined kind, of audio data of a plurality of kinds. A file generation device generates an audio file in which audio streams of a plurality of groups is divided into tracks for each one or more of the groups and arranged, and information related to the plurality of groups is arranged. The present disclosure can be applied to an information processing system configured from the file generation device that generates a file, a web server that records the file generated by the file generation device, and a moving image reproduction terminal that reproduces the file, for example.Type: ApplicationFiled: June 30, 2015Publication date: June 14, 2018Inventors: MITSUHIRO HIRABAYASHI, YUKI YAMAMOTO, TORU CHINEN, RUNYU SHI
-
Patent number: 9998845Abstract: The present technology relates to an information processing device and method for allowing a sound image to be localized with higher precision, and a program. When a target sound image is outside a mesh, the target sound image is moved in a vertical direction while a position in a horizontal direction of the target sound image remains fixed, so that the target sound image is present on a boundary of the mesh. Specifically, a mesh detection unit detects a mesh including a position in the horizontal direction of the target sound image. A candidate position calculation unit calculates a position that is a movement target of the target sound image, based on loudspeaker positions that are at opposite ends of an arc of the detected mesh that is a destination, and the position in the horizontal direction of the target sound image. As a result, the target sound image can be moved onto a boundary of the mesh. The present technology is applicable to a sound processing device.Type: GrantFiled: July 11, 2014Date of Patent: June 12, 2018Assignee: Sony CorporationInventors: Runyu Shi, Toru Chinen, Yuki Yamamoto, Mitsuyuki Hatanaka
-
Patent number: 9966084Abstract: A method and a device for achieving object audio recording and an electronic apparatus are disclosed. The method includes performing a sound collection operation via a plurality of microphones simultaneously to obtain a mixed sound signal. The method also includes identifying the number of sound sources and position information of each sound source and separating out an object sound signal corresponding to each sound source from the mixed sound signal according to the mixed sound signal and set position information of each microphone. The method further includes combining the position information and the object sound signal of individual sound sources to obtain audio data in an object audio format.Type: GrantFiled: July 18, 2016Date of Patent: May 8, 2018Assignee: Xiaomi Inc.Inventors: Runyu Shi, Chiafu Yen, Hui Du
-
Patent number: 9930467Abstract: A sound recording method and device are provided in the field of multimedia processing. The method is applied in a mobile terminal including three microphones, including: acquiring three channels of sound signals collected by the three microphones; calculating a central channel signal, a left channel signal, a right channel signal, a rear left channel signal and a rear right channel signal in a multi-channel surround audio system according to the three channels of sound signals; calculating a bass channel signal in the multi-channel surround audio system according to the three channels of sound signals; and combining the above signals to obtain a sound signal of the multi-channel surround audio system.Type: GrantFiled: March 2, 2016Date of Patent: March 27, 2018Assignee: Xiaomi Inc.Inventors: Runyu Shi, Dawei Xiong, Weishan Li