Patents by Inventor Masahiro Oshikiri

Masahiro Oshikiri has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 10685660
    Abstract: Provided are a voice audio encoding device, voice audio decoding device, voice audio encoding method, and voice audio decoding method that efficiently perform bit distribution and improve sound quality. Dominant frequency band identification unit identifies a dominant frequency band having a norm factor value that is the maximum value within the spectrum of an input voice audio signal. Dominant group determination units and non-dominant group determination unit group all sub-bands into a dominant group that contains the dominant frequency band and a non-dominant group that contains no dominant frequency band. Group bit distribution unit distributes bits to each group on the basis of the energy and norm variance of each group. Sub-band bit distribution unit redistributes the bits that have been distributed to each group to each sub-band in accordance with the ratio of the norm to the energy of the groups.
    Type: Grant
    Filed: September 25, 2018
    Date of Patent: June 16, 2020
    Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.
    Inventors: Zongxian Liu, Srikanth Nagisetty, Masahiro Oshikiri
  • Patent number: 10629218
    Abstract: A coding apparatus includes a processor and a memory that stores instructions, which when executed causes the processor to perform operations, including encoding a first band of an input audio signal to be a first spectrum and dividing the first spectrum into a plurality of sub-bands. The operations also include searching a largest amplitude value of the divided first spectrum in each of the plurality of sub-bands, and normalizing the divided first spectrum in each of the plurality of sub-bands. The operations further include emphasizing a harmonic structure in the normalized first spectrum, and searching a best band that has a largest correlation value between each divided band of a second band spectrum and the emphasized first spectrum in which the harmonic structure is emphasized, and encoding the second band spectrum using lag information identifying the best band and transmitting the lag information to a decoder side.
    Type: Grant
    Filed: March 1, 2019
    Date of Patent: April 21, 2020
    Assignee: PANASONIC INTELLECTUAL PROPERTY CORPORATION OF AMERICA
    Inventors: Takuya Kawashima, Katsunori Daimou, Masahiro Oshikiri
  • Patent number: 10607617
    Abstract: A coding apparatus, including a memory and a processor that, when executing instructions stored in the memory, performs operations including encoding low-band transform coefficients in a first band and calculating, for each extension-band subband obtained by splitting an extension band, a threshold amplitude based on an analysis of statistics on extension-band transform coefficients included in the subband.
    Type: Grant
    Filed: November 19, 2018
    Date of Patent: March 31, 2020
    Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.
    Inventors: Takuya Kawashima, Masahiro Oshikiri
  • Patent number: 10515648
    Abstract: An audio/speech encoding method is provided that includes transforming a time domain input signal to a frequency spectrum, and dividing the frequency spectrum to a plural of bands. The method also includes calculating a level of energies for each band, quantizing the energies for the each band, and calculating differential indices. The method additionally includes modifying a range of the differential indices for the Nth band when N is an integer of 2 or more, and replacing the differential index with the modified differential index, and not modifying a range of the differential indices for the Nth band when N is an integer of 1. The method further includes encoding the differential indices using a Huffman table selected based on a minimum value and a maximum value of the differential indices, and transmitting the encoded differential indices and a flag signal for indicating the selected Huffman table.
    Type: Grant
    Filed: December 19, 2018
    Date of Patent: December 24, 2019
    Assignee: PANASONIC INTELLECTUAL PROPERTY CORPORATION OF AMERICA
    Inventors: Zongxian Liu, Kok Seng Chong, Masahiro Oshikiri
  • Patent number: 10510354
    Abstract: A speech/audio coding apparatus includes a receiver that receives a time-domain speech input signal. The apparatus also includes a processor that transforms a time-domain speech input signal into a frequency-domain spectrum, and divides a frequency region of the spectrum in an extended band into a plurality of bands. The processor sets a limited band for each divided band in the current frame, a width of the limited band in the current frame being narrower than the divided band and the limited band including a first frequency. The processor further encodes the spectrum in the limited band within each divided band in the current frame, wherein the width of the limited band is predetermined and is set to 31.
    Type: Grant
    Filed: January 9, 2019
    Date of Patent: December 17, 2019
    Assignee: PANASONIC INTELLECTUAL PROPERTY CORPORATION OF AMERICA
    Inventors: Takuya Kawashima, Masahiro Oshikiri
  • Patent number: 10446159
    Abstract: A speech/audio encoding device for selectively allocating bits for higher precision encoding. The speech/audio encoding device receives a time-domain speech/audio input signal, transforms the speech/audio input signal into a frequency domain, and quantizes an energy envelope corresponding to an energy level for a frequency spectrum of the speech/audio input signal. The speech/audio encoding device further groups quantized energy envelopes into a plurality of groups, determines a perceptual significant group including one or more significant bands and a local-peak frequency, and allocates bits to a plurality of subbands corresponding to the grouped quantized energy envelopes, in which each of the subbands is obtained by splitting the frequency spectrum of the speech/audio input signal. The speech/audio encoding device encodes the frequency spectrum using the bits allocated to the subbands.
    Type: Grant
    Filed: November 22, 2016
    Date of Patent: October 15, 2019
    Assignee: PANASONIC INTELLECTUAL PROPERTY CORPORATION OF AMERICA
    Inventors: Takuya Kawashima, Masahiro Oshikiri
  • Publication number: 20190198035
    Abstract: A coding apparatus includes a processor and a memory that stores instructions, which when executed causes the processor to perform operations, including encoding a first band of an input audio signal to be a first spectrum and dividing the first spectrum into a plurality of sub-bands. The operations also include searching a largest amplitude value of the divided first spectrum in each of the plurality of sub-bands, and normalizing the divided first spectrum in each of the plurality of sub-bands. The operations further include emphasizing a harmonic structure in the normalized first spectrum, and searching a best band that has a largest correlation value between each divided band of a second band spectrum and the emphasized first spectrum in which the harmonic structure is emphasized, and encoding the second band spectrum using lag information identifying the best band and transmitting the lag information to a decoder side.
    Type: Application
    Filed: March 1, 2019
    Publication date: June 27, 2019
    Applicant: PANASONIC INTELLECTUAL PROPERTY CORPORATION OF AMERICA
    Inventors: Takuya KAWASHIMA, Katsunori DAIMOU, Masahiro OSHIKIRI
  • Publication number: 20190147897
    Abstract: A speech/audio coding apparatus includes a receiver that receives a time-domain speech input signal. The apparatus also includes a processor that transforms a time-domain speech input signal into a frequency-domain spectrum, and divides a frequency region of the spectrum in an extended band into a plurality of bands. The processor sets a limited band for each divided band in the current frame, a width of the limited band in the current frame being narrower than the divided band and the limited band including a first frequency. The processor further encodes the spectrum in the limited band within each divided band in the current frame, wherein the width of the limited band is predetermined and is set to 31.
    Type: Application
    Filed: January 9, 2019
    Publication date: May 16, 2019
    Applicant: PANASONIC INTELLECTUAL PROPERTY CORPORATION OF AMERICA
    Inventors: Takuya KAWASHIMA, Masahiro OSHIKIRI
  • Publication number: 20190130924
    Abstract: A coding apparatus, including a memory and a processor that, when executing instructions stored in the memory, performs operations including encoding low-band transform coefficients in a first band and calculating, for each extension-band subband obtained by splitting an extension band, a threshold amplitude based on an analysis of statistics on extension-band transform coefficients included in the subband.
    Type: Application
    Filed: November 19, 2018
    Publication date: May 2, 2019
    Inventors: Takuya KAWASHIMA, Masahiro OSHIKIRI
  • Publication number: 20190122682
    Abstract: An audio/speech encoding method is provided that includes transforming a time domain input signal to a frequency spectrum, and dividing the frequency spectrum to a plural of bands. The method also includes calculating a level of energies for each band, quantizing the energies for the each band, and calculating differential indices. The method additionally includes modifying a range of the differential indices for the Nth band when N is an integer of 2 or more, and replacing the differential index with the modified differential index, and not modifying a range of the differential indices for the Nth band when N is an integer of 1. The method further includes encoding the differential indices using a Huffman table selected based on a minimum value and a maximum value of the differential indices, and transmitting the encoded differential indices and a flag signal for indicating the selected Huffman table.
    Type: Application
    Filed: December 19, 2018
    Publication date: April 25, 2019
    Applicant: PANASONIC INTELLECTUAL PROPERTY CORPORATION OF AMERICA
    Inventors: Zongxian LIU, Kok Seng CHONG, Masahiro OSHIKIRI
  • Patent number: 10269367
    Abstract: A coding apparatus, including a processor that performs operations, including encoding a first band of an input audio signal to be a first spectrum and dividing the first spectrum into a plurality of subbands at equal intervals, each interval including a predetermined number of samples. The operations also include searching a largest amplitude value of the divided first spectrum in each of the subbands, and normalizing the divided first spectrum with the largest amplitude values searched in each of the subbands to obtain a flattened first spectrum. The operations further include searching a best band which has a largest correlation value between each divided band of a second band spectrum and the flattened first spectrum, the second spectrum being higher than a predetermined frequency, and encoding the second spectrum using lag information identifying the best bands for transmitting the lag information to a decoder side.
    Type: Grant
    Filed: December 15, 2017
    Date of Patent: April 23, 2019
    Assignee: PANASONIC INTELLECTUAL PROPERTY CORPORATION OF AMERICA
    Inventors: Takuya Kawashima, Katsunori Daimou, Masahiro Oshikiri
  • Patent number: 10210877
    Abstract: A speech/audio decoding apparatus is provided that includes a receiver that receives encoded data including a limited-band mode flag, and a memory that stores information on a position of a maximum amplitude spectrum frequency of a previous frame in a divided band. The speech/audio decoding apparatus also includes a processor that identifies whether a decoding band is encoded using a limited-band mode based on the decoded limited-band mode flag. Additionally, the processor decodes the spectrum in a limited band within each of the divided bands in a current frame using the stored information. Furthermore, the limited-band mode is set at an encoder side, when a difference between a first frequency with a first maximum amplitude in a spectrum of the divided band in a preceding frame and a second frequency with a second maximum amplitude in a spectrum of the divided band in the current frame is below a threshold.
    Type: Grant
    Filed: December 20, 2017
    Date of Patent: February 19, 2019
    Assignee: PANASONIC INTELLECTUAL PROPERTY CORPORATION OF AMERICA
    Inventors: Takuya Kawashima, Masahiro Oshikiri
  • Patent number: 10204632
    Abstract: An audio/speech encoding apparatus/method and an audio/speech decoding apparatus/method are provided. The audio/speech encoding apparatus includes a memory that stores instructions, and a processor that performs operations. The operations include transforming a time domain input audio/speech signal to a frequency spectrum, dividing the frequency spectrum to a plural of bands, calculating norm factors, and quantizing the norm factors. The operations also include calculating differential indices between an Nth band index and an (N?1)th band index, and modifying a range of the differential indices for the Nth band when N is 2 or more. The operations further include replacing the differential index with the modified differential index, and not modifying a range of the differential indices for the Nth band when N is 1. The apparatus encodes the differential indices using a selected Huffman table, and transmits the encoded differential indices and a flag signal over a communication network.
    Type: Grant
    Filed: December 12, 2017
    Date of Patent: February 12, 2019
    Assignee: PANASONIC INTELLECTUAL PROPERTY CORPORATION OF AMERICA
    Inventors: Zongxian Liu, Kok Seng Chong, Masahiro Oshikiri
  • Publication number: 20190027155
    Abstract: Provided are a voice audio encoding device, voice audio decoding device, voice audio encoding method, and voice audio decoding method that efficiently perform bit distribution and improve sound quality. Dominant frequency band identification unit identifies a dominant frequency band having a norm factor value that is the maximum value within the spectrum of an input voice audio signal. Dominant group determination units and non-dominant group determination unit group all sub-bands into a dominant group that contains the dominant frequency band and a non-dominant group that contains no dominant frequency band. Group bit distribution unit distributes bits to each group on the basis of the energy and norm variance of each group. Sub-band bit distribution unit redistributes the bits that have been distributed to each group to each sub-band in accordance with the ratio of the norm to the energy of the groups.
    Type: Application
    Filed: September 25, 2018
    Publication date: January 24, 2019
    Inventors: Zongxian Liu, Srikanth Nagisetty, Masahiro Oshikiri
  • Patent number: 10134410
    Abstract: A coding apparatus, including a memory and a processor that, when executing instructions stored in the memory, performs operations including encoding low-band transform coefficients in a first band and calculating, for each extension-band subband obtained by splitting an extension band, a threshold amplitude based on an analysis of statistics on extension-band transform coefficients included in the subband.
    Type: Grant
    Filed: September 13, 2016
    Date of Patent: November 20, 2018
    Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.
    Inventors: Takuya Kawashima, Masahiro Oshikiri
  • Patent number: 10102865
    Abstract: Provided are a voice audio encoding device, voice audio decoding device, voice audio encoding method, and voice audio decoding method that efficiently perform bit distribution and improve sound quality. Dominant frequency band identification unit identifies a dominant frequency band having a norm factor value that is the maximum value within the spectrum of an input voice audio signal. Dominant group determination units and non-dominant group determination unit group all sub-bands into a dominant group that contains the dominant frequency band and a non-dominant group that contains no dominant frequency band. Group bit distribution unit distributes bits to each group on the basis of the energy and norm variance of each group. Sub-band bit distribution unit redistributes the bits that have been distributed to each group to each sub-band in accordance with the ratio of the norm to the energy of the groups.
    Type: Grant
    Filed: August 10, 2017
    Date of Patent: October 16, 2018
    Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.
    Inventors: Zongxian Liu, Srikanth Nagisetty, Masahiro Oshikiri
  • Publication number: 20180166086
    Abstract: An audio/speech encoding apparatus/method and an audio/speech decoding apparatus/method are provided. The audio/speech encoding apparatus includes a memory that stores instructions, and a processor that performs operations. The operations include transforming a time domain input audio/speech signal to a frequency spectrum, dividing the frequency spectrum to a plural of bands, calculating norm factors, and quantizing the norm factors. The operations also include calculating differential indices between an Nth band index and an (N?1)th band index, and modifying a range of the differential indices for the Nth band when N is 2 or more. The operations further include replacing the differential index with the modified differential index, and not modifying a range of the differential indices for the Nth band when N is 1. The apparatus encodes the differential indices using a selected Huffman table, and transmits the encoded differential indices and a flag signal over a communication network.
    Type: Application
    Filed: December 12, 2017
    Publication date: June 14, 2018
    Applicant: PANASONIC INTELLECTUAL PROPERTY CORPORATION OF AMERICA
    Inventors: Zongxian Liu, Kok Seng Chong, Masahiro Oshikiri
  • Publication number: 20180158466
    Abstract: A coding apparatus, including a processor that performs operations including encoding a first band of an input audio signal to be a first spectrum, dividing the first spectrum into a plurality of subbands, at equal intervals each including a predetermined number of samples for flattening the first spectrum, searching a largest amplitude value of the divided first spectrum in each of the subbands, normalizing the divided first spectrum with the largest amplitude values searched in each of the subbands, searching best bands among each normalized divided first spectrum which has a largest correlation value between each divided band of a second band spectrum and each normalized divided first spectrum, the second spectrum being higher than a predetermined frequency, and encoding the second spectrum using lag information identifying the best bands for transmitting the lag information to a decoder side.
    Type: Application
    Filed: December 15, 2017
    Publication date: June 7, 2018
    Applicant: PANASONIC INTELLECTUAL PROPERTY CORPORATION OF AMERICA
    Inventors: Takuya KAWASHIMA, Katsunori DAIMOU, Masahiro OSHIKIRI
  • Publication number: 20180114535
    Abstract: A speech/audio decoding apparatus is provided that includes a receiver that receives encoded data including a limited-band mode flag, and a memory that stores information on a position of a maximum amplitude spectrum frequency of a previous frame in a divided band. The speech/audio decoding apparatus also includes a processor that identifies whether a decoding band is encoded using a limited-band mode based on the decoded limited-band mode flag. Additionally, the processor decodes the spectrum in a limited band within each of the divided bands in a current frame using the stored information. Furthermore, the limited-band mode is set at an encoder side, when a difference between a first frequency with a first maximum amplitude in a spectrum of the divided band in a preceding frame and a second frequency with a second maximum amplitude in a spectrum of the divided band in the current frame is below a threshold.
    Type: Application
    Filed: December 20, 2017
    Publication date: April 26, 2018
    Applicant: PANASONIC INTELLECTUAL PROPERTY CORPORATION OF AMERICA
    Inventors: Takuya KAWASHIMA, Masahiro OSHIKIRI
  • Patent number: 9892740
    Abstract: A speech/audio coding apparatus is provided that includes a receiver that receives a time-domain speech input signal and a processor. The processor transforms a time-domain speech input signal into a frequency-domain spectrum, and divides a frequency region of the spectrum in an extended band into a plurality of bands. The processor also sets a limited band for each divided band in the current frame, when a difference between a first frequency with a first maximum amplitude in a spectrum of the divided band in a preceding frame and a second frequency with a second maximum amplitude in a spectrum of the divided band in a current frame is below a threshold. The processor further encodes the spectrum in the limited band within each divided band in the current frame, and does not encode a spectrum outside the limited band within each divided band in the current frame.
    Type: Grant
    Filed: May 9, 2017
    Date of Patent: February 13, 2018
    Assignee: PANASONIC INTELLECTUAL PROPERTY CORPORATION OF AMERICA
    Inventors: Takuya Kawashima, Masahiro Oshikiri