Patents by Inventor Naoyuki YOSHIOKA

Naoyuki YOSHIOKA has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20240185859
    Abstract: A hypothesis stitcher for speech recognition of long-form audio provides superior performance, such as higher accuracy and reduced computational cost. An example disclosed operation includes: segmenting the audio stream into a plurality of audio segments; identifying a plurality of speakers within each of the plurality of audio segments; performing automatic speech recognition (ASR) on each of the plurality of audio segments to generate a plurality of short-segment hypotheses; merging at least a portion of the short-segment hypotheses into a first merged hypothesis set; inserting stitching symbols into the first merged hypothesis set, the stitching symbols including a window change (WC) symbol; and consolidating, with a network-based hypothesis stitcher, the first merged hypothesis set into a first consolidated hypothesis.
    Type: Application
    Filed: February 13, 2024
    Publication date: June 6, 2024
    Inventors: Naoyuki KANDA, Xuankai CHANG, Yashesh GAUR, Xiaofei WANG, Zhong MENG, Takuya YOSHIOKA
  • Patent number: 11984127
    Abstract: The disclosure herein describes using a transcript generation model for generating a transcript from a multi-speaker audio stream. Audio data including overlapping speech of a plurality of speakers is obtained and a set of frame embeddings are generated from audio data frames of the obtained audio data using an audio data encoder. A set of words and channel change (CC) symbols are generated from the set of frame embeddings using a transcript generation model. The CC symbols are included between pairs of adjacent words that are spoken by different people at the same time. The set of words and CC symbols are transformed into a plurality of transcript lines, wherein words of the set of words are sorted into transcript lines based on the CC symbols, and a multi-speaker transcript is generated based on the plurality of transcript lines. The inclusion of CC symbols by the model enables efficient, accurate multi-speaker transcription.
    Type: Grant
    Filed: December 31, 2021
    Date of Patent: May 14, 2024
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Naoyuki Kanda, Takuya Yoshioka, Zhuo Chen, Jinyu Li, Yashesh Gaur, Zhong Meng, Xiaofei Wang, Xiong Xiao
  • Patent number: 11935542
    Abstract: A hypothesis stitcher for speech recognition of long-form audio provides superior performance, such as higher accuracy and reduced computational cost. An example disclosed operation includes: segmenting the audio stream into a plurality of audio segments; identifying a plurality of speakers within each of the plurality of audio segments; performing automatic speech recognition (ASR) on each of the plurality of audio segments to generate a plurality of short-segment hypotheses; merging at least a portion of the short-segment hypotheses into a first merged hypothesis set; inserting stitching symbols into the first merged hypothesis set, the stitching symbols including a window change (WC) symbol; and consolidating, with a network-based hypothesis stitcher, the first merged hypothesis set into a first consolidated hypothesis.
    Type: Grant
    Filed: January 19, 2023
    Date of Patent: March 19, 2024
    Assignee: Microsoft Technology Licensing, LLC.
    Inventors: Naoyuki Kanda, Xuankai Chang, Yashesh Gaur, Xiaofei Wang, Zhong Meng, Takuya Yoshioka
  • Patent number: 10379170
    Abstract: Relatively inexpensive and practical cell deterioration diagnostic method and cell deterioration diagnostic device are provided. A cell deterioration diagnostic method diagnoses cell deterioration of a secondary cell having a transient characteristic. The method includes: a charging step of charging the secondary cell; a calculation step of calculating an integrated value of a potential difference obtained by subtracting a cell internal voltage V0 of the secondary cell from a cell inter-terminal voltage of the secondary cell by integrating the potential difference as the cell inter-terminal voltage converges to the cell internal voltage V0 after completion of charging; and a diagnosis step of diagnosing the cell deterioration of the secondary cell based on the integrated value.
    Type: Grant
    Filed: February 24, 2016
    Date of Patent: August 13, 2019
    Assignee: The Doshisha
    Inventors: Naoto Nagaoka, Naoyuki Yoshioka, Naoya Narita
  • Publication number: 20180038918
    Abstract: Relatively inexpensive and practical cell deterioration diagnostic method and cell deterioration diagnostic device are provided. A cell deterioration diagnostic method diagnoses cell deterioration of a secondary cell having a transient characteristic. The method includes: a charging step of charging the secondary cell; a calculation step of calculating an integrated value of a potential difference obtained by subtracting a cell internal voltage V0 of the secondary cell from a cell inter-terminal voltage of the secondary cell by integrating the potential difference as the cell inter-terminal voltage converges to the cell internal voltage V0 after completion of charging; and a diagnosis step of diagnosing the cell deterioration of the secondary cell based on the integrated value.
    Type: Application
    Filed: February 24, 2016
    Publication date: February 8, 2018
    Inventors: Naoto NAGAOKA, Naoyuki YOSHIOKA, Naoya NARITA, I