Patents by Inventor Zoltan ROMOCSA

Zoltan ROMOCSA has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20240144931
    Abstract: Some disclosed embodiments are directed to obtaining a decoded audio data including a spoken language utterance recognized in audio data and identifying a disfluency in the decoded audio data. Upon determining that correcting the disfluency would improve a readability score of the decoded audio data, the system generates a particular correction to correct the disfluency and applies the particular correction to the decoded audio data. Then, an updated decoded audio data is generated which reflects the particular correction. The updated decoded audio data has improved readability over the decoded audio data.
    Type: Application
    Filed: November 1, 2022
    Publication date: May 2, 2024
    Inventors: Sayan Dev PATHAK, Ayush VIKRAM, Zoltan ROMOCSA, Amy Parag SHAH, Piyush BEHRE, Sharman W TAN, Amit Kumar AGARWAL, Christopher Hakan BASOGLU
  • Publication number: 20240087572
    Abstract: Systems are configured to obtain streaming audio data comprising language utterances, continuously decode the streaming audio data in order to generate decoded streaming audio data and determine whether a linguistic boundary exists within an initial segment of decoded streaming audio data. When a linguistic boundary is determined to exist, the systems apply a punctuation at the linguistic boundary and output a first portion of the initial segment of the streaming audio data ending at the linguistic boundary while refraining from outputting a second portion of the initial segment which is located temporally subsequent to the first portion of the initial segment. Systems are also configured to delay the output until predetermined punctuation validation processes have been performed.
    Type: Application
    Filed: November 14, 2022
    Publication date: March 14, 2024
    Inventors: Sayan Dev PATHAK, Amit Kumar AGARWAL, Amy Parag SHAH, Sourish CHATTERJEE, Zoltan ROMOCSA, Christopher Hakan BASOGLU, Piyush BEHRE, Shuangyu CHANG, Emilian Yordanov STOIMENOV
  • Patent number: 11636854
    Abstract: A system includes acquisition of meeting data associated with a meeting, determination of a plurality of meeting participants based on the acquired meeting data, acquisition of e-mail data associated with each of the plurality of meeting participants, generation of a meeting language model based on the acquired e-mail data and the meeting data, and transcription of audio associated with the meeting based on the meeting language model.
    Type: Grant
    Filed: May 24, 2022
    Date of Patent: April 25, 2023
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Ziad Al Bawab, Anand U Desai, Shuangyu Chang, Amit K Agarwal, Zoltan Romocsa, Christopher H Basoglu, Nathan E Wohlgemuth
  • Patent number: 11562738
    Abstract: A system includes acquisition of a domain grammar, determination of an interpolated grammar based on the domain grammar and a base grammar, determination of a delta domain grammar based on an augmented first grammar and the interpolated grammar, determination of an out-of-vocabulary class based on the domain grammar and the base grammar, insertion of the out-of-vocabulary class into a composed transducer composed of the augmented first grammar and one or more other transducers to generate an updated composed transducer, composition of the delta domain grammar and the updated composed transducer, and application of the composition of the delta domain grammar and the updated composed transducer to an output of an acoustic model.
    Type: Grant
    Filed: October 28, 2019
    Date of Patent: January 24, 2023
    Assignee: MICROSOFT TECHNOLOGY LICENSING, LLC
    Inventors: Ziad Al Bawab, Anand U Desai, Shuangyu Chang, Amit K Agarwal, Zoltan Romocsa, Veljko Miljanic, Aadyot Bhatnagar, Hosam Khalil, Christopher Basoglu
  • Publication number: 20220358912
    Abstract: A system includes acquisition of meeting data associated with a meeting, determination of a plurality of meeting participants based on the acquired meeting data, acquisition of e-mail data associated with each of the plurality of meeting participants, generation of a meeting language model based on the acquired e-mail data and the meeting data, and transcription of audio associated with the meeting based on the meeting language model.
    Type: Application
    Filed: May 24, 2022
    Publication date: November 10, 2022
    Inventors: Ziad AL BAWAB, Anand U. DESAI, Shuangyu CHANG, Amit K. AGARWAL, Zoltan ROMOCSA, Christopher H. BASOGLU, Nathan E. WOHLGEMUTH
  • Patent number: 11430433
    Abstract: A system includes acquisition of meeting data associated with a meeting, determination of a plurality of meeting participants based on the acquired meeting data, acquisition of e-mail data associated with each of the plurality of meeting participants, generation of a meeting language model based on the acquired e-mail data and the meeting data, and transcription of audio associated with the meeting based on the meeting language model.
    Type: Grant
    Filed: August 5, 2019
    Date of Patent: August 30, 2022
    Assignee: MICROSOFT TECHNOLOGY LICENSING, LLC
    Inventors: Ziad Al Bawab, Anand U Desai, Shuangyu Chang, Amit K Agarwal, Zoltan Romocsa, Christopher H Basoglu, Nathan E Wohlgemuth
  • Patent number: 11348574
    Abstract: A system includes acquisition of meeting data associated with a meeting, determination of a plurality of meeting participants based on the acquired meeting data, acquisition of e-mail data associated with each of the plurality of meeting participants, generation of a meeting language model based on the acquired e-mail data and the meeting data, and transcription of audio associated with the meeting based on the meeting language model.
    Type: Grant
    Filed: August 5, 2019
    Date of Patent: May 31, 2022
    Assignee: MICROSOFT TECHNOLOGY LICENSING, LLC
    Inventors: Ziad Al Bawab, Anand U Desai, Shuangyu Chang, Amit K Agarwal, Zoltan Romocsa, Christopher H Basoglu, Nathan E Wohlgemuth
  • Patent number: 11069359
    Abstract: A context-aware transcription system includes a language model preparation service that retrieves meeting-specific data prior to or during a meeting. The language model preparation service utilizes the meeting-specific data to generate a meeting-specific statistical language model. A speech transcription service can utilize the meeting-specific statistical language model to generate a transcription of audio generated by attendees of a meeting. The system can transmit the transcription to computing devices associated with meeting attendees during the meeting for presentation in a user interface in real time. The language model preparation service can generate the meeting-specific statistical language model in response to receiving a pre-meeting signal. The pre-meeting signal can be generated a predetermined time prior to meetings according to a schedule.
    Type: Grant
    Filed: April 12, 2019
    Date of Patent: July 20, 2021
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Shalendra Chhabra, Michael J. Shelton, Amit K. Agarwal, Halley Weitzman, Mikhail Raer, Zoltan Romocsa, Rishi Girish, Skyler Michael Anderson, Tomas Bergl, Mykola Denysiuk, Andrii Matukhno
  • Publication number: 20200349931
    Abstract: A system includes acquisition of meeting data associated with a meeting, determination of a plurality of meeting participants based on the acquired meeting data, acquisition of e-mail data associated with each of the plurality of meeting participants, generation of a meeting language model based on the acquired e-mail data and the meeting data, and transcription of audio associated with the meeting based on the meeting language model.
    Type: Application
    Filed: August 5, 2019
    Publication date: November 5, 2020
    Inventors: Ziad AL BAWAB, Anand U. DESAI, Shuangyu CHANG, Amit K. AGARWAL, Zoltan ROMOCSA, Christopher H. BASOGLU, Nathan E. WOHLGEMUTH
  • Publication number: 20200349930
    Abstract: A system includes acquisition of a domain grammar, determination of an interpolated grammar based on the domain grammar and a base grammar, determination of a delta domain grammar based on an augmented first grammar and the interpolated grammar, determination of an out-of-vocabulary class based on the domain grammar and the base grammar, insertion of the out-of-vocabulary class into a composed transducer composed of the augmented first grammar and one or more other transducers to generate an updated composed transducer, composition of the delta domain grammar and the updated composed transducer, and application of the composition of the delta domain grammar and the updated composed transducer to an output of an acoustic model.
    Type: Application
    Filed: October 28, 2019
    Publication date: November 5, 2020
    Inventors: Ziad AL BAWAB, Anand U. DESAI, Shuangyu CHANG, Amit K. AGARWAL, Zoltan ROMOCSA, Veljko MILJANIC, Aadyot BHATNAGAR, Hosam KHALIL, Christopher BASOGLU
  • Publication number: 20200327891
    Abstract: A context-aware transcription system includes a language model preparation service that retrieves meeting-specific data prior to or during a meeting. The language model preparation service utilizes the meeting-specific data to generate a meeting-specific statistical language model. A speech transcription service can utilize the meeting-specific statistical language model to generate a transcription of audio generated by attendees of a meeting. The system can transmit the transcription to computing devices associated with meeting attendees during the meeting for presentation in a user interface in real time. The language model preparation service can generate the meeting-specific statistical language model in response to receiving a pre-meeting signal. The pre-meeting signal can be generated a predetermined time prior to meetings according to a schedule.
    Type: Application
    Filed: April 12, 2019
    Publication date: October 15, 2020
    Inventors: Shalendra CHHABRA, Michael J. SHELTON, Amit K. AGARWAL, Halley WEITZMAN, Mikhail RAER, Zoltan ROMOCSA, Rishi GIRISH, Skyler Michael ANDERSON, Tomas BERGL, Mykola DENYSIUK, Andrii MATUKHNO