Patents by Inventor Weisi SHI

Weisi SHI has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 12211508
    Abstract: A server-side processing method for implementing an active initiation of a dialogue is disclosed, comprising: establishing a communication connection with a voice client, in response to a received request for establishing a connection from the voice client; receiving an information stream sent by the voice client through the communication connection; performing a dialogue decision-making process according to the information stream, obtaining and outputting an adapted dialogue content to the voice client upon determining that it is an active dialogue scenario. A server and a system for implementing an active initiation of a dialogue are also provided. The disclosed solutions realize intelligent decision-making for voice interaction, and can actively initiate a dialogue based on server-side decision-making, improving interaction experience and realizing intelligent interaction.
    Type: Grant
    Filed: November 20, 2020
    Date of Patent: January 28, 2025
    Assignee: AI SPEECH CO., LTD.
    Inventors: Weisi Shi, Hongbo Song, Chengya Zhu, Shuai Fan
  • Patent number: 12131735
    Abstract: The present disclosure discloses a man-machine dialogue mode switching method, which is applicable to an electronic device. The method includes receiving a current user sentence spoken by a current user; determining whether a dialogue field to which the current user sentence belongs is a preset dialogue field; if yes, switching the current dialogue mode to a full-duplex dialogue mode; and if not, switching the current dialogue mode to a half-duplex dialogue mode. In the present disclosure, the dialogue mode is switched by determining whether the dialogue field to which the current user sentence belongs is the preset dialogue field, and the dialogue mode can be automatically switched and adjusted according to the difference of the dialogue fields, such that the man-machine dialogue is always in the most suitable dialogue mode and can be realized smoothly.
    Type: Grant
    Filed: November 25, 2019
    Date of Patent: October 29, 2024
    Assignee: AI Speech Co., Ltd.
    Inventors: Hongbo Song, Weisi Shi, Chengya Zhu, Shuai Fan
  • Patent number: 11862150
    Abstract: A skill dispatching method for a speech dialogue platform including: receiving, by a central control dispatching service, a semantic result of recognizing a user's voice sent by a data distribution service; dispatching, by the central control dispatching service, a plurality of skill services related to the semantic result in parallel, and obtaining skill parsing results from the plurality of skill services; sorting the skill parsing results based on priorities of the skill services, and exporting a result with the highest priority to a skill realization discrimination service; when failure in realization, selecting a result with the highest priority among the rest of skill parsing results and exporting the same to the skill realization discrimination service, and when success in realization, sending the result with the highest priority to the data distribution service for feedback to the user. The method improves skill dispatching efficiency, reduces delay, and improves user experience.
    Type: Grant
    Filed: November 18, 2020
    Date of Patent: January 2, 2024
    Assignee: AI SPEECH CO., LTD.
    Inventors: Chengya Zhu, Shuai Fan, Weisi Shi
  • Publication number: 20230133146
    Abstract: Disclosed is a method for determining a skill field of a dialogue text including determining a skill field hit by a dialogue text input by a user, and a name semantic slot and a character semantic slot in the skill field; when the dialogue text hits a first skill field, determining whether the name semantic slot and the character semantic slot match according to a knowledge base of the first skill field; determining, if not matched, whether the name semantic slot and the character semantic slot match according to a knowledge base of a second skill field; and determining, if matched, the second skill field as the skill field of the dialogue text. Also provided is an apparatus for determining a skill field of a dialogue text. The error rate of field classification is reduced, and the skill field can be hit by the user's voice dialogue more accurately.
    Type: Application
    Filed: November 17, 2020
    Publication date: May 4, 2023
    Applicant: AI SPEECH CO., LTD.
    Inventors: Chengya ZHU, Shuai FAN, Chun LI, Weisi SHI
  • Publication number: 20230077478
    Abstract: Disclosed are a method and apparatus for testing a full-duplex speech interaction system. The method includes: determining a scene mixed corpus set by mixing a valid corpus set related to a test scene with an invalid corpus set unrelated to the test scene; playing each corpus audio in the scene mixed corpus set to a speech interaction device under test equipped with the full-duplex speech interaction system; acquiring a work log of the speech interaction device under test, the work log including at least a first log and a second log; and obtaining number of false responses by counting number of log entries which have false response result in the second log, and determining a false response rate based on the number of false responses and a total number of corpus audios played. End-to-end testing of the full-duplex speech interaction system is realized.
    Type: Application
    Filed: November 18, 2022
    Publication date: March 16, 2023
    Applicant: AI SPEECH CO., LTD.
    Inventors: Weisi SHI, Shuai FAN, Hongbo SONG
  • Publication number: 20230066881
    Abstract: Disclosed are an information flow-based decision-making and scheduling customization method and apparatus. The method includes: instantiating a pipeline and modules customized by a developer in the pipeline; mounting the modules on the pipeline in a module sequence customized by the developer; inputting a data stream into an entry of the pipeline; and acquiring a decision result for the data stream from an exit of the pipeline. The solution reduces the coupling between modules. The modules are independent from each other and can be developed collaboratively by many developers. In case the overall design remains unchanged, the modification of the modules has less impact on the global situation. The configurability of scheduling and decision-making is improved. Since the modules are dynamically mounted on the pipeline, this provides extremely high configurability for large-scale customization, and module instances can be dynamically generated by reading configuration information at runtime.
    Type: Application
    Filed: November 18, 2020
    Publication date: March 2, 2023
    Inventors: Weisi SHI, Hongbo SONG, Chengya ZHU, Shuai FAN
  • Publication number: 20230037913
    Abstract: A server-side processing method for implementing an active initiation of a dialogue is disclosed, comprising: establishing a communication connection with a voice client, in response to a received request for establishing a connection from the voice client; receiving an information stream sent by the voice client through the communication connection; performing a dialogue decision-making process according to the information stream, obtaining and outputting an adapted dialogue content to the voice client upon determining that it is an active dialogue scenario. A server and a system for implementing an active initiation of a dialogue are also provided. The disclosed solutions realize intelligent decision-making for voice interaction, and can actively initiate a dialogue based on server-side decision-making, improving interaction experience and realizing intelligent interaction.
    Type: Application
    Filed: November 20, 2020
    Publication date: February 9, 2023
    Inventors: Weisi SHI, Hongbo SONG, Chengya ZHU, Shuai FAN
  • Publication number: 20230044968
    Abstract: A skill dispatching method for a speech dialogue platform including: receiving, by a central control dispatching service, a semantic result of recognizing a user's voice sent by a data distribution service; dispatching, by the central control dispatching service, a plurality of skill services related to the semantic result in parallel, and obtaining skill parsing results from the plurality of skill services; sorting the skill parsing results based on priorities of the skill services, and exporting a result with the highest priority to a skill realization discrimination service; when failure in realization, selecting a result with the highest priority among the rest of skill parsing results and exporting the same to the skill realization discrimination service, and when success in realization, sending the result with the highest priority to the data distribution service for feedback to the user. The method improves skill dispatching efficiency, reduces delay, and improves user experience.
    Type: Application
    Filed: November 18, 2020
    Publication date: February 9, 2023
    Applicant: AI SPEECH CO., LTD.
    Inventors: Chengya ZHU, Shuai FAN, Weisi SHI
  • Patent number: 11551693
    Abstract: An embodiment of the present invention provides a method of man-machine interaction, including: receiving first audio uploaded by a user through a client end, marking a start time and an end time of the first audio, and generating a first recognition result of the first audio using an audio decoder; determining whether the first audio is a short speech based on the start time and end time thereof, and in case of a short speech, generating a second recognition result of the second audio using the audio decoder upon receiving the second audio uploaded by the client end within a preset heartbeat protection time range, sending at least the first recognition result and the second recognition result to a language prediction model; and if it is determined that a combination of the recognition results constitutes a sentence, generating an answering instruction corresponding to the sentence, and sending the answering instruction together with a feedback time mark of the answering instruction to the client end.
    Type: Grant
    Filed: November 25, 2019
    Date of Patent: January 10, 2023
    Assignee: AI SPEECH CO., LTD.
    Inventors: Hongbo Song, Chengya Zhu, Weisi Shi, Shuai Fan
  • Publication number: 20220399020
    Abstract: The present disclosure discloses a man-machine dialogue mode switching method, which is applicable to an electronic device. The method includes receiving a current user sentence spoken by a current user; determining whether a dialogue field to which the current user sentence belongs is a preset dialogue field; if yes, switching the current dialogue mode to a full-duplex dialogue mode; and if not, switching the current dialogue mode to a half-duplex dialogue mode. In the present disclosure, the dialogue mode is switched by determining whether the dialogue field to which the current user sentence belongs is the preset dialogue field, and the dialogue mode can be automatically switched and adjusted according to the difference of the dialogue fields, such that the man-machine dialogue is always in the most suitable dialogue mode and can be realized smoothly.
    Type: Application
    Filed: November 25, 2019
    Publication date: December 15, 2022
    Applicant: AI Speech Co., Ltd.
    Inventors: Hongbo SONG, Weisi SHI, Chengya ZHU, Shuai FAN
  • Publication number: 20220165269
    Abstract: An embodiment of the present invention provides a method of man-machine interaction, including: receiving first audio uploaded by a user through a client end, marking a start time and an end time of the first audio, and generating a first recognition result of the first audio using an audio decoder; determining whether the first audio is a short speech based on the start time and end time thereof, and in case of a short speech, generating a second recognition result of the second audio using the audio decoder upon receiving the second audio uploaded by the client end within a preset heartbeat protection time range, sending at least the first recognition result and the second recognition result to a language prediction model; and if it is determined that a combination of the recognition results constitutes a sentence, generating an answering instruction corresponding to the sentence, and sending the answering instruction together with a feedback time mark of the answering instruction to the client end.
    Type: Application
    Filed: November 25, 2019
    Publication date: May 26, 2022
    Applicant: AI SPEECH CO., LTD
    Inventors: Hongbo SONG, Chengya ZHU, Weisi SHI, Shuai FAN