Patents by Inventor Weisi SHI

Weisi SHI has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Method and apparatus for testing full-duplex speech interaction system

Patent number: 12327545

Abstract: Disclosed are a method and apparatus for testing a full-duplex speech interaction system. The method includes: determining a scene mixed corpus set by mixing a valid corpus set related to a test scene with an invalid corpus set unrelated to the test scene; playing each corpus audio in the scene mixed corpus set to a speech interaction device under test equipped with the full-duplex speech interaction system; acquiring a work log of the speech interaction device under test, the work log including at least a first log and a second log; and obtaining number of false responses by counting number of log entries which have false response result in the second log, and determining a false response rate based on the number of false responses and a total number of corpus audios played. End-to-end testing of the full-duplex speech interaction system is realized.

Type: Grant

Filed: November 18, 2022

Date of Patent: June 10, 2025

Assignee: AI SPEECH CO., LTD.

Inventors: Weisi Shi, Shuai Fan, Hongbo Song
Server-side processing method and server for actively initiating dialogue, and voice interaction system capable of initiating dialogue

Patent number: 12211508

Abstract: A server-side processing method for implementing an active initiation of a dialogue is disclosed, comprising: establishing a communication connection with a voice client, in response to a received request for establishing a connection from the voice client; receiving an information stream sent by the voice client through the communication connection; performing a dialogue decision-making process according to the information stream, obtaining and outputting an adapted dialogue content to the voice client upon determining that it is an active dialogue scenario. A server and a system for implementing an active initiation of a dialogue are also provided. The disclosed solutions realize intelligent decision-making for voice interaction, and can actively initiate a dialogue based on server-side decision-making, improving interaction experience and realizing intelligent interaction.

Type: Grant

Filed: November 20, 2020

Date of Patent: January 28, 2025

Assignee: AI SPEECH CO., LTD.

Inventors: Weisi Shi, Hongbo Song, Chengya Zhu, Shuai Fan
Man-machine dialogue mode switching method

Patent number: 12131735

Abstract: The present disclosure discloses a man-machine dialogue mode switching method, which is applicable to an electronic device. The method includes receiving a current user sentence spoken by a current user; determining whether a dialogue field to which the current user sentence belongs is a preset dialogue field; if yes, switching the current dialogue mode to a full-duplex dialogue mode; and if not, switching the current dialogue mode to a half-duplex dialogue mode. In the present disclosure, the dialogue mode is switched by determining whether the dialogue field to which the current user sentence belongs is the preset dialogue field, and the dialogue mode can be automatically switched and adjusted according to the difference of the dialogue fields, such that the man-machine dialogue is always in the most suitable dialogue mode and can be realized smoothly.

Type: Grant

Filed: November 25, 2019

Date of Patent: October 29, 2024

Assignee: AI Speech Co., Ltd.

Inventors: Hongbo Song, Weisi Shi, Chengya Zhu, Shuai Fan
Skill dispatching method and apparatus for speech dialogue platform

Patent number: 11862150

Abstract: A skill dispatching method for a speech dialogue platform including: receiving, by a central control dispatching service, a semantic result of recognizing a user's voice sent by a data distribution service; dispatching, by the central control dispatching service, a plurality of skill services related to the semantic result in parallel, and obtaining skill parsing results from the plurality of skill services; sorting the skill parsing results based on priorities of the skill services, and exporting a result with the highest priority to a skill realization discrimination service; when failure in realization, selecting a result with the highest priority among the rest of skill parsing results and exporting the same to the skill realization discrimination service, and when success in realization, sending the result with the highest priority to the data distribution service for feedback to the user. The method improves skill dispatching efficiency, reduces delay, and improves user experience.

Type: Grant

Filed: November 18, 2020

Date of Patent: January 2, 2024

Assignee: AI SPEECH CO., LTD.

Inventors: Chengya Zhu, Shuai Fan, Weisi Shi
METHOD AND APPARATUS FOR DETERMINING SKILL FIELD OF DIALOGUE TEXT

Publication number: 20230133146

Abstract: Disclosed is a method for determining a skill field of a dialogue text including determining a skill field hit by a dialogue text input by a user, and a name semantic slot and a character semantic slot in the skill field; when the dialogue text hits a first skill field, determining whether the name semantic slot and the character semantic slot match according to a knowledge base of the first skill field; determining, if not matched, whether the name semantic slot and the character semantic slot match according to a knowledge base of a second skill field; and determining, if matched, the second skill field as the skill field of the dialogue text. Also provided is an apparatus for determining a skill field of a dialogue text. The error rate of field classification is reduced, and the skill field can be hit by the user's voice dialogue more accurately.

Type: Application

Filed: November 17, 2020

Publication date: May 4, 2023

Applicant: AI SPEECH CO., LTD.

Inventors: Chengya ZHU, Shuai FAN, Chun LI, Weisi SHI
METHOD AND APPARATUS FOR TESTING FULL-DUPLEX SPEECH INTERACTION SYSTEM

Publication number: 20230077478

Abstract: Disclosed are a method and apparatus for testing a full-duplex speech interaction system. The method includes: determining a scene mixed corpus set by mixing a valid corpus set related to a test scene with an invalid corpus set unrelated to the test scene; playing each corpus audio in the scene mixed corpus set to a speech interaction device under test equipped with the full-duplex speech interaction system; acquiring a work log of the speech interaction device under test, the work log including at least a first log and a second log; and obtaining number of false responses by counting number of log entries which have false response result in the second log, and determining a false response rate based on the number of false responses and a total number of corpus audios played. End-to-end testing of the full-duplex speech interaction system is realized.

Type: Application

Filed: November 18, 2022

Publication date: March 16, 2023

Applicant: AI SPEECH CO., LTD.

Inventors: Weisi SHI, Shuai FAN, Hongbo SONG
INFORMATION FLOW-BASED DECISION-MAKING AND SCHEDULING CUSTOMIZATION METHOD AND APPARATUS

Publication number: 20230066881

Abstract: Disclosed are an information flow-based decision-making and scheduling customization method and apparatus. The method includes: instantiating a pipeline and modules customized by a developer in the pipeline; mounting the modules on the pipeline in a module sequence customized by the developer; inputting a data stream into an entry of the pipeline; and acquiring a decision result for the data stream from an exit of the pipeline. The solution reduces the coupling between modules. The modules are independent from each other and can be developed collaboratively by many developers. In case the overall design remains unchanged, the modification of the modules has less impact on the global situation. The configurability of scheduling and decision-making is improved. Since the modules are dynamically mounted on the pipeline, this provides extremely high configurability for large-scale customization, and module instances can be dynamically generated by reading configuration information at runtime.

Type: Application

Filed: November 18, 2020

Publication date: March 2, 2023

Inventors: Weisi SHI, Hongbo SONG, Chengya ZHU, Shuai FAN
SERVER-SIDE PROCESSING METHOD AND SERVER FOR ACTIVELY INITIATING DIALOGUE, AND VOICE INTERACTION SYSTEM CAPABLE OF INITIATING DIALOGUE

Publication number: 20230037913

Abstract: A server-side processing method for implementing an active initiation of a dialogue is disclosed, comprising: establishing a communication connection with a voice client, in response to a received request for establishing a connection from the voice client; receiving an information stream sent by the voice client through the communication connection; performing a dialogue decision-making process according to the information stream, obtaining and outputting an adapted dialogue content to the voice client upon determining that it is an active dialogue scenario. A server and a system for implementing an active initiation of a dialogue are also provided. The disclosed solutions realize intelligent decision-making for voice interaction, and can actively initiate a dialogue based on server-side decision-making, improving interaction experience and realizing intelligent interaction.

Type: Application

Filed: November 20, 2020

Publication date: February 9, 2023

Inventors: Weisi SHI, Hongbo SONG, Chengya ZHU, Shuai FAN
SKILL DISPATCHING METHOD AND APPARATUS FOR SPEECH DIALOGUE PLATFORM

Publication number: 20230044968

Abstract: A skill dispatching method for a speech dialogue platform including: receiving, by a central control dispatching service, a semantic result of recognizing a user's voice sent by a data distribution service; dispatching, by the central control dispatching service, a plurality of skill services related to the semantic result in parallel, and obtaining skill parsing results from the plurality of skill services; sorting the skill parsing results based on priorities of the skill services, and exporting a result with the highest priority to a skill realization discrimination service; when failure in realization, selecting a result with the highest priority among the rest of skill parsing results and exporting the same to the skill realization discrimination service, and when success in realization, sending the result with the highest priority to the data distribution service for feedback to the user. The method improves skill dispatching efficiency, reduces delay, and improves user experience.

Type: Application

Filed: November 18, 2020

Publication date: February 9, 2023

Applicant: AI SPEECH CO., LTD.

Inventors: Chengya ZHU, Shuai FAN, Weisi SHI
Method of man-machine interaction and electronic device

Patent number: 11551693

Abstract: An embodiment of the present invention provides a method of man-machine interaction, including: receiving first audio uploaded by a user through a client end, marking a start time and an end time of the first audio, and generating a first recognition result of the first audio using an audio decoder; determining whether the first audio is a short speech based on the start time and end time thereof, and in case of a short speech, generating a second recognition result of the second audio using the audio decoder upon receiving the second audio uploaded by the client end within a preset heartbeat protection time range, sending at least the first recognition result and the second recognition result to a language prediction model; and if it is determined that a combination of the recognition results constitutes a sentence, generating an answering instruction corresponding to the sentence, and sending the answering instruction together with a feedback time mark of the answering instruction to the client end.

Type: Grant

Filed: November 25, 2019

Date of Patent: January 10, 2023

Assignee: AI SPEECH CO., LTD.

Inventors: Hongbo Song, Chengya Zhu, Weisi Shi, Shuai Fan
MAN-MACHINE DIALOGUE MODE SWITCHING METHOD

Publication number: 20220399020

Abstract: The present disclosure discloses a man-machine dialogue mode switching method, which is applicable to an electronic device. The method includes receiving a current user sentence spoken by a current user; determining whether a dialogue field to which the current user sentence belongs is a preset dialogue field; if yes, switching the current dialogue mode to a full-duplex dialogue mode; and if not, switching the current dialogue mode to a half-duplex dialogue mode. In the present disclosure, the dialogue mode is switched by determining whether the dialogue field to which the current user sentence belongs is the preset dialogue field, and the dialogue mode can be automatically switched and adjusted according to the difference of the dialogue fields, such that the man-machine dialogue is always in the most suitable dialogue mode and can be realized smoothly.

Type: Application

Filed: November 25, 2019

Publication date: December 15, 2022

Applicant: AI Speech Co., Ltd.

Inventors: Hongbo SONG, Weisi SHI, Chengya ZHU, Shuai FAN
METHOD OF MAN-MACHINE INTERACTION AND ELECTRONIC DEVICE

Publication number: 20220165269

Abstract: An embodiment of the present invention provides a method of man-machine interaction, including: receiving first audio uploaded by a user through a client end, marking a start time and an end time of the first audio, and generating a first recognition result of the first audio using an audio decoder; determining whether the first audio is a short speech based on the start time and end time thereof, and in case of a short speech, generating a second recognition result of the second audio using the audio decoder upon receiving the second audio uploaded by the client end within a preset heartbeat protection time range, sending at least the first recognition result and the second recognition result to a language prediction model; and if it is determined that a combination of the recognition results constitutes a sentence, generating an answering instruction corresponding to the sentence, and sending the answering instruction together with a feedback time mark of the answering instruction to the client end.

Type: Application

Filed: November 25, 2019

Publication date: May 26, 2022

Applicant: AI SPEECH CO., LTD

Inventors: Hongbo SONG, Chengya ZHU, Weisi SHI, Shuai FAN