Patents by Inventor Weisi SHI
Weisi SHI has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Patent number: 12211508Abstract: A server-side processing method for implementing an active initiation of a dialogue is disclosed, comprising: establishing a communication connection with a voice client, in response to a received request for establishing a connection from the voice client; receiving an information stream sent by the voice client through the communication connection; performing a dialogue decision-making process according to the information stream, obtaining and outputting an adapted dialogue content to the voice client upon determining that it is an active dialogue scenario. A server and a system for implementing an active initiation of a dialogue are also provided. The disclosed solutions realize intelligent decision-making for voice interaction, and can actively initiate a dialogue based on server-side decision-making, improving interaction experience and realizing intelligent interaction.Type: GrantFiled: November 20, 2020Date of Patent: January 28, 2025Assignee: AI SPEECH CO., LTD.Inventors: Weisi Shi, Hongbo Song, Chengya Zhu, Shuai Fan
-
Patent number: 12131735Abstract: The present disclosure discloses a man-machine dialogue mode switching method, which is applicable to an electronic device. The method includes receiving a current user sentence spoken by a current user; determining whether a dialogue field to which the current user sentence belongs is a preset dialogue field; if yes, switching the current dialogue mode to a full-duplex dialogue mode; and if not, switching the current dialogue mode to a half-duplex dialogue mode. In the present disclosure, the dialogue mode is switched by determining whether the dialogue field to which the current user sentence belongs is the preset dialogue field, and the dialogue mode can be automatically switched and adjusted according to the difference of the dialogue fields, such that the man-machine dialogue is always in the most suitable dialogue mode and can be realized smoothly.Type: GrantFiled: November 25, 2019Date of Patent: October 29, 2024Assignee: AI Speech Co., Ltd.Inventors: Hongbo Song, Weisi Shi, Chengya Zhu, Shuai Fan
-
Patent number: 11862150Abstract: A skill dispatching method for a speech dialogue platform including: receiving, by a central control dispatching service, a semantic result of recognizing a user's voice sent by a data distribution service; dispatching, by the central control dispatching service, a plurality of skill services related to the semantic result in parallel, and obtaining skill parsing results from the plurality of skill services; sorting the skill parsing results based on priorities of the skill services, and exporting a result with the highest priority to a skill realization discrimination service; when failure in realization, selecting a result with the highest priority among the rest of skill parsing results and exporting the same to the skill realization discrimination service, and when success in realization, sending the result with the highest priority to the data distribution service for feedback to the user. The method improves skill dispatching efficiency, reduces delay, and improves user experience.Type: GrantFiled: November 18, 2020Date of Patent: January 2, 2024Assignee: AI SPEECH CO., LTD.Inventors: Chengya Zhu, Shuai Fan, Weisi Shi
-
Publication number: 20230133146Abstract: Disclosed is a method for determining a skill field of a dialogue text including determining a skill field hit by a dialogue text input by a user, and a name semantic slot and a character semantic slot in the skill field; when the dialogue text hits a first skill field, determining whether the name semantic slot and the character semantic slot match according to a knowledge base of the first skill field; determining, if not matched, whether the name semantic slot and the character semantic slot match according to a knowledge base of a second skill field; and determining, if matched, the second skill field as the skill field of the dialogue text. Also provided is an apparatus for determining a skill field of a dialogue text. The error rate of field classification is reduced, and the skill field can be hit by the user's voice dialogue more accurately.Type: ApplicationFiled: November 17, 2020Publication date: May 4, 2023Applicant: AI SPEECH CO., LTD.Inventors: Chengya ZHU, Shuai FAN, Chun LI, Weisi SHI
-
Publication number: 20230077478Abstract: Disclosed are a method and apparatus for testing a full-duplex speech interaction system. The method includes: determining a scene mixed corpus set by mixing a valid corpus set related to a test scene with an invalid corpus set unrelated to the test scene; playing each corpus audio in the scene mixed corpus set to a speech interaction device under test equipped with the full-duplex speech interaction system; acquiring a work log of the speech interaction device under test, the work log including at least a first log and a second log; and obtaining number of false responses by counting number of log entries which have false response result in the second log, and determining a false response rate based on the number of false responses and a total number of corpus audios played. End-to-end testing of the full-duplex speech interaction system is realized.Type: ApplicationFiled: November 18, 2022Publication date: March 16, 2023Applicant: AI SPEECH CO., LTD.Inventors: Weisi SHI, Shuai FAN, Hongbo SONG
-
Publication number: 20230066881Abstract: Disclosed are an information flow-based decision-making and scheduling customization method and apparatus. The method includes: instantiating a pipeline and modules customized by a developer in the pipeline; mounting the modules on the pipeline in a module sequence customized by the developer; inputting a data stream into an entry of the pipeline; and acquiring a decision result for the data stream from an exit of the pipeline. The solution reduces the coupling between modules. The modules are independent from each other and can be developed collaboratively by many developers. In case the overall design remains unchanged, the modification of the modules has less impact on the global situation. The configurability of scheduling and decision-making is improved. Since the modules are dynamically mounted on the pipeline, this provides extremely high configurability for large-scale customization, and module instances can be dynamically generated by reading configuration information at runtime.Type: ApplicationFiled: November 18, 2020Publication date: March 2, 2023Inventors: Weisi SHI, Hongbo SONG, Chengya ZHU, Shuai FAN
-
Publication number: 20230037913Abstract: A server-side processing method for implementing an active initiation of a dialogue is disclosed, comprising: establishing a communication connection with a voice client, in response to a received request for establishing a connection from the voice client; receiving an information stream sent by the voice client through the communication connection; performing a dialogue decision-making process according to the information stream, obtaining and outputting an adapted dialogue content to the voice client upon determining that it is an active dialogue scenario. A server and a system for implementing an active initiation of a dialogue are also provided. The disclosed solutions realize intelligent decision-making for voice interaction, and can actively initiate a dialogue based on server-side decision-making, improving interaction experience and realizing intelligent interaction.Type: ApplicationFiled: November 20, 2020Publication date: February 9, 2023Inventors: Weisi SHI, Hongbo SONG, Chengya ZHU, Shuai FAN
-
Publication number: 20230044968Abstract: A skill dispatching method for a speech dialogue platform including: receiving, by a central control dispatching service, a semantic result of recognizing a user's voice sent by a data distribution service; dispatching, by the central control dispatching service, a plurality of skill services related to the semantic result in parallel, and obtaining skill parsing results from the plurality of skill services; sorting the skill parsing results based on priorities of the skill services, and exporting a result with the highest priority to a skill realization discrimination service; when failure in realization, selecting a result with the highest priority among the rest of skill parsing results and exporting the same to the skill realization discrimination service, and when success in realization, sending the result with the highest priority to the data distribution service for feedback to the user. The method improves skill dispatching efficiency, reduces delay, and improves user experience.Type: ApplicationFiled: November 18, 2020Publication date: February 9, 2023Applicant: AI SPEECH CO., LTD.Inventors: Chengya ZHU, Shuai FAN, Weisi SHI
-
Patent number: 11551693Abstract: An embodiment of the present invention provides a method of man-machine interaction, including: receiving first audio uploaded by a user through a client end, marking a start time and an end time of the first audio, and generating a first recognition result of the first audio using an audio decoder; determining whether the first audio is a short speech based on the start time and end time thereof, and in case of a short speech, generating a second recognition result of the second audio using the audio decoder upon receiving the second audio uploaded by the client end within a preset heartbeat protection time range, sending at least the first recognition result and the second recognition result to a language prediction model; and if it is determined that a combination of the recognition results constitutes a sentence, generating an answering instruction corresponding to the sentence, and sending the answering instruction together with a feedback time mark of the answering instruction to the client end.Type: GrantFiled: November 25, 2019Date of Patent: January 10, 2023Assignee: AI SPEECH CO., LTD.Inventors: Hongbo Song, Chengya Zhu, Weisi Shi, Shuai Fan
-
Publication number: 20220399020Abstract: The present disclosure discloses a man-machine dialogue mode switching method, which is applicable to an electronic device. The method includes receiving a current user sentence spoken by a current user; determining whether a dialogue field to which the current user sentence belongs is a preset dialogue field; if yes, switching the current dialogue mode to a full-duplex dialogue mode; and if not, switching the current dialogue mode to a half-duplex dialogue mode. In the present disclosure, the dialogue mode is switched by determining whether the dialogue field to which the current user sentence belongs is the preset dialogue field, and the dialogue mode can be automatically switched and adjusted according to the difference of the dialogue fields, such that the man-machine dialogue is always in the most suitable dialogue mode and can be realized smoothly.Type: ApplicationFiled: November 25, 2019Publication date: December 15, 2022Applicant: AI Speech Co., Ltd.Inventors: Hongbo SONG, Weisi SHI, Chengya ZHU, Shuai FAN
-
Publication number: 20220165269Abstract: An embodiment of the present invention provides a method of man-machine interaction, including: receiving first audio uploaded by a user through a client end, marking a start time and an end time of the first audio, and generating a first recognition result of the first audio using an audio decoder; determining whether the first audio is a short speech based on the start time and end time thereof, and in case of a short speech, generating a second recognition result of the second audio using the audio decoder upon receiving the second audio uploaded by the client end within a preset heartbeat protection time range, sending at least the first recognition result and the second recognition result to a language prediction model; and if it is determined that a combination of the recognition results constitutes a sentence, generating an answering instruction corresponding to the sentence, and sending the answering instruction together with a feedback time mark of the answering instruction to the client end.Type: ApplicationFiled: November 25, 2019Publication date: May 26, 2022Applicant: AI SPEECH CO., LTDInventors: Hongbo SONG, Chengya ZHU, Weisi SHI, Shuai FAN