Patents by Inventor Yongkang Xie

Yongkang Xie has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20250077780
    Abstract: A method for invoking a plugin of a large language model includes: acquiring natural language content; performing semantic understanding on the natural language content and detecting whether the natural language content hits a plugin to obtain a first plugin pointed to by the plugin hit result; comparing the first plugin with a second plugin corresponding to the current session understanding task to determine a to-be-executed session understanding task and a third plugin corresponding to the to-be-executed session understanding task; acquiring the language understanding content of the to-be-executed session understanding task and sending the language understanding content to the large language model to obtain the input parameter of the third plugin; and calling the third plugin according to the input parameter of the third plugin to obtain the calling result of the to-be-executed session understanding task.
    Type: Application
    Filed: June 20, 2024
    Publication date: March 6, 2025
    Inventors: Yongkang Xie, Guming Gao, Penghao Zhao, Xue Xiong, Qian Wang, Dongze Xu, En Shi, Yuxuan Li, Sheng Zhou, Shupeng Li, Yao Wang, Zhou Xin
  • Patent number: 12182546
    Abstract: A method for model production includes acquiring a related operation for model production from a user interface layer of a model production system, and determining a software platform of the model production system; acquiring a model service corresponding to the related operation by invoking an application programming interface (API) corresponding to the related operation, wherein the API is located between the user interface layer and other layer in the model production system; performing the model service by invoking local resources of the software platform with a tool of the software platform adapted to the model service, to generate a target model; and applying the target model in a target usage scene.
    Type: Grant
    Filed: August 16, 2022
    Date of Patent: December 31, 2024
    Assignee: BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD.
    Inventors: En Shi, Yongkang Xie, Zihao Pan, Shupeng Li, Xiaoyu Chen, Zhengyu Qian, Jingqiu Li
  • Patent number: 11954011
    Abstract: An apparatus and a method for executing a customized production line using an artificial intelligence development platform, a computing device and a computer readable storage medium are provided. The apparatus includes: a production line executor configured to generate a native form of the artificial intelligence development platform based on a file set, the native form to be sent to a client accessing the artificial intelligence development platform so as to present a native interactive page of the artificial intelligence development platform; and a standardized platform interface configured to provide an interaction channel between the production line executor and the artificial intelligence development platform. The production line executor is further configured to generate an intermediate result by executing processing logic defined in the file set and to process the intermediate result by interacting with the artificial intelligence development platform via the standardized platform interface.
    Type: Grant
    Filed: October 28, 2020
    Date of Patent: April 9, 2024
    Assignee: BEIJING BAIDU NETCOM SCIENCE AND TECHNOLOGY CO., LTD.
    Inventors: Yongkang Xie, Ruyue Ma, Zhou Xin, Hao Cao, Kuan Shi, Yu Zhou, Yashuai Li, En Shi, Zhiquan Wu, Zihao Pan, Shupeng Li, Mingren Hu, Tian Wu
  • Patent number: 11567103
    Abstract: A testing device is disclosed including a plurality of elastic members, a plurality of elastic terminals, and a plurality of terminal boards. Each elastic member is provided with an arc-shaped elastic deformation portion, at least one elastic terminal is arranged as one set and is clamped on one elastic member with an inner arc of the elastic deformation portion. Each terminal board is provided with a recess for accommodating one of the elastic members, the recess is provided with at least one arc-shaped groove each matched with a respective elastic terminal, an outer arc of the elastic deformation portion is embedded in a respective arc-shaped groove, each arc-shaped groove has an upper end extending to an upper surface of the terminal board, and a lower end extending to a lower surface of the terminal board.
    Type: Grant
    Filed: July 19, 2021
    Date of Patent: January 31, 2023
    Assignee: QUANWISE MICROELECTRONICS (ZHUHAI) CO., LTD.
    Inventors: Guangmin Huang, Wei Xie, Yongkang Xie
  • Publication number: 20220391182
    Abstract: A method for model production includes acquiring a related operation for model production from a user interface layer of a model production system, and determining a software platform of the model production system; acquiring a model service corresponding to the related operation by invoking an application programming interface (API) corresponding to the related operation, wherein the API is located between the user interface layer and other layer in the model production system; performing the model service by invoking local resources of the software platform with a tool of the software platform adapted to the model service, to generate a target model; and applying the target model in a target usage scene.
    Type: Application
    Filed: August 16, 2022
    Publication date: December 8, 2022
    Applicant: BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD.
    Inventors: En Shi, Yongkang Xie, Zihao Pan, Shupeng Li, Xiaoyu Chen, Zhengyu Qian, Jingqiu Li
  • Publication number: 20220309395
    Abstract: The present disclosure discloses a method and an apparatus for adapting a deep learning model, an electronic device and a medium, which relates to technology fields of artificial intelligence, deep learning, and cloud computing. The specific implementation plan is: obtaining model information of an original deep learning model and hardware information of a target hardware to be adapted; querying a conversion path table according to the model information and the hardware information to obtain a matched target conversion path; and converting, according to the target conversion path, the original deep learning model to an intermediate deep learning model in the conversion path, and converting the intermediate deep learning model to the target deep learning model.
    Type: Application
    Filed: September 16, 2020
    Publication date: September 29, 2022
    Inventors: Tuobang WU, En SHI, Yongkang XIE, Xiaoyu CHEN, Lianghuo ZHANG, Jie LIU, Binbin XU
  • Patent number: 11455173
    Abstract: A method for management of an artificial intelligence development platform is provided. The artificial intelligence development platform is deployed with instances of a plurality of model services, and each of the model services is provided with one or more instances. The method includes: acquiring calling information of at least one model service; determining the activity of the at least one model service according to the calling information; and at least deleting all instances of the at least one model service in response to that the determined activity meets a first condition.
    Type: Grant
    Filed: March 19, 2021
    Date of Patent: September 27, 2022
    Assignee: Beijing Baidu Netcom Science and Technology Co., Ltd.
    Inventors: Zhengxiong Yuan, En Shi, Yongkang Xie, Mingren Hu, Zhengyu Qian, Zhenfang Chu
  • Publication number: 20220276899
    Abstract: A resource scheduling method and apparatus, a device, and a storage medium are provided, and relates to the field of computer technology, and in particular to the field of deep learning technology. The method includes: acquiring a graphics processing unit (GPU) topology relationship of a cluster according to GPU connection information of each of computing nodes in the cluster; and in a case where a task request, for applying for a GPU resource, for a target task is received, determining a target computing node of the target task and a target GPU in the target computing node according to the task request and the GPU topology relationship, to complete GPU resource scheduling of the target task. The present disclosure can optimize the resource scheduling.
    Type: Application
    Filed: May 13, 2022
    Publication date: September 1, 2022
    Inventors: Binbin XU, Liang TANG, Ying ZHAO, Shupeng LI, En SHI, Zhengyu QIAN, Yongkang XIE
  • Publication number: 20220253372
    Abstract: An apparatus and a method for executing a customized production line using an artificial intelligence development platform, a computing device and a computer readable storage medium are provided. The apparatus includes: a production line executor configured to generate a native form of the artificial intelligence development platform based on a file set, the native form to be sent to a client accessing the artificial intelligence development platform so as to present a native interactive page of the artificial intelligence development platform; and a standardized platform interface configured to provide an interaction channel between the production line executor and the artificial intelligence development platform. The production line executor is further configured to generate an intermediate result by executing processing logic defined in the file set and to process the intermediate result by interacting with the artificial intelligence development platform via the standardized platform interface.
    Type: Application
    Filed: October 28, 2020
    Publication date: August 11, 2022
    Inventors: Yongkang XIE, Ruyue MA, Zhou XIN, Hao CAO, Kuan SHI, Yu ZHOU, Yashuai LI, En SHI, Zhiquan WU, Zihao PAN, Shupeng LI, Mingren HU, Tian WU
  • Publication number: 20220229089
    Abstract: A testing device is disclosed including a plurality of elastic members, a plurality of elastic terminals, and a plurality of terminal boards. Each elastic member is provided with an arc-shaped elastic deformation portion, at least one elastic terminal is arranged as one set and is clamped on one elastic member with an inner arc of the elastic deformation portion. Each terminal board is provided with a recess for accommodating one of the elastic members, the recess is provided with at least one arc-shaped groove each matched with a respective elastic terminal, an outer arc of the elastic deformation portion is embedded in a respective arc-shaped groove, each arc-shaped groove has an upper end extending to an upper surface of the terminal board, and a lower end extending to a lower surface of the terminal board.
    Type: Application
    Filed: July 19, 2021
    Publication date: July 21, 2022
    Applicant: Quanwise Microelectronics (Zhuhai) Co., Ltd.
    Inventors: Guangmin HUANG, Wei XIE, Yongkang XIE
  • Publication number: 20220067375
    Abstract: A method includes: determining at least one typical object ratio from a first training data set by counting ratios of objects in training pictures of the first training data set; determining at least one picture scaling size based at least on the at least one typical object ratio; scaling the training pictures of the first training data set according to the at least one picture scaling size; obtaining a second training data set by slicing the scaled training pictures; training an object detection model using the second training data set; and performing object detection on a to-be-detected picture using the trained object detection model. The object detection method according to the embodiments of the present disclosure can be used to complete, without manual intervention, a task of detecting an extremely small object.
    Type: Application
    Filed: March 12, 2021
    Publication date: March 3, 2022
    Inventors: Penghao ZHAO, Haibin ZHANG, Shupeng LI, En SHI, Yongkang XIE
  • Patent number: 11210608
    Abstract: A method and apparatus for generating a model, and a method and apparatus for recognizing information are provided. An implementation of the method for generating a model includes: acquiring a to-be-converted model, a topology description of the to-be-converted model, and device information of a target device; converting, based on the topology description and the device information, parameters and operators of the to-be-converted model to obtain a converted model applicable to the target device; and generating a deep learning prediction model based on the converted model. This embodiment enables the conversion of an existing model to a deep learning prediction model that can be applied to a target device.
    Type: Grant
    Filed: May 28, 2019
    Date of Patent: December 28, 2021
    Assignee: Beijing Baidu Netcom Science and Technology Co., Ltd.
    Inventors: Yongkang Xie, En Shi, Xiaoyu Chen, Shupeng Li, Shimin Ruan, Tuobang Wu, Ying Zhao, Lianghuo Zhang
  • Publication number: 20210216805
    Abstract: The present application discloses an image recognition method, apparatus, an electronic device and a storage medium, and relates to the field of neural networks and depth learning. An implementation solution may be as follows: loading a first image recognition model; inputting an image to be recognized into a first image recognition model; predicting the image to be recognized by using a first image recognition model to obtain an output result of a network layer of the first image recognition model; and performing post-processing on the output result of the network layer of the first image recognition model, to obtain an image recognition result.
    Type: Application
    Filed: March 18, 2021
    Publication date: July 15, 2021
    Inventors: Xiangxiang LV, En SHI, Yongkang XIE
  • Publication number: 20210211361
    Abstract: A method for management of an artificial intelligence development platform is provided. The artificial intelligence development platform is deployed with instances of a plurality of model services, and each of the model services is provided with one or more instances. The method includes: acquiring calling information of at least one model service; determining the activity of the at least one model service according to the calling information; and at least deleting all instances of the at least one model service in response to that the determined activity meets a first condition.
    Type: Application
    Filed: March 19, 2021
    Publication date: July 8, 2021
    Inventors: Zhengxiong Yuan, En Shi, Yongkang Xie, Mingren Hu, Zhengyu Qian, Zhenfang Chu
  • Publication number: 20190370685
    Abstract: A method and apparatus for generating a model, and a method and apparatus for recognizing information are provided. An implementation of the method for generating a model includes: acquiring a to-be-converted model, a topology description of the to-be-converted model, and device information of a target device; converting, based on the topology description and the device information, parameters and operators of the to-be-converted model to obtain a converted model applicable to the target device; and generating a deep learning prediction model based on the converted model. This embodiment enables the conversion of an existing model to a deep learning prediction model that can be applied to a target device.
    Type: Application
    Filed: May 28, 2019
    Publication date: December 5, 2019
    Inventors: Yongkang Xie, En Shi, Xiaoyu Chen, Shupeng Li, Shimin Ruan, Tuobang Wu, Ying Zhao, Lianghuo Zhang