Patents Assigned to INSTITUTE OF AUTOMATION CHINESE ACADEMY OF SCIENCES
  • Publication number: 20230027645
    Abstract: Disclosed is a hierarchical generated audio detection system, comprising an audio preprocessing module, a CQCC feature extraction module, a LFCC feature extraction module, a first-stage lightweight coarse-level detection model and a second-stage fine-level deep identification model; the audio preprocessing module preprocesses collected audio or video data to obtain an audio clip with a length not exceeding the limit; inputting the audio clip into CQCC feature extraction module and LFCC feature extraction module respectively to obtain CQCC feature and LFCC feature; inputting CQCC feature or LFCC feature into the first-stage lightweight coarse-level detection model for first-stage screening to screen out the first-stage real audio and the first-stage generated audio; inputting the CQCC feature or LFCC feature of the first-stage generated audio into the second-stage fine-level deep identification model to identify the second-stage real audio and the second-stage generated audio, and the second-stage generated au
    Type: Application
    Filed: February 17, 2022
    Publication date: January 26, 2023
    Applicant: INSTITUTE OF AUTOMATION, CHINESE ACADEMY OF SCIENCES
    Inventors: Jianhua TAO, Zhengkun TIAN, Jiangyan YI
  • Patent number: 11528684
    Abstract: A resource allocation method for coexistence of multiple line topological industrial wireless networks is provided. It pertains to the coexistence problem of multiple TDMA-based line topological industrial wireless networks, including three parts: lower bound analysis of scheduling delay, allocation algorithm of inter-network resources and allocation algorithm of intra-network resources. The method uses overall scheduling delay and resource utilization ratio as measurement indexes when analyzing the lower bound of delay and designing resource allocation algorithms, and selects a best node combination in each time slot to occupy as many channel resources as possible to improve the resource utilization ratio and reduce the overall scheduling delay.
    Type: Grant
    Filed: November 20, 2019
    Date of Patent: December 13, 2022
    Assignee: SHENYANG INSTITUTE OF AUTOMATION, CHINESE ACADEMY OF SCIENCES
    Inventors: Wei Liang, Jialin Zhang, Meng Zheng, Sichao Zhang, Kai Wang, Shuai Liu
  • Patent number: 11521629
    Abstract: Disclosed is a digital audio tampering forensics method based on phase offset detection, comprising: multiplying a signal to be identified with a time label to obtain a modulation signal of the signal to be identified; then, performing a short-time Fourier transform on the signal to be identified and the modulation signal to obtain a signal power spectrum and a modulation signal power spectrum; computing group delay characteristics by using the signal power spectrum and the modulation signal power spectrum; computing a mean value of the group delay characteristics, and then using the mean value results for smoothing computation to obtain phase information of a current frame signal; computing a dynamic threshold by using the phase information of the current frame signal, and then deciding whether the signal is tampered by using the dynamic threshold and the phase information of the current frame signal.
    Type: Grant
    Filed: February 9, 2022
    Date of Patent: December 6, 2022
    Assignee: INSTITUTE OF AUTOMATION, CHINESE ACADEMY OF SCIENCES
    Inventors: Jianhua Tao, Shan Liang, Shuai Nie, Jiangyan Yi
  • Publication number: 20220379490
    Abstract: Provided are a device, a system and a method for acquiring a force information based on a bionic structure, including: a force information acquisition layer and a magnetic field signal acquisition chip; wherein a permanent magnet is embedded in the force information acquisition layer; wherein the force information acquisition layer has an elastic structure configured to generate a deformation corresponding to a first force information of a force after being subjected to the force, so that the permanent magnet moves with the deformation to generate a magnetic field signal corresponding to the force information; wherein the magnetic field signal acquisition chip is arranged in parallel with the force information acquisition layer, and is configured to acquire the magnetic field signal and convert the magnetic field signal into an electrical signal.
    Type: Application
    Filed: May 24, 2022
    Publication date: December 1, 2022
    Applicant: Institute of Automation, Chinese Academy of Sciences
    Inventors: Xiaohu ZHOU, Zengguang HOU, Meijiang GUI, Xiaoliang XIE, Shiqi LIU, Zhenqiu FENG, Yanjie ZHOU, Lingwu MENG, Hao LI
  • Patent number: 11501759
    Abstract: Disclosed are a method and a system for speech recognition, an electronic device and a storage medium, which relates to the technical field of speech recognition. Embodiments of the application comprise performing encoded representation on an audio to be recognized to obtain an acoustic encoded state vector sequence of the audio to be recognized; performing sparse encoding on the acoustic encoded state vector sequence of the audio to be recognized to obtain an acoustic encoded sparse vector; determining a text prediction vector of each label in a preset vocabulary; recognizing the audio to be recognized and determining a text content corresponding to the audio to be recognized according to the acoustic encoded sparse vector and the text prediction vector. The acoustic encoded sparse vector of the audio to be recognized is obtained by performing sparse encoding on the acoustic encoded state vector of the audio to be recognized.
    Type: Grant
    Filed: July 19, 2022
    Date of Patent: November 15, 2022
    Assignee: INSTITUTE OF AUTOMATION, CHINESE ACADEMY OF SCIENCES
    Inventors: Jianhua Tao, Zhengkun Tian, Jiangyan Yi
  • Patent number: 11487950
    Abstract: The method of the present disclosure includes: obtaining an image to be processed and a question text corresponding to the image; using an optimized dialogue model to encode the image into an image vector and encode the question text into a question vector; generating a state vector based on the image vector and the question vector; decoding the state vector to obtain and output an answer text. A discriminator needs to be introduced in an optimization process of the optimized dialogue model. The dialogue model and the discriminator are alternately optimized until a value of a hybrid loss function of the dialogue model and a value of a loss function of the discriminator do not decrease or fall below a preset value, thereby accomplishing the optimization process.
    Type: Grant
    Filed: April 19, 2019
    Date of Patent: November 1, 2022
    Assignee: INSTITUTE OF AUTOMATION, CHINESE ACADEMY OF SCIENCES
    Inventors: Jiaming Xu, Yiqun Yao, Bo Xu
  • Patent number: 11488308
    Abstract: A three-dimensional object detection method includes: extracting a target in a two-dimensional image by a pre-trained deep convolutional neural network to obtain a plurality of target objects; determining a point cloud frustum in a corresponding three-dimensional point cloud space based on each target object; segmenting the point cloud in the frustum based on a point cloud segmentation network to obtain a point cloud of interest; and estimating parameters of a 3D box in the point cloud of interest based on a network with the weighted channel features to obtain the parameters of the 3D box for three-dimensional object detection. According to the present invention, the features of the image can be learned more accurately by the deep convolutional neural network and the parameters of the 3D box in the point cloud of interest are estimated based on the network with the weighted channel features.
    Type: Grant
    Filed: April 19, 2019
    Date of Patent: November 1, 2022
    Assignee: INSTITUTE OF AUTOMATION, CHINESE ACADEMY OF SCIENCES
    Inventors: Xin Zhao, Kaiqi Huang, Zhe Liu
  • Patent number: 11488586
    Abstract: Disclosed is a system for speech recognition text enhancement fusing multi-modal semantic invariance, the system includes an acoustic feature extraction module, an acoustic down-sampling module, an acoustic feature extraction module, an acoustic down-sampling module, an encoder and a decoder fusing multi-modal semantic invariance; the acoustic feature extraction module is configured for frame-dividing processing of speech data, dividing the speech data into short-term audio frames with a fixed length, extracting thank acoustic features from the short-term audio frames, and inputting the acoustic features into the acoustic down-sampling module for down-sampling to obtain an acoustic representation; inputting the speech data into an existing speech recognition module to obtain input text data, and inputting the input text data into the encoder to obtain an input text encoded representation; inputting the acoustic representation and the input text encoded representation into the decoder to fuse.
    Type: Grant
    Filed: July 19, 2022
    Date of Patent: November 1, 2022
    Assignee: INSTITUTE OF AUTOMATION, CHINESE ACADEMY OF SCIENCES
    Inventors: Jianhua Tao, Shuai Zhang, Jiangyan Yi
  • Patent number: 11475877
    Abstract: Disclosed are an end-to-end system for speech recognition and speech translation and an electronic device. The system comprises an acoustic encoder and a multi-task decoder and a semantic invariance constraint module, and completes two tasks for speech recognition and speech translation. In addition, according to the characteristic of the semantic consistency of texts between different tasks, semantic constraints are imposed on the model to learn high-level semantic information, and the semantic information can effectively improve the performance of speech recognition and speech translation. The application has the following advantages that the error accumulation problem of serial system is avoided, and the calculation cost of the model is low and the real-time performance is very high.
    Type: Grant
    Filed: June 28, 2022
    Date of Patent: October 18, 2022
    Assignee: INSTITUTE OF AUTOMATION, CHINESE ACADEMY OF SCIENCES
    Inventors: Jianhua Tao, Shuai Zhang, Jiangyan Yi
  • Patent number: 11469838
    Abstract: A method and device for implementing an FPGA-based large-scale radio frequency interference array correlator are provided. The method includes: obtaining the number of channels of data of a radio frequency interference array, and performing average division; calculating the total correlation of data group and the total correlation between the data group and other data groups respectively through corresponding correlation calculation modules, and performing an accumulation calculation in an integration period to complete the total correlation operation of the radio frequency interference array. By means of grouping division and time division multiplexing, the FPGA resource is effectively utilized, and the calculation process of FPGA is simplified. The new method is suitable for the operation process of the system with high parallelism and high real-time requirements, and provides a high-efficiency solution for the real-time calculation of massive data of the large-scale radio frequency interference array.
    Type: Grant
    Filed: May 25, 2020
    Date of Patent: October 11, 2022
    Assignees: INSTITUTE OF AUTOMATION, CHINESE ACADEMY OF SCIENCES, GUANGZHOU ARTIFICIAL INTELLIGENCE AND ADVANCED COMPUTING INSTITUTE OF CASIA
    Inventors: Yafang Song, Jie Hao, Jun Liang, Lin Shu, Liangtian Zhao, Qiuxiang Fan, Hui Feng, Wenqing Hu
  • Patent number: 11467562
    Abstract: An online monitoring device of 3D printing equipment includes a signal collection module, a signal processing module, a feature extraction module, a monitoring module and a knowledge base module. A vibration signal of a preset component of the 3D printing equipment is collected by a vibration sensor. The collected vibration signal of each preset component is converted from an analog signal to a digital signal and the spectrum characteristics are extracted. Based on the spectrum characteristics of each preset component, the operation state type of the preset component is obtained by a comparative analysis model. The knowledge base module is configured to store newly added samples and initial samples of the 3D printing equipment. The initial samples include spectrum characteristic information and corresponding fault category of known faults, and the newly added samples include spectrum characteristic information and corresponding fault category of new faults.
    Type: Grant
    Filed: July 7, 2020
    Date of Patent: October 11, 2022
    Assignees: INSTITUTE OF AUTOMATION, CHINESE ACADEMY OF SCIENCES, CLOUD COMPUTING CENTER CHINESE ACADEMY OF SCIENCES, DongGuan, Guangdong (CN)
    Inventors: Gang Xiong, Jiawei Liao, Zhen Shen, Xiuqin Shang, Chao Guo, Jun Yan, Can Luo, Xiao Wang, Feiyue Wang
  • Patent number: 11458599
    Abstract: The present invention relates to a quick clamping device.
    Type: Grant
    Filed: December 20, 2018
    Date of Patent: October 4, 2022
    Assignee: SHENYANG INSTITUTE OF AUTOMATION, CHINESE ACADEMY OF SCIENCES
    Inventors: Jinguo Liu, Keli Chen, Haodong Chi, Cao Tong
  • Patent number: 11458638
    Abstract: A robot multi-degree-of-freedom clamper has a short stroke biaxial cylinder installed on the clamping jaw supporting frame and an output end connected with a pneumatic clamping jaw A. In addition, a clamping jaw finger A is connected with an output end of the pneumatic clamping jaw A. A long stroke biaxial cylinder is connected with a pneumatic clamping jaw B. A clamping jaw finger B is connected with the output end of the pneumatic clamping jaw B. A pneumatic clamping jaw C is positioned between the pneumatic clamping jaw A and the pneumatic clamping jaw B. A clamping jaw finger C is connected with the output end of the pneumatic clamping jaw C. The clamping jaw finger A and the pneumatic clamping jaw A are driven by the short stroke biaxial cylinder to move back and forth on the clamping jaw supporting frame.
    Type: Grant
    Filed: December 20, 2018
    Date of Patent: October 4, 2022
    Assignee: SHENYANG INSTITUTE OF AUTOMATION, CHINESE ACADEMY OF SCIENCES
    Inventors: Jinguo Liu, Yunjun Liu
  • Patent number: 11462207
    Abstract: Disclosed are a method and an apparatus for editing audio, an electronic device and a storage medium. The method includes: acquiring a modified text obtained by modifying a known original text of an audio to be edited according to a known text for modification; predicting a duration of an audio corresponding to the text for modification; adjusting a region to be edited of the audio to be edited according to the duration of the audio corresponding to the text for modification, to obtain an adjusted audio to be edited; obtaining, based on a pre-trained audio editing model, an edited audio according to the adjusted audio to be edited and the modified text. In the present disclosure, the edited audio obtained by the audio editing model sounds natural in the context, and supports the function of synthesizing new words that do not appear in the corpus.
    Type: Grant
    Filed: May 5, 2022
    Date of Patent: October 4, 2022
    Assignee: INSTITUTE OF AUTOMATION, CHINESE ACADEMY OF SCIENCES
    Inventors: Jianhua Tao, Tao Wang, Jiangyan Yi, Ruibo Fu
  • Patent number: 11455548
    Abstract: Disclosed is an acquisition method for domain rule knowledge of an industrial process. The method comprises the steps of: establishing a domain rule base, establishing a semantic knowledge base, and combining the domain rule base and the semantic knowledge base so as to realize an augmented update of a domain rule knowledge base; describing the domain knowledge of the industrial process by using weighted first-order logic rules so as to form a training sample set of the first-order logic rules; performing a weight learning by applying probability soft logic and the training sample set of the first-order logic rules so as to realize weight to non-weighted rules; performing rule learning through a machine learning algorithm so as to obtain a first-order logic rule on a change in optimization decision-making semantic when multi-source data semantic information changes.
    Type: Grant
    Filed: December 19, 2021
    Date of Patent: September 27, 2022
    Assignee: INSTITUTE OF AUTOMATION, CHINESE ACADEMY OF SCIENCES
    Inventors: Jie Tan, Chengbao Liu
  • Patent number: 11456950
    Abstract: The present invention discloses a data forwarding unit based on a Handle identifier, comprising a dynamic configuration module, a Handle identifier data identification module and a matching-forwarding module. The system of the present invention is applied to network devices such as switches and routers, and supports dynamic configuration of data packet analysis, matching and forwarding rules through data interaction with network systems such as SDN managers, so that the network devices can identify data packets based on the Handle identifier and perform the specified operation on the designated data packets with the Handle identifier according to the rules of dynamic configuration.
    Type: Grant
    Filed: December 19, 2019
    Date of Patent: September 27, 2022
    Assignee: SHENYANG INSTITUTE OF AUTOMATION, CHINESE ACADEMY OF SCIENCES
    Inventors: Haibin Yu, Peng Zeng, Dong Li, Zhibo Li, Jindi Liu, Xueting Yu, Ming Yang
  • Patent number: 11436745
    Abstract: A reconstruction method of a three-dimensional (3D) human body model includes: acquiring, by a fully convolutional network (FCN) module, a global UVI map and a local UVI map of a body part according to a human body image (S1); estimating, by a first neural network, a camera parameter and a shape parameter of the human body model based on the global UVI map (S2); extracting, by a second neural network, rotation features of joints of a human body based on the local UVI map (S3); refining, by using a position-aided feature refinement strategy, the rotation features of the joints of the human body to acquire refined rotation features (S4); and estimating, by a third neural network, a pose parameter of the human body model based on the refined rotation features (S5). The reconstruction method achieves accurate and efficient reconstruction of the human body model, and improves robustness of pose estimation.
    Type: Grant
    Filed: October 22, 2019
    Date of Patent: September 6, 2022
    Assignee: INSTITUTE OF AUTOMATION, CHINESE ACADEMY OF SCIENCES
    Inventors: Zhenan Sun, Hongwen Zhang, Wanli Ouyang, Jie Cao
  • Patent number: 11432749
    Abstract: A non-contact brain blood oxygen detecting system includes a mobile terminal device. The mobile terminal device includes a control module, a transmitting module, a receiving module and a display module. The control module is connected to the transmitting module, the receiving module and the display module, respectively. The transmitting module in the mobile terminal device is configured to emit dual-wavelength near-infrared light to a detected subject. The receiving module is configured to receive a light signal after propagation fed back by the detected subject, and to perform data conversion on the received light signal to obtain a digital signal containing blood oxygen information. The control module is configured to obtain the blood oxygen information of the detected subject according to the digital signal obtained by the receiving module. The display module is configured to display the blood oxygen information obtained by the control module.
    Type: Grant
    Filed: December 28, 2017
    Date of Patent: September 6, 2022
    Assignee: INSTITUTE OF AUTOMATION, CHINESE ACADEMY OF SCIENCES
    Inventors: Tianzi Jiang, Xin Zhang, Nianming Zuo
  • Patent number: 11429818
    Abstract: A multi-label object detection method based on an object detection network includes: selecting an image of an object to be detected as an input image; based on a trained object detection network, obtaining a class of the object to be detected, coordinates of a center of the object to be detected, and a length and a width of a detection rectangular box according to the input image; and outputting the class of the object to be detected, the coordinates of the center of the object to be detected, and the length and the width of the detection rectangular box. The method of the present invention can perform real-time and accurate object detection on different classes of objects with improved detection speed and accuracy, and can solve the problem of object overlapping and occlusion during the object detection.
    Type: Grant
    Filed: December 10, 2019
    Date of Patent: August 30, 2022
    Assignee: INSTITUTE OF AUTOMATION, CHINESE ACADEMY OF SCIENCES
    Inventors: Guodong Yang, Yunong Tian, En Li, Zize Liang, Min Tan, Fengshui Jing, Zishu Gao, Hao Wang, Yuansong Sun, Sixi Lu
  • Publication number: 20220265184
    Abstract: Disclosed is an automatic depression detection method using audio-video, including: acquiring original data containing two modalities of long-term audio file and long-term video file from an audio-video file; dividing the long-term audio file into several audio segments, and meanwhile dividing the long-term video file into a plurality of video segments; inputting each audio segment/each video segment into an audio feature extraction network/a video feature extraction network to obtain in-depth audio features/in-depth video features; calculating the in-depth audio features and the in-depth video features by using multi-head attention mechanism so as to obtain attention audio features and attention video features; aggregating the attention audio features and the attention video features into audio-video features; and inputting the audio-video features into a decision network to predict a depression level of an individual in the audio-video file.
    Type: Application
    Filed: September 10, 2021
    Publication date: August 25, 2022
    Applicant: INSTITUTE OF AUTOMATION, CHINESE ACADEMY OF SCIENCES
    Inventors: Jianhua TAO, Cong CAI, Bin LIU, Mingyue NIU